BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] (420 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 269 bits (687), Expect = 6e-70, Method: Composition-based stats. Identities = 420/420 (100%), Positives = 420/420 (100%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA Sbjct: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN Sbjct: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL Sbjct: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD Sbjct: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ Sbjct: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN Sbjct: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG Sbjct: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 237 bits (604), Expect = 3e-60, Method: Composition-based stats. Identities = 51/472 (10%), Positives = 129/472 (27%), Gaps = 79/472 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + +I+ +M + + G + + +++ +++ ++ A+LA A Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA- 82 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +L ++ D + ++ ++ + + + + + N Sbjct: 83 ----DLDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASK-------NMDT 131 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + L+ ++ L + ++ + L + Sbjct: 132 QFIGRFGFPTLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQNAADIFLDTVLK 191 Query: 187 ERP-----IFLIELVVDLSGS---MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + L+ ++ M + D + + +A + Sbjct: 192 DENQDLISVSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQ 251 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + + E+ S ++ + D + T M Sbjct: 252 MQHFQWNYDGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQID---DLVPRAGTSIFLGM 308 Query: 299 KQAYQILTSDKKRSFFT---------NFFRQGVKIPSLPFQKFIIFLTDGENNNFK---- 345 K A +L + + F+ + K +I +TDG N+N Sbjct: 309 KWAAAMLDPAFRDINNSLVNAGYVDREFYNRPASYTDSETLKTVILMTDGANDNSYRIRS 368 Query: 346 ------------------------------------------SNVNTIKICDKAKENFIK 363 N ICD AK I Sbjct: 369 NYYDSDSEYVHWNKYNLWWYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIV 428 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I +I ++ C SSP + + V L F+ I++ + + + Sbjct: 429 IWSIGFEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 237 bits (603), Expect = 3e-60, Method: Composition-based stats. Identities = 50/472 (10%), Positives = 127/472 (26%), Gaps = 79/472 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + +I+ +M + + G + + +++ +++ ++ A+LA A Sbjct: 24 LRKRLTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA- 82 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +L ++ D + ++ ++ + + + + + + Sbjct: 83 ----DLDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASK-------DMDT 131 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + L+ ++ L + ++ L + Sbjct: 132 QFIGRFGFPTLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQDAADIFLDTVLK 191 Query: 187 ERP-----IFLIELVVDLSGS---MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + L+ ++ M + D + + +A + Sbjct: 192 DENEDLISVSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQ 251 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + + E+ S ++ + D + T M Sbjct: 252 MQHFQWNYDGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQID---DLVPRAGTSIFLGM 308 Query: 299 KQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN------ 343 K A +L + F+ + K +I +TDG N+N Sbjct: 309 KWAAAMLDPAFRDINNSLVNAGHVDREFYNRPASYTDSETLKTVILMTDGANDNSFRISN 368 Query: 344 ----------------------------------------FKSNVNTIKICDKAKENFIK 363 N ICD AK I Sbjct: 369 TYYNEDSEYVHWNRYNLWWYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIV 428 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I +I ++ C SSP + + V L F+ I++ + + + Sbjct: 429 IWSIGFEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 53/472 (11%), Positives = 116/472 (24%), Gaps = 81/472 (17%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + + K+ E F I + LL G + ++ + + ++ + A+LA A Sbjct: 21 KRFVKEFARDEDGAFIIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAA-- 78 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 +L + ++++ +A + + S Q + Sbjct: 79 ---DLDQTLSPAAVVTDYFAKAGLSSFLTSTTVDQGLNYRIIS-------AQGNMTMPTT 128 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 M L + + + + + + + Sbjct: 129 FMRLSGQTELAIRGDATAEERVSNVEISLVVDISGSMGRNNKLSTLRTASHTFIDTVIRP 188 Query: 188 RPI-----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 +I ++ + ++ C D + + Sbjct: 189 ETEDLISLNIIPYTAQVNAGPDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPVSTRT 248 Query: 243 DLLSHVKEDVYMGLI-----GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + E+ + S ++ VT T Sbjct: 249 YKQMQHFQYGWSSSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSL---RARANTAIHLG 305 Query: 298 MKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN------- 341 MK +L + F + V K I+ +TDG+N Sbjct: 306 MKWGVSMLDPTFRPIVTAMIANNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRIS 365 Query: 342 ---------------------------------------NNFKSNVNTIKICDKAKENFI 362 +++ ICD AK I Sbjct: 366 DEFYSTPSQIAHWDRYQLFFFTNNYIDRDIDQNYYYKKFTATQADTMLQSICDAAKAEGI 425 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + TI S + + C SSP + + V L F +I++ + + Sbjct: 426 LVWTIGFEVSNHAAGEMLDCASSPSHFFRVEGV-ELSEAFASIARQINQLRL 476 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 218 bits (555), Expect = 1e-54, Method: Composition-based stats. Identities = 50/467 (10%), Positives = 120/467 (25%), Gaps = 79/467 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ E + + + + M ++ G I ++ ++ +++ + A+LA A + Sbjct: 28 RRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAA-----D 82 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L + + ++ + + DA + + A Y T M Sbjct: 83 LDNELEAQGVVEDYMDKMALADALISVDVDEGLNYRTVVAEGYKT-------MPSNFMQI 135 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE---- 187 + + L + L + + + + + E Sbjct: 136 LGVDNLQAYGLAEATERINKVEVSLVLDISGSMDDNDKLANMQDAAGTFIDTLLAEGNED 195 Query: 188 -RPIFLIELVVDLSGSM---HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 I L+ ++ + + + A + + Sbjct: 196 LVSISLVPYSEQVNAGPEILSYLSANWKHGYSHCIEMPNSVFGSAALDFSRTYEQMQHYQ 255 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + E+ S ++ V + T MK Sbjct: 256 WNYDGYNNTLSDTVCPRYGYERIQAWSHDASALKAQVNQL---QPRAGTSIFMGMKWGTA 312 Query: 304 ILTSD---------KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-------------- 340 +L + S F + V K ++ +TDG+ Sbjct: 313 LLDPSTRPIASGMIARGSVDQVFEGRPVAYDDTDVLKTVVLMTDGQHDRSYRIQDWAYNS 372 Query: 341 --------------------------------NNNFKSNVNTIKICDKAKENFIKIVTIS 368 N + IC AK I I ++ Sbjct: 373 ESEYAHWNRYNLWYYLSRYVSSYERSSFYYQKYNADLGDALLGSICAAAKAQGIIIWSVG 432 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +G ++++C SSP + + V + F I+ + + + Sbjct: 433 FEVGDHGADVMESCASSPAHFFRVEGV-EITEAFSTIAHTLNQLRLT 478 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 217 bits (552), Expect = 2e-54, Method: Composition-based stats. Identities = 59/463 (12%), Positives = 130/463 (28%), Gaps = 68/463 (14%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + +E N ++I AL+++ + + GF I +K ++ A ++A+LA M Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDG 65 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 R E+ + +I + E + + Sbjct: 66 KDRAYSLKEANDYFKGILNQSNNSGLNCTNIDLVYIDETEE---LEGHVECSQNTTLSKV 122 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN--------------------- 170 LD + ++ + + N Sbjct: 123 AGIRHLDFNVSSAATYGIGKLEIAFVFDVSGSMANDNRMGNLKVAAREAVNTLLPVEGYA 182 Query: 171 ----QKIVSFIPALLRIEMG-----------ERPIFLIELVVDLSGSMHCAMNS----DP 211 ++ + + G ER + + + N Sbjct: 183 GDPEDVRLAMVSYDTMVNAGPYFKAVTNQDPERTEPFYGYIRERTTCRRYRNNGTCREWN 242 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVYMGLIGYTTRVEKN- 266 + ++ + + D L V ++T + + Sbjct: 243 YEWRGPYHRSYTIKSTCVWEREGAERYTDASPGHNRWLPPVSATFDSYNDSWSTDHQTDP 302 Query: 267 -------IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 I ++ K+ ++ D + T + +++ + + Sbjct: 303 WCNDNTPIPLTYNRNKLHDFID---DMTPRRNTAGHIGQAWGWYLVSPEWNSVWPAGSKA 359 Query: 320 QGVKIPSLPFQKFIIFLTDGENN-------NFKSNVNTIKICDKAKENFIKIVTISINAS 372 K +I ++DG+ N S ICDK KE + I T+ +A Sbjct: 360 LP--YDEPDATKVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDAG 417 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 GQ +L C S+P + Y N L +++I++ + + S Sbjct: 418 -YGQDVLNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRIS 459 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats. Identities = 58/475 (12%), Positives = 125/475 (26%), Gaps = 79/475 (16%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L R R + +E + +++ + + G + V+ ++ ++ +A+ L Sbjct: 14 LRDRVRRHVVDFARAEDGVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDASTL 73 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A A K D + + ++ +A + + + + T Sbjct: 74 AAAHK-----DNALDPKQVVLDYFDKAALASYISADDILVGGGETSTAVEVNLT-----A 123 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + H+ N + + L + + R ++ + Sbjct: 124 QVKTPFIRHLGNESFNVPARGRAEQAYGNSEVSLVLDISGSMDDNRRMSRLHRAANEFVD 183 Query: 183 I-----EMGERPIFLIELVVDLSGSMHCAMNSDP---EDVNSAPICQDKKRTKMAALKNA 234 + + LI D++ + D + + A Sbjct: 184 TVLTPDSVDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFSTTAIDPED 243 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + + Y E S + + R T Sbjct: 244 AYIQGQHFSHVDARFNYISCPTQSY----ETVTPFSQNNAALEAQINRLTGRER---TSI 296 Query: 295 TPAMKQAYQILTSDKK--------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK- 345 +K +L + S FR + K I+ +TDG N K Sbjct: 297 HIGIKWGAAMLDEAFRPLVNDLVDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKR 356 Query: 346 --------------------------------------------SNVNTIKICDKAKENF 361 N IC+ AK N Sbjct: 357 IKEFAYDTPDMRAHWARHAMDDWDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANG 416 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I I +I + + + ++ C SSP + Y V + F +I+Q + + ++ Sbjct: 417 IIIYSIGFEINNDAAQEMEDCASSPSHFYRVEGV-QISEAFSSIAQQLKQLRLTL 470 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 58/477 (12%), Positives = 131/477 (27%), Gaps = 92/477 (19%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ E ++ L ++ +LL G + ++ ++ +++ + A+LA A Sbjct: 19 LRRFAREEDGLVTLFAILMILLMILLGGVGVDLMRHERERARVQAVADRAVLAAA----- 73 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + ++ ++ + D + + + + + + Sbjct: 74 DLDQTLSPEAVARDYFDKSGLADYISSVTVEEGLNYRRVTVDASR-------DLKTMFID 126 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI----------------- 173 +L L + + Sbjct: 127 KFGQEKLHVPAKATAEEKVAKVEISMVLDISGSMRENDKMNNLHDASNVFIDTVIQTDTE 186 Query: 174 ----VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 +S +P ++ +G + I V L HC D D N I Q + M Sbjct: 187 DLISISVVPYTAQVNVG-KDIMDELNVTQLHSYSHCVDFED-SDFNLTTISQTRSYEHMQ 244 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + D H G + E+ S ++ + Sbjct: 245 HFEAGYYWNGNDRDRTGHYDNISNPGCPKQS--YEEIETFSQNAAALKSRIANF---QPR 299 Query: 290 KPTDSTPAMKQAYQILTSDKKRS-----FFTNFFRQGVKIPSLPFQKFIIFLTDGEN--- 341 T +K +L + F + + + K +I +TDG N Sbjct: 300 ANTAIHLGLKWGVALLDPSFRAINEAIGGDAVFRGRPAEYNDIDTLKTVILMTDGVNVTT 359 Query: 342 -------------------------------------------NNFKSNVNTIKICDKAK 358 +++ ICD AK Sbjct: 360 RRIAPEAYSNRDHYRHWSDYPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAK 419 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I +I + +G ++K C SS + + V ++ F+ I++ + + + Sbjct: 420 AKGIVIWSIGFEVTDHGAAVMKNCASSDSHFFRVEGV-EIVDAFEAIARQINQLRLT 475 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 210 bits (533), Expect = 4e-52, Method: Composition-based stats. Identities = 51/465 (10%), Positives = 131/465 (28%), Gaps = 80/465 (17%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 +E + +I+ + ++ G I ++ ++ +++ + A+LA A +L Sbjct: 29 FRRAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAA-----DLD 83 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 D + ++ + + DA + + +A Y T M + Sbjct: 84 NELDAQGVVEDYMSKMSLADALVSVNVDEGLNYRTVTADGYRT-------MPSNFMQLIG 136 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE-----R 188 + + ++ L + + ++ + + + + Sbjct: 137 IENMQAGGHSQAMERINKVEVSMVLDISGSMDDGDKMAELQTAASDFVDTLLDDGSEDLV 196 Query: 189 PIFLIELVVDLSGSMHCAMNSDP---EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I L+ ++ + D + + A + + Sbjct: 197 SISLVPYSEHVNAGPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTYDQMQHFQW- 255 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + + E+ S ++ +++ T MK A +L Sbjct: 256 NYSGSNSLTDTVCPRYAYEQIRPWSQDAGALKTQISQL---QPRAGTSIFMGMKWASALL 312 Query: 306 TSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF------------ 344 + + F + V K I+ +TDG+++ Sbjct: 313 DPSTRPIASGMIADGTVDAVFEGRPVAYSDTDVLKTIVLMTDGQHDRSFRIQNWAYNDEN 372 Query: 345 ----------------------------------KSNVNTIKICDKAKENFIKIVTISIN 370 + +C AK I I +I Sbjct: 373 EVEHWSQYNLWHYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIWSIGFE 432 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S +G ++++C SSP + + V + F I+Q + + + Sbjct: 433 VSDHGANVMESCASSPAHFFRVEGV-EISEAFSTIAQTLNQLRLT 476 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 209 bits (531), Expect = 8e-52, Method: Composition-based stats. Identities = 47/471 (9%), Positives = 126/471 (26%), Gaps = 80/471 (16%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R E + I V+ L + G + ++ + ++ + AIL+ A Sbjct: 26 RSPLASFKDDESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAA-- 83 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 +L + + ++ + A ++ + + A + + Sbjct: 84 ---DLDQTQTPQAVVDDYFETAGLESFLSNVTVDQGINYKTVGAEAQSIT-------TTA 133 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 M + L+++ + L + ++ S + + + Sbjct: 134 FMKMAGVDTLNATAAGVAEERIANVEISMVLDISGSMGIGSKMTQLRSAATSFVNTVLSP 193 Query: 188 RP-----IFLIELVVDLSGS---MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + L+ ++ + + + + D ++ + + Sbjct: 194 ENEDLVSVSLVPYSQHVNAGPKIYNELNTNHRHNYSHCVEMADSAYSETELDLSVTYDQM 253 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ + I E+ S + + + T MK Sbjct: 254 QHFQW-NYSGANQLTDTICPRYSYERITAFSQDASALNAQIAQL---QPRAGTQIFMGMK 309 Query: 300 QAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF------ 344 A +L + F + K ++ +TDG+N++ Sbjct: 310 WAAAMLDPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKTVVLMTDGKNSSSMRIKSW 369 Query: 345 ----------------------------------------KSNVNTIKICDKAKENFIKI 364 + + IC+ +K+ I I Sbjct: 370 AYDSSSDYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAAQGDALLDDICNASKDAGIVI 429 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +I +G ++ C SSP + + V + F I++ + + + Sbjct: 430 WSIGFEVDDHGADVMANCASSPSHFFRVEGI-EISEAFDAIARQINQLRLT 479 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 207 bits (526), Expect = 3e-51, Method: Composition-based stats. Identities = 62/483 (12%), Positives = 144/483 (29%), Gaps = 82/483 (16%) Query: 7 FRFYFKK---GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R + ++ E +I + V+ LL+ G + ++ + +++ + AILA Sbjct: 13 LRGFCRRLMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILA 72 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + +L + E + ++ +A + + ++ L + F + + Sbjct: 73 AS-----DLDQPLPADEVVDDYFAKAGMTEFLDDVQITPGAHLP--TTNFRVVQAEARTR 125 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + I M L T L + N + + + Sbjct: 126 TPSIYMAMTGVRSLPVYVAGTAEETIENTEISLVLDISGSMRNNGKIGNLRTAAKDFIGA 185 Query: 184 -----EMGERPIFLIELVVDLSGSM------------HCAMNSDPEDVNSAPICQDKKRT 226 + ++ + +SD ++ D + Sbjct: 186 VLEGNAANTTSLNIVPYAGQTNPGPIVFQRAGGRPFATFIEDSDGNEILYGQTFVDDEGN 245 Query: 227 KMAALKNALLLFLD-SIDLLSHV--------------KEDVYMGLIGYTTRVEKNIEPSW 271 + N + LD + ++ + + + + + Sbjct: 246 SIDVPYNTMSSCLDLTNGDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNKSSIRY 305 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT---------NFFRQGV 322 Q D + T + MK +L + +F F + Sbjct: 306 AQNNAGQLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRDTFVALNAAGLVPDGFKDRPA 365 Query: 323 KIPSLPFQKFIIFLTDG---------ENNNFKSNVNTIK--------------------- 352 + +KFI+ +TDG + N+ K++ + Sbjct: 366 DFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTNVANFYS 425 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 IC+KAK I + TI+ A N ++TC +SP + Y V + F++I++ + Sbjct: 426 ICNKAKAAGITVYTIAFEAPANAITQMRTCATSPAFFYKVEGV-EIKTAFKSIARQINEL 484 Query: 413 KYS 415 + + Sbjct: 485 RLT 487 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 207 bits (525), Expect = 4e-51, Method: Composition-based stats. Identities = 82/402 (20%), Positives = 164/402 (40%), Gaps = 54/402 (13%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R +K + +E NF++I A +V + + + K +++A ++A LA Sbjct: 4 TRITSKIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALAT 63 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+++ + + ++ A +F+ +KE ++ F T NI Sbjct: 64 AARLSEEENLTAAQAQA-----------FALKFVNGQVKEDFGAFNG-FSVTPTVNIDPV 111 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + V L + ++ ++ VS + Sbjct: 112 ET--------------GGRTVWKVAVSMEGSQSLTPMARIMG--KDKLTVSVVGKSESAG 155 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + F + LV+D SGSM +N + K+ LK A+ ++ + Sbjct: 156 -EAQGAFSMALVLDRSGSMDWNLNG---------------QKKINVLKTAVGGLIEQFEE 199 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ V +G Y +++ + + W K +++V TDST A AY Sbjct: 200 ADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKEFVDAL---PASGGTDSTDAFDWAYTA 256 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +T ++ + +KFI+F+TDG+NN ++ +T +CD AK++ I++ Sbjct: 257 VTHKRENN-------THDAKSGQVPKKFIVFMTDGDNNYSSADSSTKHLCDDAKDDGIEV 309 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T++ A G++LL C S+ E+ ++ N+ LI F+NI Sbjct: 310 YTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIG 351 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 205 bits (520), Expect = 2e-50, Method: Composition-based stats. Identities = 59/462 (12%), Positives = 129/462 (27%), Gaps = 59/462 (12%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 YF K + E+ + + A + L++ G + ++ + +A +AA L A ++ Sbjct: 8 SYFHKFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAALTVAKQL 67 Query: 69 VSN-LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L+ R + +++ + +I ++ I Sbjct: 68 STTVLTENQIRTGLKNAFRANLNAAGINSQGIDNLDFKVDPGEGTLDVWSSVDI-QTNFI 126 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN------QKIVSFIPALL 181 + + +L+ + L + + +P Sbjct: 127 KLGGIGPEKLEVGAASQVNYSRFDVELALVLDVTGSMRPDMNALKEASKSIVNILLPDDS 186 Query: 182 RIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + I L+ ++ GS + + + + + A N D Sbjct: 187 NSRESKVRISLVPYSQGVNLGSYATRVTNGGSTWRNCVNERSGPQKFTDAPYNYAGSRSD 246 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 Y + TR E + K+ + ++ D Sbjct: 247 FFHGKPKQFVWDYGWTEQWQTRPEACPKTAVEPLTADRTKLLRAISGLKDGGGT---GGQ 303 Query: 296 PAMKQAYQILTSDKKRSF--FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-------- 345 + + L+ K + + G + +KF + +TDG+ N Sbjct: 304 TGIAWGWYTLSPKWKNLWPRDSAPATYGTGSHTDDTKKFALIMTDGDFNAAYGWDCGCRK 363 Query: 346 ------------------------------SNVNTIKICDKAKENFIKIVTISINASPN- 374 K+CD+ K I+I T+ + Sbjct: 364 IRDKPLYCRKKSNKKSWIERYFSPSKISHAPAQRAKKLCDEMKSKNIEIFTVYFDTGGAT 423 Query: 375 -GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 G L+ C S +Y N++ LI F NI+ + + Sbjct: 424 FGDDLMSYCASGSRNYYRADNSNELIQAFSNIANEIQSIYIA 465 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 202 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 54/438 (12%), Positives = 124/438 (28%), Gaps = 46/438 (10%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + N +++FA+ + ++ +G + ++SA +A LA AS Sbjct: 20 MRSLLRHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAAS 79 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDD--AKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 LS+ D + N+ AL + + + L+ S +++ Sbjct: 80 -----LSQGEDPETIVRNYITAALSEHNGVLERLNVQVSSDLAINSREVT---ADAVISV 131 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + L + + L + + + + + Sbjct: 132 PTLMLGIIGYDALTLNRVSEANERVRNLEISLVLDISGSMSGSKITALRDAAEEFVGVMM 191 Query: 185 MGE----RPIFLIELVVDL--SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + +I + ++ + + + M N Sbjct: 192 DPDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCLELGVSDPVTMDLAANGY--- 248 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 D +D + + V D T A Sbjct: 249 -DWLDWQDRDQRGWRSSAFCPEENEATVF-LEQTPSVL---VNLIRDLDAGGNTGLDVAT 303 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---------NFKSNVN 349 + L + +F + K ++ +TDG Sbjct: 304 AWGARALDPAWRGRLGGDFASRPAAYDDPSTMKVLVVMTDGAATAQIRRAQNWYGDWYSY 363 Query: 350 TIKICDKAKEN-----------FIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADS 397 I +A++N + I TI+ + L++ C S PE +Y V N D Sbjct: 364 EIYSASQARDNMADACDAAEAEGVHIYTIAFQVSGSTNRNLMRDCASRPENYYAVENLD- 422 Query: 398 LIHVFQNISQLMVHRKYS 415 + F +I+ + + + + Sbjct: 423 ISAAFNSIAADLNNLRLA 440 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 202 bits (512), Expect = 1e-49, Method: Composition-based stats. Identities = 93/462 (20%), Positives = 184/462 (39%), Gaps = 60/462 (12%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 ++F FY KK I S +F II AL + L + G L+ V+ W Y +++++ A AI+ Sbjct: 5 TKFIFYSKKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITA 64 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + ++ +L + R ++ K+ + + R +N++K++ + + +N Sbjct: 65 SVPLIQSLEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNP 124 Query: 125 SRISMT-------------------------HMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + + + + ++ + Y+ + S + + Sbjct: 125 RKSAYQVVLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVID 184 Query: 160 FIEHLLN---QRYNQKIVSFIPALLRI---------EMGERPIFLIELVVDLSGSMHCAM 207 F +L+ Q + F R ++G R L +V + S++ + Sbjct: 185 FSRSMLDYQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYML 244 Query: 208 NSDPEDVNSAPICQDKKRT------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 P D + + + K +++AL + SI + +V + V MG + Sbjct: 245 YPGPLDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFND 304 Query: 262 RVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 RV + SWG K+ + + + +D + T AM+ AY + S + Sbjct: 305 RVISDPSFSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEV------ 358 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR-- 377 +L +K+I+ LTDGEN + N I IC+KAK I+I+TI+ + + Q Sbjct: 359 -HRMKNNLEAKKYIVLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKA 415 Query: 378 --LLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSV 416 L C S P + + L +F++ I + R + Sbjct: 416 RYFLSNCAS-PNSFFEANSTHELNKIFRDRIGNEIFERVIRI 456 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 198 bits (503), Expect = 1e-48, Method: Composition-based stats. Identities = 56/466 (12%), Positives = 129/466 (27%), Gaps = 60/466 (12%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + K ++ + IFA V+ +++ G I + + + A +AA L+ A+ Sbjct: 2 IKALISKFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVAT 61 Query: 67 KMVSNLSRLGDRFESISN-HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI---- 121 + + + R I N + +++ + + + + Sbjct: 62 DLSTTVLRDNQIKTRIENSFRANLSDAEFLDQAIDNLDFDVDSNAGTVTVSSSAGLNNYF 121 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ------RYNQKIVS 175 +N + + + + + + + + Sbjct: 122 LNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAGDMGALRDAAEEVVDI 181 Query: 176 FIPALLRIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 I + I L+ ++ GS + + + ++ ++ A+ N Sbjct: 182 LIEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVNEREGQQKYTDAVYNY 241 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTT-----RVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + L Y +++ + + + Sbjct: 242 DGTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKNTLISDIRNLSSGGGT 301 Query: 290 KPTDSTPAMKQAYQILTSDKKRSF--FTNFFRQGVKIPSLPFQKFIIFLTDGENN----- 342 + + L+ + + ++ G P +KF + +TDG+ N Sbjct: 302 ---GGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKFALIMTDGDFNAQYGK 358 Query: 343 -------------------------------NFKSNVNTIKICDKAKENFIKIVTISI-- 369 N +CD K I+I T+ Sbjct: 359 EERTTCTGRGRNRVCTTNEYWVERYHRYSDYNDPPATRARTLCDAMKAENIEIFTVFFDT 418 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S G L+ C S +Y+Y N D LI F NI++ + S Sbjct: 419 GGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRIQQIYLS 464 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 198 bits (502), Expect = 2e-48, Method: Composition-based stats. Identities = 50/471 (10%), Positives = 116/471 (24%), Gaps = 84/471 (17%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + E + +I+ ++ +++ G + + ++ +++A A ++L Sbjct: 21 RFARHEDGSITILTIFIIIMMVMVGGIQLDFMRHEMERTKLQAAA-----DRAVLAAADL 75 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + ++ + D + + + N + Sbjct: 76 DQTLAPADVVDEYFAKSGMSDYLSSVTIENGLNFRTVTVKANN-------EMKTQFLGRF 128 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI-----EMGE 187 LD + L + N + + Sbjct: 129 GFPTLDVPALSKAEERVEKVEISLVLDVSGSMKNNSKLTTMKDAAKTFIDTVLRPETKNN 188 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + LI ++ + + C D + D Sbjct: 189 VSLSLIPYSEQVNVGPDIFNALWVDTRHDFSYCIDVPDGHFVQTQMTPGFPWDQTQHFQW 248 Query: 248 V--------KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +++ + E+ S ++ + T MK Sbjct: 249 NTYSIESGYQQNTLHDTVCPRAVYERVRPISQDGPSLKAQIDLF---QPRAGTAIYMGMK 305 Query: 300 QAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK----- 345 +L + + F + K I+ +TDG+N+N + Sbjct: 306 WGTALLDPSFRETTASLVSDSVVESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTA 365 Query: 346 -----------------------------------------SNVNTIKICDKAKENFIKI 364 N IC AK+ I I Sbjct: 366 YYNSSSEVVHWSKWNFNYYLSQYIKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVI 425 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 TI + G ++K C SSP + + V L F I+ + + + Sbjct: 426 WTIGFEVNDTGADVMKKCASSPSHFFRVEGV-ELTDAFSAIASQINQLRLT 475 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 198 bits (502), Expect = 2e-48, Method: Composition-based stats. Identities = 56/473 (11%), Positives = 137/473 (28%), Gaps = 79/473 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E +I + V+ LL+ G + ++ + +++ + AILA + +L Sbjct: 3 FKREEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAAS-----DLD 57 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + E + ++ +A + + ++ L + F + + + I M Sbjct: 58 QPLPADEVVDDYFAKAGMTEFLNDVRITPGSDLP--TTNFRIVQAEARTRTPSIYMAMTG 115 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI-----EMGER 188 L + L + N + + + Sbjct: 116 VRTLPVYVSGTAEETIEKIEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAAKTT 175 Query: 189 PIFLIELVVDLSGSM------------HCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + ++ + +S+ +++ D + + N + Sbjct: 176 SLNIVPYAGQTNPGRIVFERAGGLPFATFIEDSNGDEILYGQTIVDDEGNSIDVPYNTMS 235 Query: 237 LFLDSIDLLSHV---------------KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 LD + + + + + N + + Sbjct: 236 SCLDLTNSDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNNSSIRYAQNDAGRLQD 295 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSF---------FTNFFRQGVKIPSLPFQKF 332 D + T + MK +L + +F F + + +KF Sbjct: 296 FIDDMRLHDGTGTQYGMKYGVALLNPSSRNTFLALNAAGLVPDGFKNRPADFGTTDTRKF 355 Query: 333 IIFLTDGENNN------------------------------FKSNVNTIKICDKAKENFI 362 I+ +TDG+ + + N +C+KAK I Sbjct: 356 IVLMTDGQITDQFRPEDKNDPKNDEIALNQRTGDRDTYSTQSTNVTNFYSVCNKAKAEGI 415 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + TI+ A + ++TC +SP + Y V + F++I++ + + + Sbjct: 416 TVYTIAFEAPADAVTQMRTCATSPAFFYKVEGV-QIKTAFKSIARQINELRLT 467 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 197 bits (500), Expect = 3e-48, Method: Composition-based stats. Identities = 60/445 (13%), Positives = 133/445 (29%), Gaps = 43/445 (9%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +S F + + N +II AL + +G + + ++SA ++ LA Sbjct: 1 MSVLERIFSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 AS L++ + + + + + AL D + + ++ S V Sbjct: 61 AAS-----LTQDRNPEDVVRAYVEAALADHPQLLASLQLDV-VADISLNSRVVNATASVA 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + N L + + L + + N + I + + Sbjct: 115 MPTTMLGLVGINTLTLEHASEAIEQVRDVEISLVLDVSGSMGGSKINALQDAAIEFVEIV 174 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSD---PEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + V+ +G + + N+ L + + Sbjct: 175 LAADAAERTSISVIPYNGGVRTPREVNQDIVSGNNNHRRQSGCVDMGTDYPVEMTLPYRE 234 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + E + R E S ++R + T A Sbjct: 235 MEFTEYYGSEQTGNSSSAFCPRSNMESEFLSQNEGRMRGLINSL---RAEGNTGLDVATM 291 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--------------- 344 + L + + +F + K ++ +TDGE Sbjct: 292 WGARALDPAWRGNLGGSFSDRPASYDDRDTIKILVVMTDGEATAQIRSEEYTYYDWWGRE 351 Query: 345 -------------KSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSPEYHY 390 ++ N + CD A+ N ++I TI+ + + L++ C + P+ +Y Sbjct: 352 RTGTRSYELYSARQARENMAEACDIAEGNGVQIYTIAFQLSGQTNRDLMRNCANKPQNYY 411 Query: 391 NVVNADSLIHVFQNISQLMVHRKYS 415 V N D + F +I+ + + + Sbjct: 412 QVENLD-IAEAFSSIAADINRLRLT 435 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 196 bits (499), Expect = 4e-48, Method: Composition-based stats. Identities = 242/429 (56%), Positives = 317/429 (73%), Gaps = 13/429 (3%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+LL F F FKK I S KANFSI+F++ ++S LL IG LIYVLD+++KKN+ME+AN +A Sbjct: 1 MNLLRTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 IL+GASK++S +S +++S+H RA++DD RFIK++IKESL S+VF +E Sbjct: 61 ILSGASKIISRISY---FGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNI 117 Query: 121 IVNSSRISMTHMANNRL--------DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 I +S++S+T + + N FY++ V T YDY ++F ++LLN++ N K Sbjct: 118 ISQNSKVSITREPHPNVFHEFNNQSILQNKKTFYHISVETFYDYHIKFFDNLLNKKINSK 177 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ-DKKRTKMAAL 231 I+SF+PAL++I+ GE P F ++LVVDLS SM C MNSDPE +C KK +KM AL Sbjct: 178 IISFVPALVKIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDAL 237 Query: 232 KNALLLFLDSIDLLSHV-KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K A+LLFLDS+D S K+ Y+GL GYTTRVEKNIEPSWGT KVR+Y+ ++D +L Sbjct: 238 KKAVLLFLDSVDRGSKTQKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLG 297 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 TDSTPAMK+AYQILTSDKKR+F N + +KIP LPFQKF+IFLTDGENN+ KS+V T Sbjct: 298 QTDSTPAMKKAYQILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKT 357 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 IKIC+KAK+N IKI+TISINAS NG+RLLK CVS+PEY+YNVV+ SL+ VFQ+IS L+ Sbjct: 358 IKICEKAKKNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLIT 417 Query: 411 HRKYSVILK 419 H KY VILK Sbjct: 418 HYKYQVILK 426 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 196 bits (499), Expect = 4e-48, Method: Composition-based stats. Identities = 80/416 (19%), Positives = 140/416 (33%), Gaps = 42/416 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L G I V D K ++ A ++A LA A+ + + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAADSAALATATALGN 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIK------NHIKESLSGYSAVFYNTEIQNIVNS 124 + N + + + + +++ S SG SA + T + + Sbjct: 70 GTIQTSQAEAFARNFVAGQMANYLQNGVDIKNATAVNVQTSNSGKSASYQVTVTPSYDLT 129 Query: 125 SRISMTHMANNRLDSSNNTIFYN---------MDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 M + + S ++ + S L + + Sbjct: 130 VNPLMQAVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFLALDKSGSMGDPTETV---- 185 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + D + ++ ++ + TK+ ALK A Sbjct: 186 -----------NKDQPTETFTYDCNPHLNKK---GKWVYDTCTGSRTNYYTKIEALKMAA 231 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + V G + Y +WGT V YV T+S+ Sbjct: 232 GNLFGQLTSADPDAQYVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNAL---QAGGGTNSS 288 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNT 350 AM AY LT+ +K+I+F+TDG+NNN S + T Sbjct: 289 GAMGTAYSSLTAKNAAGNDAEDA-AHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLT 347 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 CD AK I+I TI+ A P GQ LL+ C S +++ + L+ F+ I Sbjct: 348 KATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIG 403 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 196 bits (498), Expect = 6e-48, Method: Composition-based stats. Identities = 77/411 (18%), Positives = 131/411 (31%), Gaps = 37/411 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L G I V D K ++ A ++A LA A+ + + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAALATATALAN 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIK------NHIKESLSGYSAVFYNTEIQNIVNS 124 + N + + + + ++ + SG S + T + + Sbjct: 70 GTIQTSQAEAFARNFVAGQMANYLQSGVDIKSATGVTVQTNTSGNSTSYQVTVSPSYDLT 129 Query: 125 SRISMTHMANNRLDSSNNTIFY----NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 M + S + S L + Sbjct: 130 VNPLMQAVGFTTQHLSTSGTTIGGHSQTQGSISMYLALDKSGSMGEDTATV--------- 180 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 D +G + ++ + TK+ ALK A Sbjct: 181 ------NEEDPTESYTYDCNGHYNKK---GKWIYDTCTGSRANYYTKIEALKMAAGNLFG 231 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + V G + Y +WGT V YV T+S+ AM Sbjct: 232 QLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTYVNAL---QAGGGTNSSGAMST 288 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICD 355 AY LT+ +K+I+F+TDG+NN+ S + T CD Sbjct: 289 AYSSLTAKNAAGNDAEDA-AHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCD 347 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 AK I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 348 TAKSKGIEIYTIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIG 398 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 195 bits (496), Expect = 9e-48, Method: Composition-based stats. Identities = 83/415 (20%), Positives = 138/415 (33%), Gaps = 36/415 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R ++ + NF I+ A+ L G I V D K ++ A ++A LA A+ Sbjct: 6 LRPCLRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAALATAT 65 Query: 67 KMVSNLSRLGDRFESISNHAKRALID------DAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + + + N + + D K +++ + SG S + T Sbjct: 66 ALANGTIQTTEAEAFARNFVAGQMANYLQSGTDIKSTTSVNVQTTTSGKSTSYQVTVSPA 125 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFY----NMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 V + M + S + S L + S Sbjct: 126 YVLTVNPLMQAVGFTTQHLSTSGTTIGGHSQTQGSISMFLALDKSGSMGEDTATVNEESP 185 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + D + +H ++ + + TK+ ALK A Sbjct: 186 TES---------------YTYDCN--LHYNTKNNKWVYDKCTGSRTNYYTKIEALKMAAG 228 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + V G + Y +WG V YV T+S+ Sbjct: 229 NLFSQLNSADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNAL---QANGGTNSSG 285 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-----KSNVNTI 351 AM AY LT+ + +K+I+F+TDG+NNN + T Sbjct: 286 AMNTAYTSLTAKNAAGNDVEN-SAHQQKTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATK 344 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 K CD AK I+I TI+ A GQ LL C S +++ + L+ FQ I Sbjct: 345 KTCDDAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQAIG 399 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 195 bits (495), Expect = 1e-47, Method: Composition-based stats. Identities = 58/479 (12%), Positives = 139/479 (29%), Gaps = 70/479 (14%) Query: 1 MHLLSRF--RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 M R R K KA+ +F L V+ +++ G I V + + A + Sbjct: 1 MQYFRRLIGRPLLKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAID 60 Query: 59 AAILAGASKMVSNL-SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 AA L+ A+ + +++ S + + + ++ + + + Sbjct: 61 AAALSVAADLSTSVMSDEQIKAALADSFKANLADVEFLDEAIKNLSFVVDAENGTIKVSS 120 Query: 118 IQNIVN----SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ-- 171 + N M + + ++ + + N Sbjct: 121 FATLDNYFIDMGGYGMQALGPETFNFGTSSQVTYSRFDVELALVVDVTGSMRNDMDTLRD 180 Query: 172 ----KIVSFIPALLRIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRT 226 + IP + I L+ ++ G+ + + +C +++ Sbjct: 181 ASKGLVNILIPETTEEADSKVRISLVPYSQGVNLGTYAAKVKGGVYGYADSSVCVTERQD 240 Query: 227 KMA-----ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++ + + + K+ Y G + K I + + + + Sbjct: 241 YDDGEDIYKVRYTDMPYNYYVKTDPPPKDVFYGGGSNRCSGTSKMIPLTADRDTL---LD 297 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D T + + ++ + + + + KF I +TDG+N Sbjct: 298 AIADLDDNGGTAGQTGVVWGWNSISPNYSDVWPLASKPEP--YDNDDVLKFAIIMTDGDN 355 Query: 342 N-------------------------------------------NFKSNVNTIKICDKAK 358 N N S+ +C K Sbjct: 356 NRFYEFVKEREECDWVYSRRYGWQWTCEMVSVNQWQERSESESYNNNSSKAQRALCQAMK 415 Query: 359 ENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I I + + + + +++C S+ +Y ++D LI+ F NI++ + S Sbjct: 416 DEGISIFGVYFGTNDSSAGSKNMQSCAST-GNYYKATSSDELINAFANIAKKIQQIYVS 473 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats. Identities = 79/411 (19%), Positives = 135/411 (32%), Gaps = 36/411 (8%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ L + G I V D K ++ A ++A LA A+ + + Sbjct: 10 LRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAALATATALAN 69 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIK------NHIKESLSGYSAVFYNTEIQNIVNS 124 + N + + + + +++ S SG S + T + + Sbjct: 70 GTIQTSQAEAFAQNFVAGQMANYVQSGVDFKSGTSVNVQTSTSGKSTSYQVTVSPSYDLT 129 Query: 125 SRISMTHMANNRLDSSNNTIFY----NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 M + S + S L + Sbjct: 130 VNPLMQAVGFKTQHLSTSGTTVGGHSQTQGSISMFLALDKSGSMG--------------- 174 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 D + +H ++ + + TK+ ALK A Sbjct: 175 EATATVNADDPTESYTYDCN--LHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFG 232 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ E V G + Y +WGT V YV T+S+ AM Sbjct: 233 QLNSADPNAEYVRTGAVSYDINQYTPSNLAWGTAGVTSYVNAL---QANGGTNSSGAMST 289 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICD 355 AY LT+ +K+I+F+TDG+NN+ S + T CD Sbjct: 290 AYSSLTAKNAAGNDAED-SAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCD 348 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 AK I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 349 TAKSKGIEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIG 399 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 76/406 (18%), Positives = 136/406 (33%), Gaps = 38/406 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ + G I V D K ++ A ++A LA A+ + + Sbjct: 11 LRRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAADSAALATATALAN 70 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + + + I + + + + S ++ Sbjct: 71 GTIQTSQAEAFARDFVAGQMANYLQSGIDIKSTTGVDVRTTTSGKSTSYQVTVSPDYNIA 130 Query: 131 HMA----------NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 N S+ + S L + Sbjct: 131 VNPLMQTIGFTTQNISTSSTTTSGNSQTQGSVSMFLVLDRSGSMGEDTATV--------- 181 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 E D S + + + TK+ ALK A+ Sbjct: 182 ------NASDPTEEYNYDCS---EKDRYGNVTKKKTCTDTRPHYYTKIEALKLAVGTLTG 232 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +D + KE V G + Y ++K WGT V +YV + TDS A K Sbjct: 233 ELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNKLT---ATDGTDSGEAFKT 289 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY L + V K+I+F+TDG+NN ++ T CDKA++ Sbjct: 290 AYNKLA-------DAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSADTETKTWCDKARDA 342 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +++ TI+ A GQ LL C ++P ++ + +L+ F+ I Sbjct: 343 KMQVYTIAFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIG 388 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 190 bits (483), Expect = 3e-46, Method: Composition-based stats. Identities = 87/411 (21%), Positives = 152/411 (36%), Gaps = 27/411 (6%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F + + NF ++ AL L + G + V + KN ++ A +AA LA AS +VS Sbjct: 11 FMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVS 70 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + ++ + A S+ + E+ ++ T Sbjct: 71 DARPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNAT 130 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + + + V + + + LL I + + Sbjct: 131 EIDITATPNGAKGKSFQVTVANKHLLQFNAMTRLLGPE---SIEIETRSTAESATESKNA 187 Query: 191 FLIELVVDLSGSMHCAMNS--------------DPEDVNSAPICQDKKRTKMAALKNALL 236 + LV+D SGSM N+ + TK+ ALK A+ Sbjct: 188 LSMYLVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDALKTAVG 247 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L + V G I Y + + SWGT YV + + T S Sbjct: 248 DLLAQLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYVDALV---AIGGTASGN 304 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K A+Q +T + K+I+F+TDGENN+ + T + CD Sbjct: 305 AFKTAFQKVT-------NAAEDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDT 357 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 AK + ++I +++ A GQ+LLK+C SS +++ A L+ F+ I + Sbjct: 358 AKASKVQIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 190 bits (482), Expect = 4e-46, Method: Composition-based stats. Identities = 58/467 (12%), Positives = 128/467 (27%), Gaps = 63/467 (13%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +++R R + + + N ++IFAL+++ L IG I + +M++A ++ L Sbjct: 6 IITRSRKVASRFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTAL 65 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + ++ + + E + R + + S + Sbjct: 66 MVSKDLGADKIKTSEVSEKAQTYFNSLYTGTEARGVTLTTNYTAKDDSGSSTVVVNGDGA 125 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-------------- 168 S M L + L ++ Sbjct: 126 -VSTHFMKMFGFPSLAIGSAATATWGGTRLRVAMALDVTGSMVLNGSTKLAEMKKAASAL 184 Query: 169 ----------YNQKIVSFIPALLRIEMGERPIFLIELVVDL-----SGSMHCAMNSDPED 213 + +S +P + +G I + D+ + + Sbjct: 185 VDTLRASAQSKDDLYISVVPFAQMVNVGSSNIDASWIKWDVWDETEGSCSKSKFKTKTDC 244 Query: 214 VNSAPICQDKKRTKM---AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 ++ R+K ++ + L+G T+ + + Sbjct: 245 EDNGRTWTVTDRSKWKGCVTDRDQPADTTKDAPTSDDTRFPALRTLLGTTSCPAQIFPMT 304 Query: 271 WGTEKV--RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 ++ D + T+ M A+ L N K P+ Sbjct: 305 SAYAATDAQKIKDVIDDLVADGGTNQPIGMAWAWMSLQQG-------NPLNTPAKDPNYK 357 Query: 329 FQKFIIFLTDGENN--------------NFKSNVNTIKICDKAK-----ENFIKIVTISI 369 + II L+DG N + K + +CD K + TI + Sbjct: 358 YTDAIILLSDGLNTMDRWPDYGDGQRQFDGKIDARQKLLCDNIKLPDSNGKRPVVYTIQV 417 Query: 370 NASPN-GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 N + + +LK C + A + F I + + + Sbjct: 418 NTTGDPESTILKYCA-DGGNFFATTTASGIGTAFAQIGSSLSKLRIA 463 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 57/454 (12%), Positives = 117/454 (25%), Gaps = 55/454 (12%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + + ++ N +IIFA++++ + IG I + + SM++A ++A L + Sbjct: 11 RDVSSRFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKD 70 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + S + G ++ + I + + ++ + S Sbjct: 71 LASGVITAGQVSAKAQSYFASLYNNTEAPNITVTATYTAKDSTGSSTVL-LKGTGDISTE 129 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN--------------------- 166 M L + L + + Sbjct: 130 FMNMFGFPTLGIGSAATATWGGTRLRVAIALDVTGSMASAGKMPAMQSAAKTLVDNLRAN 189 Query: 167 -QRYNQKIVSFIPALLRIEMG----ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 Q + +S IP + +G + D +GS + + SA Sbjct: 190 AQTADDLYISIIPFAQMVNVGKSNKNASWIKWDYWEDTTGSCNWWWLTTKSSCESAGRTW 249 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV- 280 + Y+ E+ + + Sbjct: 250 SSTNQSQWGGCVTDRDQPADTTKDAPTTAATRFPAANYSACPEQILPMTSAYSSSNATTI 309 Query: 281 -TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ M A+ L K + II L+DG Sbjct: 310 KDKIDALSPNGGTNQPIGMHWAWMSLQDG-------APLNTPAKDADYKYTDAIILLSDG 362 Query: 340 ENN-----------NFKSNVNTIKICDKAKEN------FIKIVTISINASPN-GQRLLKT 381 N + + +CD + I TI +N + +LK Sbjct: 363 MNTIDRWYGNGSSWSKDVDARQKLLCDNIRAASAASTTKTVIYTIQVNTDGDPESEVLKY 422 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C S + A + F I + + + Sbjct: 423 CADS-GNFFATTTASGISTAFAQIGASLSKLRIA 455 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 186 bits (471), Expect = 7e-45, Method: Composition-based stats. Identities = 59/477 (12%), Positives = 124/477 (25%), Gaps = 92/477 (19%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + + +IIFAL+++ L +G + + K+ + A +AA LA S + Sbjct: 3 LRRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGSSTGT 62 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + K + +D A F S M Sbjct: 63 AAELQQIGQKFFDANFKDSGLDAAGSF----------SVSVTGDVVSANGSAQVQTTLMQ 112 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY--------------------- 169 + + + S + L + Sbjct: 113 LVGIDTIAVSESAQVIRSIKGLELALVLDNTGSMTTSDNIGALRDAAQELVDILFGGRAD 172 Query: 170 -NQKIVSFIPALLRIEMGERPIFLIELVVDLS-------GSMHCAMNSDPEDVNSAPICQ 221 V+ +P + G LI + + + A Sbjct: 173 HPTLRVAVVPYSASVNPGPIAPTLISGNDAYAPTNLLGWKGCVIERVGRAMEDSPASTAP 232 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + A+ N S + + G + V + Sbjct: 233 WLRYQWLPAIDNYYDATKASTVRADPSQGN--GGTGPNLGCPTPITPLTGVKATVDSAIQ 290 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T M ++L+ + F +G+ + + K +I +TDG+N Sbjct: 291 AL-RAWSRGGTMGDIGMAWGLRVLSPE-------PPFTEGLAWNTPKWAKAVILMTDGDN 342 Query: 342 NNFKS-----------------------------------------NVNTIKICDKAKEN 360 +K N ++C K+ Sbjct: 343 QFYKLTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLTQVCQAMKDK 402 Query: 361 FIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + TI+ + + + K C SS ++ + L F+ I+ + + S Sbjct: 403 GITVYTITFTSGINQATKDIYKACASSTAKWFDSPSQADLRASFRAIATELSQLRVS 459 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 185 bits (470), Expect = 1e-44, Method: Composition-based stats. Identities = 47/408 (11%), Positives = 127/408 (31%), Gaps = 15/408 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I ++ ++ F + + + L+ ++ + A ++A+LA A + Sbjct: 4 IQRQRGAITLTFTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACAYQQNDTGVV 63 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + ++ + + N+ + + Y + S + + Sbjct: 64 TEGILDYYRPNFVLPELNKSVKLNSNNGCQISAQYRFEPAMVNALPVAIDSDTEVVSNSQ 123 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPALLRIEMGERPIFLI 193 + N + + L + +KI++ + + + + ++ Sbjct: 124 SSAKLVQNVNVNGIQNPVDFSLVLDISGSMTWHLPELKKIITDVISDIVPSSNQVRFSIV 183 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + S E + + A K L+ E Sbjct: 184 PFQTGVGVSGA-PWLLSSEASPKCVDGLVYRNGNLDADKTV--QSLNYSSDRLDFNEVTP 240 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + + + + +V +YV S T S + LT ++ + Sbjct: 241 GRWLDRCSETSFILPLTNNLNRVIRYVESLDTS--GGSTASYQGFIWGVRTLTDQWQKEW 298 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKA-KENFIKIVTISIN 370 + S + +I TDG++N ++ +++ + +CD ++ I++ I Sbjct: 299 QVTPVQ------SSSLTQRLILFTDGDDNRRDYFNDLMSAGLCDVIQQDLNIQVSFIGFG 352 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 S + + K C ++ N L F++ + + K ++L Sbjct: 353 VSADRIKQFKQCAGRNGSVFDANNTAELADYFEDAININIETKVRIVL 400 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 57/484 (11%), Positives = 137/484 (28%), Gaps = 92/484 (19%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 Y E + I+ L +++ L++ G + + + ++ +++S ++ A+LA AS Sbjct: 19 IRYLHAFGRDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAAS- 77 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 L++ D + + ++ +A F + + + +++ ++ + Sbjct: 78 ----LNQTLDSRDVVEDYFAKAG------FPNALVGAPIVVDNGNSRTVTVRSALDVNTF 127 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY------------------ 169 + +RL + + L + Sbjct: 128 YLRLAGMDRLTAPARSSATEGVGKVEISLVLDISGSMRFSNRFVNMQAAAIAFAEEVLDP 187 Query: 170 ---NQKIVSFIPALLRIEMGERPIFLIELV----VDLSGSMHCAMNSDPEDVNSAPICQD 222 ++ IP G + V L+G D C + Sbjct: 188 ANGGTVSLTIIPYAGATNPGPEMFAFMGGVRYPDTLLAGDDGILGTEDDYFFPQVSSCVE 247 Query: 223 KKRTKMAA--LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + ++ L A + + + ++ + + + Q Sbjct: 248 MVGSDWSSAGLPGAGRAQVPHFQVWDIARS-----VMDWGWCPQDRSSIQYAMATPAQAR 302 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF-----------FTNFFRQGVKIPSLPF 329 + + T + AMK A L + +F F + Sbjct: 303 SFINGLRMHDGTGTHYAMKYALATLDPSSQPAFMHLSHPGRGLVPPQFANRPAAWDDPET 362 Query: 330 QKFIIFLTDGENNNFKSN---------------------------------VNTIKICDK 356 +K I+ +TDG+ + IC Sbjct: 363 KKIIVLMTDGDITQQERPRIAQQERDIDYIISRSINGRDNRGQFVDAATNVGRFEAICTL 422 Query: 357 AK--ENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 A + + T++ PN ++ C S P + + LI VF I++ + Sbjct: 423 ANQPARSVDVYTVAFEVQPNSAADLQMRNCASDPSMFFR-TSGAELIDVFSGIAERITDL 481 Query: 413 KYSV 416 + ++ Sbjct: 482 RLNL 485 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 183 bits (464), Expect = 4e-44, Method: Composition-based stats. Identities = 62/490 (12%), Positives = 133/490 (27%), Gaps = 102/490 (20%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + ++ + ++ AL ++ +++ G I ++ ++ ++ + +LA AS Sbjct: 13 SSFLRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAAS----- 67 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L++ E+I + K A+ + + + + E + + Sbjct: 68 LTQTRPPRETIESFLKAAVTKGSYAL------DVKADELSNAKRVEASATAVTDTAFLRL 121 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL-------------------------- 165 + ++L + L + Sbjct: 122 IGIDKLTVEAHAEAEEKRKNIEISLLLDMSGSMRFDKSGSYPGPSGAMRINYLRPAAKSF 181 Query: 166 ------NQRYNQKIVSFIPALLRIEMGERPIFLIELV---VDLSGSMHCAMN----SDPE 212 + + VS +P ++ +G + D S N P+ Sbjct: 182 MDMVLADGAEDYTTVSIVPYAGQVSIGPVLFDALARNRRQHDRSSCFQFGRNDFTLGVPD 241 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY-------TTRVEK 265 N + ALK A + D G T Sbjct: 242 FANLPQTQHFTQANHHDALKKAGEAQITEPWWCPDDPHDPRPGTTPDFVAGEGKDTDRTS 301 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR-------------- 311 S E +++ + T + A+K +L + Sbjct: 302 VSFLSNDREYLKRQIDNYKL---YDGTGTPIALKWGLLLLDPAIQPMLREAARYRALSEE 358 Query: 312 -SFFTNFFRQGVKIPSLPFQKFIIFLTDGE----------------NNNFKSNVNT---- 350 F + KF++ +TDG NN N + Sbjct: 359 LDIDARFSNRPASFTDPDTMKFLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDLYSVG 418 Query: 351 ------IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +C AK+ + + TI + + + + C S E Y V NA + F++ Sbjct: 419 DAERFAAALCTAAKQKNVIVFTIGFDVNDTAAKQMSNCASGAERFYRV-NALDIQDAFKS 477 Query: 405 ISQLMVHRKY 414 I+ + K Sbjct: 478 IATAIQKIKL 487 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 182 bits (461), Expect = 1e-43, Method: Composition-based stats. Identities = 52/456 (11%), Positives = 120/456 (26%), Gaps = 50/456 (10%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + L + +K + + +I+F L + +IG + + ++ +A +AA Sbjct: 19 VGLKAAGSRAIEKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAA 78 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LAGA + +N ++ +A + + I +++ Sbjct: 79 VLAGARALQTNGGDQAAALRVAQSYYAQATKNR-LSLSNDTINFAIADN---ATAMVTTG 134 Query: 121 IVNSSRISMTHMANNRLDS----------SNNTIFYNMDVMTSYDYRLQFIEHLLNQR-- 168 + M L + + N ++ L + Q+ Sbjct: 135 NAVITTPFMGLAGTGSLPILRKDGSDYSKAVLAVGGNAELNLEIAMMLDITGSMRGQKLT 194 Query: 169 -----YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + + + I V L + + + Sbjct: 195 DMKAAASDLLNIVVWTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPCVVERTG 254 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 A + + + Y + E + + + V Sbjct: 255 TEKYTDAAPATGKYVMVHNTSSTKKNKTTYSPTCDVASSAE-VLPLTSDKSTLLAKVNGL 313 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T A+ +L + + + + +K + +TDGE N Sbjct: 314 S---TAGSTAGHIGTAWAWYMLAPNWSSLWTSASST-PAAYNADNLRKIAVLMTDGEYNT 369 Query: 344 FK-----------------------SNVNTIKICDKAKENFIKIVTISINASP-NGQRLL 379 S+ + C K I++ T+ L Sbjct: 370 QYTTNGVPDDSSSLTRCPNAANGVCSSAQAVSQCTAMKAKGIEVYTVGFQLDNQTAIDTL 429 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C + + YN D+L F++I+ + S Sbjct: 430 SQCATDSSHFYNSTTGDALKAAFRDIALKISTLYLS 465 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 180 bits (455), Expect = 5e-43, Method: Composition-based stats. Identities = 57/488 (11%), Positives = 134/488 (27%), Gaps = 86/488 (17%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + R R + + N + IFA++++ L +G + + ++SM++A ++A L Sbjct: 6 IRERIRSSAVRFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAAL 65 Query: 63 AGASKMVSNLSRLGD---RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A + ++ + DA+ + + + Sbjct: 66 MVAKDANAASPQMTADQVTAAAQKYFNALYHNTDAQ--GASVSAVYTPYNNGTPATVVLS 123 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP- 178 N M + ++ N+ + L + + ++ Sbjct: 124 GSGNVQTDFMKVVGFPQISFKTNSTATWGNTKLRVAMALDVTGSMSSAGKLVQMKIAAKK 183 Query: 179 -----ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-- 231 G+ I +I V ++ + S E + + + Sbjct: 184 LIDTLKASATAEGDVYISIIPFNVMVNVGANNNTASWLEWEDGSYDNSSSNYGSCSGSGK 243 Query: 232 --KNALLLFLDSIDLLSHVKEDVYMGLIG------------------------------- 258 N + + + + G + Sbjct: 244 SKPNTKSSCIAAGKTWTPKNISSWKGCVTDRGPVSKPGSGDYDTTKDEPVASTPYTLYLA 303 Query: 259 --YTTRVEKNIEPSWGTEKVRQYVT--------RDMDSLILKPTDSTPAMKQAYQILTSD 308 Y+T + + + + + + + T+ AM+ A+ +L Sbjct: 304 RNYSTCPSSILPMTSAYDSKESDSSTDDSTLKGKINNLVANGATNQAIAMQMAWMMLQP- 362 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-----------NFKSNVNTIKICDKA 357 T F K + II L+DG N + + + +C+ Sbjct: 363 ------TAPFPAPAKDEKYKYTDAIILLSDGLNTQDRWYGNGSDWSSQVDTRQALLCNNI 416 Query: 358 KENFI---------KIVTISINASPN-GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K + I +I TI +N + +LK C + + A + F I Sbjct: 417 KNDPISKTDPTRRTRIYTIQVNTDGDPESTVLKNCATDG--FFPTSTASGIASAFAQIGA 474 Query: 408 LMVHRKYS 415 + + + Sbjct: 475 SLSQLRIA 482 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 178 bits (451), Expect = 1e-42, Method: Composition-based stats. Identities = 52/460 (11%), Positives = 117/460 (25%), Gaps = 54/460 (11%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + L ++ ++ + N ++IFAL+++ L IG + + + S+++A ++A Sbjct: 4 ISLAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L + + + + + + S Sbjct: 64 ALMLSRDLGVGTITPDQVSSKAQTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSG- 122 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ--------- 171 M + + ++T L + + Sbjct: 123 QGAVQTQFMKILGFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNL 182 Query: 172 -------------KIVSFIPALLRIEMGERPIFLIELVVDL----SGSMHCAMNSDPEDV 214 +S +P + +G + DL +GS S Sbjct: 183 VDSLRASAQTVDDVYISVVPFAQMVNVGSSNRNASWVRWDLWDESNGSCSSWWYSTKSSC 242 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT- 273 A A + Y ++ + + Sbjct: 243 EYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYS 302 Query: 274 -EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T+ M A+ L + + K + + Sbjct: 303 LSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTG-------DPLNTPAKDSNYKYTDA 355 Query: 333 IIFLTDGENN-----------NFKSNVNTIKICDKAKE-----NFIKIVTISINASPN-G 375 II L+DG N + + + +CD + N + I TI +N + Sbjct: 356 IILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPE 415 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +LK C S + + + F I + + + Sbjct: 416 SAVLKYCADS-GNFFATTTSSGIGTAFAQIGSSLSKLRVA 454 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 66/432 (15%), Positives = 144/432 (33%), Gaps = 50/432 (11%) Query: 6 RFRFYFKKG---IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 RF + ++ + N +IFALSV +G I Y + ++ A ++A+L Sbjct: 10 RFAGFRRRLGESCRDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAVL 69 Query: 63 -AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A A K ++ SR ++ + + + D +K + E++ + + T Sbjct: 70 RAMALKSATDESRGVAADKAFAENFGHPGVYDLNGALKREVNENIISQTYTVHAT----- 124 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 S + + + + + L + + S + + + Sbjct: 125 --VSSYFGAFFGKDSYPVTVVSQAKTSLDVFEIAFVLDTTGSMAEANKMPNLKSSVDSAM 182 Query: 182 ------RIEMGERPIFLIELVVDLSGSMHC----AMNSDPEDVNSAPICQDKKRTKMAAL 231 + I ++ + S + + +D + + Sbjct: 183 AGLLQNGKNLSGSKIAVVPFNTQVRLSDATVTTMSSQGLSSGWGNCVHDRDLATSHDVSA 242 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 A ++ L ++ S R ++ Sbjct: 243 SAAQKGKAQTLYPLE----------TCDEASLKPVQGLSDNISSARNFIK---TLQPGGY 289 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-------NF 344 T+ T ++ ++L+ ++ S T F S +KF+I +TDG+N Sbjct: 290 TNVTMGVQWGMEVLSPNQPFSDATEF-------GSTKARKFMIVVTDGDNTKSFTSWSAS 342 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + T C+ AK I + T+ I + +L+ C S+PEY Y++ +A+ L Sbjct: 343 VIDKRTALACENAKAKGITVYTVKIIQGNSN--MLRKCASAPEYFYDLTSANQLNAAMSG 400 Query: 405 ISQLMVHRKYSV 416 I + + + S+ Sbjct: 401 IFKSINKTRLSM 412 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 177 bits (448), Expect = 3e-42, Method: Composition-based stats. Identities = 46/460 (10%), Positives = 119/460 (25%), Gaps = 54/460 (11%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + L ++ ++ + N ++IFAL+++ L IG + + + S+++A ++A Sbjct: 4 ISLAAQLTRAARRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L + + + + + + S Sbjct: 64 ALMLSRDLGVGTITPDQVSSKAQTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSG- 122 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-- 178 M + + ++T L + + + + Sbjct: 123 QGAVQTQFMKILGFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNL 182 Query: 179 ---------ALLRIEMGERPIFLIELVVDLSGSMHC-AMNSDPEDVNSAPICQDKKRTKM 228 + + P + V + + + + E S ++ Sbjct: 183 VDSLRASAQTADDVYISVVPFAQMVNVGSSNRNANWVRWDLWDESNGSCSSWWYSTKSSC 242 Query: 229 AAL-----KNALLLFLDSIDLLS---------HVKEDVYMGLIGYTTRVEKNIEPSWGT- 273 + + + + Y ++ + + Sbjct: 243 EYAGRTWTATSHNQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYS 302 Query: 274 -EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T+ M A+ L + + K + + Sbjct: 303 LSNATTIKNKIDALSPNGGTNQAIGMHWAWMSLRTG-------DPLNTPAKDSNYKYTDA 355 Query: 333 IIFLTDGENN-----------NFKSNVNTIKICDKAKE-----NFIKIVTISINASPN-G 375 II L+DG N + + + +CD + N + I TI +N + Sbjct: 356 IILLSDGLNTVDRWYGNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPE 415 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +LK C S + + + F I + + + Sbjct: 416 STVLKYCADS-GNFFATTTSSGIGTAFAQIGSSLSKLRVA 454 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 177 bits (448), Expect = 3e-42, Method: Composition-based stats. Identities = 81/405 (20%), Positives = 163/405 (40%), Gaps = 43/405 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + K +I+ A+ + L++G +I V + K + S + +++ A+++++ Sbjct: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI-QNIVNSSRISM 129 E N+ K+ D IKN S +IV S+ + + Sbjct: 68 ---------EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + N Y++ ++ Y L+F + ++ IV I + +++ Sbjct: 119 VVVPQNEG--------YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + +V+D+S SM D TK+ ++ L+ + L+ V Sbjct: 171 RLDMMIVLDVSRSMESFF--------------DSSITKIDMAIKSINAMLEEVKLIPDVN 216 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V GL+ ++ ++E+ WG +++ + + T+STP +K AY + Sbjct: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV--STNSTPGLKYAYNQI---- 270 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 F RQ ++K I+F+TDGEN + K + ++ C++AK+ + I I Sbjct: 271 ---FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 L+ C S P Y V N S+ F +I + +V ++ Sbjct: 328 RV-IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 176 bits (447), Expect = 4e-42, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 99/363 (27%), Gaps = 33/363 (9%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + E +++ ++ + G + ++ + K+ +++ + A+LA A Sbjct: 19 RLKDRANAFAREEDGLMTVMALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLAAA 78 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +L + + ++ ++A + + + SA Sbjct: 79 -----DLDQQVSPETVVRSYLEKAGLLEYLSSVTVQEGLGYRKVSAT-------ATAELP 126 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR--- 182 M + L + L + + + + + Sbjct: 127 THFMKLSGYDSLTIPAASTAEESIGNVEISLVLDVSGSMNSNSRLYNLKNAAKEFVDHML 186 Query: 183 --IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL---KNALLL 237 E G I ++ ++ + ++ C + + + + L Sbjct: 187 SATEPGTVSISIVPYATQVNAGADILSYYNVSTEHNYSHCVNFIDDEFSQPGLSRVTPLE 246 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D S+ K+ + + + + S + Y+ T Sbjct: 247 RTMHFDPFSYTKDPISTP-VCPVRASTEILPFSNDQTVLNNYIDGLTG---RGNTSIDIG 302 Query: 298 MKQAYQILTSDKKRSF---------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K +L + +F + S K +I ++DGEN N Sbjct: 303 TKWGVVMLDPGTQSVISGLISDNKVPASFQGRPSAYDSGDVLKVLIVMSDGENTNQYMLN 362 Query: 349 NTI 351 ++ Sbjct: 363 PSL 365 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T +CD AK+ I + T+ A +G+R+LK C SS ++Y+ + F +I+ Sbjct: 490 DQRTDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDADGL-EISDAFTSIA 548 Query: 407 QLMVHRKYS 415 + + + Sbjct: 549 SSIRKLRLT 557 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 176 bits (447), Expect = 5e-42, Method: Composition-based stats. Identities = 61/457 (13%), Positives = 136/457 (29%), Gaps = 67/457 (14%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + NF+ IFA++++ + G + + + ++ +++A +A+ LA Sbjct: 1 MRRIQKLLGALWQDRRGNFAAIFAIAIIPVVAAAGATVDISRAYIVESRLKAALDASALA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 S++ +S N A S + Sbjct: 61 VGGATGMTTSQMQAMAQSFFNANYPASKLGV---------PGTLSVSQSGNVVSLSVHAQ 111 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP----- 178 M + N L+ S + M L + + + + Sbjct: 112 LPTTLMGVVGINTLNVSATSQVTRMGKKLEVALVLDNTGSMASGGRMTVLKTAAKNLITT 171 Query: 179 -ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----------- 226 + G+ + ++ VD++ S P Sbjct: 172 VSAAATNPGDVKVAIVPFNVDVNIGTTNENVSWLHWDEFTPSGGGGNGNGNCNIIQILLG 231 Query: 227 ---------------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEK 265 ++ ++ + + + ++ Sbjct: 232 LCNNNNNSNSHAGWEGCVMDRDQNYDAQNTFPPPNPGGSNATRYPASNSDSDNSNCNLQT 291 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + S + ++ + T++T + + +LT P Sbjct: 292 IMPLSTNWSALNSHIDAMASA---GNTNTTIGLAWGWNMLT--------QGGPLSSAAAP 340 Query: 326 SLPFQKFIIFLTDGE-------NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + K I+FLTDG+ NN+ N T IC+ K IK+ ++ + L Sbjct: 341 AANLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSVRVIEGN--ATL 398 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ C + P +Y+V A L VF +I+Q + + + S Sbjct: 399 IRNCATEPGMYYSVTTASELTSVFASIAQSLSNLRIS 435 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 176 bits (447), Expect = 5e-42, Method: Composition-based stats. Identities = 51/434 (11%), Positives = 126/434 (29%), Gaps = 35/434 (8%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 A + + +I+ ALS + L ++G I + + + A +AA LA + + + Sbjct: 22 AADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLAVSRENFQVSTN 81 Query: 75 LGDRFESISNHAKRALIDD---------AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + S A ++ +A + Sbjct: 82 DQLKALAQSYFDANFPPGTMGATTSLSVATSGTPPTVQGFTVTVTATLPLVFAPLVETLG 141 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 ++ + ++ + T + L + + + + + Sbjct: 142 GPTIGSVGISKASGAVFTTQTSNQGGMELVIVLDNSASMKGSQEDLRGGVKALLDMLYGN 201 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSID 243 + L +V SG+++ ++ + A N L + Sbjct: 202 ADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCPMATVNGKLNGSRLSNAP 261 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + G+ S + + T + ++ Sbjct: 262 PKTFKFDSTTDGVEIQYCGASTLGTSSALSPNRGDADKAIKSYVAGGDTLIGEGLVWGWR 321 Query: 304 ILTSDKKRSF---FTNFFRQGVKIPSLPFQKFIIFLTDGEN------------------- 341 +LT + + + +K ++ +TDG N Sbjct: 322 MLTPSWRGLWNTKDQPGASLPLDYDLPYMKKVLVLMTDGVNHIAGRNYTAYYSDPYQTVA 381 Query: 342 NNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + K++ + + IC+ A K++ + + TI+ + + Q+ + C S P HY+ L Sbjct: 382 DASKADADLMTICNAAKKDHNVVLYTITYGSDTDEQQ-MSDCASDPSKHYHAALPQDLAK 440 Query: 401 VFQNISQLMVHRKY 414 F + + K Sbjct: 441 AFTQVGTDLTTMKL 454 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 59/398 (14%), Positives = 124/398 (31%), Gaps = 42/398 (10%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I+ + I+ A+ + L + F + V + + A ++A AGA + Sbjct: 15 ISRRRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGARTLSLEQDA 74 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 +I AK + + + ++ LS + + V S Sbjct: 75 SLAHKSAIEYAAKNNVAGNTLTLADSDVQIGLSVRT---------DDVGRFTFSSGGKLL 125 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 N ++ + D F + + + I Sbjct: 126 NSVNVTGRRTQQAPDGAVRLYLTPIF---------GHEFFQPVADA----TASQIDRDIA 172 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LVVD SGSM +N + + R + AL +++ FL + ++ + Sbjct: 173 LVVDRSGSMTFRINRNSYESGWRNNDPVPSRARWWALVDSVDGFLTELGSTPQLE---LV 229 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L Y + + + + + ++ + T+ T M + L + Sbjct: 230 SLSTYNSSAKIDEQLTDKYSRIEDALDDYSRRYPDGSTNITAGMDRGISTLQNK------ 283 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K K ++ +TDG +N S A + I + TI+ + N Sbjct: 284 --------KYARPYASKTMVVMTDGNHNYGSSPT---NAAYDAASDDIVVHTITYSDGAN 332 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + H++ + D L +F+ I++ Sbjct: 333 QSLMREVARIGGGQHWHAPDGDELEEIFREIARNAPTL 370 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 174 bits (440), Expect = 3e-41, Method: Composition-based stats. Identities = 67/394 (17%), Positives = 132/394 (33%), Gaps = 48/394 (12%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + +F + +M L ++ I + ++ +++A + A + + Sbjct: 76 KERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRAL------- 128 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E I+ A DA I + + S G S + Sbjct: 129 --LAEEITIAQANAYAKDAFNRIASGLSASGDGSSGTSIFGTMTVKPAVQITETVDGNTT 186 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LIE 194 N S+ + + + +S E + Sbjct: 187 NYVVKVNGTAKIPASPLSFMF--------FDGETGKNTISLGFESETTAKAEAGASLSMA 238 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 LV+D SGSM ++ ++M+ LK A+ + + + + + Sbjct: 239 LVLDRSGSMGW-----------------ERPSRMSELKKAVRSLIKELQTVDPDDQFTRL 281 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G Y E +W VR +V T + PA+++A L + Sbjct: 282 GAYAYHWYYAGKKELTWNKNSVRSWVNSL---PASGGTRAAPAIQKAKNDLLT------- 331 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + FI+++TDG + + + C AK I I T++ A + Sbjct: 332 NSELNAHINKNEQEPDLFILYMTDGIDGD---PNWAKRECTSAKNAGITIYTVAFKAPAS 388 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G+ LLK C +S ++Y+ NA+ L VF++I++ Sbjct: 389 GRNLLKACATSDAHYYDAKNANELNKVFKDIARE 422 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 172 bits (436), Expect = 8e-41, Method: Composition-based stats. Identities = 55/502 (10%), Positives = 130/502 (25%), Gaps = 106/502 (21%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++E N ++I A + L L G I + + +K+ +++A ++A+LAGA + + Sbjct: 14 LRSAEGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGALGRQAGNT 73 Query: 74 RLGDRFESISNH-----------------------------------AKRALIDDAKRFI 98 + + + Sbjct: 74 AAETTLDVQTYALALFTDQGGGLDCDPVAVTFDETNLDILGTVRCRQPTYLSSLIGHDEL 133 Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN----------------- 141 + ++ + + I ++ S + Sbjct: 134 EFNVASTSTYGVGKLDVAFIFDVSGSMNSYNRLAQLKTAAVAAVDELLPDSRERDGTVRL 193 Query: 142 ---------NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK--IVSFIPALLRIEMGERPI 190 N Y V + RYN + + Sbjct: 194 AIASYNHSLNAGAYIGAVTETVTLSADGSNSTALSRYNSHNTKRMIDQDSGKRFFYYQSG 253 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 S D +RT A +A I + Sbjct: 254 TCSSWNCGKYSSWSWDTKRRFFDDTGLADACVYERTGTQAATDAAPGSGAWIGAGNPRWS 313 Query: 251 DVYMGLIGYTTRVEKN------------------------IEPSWGTEKVRQYVTRDMDS 286 Y + + ++ +V + Sbjct: 314 FYAGSSSKYDGWQNVENQNATGYGVGAYEGRHGTCMPSGPVPLTEDKTVLKDHVNALV-- 371 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---- 342 T + + +++ + + + + K +I +TDG+ N Sbjct: 372 -AEGGTAGHLGIAWGWYLVSPEWAAIW--PEASEPLPYRQPQTSKAVILMTDGDFNIEHP 428 Query: 343 --NFKSNVNTIKICDKAKE--NFIKIVTISINASP------NGQRLLKTCVSSPEYHYNV 392 + S ++ +CD K I+I T+ +G+ +L+ C +SP + ++ Sbjct: 429 TASRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQRTGDGRTILEYCATSPSHAFSA 488 Query: 393 VNADSLIHVFQNISQLMVHRKY 414 + + LI V+++I++ + + Sbjct: 489 DSGEELIEVYRSIARSISDLRL 510 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 48/400 (12%), Positives = 123/400 (30%), Gaps = 26/400 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I +++ + SI+F + + L ++ I + A++ + +A A +N+S Sbjct: 5 IKAQQGSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACAYSQRANVSL 64 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + F+ + + + T + + + A+ Sbjct: 65 TQGFAQYYKPNFISHVNAQ-STFLSGQKQCKIQI---GYAFTPLLKDLLPASSQNKVHAS 120 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP----- 189 ++ S++ ++ L + + K + + + Sbjct: 121 VQIQSTSTLTVHSEIKPMDLSLVLDISGSMSGRIGLLKRIINQAIQNIEQQNTKNNTQIR 180 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 ++ +S S + + + + + +D+ +++ Sbjct: 181 FSIVPFSSGVSIS-NAPWLAKSKGKALCVDAMSYPGNVLNTAQTVAD--IDTHPSKLNIR 237 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + LI + + KVR++V T S + L + Sbjct: 238 AKEPLSLINDCNVYSLLLPLTNNLSKVRKHVDSLSIL---GSTASYQGFIWGVRTLLPNW 294 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKEN-FIKIVT 366 ++++ S + +I TDGE+++ + +C + +++ I I Sbjct: 295 QKAWNLQPET------SSLLSQRLILFTDGEDDSRDQFDKLVRSGMCQRIQDDFNIDISF 348 Query: 367 ISINASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQN 404 I SP K C+ S Y+ N L F Sbjct: 349 IGFGLSPRRLDQFKKCIGSNGKGVVYDAKNGSDLEKFFAE 388 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 39/370 (10%), Positives = 103/370 (27%), Gaps = 41/370 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + E + + ++ L + G + ++ + ++ + A+LA A Sbjct: 23 VRSFLREEDGVLAKPMIGTFLAMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAA----- 77 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + D + ++ +A ++ + A + Sbjct: 78 DLDQSLDADAVVLDYLTKAGLEQYYSDPDDQKGLGYKSVEATIDTDF-------EAYLLK 130 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR-----IEM 185 + + N+ + L + + + + + + ++ Sbjct: 131 FAGGDNMSLYANSRAEEIIGSVEISMVLDISGSMNSGNRLVNLQAAAKSFVTQITSNTDV 190 Query: 186 GERPIFLIELVVDLSGS----MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I +I ++ S D + + +K +N L+ Sbjct: 191 SNLSISIIPYATQVNAGEKLLSKYTKVSQEHDYSYCVNFIKDQFSKHTLNQNEDLIRTAH 250 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 D ++ + + T + + K+ Y+ T MK Sbjct: 251 FDTFTYSMNMIDRP-VCPTRPGSAILPFTNDAAKLHAYIDSLT---ASGNTSIDIGMKWG 306 Query: 302 YQILTSDKKRSFFT---------NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +L + NF + S K II ++DG+N N Sbjct: 307 SALLDPTAQPVVNALVDDKVISENFRGRPKAYGSGDTLKIIILMSDGQNTNQY------- 359 Query: 353 ICDKAKENFI 362 + ++ + + I Sbjct: 360 MVNEHRRDGI 369 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 39/82 (47%), Gaps = 3/82 (3%) Query: 336 LTDGENNNFKS--NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 T G N++ + +T ICD K+ + + ++ A G ++L+ C SSP + ++V Sbjct: 469 FTAGRTVNYRVAKDQHTKTICDITKDQGVIVYSVGFEAPSAGIKVLEDCASSPAHFFDVE 528 Query: 394 NADSLIHVFQNISQLMVHRKYS 415 + F +I+ + + + Sbjct: 529 GL-EISDAFSSIATSIRQLRLT 549 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 43/370 (11%), Positives = 105/370 (28%), Gaps = 46/370 (12%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ SE + I ++ L++ G + V+ + +++ ++ + A+LA AS Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS-- 68 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 L++ E + ++ +A ++D + + + + + Sbjct: 69 ---LTQSRSPAEVVRDYVAKAGLEDYLDEPVVN------ANTLNVRSVTATAAYSMPTVF 119 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN------QKIVSFIPALLR 182 M + +RL++ + L ++ N V+ + Sbjct: 120 MKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGTNPRDRLDNLKVAARDFIDI 179 Query: 183 IEMGER---------PIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + G + ++ ++ G+ A + C + + Sbjct: 180 VMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTTA 239 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-------KNIEPSWGTEKVRQYVTRDMD 285 A L Y E S E ++ + R Sbjct: 240 LANGATLTGSGNSELFSSSSSTQTPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLSG 299 Query: 286 SLILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFL 336 T MK +L + F + + S K ++ + Sbjct: 300 E---GSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLM 356 Query: 337 TDGENNNFKS 346 TDG++ N + Sbjct: 357 TDGQHVNQEP 366 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 55/478 (11%), Positives = 121/478 (25%), Gaps = 78/478 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R K A++ N +++F ++V+ + +G + +++M+ A ++A L + Sbjct: 10 LRRTAKAFHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSK 69 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + R D + + K + + + + Sbjct: 70 DYAAGVIRASDIQATAEKYFKALYTSPGINNVTVTATYTARSANGSSTVVM-NTSGSMPT 128 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI------PAL 180 + L + ++ L + + + Sbjct: 129 SFLKVAGFTALPFTASSTSTWGATRLRVAMALDVTGSMDWDDKLTAMKTAAIKLVNTLKA 188 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT--KMAALKNALLLF 238 + I +I V ++ + C+ + T A + Sbjct: 189 TASTDADVYISIIPFNVMVNVGTANKDAEWLDWDTDYGSCKSNRTTQNSCQAAGETWSWW 248 Query: 239 LDSIDLLSHVKEDVYMGLIG-------------------------------------YTT 261 +S K G + Sbjct: 249 ANSCTSRYTRKSTCVAGGETWIPSGVSNWKGCVTDRTTSNDYDVIKTPPTTATPATLFLA 308 Query: 262 RVEKNIEPS------------WGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQILTSD 308 + S T + + T+ + A+ L + Sbjct: 309 KSYSACPLSLLPMKAAYSSNESDTSTAESTLKGKINKLDAEGNTNQPIGLFWAWMSLQTG 368 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-------NVNTIKICDKAKE-- 359 K + II L+DG+N + + K+CD K+ Sbjct: 369 -------VPLNTPAKDTEYKYTDAIILLSDGDNTQSGNSNSVSAIDARQKKLCDNIKDPL 421 Query: 360 NF-IKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 N I TI +N + +LK C S + AD + FQ+I + + + Sbjct: 422 NGTTTIFTIQVNTDGDDESAVLKYCASD-GQFFQSTTADQIEIAFQSIGSSLTKLRLA 478 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 44/370 (11%), Positives = 105/370 (28%), Gaps = 46/370 (12%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ SE + I ++ L++ G + V+ + +++ ++ + A+LA AS Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS-- 68 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 L++ E + ++ +A ++D + + + + + Sbjct: 69 ---LTQSRSPAEVVEDYVTKAGLEDYLDEPVVN------ANTLNVRSVTATAAYSMPTVF 119 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN------QKIVSFIPALLR 182 M + +RL++ + L ++ N V+ + Sbjct: 120 MKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGTNPRDRLDNLKVAARDFIDI 179 Query: 183 IEMGER---------PIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + G I ++ ++ G+ A + C + + Sbjct: 180 VMAGANSGLDGAPVISISIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTTA 239 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-------KNIEPSWGTEKVRQYVTRDMD 285 A L Y E S E ++ + R Sbjct: 240 LANGATLTGSGNSELFSSSSSTQTPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLSG 299 Query: 286 SLILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFL 336 T MK +L + F + + S K ++ + Sbjct: 300 E---GSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLM 356 Query: 337 TDGENNNFKS 346 TDG++ N + Sbjct: 357 TDGQHVNQEP 366 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 170 bits (430), Expect = 4e-40, Method: Composition-based stats. Identities = 51/496 (10%), Positives = 117/496 (23%), Gaps = 107/496 (21%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + N +++F L ++ + + + +++ + A +AA LA A + + Sbjct: 5 DFFRDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVAVLRPATV 64 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + L D+ + + + +++ + Sbjct: 65 EQAQAAVKLRLD---KELGDNPDKVVIGQFNYDTKTRTYYVTAKGTYKPFLLGVVNIKEI 121 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE-------- 184 + + L + + + Sbjct: 122 PYEVIS----ETIQAANGTLELALVLDNTDSMGQILNGSSTRLDVLKTAATNLVNTVMTS 177 Query: 185 -------MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + P V + S V +A TK + Sbjct: 178 ANKDYVKVAVVPYADYVNVGLANRSQSWVSVGADYTVPAAAKTCTTISTK-QVCTGGVYG 236 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTT------------------------------------ 261 DSI +K + TT Sbjct: 237 TCDSIKDGVPIKVGCWKTPQTCTTVNITPYQSCNNPQPTYYKWYGCVRHQVDSKTKMLVL 296 Query: 262 ----------------RVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQ 303 S V + ++S+ T + Sbjct: 297 PDPLTAYTGVLETAQKCPTAIQPLSNDKTVVTNSIKGLVNSIGSYKPDTFIPGGLHWGVN 356 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN---------------- 347 L+ + +K I+ +TDG N + ++ Sbjct: 357 TLSPPAPFK-----EGMAYDSKNKEPKKVIVLMTDGANTLYTNSSGQIVSAATGSPPTIS 411 Query: 348 --------VNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 C AK I++ I + P LK+C + +++++ NA+ L Sbjct: 412 SSLVAPTYTAQDNACKYAKGKNIEVFVIGLGVTDPTALSALKSCATDAQHYFDAQNANDL 471 Query: 399 IHVFQNISQLMVHRKY 414 I F+ I + + Sbjct: 472 IEAFEIIGGKLSVVRL 487 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 170 bits (429), Expect = 5e-40, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 127/409 (31%), Gaps = 25/409 (6%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ + S++F + + L +I I + + A++AA LA ++ L Sbjct: 10 QRGDISLMFVICLPFILTMIAVSILLAMYLLTVTRAGQASDAASLACGYSQRADQDLLVG 69 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + ++ D + + K S + +N + ++ S + ++++ Sbjct: 70 I---LDYYRPGFVVHDGEALVSIDGKNRCSIEATYRFNPTMMALLPESARTHVSLSSDTG 126 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF-----IPALLRIEMGERPIFL 192 +S+ I + + L + Q K++ I E+G L Sbjct: 127 ATSHLVI-NSTPLPMDLALVLDISSSMSAQLPQLKLIINGALEEIRQQDPNEVGGVRFSL 185 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + G ++ + + + + + + L+ Sbjct: 186 VPFETGV-GVLNAPWMPKSAAKVTCVDGLSYGQHSVDYART-VDDLAEPAANLNIKSVFA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L + + + V+Q V + T S + + L + Sbjct: 244 SQWLDACSMDA-TILPLTQDLNLVKQRVDALV---TSGTTSSYQGLIWGVRTLLPQWQEE 299 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-FKSNVNTIKICDKAKEN-FIKIVTISIN 370 + I + ++ TDG + ++ +C ++ I++ I Sbjct: 300 WQIPPVESPALI------QRLVLFTDGADQGFHLDDLIEQGLCRVIQDKHHIEMSFIGFG 353 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 S + + C Y+ N L F+ Q K S++L+ Sbjct: 354 VSDRRLQQFRECAGDKGKVYDAQNTQELEAFFREALQ--TDTKASLVLR 400 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 170 bits (429), Expect = 6e-40, Method: Composition-based stats. Identities = 42/365 (11%), Positives = 98/365 (26%), Gaps = 41/365 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 SE +I+ +M + GF + V+ + ++ ++ A + A+LA A +L Sbjct: 25 FATSESGAMTILTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAA-----DLD 79 Query: 74 RLGDRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + ++ K+ +D ++ + S + + E ++ M Sbjct: 80 QELCPKDVVIDYLKKEGLDKYLTGDPKVEPDVCGSTAAVLKGYRRVEANADMDIEMHFMK 139 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM----G 186 + S+ ++ L + + + + + G Sbjct: 140 WRGIETIASAATSVAEESIGNVEISLVLDVSGSMRGSKLENLKKAANLFIDDMFAKTEDG 199 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID--- 243 + I ++ +S + + + NS C D A + D + Sbjct: 200 KVSISIVPYSEQVSIPDYLMNKLNTQGTNSIANCVDFASADFATTRFTAFDVTDPVTGIV 259 Query: 244 -----------------LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 I + E + + Sbjct: 260 TPGTTLARTIHHDIGDGSDRRPYNGFVSSTICRPNTSTNHREITILQKDPVALKKEINLL 319 Query: 287 LILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T K +L + + F + + K +I +T Sbjct: 320 NASGWTSIDVGAKWGVTLLDDSFQPLTKKLVTESKVPSIFKDRPDQNKGYDTMKVMILMT 379 Query: 338 DGENN 342 DGEN Sbjct: 380 DGENT 384 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 65/231 (28%), Gaps = 25/231 (10%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I D + P+ + R ++ D Sbjct: 393 YNHGTSDIWWNADKEKYSVYDREAGNFIWIDVPMAEKHSRRDWYWVRLYRQ------DHA 446 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + Y+ V +I S EK + S + Sbjct: 447 YGQGTYMQYKCNDYSYGVCHDINFSRSREKKDEGSGAVELSWPALWERTPKG-------- 498 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + G + + L N + + IC KAK+ I I Sbjct: 499 -----KIYDIFKTAFGTSYANEWYNTSTTVL-----NQVQKDPRLTSICQKAKDEKIIIF 548 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQLMVHRKYS 415 +I+ +A + LLK CVS +Y + D +I VF +I + + + + Sbjct: 549 SIAFDAPDGVKPLLKGCVSDDGAYYEAKDNDKDIISVFSSIGSTIQNLRLT 599 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 169 bits (428), Expect = 7e-40, Method: Composition-based stats. Identities = 33/357 (9%), Positives = 91/357 (25%), Gaps = 36/357 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + E + ++ L + G + ++ + ++ + A+LA A Sbjct: 23 LRSFRSDESGALAYPTIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAA----- 77 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + + + ++ +A + + + A T + Sbjct: 78 DLDQTQEPAVVVQDYLNKAGLGEYYEAPEVETGLGYKKVKATIDATFDA-------HLLQ 130 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP- 189 + L ++ L + + + + + Sbjct: 131 FAGGSDLPVYASSTAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAARDFIDTMVENTTD 190 Query: 190 ----IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL-------F 238 I ++ +S S N+ C + + + + + F Sbjct: 191 GRMSISIVPYATQVSVSDELFDEYTTSGTNNFANCINFETSDYSTTALSTTSERERTMHF 250 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + I + + ++ ++T T M Sbjct: 251 SPWYTSNTRASGSPIDYEICDDRSSREILPLQKDATTLKSFITNLT---AWGNTSIDIGM 307 Query: 299 KQAYQILTSDKKRSF---------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K +L + + + F + V K I+ +TDG+N + Sbjct: 308 KWGVALLDPSARPAISSLASGASVPSEFSVRPVDYSDPDTLKIIVLMTDGQNTSQYY 364 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 169 bits (428), Expect = 7e-40, Method: Composition-based stats. Identities = 43/369 (11%), Positives = 105/369 (28%), Gaps = 46/369 (12%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 ++ SE + I ++ L++ G + V+ + +++ ++ + A+LA AS Sbjct: 12 ALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS--- 68 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 L++ E + ++ +A ++D + + + + + M Sbjct: 69 --LTQSRSPAEVVRDYVAKAGLEDYLDEPVVN------ANTLNVRSVTATAAYSMPTVFM 120 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN------QKIVSFIPALLRI 183 + +RL++ + L ++ N V+ + + Sbjct: 121 KLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGTNPRDRLDNLKVAARDFIDIV 180 Query: 184 EMGER---------PIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 G + ++ ++ G+ A + C + + Sbjct: 181 MAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTTAL 240 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-------KNIEPSWGTEKVRQYVTRDMDS 286 A L Y E S E ++ + R Sbjct: 241 ANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLSGE 300 Query: 287 LILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T MK +L + F + + S K ++ +T Sbjct: 301 ---GSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLMT 357 Query: 338 DGENNNFKS 346 DG++ N + Sbjct: 358 DGQHVNQEP 366 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 169 bits (427), Expect = 9e-40, Method: Composition-based stats. Identities = 42/419 (10%), Positives = 115/419 (27%), Gaps = 57/419 (13%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 AL + L + G + + + + +++A +A LAG+ ++ + Sbjct: 1 MALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELPYDPDLSKGIV------ 54 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 +A+ D + ++ + +S + + ++ + M + + + Sbjct: 55 -TQAVNDMVETNMEEAVVTEISAGT-EIRSVKVTAQAEVRMLLMEVLGMADKTVEASAMA 112 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL----RIEMGERPIFLIELVVDLSG 201 + + + + + + + L+ + Sbjct: 113 --GFNKLEVVFVIDNSGSMKGTPIDLVKQASEELTDLLIPDGTTPDTKVGLVPFRGKIRL 170 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + T + F+D + L + + + + Sbjct: 171 GEAVDGYA-----EGCVNADGSLNTGIN------EEFMDEYNALPYYYKRYIT--LDTCS 217 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK----------- 310 + + S + + + T + +K ILT D Sbjct: 218 DIPTVLPLSKNKSTIIAAIGSQTATGAASGTVISEGIKWGRNILTPDAPFTQAGSKEDFR 277 Query: 311 ------RSFFTNFFRQGVKIPSLPFQKFII--------FLTDGENNNFKSNVNTIKICDK 356 T G + T N+ N + + Sbjct: 278 KIMIVLTDGDTEDGECGGTYRATYRPNNYWTNAYYGMGVDTAHCNDGGVLNADMLSEAQL 337 Query: 357 AKENFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLMV 410 AK+ I+I +I +S L+K SS +++++ + + +F+ I + + Sbjct: 338 AKDAGIEIFSIRFGSSDTTDINLMKEIASSKAGTDDHYFDAPSVYDIPDIFKQIGKQLG 396 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 169 bits (427), Expect = 1e-39, Method: Composition-based stats. Identities = 74/399 (18%), Positives = 125/399 (31%), Gaps = 40/399 (10%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 A+ L G ++V D K ++ A ++A LA A+ + + + + N Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSEAEAYARNF 60 Query: 86 AKRALIDDAKRFIK------NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + + + + ++ + SG S + T + + M + Sbjct: 61 VAGQMANYLQSGVDIKSATGVSVQTNTSGNSTSYQVTVSPSYDLTVNPLMQAVGFTTQHL 120 Query: 140 SNNTIF-------YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 S + S L + Sbjct: 121 STSGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATV---------------NEEDPT 165 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + E ++ + TK+ ALK A + + V Sbjct: 166 ESYTYPCNPHYNRK---GKEVWDTCTGSRANYYTKIEALKMAAGNLFAQLSGADPNAQYV 222 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G + Y +WG V YV T+S+ AM AY LT+ Sbjct: 223 RTGAVSYDIVQYAPSSLAWGAIGVSSYVNAL---QAGGGTNSSGAMSTAYLSLTAKNAAG 279 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTI 367 QK+I+F+TDG+NNN S + T CD AK I+I TI Sbjct: 280 NDAED-SAHKLKSGQIPQKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTI 338 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + A P GQ LL+ C S +++ + L F+ I Sbjct: 339 AFMAPPGGQALLQYCASDASHYFQAEKMEDLFAAFKAIG 377 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 32/359 (8%), Positives = 87/359 (24%), Gaps = 38/359 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + ++S L + G + ++ + ++ + A+LA A Sbjct: 27 LRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA----- 81 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + + ++ +A ++ + T + Sbjct: 82 DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKKVQSTIDTTF-------ETHMLK 134 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-- 188 + + + L + + + + + Sbjct: 135 FSSGQDMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTMIANTID 194 Query: 189 ---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I +I +S + D ++ C + + + LD Sbjct: 195 NKMSISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTEELDRTMHF 254 Query: 246 ------SHVKEDVYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + +G R + + + ++ ++ T M Sbjct: 255 SVWSGSDYRASANPLGSPTCEDRADREILPFQKDANTLKGFIDGLS---AKGNTSIDVGM 311 Query: 299 KQAYQILTSDKKRSF-----------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K +L + + F + K I+ +TDG+N N Sbjct: 312 KWGTALLDPSARPAISALASGGGAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQYY 370 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + + D N + T +C+ AK I + TI A +G +LK C SS +H++ Sbjct: 478 WYYGVYDY-WNTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFD 536 Query: 392 VVNADSLIHVFQNISQLMVHRKYS 415 V + F +I+ + + + Sbjct: 537 VRGL-EIRDAFASIATSIRQLRLT 559 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 56/475 (11%), Positives = 128/475 (26%), Gaps = 82/475 (17%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 A+ N ++IF ++++ L +G + + +M+SA ++ L A + S Sbjct: 17 FPAASGGNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSGKI 76 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + + + I S+ + + + M M Sbjct: 77 TAENVQSAANTYFTSLYKNTDAPSIDVTAT-YTPKTSSENAKLTVGGTGSINTEFMKVMN 135 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP------ALLRIEMGE 187 +++ ++ L + + + + + Sbjct: 136 ISQMSLGASSTTTWGGTRLRVALALDVTGSMDSAGKLSAMKTAAKQLIDTLKATSTTKED 195 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK----------------MAAL 231 I ++ V ++ + + S C+ K TK + Sbjct: 196 VYISIVPFNVMVNVGPGNKNATWLDWDTSYGSCKSKYTTKNACQAGGDSWNYWSNTCQSQ 255 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIG--------------------------YTTRVEK 265 K + + + G + Y+ + Sbjct: 256 KTLKSACQAGGHTWTASNVNSWKGCVTDRTQNYDTTKTEPTSATPDTLFLAQNYSDCMAS 315 Query: 266 NIEPSWGTEKVRQYVT--------RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + E + R T+ M A+ L + T Sbjct: 316 LLPMKSAYEATESDSSTDATTLKGRINTLDAQGGTNQGIGMFWAWMTLQA-------TAP 368 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENN-----------NFKSNVNTIKICDKA--KENFI-- 362 K + I+ L+DG N + + + +CD K N + Sbjct: 369 LYTPAKDSEYKYTDAIVLLSDGMNTKNRWYGNGSNWSPQVDDRQKILCDNITTKVNGVPE 428 Query: 363 -KIVTISINASPN-GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I TI +N S + +LK C S+ ++ A + FQ + + + + Sbjct: 429 TTIYTIQVNTSGDPESSVLKYCGSTGG-FFSTTTASGIQSAFQEVGASLTKLRIA 482 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 58/471 (12%), Positives = 131/471 (27%), Gaps = 75/471 (15%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 NF+I+F + L GF + + + K+ ++ +AA+ + A + + + Sbjct: 14 FARHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGVI 73 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D +++ N + + + I + V+ + Sbjct: 74 KEADASKAVQNFLVANSMAGILQ--PDQIVLDRLVVDRTANTVQADAHVDVALFFPVFGM 131 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR------------------------- 168 N + +T D L + Sbjct: 132 GNTQRVTASTTSLYSDKTIEVAMMLDVTGSMAANWWAKTDKIGDLQAAASTAVENLLDNN 191 Query: 169 ----YNQKIVSFIPALLRIEMG---------ERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + V+ +P + G + + V +G+ +S + Sbjct: 192 IDPNNPRVRVAIVPYAEAVNTGGLADSVFVEQAGGSNLPPPVPSAGAPIPVGSSVTLRPD 251 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + K + + ++ + +G + E + +K Sbjct: 252 KCATERKDKDGYADYSSDGPSELRRNNQNQEYLAKVNRDDRMGTCPKPELI-PLTADKQK 310 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ-GVKIPSLPFQKFII 334 + + D T A++ Y +L+ + + + K I Sbjct: 311 L---LDTIADFKAAGVTAGGIAVQWGYYMLSPSWRSTIVNARLGSGPANFDNRKVGKVAI 367 Query: 335 FLTDGENN---------------NFKSNVNTIKICDKAKENFIKIVTISINAS------- 372 +TDG+ N S N ICD K + I+I TI + Sbjct: 368 LMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDNMKRDGIEIFTIGFDLDDPSMTST 427 Query: 373 --PNGQRLLKTCV----SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +L+ C S+ +++Y L F I Q + + ++ Sbjct: 428 ERDQAKSVLQDCSTADTSTLKHYYEAATGPELDEAFNAIVQNI--ERLTIA 476 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 32/359 (8%), Positives = 84/359 (23%), Gaps = 38/359 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + ++S L + G + ++ + ++ + A+LA A Sbjct: 25 LRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA----- 79 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +L + + ++ +A ++ + T + Sbjct: 80 DLDQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKKVQSTIDTTF-------ETHMLK 132 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-- 188 + + + L + + + + + Sbjct: 133 FSSGQDMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTMIANTID 192 Query: 189 ---PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I +I +S + D ++ C + + + LD Sbjct: 193 NKMSISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTQELDRTMHF 252 Query: 246 SHVKEDVYMGLIGYTTRVE-------KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 S Y + + ++ ++ T M Sbjct: 253 SVWSGSDYRASANPLDSPTCEDSANREILPFQKDANTLKGFIDGL---QAEGNTSIDVGM 309 Query: 299 KQAYQILTSDKKRSF-----------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 K +L + + F + K I+ +TDG+N N Sbjct: 310 KWGTALLDPSARPAISALASGGGAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQYY 368 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + + D N + T +C+ AK I + TI A +G +LK C SS +H++ Sbjct: 476 WYYGVYDY-WNTSTKDARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFD 534 Query: 392 VVNADSLIHVFQNISQLMVHRKYS 415 V + F +I+ + + + Sbjct: 535 VRGL-EIRDAFASIATSIRQLRLT 557 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 42/370 (11%), Positives = 104/370 (28%), Gaps = 46/370 (12%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ SE + I ++ L++ G + V+ + +++ ++ + A+LA AS Sbjct: 11 SALRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAAS-- 68 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 L++ E + ++ +A + D + + + + + Sbjct: 69 ---LTQSRSPAEVVRDYVTKAGLADYLDEPVVN------ANTLNVRSVTATAAYSMPTVF 119 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN------QKIVSFIPALLR 182 M + +RL++ + L ++ N V+ + Sbjct: 120 MKLLDIDRLEAPAVSTAEERVSNVEISLVLDMSNSMVTDGTNPRDRLDNLKVAARDFIDI 179 Query: 183 IEMGER---------PIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + G + ++ ++ G+ + + C + + Sbjct: 180 VMAGANSGLDGAPVISVSIVPYTGQVNAGADLLSTYPNVSHRQPYSSCVEFAASDFTTTA 239 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-------KNIEPSWGTEKVRQYVTRDMD 285 A L Y E S E ++ + R Sbjct: 240 LANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETAAGNPTVTPFSHDPEALKLAIDRLSG 299 Query: 286 SLILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFL 336 T MK +L + F + + S K ++ + Sbjct: 300 E---GSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLM 356 Query: 337 TDGENNNFKS 346 TDG++ N + Sbjct: 357 TDGQHVNQEP 366 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N T +ICD A+ I + +++ A GQ LL+ C S+ ++Y + VF +I+ Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSIA 556 Query: 407 QLMVHRKYS 415 + + + Sbjct: 557 SHITQLRLT 565 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 167 bits (423), Expect = 3e-39, Method: Composition-based stats. Identities = 51/482 (10%), Positives = 122/482 (25%), Gaps = 93/482 (19%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + K N +II A+ ++ + IG + V + K+ M +A A + Sbjct: 2 IRNSLSRLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRMSAAL---DAAALA 58 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 S+ + + +K + + Sbjct: 59 VGSSSGTDAQLSAVAQKFFDANYPTGALGAHPSVAVKVTG-------DVISASAVAEVDT 111 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY----------------- 169 + M + N + ++ L + Sbjct: 112 VFMKVVGLNDVPVHADSTVNRQIAGLELAMVLDNTGSMTTNNNIQAVRDAANQLTDILFG 171 Query: 170 -----NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 ++ +P + +G LI + + + + V A Sbjct: 172 TATVHPYLKIALVPYSAAVNVGSVAPSLI--TTGDTYAPNDLLGWKGCVVERAGANGVGD 229 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--------RVEKNIEPSWGTEKV 276 + A + K + + + Sbjct: 230 TSAATAPWTRYKWLPAVDNNYDATKSSTVLANPSNGNASTGPNLGCPTAITPLTNVKATL 289 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ T S M ++L+ + F +G+ + + K +I + Sbjct: 290 TPAINAM-EAWSRGGTLSDVGMAWGLRVLSPE-------PPFTEGLPWGTPKWSKAVILM 341 Query: 337 TDGENNNFKS-----------------------------------------NVNTIKICD 355 TDG+N +K N +C+ Sbjct: 342 TDGDNQFYKLTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSVCN 401 Query: 356 KAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 K I + T++ + + + K C + +++ + D L F+ I+ + + + Sbjct: 402 AMKAKNIIVYTVTFTSGINQATKDIYKACATDASKYFDSPSQDELKSAFRAIATSLSNLR 461 Query: 414 YS 415 S Sbjct: 462 VS 463 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats. Identities = 67/489 (13%), Positives = 143/489 (29%), Gaps = 89/489 (18%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +LL RF + + S+ + +MS L+ +++ V+ + N M +A +AA+ Sbjct: 4 NLLLAMRFTKGDNMRKQTGGISVFMLVLLMSMLVFAAWVMDVMRIYSVHNQMANATDAAL 63 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + V + ++ A +++ L + + Sbjct: 64 ASAIISEVPEST-------AVELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVVLDFV 116 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL----------------- 164 NS + A + N + L + Sbjct: 117 PNS----LNIAAQESVPIRTNAKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFAD 172 Query: 165 ----LNQRYNQKIVSFIPALLRIEMG--ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 R +VS +PA + G E D + SD D S Sbjct: 173 KLYARGNRNQNYVVSIVPASGNVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDRASGR 232 Query: 219 ICQDKKRTKMAALKN----------------------------------------ALLLF 238 R + A ++ A+L F Sbjct: 233 TPAVPGRQRNAMCRDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHF 292 Query: 239 LDSIDLLSHVKEDVYMGLIG---------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 D L V + + R + + + Sbjct: 293 DDGTPLDPPVYPSTNPSNNYRPFHEDKAIFDDIECHVNPIVPFITERRHFESTVQRLVPG 352 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--N 347 T++ M A ++L+ + + + K+++ +DG + + + Sbjct: 353 MNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRD 412 Query: 348 VNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 IC + K+ +K++T++ +RL+++C S PE +Y+V + S+ VF+ I Sbjct: 413 KKMKLICTQLKQPGRGVKVMTVNFG-GAASERLMQSCASGPE-YYHVASLFSVEKVFEQI 470 Query: 406 SQLMVHRKY 414 ++ ++ Sbjct: 471 AEQVISSSL 479 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 166 bits (421), Expect = 4e-39, Method: Composition-based stats. Identities = 46/461 (9%), Positives = 115/461 (24%), Gaps = 66/461 (14%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 A N +I+ AL+ + +L+ + V+ + ++S ++A LA AS + + Sbjct: 1 MGADRSGNVAILTALAFVPLMLITIGSLDVVRMTTAQAKLQSTLDSATLAAAS-LSNTAD 59 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 E I + + + +SL + EI V+ + Sbjct: 60 IEDTVDEYIQANLPDTAPWTTLKLTMGDVTDSL-----NAKSVEITATVDIEMTILKLAG 114 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP---- 189 ++ +++ L + + + + + + Sbjct: 115 IDKTSVLASSVAQQAAQNIEVSVVLDISSSMGGSKITSLREAAKGFIDTMLKEDEDKEYT 174 Query: 190 -IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL------------ 236 + +I ++ + + Sbjct: 175 SLSIIPFGGTVNIGDFYDTYAVNSSTPGVIDSPSSANYYVNKNVPYGKFMFSTEREGCIE 234 Query: 237 ------LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 D + ++ + D + Sbjct: 235 YTDDDFDMAAIPANSRPQVPDFTKWVATNPWCPSEDSAMVLNSNNTTDLKALIDDMDLSD 294 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN------- 343 T ++L+ + +F + K + +TDG Sbjct: 295 GTGMDIGALWGAKVLSGSMRGQLGGDFSDRPADFNDEDTLKVAVIMTDGAITAQFRPRDY 354 Query: 344 ---------------------------FKSNVNTIKICDKAKENFIKIVTISINASPNG- 375 + ++C+ +N +++ TI + Sbjct: 355 TTTGKIKNKTQQTIVSKGNINTASTKADDAVAYFKRVCEYLNDNNVQVYTIGFQINSGSL 414 Query: 376 -QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +LLK C SS +Y V + + F I+ + + + S Sbjct: 415 PDQLLKYCASSLSNYYFVEGLN-IEDAFNAIASAVNNLRVS 454 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 166 bits (420), Expect = 5e-39, Method: Composition-based stats. Identities = 33/371 (8%), Positives = 91/371 (24%), Gaps = 43/371 (11%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L R + + + ++ L + G + ++ + ++ + A+LA Sbjct: 16 LRRGAQSLRAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLA 75 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A +L + + ++ +A + + + A T Sbjct: 76 AA-----DLDQTQPPAVVVQDYLNKAGLGEYYQEPIVESGLGYKRVQATIDATF------ 124 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + N L + L + + + + Sbjct: 125 -EAHLLRFSNGNDLPVFATSKAEESIDGLEISLVLDVSGSMNSNSRLSNLKVAAKDFIDT 183 Query: 184 EMGERP-----IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + I ++ +S N C + + + + + L Sbjct: 184 MVANTTDGKMSISVVPYATQVSLPDDLIDQYTTVGENPYSNCINFEAAEYNSASLSTLDT 243 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTR------------VEKNIEPSWGTEKVRQYVTRDMDS 286 L+ + Y++ + + ++ ++ Sbjct: 244 LERSMHFTPWGYSNRDMRTYYSSPRLVRSPVCDERASREVLPLQKDATTLKNFIQNLS-- 301 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFT-----------NFFRQGVKIPSLPFQKFIIF 335 T MK +L + + +F + + K I+ Sbjct: 302 -AGGNTSIDVGMKWGTALLDPSARPAISAISTGIGASVPGDFSDRPAEYSDSDTIKIIVL 360 Query: 336 LTDGENNNFKS 346 +TDG+N + Sbjct: 361 MTDGQNTSQYY 371 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 36/390 (9%), Positives = 100/390 (25%), Gaps = 12/390 (3%) Query: 37 IGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 I ++ V + + + AA + + + D + Sbjct: 154 ISLVLDVSGSMNSNSRLSNLKVAAKDFIDTMVANTTDGKMSISVVPYATQVSLPDDLIDQ 213 Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + + + ++ + + S+ + Y Sbjct: 214 YTTVGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPWGYSNRDMRTYYSSPRLVRSP 273 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-------GSMHCAMNS 209 + + + + G + + + ++ Sbjct: 274 VCDERASREVLPLQKDATTLKNFIQNLSAGGNTSIDVGMKWGTALLDPSARPAISAISTG 333 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 V + + + +K +L+ +V +D G G Sbjct: 334 IGASVPGDFSDRPAEYSDSDTIKIIVLMTDGQNTSQYYVDDDHRDGPSGVWYNSHYKSYS 393 Query: 270 SWGTEKVRQYV---TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 ++ + +Y + + + S + + + Sbjct: 394 TYDSRYGGRYFFHYNNNWYNEPYGNGSGQSGTAVELNYAELFARTSLKYIYRYIFYEWMN 453 Query: 327 LPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 + + ++ N N T +C+ AK I + TI A NG +L+ C SS Sbjct: 454 FYDARDDWYYGIYSSHGNSTKNARTRSVCEAAKAKGIVVYTIGFEAPSNGVAVLRDCASS 513 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++++V + F +I+ + + + Sbjct: 514 DAHYFDVDGL-EIKDAFASIATSIRQLRLT 542 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 166 bits (419), Expect = 7e-39, Method: Composition-based stats. Identities = 55/440 (12%), Positives = 125/440 (28%), Gaps = 56/440 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K +K + +I+FA+ + + + K +E A A LA ++ N Sbjct: 1 MKLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAHNDPN 60 Query: 72 LSR-------LGDRFESISNHAKRALIDDAKRFIKNH--------------IKESLSGYS 110 ++ D+ IK + + Sbjct: 61 QDYGGGGSPSSANQQIVTDYINAYISDVDSINEIKVYKRNCEEIPECKAGLAVGEPRYFE 120 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL--NQR 168 T Q ++ M ++ S ++ + F + Sbjct: 121 HEVGVTTSQKSWFPGNDAIVGMGDSFSTSGHSLARKYQSEAVDVMFAADFSGSMGDRWTG 180 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLS----------GSMHCAMNSDPEDVNSAP 218 N+K I + I + +E + + + +S + Sbjct: 181 GNKKYEDLIDIIDSISKELQKFNDLEHNDNDNTMGITAYNEYTYSQYSGSSGGWWGDDCY 240 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + Q + + ++ +D + ++ Y + +I + + V Q Sbjct: 241 LSQAESDGFWGGV--SISKTIDGL----WNEKSKDHCNNSYNSGRFNDIPLTSNFDVVNQ 294 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF--- 335 V+R T S A+ + Q+LT G+ + + Sbjct: 295 DVSRF---WPEGGTSSYQALIRGAQLLTYGTNSRRLLIVLSDGMDTDNNLTSSLVNAGMC 351 Query: 336 --LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + G ++ + I+ ++ I + P+ + LK CV + + Sbjct: 352 RDIQQGLESDKTLDNRPIRA---------QMAVIGFDYEPSENQALKDCVGAENVYKAEN 402 Query: 394 NADSLIHVFQNISQLMVHRK 413 + D L + + IS+ + H K Sbjct: 403 SDDILNTILELISEEIGHLK 422 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 165 bits (418), Expect = 1e-38, Method: Composition-based stats. Identities = 68/434 (15%), Positives = 130/434 (29%), Gaps = 75/434 (17%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-------------SANNA- 59 + NF ++ AL L + G + V + KN ++ ++A Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 60 ----AILAGASKMVSNLSRLGDRFES--ISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A A K + + + A A + + E + + Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETPNG 122 Query: 114 Y------NTEIQNIVNSSRISMTHMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHL 164 + + + ++ + S L + Sbjct: 123 TKGKSFQVSVANKHLLQFNAMTRLLGKESIELETRSTADSATESKNAISMYLVLDRSGSM 182 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + VD S + ++ Sbjct: 183 AWKTDT--------------------------VDTSRPRCINWTASNWGESNVRATSPCY 216 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ LK+A+ + + E + G Y R ++ + +WGT+ +V Sbjct: 217 VDKITTLKSAVDKLFTPLAKMDPGNEYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLD 276 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 TDS+ A A + L + +K+I+F+TDGEN ++ Sbjct: 277 ---ATGGTDSSSAFAAAVEEL-------LLDGENEAHLAKNGQTPEKYIVFMTDGENTSY 326 Query: 345 K----------SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 ++ T C AK N I I T++ A G+ LLK C +SP+++ + Sbjct: 327 NGKTSPRDLEKADSVTKAACTTAKNNGIAIFTVAFMAPQRGKDLLKACATSPDHYKEADD 386 Query: 395 ADSLIHVFQNISQL 408 A +L+ F+ I Q Sbjct: 387 AAALVSEFEKIGQK 400 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 38/366 (10%), Positives = 95/366 (25%), Gaps = 28/366 (7%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R F++ E I + ++ L G + ++ + + ++ + + A LA A Sbjct: 16 RGIFQRFRKDEDGALIIFSLMMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA-- 73 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKR-FIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +L ++ + + ++ +A + + + +A + + Sbjct: 74 ---DLDQVMAPADVVRDYMDKAGMLHFLQGDPIVDQGINYRIVTANASAPMPLFFYDLPK 130 Query: 127 ISMTHM--ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + L S ++ L + + + + Sbjct: 131 VFSSPFTPGMSSLTVSGSSTAEERVSDVEISLVLDVSSSMNSNNRMTNLRPAAREFVTTV 190 Query: 185 MGER--------PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + I +I ++ A + + + C T+ L Sbjct: 191 LANNTNAPQGLITISMIPYSAVVNPGTDIAPHLNINRTHEYSTCPMFDDTEFTTTALNLG 250 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + S+ + YT T T + T Sbjct: 251 ASYDHVSHFSYGGSNDMPINPNYTWCFAGDLNAIKPHTTNEADLHTAINNLHAYGNTAID 310 Query: 296 PAMKQAYQILTSDKK-----------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 +K +L S + + K ++ +TDG+N Sbjct: 311 MGVKWGVALLDSSTQSLISSLAGASGTGVPAIANGRPELHTQADVLKVLVLMTDGQNTQQ 370 Query: 345 KSNVNT 350 V Sbjct: 371 WDLVEP 376 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 65/189 (34%), Gaps = 10/189 (5%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + ++ + D + GY+ + ++ + + + S + T Sbjct: 402 KTSVQYQGEATNSRWDDWFYWNGYSGTLRYRDYPNGFNSRLAYVNASPVDASGPGEGTRY 461 Query: 295 ---TPAMKQA-YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN----NNFKS 346 + A +Q L +D+ N + + + N ++ Sbjct: 462 VDNGDELYHASWQQLFADRSYFLINNHYFLDAYYAGAWSWNEYWGTDNSIDHLIVNGSEA 521 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N IC A+ I I T++ A GQ L+ C SS ++++V D + F I+ Sbjct: 522 NTRLSNICAAARAQGIVIYTVAFEAPSGGQTALQDCASSSSHYFDVDGTD-ISGAFSAIA 580 Query: 407 QLMVHRKYS 415 + + K + Sbjct: 581 SDIRNLKLT 589 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 75/405 (18%), Positives = 158/405 (39%), Gaps = 44/405 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + K + SI+ A+ + +++G +I + K + + ++L A+K+++ Sbjct: 6 IRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILN 65 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + N+ K+ D + R IKN + + N + S Sbjct: 66 ---------QENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSI 116 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + YN+ ++ Y+ F + I + ++I Sbjct: 117 IIDDQ-------HKDYNLSAVSRYEMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIG 168 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +V+D+S SM+ K+ ++ LD I + V Sbjct: 169 LDMMMVLDVSLSMNDHFGPG--------------MDKLGVATRSIREMLDIIKSIPDVNN 214 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 V GL+ +++++ + +WG + +++ + R + K T P ++ AY + Sbjct: 215 VVRSGLVTFSSKIVQTFPLAWGVQHIQEKINRLIFGSTTKST---PGLEYAYNKI----- 266 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKENFIKIVTISI 369 F + + ++K+IIFLTDGEN++ + ++ C++AK + I + Sbjct: 267 --FDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNIDNKESLFYCNEAKRRGAIVYAIGV 324 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 A + LK C S P+ Y+V N+ L F I + MV ++ Sbjct: 325 QAEA-ADQFLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 164 bits (414), Expect = 3e-38, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 143/429 (33%), Gaps = 55/429 (12%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 S ++ ++ +A NF ++ A+ + L G + + + K +++ ++A LA Sbjct: 4 SAYQKMTRRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAALAA 63 Query: 65 AS-------KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 A+ + + + S K ++ K KN + A Sbjct: 64 ATEARLKEGALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARGKTYT 123 Query: 118 IQNIVNSS---RISMTHMANNRLDSSNNTIFYNM---DVMTSYDYRLQFIEHLLNQRYNQ 171 IQ +N + LD + + S L + + Sbjct: 124 IQTTINYQMQLNPLLGFFGAKTLDLAATGTAVSTVNKGAPISMYLVLDRSGSMSFKTDT- 182 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 L + + +++ + K +L Sbjct: 183 -----------------------LNTKKTSCQNYTVDNWGS-YPNLKNTSPCYVNKATSL 218 Query: 232 KNALLLFLDSIDLLSHVKED------VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K A+ + +++ V G YT +WGT V YV + + Sbjct: 219 KTAVGYLVATLNKADPTYTANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIP 278 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 TD+ ++ AY L + T ++ S F+++I+ +TDGE Sbjct: 279 EFPSGGTDARSSLNAAYNAL-----KKANTVEAKEHKDKKSESFERYIVLMTDGEMTGNS 333 Query: 346 S------NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 S + CD AK++ IKI +++ A G+ LL+ C SS + +Y N + ++ Sbjct: 334 SSWSSSIDQTVRNTCDTAKKDGIKIFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIV 393 Query: 400 HVFQNISQL 408 F I++ Sbjct: 394 TAFGEIARK 402 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 67/531 (12%), Positives = 136/531 (25%), Gaps = 139/531 (26%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L +R + +E + +IFAL+++ L G I + + A +A L Sbjct: 8 LTARLA----RFRHTESGSVLVIFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATAL 63 Query: 63 AGAS-KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A A + + L + E+ A + L DA H E+ Sbjct: 64 ALAKLPLDTTDKDLAAKAEAQVLTALKGLPIDALTVTMRH----------NGDLIEVAAK 113 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-------------- 167 + + S + I L + Sbjct: 114 GATPTSLTRLAGFMSMPLSVSAISNRSMTNLEIALVLDNTGSMKGTKLTNLKAAARDLVT 173 Query: 168 ---------RYNQKIVSFIPALLRIEMGE---------------------------RPIF 191 + N + +P + + +G F Sbjct: 174 SLFQQADPAKPNALKIGVVPFSMTVNVGSGFAGSDWLDINAKSPIHQQIFNAQGVPANRF 233 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM---------AALKNALLLFLDSI 242 + + + P DV Q T A+ ++ + Sbjct: 234 SLFADMGKPWAGCVESRPAPYDVQDTAPSQATPSTLFVPFFAPDESDNDSRAVNDYMADL 293 Query: 243 DLL-------------------------SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 S + + + T Sbjct: 294 PSGGSAGGASNRQLQGMTAKYDKNAFKVSTTARQDGTNYLFGPNAGCEIQPLTRLTTSQT 353 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q ++ T+ + + +L+ F+ GV + +KFI+ +T Sbjct: 354 QLTNAIAAMTVIGDTNIPIGLAWGWHLLSP-------NGPFKDGVAYGEIKTKKFIVLMT 406 Query: 338 DGENNNFKS---------------------------------NVNTIKICDKAKENFIKI 364 DG+N + S + +CD ++ I++ Sbjct: 407 DGQNQSAVSSSDNRSYYSGLGFIWQNRIGTTSNDNAVRTKAIDTRLTLLCDNIRKARIQV 466 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + +LK C +SP ++V N+ L VF+ I+ + + S Sbjct: 467 FAVRVEVNDGDSAVLKACATSPNMFFDVKNSSGLPAVFRAIADQISELRIS 517 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 38/390 (9%), Positives = 97/390 (24%), Gaps = 42/390 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + E+ + + + + G I V+ + +++ ++ + A+LA AS Sbjct: 42 RQFMRREEGTITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAAS----- 96 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L++ D + ++ A ID + ++ + + I M Sbjct: 97 LTQPYDPEGVVRDYFAIAGIDGYRLDVRVEEGLNFR-------RVHAYAELEVRSIFMQM 149 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA------LLRIEM 185 + S L + + + Sbjct: 150 FGVRAMTSPAIGAAEERVRRIEVSMVLDISGSMGENNRMTNMRPAAREFVTEVLSANENV 209 Query: 186 GER---PIFLIELVVDLSGSM---HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + ++ ++G D ++ + T A L + Sbjct: 210 NNELLVSVSIVPYNGRVNGGDLIESVFTYDDLHSESNCTRFAEADFTSTAIDPAVPLQRI 269 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D + +++ + T + + + ++ T M Sbjct: 270 AHWDRGNEEEDESFQWAHCQTDQYGAILPWQHTEAALHAHIDSLN---TGGWTAIDLGMN 326 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A +L + V DG+ + Sbjct: 327 WAVGLLDPAAAPALTGLIASGHVHPEFSDRPAPYR---DGDR------------ATTIDD 371 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 IK+V + + Q L+ S + Sbjct: 372 ETIKVVVLMTDGDNTRQYDLRDIYDSAGRY 401 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 25/254 (9%), Positives = 68/254 (26%), Gaps = 17/254 (6%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ I + + + I + ++ + Sbjct: 368 TIDDETIKVVVLMTDGDNTRQYDLRDIYDSAGRYVGFREGYAPIFRNTSTGQYSIWWEDQ 427 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRD 283 ++ + + G++ +E + + + Sbjct: 428 GA--------FWIPTGNTRDPGGSWQAQPDGGWSRYGMTALEFTEDRANAFDPAEAGNGE 479 Query: 284 MDSLI---LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T A + + ++ F+ + DG Sbjct: 480 VLLWADLFSDHTAGYIAAEWFHAPADESEQWDFYNQLAENPS--HNYIGWDSDGVRPDGV 537 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +++ N + ICD A I + I A GQR+++ C S +++V + Sbjct: 538 VGQSQADTNLMAICDVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVE-GREISE 596 Query: 401 VFQNISQLMVHRKY 414 F +I++ + + Sbjct: 597 AFASIARSINQLRL 610 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 163 bits (411), Expect = 6e-38, Method: Composition-based stats. Identities = 45/446 (10%), Positives = 116/446 (26%), Gaps = 47/446 (10%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + S NF ++ L +++ + + G + + K + + +A +L + Sbjct: 4 HSVSQYLRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDI 63 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + E+ + K + A + + V ++ +S Sbjct: 64 IEEG---KTLAEAELSARKYFDAELAFSSGVGLEVSTFTLTQGVDGIVKLGVSGKTSTSL 120 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL---NQRYNQKIVSFIPALLRIEM 185 + + + S + + + +++ L+ Sbjct: 121 LKAVGREEIPVSVDAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALNALKANS 180 Query: 186 GERPIFLIELVVDLSGSM--HCAMNSDPEDVNSAPICQDKKRTKMAALKN---------- 233 G + I ++ M +N + + KN Sbjct: 181 GSMALTFIPFTDRVNVGMGRANLLNPGDQTAVKKGGWGGCVDVRATKKKNKGETEYFMPD 240 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + D + + + I P+ V + + T Sbjct: 241 SAPEKGDRFTKFDNGTPAAHKSGYKLACNPQSIIGPTSNVSDVTSQLGKL----TKGGTG 296 Query: 294 STP-AMKQAYQILTSDKKRSF------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + L+ + K + S +K + TDG N + Sbjct: 297 RFDLGFAWLWYALSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAYVY 356 Query: 347 -----------------NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEY 388 N + IC I++ + +N + + + C S + Sbjct: 357 EYGKTNLAGWNTGSKDHFENVVAICKSMAAQKIEVHVMHVNGNDKAEPYFRECASATGGG 416 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKY 414 +Y V + +L+ I+ + + Sbjct: 417 YYKVASKQTLVDALTGITNGGGNLRL 442 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 162 bits (410), Expect = 8e-38, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 150/426 (35%), Gaps = 57/426 (13%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ +A NF ++ A+ + L + G + + + K M++ + Sbjct: 8 KSKRRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTAD---------- 57 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + L+ + + + + AK FI ++++L+ + V ++ Sbjct: 58 SAALAAATEARLREGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKNSPTRVTTTE-- 115 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + Y ++ + +L +L + + + + + + Sbjct: 116 -----------NARGKTYAVETTIKHQIQLNP---MLGFIGAKTLDLSVTGTAKSTINKG 161 Query: 189 PIFLIELVVDLSGSMHCAMNS--------------DPEDVNSAPICQDKKRTKMAALKNA 234 + L +D SGSM ++ + + K A+LK A Sbjct: 162 APISMYLALDRSGSMSFKTDTVDTTKTSCQNYTSDNWSKYPNLAKTSPCYVNKAASLKTA 221 Query: 235 LLLFLDSIDLLSHVKED------VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + +++ V G YT WGT V YV + + Sbjct: 222 VGFLVATLNKADPTYTVNGGSELVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFP 281 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN------ 342 TD+ ++ AY L R + S F+++I+ +TDGE Sbjct: 282 SGGTDARSSLNAAYNALKKANPD-----EARYHKEKGSESFERYIVLMTDGEMTGNSAAW 336 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 N + + C+ AK++ IKI +++ A G+ LL+ C SS + +Y N + ++ F Sbjct: 337 NSSIDQSVRTTCETAKKDGIKIFSVAFMAPDKGKSLLQYCASSADNYYAPENMEQIVTAF 396 Query: 403 QNISQL 408 I++ Sbjct: 397 GEIARK 402 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 162 bits (410), Expect = 8e-38, Method: Composition-based stats. Identities = 45/401 (11%), Positives = 114/401 (28%), Gaps = 25/401 (6%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 +F + I+ + + +I + ++ ++ I + N A +AA LA Sbjct: 2 HFNQSISKQHGSVAISYLAMLIPMIIAAASTIVIGYQVQLSNRGMQATDAASLACEFSGE 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + + I ++ Y+ + + Sbjct: 62 YDGTMAQGYLDYYRPKIDK---------VSGQIGTHSGCNVSLSYSLSTIFT-SLTLSDA 111 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + ++ + L + + + K + E Sbjct: 112 SFVVSSTANEKAYVTEDVASEPLELILVLDISGSMASDLDDLKAILKRGLASLKEQQNNA 171 Query: 190 IF-LIELVVDLSGSMHCAMNSDPEDVNSAPIC---QDKKRTKMAALKNALLLFLDSIDLL 245 + V + S ++N+ P + C + K +A LD Sbjct: 172 LSKDHIKVSIVPFSDGVSVNNAPWLNETGTFCVEGITESGGKFSAAHTVAN--LDITHDQ 229 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + VK + + + + + +V V T S + + L Sbjct: 230 TPVKTFQPDKWLMDCSAMSVTLPLTADLNQVTNAVDSL---RTEGGTASYQGLIWGLRQL 286 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKI 364 T + ++++ R K+ ++ +TDG + + +CD+AK+ + + Sbjct: 287 TPNWQKAWEVGPNRNFDKVERK-----LVLMTDGADYGSHFDELINAGLCDRAKDYGVAL 341 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + C ++ N L F + Sbjct: 342 NFVGFGVYGARLEQFTRCAGDANGVFSASNTQELDSYFSQL 382 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 161 bits (408), Expect = 1e-37, Method: Composition-based stats. Identities = 72/488 (14%), Positives = 152/488 (31%), Gaps = 85/488 (17%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 RF+ K IA + NF+I+ A+ +M LL +G + +N ++ ++AILA A Sbjct: 2 RFKDLLSKFIADDNGNFAIMSAILLMPLLLAVGAAVDYSSARDHRNDIQVTADSAILAAA 61 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV--N 123 S S+ S + + + + LSG + EI +V Sbjct: 62 SSYSSSSGVDSLAAGIDSYLDSKLTDQGSNDVDTAAVPKRLSGPTLSADGKEICIVVGEG 121 Query: 124 SSRISMTHMANNRLDSSNNT-IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M +D S + ++ L ++ + + + + + L Sbjct: 122 VPTSFMQLAGVKTVDVSAKSCAALPGNIDLEVSLVLDVSSSMIEEGRFVPMQTAVKSFLT 181 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL----- 237 + + + S + +D A D ++ K+ Sbjct: 182 SFANDATVAKRSKIAIAPFSSRFNIGLTHKDWLKAYGGNDAVPSRWTDPKSYYKDSKYSF 241 Query: 238 --FLDSIDLLSH-VKEDVYMGLI------------------------------------- 257 ++D++ L++ ++G + Sbjct: 242 SQWIDNVTTLAYTSSNYYWIGCVEPRADVEMKDNGAIGTYGLSDAPPSTEAFVAQDYNTG 301 Query: 258 -GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + + + ++ + D T M + L+ + ++ Sbjct: 302 SSTSFCPPPIVPLTSSFSTLQSAIA---DMTSEGSTRLDAGMLAGWYTLSPKWRSAW--G 356 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENN----------------------------NFKSNV 348 S +K I+F+TDGE N N + Sbjct: 357 GGTAPADY-SEKVKKVIVFMTDGEMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATN 415 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + CD K N I+I IS ++ + Q L+TC S +Y+++ + + V+ IS+ Sbjct: 416 ALLTTCDSIKSNNIEIYAISYSSEADVQN-LQTCSSGTKYYFSASTTN-IKDVYTAISKN 473 Query: 409 MVHRKYSV 416 ++ + Sbjct: 474 IIGSTVRL 481 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 161 bits (408), Expect = 2e-37, Method: Composition-based stats. Identities = 47/405 (11%), Positives = 120/405 (29%), Gaps = 49/405 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + S ++ + + L + + I V+ + ++ + + A A + Sbjct: 26 KARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVT 85 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + +D I S + E + +S+ Sbjct: 86 GDKAEAIEAAERLLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVSLRA 145 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + + I + + Sbjct: 146 FGADDVPM--------------------LFPTMGVPIEFRPIKQAV--------ATQVEL 177 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM----AALKNALLLFLDSIDLLSH 247 I +V+D SGSM + + ++ + + K A + + +D++ Sbjct: 178 DIAIVLDRSGSMAFSHDEVAKNGSPSSAPPGWKMGHAVPENARWLDTVAAVNGFLDIMED 237 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 D + L Y+ + + +++ + ++R + T+ + + L Sbjct: 238 SSHDERVSLSTYSDKSKADVKLTGDYTEIRAAMNAHSTKFKGGATNIGSGILEGGATLGD 297 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K+ + +I ++DG +N I + I I T+ Sbjct: 298 K--------------KLARSWASRVLIVMSDGIHNTGI---EPIPAAQQVANEKIMIFTV 340 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + N Q + K VS H++ ++ L F+ I++ + Sbjct: 341 TFSDEANVQEMEKVAVSGGGQHFHAKDSQQLTEAFRKIAKSLPTL 385 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 36/361 (9%), Positives = 91/361 (25%), Gaps = 43/361 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E + + +++ G I ++ + + +++ ++ A+LA A +L Sbjct: 24 FCRDEAGVLTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAA-----DLD 78 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + ++ +A + ++ + A T M Sbjct: 79 QTTNAKAVVEDYFAKAGMSQYLDGVQVSKGLNFKEVEAQVSAT-------IPTWFMNMSG 131 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP---- 189 LD+ + L + + + +R M Sbjct: 132 IETLDAFARSKAEERIQNIEVSLVLDISGSMGWDGKLANMRTAADQFVRTMMAGNDNVAA 191 Query: 190 -------IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +I ++ + C T ++ LD + Sbjct: 192 DGTGLTSVSIIPYHAVVNVPDELLDEYAVSTQQTVSNCVRFTATDFQSISIDRTKTLDRL 251 Query: 243 DLLSHVKEDVYMGL--------IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 +++ + S V + + T + Sbjct: 252 AHFDRNNSNLHTFNGDRLIGRPWCQVGTYGAILPWS---TSVTDLTNKVAELGASGNTAT 308 Query: 295 TPAMKQAYQILTSDKKRSFF---------TNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 MK A +L + + + V K ++ +TDGEN + Sbjct: 309 DIGMKWAAALLDPGTQNIVDDMIDGGHLEADLAGRPVLYSDPETIKVVVLMTDGENTSQY 368 Query: 346 S 346 Sbjct: 369 D 369 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 75/212 (35%), Gaps = 8/212 (3%) Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 SDPE + + D + T LKN + + + + R + Sbjct: 347 YSDPETIKVVVLMTDGENTSQYDLKNEFKGTMSPVWWDEASDSYF----VYFQNRANNDK 402 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDS---TPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 +P W + + T DS A + ++ L F + FF + Sbjct: 403 KPWWNVGRTPESDTNGNDSDKGSWEWELGEPSARQLSHAELFGTFSVRFISEFFFRVKND 462 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 ++ N ++ T +ICD+ K + I TI A GQ L++ C S Sbjct: 463 NQKTIREKYRNAVTRYTGNSTADGYTEQICDQLKAQDVVIFTIGFEAPQRGQDLMRYCAS 522 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 S ++++V + F +I+ + + S+ Sbjct: 523 SSGHYFDVEGV-EISEAFSSIANTIQQLRLSL 553 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 53/437 (12%), Positives = 115/437 (26%), Gaps = 74/437 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I+ NF I+ AL ++ + G + + + +A A + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLS--GYSAVFYNTEIQN--------IVN 123 + N DA+ + + L+ T+ N Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANKLNSQVSFTAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S Y + L + Sbjct: 125 VPTTFMQIFGRDSITISGTATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +E V + + H D N ++ ++ A D+ Sbjct: 168 VGATPSDVSKLEAKVGCAFACHQM---DKSTNNYTIAKSLGVAMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + MG+ + T+ E + KV+ Y +D + + + Sbjct: 225 KTERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKSYTDA-VDLMTIPYQNYNSD 283 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 + + G + +K + F++DG +++K + T K Sbjct: 284 QITNFDSAMTQMNTIID----PAGDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGR 339 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ +KI T + N ++ C S P + Sbjct: 340 CQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGF 398 Query: 389 HYNVVNADSLIHVFQNI 405 ++ V D + + + Sbjct: 399 YFEVSPTDGITDAMKAL 415 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 49/419 (11%), Positives = 108/419 (25%), Gaps = 48/419 (11%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 +++ A+ + L L G I + N ++ A +AA LAG+ ++ + + Sbjct: 1 MAMLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDPQLDKGLVKG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 N A + +K G + + V+ I M + Sbjct: 61 AVNQYMAA------NYPAAVLKGVTPGTEERSVTVKAEATVD--TIFMGALGIGSSTV-- 110 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 + + + Q + L + + V + Sbjct: 111 RAQATAGYNNLEVVFVIDNTGSMKGTAIQQ--ANAAATQLAELIMPDGMETSVKVGLVPF 168 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + + + + D L+ + + Sbjct: 169 RGKVHIPAGVDGLADGCRNADGTLAPSWILEEY-----KQTKYRYPTGSSLNVPKGT-CD 222 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK----------- 310 + + + + + + T + +K +LT + Sbjct: 223 SIPRVQALTSNRTTIVSAIAKQDALGDASGTVISEGIKWGRHVLTPEAPFTQGSSNKDMR 282 Query: 311 ------RSFFTNFFRQGVKIPSLPFQKFIIF--------LTDGENNNFKSNVNTIKICDK 356 T + G + N K N + Sbjct: 283 KVMIVLTDGDTEDGKCGGNYALNYTPNAYWTNAYYGMFDMNTHCENGGKLNAAMLSEAQI 342 Query: 357 AKENFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLMV 410 AK+ I+I I S + L+K SS +++YN +A L +F+ I + + Sbjct: 343 AKDKGIEIFAIRYGDSDSTDISLMKAIASSKAGTDDHYYNAPSAYDLEEIFKKIGRQLG 401 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 38/375 (10%), Positives = 96/375 (25%), Gaps = 43/375 (11%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + E + + ++ L G + ++ + +++ + A+LA A Sbjct: 20 FVDELRAFRSDESGVLAKPMIMILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAA- 78 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN--- 123 +L + D + ++ ++ + D + + + A +N Sbjct: 79 ----DLDQPLDPEAVVIDYMSKSGLSDYTTVVVPEVSPTAKRVKASVDTEFTAGWMNSIF 134 Query: 124 ----SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + L ++ L + + + Sbjct: 135 YEDYMRNPDTYELEPITLPLLASSTAVESIGNVEISLVLDVSGSMRSNNRLVNLKRAAKE 194 Query: 180 LLRIE-----MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL--- 231 ++ G+ I ++ +S D +S C + + Sbjct: 195 FVQTMDDNTEDGKMSISIVPYSTQVSMPAAFLDEMRVSDEHSYSNCINFDGSDFNTTGLN 254 Query: 232 ------KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDM 284 + + D + E+ S +++ Y+ Sbjct: 255 LSREYERTMHFSVWNYYDYRDDDEHVRQPTCASDADNPERTALLMSDNVAQLQSYIDAFE 314 Query: 285 DSLILKPTDSTPAMKQAYQILTS-------------DKKRSFFTNFFRQGVKIPSLPFQK 331 S T MK +L + +S + + V K Sbjct: 315 HSE---NTSIDLGMKWGTALLDPSVQPVIATLANDANPNQSIEARYANRPVSYQDTETLK 371 Query: 332 FIIFLTDGENNNFKS 346 I+ +TDG+N Sbjct: 372 VIVMMTDGQNTAQYY 386 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 46/319 (14%), Positives = 102/319 (31%), Gaps = 16/319 (5%) Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + ++ + M++N + + TS D +++ L Sbjct: 272 DYRDDDEHVRQPTCASDADNPERTALLMSDNVAQLQSYIDAFEHSENTSIDLGMKWGTAL 331 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH-CAMNSDPEDVNSAPICQDK 223 L+ I + + E + + ++ M +D ++ I D Sbjct: 332 LDPSVQPVIATLANDANPNQSIEARYANRPVSYQDTETLKVIVMMTDGQNTAQYYIKNDY 391 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + N+ + D + + + + N R+Y R Sbjct: 392 REGLTPVWYNSEENVYSTYDPNRYGSDKYYWHQTGQWEDHPYGNGTYQETYCDGREYYGR 451 Query: 283 DMDSLILKPTDS----TPAMKQAYQILTS--DKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 D T + A++ +Y L + + + F + + ++ Sbjct: 452 CYDG--SWKTRTVDEPGEALQLSYADLFAETSLRYLYRDLFGDWMSNASWYWYNRLYSYV 509 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 D + T+ +CD AKE I + TI A GQ++L+ C SS ++Y+V Sbjct: 510 GD-----STKDSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASHYYDVDGL- 563 Query: 397 SLIHVFQNISQLMVHRKYS 415 + F +I+ + + + Sbjct: 564 EISDAFASIASAIRQLRLT 582 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 161 bits (406), Expect = 2e-37, Method: Composition-based stats. Identities = 52/437 (11%), Positives = 120/437 (27%), Gaps = 74/437 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES----------ANNAAILA 63 I+ NF I+ AL ++ + G + + + + A + +A Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A M N + + ++ + + A+ + I + + + Sbjct: 68 AAMAMNGNGTISLGKTDARDIFMSQVSGELAEVHVDLGIDVTKTANKLNSQVSF---TAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S Y + L + Sbjct: 125 VPTTFMRIFGRDSITISGTATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +E V + + H D N ++ ++ A D+ Sbjct: 168 VGATPSDVSKLEAKVGCAFACHQM---DKSTNNYTIAKSLGVAMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + MG+ + T+ E + KV+ Y +D + + + Sbjct: 225 KTERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDA-VDLMTIPYQNYNSD 283 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 + + G + +K + F++DG +++K + T K Sbjct: 284 QITNFDSAMTQMNTIID----LAGDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGR 339 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ +KI T + N ++ C S P + Sbjct: 340 CQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGF 398 Query: 389 HYNVVNADSLIHVFQNI 405 ++ V D + + + Sbjct: 399 YFEVSPTDGITDAMKAL 415 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 160 bits (405), Expect = 3e-37, Method: Composition-based stats. Identities = 34/371 (9%), Positives = 97/371 (26%), Gaps = 43/371 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ E + + V+ ++ G + ++ + ++ + A+LA A Sbjct: 24 MRQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAA----- 78 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN------- 123 +L + D + ++ ++ + D + + + A + +N Sbjct: 79 DLDQPLDPEAVVLDYMSKSGLGDYTTVVVPEVSPTAKRVKASVDTNFTASWMNNVFYDDY 138 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + L ++ L + + + ++ Sbjct: 139 IRNPDTYQLEPITLPLLASSTAVESIGNVEISLVLDVSGSMRSNDRLVNLKRAAKEFVQT 198 Query: 184 E-----MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 G+ I ++ +S + + C + + + Sbjct: 199 MDDNTEDGKMSISIVPYSTQVSMPEAFLDELNVSSEHDYSHCINFSGSDFNNAGISTTQA 258 Query: 239 LDSIDLLSHVKEDVY----------MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + Y + S +++ Y+ + Sbjct: 259 YERTMHFTVWNSGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYIDAFV---P 315 Query: 289 LKPTDSTPAMKQAYQILTS-------------DKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T MK +L + +S + F + V K I+ Sbjct: 316 SENTSIDLGMKWGSALLDPSVQPVIASLADDANPNQSIASRFANRPVPYTDTETLKVIVM 375 Query: 336 LTDGENNNFKS 346 +TDG+N + Sbjct: 376 MTDGQNTSQYY 386 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 91/317 (28%), Gaps = 9/317 (2%) Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + + ++ + +++N N + TS D +++ Sbjct: 270 SGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYIDAFVPSENTSIDLGMKW-G 328 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFL---IELVVDLSGSMHCAMNSDPEDVNSAPI 219 L Q +++ + F + D M +D ++ + + Sbjct: 329 SALLDPSVQPVIASLADDANPNQSIASRFANRPVPY-TDTETLKVIVMMTDGQNTSQYYL 387 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + NA + D K + L + N Y Sbjct: 388 RNSYREGDSPVWYNAQERVYSTYDPNRGSKPYYWDNLQRWADHPYGNGTYEETYCTGTLY 447 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD- 338 T + A Q+ D + R + + + F Sbjct: 448 YGNCYYG--SWQTRTVDEPGTAVQLSYPDLFADTSLRYLRDRLFGDWMSNANYYWFSGLF 505 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + T+ IC+ AK + + TI A GQ +L+ C SS ++Y+V + Sbjct: 506 SSVGSTTKDARTLDICEAAKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDVDGL-EI 564 Query: 399 IHVFQNISQLMVHRKYS 415 F +I+ + + + Sbjct: 565 SDAFASIASAIRQLRLT 581 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 160 bits (404), Expect = 4e-37, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 120/406 (29%), Gaps = 51/406 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + S ++ + + L + + I V+ + ++ + + A A + Sbjct: 26 KARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVT 85 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + +D I S + E + + + Sbjct: 86 GDKAEAIEAAERLLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVGLRA 145 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + + I + + Sbjct: 146 FGADDVPM--------------------LFPTMGVPIEFRPIKQAV--------ATQVEL 177 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----LLLFLDSIDLLS 246 I +V+D SGSM + + ++ + + K A KNA + +D++ Sbjct: 178 DIAIVLDRSGSMAFSHDEVAKNGSPSSAPPGWKMGH-AVPKNARWLDTVAAVNGFLDIME 236 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 D + L Y+ + + +++ + ++R + + T+ + + L Sbjct: 237 DSSHDERVSLSTYSDKSKADVKLTGDYTEIRAAMNAHSTNFKGGATNIGSGILEGGATLG 296 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +I ++DG +N I + I I T Sbjct: 297 DKNL--------------ARSWASRVLIVMSDGIHNTGI---EPIPAAQQVANEKIMIFT 339 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + N Q + K VS H++ ++ L F+ I++ + Sbjct: 340 VTFSNEANVQEMEKVAVSGGGQHFHAKDSQQLAEAFRKIAKSLPTL 385 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 159 bits (402), Expect = 6e-37, Method: Composition-based stats. Identities = 70/420 (16%), Positives = 152/420 (36%), Gaps = 52/420 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS--RLGDRFESI 82 + A+ + L I + I + Y +N M+SA +AA+L+G + +VS+ + + + Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 83 SNHAKRALIDDAKR-----------FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM-- 129 S K+ + K+ K I + + + Y E + ++ Sbjct: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Query: 130 -THMANNRLDSSNNTIFYNMDVM----TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + S + S L + + + + + Sbjct: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK-- 178 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 L + S + K+ L + ++SI Sbjct: 179 ------------YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 Query: 245 LSHVKEDVYMGLIGYTTRVEKN----IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 K+++ + + + S +V+ + + + T++ PAM Sbjct: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHH 283 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---FKSNVNTIKICDKA 357 AY+ L + I S +KF+IF+TDGEN+ +++ +NT++IC+ Sbjct: 284 AYREL--------YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +KI +++++A P GQ LL+ C S + V ++ L+ F I+ + + + Sbjct: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 159 bits (402), Expect = 7e-37, Method: Composition-based stats. Identities = 55/450 (12%), Positives = 129/450 (28%), Gaps = 76/450 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ N ++IFA++++ L IG I + + ++SA +AA+L S N Sbjct: 15 RRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVSNSEIN 74 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + ++ IK + E+ SA T + + Sbjct: 75 RTTAQAKADAEQFFNATIGAYGLTATIKIEVTENDGKRSATADFTST-----VTTNFLNL 129 Query: 132 MANNRLDSSNNTIFYNMDVMT-SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + L N + + + L + + I Sbjct: 130 IGYPTLAIGNRSTSTVSRPIYQDFYLLLDNSPSMG-----------------VAATTADI 172 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + + C SD + + + + ++ ++ A+ + L++ V Sbjct: 173 ATMVGNTSDKCAFACHDLSDSNNYYNL-AKKLGVKMRIDVVRQAVQQLTSTATLMTAVNN 231 Query: 251 DVYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 M + + S V+ D + + + + Sbjct: 232 QFRMAVYTLGGSCASLGLTTIASLSSAMSSVQTAAGAI-DLMSIPKQNYNNDQCTDFNSA 290 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-------------- 351 + + ++ QK++ F++DG + + T Sbjct: 291 LAAMNTTIPSSGTGTAA-----QPQKWLFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTV 345 Query: 352 KICDKAKENFIKIVTIS------------------INASPNGQRL-------LKTCVSSP 386 C K+ I+I + NA P G + +K+C S P Sbjct: 346 TQCKAMKDRGIQIAVLYTTYLALPTNQWYNDHIAPFNAGPYGPSVNSQIAAKMKSCAS-P 404 Query: 387 EYHYNVVNADSLIHVFQNISQL-MVHRKYS 415 ++++ V + + + + + S Sbjct: 405 DFYFEVSPTQGISEAMDALFKKAVAKARLS 434 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 159 bits (402), Expect = 8e-37, Method: Composition-based stats. Identities = 52/448 (11%), Positives = 122/448 (27%), Gaps = 75/448 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I+ NF I+ AL V+ G + + + +A A + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY--SAVFYNTEIQN--------IVN 123 + N DDA+ + + L+ T+ N Sbjct: 65 AVAAAMTMSGNGTISLGKDDARNIFMSQMSGELTDVHIDLGINVTKTANKLNSQVSFSAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + + S Y + L + Sbjct: 125 VPTTFMRILGRDSITISGAATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + ++ + + H N ++ ++ A D+ Sbjct: 168 VGATANDVSKLQAKTGCAFACHQMDQ---STNNYTIAKGLGVAMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + MG+ + T+ E P+ KV+ Y +D + + + Sbjct: 225 KTERVSSDQFRMGVYTFGTKAEDAKLTTISSPTSDLTKVKNYTD-TVDLMTIPYQNYNQD 283 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 ++ + G ++ +K + F++DG +++K + T K Sbjct: 284 QLTSFDSALTQMNTIID----PAGDGTSNISPEKILFFVSDGVGDSYKPSTCTKKTTGGR 339 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ +KI T + N ++ C S P + Sbjct: 340 CQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNDWYNKWISPFQSEIPTKMQACAS-PGF 398 Query: 389 HYNVVNADSLIHVFQNIS-QLMVHRKYS 415 ++ V + + + + +++ + + Sbjct: 399 YFEVTPTEGITDAMKALFLKVIRSPRIT 426 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 63/476 (13%), Positives = 141/476 (29%), Gaps = 89/476 (18%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + S+ + +MS L+ ++ V+ + N M +A +AA+ + V + Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQMANATDAALASAIISEVPESTA 60 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + ++++ A +++ + +E + F S+ A Sbjct: 61 VELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFV-----------PNSLNIAAQ 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---------------------LNQRYNQKI 173 + N + L + R + Sbjct: 110 ESVPIRTNAKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYV 169 Query: 174 VSFIPALLRIEMG--ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 VS +PA + G E D + SD D S R + A Sbjct: 170 VSIVPASGNVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMC 229 Query: 232 KN----------------------------------------ALLLFLDSIDLLSHVKED 251 ++ A+L F D L V Sbjct: 230 RDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPS 289 Query: 252 VYMGLIG---------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + R + + + T++ M A Sbjct: 290 TNPSNNYRPFHEDKAIFDDIECHVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAM 349 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDKAKE- 359 ++L+ + + + K+++ +DG + + + IC + K+ Sbjct: 350 RLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQP 409 Query: 360 -NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +K++T++ +RL+++C S PE +Y+V + S+ VF+ I++ ++ Sbjct: 410 GRGVKVMTVNFG-GAASERLMQSCASGPE-YYHVASLFSVEKVFEQIAEQVISSSL 463 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 49/514 (9%), Positives = 123/514 (23%), Gaps = 120/514 (23%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 LL R + + N +++F+L ++ +L G + + + Sbjct: 5 LLDRLA----AFPRNRRGNVAMLFSLMLIPITVLSGGAVDINQALNAR------------ 48 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH--------IKESLSGYSAVFY 114 +++ + L ++ ++ A FI + + + Sbjct: 49 ---ARLSAALDAAALAVGVHTSVSETEAAGIASEFIAANYPDRELGLVGNIVVQLDPDQD 105 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR------ 168 + I + + + + L + + Sbjct: 106 RVTVGAESRVETIVLGLIGIEYITVHWESEVQRARSSLELVMVLDNTGSMGGSKISSLRS 165 Query: 169 --------------YNQKIVSFIPALLRIEMGE--------------------------- 187 N+ + +P + +G Sbjct: 166 AGLLLTDILFDGADPNRLKIGLVPFSATVNVGTWHERAWWLDANAQSPLHAENFDPAANR 225 Query: 188 -------RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + V + + + NA Sbjct: 226 WDLYDSLQNRAWEGCVEARAIPHDIEDTAPDTGYPETLFLPYFAPDESNYANNAGYANSY 285 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIE------------PSWGTEKVRQYVTRDMDSLI 288 D + E M T N + T + D + Sbjct: 286 LNDGMGGSNERARMRNTPKYTNAWINSSSRGPEWGCTARPITPLTNQRNVIDDAIEDMIA 345 Query: 289 LKPTDSTPAMKQAYQILTSDKKRS-----------------FFTNFFRQGVKIPSLPFQK 331 T+ + ++L+ + + G P+ Sbjct: 346 SGTTNIPIGISWGVRVLSPGMPFTEGVSYDEEGTIKAMVVLTDGENYLDGRNNPNYSHYS 405 Query: 332 FIIFLTDGENNNFKS---------NVNTIKICDKAKENFIKIVTISINASPNG-QRLLKT 381 ++ DG S N T C+ AK I++ TI+ + + + +++ Sbjct: 406 GYGYMRDGRLGIQTSSDSTIRNALNDRTEAACEYAKSLGIRVYTITFQVNSSSTRDMMRD 465 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C + P +++ + D+L F+ I+ + + + S Sbjct: 466 CATHPTLYFDSPSDDALRSAFEMIAGDLTNLRLS 499 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 52/439 (11%), Positives = 114/439 (25%), Gaps = 52/439 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + ++ + +I+FA+ + + + K +E A+ A LA ++ Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDP 60 Query: 71 NLSRLGDRF-------ESISNHAKRALIDDAKRF--------------IKNHIKESLSGY 109 + G + DA + K Sbjct: 61 DQPDNGSYTPSTRNRQIVVDYVNAYISDIDAVTDIKVAKRRCELISGCVAGLYKGDARYL 120 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 T QN ++ M + + F +L+ Sbjct: 121 EHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDVMFAADFSGSMLDTWS 180 Query: 170 NQ---KIVSFIPALLRIEMGERPIFLIELVVDLS--GSMHCAMNSDPEDVNSAPICQDKK 224 K + I + I + + + D S G + ++ ++ C + Sbjct: 181 GSSNPKYIDLIEIIRNISVELQKFNDLPENRDKSTMGISAFSTFTNSFTSDTGIQCSLSQ 240 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT----TRVEKNIEPSWGTEKVRQYV 280 KN + + + V Y ++ + + V Sbjct: 241 G---VNSKNKPGNWFRPVKPANTVANIWNEKTEDYCKSGAYAGFHDVNLTSNFNSLNGQV 297 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI-----IF 335 T S A+ + Q+L + G+ + Sbjct: 298 GSF---YAGGGTASYQALIRGAQLLDRGRNSRRLLIVLSDGMDNDRNLANGLVSNGMCRE 354 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + G ++ + I K+ I + P + LK CV + Y +A Sbjct: 355 IQAGLESDRTPDGRPIAA---------KMAVIGFDYDPFANKALKDCV-GEKNVYKAEDA 404 Query: 396 DSLIH-VFQNISQLMVHRK 413 D + + + I++ + H K Sbjct: 405 DEVEDIILELINEEVGHLK 423 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 158 bits (399), Expect = 2e-36, Method: Composition-based stats. Identities = 62/476 (13%), Positives = 141/476 (29%), Gaps = 89/476 (18%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + S+ + +MS L+ ++ V+ + N + +A +AA+ + V + Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQIANATDAALASAIISEVPESTA 60 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + ++++ A +++ + +E + F S+ A Sbjct: 61 VELLHANLTSGAASPYVEEVRLTHLRDEQEESLQVALDFV-----------PNSLNIAAQ 109 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---------------------LNQRYNQKI 173 + N + L + R + Sbjct: 110 ESVPIRTNAKAGISSNKAEIVFMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYV 169 Query: 174 VSFIPALLRIEMG--ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 VS +PA + G E D + SD D S R + A Sbjct: 170 VSIVPASGNVNTGPMEEIYLGSFRRYDHAQVKRENRWSDMFDKASGRTPAVPGRQRNAMC 229 Query: 232 KN----------------------------------------ALLLFLDSIDLLSHVKED 251 ++ A+L F D L V Sbjct: 230 RDLDFEGNNPATLGLRYFRNLEKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPS 289 Query: 252 VYMGLIG---------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + R + + + T++ M A Sbjct: 290 TNPSNNYRPFHEDKAIFDDIECHVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAM 349 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--NVNTIKICDKAKE- 359 ++L+ + + + K+++ +DG + + + IC + K+ Sbjct: 350 RLLSPYWQGIWDKTRPELPRRYSDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQP 409 Query: 360 -NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +K++T++ +RL+++C S PE +Y+V + S+ VF+ I++ ++ Sbjct: 410 GRGVKVMTVNFG-GAASERLMQSCASGPE-YYHVASLFSVEKVFEQIAEQVISSSL 463 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 51/433 (11%), Positives = 118/433 (27%), Gaps = 39/433 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + + +++FA+ + + + K +E A+ AA+LA ++K + Sbjct: 5 MRKQSGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAKDEQDHQL 64 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + L + K+ + + E ++ + + N Sbjct: 65 AERYIQHYLYDMDSILDIEVKKLGCDEMPECIAATERGEARYFEYRVAGQTLHKSWFPGN 124 Query: 135 -------NRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRY-------NQKIVSFIPA 179 + + + ++ + + F E + + N Sbjct: 125 DVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGGRHSKLNDLKDIIEDV 184 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + V L+G +N D D + R K+ + F Sbjct: 185 ADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVSREG-EYDKDDTVNFN 243 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +I VK + G ++ + + V + T S + Sbjct: 244 KTIAQQFIVKGEASRVPNGDDDARFYDLYFTTDFSSFTKKVKKFK---AGGGTASLQGII 300 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAK 358 +A QI+TS K + + + ++ G +N D Sbjct: 301 RAGQIVTSMSKNQKQLIIILSDGEDWNHYAGQTNKLVSKGMCSNILNMVNGGKVSADNTH 360 Query: 359 EN-------------------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ ++ I + N L+ CV + D L Sbjct: 361 DDVEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVGLRNCVGRDNVYKAENKEDILN 420 Query: 400 HVFQNISQLMVHR 412 + I++ + H Sbjct: 421 KILGLITEEVGHL 433 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 157 bits (397), Expect = 3e-36, Method: Composition-based stats. Identities = 43/459 (9%), Positives = 116/459 (25%), Gaps = 73/459 (15%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + + N ++ FA+ + + +G + + + ++SA +AA + Sbjct: 1 MRMLSSLLARFRSDIQGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASVG 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVF 113 S+ G ++ A DDA+ ++ + + Sbjct: 61 AVSRTSPAFIAAGA---MTADGIITAGNDDARNIFNGNMNGTTGYTLNSVTPEVKKTGSV 117 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQK 172 + + + M + + + + L + Sbjct: 118 LTATVSFSASVPMMFMNIVGIKTMTLQGMSKATASMPKYIDFYLLLDNSPSMG------- 170 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + + C +D + + T++ L+ Sbjct: 171 ---VAATPDDVTKMVNATSDAKYGSNRYCAFACHDYNDSNNFYNLAKSI-GVTTRIDVLR 226 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-----IEPSWGTEKVRQYVTRDMDSL 287 +A +D+ M + + + S + Sbjct: 227 SATQQLMDTATQTQTYPNQFRMAIYDFGAASKTIGLRALFALSANLSSAKSAAGNIDLMG 286 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS- 346 + D+ A + N + K++ F++DG + + Sbjct: 287 VYGNNDAY----TADKDTPFTAVFPAVNNEISTPGDGTTGSPLKYLFFVSDGVADESNAA 342 Query: 347 ------------NVNTIKICDKAKENFIKIVTIS-----------------------INA 371 + +C K IKI + Sbjct: 343 CLKPKASGNRCQSPINPALCTTLKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGP 402 Query: 372 SPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 SPN + + ++ C S P +++ V + + + Sbjct: 403 SPNSEIAQNMQACAS-PGFYFEVSPTQGIADAMNALFKK 440 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 157 bits (396), Expect = 4e-36, Method: Composition-based stats. Identities = 69/424 (16%), Positives = 136/424 (32%), Gaps = 65/424 (15%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN--------------NA 59 + NF ++ A++ L G I + + KN ++ A Sbjct: 14 MLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAAALAAASALVSDEQ 73 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALID------------DAKRFIKNHIKESLS 107 +A A ++ + + + + D ++ +I E+ + Sbjct: 74 PDIAAAKEIARKFLKTQAGGTTTPDAPADSGEGASSGAASSTPDWDDVNTLEVNITETPN 133 Query: 108 GYSAVFYNTEIQNIVNSSRISMT-HMANNRLDSSNNTIF---YNMDVMTSYDYRLQFIEH 163 G + + N + +MT + + ++ ++ S L Sbjct: 134 GTKGKIFQVTVINKRVTEFNAMTRLLGTDSIELEASSTAESATESKNALSMYLVLDRSGS 193 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + ++ + + + + + Sbjct: 194 MAWK---TNTINAAKKSCPNYTES----------------NWSRYPNLWASSPCYV---- 230 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 TK+ ALK A+ L + + + V I Y + + +WGT YV Sbjct: 231 --TKIDALKTAVTDLLAQLLVADPDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYVNAL 288 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T S A K AYQ + K+++F+TDGENN Sbjct: 289 V---ATGGTASAGAFKTAYQ-------KVIAATENTAHAAKNGQVPSKYMVFMTDGENNY 338 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + T + CD AK N ++I +++ A GQ LLK C SS +++ L+ F+ Sbjct: 339 ANDDTVTKQWCDTAKANKVEIYSVAFMAPERGQALLKYCASSSSHYFEAEEVTDLVAAFK 398 Query: 404 NISQ 407 I + Sbjct: 399 AIGE 402 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 156 bits (395), Expect = 4e-36, Method: Composition-based stats. Identities = 42/404 (10%), Positives = 106/404 (26%), Gaps = 34/404 (8%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + ++ ++ + +I + ++ ++ I + N A + A LA + Sbjct: 2 HSSRLLSKQRGSVAISYLAMLIPMIIAAASTIVIGYQVLLSNRAMQAVDTASLACEFRGE 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + S + + ++ S S T + + Sbjct: 62 YDRSIAQGYLDYYKPKIDKVTATLGASSGC-KVELGYSYSSIFTSLTFSDASYVAGVTAS 120 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER- 188 + + L ++ K + Sbjct: 121 QKVYVTEVT---------DSDPIELVLVLDISGSMMGALDELKSILNRGLTTLRSQQANV 171 Query: 189 ------PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + ++ +S + + S A + + + N + Sbjct: 172 AGQDHIKVSIVPFSNGVSVTDAPWLKSGGTLCVDATVNSGGSFSPANTVAN--------L 223 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D+ + + + V V R + T S + Sbjct: 224 DVTHDQAPVTTSSSSSDCSLTSVILPLTSNLNDVVDAVNRL---QTIGSTASYQGLLWGL 280 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENF 361 + LT + + ++ R G Q+ ++ +TDG ++N + +C +AK+ Sbjct: 281 RQLTPNWQSAW-----RVGPNRNQDNVQRKLVLMTDGMDDNSHLDELINAGLCTRAKDLG 335 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I++ I C S ++ N L F + Sbjct: 336 IELNFIGFGVQSWRLEQFTRCAGSAGAVFSANNTQDLDDYFSQL 379 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 156 bits (395), Expect = 4e-36, Method: Composition-based stats. Identities = 53/476 (11%), Positives = 119/476 (25%), Gaps = 104/476 (21%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 AL+++ LL +G + V ++ A + A L A + D + I ++ Sbjct: 1 MALAILPMLLAVGGTVDVGRQSSLATDLQEAIDIAALHIAKAPSDAIPGEEDVLQLIKSN 60 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS-NNTI 144 A + + + + + L + Sbjct: 61 ITTKDSRIALK-----------KLDVTEKDVSLHATAEITPFFLGLAGIKNLTAQRATKT 109 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYN-------------QKIVSFIPALLRIEMGERPIF 191 L + + + + + + + + P Sbjct: 110 AREARGEIEVALVLDTTWSMSEKDSSGKSRLDSLKGAAAKLVDTIFTEDGKTRVAVVPYA 169 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-LLLFLDSIDLLS---- 246 V + + + + T+ A ++D +S Sbjct: 170 DYVNVGTQHRNQSWLDVPPSYSTTPSERRCETRTTRTQCTSYAPTYQCTRTVDGVSESTT 229 Query: 247 -----HVKEDVYMGLIGYTT---------------------------------------- 261 E V + Y T Sbjct: 230 CGGGCTSSETVQVAPYEYCTGGGSSRTYDWYGCVASRTVGDYRLTDARPDIRYPGFLGTS 289 Query: 262 --RVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + S V+ ++ T + +L+ F Sbjct: 290 RECPGPLLSLSTREADVKTSISNLSYGGGGYRPSTFIPAGLIWGLNVLSP------PAPF 343 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENN------------------NFKSNVNTIKICDKAKE 359 Q + +K ++ +TDG N +S+ +TI IC+ K Sbjct: 344 EEQAYDPNNKLPRKALVLMTDGANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKR 403 Query: 360 NFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + I+I T+ + + LLK C + E++++ + + L F I+ + + Sbjct: 404 SGIEIFTVGFMVNSSSALDLLKECATDGEHYFDATSPEELHSAFGRIADGLTQIRL 459 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 156 bits (395), Expect = 5e-36, Method: Composition-based stats. Identities = 60/491 (12%), Positives = 131/491 (26%), Gaps = 95/491 (19%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R++ + S N ++IF L++ + + +G I K S + A ++A+LA A Sbjct: 8 MRYFVNAFLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAAI 67 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 SN L + + + D + A + + Sbjct: 68 VNNSNEGDL--KKLAADVFKENLGAADLDAKVTAFKY------DAKARTVKATAQGSYDP 119 Query: 127 ISMTHMANNRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + M L + + D L + + + + + Sbjct: 120 VIMQLFGFKNLPYAVTSDAIKAADGTLEVALVLDNTWSMSATVNGTPKIDILKTAAQGLV 179 Query: 186 GE---------RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-- 234 I ++ ++ M K ++ Sbjct: 180 STILTKDNKDYVKIAVVPYADYVNVGMANRNMPWVSVAADYSTTSTKTCKTVSTATQCTG 239 Query: 235 --------------------------------------------------LLLFLDSIDL 244 + + S + Sbjct: 240 GTKGTCTGNQDGVPYTYSCWIVAQTCKTVNVTPYQSCSGGGTTNYKWYGCVKNQVASSKV 299 Query: 245 LSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQA 301 + Y GL+ + T + + S V + + ++ T M Sbjct: 300 VMPDPTTPYGGLVQTSQTCLNPILPLSNDATVVTNTIKGLVVNIGGYKPETYIPGGMIWG 359 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF----------------K 345 LT + + + +K I+ +TDG N + Sbjct: 360 VNALTPPAPFT-----EGKPYDANNKEPRKTIVLMTDGANTLYANTSGGIAVANATQVAV 414 Query: 346 SNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + I++CD AK I+I TI + LK C + +++++ ++ LI F+ Sbjct: 415 TYSDQIRVCDYAKSKKIEIYTIGFDVTDSKALSTLKACATDAQHYFDAKSSADLIKAFET 474 Query: 405 ISQLMVHRKYS 415 I + + + Sbjct: 475 IGGKLSKVRLT 485 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 156 bits (393), Expect = 7e-36, Method: Composition-based stats. Identities = 60/446 (13%), Positives = 123/446 (27%), Gaps = 54/446 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K +++ + SI+FA+ + + K +E A AA LA A+ N Sbjct: 1 MKLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAHNDPN 60 Query: 72 LSR-------------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 ++ D ++ R I E SG + Sbjct: 61 VNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFFE 120 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNM--------DVMTSYDYRLQFIEHL--LNQR 168 + + + NN + +T + F + + Sbjct: 121 YEVEALTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSMEEPWTG 180 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLS------GSMHCAMNSDPEDVNSAPICQD 222 QK + + + I + + + S ++ NS + QD Sbjct: 181 GRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSCFMKQD 240 Query: 223 -KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYV 280 ++ K + +++ + K + G V +I + + + + Sbjct: 241 YFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDAVFHDIYLTNDFDTFNKEI 300 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T S + ++ Q+L G + TD E Sbjct: 301 RKF---RPGNGTASCQGIIRSAQMLRKGTNSRRLLIIISDG---NDWYYPYSGYKETDKE 354 Query: 341 NNNFKSNVNTIKICDKAKEN------------FIKIVTISINASPNGQRLLKTCVSSPEY 388 N N +C+K +E +I I + N + L C + Sbjct: 355 IANKLVN---AGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCA-GEDN 410 Query: 389 HYNVVNADS-LIHVFQNISQLMVHRK 413 + D L + I++ + H K Sbjct: 411 VFKAQYRDELLDQILSLITEEIGHLK 436 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 156 bits (393), Expect = 7e-36, Method: Composition-based stats. Identities = 82/463 (17%), Positives = 158/463 (34%), Gaps = 61/463 (13%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F FK+ +F +I AL + ++IG LI ++ W Y NS+ A N A L+ + Sbjct: 7 LFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSASV 66 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS---------GYSAVFYNTE 117 ++++++ S + + +K + + YN + Sbjct: 67 QLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIYNRK 126 Query: 118 IQNIVNS-----------SRISMTHMANNRLDSSNNTIFYNMDVM-------TSYDYRLQ 159 +I+N S M + + S + + Sbjct: 127 GTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQWLID 186 Query: 160 FIEHLL--------NQRYNQKIVSFIPALLRIEMGERPIFLIEL-------VVDLSGSMH 204 + + + + + + D S Sbjct: 187 DSGSMGSIIDRACFGSKQLKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCDRSLYYV 246 Query: 205 CAMN----SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 D + + K +++AL F+ + + ++K+ + M + + Sbjct: 247 LNDKKILEDDDLEEKNLDNHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLRMSFMYFN 306 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 R++ +WG ++ +Q V+ TD P +++AY L S + Sbjct: 307 ERIDHYFPMTWGIKEFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNEDD------- 359 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPN---- 374 + K S+ +KFI+ LTDG N + + +KICD AKE IKI TIS + + Sbjct: 360 EHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAKEEGIKIFTISYSVDSSERKK 419 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSV 416 L C S P+ + +AD L +F+ I + R + Sbjct: 420 ANDFLSRCAS-PDKFFEAYDADKLNMIFKEHIGDAIFERLVKI 461 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 156 bits (393), Expect = 8e-36, Method: Composition-based stats. Identities = 55/476 (11%), Positives = 127/476 (26%), Gaps = 86/476 (18%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 NF+++F + L GF + + + K+S++ +AA+ + A + + + Sbjct: 14 FARDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGVI 73 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D S+ + + + + V+ + Sbjct: 74 KEADADNSVKAFLVANSAAGILQ--PDQVVLDKLIVDKTAKTVQANVHVDVALYFPLFGI 131 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL--------------------------NQ 167 + + +T D L + + Sbjct: 132 GDMQRVAASTTALYSDKTVEVAMMLDITGSMAKRGNVDKIGDLRAAARNAVQTMLQNQDP 191 Query: 168 RYNQKIVSFIPALLRIEMG---------ERPIFLIELVVDL-------------SGSMHC 205 + + V+ +P + G + + V S S + Sbjct: 192 KRPRIRVAIVPYASGVNAGKLAENVYAETQGSSELPPVAGSSLLVAKTGKALLPSFSDYI 251 Query: 206 AMNSDPEDVN-SAPICQDKKRTKMAALKNALLLF-LDSIDLLSH---VKEDVYMGLIGYT 260 ++ + + K + D + ++D G Sbjct: 252 SIVGAAMPHPDNCTTERKNKNGDADLSADGPDTVRTDRNGKKYYALVNRDDHLDGGGMNR 311 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + I + ++ + + D T A++ Y +L+ + + Sbjct: 312 CPDAEVIPLTADSDAL---LDSIDDFRAAGYTAGAIAIQWTYYMLSPQWRAAIKNVGLGN 368 Query: 321 GVKIPS-LPFQKFIIFLTDGENN-------------NFKSNVNTIKICDKAKENFIKIVT 366 G + K I +TDG+ N + N +C K + I+I T Sbjct: 369 GASDANAKKIAKVAILMTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGIEIFT 428 Query: 367 ISINAS---------PNGQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 I + + + +LK C S +++ L FQ I + Sbjct: 429 IGFDLNDKDMSATERDQAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEIIRN 484 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 59/476 (12%), Positives = 131/476 (27%), Gaps = 86/476 (18%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 NF+I+F LS L +GF + V + ++S++ +AA+ + A + + Sbjct: 14 FARDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGAI 73 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D +S+ + + I + + V+ Sbjct: 74 KEADANKSVQAFLDANSQAGILQ--ADQIVLDRLIVNRTAKTVQADAHVDVGLYFPIFGT 131 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK---IVSFIPALLRIEMGERP- 189 + + +T D L + + K + + ++ + ++ Sbjct: 132 GDMKRVAASTTALYSDKTVEVAMMLDITGSMAKRGKVDKIGDLKTAAKNAVQTMLQKQDP 191 Query: 190 ------IFLIELVVDLSGS-MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + ++ ++ + + ++ + P KN L F D I Sbjct: 192 QNPRIRVAIVPYASGVNAGKLAENVYAEKQASTELPPVAGSPLLVAKTGKNLLPSFSDYI 251 Query: 243 DLLSHVKEDVYMGLIGYTTRVEK------------------------------------- 265 ++ + Sbjct: 252 SIVGAAMPRPDNCATERKDKNGNADMSADGPDTVRTDGNGKKFYALVNRDDHLGDGDMNR 311 Query: 266 -----NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 I + ++ + + D T A++ Y +L+ + + + Sbjct: 312 CPDAKVIPLTADSDAL---LESIEDFRANGFTAGAIAIQWTYYMLSPQWRTAIRNAGLGK 368 Query: 321 GVKIPSL-PFQKFIIFLTDGENN-------------NFKSNVNTIKICDKAKENFIKIVT 366 G K I +TDG+ N + N +CD K + I+I T Sbjct: 369 GASDADPKKIAKVAILMTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIEIFT 428 Query: 367 ISINAS---------PNGQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 I + + +LK C S + ++V L FQ I + Sbjct: 429 IGFDLDDKDMSTTERDQAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEIIRN 484 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 52/448 (11%), Positives = 126/448 (28%), Gaps = 73/448 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM----------ESANNAAILA 63 I+ NF I+ AL ++ L G + + + A + +A Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVGSIAEKSGAVA 67 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A M N + + ++ S + + A + I + + + + Sbjct: 68 AAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANKLNSQVSFTAVV-- 125 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + + S + + L + Sbjct: 126 -PTTFMRVLGKDSITISGTATAEYLTASFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + +E S + C + + + ++ ++ A + Sbjct: 168 VGATAKDVATMEKNTSDSCAFACHETENKNNYYNLAKT-LGVSMRIDVVRQATKELTLTA 226 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 MG+ + T+ E +P+ +KVR Y +D + + Sbjct: 227 KSTRVSTNQFRMGVYTFGTKAEDANLTTISDPTDDLDKVRTYTDA-VDLMTIPKQGYNND 285 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 + ++ + G + QK + F++DG ++ K T K+ Sbjct: 286 QQTSF----DNALTQMKDIITTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNR 341 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ I+I T + N ++ C S P Sbjct: 342 CQEPIDTSFCKPLKDKGIRIAVLYTTYLPLPKNSWYNTWISPFQSQIPTKMQECAS-PGL 400 Query: 389 HYNVVNADSLIHVFQNIS-QLMVHRKYS 415 ++ V + + + + + + + + Sbjct: 401 YFEVTPTEGIADAMKALFLKAIRAPRIT 428 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 60/446 (13%), Positives = 123/446 (27%), Gaps = 54/446 (12%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K +++ + SI+FA+ + + K +E A AA LA A+ N Sbjct: 1 MKLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAHNDPN 60 Query: 72 LSR-------------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 ++ D ++ R I E SG + Sbjct: 61 VNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRFFE 120 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNM--------DVMTSYDYRLQFIEHL--LNQR 168 + + + NN + +T + F + + Sbjct: 121 YEVEALTTQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSMEEPWTG 180 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLS------GSMHCAMNSDPEDVNSAPICQD 222 QK + + + I + + + S ++ NS + QD Sbjct: 181 GRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSCFMKQD 240 Query: 223 -KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKVRQYV 280 ++ K + +++ + K + G V +I + + + + Sbjct: 241 YFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDPDAVFHDIYLTNDFDTFNKEI 300 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T S + ++ Q+L G + TD E Sbjct: 301 MKF---RPGNGTASYQGIIRSAQMLRKGTNSRRLLIIISDG---NDWYYPYSGYKETDKE 354 Query: 341 NNNFKSNVNTIKICDKAKEN------------FIKIVTISINASPNGQRLLKTCVSSPEY 388 N N +C+K +E +I I + N + L C + Sbjct: 355 IANKLVN---AGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCA-GEDN 410 Query: 389 HYNVVNADS-LIHVFQNISQLMVHRK 413 + D L + I++ + H K Sbjct: 411 VFKAQYRDELLDQILSLITEEIGHLK 436 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 155 bits (391), Expect = 1e-35, Method: Composition-based stats. Identities = 51/433 (11%), Positives = 117/433 (27%), Gaps = 39/433 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + + +++FA+ + + + K +E A+ AA+LA ++K + Sbjct: 5 MRKQFGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAKDEQDHQL 64 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + L + K+ + I E ++ + + N Sbjct: 65 AERYIQHYLYDMDSILDIEVKKLGCDEIPECIAATERGEARYFEYRVAGQTLHKSWFPGN 124 Query: 135 -------NRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRY-------NQKIVSFIPA 179 + + + ++ + + F E + + N Sbjct: 125 DVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGGRHSKLNDLKDIIEDV 184 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + V L+G +N D D + R K+ + F Sbjct: 185 ADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVSREG-EYDKDDTVNFN 243 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +I VK + ++ + + V + T S + Sbjct: 244 KTIAQQFIVKGEASRVPNSDDDARFYDLYFTTDFSSFTKKVKKFK---AGGGTASLQGII 300 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAK 358 +A QI+TS K + + + ++ G +N D Sbjct: 301 RAGQIVTSMSKNQKQLIIILSDGEDWNHYAGQTNKLVSKGMCSNILNMVNGGKVSADNTH 360 Query: 359 EN-------------------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ ++ I + N L+ CV + D L Sbjct: 361 DDIEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVGLRNCVGRDNVYKAENKEDILN 420 Query: 400 HVFQNISQLMVHR 412 + I++ + H Sbjct: 421 KILGLITEEVGHL 433 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 66/516 (12%), Positives = 143/516 (27%), Gaps = 107/516 (20%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + K+ + + A + +FAL + L +IGF I K +++A ++A+LA Sbjct: 1 MRSIQKTLKQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLA 60 Query: 64 GASKMVSNLSRLGDRFESISN--HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A +++ ++ + + A L + N + Sbjct: 61 AARAYLTSSVQVKETKRQEDSQKIASDYLTANLLSSSNNFENNQIQLVFKEDGEIVGNAS 120 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYN----QKIVSF 176 I + + D L + +Q ++F Sbjct: 121 TKIKLIFGGLFGKSDVVLPALAAATVGDSRKLEIVLVLDTSGSMSSQNRMKQLRTASINF 180 Query: 177 IPALLRIEMGERPI--FLIELVVDLSGSMH----CAMNSDPEDVNSAPICQDKKRTKMAA 230 + ++ + ER + ++ ++ +M + P NS + T Sbjct: 181 VNSVFDNAVYERTVQVGVVPWNATVNINMDRPGTWDASPGPAIHNSNYGNGTNQVTSFQD 240 Query: 231 LKN--ALLLFLD-----SIDLLSHVKEDVYMGLIGYT------TRVEKNIEPSW------ 271 F D D+ ++G I T + + Sbjct: 241 FTENLYPPGFSDFGSYSDSDIDDDFGSSGWLGCITATKDERKISSSGNVTPLTDVPPSKM 300 Query: 272 ----------------------GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT--S 307 ++ Q + + T + + Y++ + + Sbjct: 301 KWPARKVAGWDPNSDCPSPMLAMSQSRPQIIKKLNQLNPSGNTHADIGLMWGYRMFSQQA 360 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--------------------- 346 + F N + S +K +I LTDGEN S Sbjct: 361 NWNNFFGYNSDTKPDSFHSTKSRKIMIMLTDGENTATNSEGYSYYGWCTYTNHYNKWGRY 420 Query: 347 -----------------------NVNTIKICDKAKENFIKIVTISIN----ASPNGQRLL 379 N + C+ + +++ TI+++ LL Sbjct: 421 TGSTKDCEVPKGINKDEISNNDLNSLMLDACEVIRSKDVELFTIALDLHSYYDSTAIALL 480 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + C S + YN+ + L FQ ++ + S Sbjct: 481 RECAGSDSHAYNIK-GNELDETFQELASK--ALRLS 513 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 77/413 (18%), Positives = 131/413 (31%), Gaps = 49/413 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + NF I+ A+ + L G I + K ++ S Sbjct: 10 LRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQE-----------ATDS 58 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 S + AK F+ + + + S I Sbjct: 59 AALAAATALASGAASTTADAEAIAKDFVSGQMANYV-------------DTDAISSIKAG 105 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + ++ + Y + V TSY + Y + + + Sbjct: 106 TSVDIDVSATATSKSYKVTVATSYGIAA--TPFMSVLGYKTLNIGASTSTSSGTSDTKTA 163 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +ELV+D SGSM TK+ ALK A D++D Sbjct: 164 LSMELVLDQSGSMGEKT---TTCATYNGKNCKTYVTKIDALKKAADALFDALDTADPDHS 220 Query: 251 DVYMGLIGYTT---------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 V G Y +++ +WGT YV+ TD+T M+QA Sbjct: 221 LVRTGAYSYNNGLIYNSQKTQIKSMSGMAWGTATTATYVSGIT---ASGGTDATEPMRQA 277 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN------FKSNVNTIKICD 355 L+ K + + ++II +TDGE + N CD Sbjct: 278 --TLSIAKASDGSDVETQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCD 335 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K IKI T++ A G++LL+ C S +Y + L+ F +I++ Sbjct: 336 ATKTAGIKIFTVAFMAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAKE 388 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 52/437 (11%), Positives = 114/437 (26%), Gaps = 74/437 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I++ NF I+ AL ++ + G + V + + +A A + Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLS--GYSAVFYNTEIQN--------IVN 123 + N DA+ + L+ T+ N Sbjct: 65 AVAAAMTMNGNGTVSLGKTDARNIFMSQTSGELTDIHIDLGIDVTKTANKLNSQVSFTAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S Y + L + Sbjct: 125 VPTTFMRIFGRDSIIISGTATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + ++ + + H N ++ ++ A D+ Sbjct: 168 VGATASDVSKLQAKTGCAFACHQMDQ---STNNYTIAKSLGVTMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + MG+ + T+ E + KV+ Y +D + + + Sbjct: 225 KAERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTNA-VDLMTIPYQNYNSD 283 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 ++ T G ++ +K + F+ DG +++K + T K Sbjct: 284 QLTSF----DSAMTQINTIIDPAGDGTSNISPEKILFFVADGVGDSYKPSTCTKKTTGGR 339 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ +KI T + N ++ C S P Sbjct: 340 CQEPIDTTFCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQNEIPTKMQACAS-PGL 398 Query: 389 HYNVVNADSLIHVFQNI 405 ++ V D + + + Sbjct: 399 YFEVTPTDGIADAMKAL 415 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats. Identities = 39/432 (9%), Positives = 114/432 (26%), Gaps = 47/432 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + I + + +++FA+ + + K +E A+ A+LA A+ N Sbjct: 1 MRKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAHNDDN 60 Query: 72 LSR-------------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV------ 112 D + + + K++ + I E +G + Sbjct: 61 QDSQGAGSGSRVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAECRAGLARGEPRFFQ 120 Query: 113 --FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-- 168 + +Q+ S+ + + + + Sbjct: 121 YEIEVSSVQDTWFPGNDSIEGFGDTFSAKGAAVARKYQSEAVDIIFVSDYSGSMAWNWSG 180 Query: 169 -YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ-----D 222 N+K + + + + + + + + ++ Sbjct: 181 GRNRKYIDLRNIIQEVTDELQKFNDLNNTDNNTVGLTAFNYYTKTVPSNRSNHCFMTQLV 240 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + +A + +F++ + + + +++ + V Sbjct: 241 NPNGRFSASQTVRNIFVEK---------NNRYCVNHGDSSRFQDLPLTDNYSSFNNSVRS 291 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF-LTDGEN 341 T S + + Q+L + G + + + Sbjct: 292 F---YPNHGTASFQGIIRGAQMLRKGRNPRRLLIVLSDGEDGDPSRHMQLVNAGMCSTIV 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N ++ + ++ + + N R L+ CV + + D L + Sbjct: 349 NTLSGDLTPDG-----HKVKARLAVVGFDYDVNKNRALQKCVGAENVYKAQNRDDILNKI 403 Query: 402 FQNISQLMVHRK 413 + I++ + H K Sbjct: 404 LELITEEIGHLK 415 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 153 bits (387), Expect = 4e-35, Method: Composition-based stats. Identities = 51/454 (11%), Positives = 123/454 (27%), Gaps = 76/454 (16%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + + N ++IFA++++ L IG + + + +ESA +AA+L S Sbjct: 11 RGSLGRFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVSN 70 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 N + + +++ I + E+ + + Sbjct: 71 SAINKTVADAQADAVQFFNATLDGYGLSATIDLSVSEN-----DGKRSAVSSFSSSVKTH 125 Query: 128 SMTHMANNRLDSSNNTIFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + L N + V + L + + Sbjct: 126 FLDMIGYPTLAIGNRSTSTVSLPVYVDFYLLLDNSPSMG-----------------VAAT 168 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + + C S + + + ++ ++ A+ + +S Sbjct: 169 TSDIATMVANTSDQCAFACHDLSTSNNYYNL-AKKLGVTMRIDVVRQAVQRLTTTATAMS 227 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 V MG+ + + I + + Q +D + + Sbjct: 228 AVTNQFRMGVYTFGSS-CTAIGLTTVANLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTD 286 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---------- 351 + + + + S QK++ F++DG + + T Sbjct: 287 FDGSLTAINSAIPSPGSGI-----STQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQE 341 Query: 352 ----KICDKAKENFIKI---VTISINASPNG----------------------QRLLKTC 382 C K I+I T + N +++C Sbjct: 342 PLTVAQCTAIKSRGIQIAVLYTTYLALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSC 401 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL-MVHRKYS 415 S P +++ V + + + + + S Sbjct: 402 AS-PGFYFEVSPTQGIAEAMDALFKKAVAKARLS 434 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 153 bits (385), Expect = 6e-35, Method: Composition-based stats. Identities = 40/405 (9%), Positives = 119/405 (29%), Gaps = 47/405 (11%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ + + L+L ++I V + +AA+ A + + Sbjct: 34 KQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQTGDKN 93 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + + + ++ +S ++ ++ Sbjct: 94 AALAAARDAAERNPVAGKVVPINMSDLEFGISLRE----------SLDEGYSFQPLSDDD 143 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ ++ + F + + R+ + + + L Sbjct: 144 EFGNAVRLTTLSLSNSPQPVFSPLF-PTMGTNLEIRP--------QRVAVSTQSTMDVAL 194 Query: 196 VVDLSGSMHCAMNSDPEDVNS--------APICQDKKRTKMAALKNALLLFLDSIDLLSH 247 V+D SGSM A + P+ + ++ L ++ F + Sbjct: 195 VIDRSGSMAYANDEAPDPYVNPAAAPPGWTYGDPVPPNSRWLDLVASVNAFNGFLADSP- 253 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + L Y+ ++ + + ++ + T ++ +LT Sbjct: 254 --QYEKLCLATYSDNASRDCDLTHTYAEISNQLDAISYQFNGGGTSVGYGLEHGLAVLTD 311 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ +TDG +N KS + + + + + TI Sbjct: 312 --------------ATHARKFAVRVMVLMTDGHHNTGKS---PESMTYHLQNHGVTLFTI 354 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + R+ + +++ +A L + FQ I++ + Sbjct: 355 TFSDDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKLPSL 399 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 152 bits (384), Expect = 8e-35, Method: Composition-based stats. Identities = 54/437 (12%), Positives = 116/437 (26%), Gaps = 72/437 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I+ NF I+ AL V+ G + + + +A A + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQN--------IVN 123 + N DDA+ + I L+ T+ N Sbjct: 65 AVAAAMTMSGNGTISLGKDDARSIFMSQISGELTDVQVDLGIDVTKTANKLNSQVSFSAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + + S + L + Sbjct: 125 VPTTFMRVLGRDSITISGTATAEYQTASFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +E + + C + + + + ++ ++ A + Sbjct: 168 VGATATDVSTMEKNTSDTCAFACHETQNNNNYYNL-AKKLGVSMRIDVVRQATKELTVTA 226 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 MG+ + T+ E +P+ +KVR Y +D + + Sbjct: 227 KSTRVSSNQFRMGVYTFGTKAEDAKLTTISDPTDDLDKVRSYTDA-VDLMTIPFQGYNND 285 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 + ++ T G + QK + F++DG ++ K T K+ Sbjct: 286 QQTSF----DSALTQMKTIITTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNR 341 Query: 354 ---------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEY 388 C K+ I+I T + N ++ C S P Sbjct: 342 CQEPIDTSFCQPLKDKSIRIAVLYTTYLPLPKNSWYNTWIKPFQGEIPTKMQACAS-PGL 400 Query: 389 HYNVVNADSLIHVFQNI 405 ++ V + + + + Sbjct: 401 YFEVTPTEGIADAMKAL 417 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 152 bits (383), Expect = 1e-34, Method: Composition-based stats. Identities = 40/459 (8%), Positives = 115/459 (25%), Gaps = 80/459 (17%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + N ++ FA+ + + +G + + + +++A +AA + Sbjct: 1 MRSLSLLLSRFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVG 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY----------SAVF 113 S+ G ++ A DDA++ ++ + + Sbjct: 61 AVSRTSPAFIAAGA---MTTDGVIAAGNDDARKIFNGNMSGTTGYTLDSLTPEVKKTGSV 117 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQK 172 + + M+ + + ++ + L + Sbjct: 118 LTATVSFSATVPTLFMSIVGYKTMSLQGSSTAKASMPKYIDFYLLLDNSPSMG------- 170 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + + C +D + + T++ L+ Sbjct: 171 ----------VAATPADVTKMVSATSDKCAFACHDYNDANNYYNLAKT-LGVTTRIDVLR 219 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-----IEPSWGTEKVRQYVTRDMDSL 287 +A +D+ M + + + S + Sbjct: 220 SATQQLMDTAQQTQTYSNQFRMAIYDFGASSKTIGLRALFALSSSLTSAKSAAGNIDLMG 279 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS- 346 + + + + G P K++ F++DG + + Sbjct: 280 VYGN---NDSFTADKDTPYTTALPAINNEIATPGDGTSGSPL-KYLFFVSDGVADESNAA 335 Query: 347 ------------NVNTIKICDKAKENFIKIVTIS-----------------------INA 371 + +C K IKI + Sbjct: 336 CLKPKASGNRCQSPINPALCTALKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGP 395 Query: 372 SPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 SPN + + ++ C S +++ V + + + Sbjct: 396 SPNSEIAQNMQACASD-GFYFEVSPTQGIADAMNALFKK 433 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 43/416 (10%), Positives = 111/416 (26%), Gaps = 63/416 (15%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 L + G + + + + +++A +A LAG+ ++ + + + Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSLELPYDPDLSKGIVQQAVSDMIH--- 57 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 ++ G + VN + ++A+ +++ F Sbjct: 58 ---TNMPDAVVESVSPGTEVRSVVVTAKAKVNLLVMGFLNLADQWVEAGAAAGFN----K 110 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALL----RIEMGERPIFLIELVVDLSGSMHCA- 206 + + + N + I + + + L+ + Sbjct: 111 LEIVFVIDNSGSMKGTPINLVKEASIGLTDLLIPDGQQPDTKVGLVAFRGKVRLGGDVDG 170 Query: 207 --MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 D + + + AL + +D + + + Sbjct: 171 LEAGCRNADGSVNTGIHEDFMSMYWALSSYYRNQID----------------LDTCSSIP 214 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK-------------- 310 ++ S + + + T + +K A +LT + Sbjct: 215 ESRPLSQDKGDIVEGINSQTALGSASGTVISEGIKWARHMLTPEAPYTQAGDKKDFRKIM 274 Query: 311 ---RSFFTNFFRQGVKIPSLPFQKFII--------FLTDGENNNFKSNVNTIKICDKAKE 359 T G + T + N + + AK+ Sbjct: 275 IVLTDGDTEDGECGGSYRASFRPNNYWTNAYYGMGVDTAHCQDGGVLNQDMLAEAQLAKD 334 Query: 360 NFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLMV 410 I+I I S N L+K SS +++++ + + VF+ I + + Sbjct: 335 EGIEIFAIRFGVSDNTDISLMKQIASSKAGTNDHYFDAPSVYDIPDVFKKIGKQLG 390 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 50/439 (11%), Positives = 115/439 (26%), Gaps = 52/439 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + ++ + +I+FA+ + + + K +E A+ A LA ++ Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDP 60 Query: 71 NLSRLGDRF-------ESISNHAKRALIDDAKRF--------------IKNHIKESLSGY 109 + G + DA + + Sbjct: 61 DQPDNGSYTPSTRNRQIVVDYVNAYISDVDAVTDIKVAKRRCELIPECVAGLYDGDMRYL 120 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 T QN ++ M + + F +L+ Sbjct: 121 EHEIDVTTRQNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDAMFAADFSGSMLDTWS 180 Query: 170 NQ---KIVSFIPALLRIEMGERPIFLIELVVDLS--GSMHCAMNSDPEDVNSAPICQDKK 224 K V I + I + + + S G + ++ ++ C + Sbjct: 181 GSSNPKYVDLIEIIRNISAELQKFNDLPENRNKSTMGISAFSTFTNSFTSDTGIQCSLSQ 240 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT----TRVEKNIEPSWGTEKVRQYV 280 +N + + + V Y ++ + + V Sbjct: 241 G---VNGRNGPATWFRPVKAANTVANIWNPKTEDYCKSGAYAGFHDVNLTSNFNYLNGQV 297 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI-----IF 335 T S A+ + Q+L G+ + + Sbjct: 298 GSF---YAGGGTASYQALIRGAQLLRKGNNSRRLLIVLSDGMDNDTQLADGLVSAGMCRD 354 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + +G ++ + I K+ I + +P + LK CV + Y +A Sbjct: 355 IQNGLESDRTPDRRPIAA---------KMAVIGFDYNPFANKALKDCV-GEKNVYKAEDA 404 Query: 396 DSLIH-VFQNISQLMVHRK 413 D + + + I++ + H K Sbjct: 405 DEVEDIILELINEEIGHLK 423 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 149 bits (376), Expect = 8e-34, Method: Composition-based stats. Identities = 45/405 (11%), Positives = 118/405 (29%), Gaps = 33/405 (8%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 +F + I + +I + ++ ++ I + N A +AA +A K Sbjct: 2 HFNQSIVKHHGSVAISYLAMLIPMVIAAASTIVIGYQVQLSNRAMQAADAASIACEFKGE 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + ++ I+ + ++ Y+ + + Sbjct: 62 YDQALTQSYLDYYQPKIDK---------VRGQIRTNSGCNMSLGYSLSTIFT-SLTLSDT 111 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + ++ + + L + + K + + E Sbjct: 112 SFVVSSTANEKAYVTEDVVSDPLELVIVLDISTSMYGAINDLKAILKRGIVSLKEQQNNA 171 Query: 190 -------IFLIELVVDLSGSMHCAMNSDPEDVNS-APICQDKKRTKMAALKNALLLFLDS 241 + +I +S D + + K A + LD Sbjct: 172 QSEDHIKVSIIPFSTGVSV----NNAPWLNDARTFCVDGTTESEDKFYAARTVAN--LDI 225 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 VK + + + ++V V T S + Sbjct: 226 THDQISVKLSQPNKWRESCSAASFTLPLTADLDQVTNTVDSL---RTEGGTASYQGLIWG 282 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-FKSNVNTIKICDKAKEN 360 + LT + ++++ R K+ ++ +TDG + + ++ +CD+AK+ Sbjct: 283 LRQLTPNWQKAWEVGPNRNVDKVERK-----LVLMTDGNDYGRYFDDLINAGLCDRAKDY 337 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + + + C P+ ++ + L H F + Sbjct: 338 GIALNFVGFGVNGSRLEQFTRCAVDPKGVFSASDTQDLDHYFSQL 382 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 149 bits (375), Expect = 8e-34, Method: Composition-based stats. Identities = 68/427 (15%), Positives = 132/427 (30%), Gaps = 55/427 (12%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + NF ++ A+ + + + G + + +++++S+ +AA LA AS M + + Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHI-------------KESLSGYSAVFYNTEIQ 119 S + S + + A+ + + ++ S + Sbjct: 101 SEADAIALAKSFLSSQLANTMARDENTSSVDQITQAEPDISVKTTQVNSSSTSYDVELTG 160 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + + + + + S L + ++ + Sbjct: 161 SYTITMNPLSRVLGWETVTLKAYGKAQAATTASESPLSMYLVLDRSGSMNDETATTYTGT 220 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + K TK+ +LK A+ Sbjct: 221 CTKTTTS------------------------GYGWNKKTTTTSYSCTKNYTKIESLKLAV 256 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + E V G Y + SWGT V YV TD+ Sbjct: 257 ADLAAQLKKADPNSEYVRTGADSYNASADTAQAMSWGTANVVTYVNALS---ATGGTDAR 313 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN------FKSNVN 349 A+ AY L + K + KI ++I+F+TDGE + Sbjct: 314 GALSAAYSALQTSNKTEITAHNVSSVSKIG-----RYIVFMTDGEMTGNSSSWSSSIDSA 368 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 C K + I+I T++ A NG+ LL C S ++Y +A SL+ F I + Sbjct: 369 VRSQCTSIKADGIQIYTVAFMAPANGKSLLSACASDASHYYEATDAASLVAAFGEIGKKA 428 Query: 410 VHRKYSV 416 + Sbjct: 429 TSTSTRL 435 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 149 bits (375), Expect = 9e-34, Method: Composition-based stats. Identities = 47/426 (11%), Positives = 115/426 (26%), Gaps = 31/426 (7%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + + + +I+FA+ + L K +E A+ A+LA A+ N Sbjct: 1 MKALRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAHNDDN 60 Query: 72 LSR-------------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV------ 112 + D E+ + ++ + I E ++G + Sbjct: 61 KNSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECVAGLARGEPRFFQ 120 Query: 113 --FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY- 169 T S+ T + + + Sbjct: 121 YEVEATSRHVSWFPGDSSIPGFGKTFDAKGAATARKYQSEAVDILFVADYSGSMAGGWNG 180 Query: 170 --NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N+K + + + + + + + M + T+ Sbjct: 181 GSNRKYIDLRNIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYTKTKPTNRSNSCF--MTQ 238 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + N + + +++ + + K + Y ++R ++I+ + V Sbjct: 239 LVYNNNYNINYTKTVNNIFNEKNNKYCVSHSDSSR-FRDIDLTDNYSSFNTTVNGF---Y 294 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T S + + Q+L G + + G K Sbjct: 295 PNHGTASYQGIMRGAQMLKKGTNPRRLLIVLSDGDDSGTSQKNIHKQLVNAGMCTKIKQ- 353 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I + ++ + + + N L+ C + D L + + I++ Sbjct: 354 ELSTGISSSGQSIKARLAVVGFDYNVNNNTALRDCAGAENVFKAQNTDDILNKILELITE 413 Query: 408 LMVHRK 413 + H K Sbjct: 414 EIGHLK 419 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 49/474 (10%), Positives = 127/474 (26%), Gaps = 85/474 (17%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + I +II L + LL +G I + + + +++A ++A+LA A+K S Sbjct: 1 LSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKYKS 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + + + + + + + M Sbjct: 61 GMPEATIAKTINAFLSAN---GEFETAVAGKPQVASDESELCLDVADA-----VPTTFMK 112 Query: 131 HMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + S + V L ++ + + + + L+ Sbjct: 113 LANIQSVPISIRSCAALPGVKQLEIALVLDVSSSMIEENRFTPMQTAVAGFLQAFSSNTS 172 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL-------FLDSI 242 + + + S + + ++ ++D + Sbjct: 173 LVDKTKISIVPFSSRVNFGLANTAWLKSYNGTAAVPKRWTDPESVYTSSGYKLSYWIDGV 232 Query: 243 DLL-SHVKEDVYMGLIGY-------------------------------------TTRVE 264 + S K +MG I + Sbjct: 233 TPVMSTSKNYYWMGCIEPRADVEVRDTGAIGDGMGDAPPSTSAFVAMDANPKSGTSFCPP 292 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + ++ V T + + L+ + + V Sbjct: 293 PVTPLTGDFAYLKSVVKNLTSE---GSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPV-- 347 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNT---------------------IKICDKAKENFIK 363 S K ++F+TDGE N + C K++ I+ Sbjct: 348 -SDSVHKVMVFMTDGEMNTKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIE 406 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV--HRKYS 415 I T+S +A + + + C ++ + + + ++ V++ I+ + + + Sbjct: 407 IYTLSYSADADVVNI-RNCATNTAHFFTA-SPATIKTVYETIAAAIRGDTLRLT 458 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 44/454 (9%), Positives = 119/454 (26%), Gaps = 76/454 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES--------ANN 58 F + + N ++IFAL ++ + IG + + + +++ A + Sbjct: 3 FAEAIARFHRDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVS 62 Query: 59 AAILA--GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 A A M ++ + +++ S SG + T Sbjct: 63 RTSPAYIAAGSMSTDGAITSGADDALRIFNGNLANLTGYTLDSVAATVSKSGEAVTSKVT 122 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVS 175 S + M +A + + + + L + + + Sbjct: 123 FS---AQISTMFMKAVAVSSMTVGGVSTATASMPKYIDFYLLLDNSPSMGVGATPTDVAA 179 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + A D + C ++ + + T++ L++A Sbjct: 180 MVAATANKSS------------DDHCAFACHDVNNKNNYYNL-AKALGITTRIDVLRSAT 226 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILK 290 +D+ + M + + + S + D + +K Sbjct: 227 QQLMDTAAATATYTNQFRMAIYDFGASAQTAGLRNLFALSASLSSAKTAAGAI-DLMTVK 285 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---- 346 + Y + + K++ F++DG + + Sbjct: 286 GQNDNNDQDTQYTAILPAIDKLIAAPGTGAAGS-----PLKYLFFVSDGVADEYNPACLK 340 Query: 347 --------NVNTIKICDKAKENFIKI---VTISINASPNG-------------------- 375 + +C K+ +++ T ++ N Sbjct: 341 PKTGNRCQSPINPALCKTLKDRGVRVAVLYTTYLDLPSNDWYKKWIAPFNEGPYGPSPNS 400 Query: 376 --QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ C S P +++ V + + + Sbjct: 401 EIAKNMEACAS-PGFYFEVSPTQGIAEAMNALFK 433 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 47/446 (10%), Positives = 116/446 (26%), Gaps = 51/446 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS-NLS 73 I ++K IIFAL ++ L + V W+ ++ + + +A +AGA + + L Sbjct: 5 IKNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPYLG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 G + + + + Y+ + ++ V+S Sbjct: 65 EDGHLRLAEEVARENFSAGYLMTPDSGERSATFTAYADEDHRIRVEGTVSSPGNLAGLFG 124 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIF 191 + + +S + + L + N + + + + Sbjct: 125 VDWVATSAMGVAKKNE--VEIMLVLDRSGSMDGTPMNDLKKAARSFVSFFEETQDQDKMG 182 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNS-APICQDKKRTKMAALKNALLLFLDSIDLLS--HV 248 L+ + + N + + +L A + S Sbjct: 183 LVSFATSVKVDVPLGNNYVSSMTSKINAMDAVGATNAEDSLSQAGNPAKGGLTDQSGVPG 242 Query: 249 KEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQIL 305 + V +I ++ + + V + T ++ + Sbjct: 243 NKRVQQFVIFFSDGNPTAFRGKFKYNGTDNIDAVVCGTGNDCGTVYTKLGKPEREEWLSY 302 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD--------------GENNNF------- 344 G + + + T G N Sbjct: 303 NPRFTGDGKPKPPGTGTSKCTTRYGGSYVNTTKWYVLDDPDYRLTYRGTTYNSESCFIPT 362 Query: 345 ----------------KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + ++ + K+N +KI TI + + L S P + Sbjct: 363 VGSSNTTAPLSTYICTTARGMAVEHAQELKDNNVKIYTIGLGNID--RDFLSQIASGPSF 420 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKY 414 + + L +F I++ + + Sbjct: 421 EFYAPTSGELQAIFNKIAKDI-KLRL 445 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 48/467 (10%), Positives = 123/467 (26%), Gaps = 72/467 (15%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L R +A+ + + V +++ G + + S+ A +AA+L Sbjct: 5 VLRRLPARLAAFGRDARASILPMVGVLVALMVVIGGAGLDYGRAIMLRASISHALDAAVL 64 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A A ++ ++ + ++I + + + + Sbjct: 65 AVARQLSVSIMTDSELDKAIKDAFAANMASAGLSG--ATLGDLTYVLDPDAGTISATATA 122 Query: 123 NSSRISM--THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ------RYNQKIV 174 + + + + + + + N + Sbjct: 123 LVPTYFIHVGGLGPENVAIAASADATYSRFDVELAMVVDVTGSMRNSMASLRTAAQSVVD 182 Query: 175 SFIPALLRIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 IP + + I L+ ++ G +++ + + A N Sbjct: 183 ILIPDGTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCVTERMGNEKYTDATYN 242 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + S+ + + + +++ D T Sbjct: 243 YNGTSSEFFGGGSNS-----------CASTPQMEPLTSKRNTLTSAISKLKD---NGRTA 288 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-------- 345 + + L+ + + KF + +TDG+ N + Sbjct: 289 GQTGIAWGWYALSPKWSNLWPNDSV--PGSYTDSDILKFALIMTDGDFNEYYDKATAQSN 346 Query: 346 ----------------------------------SNVNTIKICDKAKENFIKIVTISINA 371 S+ +C K+ I++ +I + Sbjct: 347 CKWQFNWSTFKWEQVCDSSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFGS 406 Query: 372 SPN--GQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + N G +++K C SS E + + LI F I+ + + S Sbjct: 407 NANSAGAKVMKDCASSTKETFFMATSDSELIAAFAKIANKIQNIYLS 453 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 64/327 (19%), Positives = 102/327 (31%), Gaps = 31/327 (9%) Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY-- 146 D K ++ + SG S + T + + M + S + Sbjct: 13 QSGVDIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPLMQAVGFTTQHLSTSGTTIGG 72 Query: 147 --NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 S L + D +G + Sbjct: 73 HSQTQGSISMYLALDKSGSMGEDTATV---------------NEEDPTESYTYDCNGHYN 117 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 ++ + TK+ ALK A + + V G + Y Sbjct: 118 KK---GKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQY 174 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 +WGT V YV T+S+ AM AY LT+ +K Sbjct: 175 TPSALAWGTSGVSTYVNAL---QAGGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKT 231 Query: 325 PSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 +K+I+F+TDG+NN+ S + T CD AK I+I TI+ A GQ LL Sbjct: 232 GQT-PKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALL 290 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNIS 406 C S +++ + L+ F+ I Sbjct: 291 HYCASDDSHYFQAEKMEDLLAAFKAIG 317 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 51/486 (10%), Positives = 117/486 (24%), Gaps = 123/486 (25%) Query: 40 LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + ++ K+ +++ + + LA AS L + D + ++ +A + + Sbjct: 56 ALDLMRHEQKRTTLQQTLDRSTLAAAS-----LQQSLDPESVVRDYFAKANMTQYLSGVT 110 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + +++ M + + D+ + L Sbjct: 111 VDEGMNYREV-------NALAAADTNPFFMQMVGIDSFDAKAASTAEQRISNVEVSMVLD 163 Query: 160 FIEHL---------------------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + +S +P ++ +G + + Sbjct: 164 ISGSMASNSRLTRLRPAAKEFIDTVINGSDPGRVSISVVPYNAQVNLGAGMMSQFNVNAL 223 Query: 199 LSGSMH-------------------------CAMNSDPEDVNSAPICQDKKRTKMAALKN 233 S S + + P + T M+ Sbjct: 224 HSTSYCVELPNSVFGSTGLSQATSFVHNGHFDPWGTGNSSNYNCPPTANVAVTPMSGDAA 283 Query: 234 ALLLFLDSIDL----------------LSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEK 275 L +D + L + + GL+G + Sbjct: 284 YLKGRVDLLASMGYTSIDVGVKWGTLLLDPSAQPLINGLVGLGQVDEDFTDRPLDPDEAN 343 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQIL--------------------------TSDK 309 V + + D + T + + S Sbjct: 344 VLKVLVVMSDGENTEEYKLTAPYRSGPSAIYKKTSNSKLTLYSDRASTTSDYYWFSDSKW 403 Query: 310 KRSFFTNFFRQGVKIP---------SLPFQKFIIFLTDGENN-----------NFKSNVN 349 + + G N + +V Sbjct: 404 HTTIDGGTTGSVQMTWPEVWAKWSVRYVAKDIYTKALGGSENSWFETFTDEISYGQKDVR 463 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +ICD AK++ I I +I A NG+ L+ C S P ++N + F+ I+ + Sbjct: 464 LQQICDAAKDSGIVIFSIGFEAPENGRNQLRDCASQPSNYFNATGV-QITTAFRAIATQL 522 Query: 410 VHRKYS 415 H + + Sbjct: 523 SHLRLT 528 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 47/454 (10%), Positives = 117/454 (25%), Gaps = 75/454 (16%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES---------ANN-AA 60 K + NF I+ AL ++ G + + K ++ + +A Sbjct: 1 MKNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAADSAALAAVADMSA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + A KM + E+ + D + G + Sbjct: 61 SVQAAKKMSGDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHGNVVESSVSF--- 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + S + + + + Y + + + L + Sbjct: 118 KASVSTTLSGLLGKDFVSVAGTATAKYETETFSDFYLLLDNTPSMG-------------- 163 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI---CQDKKRTKMAALKNALL 236 + + + + C + D ++ + T++ + A Sbjct: 164 ---VGATPTDVATLVANTGDKCAFACHIVKDGVADPNSYYFKAKKLGVTTRIDVVAKATA 220 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEK-----NIEPSWGTEKVRQYVTRDMDSLILKP 291 +D+ M + + R E + + + ++ + + Sbjct: 221 SLMDTAKSTRKSSNQYRMAVYTFGERAEDTKLLEVVSLTSDLDAAKKKAGEIN-LMSIPY 279 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + + G S K I F++DG +++K + T Sbjct: 280 QGYNNDQQTDF----DRALIQIGDKVGSSGTGASSANPDKVIFFVSDGVGDSYKPSSCTK 335 Query: 352 KI-------------CDKAKENFIKI---VTISINASPNG-------------QRLLKTC 382 K+ C K KE +I T + N +++C Sbjct: 336 KLTGGRCQEPIDIKDCTKLKEKGFRIAVLYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSC 395 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL-MVHRKYS 415 S P ++ V + + + + + + + Sbjct: 396 AS-PGLYFEVSPSQGISDAMTVLFKKAITSPRLT 428 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 39/361 (10%), Positives = 82/361 (22%), Gaps = 36/361 (9%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I E +++ + FL++ G + + + S+++ + A+LAGA+ + Sbjct: 19 FIQEEDGTVTVLSFFIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATA----ST 74 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 G R AK D I + ++ M Sbjct: 75 EAGARTIVEDYFAKSGQSDYLLAQKDGDIS-----TTLNAAKVTAGAELSLDTYLMKLAG 129 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL- 192 L +S L + + Q + + + M Sbjct: 130 VPTLSASGTATAEVRIPKLEAILVLDVSGSMASNSKIQNLQTAAKDFVTTVMNSSKPGDT 189 Query: 193 ----IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL----------LF 238 + ++ E+ ++ C + K + Sbjct: 190 VMSIVPFSFSVTPPQSVFDALAVEETHNYSTCLEFKENDYQHATLSSGSSSLSSGIPVNQ 249 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + YT + + S + + T M Sbjct: 250 MVYTSVYGDFDNLDSGWRSCYTDEYIRILPYSTSITDLHAKIDAL---QPAGNTSGNEGM 306 Query: 299 KQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 +L + K IIF+ DG N Sbjct: 307 NWGAALLDPTFREVTASMIAAGHLSETLANVPSDYDEPETLKAIIFMGDGANTTSYFFDR 366 Query: 350 T 350 + Sbjct: 367 S 367 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 36/128 (28%), Gaps = 6/128 (4%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFR-QGVKIPSLPFQKFIIFLTDGEN--NNFKSNVN 349 ++ + S + + + G + N Sbjct: 453 NTFNNFESTMDGFISRTQLDWEMAWGLMSPEYYGQTTGNWGPWNDYIGSEFVSGSMKNGL 512 Query: 350 TIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +C K + + +I NG + L C SSP +++ D + F I+ Sbjct: 513 MQNVCKATKTEGVVVYSIGFEVPVNGTAENQLSACASSPAHYFRASGTD-IKSAFSAIAA 571 Query: 408 LMVHRKYS 415 + + + Sbjct: 572 NVKQLRLT 579 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 42/462 (9%), Positives = 114/462 (24%), Gaps = 70/462 (15%) Query: 8 RFYFKKGI--------ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 +F++ I + N ++ A+ ++ +L +G + + ++ M+S +A Sbjct: 5 FGFFRQLIAGRMATMVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDA 64 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A++A ++ A+ + + Sbjct: 65 ALVAAIKEIDEYDEDEIAEKIKDWFDAQSEKQSATYDLTEIT-------VDKSGHTITAS 117 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIP 178 MT + + + + + Sbjct: 118 ASGTVPTTLMTLADIKTVPVGVISAIEGPATSYLEVYIVIDKSPSM---------LLAAT 168 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + E + + I + + +A+ Sbjct: 169 SEDQAMLRADANITCEFACHDTKDPVKKNGTVIASTYYNYIKSLGVKLRTDVALDAVEEV 228 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 LD +D + +GL + + +EP++ T R+ ++ D L + S Sbjct: 229 LDMVDAADEDHARIKVGLYSLGETISEVLEPTYSTSTARKKLSDDSSGLTSATSMSATYF 288 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVK----IPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + A + L + +K + + + C Sbjct: 289 QTALKALKKKVGTAGDGTSAASPLKLVLLLTDGVQSNRDWVIKWSGKYWGRVTPLNPDWC 348 Query: 355 DKAKENFIKIVTI------------------------------------SINASPNGQR- 377 D K+N + + +++S Sbjct: 349 DYLKDNDATMAVLYTEYLAIPADWGYNATLAKSMGNSDWTSTWGGTLHSGVSSSTTRHDY 408 Query: 378 ---LLKTCVSSPEYHYNVVNADSLIHVFQNISQL-MVHRKYS 415 L+ C SS + + + D + + + + + Sbjct: 409 IPIALQDCASSSDLFISAASEDEITAGLSTLFNQYLTSVRLT 450 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 147 bits (370), Expect = 4e-33, Method: Composition-based stats. Identities = 48/461 (10%), Positives = 119/461 (25%), Gaps = 69/461 (14%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ R F+ N +I+ ALS++ L+ +G + + + M+S +AA Sbjct: 19 LNSFGRLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAA 78 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++A ++ ++ + + + A I+ N Sbjct: 79 LIAAVKQINNSEDTDALKQKVYDWFHAQVENSYALGEIEIDTTNH---------NITATA 129 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 M + + S + + + +L Sbjct: 130 SGTVPTTFMKIANIDTVPVSVGSAVKGPATSYLNVYIVIDRSPSMLL------------- 176 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 I + N + + + +A+ L Sbjct: 177 AATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVL 236 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL----KPTDST 295 D ID E + +GL ++ + P+ T R+ ++ D L T Sbjct: 237 DMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFD 296 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +I+ + + N + + + + + + N + +N C Sbjct: 297 VALAALQKIVGTGGDGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNP-DWCG 355 Query: 356 KAKENFIKIVTI------------------------------------SINASPNGQR-- 377 K + + ++ S + Sbjct: 356 YVKNKSATMAVLYTEYLPITTDWGYNDTVGSTMASANWKSNWGGTMRSGVSTSITRRDYI 415 Query: 378 --LLKTCVSSPEYHYNVVNADSLIHVFQNISQL-MVHRKYS 415 L C SS + ++ + + + + + Sbjct: 416 PYALADCASSKSLFLSASSSTEITEGLSALFTQYLSSVRLT 456 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 146 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 51/454 (11%), Positives = 131/454 (28%), Gaps = 74/454 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++IFAL ++ + +G + + ++ +++A +AA + S Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--ESLSGYSAVFYNTEIQNIVN- 123 + + + DDA R ++ + S+ T+ ++V Sbjct: 63 R---TSPAYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLASSSATVTKASDVVTS 119 Query: 124 -------SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVS 175 S + M + + + + + L + + + Sbjct: 120 QVTFSAQISTMFMKVVGMSAMTVGGTSTATASMPKYIDFYLLLDNSPSMGVGATPTDVSA 179 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I A D + C ++ + + T++ L++A Sbjct: 180 MIAATANKSS------------DDHCAFACHDVNNKNNYYNL-AKALGITTRIDVLRSAT 226 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK----NIEPSWGTEKVRQYVTRDMDSLILKP 291 +D+ + M + + + N+ + + +D + +K Sbjct: 227 QQLMDTASATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKG 286 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS----- 346 + +Y + + QK++ F++DG + + Sbjct: 287 QNDNNDQDTSYTAIFPAINNEISSPGSGVSG-----SPQKYLFFVSDGVADEYNPSCLKP 341 Query: 347 -------NVNTIKICDKAKENFIKI---VTISINASPNG--------------------- 375 + +C K+ IKI T +N N Sbjct: 342 KTGNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSE 401 Query: 376 -QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ C S P +++ V + + + Sbjct: 402 IAQNMEACAS-PGFYFEVSPTQGIAEAMNALFKK 434 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 146 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 33/343 (9%), Positives = 89/343 (25%), Gaps = 37/343 (10%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 ++S + + G I ++ + ++ + A+LA A +L + + ++ Sbjct: 6 MIAFLLSMVAVGGIGIDLMRMERDRTILQYTLDRAVLAAA-----DLDQPLPPDVVVQDY 60 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 +A + + + + T + + N+ Sbjct: 61 LNKANLSEYYQPPIAETGIGYKRVESTIDTTF-------ETQWLDFSGGQDMPLYANSRA 113 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP-----IFLIELVVDLS 200 L + + + + + + + ++ +S Sbjct: 114 EESIDGLEISLVLDVSGSMNSNSRLYNLKNAARDFIDTMVANTADNKMSVSIVPYATQVS 173 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE---DVYMGLI 257 + D + C + T + + L+ + GLI Sbjct: 174 LPKDMLDQYNVTDEHEYSNCVNFTGTHFTSTGLSTTASLNRTMHFTPWWSGDARPSNGLI 233 Query: 258 GYTTRVE----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR-- 311 Y E + + +++ ++ T MK +L + Sbjct: 234 QYPVCDERAHREVMPFQKDANRLKDFIQNL---QAWGNTSIDVGMKWGTVLLDPSAQPVI 290 Query: 312 --------SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + F + K I+ +TDG+N + Sbjct: 291 SALTSSSVNVPGVFADRPAAYNDTETVKVIVLMTDGQNTSQYY 333 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 2/91 (2%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 S + + D + N T ICD AK I + TI A G +LK C S Sbjct: 434 NSAAKNSWYYGVYDSWGTS-TKNARTKAICDAAKARGIVVYTIGFEAPSGGVSVLKDCAS 492 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S ++++V + F +I+ + + + Sbjct: 493 SDAHYFDVQGL-EISDAFASIATSIRQLRLT 522 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 146 bits (368), Expect = 5e-33, Method: Composition-based stats. Identities = 48/472 (10%), Positives = 127/472 (26%), Gaps = 86/472 (18%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + S+ AL ++ L F++ V+ + + + A+L + + + Sbjct: 10 ARVQHGVISVTAALMLLGMLTFFSFVLLVIVLSTTDSRLSMLAD-AVLYSTTNSYNAKAD 68 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 E+ + + ++ + S + Sbjct: 69 AQQMSEANTPQP---------NLGLSSLQVDTGNNENAAQVQVSGRVDRGSLALTDTLGT 119 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN---------------------QRYNQKI 173 + + ++ L + +R K+ Sbjct: 120 SDVLVTHQAQSKIHQTTLEIVVMLDVSNSMKGEPMTQSIKGLRDFADILYAEERRDFSKV 179 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM----- 228 VS +PA + +G RP F + P + R M Sbjct: 180 VSIVPATGLVNIGHRPEFFSASAFAIPRDWRSLAKERGWKDLLHPEVPGRWRKAMCTALP 239 Query: 229 ------AALKNALLLFLDSIDLLSHVKE------------------------DVYMGLIG 258 ++ ++ ++L + Y G Sbjct: 240 EEQDELTSVSALTPNWIRRLELSPPDQNLRLHMEWMSKPAIEHYENDMPLFTYYYSGNPK 299 Query: 259 -------------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + + + R ++ T++ + A+++L Sbjct: 300 EKYSPNKHEQRGLFDSPDCGVSQIQPLLSTRRAFIKALDTLYPEFNTNNAEGVMWAWRLL 359 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIKICDKAKENFIK 363 + + + +K ++ TDG + + K + + +C + K+ I+ Sbjct: 360 SPHWRGYWDKGKSELPRDYQHPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIE 419 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I++I N +++K+C S +Y N + V + ++ + + + Sbjct: 420 IISIDF---NNRSQVMKSCAS-AGQYYIADNRT-IRSVLKQVATTLSKIELT 466 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 146 bits (368), Expect = 7e-33, Method: Composition-based stats. Identities = 42/467 (8%), Positives = 123/467 (26%), Gaps = 87/467 (18%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ ++ S FAL ++ L L G I ++ + ++ + +A + A++ A+ + L Sbjct: 3 LLSEQRGGLSPAFALMLVGVLALTGVAIELVRGYNGQSLLSAAAD-AVIYSAADSDTALQ 61 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 ++ + + + + L G + + Sbjct: 62 DAQALMQANLSGRNLQVATPRLSQGEQGAQVILQGKVPALMDLSAIDE-----------G 110 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN---------------------QRYNQK 172 + + + L + + +R ++ Sbjct: 111 EEGMPIAAAARASSARSRIEVALVLDISDSMSGAPMKAIKQGLVEFGEVLFGRERRNQER 170 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK----- 227 +VS IPA + +G+ P +++ + + ++R Sbjct: 171 VVSIIPATGLVNIGDHPELFHPESIEIPFGLRTLAEERGWLNLLSREVPGRQRKAFCARL 230 Query: 228 ------MAALKNALLLFLDSI-------------------------------------DL 244 + + ++ + + Sbjct: 231 PEHVDGIDRVAEVTPGWIRKLEQAPRDELRPHLFYSTKPPPIARYGDGTPLLAFAPKENP 290 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 L E+ L + Y T++ + +++ Sbjct: 291 LDRYLENRRDKLGIFDDPDCGVSPIQAHLSTRADYRQALDTLYAAFNTNTAEGVMWGWRL 350 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE--NNNFKSNVNTIKICDKAKENFI 362 L+ + + + +K ++ +DGE + + +C + K I Sbjct: 351 LSPEWQGRWRQGAAALPRPYELQDNRKIMVLFSDGEHMTEAALRDRKQLLLCREMKRKGI 410 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +I T++ + C S + ++ V ++ + Sbjct: 411 QIYTVAFEGDTR---FVAQCASDRSLAFKA-TKSNIRTVLTRLASSI 453 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 146 bits (368), Expect = 7e-33, Method: Composition-based stats. Identities = 51/454 (11%), Positives = 131/454 (28%), Gaps = 74/454 (16%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++IFAL ++ + +G + + ++ +++A +AA + S Sbjct: 3 FSRSIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVS 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--ESLSGYSAVFYNTEIQNIVN- 123 + + + DDA R ++ + S+ T+ ++V Sbjct: 63 R---TSPAYVAAGAMSGDGVISSGADDALRIFNGNLNGLTGYTLASSSATVTKASDVVTS 119 Query: 124 -------SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVS 175 S + M + + + + + L + + + Sbjct: 120 QVTFSAQISTMFMKVVGMSAMAVGGTSTATASMPKYIDFYLLLDNSPSMGVGATPTDVSA 179 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I A D + C ++ + + T++ L++A Sbjct: 180 MIAATANKSS------------DDHCAFACHDVNNKNNYYNL-AKALGITTRIDVLRSAT 226 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK----NIEPSWGTEKVRQYVTRDMDSLILKP 291 +D+ + M + + + N+ + + +D + +K Sbjct: 227 QQLMDTATATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKG 286 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS----- 346 + +Y + + QK++ F++DG + + Sbjct: 287 QNDNNDQDTSYTAIFPAINNEISSPGSGVSG-----SPQKYLFFVSDGVADEYNPSCLKP 341 Query: 347 -------NVNTIKICDKAKENFIKI---VTISINASPNG--------------------- 375 + +C K+ IKI T +N N Sbjct: 342 KTGNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSE 401 Query: 376 -QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ C S P +++ V + + + Sbjct: 402 IAQNMEACAS-PGFYFEVSPTQGIAEAMNALFKK 434 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 51/407 (12%), Positives = 121/407 (29%), Gaps = 35/407 (8%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + S+ F ++ ++L I + + A +AA LA A S+ S Sbjct: 1 MLKQHTGSVSLSFLALLIPLVVLSAATIMIGFQVQLSSRAMQAVDAASLACAFADYSDPS 60 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKN-HIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 E + K + ++ L+G + + S + H+ Sbjct: 61 VNQAYLEYYQPNVKLVKSEIYSASGCELNMGYQLTGLFSSLKFAQASYSAQSGSVEQAHV 120 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS--FIPALLRIEMGERPI 190 + T L + K + I + + + Sbjct: 121 NQS-----------ASVTPTEMTLVLDISSSMAGSIDTLKSILTRAIERIEQDNVQIDGR 169 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM--AALKNALLLFLDSIDLLSHV 248 I + + + S + + + C D + + L N + LD I V Sbjct: 170 RAISISI-VPFSDGVSARNADWLDDKGVFCIDGLTKESGGSVLVNETVQNLDRIHSEKAV 228 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + + + +V+ + T S + + L Sbjct: 229 SHRAPDEFLADCSASATLVPLTDNMSEVKTAINALT---TTGGTRSYQGVIWGARQLIPR 285 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKEN-FIKIVT 366 ++ + N + ++ +I +TDG ++ + + +CD+ I++ Sbjct: 286 WRQEWGYNPYSLA-------PKQKLILMTDGVDSGYVLDDLIDAGLCDRLANEFAIELNF 338 Query: 367 ISINASPNGQRLLKTCVSSP------EYHYNVVNADSLIHVFQNISQ 407 I N + ++C+++ ++ N + L F I + Sbjct: 339 IGFNVQDSRLAQFQSCINAANTDGIKGQVFSATNTEKLDEYFSKILE 385 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 40/347 (11%), Positives = 100/347 (28%), Gaps = 32/347 (9%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + A + S + + ++ G I ++ K++ +++ + A+LA A NL Sbjct: 25 RFWADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAA-----NL 79 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S D + ++ + ++D + + +G S + + + + + Sbjct: 80 SNTRDPQTVVEDYFRAMKLED-------TLGDVQTGDSLGAKRVRAEGNGSINSHFLGLI 132 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI-----EMGE 187 ++LD N L + Q+ + + + + Sbjct: 133 GVDQLDVYGAATAENATAPLEISLVLDVSGSMQGQKIRDLKEAAKAFVDAVLGEGGDNSR 192 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + LI ++ + + + + C + + +L L+ + Sbjct: 193 VTVSLIPYNATVNLGDDLSERFNLDRWQNYSSCAIFESSDYNSLSIDPNAGLEQLAHFDP 252 Query: 248 VKEDVYMGLIGYTTRVE----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + E + S + + + T MK Sbjct: 253 YDYSGNSPDLTAPWCAEGNNLAIVPHSSDADYLSDVIDSF---EAQGNTAIDLGMKWGLA 309 Query: 304 ILTSDKKRSFFT--------NFFRQGVKIPSLPFQKFIIFLTDGENN 342 +L + + R KF++ +TDGEN Sbjct: 310 LLDPAARPVIGDMQADGLVPSSARYRPSDYGTQTMKFVVVMTDGENT 356 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 90/256 (35%), Gaps = 19/256 (7%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 N +S + G+ V D G + N+ T+ A Sbjct: 367 NPNALSDVWVDDHGTPGKGDDRYSIRVKDNYGDSNDVFYWPHASRNNYRNGPYSWVTRTA 426 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGY---TTRVEKNIEPSWGTEKVRQYVTRDMDS 286 A ++ + ++ E Y + + D D Sbjct: 427 A------QMVNGVAVVDGDSETTKAKCSSYKGAGHNAGQETLIENVLGMDYGTLDLDGDG 480 Query: 287 LILKPTD--STPAMKQAYQIL-----TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + D + P ++ +Q L T+ +++ + G + + + + T Sbjct: 481 IAGANDDCSNYPPVRLTWQELFGNVKTTYYANAWYWQAYMDGRASYNDYYNAYYSWET-- 538 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + ++N N IC KAK+ + I TI + A G ++ C SS ++YNV + + L+ Sbjct: 539 TVDASQANTNLATICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYNVSS-NQLV 597 Query: 400 HVFQNISQLMVHRKYS 415 F++IS ++V + + Sbjct: 598 DTFRSISDVVVELRLT 613 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 48/455 (10%), Positives = 118/455 (25%), Gaps = 75/455 (16%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES------------ANNAA 60 + I NF I+ AL V+ L G + + + + + AA Sbjct: 4 RFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYAAADAAAVGSITPTSEAA 63 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A +L+ + I DA + +++ S+ Sbjct: 64 AQANTMSGDGSLTLGKSEAQKIFFSQMSKKQGDAPVTVDISVQKKGDTLSSTVS-----F 118 Query: 121 IVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 M M + + + Y + L + Sbjct: 119 NATMPTTFMQVMGFDEIAVTGAATAQYQTPSYMDFFMLLDNTPSMG---------VAATT 169 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALKNAL 235 M + + D + + C + S+ + + ++ + +A+ Sbjct: 170 DDITAMKKATANGHDGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVASAV 229 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--------NIEPSWGTEKVRQYVTRDMDSL 287 + + + + + ++ V + + Sbjct: 230 KALMAKAKDTQSMPSQFRVAAYTSGKTAQDAKAAKLFKVSDLNYDLGAVAAAAN-MIKLM 288 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + + ++ + N G + QK + F+ DG +++K Sbjct: 289 SIPYQNYYSDQQTSFDEALKGIEGEIKGNI---GTGTSNADRQKIVFFVADGVGDSYKPT 345 Query: 348 VNT---------------IKICDKAKENFIKI---VTISINASPNG-------------Q 376 T C K K+ IK+ T + NG Sbjct: 346 GCTSPKGANGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETRIA 405 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ C +P +++ V + + + + + +V Sbjct: 406 AKMEECA-TPGFYFAVSPTEGIEEAMEALFRKIVS 439 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 43/340 (12%), Positives = 91/340 (26%), Gaps = 23/340 (6%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R Y ++ E+ +++ + FL++ G I ++ + S+++ + A Sbjct: 1 MRETTRAAEYAERFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LAGA+ + + R AK D I L + Sbjct: 61 VLAGATAVNN----ATARAVIEDYFAKSGQSDYLAAQEAGDIDIRL-----NSSKVTARA 111 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 M + L S+ N+ L ++ R + + I + Sbjct: 112 TQTLDTYLMRLAGVDTLTSAGNSTAEVTIPKLEIAMALDVSGSMIGARIDALKPAAIEFV 171 Query: 181 LRIEMGERP----IFLIELVVDLSGS---MHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 I P I ++ ++ S ++ +S D T N Sbjct: 172 DSILDSTEPNDAVISVVPFSWGVTPSKEIYEALTVNETHKYSSCLELNDSHFTDTTIDPN 231 Query: 234 ALLLFL-------DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 L + L+ ++ T + + Sbjct: 232 TAYNQLIYTSREGVTFGDLTTTPLGDFLDTYNQTCYTQDYFNILPYATTKTALHDKINGL 291 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 T + +K A +L + + V Sbjct: 292 QAGGSTSNDEGVKWAAALLDPAFQPVVTSLQQPIQVPQDD 331 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 31/91 (34%), Gaps = 4/91 (4%) Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + ++ R + N K + IC +AK I I TI Sbjct: 477 SWETAWGLMSPRFYGNTTGNWGPWNNFL--NNPINRSKKDERLDDICREAKSEGIVIYTI 534 Query: 368 SI--NASPNGQRLLKTCVSSPEYHYNVVNAD 396 + + P G +K C SS +HYN + Sbjct: 535 AFEMGSQPTGADKIKKCASSVNHHYNATTVN 565 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 56/459 (12%), Positives = 126/459 (27%), Gaps = 74/459 (16%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R F + N +I+FA S++ L+ IG I ++ M+SA +AA + Sbjct: 4 RLSGLFDSFHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASVGSV 63 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI--KESLSGYSAVFYNTEIQNIVN 123 SK ++ DA ++ + + T+ + Sbjct: 64 SKASPAF---LAAGSMTTDGPIAVGSTDATNIFNGNMASQSGYTLSKLDAAVTKSGATLT 120 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 S+ +A L T S + ++ P + + Sbjct: 121 STVTFSASVATTFLTIIGKTALAIGGTSVS-------TSSMPVYIDFYLLLDNSP-SMGV 172 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + + C +D + +T++ L++A +D+ Sbjct: 173 GATPTDVATMVDNTSDKCAFACHDVNDEHNYYELAKT-LGVKTRIDVLRDATQQLMDTAA 231 Query: 244 LLSHVKEDVYMGLIGYTTRVEKN----IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + M + + + + + + +D + +K + Sbjct: 232 ATATYPNQFRMAIYDFGASAQSAALRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDDRD 291 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS------------- 346 +Y L + S QK+++F++DG + + Sbjct: 292 TSYSKLLPAIDKQITAAGAG-----TSDAPQKYLLFVSDGVADETNAGCAKTMKNAFWGN 346 Query: 347 ------NVNTIKICDKAKENFIKIVTIS------------------------INASPNG- 375 + +C + +K+ + N P G Sbjct: 347 KSPRCQSPIDPALCKAMTDRGVKVAVLYTTYLALPLKQANGDPSWYASWIAPFNVGPYGP 406 Query: 376 ------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +K C S P +++ V D + I + Sbjct: 407 SPNSEIANNMKACAS-PGFYFEVSPTDGIADAMNAIFRK 444 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 94/365 (25%), Gaps = 38/365 (10%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M L+SR E ++ + + G + + + + + ++ + A Sbjct: 1 MMLISRIAH----FRKDEGGGMLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNA 56 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LA AS LS+ D E + ++ A +D ++ +E G V E Sbjct: 57 VLAAAS-----LSQDLDAEEVVRSYLTSAGLDP--SEVEIETREEKIGGILVGRTVEASL 109 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV------ 174 + + L + ++ L + + ++ + Sbjct: 110 PAGLNTYFFRFFDIDTLGMTISSEATERVEDIEISLVLDVSGSMGDITSDRSGIKMDLLK 169 Query: 175 -----SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 L E G I ++ ++ +S C D Sbjct: 170 RAAGDFVETILSDAEEGRVSISIVPYSTKVNPGSALLGQYTVSQEHSYSHCVDFDADDFT 229 Query: 230 ALKNALLLFLDSIDLL----SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 L+ L + S +++ + Sbjct: 230 HLRIDTATELQRTGHFLIGSESTSNRTAGQWVCRFDSGFAVTPLSSSVAELKAQIAALT- 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFT---------NFFRQGVKIPSLPFQKFIIFL 336 L T K +L + F + + K ++ + Sbjct: 289 --PLGSTSIDMGAKWGLALLDPSAQTPIAAMIASGQVNRAFQGRPHVYGADNSMKVLVLM 346 Query: 337 TDGEN 341 TDGEN Sbjct: 347 TDGEN 351 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 20/191 (10%), Positives = 48/191 (25%), Gaps = 7/191 (3%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+ S + + Y Y Sbjct: 369 TRTTYGGYTYYSVASSESNYENDSDWTYPESNFYAIHPFGTQRMWSNYT----LANNSDF 424 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ + + ++ F ++ ++ + Sbjct: 425 RQARMSTEVRLDWPEVWAEMSPYYYGYNMYGRRYNSSWYWYQRATDFRDYIQWTVDSV-E 483 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +IC A + I +I ++ LLK C SS ++++V + F Sbjct: 484 KDRRLRQICGVANAAGVVIYSIGMDVDNTNSLNLLKDCASSESHYFDVEGL-EIQTAFDM 542 Query: 405 ISQLMVHRKYS 415 I+ + + + Sbjct: 543 IAASISMLRLT 553 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 41/464 (8%), Positives = 116/464 (25%), Gaps = 89/464 (19%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + S FAL + L L G +I ++ + ++ + +A + A+L A+ + Sbjct: 7 QGGGLSPAFALMLTGILALTGVVIELVRGYSGQSLLSAAAD-AVLYSAADSDTAAEDAVA 65 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 ++ + S +Q V + + Sbjct: 66 LVQANLAGR------------PLQVGPPSLSQSEQGARVILQGHVPALMDLSVIGEGGDM 113 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLN---------------------QRYNQKIVSF 176 + + L + +R ++VS Sbjct: 114 PVAAAARASSARTRIEIALVLDVSNSMSGAPMKAIKQGLTEFGEVLFGRERRNQDRVVSI 173 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--------- 227 IPA + +G+ P + + + ++R Sbjct: 174 IPATGLVNIGDHPELFHPESLAFPFGLQTLAHERGWSNLLTRDVPGRQRKAFCARLPEHV 233 Query: 228 --MAALKNALLLFLDSIDLLSHVKEDVYMGLIG--------------------------- 258 + L ++ ++ + + Sbjct: 234 DGIDRLAELTPGWIRKLEQAPVGETQPRLHYSTKPPAIKQYEDGTPLRAFAPRENPLERY 293 Query: 259 ----------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +Y T++ + +++L+ Sbjct: 294 LENRRDKLGIFDDADCGVSPIQAHLSTRAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQ 353 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---NVNTIKICDKAKENFIKIV 365 + + +K ++ +DGE+ ++ + + +C + K I++ Sbjct: 354 WQGRWGQGAAELPRPYGQADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVY 413 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T++ + C S + Y + + + V ++ + Sbjct: 414 TVAFEGDAR---FVAQCASDRSHAYKATSGN-IRTVLTRLASAI 453 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 143 bits (360), Expect = 5e-32, Method: Composition-based stats. Identities = 43/452 (9%), Positives = 116/452 (25%), Gaps = 70/452 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ + N +I L++ L IG + K ++ A + A + + Sbjct: 5 LKRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASVGSVAVNSY 64 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG-----YSAVFYNTEIQNIVNSS 125 S +AL K++ ++ + +EI + Sbjct: 65 AYKANTKGHSSFKTGENQALAIFNSNVKKHNDLNNIKVKAKIKKQSTNLVSEIGVTADYR 124 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + M N + + + + + L + + Sbjct: 125 PYLLGLMGMNTMPITIKSTSSSTFPPYIDFYLLLDNSPSMG-----------------VG 167 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I + + C + A + K T++ ++ A + + Sbjct: 168 ATTKDIDTMVANTSDKCAFACHQMDKAGNDYYALAKKLKVTTRIDVVRQATQNLMTTAKN 227 Query: 245 LSHVKEDVYMGLIGYTTRV----------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + M + + + + + D + + + Sbjct: 228 TQTLTDQYRMAIYHFGMAADQIDSKNPAPYEVSALTTNLSTSASNAAKI-DLMTIPYQNY 286 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---------- 344 + + G S Q+ + F++DG N+ + Sbjct: 287 NSDRQTNF----PSYLLGMNKVIPSSGDGSSSSKPQQVLFFVSDGANDGYDCAYSNGASC 342 Query: 345 -KSNVNTIKICDKAKENFIKI---VTISINASPNG----------------QRLLKTCVS 384 + + C K +KI T + N ++ C Sbjct: 343 RRISPLDTPQCKAMKARGVKIAVLYTTYLPLPTNAFYNSHLAKYVSPTSQLAAKMQECA- 401 Query: 385 SPEYHYNVVNADSLIHVFQNI-SQLMVHRKYS 415 + ++ V + + + ++++ + S Sbjct: 402 TEGLYFEVGPNEGISEAMNALFAKVISTVRIS 433 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 143 bits (359), Expect = 7e-32, Method: Composition-based stats. Identities = 43/373 (11%), Positives = 103/373 (27%), Gaps = 28/373 (7%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ R F+ N +I+ ALS++ L+ +G + + + M+S +AA Sbjct: 6 LNSFGRLVRAFRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAA 65 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++A ++ ++ + + + A I+ N Sbjct: 66 LIAAVKQINNSEDTDALKQKVSDWFHAQVENSYALGEIEIDTTNH---------NITATA 116 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 M + + S + + + +L Sbjct: 117 SGTVPTTFMKIANIDTVPVSVGSAVKGPATSYLNVYIVIDRSPSMLL------------- 163 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 I + N + + + +A+ L Sbjct: 164 AATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVL 223 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL----KPTDST 295 D ID E + +GL ++ + P+ T R+ ++ D L T Sbjct: 224 DMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFD 283 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +I+ + + N + + + + + + N + +N C Sbjct: 284 VALAALQKIVGTGGDGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNP-DWCG 342 Query: 356 KAKENFIKIVTIS 368 K + + Sbjct: 343 YVKNKSATMAVLY 355 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 23/314 (7%), Positives = 78/314 (24%), Gaps = 25/314 (7%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L ++ R + IA N +++F ++ + + +G + + +M+SA ++ L Sbjct: 5 LFTQLRKSAARFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTAL 64 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A N + + + + + + L+ + ++ Sbjct: 65 MVAKDYSLNKISASEIDGKAKSIFSALYTNKSANSV--EVVAVLTPNTGKGSTIKVDGTG 122 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP---- 178 M + +++D ++ L + + + + Sbjct: 123 KVPTDFMKLVNISQIDIGASSTTTWGSTRLRVALVLDTTGSMNDNGKIGALKTATQNLLT 182 Query: 179 --ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + +I D++ + + + + K+ Sbjct: 183 QLKDAAGKPEDVYVSIIPFSKDVNVGASNYTANWIDWTDWKSQPPV-----LDYAKSGSK 237 Query: 237 L-FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----------TRDM 284 +D++ V + + + Sbjct: 238 SGLVDNVAWKDVGPGSKCPFSDSKHGFVCTDRPATEANANKVSNIPSNGTRKGLICPSMD 297 Query: 285 DSLILKPTDSTPAM 298 T + Sbjct: 298 TGGPNGTTGPRNGL 311 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 57/177 (32%), Gaps = 30/177 (16%) Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 ++ K E + + V + T+ + +Q L F Sbjct: 412 SSCAPKVTEMNNNWATMNTTVDGL---FPVGGTNQPIGLVWGWQSL------VGGGPFPT 462 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVN----------------TIKICDKAKENFIK 363 VK +Q I+ ++DG N + N T C K IK Sbjct: 463 PPVKDEQYTYQDIIVLMSDGLNTVDRWYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIK 522 Query: 364 IVTISINASPN-GQRLLKTCVS----SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + T+ +N + + LLK C S + V +A L F +I+ + + + Sbjct: 523 VYTVHVNTNGSPESTLLKNCASPADDGGKEFQMVTSASGLNAAFNSIATKLTDLRVA 579 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 56/399 (14%), Positives = 142/399 (35%), Gaps = 52/399 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + +++ +I A+ ++L+ + + + + +K +++ + A+L Sbjct: 6 IRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALL-------- 57 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++I + K ++K L + + + + Sbjct: 58 -DTVTMIKLKNIEDVVKNV--GPVNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKLE--S 112 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 L + + + + + N+ + + + ++I E Sbjct: 113 DSNFKTLSITAISQYKMPFKICNIHLL---------CPKNKYVTVPVLSSMKIGRNEGSD 163 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +V+D+S SM ++ ++ +++ K ++ L+ + + Sbjct: 164 IDLMIVLDVSSSM----------DDNFMKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYAN 213 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G +G+ V+ + G +++ T+S MK A++ L Sbjct: 214 VFRTGSVGFNDMVQFPMPLKRG---LKRIYNDIKKYRAFGSTNSYVGMKYAWEQLY---- 266 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 G + +K +IFLTDGEN + TI++C+ K+ I +I++ Sbjct: 267 ----------GNPQDTKDRKKIVIFLTDGENMIINATRKTIELCNDMKKKKAVIYSIALA 316 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +L+ C SS Y +A SL+ + I + + Sbjct: 317 VDN--KEVLQGC-SSSGNVYAADDAQSLVQAYSLIGKDV 352 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 141 bits (356), Expect = 2e-31, Method: Composition-based stats. Identities = 54/443 (12%), Positives = 115/443 (25%), Gaps = 69/443 (15%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 +F +E+ I F + F+ LI + + +K + A A LA A + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALAIE 70 Query: 68 MVSNLSRLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 +S I N + S Sbjct: 71 NNEIPDEPQQIKNNALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVTMAYPAKFLS 130 Query: 125 SRISMTHMANNRLDSSNNT---IFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVSF 176 +++ + + T + + +L + + + Sbjct: 131 QSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDA 190 Query: 177 IPALLRI-------EMGERPIFLIELVVD----------LSGSMHCAMNSD--PEDVNSA 217 + + R I I H + N Sbjct: 191 LRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYL 250 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEKNIEPSW 271 + + L + + +D + + + I + ++ + S Sbjct: 251 SDEIKRSSNTL-LLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSL 309 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E+ + ++ T + + A I + K +K Sbjct: 310 EQEQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNG----------------HKK 353 Query: 332 FIIFLTDGENNNFKSNVN---------TIKICDKAKENFIKIVTISINASPNGQR----- 377 +I L+DG + + + +C++ KEN I++ I+I SP+ + Sbjct: 354 LMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYH 413 Query: 378 -LLKTCVSSPEYHYNVVNADSLI 399 K CV + +Y NA L Sbjct: 414 INWKKCV-GEDNYYEAHNAHELE 435 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 81/417 (19%), Positives = 144/417 (34%), Gaps = 46/417 (11%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 +FR + I NF I+ A+++ G + V + + ++ Sbjct: 7 KFRSLI-QLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQQ---------- 55 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + N A +F+ + LSG + + N + Sbjct: 56 --ATDAAALATATALANGNATTSNAQQLATQFVTGQMSNYLSGDTNTADALKAGTTANVT 113 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 +S+ Y + V SYD + + LL + + Sbjct: 114 ----------SATNSSGGTSYTVAVNASYDMSVNGMSQLLGIKTMHVSAASTSTSGSAAA 163 Query: 186 GERPIFLIELVVDLSGSMHCAMN--------------SDPEDVNSAPICQDKKRTKMAAL 231 ++ +E+ +D SGSM + K+AAL Sbjct: 164 AKQAALSMEIALDKSGSMLLNTDVIDTSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAAL 223 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ LD +D + V I +++ V+ + +WGT R Sbjct: 224 KTAVGTLLDQLDSADPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRS--NVISGLNANGG 281 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+S+ M AY+ +++ + + FQK I+ +TDGENN S+ T+ Sbjct: 282 TESSAPMALAYKNVSAS-------SEATAQAAKGNTTFQKIIVLMTDGENNATSSDTKTL 334 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 C AK+ + I +++ A GQ LLK C SSP +++ LI F+ I Sbjct: 335 ATCKAAKDAGVLIYSVAFMAPDRGQTLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQ 391 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 55/446 (12%), Positives = 118/446 (26%), Gaps = 64/446 (14%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 RF + S + +++ A + LL F I + K + A +AA AG Sbjct: 71 VRFALTQRDDSRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 130 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH--------IKESLSGYSAVFYNTE- 117 + + + A + + + + + G + FY T+ Sbjct: 131 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKV 190 Query: 118 ---------------------IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + + + + D S M V Sbjct: 191 PTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISL 250 Query: 157 RLQFIEHLLNQRYNQKIVSF-IPALLRIEMGERPIFLIELVV--DLSGSMHCAMNSDPED 213 L + + Y+ + I + I +E + + D Sbjct: 251 VLDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYD 310 Query: 214 VNSAPICQDK--------KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + T L A+ FL + ++ + + Y + Sbjct: 311 EYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVL---DQTPQNEQVSIASYNSHGTL 367 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + VR V + T M T + Sbjct: 368 DCWLLDDFDSVRAAVAQL---GPNGSTGIGNGMNSGKTAFTHEN---------------A 409 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 K ++ +TDG +N + + + I T++ + + + + V+ Sbjct: 410 RPYASKTMVVMTDGNHNYGTQPNTVAQ--QLMSSSNLNIQTVTFGGGADQETMQEVAVTG 467 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH 411 HY+ + D L+ F+ I+ + Sbjct: 468 LGRHYHADSGDELVSAFEEIANNLPT 493 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 141 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 52/443 (11%), Positives = 114/443 (25%), Gaps = 69/443 (15%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 +F +E+ I F + F+ LI + + +K + A A LA + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIE 70 Query: 68 MVSNLSRLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 +S I N + S Sbjct: 71 NNEIPDEPQQIKNNALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVTMAYPAKFLS 130 Query: 125 SRISMTHMANNRLDSSNNT---IFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVSF 176 +++ + + T + + +L + + + Sbjct: 131 QSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDA 190 Query: 177 IPALLRI-------EMGERPIFLIELVVD----------LSGSMHCAMNSD--PEDVNSA 217 + + R I I H + N Sbjct: 191 LRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYL 250 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEKNIEPSW 271 + + L + + +D + + + I + ++ + S Sbjct: 251 SDEIKRSSNTL-LLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSL 309 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E+ + ++ T + + A + + K +K Sbjct: 310 EQEQYINNIDNIIEMEPHGWTLISSGILSANNLFKNKAKNG----------------HKK 353 Query: 332 FIIFLTDGENNNFKSNVN---------TIKICDKAKENFIKIVTISINASPNGQR----- 377 +I L+DG + + + +C++ KEN I++ I+I SP+ + Sbjct: 354 LMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYH 413 Query: 378 -LLKTCVSSPEYHYNVVNADSLI 399 K CV + +Y NA L Sbjct: 414 INWKKCV-GEDNYYEAHNAHELE 435 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats. Identities = 53/443 (11%), Positives = 114/443 (25%), Gaps = 69/443 (15%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 +F +E+ I F + F+ LI + + +K + A A LA + Sbjct: 11 FNHFTLFKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIE 70 Query: 68 MVSNLSRLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 +S I N + S Sbjct: 71 NNEIPDEPQQIKNNALVLSYVNAYLPSKKFLVPIININDNTHYLEYNAAVTMAYPAKFLS 130 Query: 125 SRISMTHMANNRLDSSNNT---IFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVSF 176 +++ + + T + + +L + + + Sbjct: 131 QSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDA 190 Query: 177 IPALLRI-------EMGERPIFLIELVVD----------LSGSMHCAMNSD--PEDVNSA 217 + + R I I H + N Sbjct: 191 LRSAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYL 250 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEKNIEPSW 271 + + L + + +D + + + I + ++ + S Sbjct: 251 SDEIKRSSNTL-LLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSL 309 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E+ + ++ T + + A I + K +K Sbjct: 310 EQEQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNG----------------HKK 353 Query: 332 FIIFLTDGENNNFKSNVN---------TIKICDKAKENFIKIVTISINASPNGQR----- 377 +I L+DG + + + +C++ KEN I++ I+I SP+ + Sbjct: 354 LMIILSDGVDTDDFPSSKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYH 413 Query: 378 -LLKTCVSSPEYHYNVVNADSLI 399 K CV + +Y NA L Sbjct: 414 INWKKCV-GEDNYYEAHNAHELE 435 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats. Identities = 37/321 (11%), Positives = 79/321 (24%), Gaps = 23/321 (7%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L+R + N I+ ALS++ L+ +G + + + M+S +AA++A Sbjct: 100 LARITGALRSLERDRGGNVGIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA 159 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ + + + + I N Sbjct: 160 AVKQINNTEDTDALKEKVSDWFHAQVDNSYTLGDIDIDTVNH---------NITATANGT 210 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + S + + + +L Sbjct: 211 VPTTFMKIANIETVPVSVASAVKGPATSYLNVYVVIDTSPSMLL-------------AAT 257 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I + N A K + + +A+ LD I Sbjct: 258 TSGQSTMYSGIGCQFACHTGDAHTVGKTKYANNYAYSTAKKIKLRADVAGDAVREVLDMI 317 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D E + +GL + + + P+ T+ R + L + + + Sbjct: 318 DESDENHERIKVGLYSLGDTLSEVLAPTLSTDTARTRLADASYGLTSATSKAATYFDVSL 377 Query: 303 QILTSDKKRSFFTNFFRQGVK 323 L +K Sbjct: 378 ATLKQKVGAGGDGTSSGSPLK 398 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 140 bits (353), Expect = 4e-31, Method: Composition-based stats. Identities = 54/414 (13%), Positives = 131/414 (31%), Gaps = 61/414 (14%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + ++ A + L+L+ F V + + + +AA AG + SR Sbjct: 16 RRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQA 75 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 R + +K + + I+ + + + N I ++ ++ Sbjct: 76 RVVAKDIFSKNLIGGRELKLHNKDIEFGRTDANPDGTWEFLPNERPFQAIRISVNLDDN- 134 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 S + + A + + I L + Sbjct: 135 --------RQKGRNGSVPLLF-----------GKVLGQSSFATNHSSVAANLVHEIVLCL 175 Query: 198 DLSGSMHCAMNSDPEDVNSAPIC--------QDKKRTKMAALKNALLLFLDSIDLLSHVK 249 D S SM + ++ A L+ A+ +F+D++D L V Sbjct: 176 DRSHSMCFDETGVDYAYPPGTPSYPAGYITPPNPVGSRWAKLQGAIQVFVDTLDDLQIVP 235 Query: 250 E----------DVYMGLIGYTTRVEK----NIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + R ++ V + + +++ T+ + Sbjct: 236 DVGVVTWGSDITLSWSWYPFQGRSFPAVMVDVPLGQNLNLVSPAIAAKLGDIMMGGTNMS 295 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + ++ +LT++ QK II ++DG+ N + N + + Sbjct: 296 SGIDRSVSLLTANG---------------THSLAQKTIILMSDGQWNAGR---NPLDAAN 337 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 A + I I TI+ + + + + + +N + +SL F+ +++++ Sbjct: 338 DAADKNITIHTIAF-LNGDQSVMRQIAERTGGKFFNAPDGESLEDTFKELAKML 390 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 139 bits (350), Expect = 7e-31, Method: Composition-based stats. Identities = 44/464 (9%), Positives = 115/464 (24%), Gaps = 89/464 (19%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + S FAL + L L G +I ++ + ++ + +A + A+L A+ S Sbjct: 7 QGGGLSPTFALMLTGVLALTGVVIELVRGYSGQSLLSAAAD-AVLYSAADSDSAAEDAAA 65 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + A + + L G + L Sbjct: 66 LVRANLAGRHLQVGPPALSQNEQEAQVILQGEVPALMALSAIGT------------SGDL 113 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLN---------------------QRYNQKIVSF 176 + + L + +R ++VS Sbjct: 114 PVAAAARASSARTRIEIALVLDVSNSMSGAPMKAIKQGLAEFGEVLFGRERRNQDRVVSI 173 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--------- 227 IPA + +G+ P + + + ++R Sbjct: 174 IPATGLVNIGDHPELFHPESLTFPFGLQTLAHERGWSNLLTREVPGRQRKAFCARLPEHV 233 Query: 228 --MAALKNALLLFLDSIDLLSHVKEDVYMGLIG--------------------------- 258 + L ++ ++L + + Sbjct: 234 DGIDRLAELTPGWIRKLELAPRGEAQPRLHYSTKPPAIQQYEDGTPLRAFAPRENPLERY 293 Query: 259 ----------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + Y T++ + +++L+ Sbjct: 294 LENRRDKLGIFDDPDCGVSPIQAHLSTRAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQ 353 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---NVNTIKICDKAKENFIKIV 365 + + +K ++ +DGE+ ++ + + +C + K I++ Sbjct: 354 WQGRWQQGAAELPRPYGQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVY 413 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 T++ + C S Y + + + V ++ + Sbjct: 414 TVAFEGDAR---FVAQCASERSLAYKATSGN-IRTVLTRLASAI 453 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 139 bits (349), Expect = 9e-31, Method: Composition-based stats. Identities = 35/357 (9%), Positives = 92/357 (25%), Gaps = 32/357 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R E+ I + LL G + + + +++ + A+LA Sbjct: 1 MRRITTRAAALRRDERGGMIIFGLFVFLLLLLAGGMAVDFMRTETARGRLQATLDGAVLA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A +L + D E + ++ +A +D + + + Sbjct: 61 AA-----DLDQDKDPVEVVRDYVAKAGLDPFLIDVDVTEIAGQRIVT-------ASAKSD 108 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + M + + L + + L + + +Q + + + Sbjct: 109 VTMHFMKMVGIDFLPAPARSTASEAVSNLDVSLVLDMSGSMEGDKLDQLQAAAKNFVGIV 168 Query: 184 E----MGERPIFLIELVVDLSGSMHCAMNSD----PEDVNSAPICQDKKRTKMAALKNAL 235 + + ++ ++ ++ T+ + L+ A Sbjct: 169 YDTMGAEKILLNVVPYATQVAAPAGLLDMLGAFLREHSYSNCVSFSAADFTETSILEAAA 228 Query: 236 LLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 L D + + + + + ++ Y+ + T Sbjct: 229 LPQGGHFDPFYTWGPLRYDDVTFVCNPDPSTEVLTLASTQREIEDYIDGLV---AEGNTS 285 Query: 294 STPAMKQAYQILTSD-------KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 MK ++ D K I+ +TDG+N Sbjct: 286 IDVGMKWGAALIDPDLGSTLNEFANGPSAAGINPVALWGDRSTDKVIVLMTDGKNTT 342 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 51/179 (28%), Gaps = 1/179 (0%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++D V++ G R R + Sbjct: 357 YIDDARDEFWVRQTDDRDADDDNRYSNDTWFWVDGRVAGRNNFWRRDNRFPHGGGRIEDN 416 Query: 298 MK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ Y+ L S + + + D + N IC Sbjct: 417 VRRLTYEELFSRVSVYYNAYYHHYLQNFDRTELDTWYWDFLDMSLSTSAKNARLEAICTA 476 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 AK +++ T+ + +++ C SS + + V L F++I++ + + + Sbjct: 477 AKNQGVQVFTVGFEVEDDEAIIMEDCASSRAHFFRVSGGGDLTTAFESIARQITELRLT 535 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 139 bits (349), Expect = 1e-30, Method: Composition-based stats. Identities = 41/364 (11%), Positives = 88/364 (24%), Gaps = 39/364 (10%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R F SEK + + V + G I + + M+ + A+L+ AS Sbjct: 24 MRKVFSNWRKSEKGSMTAFGIFIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSAAS 83 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + LG +K + D + + A Sbjct: 84 LLQT----LGAEAVVTDYMSKANIDVDYNLSVNVSEGINFRAVDAT-------ATATLET 132 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + + + + L + + L + + + + + Sbjct: 133 LFLGLLNIDSLGITVTSGAEERIPNLEISLVLDVSGSMGSNSRLTNLKTAATQFVSTIIS 192 Query: 184 --EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 G + +I ++ S ED + C + ++ L Sbjct: 193 GGSGGTVAMSIIPFSSSVTPSQSVIDAITMEDNHDYSTCIEFADDDFSSSSLDLDSTYKR 252 Query: 242 I-----------DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + Y + + S + + L Sbjct: 253 AVFTSRYSDTGSGDFDDADDFNQDWRSCYMDEYFELLAYSDDETVLY---NKIQGLLAQG 309 Query: 291 PTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T MK +L + + V K I+F++DG N Sbjct: 310 STAGHTGMKWGTSLLDPEFQAVTNSMIAAGVVDAAHAGMPVAYSDTNTMKIIVFMSDGNN 369 Query: 342 NNFK 345 + + Sbjct: 370 HTQR 373 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 33/75 (44%), Gaps = 1/75 (1%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +S+ C AK+ I I TI+ A N + L C +S ++Y+ + V Sbjct: 503 TGSQSDTLMSANCTAAKDRGITIFTIAFEAPSNAETQLNNCATSDNHYYDAQGTS-ITSV 561 Query: 402 FQNISQLMVHRKYSV 416 F +I+ + K ++ Sbjct: 562 FSSIATTIQKLKLTL 576 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 42/419 (10%), Positives = 103/419 (24%), Gaps = 56/419 (13%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 A+ + L ++G + + + +++ S ++ Sbjct: 1 MAVLLPVILGIMGLGLDSGMLYLSHSRLQA--------AVDAAALAGSLQLPYDPAMDKG 52 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 RA +D+ A + + I M + Sbjct: 53 LVRAAVDEYMHANFPQAVVQSVLPGAEERSVTVNAEATVGTIFMGALGIGSSTV--RAQA 110 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI----EMGERPIFLIELVVDLSG 201 + + + N+ + + I I L+ + Sbjct: 111 SAGYNNLEVVFVIDNSGSMKGSPINETNAAATRLVDLIMPEGMATSVKIGLVPFRGKVRI 170 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 P DV+ P + ++ LL + + + + + Sbjct: 171 ---------PADVDGLPSGCRNADGSLN--EDGLLDEYKKPEYRYPYNDRLRVTPYS-CS 218 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK----------- 310 + + + Q + R T + +K A +LT + Sbjct: 219 SIPLTQGLTADRATITQAIGRQDARGDSSGTVISEGLKWARHVLTPEAPFTEGSSAKDMR 278 Query: 311 ------RSFFTNFFRQGVKIPSLPFQKFIIF--------LTDGENNNFKSNVNTIKICDK 356 T G + + N + Sbjct: 279 KVIILLTDGDTEDGNCGGNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVLNNAMLSEAAL 338 Query: 357 AKENFIKIVTISINASPN-GQRLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLMV 410 AK+ I+I I +S + L++ SS +++++ + + VF+ I + + Sbjct: 339 AKDAGIEIFAIRYGSSDAVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDVFKLIGRQLG 397 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 47/448 (10%), Positives = 110/448 (24%), Gaps = 58/448 (12%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + ++ +IF ++ + + + + A LA A+ + S+++ Sbjct: 8 SRQQGVAVVIFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVAANISSDITEN 67 Query: 76 GDRFESISNHAKRALIDDAKRFIK-----NHIKESLSGYSAVFY---NTEIQNIVNSSRI 127 ++ + + I S G + V+ Q V S Sbjct: 68 KTLAKNYVDGFVPDGTISLADINIERKSCDEIYGSQCGVAGVYDEEGLVFTQYKVTLSSE 127 Query: 128 SMTHMANNR--------LDSSNNTIFYNMDV-MTSYDYRLQFIEHL--LNQRYNQKIVSF 176 + + ++ + + F + R + Sbjct: 128 FESWYPEDDFAPGFEEIVELGGTAVARKYQGFTIDVAFVADFSGSMQQTWNREIKYKGVV 187 Query: 177 IPALLRIEMGE----------------RPIFLIELV-VDLSGSMHCAMNSDPEDVNSAPI 219 E + I +GS + + + Sbjct: 188 NVISDITRKLETFNDHTEQELNGKKVANKVAFIGYNFYPHNGSTFYSNVDYKANYSRLSY 247 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + ++ + A + + + + L + + + + R Sbjct: 248 KWQENIPEINYRRTARDPINNKRTPIIGRYVNNTIPLYSDDSYFYTL-DLTDNFTQFR-- 304 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII----- 334 T S + +A +I+ + + G + Sbjct: 305 -NTISTFYPDYGTASYEGIIEAAKIVNNGENIRKLIIVLSDGEDSINENNPYDNRYPGFI 363 Query: 335 ---FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 G N +++ + +I + KI I LK C Sbjct: 364 APLIYQSGLCQNIINDLESKEI--NGRNVEAKIFVIGFGYDLEKNPGLKICAGEEN---- 417 Query: 392 VVNADSLIHVF----QNISQLMVHRKYS 415 V +ADS +F Q IS+ + H Y Sbjct: 418 VQSADSYQEIFDTVLQLISEEVGHLYYR 445 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 55/446 (12%), Positives = 118/446 (26%), Gaps = 64/446 (14%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 RF + S + +++ A + LL F I + K + A +AA AG Sbjct: 8 VRFALTQRDESRRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGR 67 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH--------IKESLSGYSAVFYNTE- 117 + + + A + + + + + G + FY T+ Sbjct: 68 AFSEEQTVEAAKAAARLTAAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKV 127 Query: 118 ---------------------IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + + + + D S M V Sbjct: 128 PTSDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISL 187 Query: 157 RLQFIEHLLNQRYNQKIVSF-IPALLRIEMGERPIFLIELVV--DLSGSMHCAMNSDPED 213 L + + Y+ + I + I +E + + D Sbjct: 188 VLDRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYD 247 Query: 214 VNSAPICQDK--------KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + T L A+ FL + ++ + + Y + Sbjct: 248 EYDLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVL---DQTPQNEQVSIASYNSHGTL 304 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + VR V + T M T + Sbjct: 305 DCWLLDDFDSVRAAVAQLA---PNGSTGIGNGMNSGKTAFTHEN---------------A 346 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 K ++ +TDG +N + + + I T++ + + + + V+ Sbjct: 347 RPYASKTMVVMTDGNHNYGTQPNTVAQ--QLMSSSNLNIQTVTFGGGADQETMQEVAVTG 404 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH 411 HY+ + D L+ F+ I+ + Sbjct: 405 LGRHYHADSGDELVSAFEEIANNLPT 430 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 75/439 (17%), Positives = 162/439 (36%), Gaps = 71/439 (16%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L R F+ K ++ F I+ A + L+ + F+I + + KN ++S+ + AI+ Sbjct: 9 ILRRIYFFSK----NKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAII 64 Query: 63 AGASKMVSNLSRLG---------------DRFESISNHAKRALIDDAKRFIKNHIKESLS 107 +G S +VS+ + N + L+ Sbjct: 65 SGCSIVVSDPKINDLNPQEERIRDVIKKNAYVNMVQNFPA-EHAAYIIENANISFSKDLT 123 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS-----YDYRLQFIE 162 E ++ ++ + + N + ++ + + + L Sbjct: 124 NKYEYKITMEAKHQLSGKNFILGFLMPNVITHISSISTGIIQKPSDKKAFSVEMVLDCSG 183 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 +L+ C ++S + Sbjct: 184 SMLDSMQES---------------------------------CDLSSGRGGYYFYSKNNN 210 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDV--YMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K ++K+ ALK A F++ I + +GLI + + ++ + S +++ + Sbjct: 211 KPKSKIYALKTASSDFVNLIQETVQTFPQISARIGLITFNHYIMQDSKLSNNFNVIKKTI 270 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +R TD+ M AY+ L + N + ++P +++II +TDGE Sbjct: 271 SRMK---PKGGTDTFLPMNAAYEYLNN------IPNETKAHNISDNVPLKRYIILMTDGE 321 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSL 398 NN+ ++ TI +CD A++N I I +I +N +G L + C SS ++ + N +L Sbjct: 322 NNHPSYDLKTINVCDNARKNGIIIYSIFLNYYEYTDGYELARKCASSEKHFFYANNTKAL 381 Query: 399 IHVFQNISQLMVHRKYSVI 417 + F++I+ + + + Sbjct: 382 LDSFKSIAHAIQDKAVRIA 400 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 138 bits (346), Expect = 2e-30, Method: Composition-based stats. Identities = 50/438 (11%), Positives = 112/438 (25%), Gaps = 55/438 (12%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R+ + +E N +IIFAL+++ L +G I K + ++ +AA+LA Sbjct: 6 RYLGLLSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAAMLAAT 65 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + + + + N S V + Sbjct: 66 GYTAMRGTAADAKTSATNMYNGQMSSHKLTSNSLNITVTD----SVTARTVTGTASVVVN 121 Query: 126 RISMTHMANNRLDSSNNTIFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + + ++ + + Sbjct: 122 TAFMYMFGFPTMTVTASSSASASFPTYMDFYVLVDNSPSQGLG---------ATTADMTT 172 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + + ++ +++A D+ Sbjct: 173 LQNATSDKCAFACHDTYTSSTKKTLQTNSY-YQIAKNKGVTMRIDVVRSATQSLTDTATS 231 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR----DMDSLILKPTDSTPAMKQ 300 V M + + + + + +D + + + M Sbjct: 232 SQVVSNQYRMAVYSLGSDCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCT 291 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI------- 353 + S S QK++ F++DG + + + Sbjct: 292 DFDGAMSGMNGVIPAQGDGS-----STSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQ 346 Query: 354 -------CDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHY 390 CD K IKI T + + N ++K+C S P Y+Y Sbjct: 347 EPLNTTTCDTLKARGIKIAVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCAS-PGYYY 405 Query: 391 NVVNADSLIHVFQNISQL 408 V ++ S+ + Q Sbjct: 406 EVDSSGSIGAALTALFQQ 423 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 63/488 (12%), Positives = 123/488 (25%), Gaps = 105/488 (21%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I +E + F + F+ L+ + + KK + A A LA + Sbjct: 17 FIKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATLALTVDNDESPD 76 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS----RISM 129 + E S + +++ +S I + +I Sbjct: 77 DDNIKKEKNSKFIINYAKAYLPNEKFSKPVINITSHSDYINYQVDMTIYYPTKILNKIFQ 136 Query: 130 THMANNRLDSSNNTIFYNM--DVMTSYDYRLQFIEHLL------NQRYNQKIVSFIPALL 181 T + + + Y T + + + ++ +KIV+ Sbjct: 137 TVSPEVSISDNARALKYTTTDSKPTDVVFVADYSGSMNEYFDESDESDEKKIVALRRIFK 196 Query: 182 RI-------EMGERPIFLIELVVD-----------------LSGSMHCAMNSDPEDVNSA 217 I + I + S D + Sbjct: 197 DIQNEIKYNNVNIDIIGFVPFSWGTKNFYSSTLANMEKESFCHFPFVPNQFSPSSDYLAK 256 Query: 218 PICQDKKR----TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---- 269 +K K LKN L +D I + + ++ ++ Sbjct: 257 YNFSEKYNISDLKKFPELKN--LDIVDRIKNGECTYNNYRKIINEINDKLHYEVDLDESY 314 Query: 270 ------SWGTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 S Y D + + T + ++ +A + + +G Sbjct: 315 VNEIYDSLDLACNMAYFDEIADIVESNIDYTKTLESINKADNTINIKMVDMPNNSICLRG 374 Query: 322 VKI----------------------------------------PSLPFQKFIIFLTDGEN 341 K + K +I ++DG++ Sbjct: 375 SKAFTFDRHNRNNTSISKILGTRATGGTLISSGILTGNNIFLETDNSYNKLMIIISDGDD 434 Query: 342 ----------NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 N N+ +C+K K+N IK+ I+I P + CV E Y Sbjct: 435 SRQTDKEKRYYNISKNLIKDGMCEKIKDNGIKMAFIAIGYVPREDIDWRRCV-GEENFYF 493 Query: 392 VVNADSLI 399 NA L Sbjct: 494 AKNAHELE 501 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 52/441 (11%), Positives = 122/441 (27%), Gaps = 67/441 (15%) Query: 6 RFRFYFKK-GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 RF+ Y I E F + + F+ L+ + + K + A A LA Sbjct: 8 RFKQYSVGIFIKDEIGAILWPFIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLAL 67 Query: 65 ASKMVSNLSRLGDRFE----SISNHAKRALIDDAKRFIKNHIK----ESLSGYSAVFYNT 116 + +N + + + + + I + + Y Sbjct: 68 TIENNTNNPDETQTEKNISLVNAYARAYLPSESFSAPVIDIISHPNYIEYRAATTLNYTP 127 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + + I + ++ + F + +T + + + + ++K + Sbjct: 128 KFLTKELITNIDRRIIVSDNG-VAIKNKFTSPGEITDVVFVVDYSVSMDGNFGDEKKTTK 186 Query: 177 IPA----------LLRIEMGERPIFLIELVVD----------LSGSMHCAMNSDPEDVNS 216 I + I + H S Sbjct: 187 IQELRRIFEDLNNTILKNNNTHTIGFVPFSWGTKKIIGKGIHRKIYCHFPFVPKTPMPPS 246 Query: 217 APICQDKKRTKMAALKNALLLFLD---SIDLLSHVKEDVYMGL-----IGYTTRVEKNIE 268 + K L +A+ +D +I ++ + + + + + Sbjct: 247 YYLGDLKSYNPAKELTDAVKNNIDYDETIKSITANYNFINIPIDDIKPSSFCLKGSDAYT 306 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + +++ + T + + A I + S Sbjct: 307 LRSD-DITNDNIQENIEHEVNGLTLISSGILVANDIF-----------------RKDSKN 348 Query: 329 FQKFIIFLTDGENNNFKSNVNT---------IKICDKAKENFIKIVTISINASPNGQRLL 379 K +I ++DG + S++ +C++ KEN I++V I I + + Sbjct: 349 KDKLMIIISDGNDQEISSDLTQEKITKTLIEKGMCERIKENNIRMVFIGIAYTVKEIK-W 407 Query: 380 KTCVSSPEYHYNVVNADSLIH 400 + CV +Y NA L Sbjct: 408 EDCVGKRN-YYEAQNAHELEA 427 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 45/463 (9%), Positives = 115/463 (24%), Gaps = 64/463 (13%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + + +I+F + + + K +E A AA+LA ++ + Sbjct: 1 MKLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEED 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + ++++ + K E + ++ V+ ++ Sbjct: 61 EVSTQTGKDYVAHYLHDMSSLVDIKVEKLECSELPECTADDNDRPFVEYQVSGRTKHISW 120 Query: 132 MANNRLDS----------SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN---QKIVSFIP 178 N + S+ + + L F + + IP Sbjct: 121 FPGNDVTVGFGESFDVTGSSKARKFQSSQPMDITFILDFSGSMNYDWEGHAPSYMEEEIP 180 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + ++ VV + + ++ + + + Sbjct: 181 KVPGRYSPPSRLSDLKYVVQ-MVTDELQVYNNSTAGPKHRVAMTGYNRRTVNESSNGKFV 239 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + + G Y + G + + T + Sbjct: 240 IRDQRITKYNSDGYDAGDTFYPKKTINKQFMVKGAAARVPNGDEKAEFTDIMYTSDFASF 299 Query: 299 KQAYQILTSDKKRSFFTNFFRQG--------VKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + + + R ++ II L+DGE+ N Sbjct: 300 NHKIKSFEAFGGTASLQGIIRASQIVSYHITNDGEEANPKQLIIILSDGEDFNHYLGQTE 359 Query: 351 I----KICDKAKEN----------------------------------FIKIVTISINAS 372 +CD + ++I I Sbjct: 360 TLVDYGMCDNLRNAIEGGPVSSEDNKEDKIVFSSGSSSGLPTNTGEDPSVRIAMIGFGDG 419 Query: 373 PN--GQRLLKTCVSSPEYHYNVVNADSLIH-VFQNISQLMVHR 412 + L CV E ++ N D +++ + +S+ + H Sbjct: 420 YDIHANTGLLNCV-GEENAFSANNKDEILNLIMSLVSEEVGHL 461 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 136 bits (343), Expect = 4e-30, Method: Composition-based stats. Identities = 39/467 (8%), Positives = 105/467 (22%), Gaps = 97/467 (20%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ S + +I+ +M + GF + ++ + ++ ++ A + A+LA A Sbjct: 18 RSSRHLRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAA 77 Query: 66 -----------------------------------------SKMVSNLSRLGDRFESISN 84 S + + D S S+ Sbjct: 78 DLDQELCPRVVVNDYISKEGFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSDSD 137 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYN--------TEIQNIVNSSRISMTHMANNR 136 + A +G S E +N M + Sbjct: 138 PSDTASSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKWSGVDT 197 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLN--------------------QRYNQKIVSF 176 ++S+ + L + + +S Sbjct: 198 INSTAVSAAEESIGNVEISLVLDVSGSMEGAKLTNLQKAAKDFVKEMLEKSADDSLSISI 257 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 IP ++ + + + I + ++ ++ Sbjct: 258 IPYSEQVGVPDYMMDKINTTGGNKVANCIEFQPADFTAIPFTAFSIGAPSEATNPPPSVP 317 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-----------------EKVRQY 279 L + + + V + + Sbjct: 318 QSLHFTNRSNDFRRGGNRDHRSTNDVVSRFSPWDANFPCREDTPTDRREMVVIQNDLDTL 377 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSD---------KKRSFFTNFFRQGVKIPSLPFQ 330 + + + T +K +L + F + + Sbjct: 378 NKQINNLVAAGSTSINIGLKWGLALLDESIQPLIKTVANDTNVPKIFEDRPRPTNTTDTL 437 Query: 331 KFIIFLTDGENN--NFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 K ++ +TDG+N+ D + ++ ++ + Sbjct: 438 KVVVLMTDGKNDLQRAVVPPYNDGPSDVFWNAGAQEYSVLVDEDNSQ 484 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 41/107 (38%), Gaps = 1/107 (0%) Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + + I + + K N + +C KA+E + I +I+ Sbjct: 562 WAHTEIKAIESLFRRTKGDTYADDFIRNSIVTADISKKNEQVVSLCGKAEEKEVLIFSIA 621 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 A + +++LK C P +Y + VF +IS + + + + Sbjct: 622 FEAPSSVKQMLKDCAVKPARYYEATGT-QIERVFDSISTSIQNLRLT 667 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 136 bits (342), Expect = 6e-30, Method: Composition-based stats. Identities = 61/559 (10%), Positives = 149/559 (26%), Gaps = 144/559 (25%) Query: 1 MHLLSR----FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 M ++ R + + + + N +++FAL++ +L+ + + + +++ A Sbjct: 1 MRIMKRTVSALKTFGTRFSDDRRGNVAMMFALALPPMMLMTLGGVDIARVSTVRMNVQDA 60 Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH--IKESLSGYSAVFY 114 +AA LA A ++ R+ + D + + + + A Sbjct: 61 LDAATLAAARSQYTDNPRINAVGLAALQANLAPYGDVTLDTTQTNFRLNTATGAVEADAK 120 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR------ 168 + N +++L ++ ++ + + + + Sbjct: 121 VNVRALVANIFLPPYGQFFDDQLPANAHSEVLRSNNRIEVALVIDNTGSMDGAKLTNTKT 180 Query: 169 ------------------YNQKIVSFIPALLRIEMG----ERPIFLIELVVDLSGSMHCA 206 + +S +P + + + P + +G Sbjct: 181 AAIDLINRLEAADGRSIEQDAIKISLVPFSMTVRVAQGGTNTPPSFMSNADTHTGGGAWN 240 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALK---------------------NALLLFLDSIDLL 245 S+P + + ++ +F+ Sbjct: 241 SGSNPYSAFDTAVGRFTLFGRLNTTWGGCVESRPQPYDIRDTAPSSGTQATMFVPYFAPD 300 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWG-----------------------TEKVRQYVTR 282 + D + E N G V +Y T Sbjct: 301 EPDRADYPNHSTWQNWQYEGNDYLDDGRPGSNANSPFANTAARTTEWFARVRSVSRYSTT 360 Query: 283 DMDSLILK--------------PTDSTPAMKQAYQILTSD------------KKRSFFTN 316 ++L TD+ A++ A + + Sbjct: 361 PRNTLNTGFGPNRGCDLQPIIRLTDNYTALRTAVNNMIASGNTNVPLGTMWGWHTLSPNA 420 Query: 317 FFRQGVKIPSLPFQKFIIFLTDG--------ENNNFKSNVN------------------- 349 F G + +K II +TDG N+ N Sbjct: 421 PFGDGRPYGTERLKKIIIIMTDGANVMSDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRR 480 Query: 350 -------------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K+ I++ T+++ Q LL+ C + ++++ V +A Sbjct: 481 TRMDNRFDHATAATEDMCGNMKDKDIEVYTVAVQVDSTAQTLLRRCATDTDHYFPVDSAA 540 Query: 397 SLIHVFQNISQLMVHRKYS 415 + F I+ + + + S Sbjct: 541 GIGAAFDRIAGAIENLRIS 559 >gi|170724979|ref|YP_001759005.1| hypothetical protein Swoo_0614 [Shewanella woodyi ATCC 51908] gi|169810326|gb|ACA84910.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908] Length = 503 Score = 136 bits (341), Expect = 9e-30, Method: Composition-based stats. Identities = 54/489 (11%), Positives = 120/489 (24%), Gaps = 107/489 (21%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +I ++ ++ + + KKN + A AA LA + + Sbjct: 9 AKRQRGAAAIYLVFLLIPLFGMVFLALEGTRYIQKKNRLGDATEAASLAVSMANRDDKGY 68 Query: 75 LGDRFESISNH------AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR-- 126 + + + + K I ++ S + + + Sbjct: 69 ETQLAKDYISSYMRNIKEISQVKVERKEDIDHYPMADGSFEDREYTQYRVTAKTEHTSWL 128 Query: 127 -ISMTHMANNRLDSSNNTIF-----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + +N + Y D + F + R N + Sbjct: 129 HSDLIPSFKETETLANRALARAYPEYLGDRDVDIVFVSDFSGSMKGSRINSLKDAITEIS 188 Query: 181 LRIEMGERP-------IFLIELV-------VDLSGSM--------------------HCA 206 I + I L+ S M Sbjct: 189 NEILVPRDGETEIRNRIALVPYNMRVVEGDSGRSVCMTQLKYRNPSGKTGSNYTNYESIN 248 Query: 207 MNSDPEDVNSAPICQDKKRTK------------------------------------MAA 230 + K + Sbjct: 249 WREWANKSYNQVSSCVSNSRKCNGLPGPRADARTIKSVVNDAPSSYRSSKWPDSSNWIDY 308 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 ++ LF ++ + + H + + + + + + Sbjct: 309 SRSVDQLFNENSNNVQHHPSYQRLYSGSMCNGKFWTVPLT----NQKSEIMKVNQMSPDG 364 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN--- 347 T + + QIL + + + K K I+ L+DG + ++S Sbjct: 365 GTSVYQGLLRGAQILDKGRPVNPNEEELEEYNKR-----LKMILILSDGMESPYESTFSK 419 Query: 348 -VNTIKICDKAKEN----FIKIV--TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 VN +C+K + + + I I S +GQ K CV + +++ + L Sbjct: 420 LVNNYGMCNKIRAQFNDGELPLHMGVIGIKFSASGQNAFKNCVGADN----IIDVNDLDD 475 Query: 401 VFQNISQLM 409 + Q I L+ Sbjct: 476 LIQEILDLI 484 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 136 bits (341), Expect = 9e-30, Method: Composition-based stats. Identities = 48/455 (10%), Positives = 126/455 (27%), Gaps = 71/455 (15%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMV 69 K I + I+F + + ++ + F + + + + A A LA + K Sbjct: 1 MKSIKKNRGVAGILFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLALIASPKED 60 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + R+ + D F + + ++ +V+++ Sbjct: 61 EDKNVKYARYLVDRYILDNSEDVDVAVFTRKCEYKDGCVQASGELAPFSDFVVSATAKYT 120 Query: 130 THMANNRLDSSNNTIF-------YNMDVMTSYDYRLQFIEHLLNQRYNQKI-VSFIPALL 181 + ++ +D + + F + N N K+ + + + Sbjct: 121 SWISYEDVDLEPEFTVSGRAVTRKYLPQSVDVYFIGDFSGSMGNSWKNGKMKLDVVKETI 180 Query: 182 RIEMGE---------RPIFLIELV----------VDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + + L+ V ++ + N A Sbjct: 181 KRVVDDIEKFNTEEKSRVALLGYNPFHVKQTDKTVRVNAYGYYGSWRKKYAYNYARSSPG 240 Query: 223 KKRTKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQYV 280 +M K + + + E ++ + + +I + ++ R + Sbjct: 241 TTVRRMFDKPKLYNEILEPKRGMSRYEVERLHTHNVNFAKYYKFYDIPLTEDYDEFRSQL 300 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T S + A Q + K +L ++ I L+DG+ Sbjct: 301 MN-TKLQAGGGTSSWNGIIAAAQ----------------EANKATNLNPEQVFIVLSDGQ 343 Query: 341 NNNFKSNVNT--IKICDKAK-------------------ENFIKIVTISINASPNGQRLL 379 + + +C K + + + + I IN N Sbjct: 344 DGDKNYLQKLVDQGLCKKLRSTISAKRNRFQSNSPTEAEKTKVTMGVIGINYKVNESDGF 403 Query: 380 KTCVSSPEYHYNVVNADSLIH-VFQNISQLMVHRK 413 C + Y+ + + + + I++ K Sbjct: 404 GDC-FGKKNIYHAKDGEDVYKYILNLINEETGKLK 437 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 58/422 (13%), Positives = 132/422 (31%), Gaps = 70/422 (16%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES----ANNAAILAGAS 66 +K + + ++ A ++ + + F I V ++ + + A A LA A Sbjct: 9 SRKPRLARRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRAATDAAAKAGTLALAK 68 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ R +I A+ + A + ++ S A + N + Sbjct: 69 ---TDGDAASARTAAIQAAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQTPYTS 125 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + ++ + + F+ + + Q + Sbjct: 126 VKILSSMSDSTAAGS---------------VPLFLGTFMGRGSFQP--------AQSATA 162 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP--------ICQDKKRTKMAALKNALLLF 238 + I LV+D S SM M+ ++ AAL++++ LF Sbjct: 163 SQMEQEICLVIDRSHSMCFNMSGVEWSYPPGTKTTPHTICYPPHATLSRWAALQSSVNLF 222 Query: 239 LDSIDLLSHVKE----------DVYMGLIGYTTRVEK----NIEPSWGTEKVRQYVTRDM 284 +D+I ++ YT + E + S V+ + Sbjct: 223 MDTILETNNTPRVALITWGSTIGTNTAEYSYTKKTEVAVANELGLSTDYAAVKSKIAART 282 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 ++L T+ + + +L +K +I +TDG+ N Sbjct: 283 TKVMLGGTNMSAGIDAGRTLLN---------------GNTVRALAKKTMILMTDGQWNQG 327 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + A + I+I TI+ + + + + +Y N L F++ Sbjct: 328 RD---PIDAAEDAADEGIQIHTITFLSGSAQNTMRQVAEITGGKYYVSSNQAELEEAFRD 384 Query: 405 IS 406 ++ Sbjct: 385 LA 386 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 52/449 (11%), Positives = 131/449 (29%), Gaps = 69/449 (15%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R Y +K ++ N +++FA++ + L +G I + + ++ A + A+LA Sbjct: 1 MDRLLRYLRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + + L + +I+ + K +++ + + + + Sbjct: 61 GVAAGKAALDSGATQAAAIAMAQAASSSYFTGNTAKIDATPTINFTTMGRTLSGTGSATS 120 Query: 124 S-SRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + M + + + ++ + + +L Sbjct: 121 VMNTSFMRLVGFPTMTLNASSASSATMQPYLNVYLLVDISSSML---------------- 164 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I + + + H N + + ++ + + + L Sbjct: 165 -LPATQAGITQMRNGTGCALACHETTNG---TDSYSYALKNNVLLRYQVVNQGVQNLLTY 220 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ + K V +GL + ++ + + V + A A Sbjct: 221 LNSSAVYKNYVKVGLWSFDNQLTQLSSLTSSFSSVAA-------NFPAPGLAYNDA---A 270 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-----------NFKSNVNT 350 SF ++ G S QK +I TDG N+ + V Sbjct: 271 AATPFDSLIGSFVSSVGTAGDGSTSATPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFN 330 Query: 351 IKICDKAKENFIKIVTI-----------SINAS-------------PNGQRLLKTCVSSP 386 C+ K N + + I NA+ + LK+C Sbjct: 331 TAFCNTFKSNGVTVAIINTPYYPMTWDWGYNATLGQPGSLGGATRVDDIPIALKSCA--G 388 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + ++ + F + + + Sbjct: 389 SNFIIASDVATIQNAFTTLFNKASPVRLT 417 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 49/489 (10%), Positives = 119/489 (24%), Gaps = 90/489 (18%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +F FKK SE +++ A ++ ++ G I + A Sbjct: 6 MGYFKQFSDQFKKFNRSEDGVVAVLVAFLMVLLIVFAGMAIDFGLGFNTR--------RA 57 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + V ++ E SN + + +KN + + V + Sbjct: 58 VNQALDAAVLAVANKLSTTELSSNTVDSLIDQYFEENLKNSVGGDVVHTKPVVTYDPKGD 117 Query: 121 IVNS------SRISMTHM--------ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 V + + + L ++++ + + Sbjct: 118 TVAATATATVKTSFLPVLKLLNSESGDFGELTVTSSSTARFPKTKVEVAVVVDVTGSMSG 177 Query: 167 QRYNQK------IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + K + + +P I + V + D A Sbjct: 178 SIGSLKTASRDMLDTLLPDDNTRLQSRVRISYVPYNVGVK--------LDKTLARKATFE 229 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQY 279 + + A +++ + E V Y+ + + K+ Sbjct: 230 KSQYGCVHARVRDLAYSGENHDYEDEDDDERVDYIGTNYSWCPNAQMVPLTNDRTKIESS 289 Query: 280 VTRD--------------------------------------------MDSLILKPTDST 295 + + ++ Sbjct: 290 INALRASSATAGQIGIAWGWYTLSPEWRGFWPTESKPDFYDNNGVRKYAVLMTDGSFNAY 349 Query: 296 PAMKQA---------YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 A + + + + + + K + + Sbjct: 350 YAADYSKADAEHKKLIKNKSDVQNSQDPMDSGKLDADDHKKIASKVKWEYDYSSSLSGVP 409 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +C K+ I I T+ + G+++++ C S+ E Y+ N +LI F +I+ Sbjct: 410 FKTASNLCKNMKKEDIVIYTVFFGSDYKGKKIMEECASNSETFYHATNQSALIQAFSSIA 469 Query: 407 QLMVHRKYS 415 + S Sbjct: 470 NDIKSIYLS 478 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 25/216 (11%), Positives = 63/216 (29%), Gaps = 6/216 (2%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R R + +AN + IFA++++ L +G + + ++SM++A ++A+L Sbjct: 8 KRIRNSISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVLMV 67 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + +N + + +L +D F + S+ Sbjct: 68 SRDAAANPAMTSQQITDAVQRYFNSLYNDKSAFNVSVSAAYTPSTSSAAAKILASGQGAI 127 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP------ 178 M +L ++ + L + + + Sbjct: 128 ETDFMKIAGFPQLSFGTSSTSTWGNSRMRVALVLDNTGSMRDNGKMAALQRAAKDMIDSL 187 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + I +I D++ + Sbjct: 188 SAFAKTADDVYISIIPFAKDVNVDKSNYNAAWINWT 223 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 30/269 (11%), Positives = 75/269 (27%), Gaps = 36/269 (13%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S + + + + S + + T K+A Sbjct: 344 SVVDTDRSVVVSTGSGASCP-STTPNCSCTGSGRNRKCTQAKYKHYWRAHPTDTNQAKDA 402 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTT---------------RVEKNIEPSWGTEKVRQY 279 ++ + + ++ + + + Sbjct: 403 APAHSTWTGCINDRDQAYDISNADPSSGSSGTPSTKFYAEQLNGCLPATITPVSSQSSTL 462 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ + +Q L++ F K + +Q +++ L+DG Sbjct: 463 KNQIDSMSPSGSTNQAIGLAWGWQTLSTT------NGPFPAPAKDKAYVYQDYLVLLSDG 516 Query: 340 ENN-----------NFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSP 386 N + + +V +C K K++ I T+ +N ++L+ C S Sbjct: 517 LNTRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGNRDPLSQVLQDCAS-N 575 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +A+ FQNI + + + Sbjct: 576 GNFQMITSANQTADAFQNILTQISQLRIA 604 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 134 bits (337), Expect = 3e-29, Method: Composition-based stats. Identities = 60/483 (12%), Positives = 126/483 (26%), Gaps = 98/483 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + + + F + F+ LI + K+ + A A LA +++ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 74 RLGDRFE---SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R + S + + + + GY+ + ++ + + Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLK 136 Query: 131 HMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + D + N ++ + F + + ++ + I L + Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAI 196 Query: 184 ----------EMGERPIFLIELVVDL----------SGSMHCAMNSDPEDVNSAPICQDK 223 G I + S H + D N + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 224 KRT-----------KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + L +A+L ++ + + + + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 264 EKNIEPSW---------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKK 310 K + S ++ + R+ +++ + D + + L D Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPN 376 Query: 311 RSFFTNFFRQGVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN---- 349 N K +I L+DG++N K + Sbjct: 377 SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQK 436 Query: 350 -----------TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K K+N IK+V I I N K CV + Y NA Sbjct: 437 AGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAH 495 Query: 397 SLI 399 L Sbjct: 496 ELE 498 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 32/300 (10%), Positives = 84/300 (28%), Gaps = 17/300 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R ++ + N +IIFA+ + + L+G + + +++SA ++A L Sbjct: 6 ITRLTRLARRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAALM 65 Query: 64 GASKMVS-NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + + S++ R + A + F + + S + + Sbjct: 66 ISKDAATMSASQITTRARQYVDSLYTATDAPIQNFTA-----TYTPNSGSGASILLSANG 120 Query: 123 NSSRISMTHMA--NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-- 178 M + N L + ++ L + + S Sbjct: 121 TMPTYFMRVLGSNFNTLPVATSSTTKWGSTRMRVALVLDNTGSMAQNGKMAALQSAATDM 180 Query: 179 ----ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + G+ I ++ D++ S S + + Sbjct: 181 ITKLSAFNTTTGDVYISIVPFAKDVNVSTSNVSASWLNWTEWMAEPPYLSQNGYPNNLAS 240 Query: 235 LLLFLD---SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 L+ ID+ + + ++ S + + + +I Sbjct: 241 LVGITTPGVWIDMTGPGSNCPFTNSSYGFSCTDRPATLSNANSASKIPSSGNYKGMICPG 300 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/301 (10%), Positives = 78/301 (25%), Gaps = 29/301 (9%) Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLN-QRYNQKIVSFIPALLRIEMGERPIFLIELV 196 S Y TS L++ ++ V Sbjct: 306 KLSGKASIYYNGCYTSVSVTLKWTGSSATCSGKPSGCSCSGSGSSKVCTLPAYRHDWRNV 365 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTK------MAALKNALLLFLDSIDLLSHVKE 250 + + P ++ + N D+ + Sbjct: 366 LADPSATAPDPRVTPAPDSTLTQYAATNPSASVSNGGWTGCVNDRDQNFDTTNDAMTGSG 425 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + + + ++ T+ + +Q L + Sbjct: 426 IPSKLPYAEQWADCLPATITPMSNQWSTLKSQINAMTPSGNTNQAVGLFWGWQTLNTT-- 483 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-------NVNTIKICDKAKE---- 359 + F+ K P+ ++ +I+ L+DG N + + +C K+ Sbjct: 484 ----NDPFKAPAKDPNWVYKDYIVLLSDGLNTQNRWTQTVSDIDARQELLCKNIKDPAQN 539 Query: 360 --NFIKIVTISINASPNG--QRLLKTCVSSPEYHYN-VVNADSLIHVFQNISQLMVHRKY 414 N I + +I +N S ++L+ C + ++ + + F N+ + + Sbjct: 540 GGNQITVFSIQVNISSKDPTSKVLQDCATPGAGYFQMITQSSQTADAFNNVLATIAKLRI 599 Query: 415 S 415 S Sbjct: 600 S 600 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 39/376 (10%), Positives = 89/376 (23%), Gaps = 33/376 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R + N +I+ ALS++ L+ +G + + + M+S +AA++A Sbjct: 6 FGRCLHTLRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIA 65 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ + + + + A I+ N Sbjct: 66 AVKQINNTEDTDALKQKVSDWFHAQVENSYALGEIEIDTTNH---------NITATASGT 116 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S + + + +L Sbjct: 117 VPTTFMKIANIDTVPVSVASAVKGPATSYLNVYIVVDTSPSMLL-------------AAT 163 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I+ + N + + + +A+ LD I Sbjct: 164 TAGQSTMYSGIKCQFACHTGDTHTIGKKTYANNYDYSTEKGIKLRADVAGDAVREVLDMI 223 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D E + +GL G + + + P+ T+ R + L + + + Sbjct: 224 DESDSNHERIKVGLYGLGDTLTEVLAPTLSTDIARTRLADSSYGLTSATSKAATYFDVSL 283 Query: 303 QILTSDKKRSFFTNFFRQGVK----------IPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 L +K + K Sbjct: 284 ATLKQKVGAGGDGTTSGTPLKLVLLLTDGVQSQREWVTDKVNKNRTSGAYWNKVAPLNPD 343 Query: 353 ICDKAKENFIKIVTIS 368 C K+ + + Sbjct: 344 WCGYLKDQSNTMAVLY 359 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 43/445 (9%), Positives = 123/445 (27%), Gaps = 61/445 (13%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + A + ++F ++++ LLL+G + + + +++ + A+LA S Sbjct: 12 VTRSRRVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVS 71 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +S+ A A + + + + + + Sbjct: 72 AAGMKMSQADAEAAMAKLFTDAAA---ALPNVSASPRAATAPTTDGVRTASLTYSATIRT 128 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 M + + + + + T + L + + Sbjct: 129 GIMRLAGFSTVAFGGTATAASPNPIFTDFYLLLDNSPSMG-----------------VAA 171 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I + + C S + A + ++ +++A +D+ Sbjct: 172 TTADIATMVANTSDQCAFACHDMSAGGNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAK 231 Query: 246 SHVKEDVYMGLIGYTTR-----VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + M + + T + + + + D + + + Sbjct: 232 AIAAGQYRMAIYSFGTSCSGIGLNQVSALTANLSTSKTDAGAL-DLMTVPYQNYNNDQCT 290 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--------- 351 + + + + P QK + F++DG + + T Sbjct: 291 DFDGIFARLNSAVPNPGSGASAASP----QKVVFFVSDGVADANYPSTCTKPTTNGRCQE 346 Query: 352 ----KICDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYN 391 C K+ I++ T + NG + C S P+ ++ Sbjct: 347 PITLANCQALKDRGIRVAVLYTTYLPLPTNGWYNTWIAPFSSQIATNMAACAS-PDLYWP 405 Query: 392 VVNADSLIHVFQNISQLMVHRKYSV 416 V ++ + + + + +V + + Sbjct: 406 VSPSEGIADAMKGLFKKVVDSQRRI 430 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 43/464 (9%), Positives = 112/464 (24%), Gaps = 64/464 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K + + +I+F + + + K +E A AA+LA ++ Sbjct: 2 IMKLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEE 61 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++++ K E + ++ V+ + Sbjct: 62 DEVSTQTGKDYVAHYMHDMSNLVDIEVEKLECSELPECTADDNDRPFVEYQVSGRTKHKS 121 Query: 131 HMANNRLDSS----------NNTIFYNMDVMTSYDYRLQFIEHLLNQRYN---QKIVSFI 177 N + + + + L F + + + Sbjct: 122 WFPGNDVTVGFGESFDVTGMSKARKFQSSQPMDITFILDFSGSMNYDWEGHAPSYMEEEV 181 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + ++ VV + + ++ + + + Sbjct: 182 PKVPGRYSPPSRLSDLKDVVQ-MVTDELQVYNNSTTGPKHRVAMTGYNRRTVNESSNGKF 240 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + + G Y + G + + T + Sbjct: 241 VIRDQRITKYNSDGYDAGDKFYPKKTINKQFMVKGAAARVPNGDEKAEFTDIMYTSDFAS 300 Query: 298 MKQAYQILTSDKKRSFFTNFFRQG--------VKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + + + R ++ II L+DGE+ N Sbjct: 301 FNHKIKSFEAFGGTASLQGIIRASQIVSYHITNDGEEANPKQLIIILSDGEDFNHYLGQT 360 Query: 350 TI----KICDKAKEN----------------------------------FIKIVTISINA 371 +CD + ++I I Sbjct: 361 ETLVDYGMCDNLRNAIEGGPVSSEDNKADKIVFSSGSSSGLPTNTGEDPSVRIAMIGFGD 420 Query: 372 SPN--GQRLLKTCVSSPEYHYNVVNADSLIH-VFQNISQLMVHR 412 + L CV E ++ N D +++ + +S+ + H Sbjct: 421 GYDIHANTGLLNCV-GEENAFSANNKDEILNLIMSLVSEEVGHL 463 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 29/366 (7%), Positives = 85/366 (23%), Gaps = 31/366 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + E+ +I ++ L++ G + ++ + +++ + A+LA Sbjct: 42 MKRLTHFIADLAGDEEGTMTIFSTFMLVLILVITGASVDIMYQEAIRARLQATLDRAVLA 101 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A +L + D ++++ +A + + + + Sbjct: 102 AA-----DLDQQQDPVAVVNDYVTKAGLVEHLTDVIATPGLYDR-------TVAADAGLT 149 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + L + + + + + Sbjct: 150 LDTYFLRMSGWQTLPVIAASTAEERIANVEISLVMDISGSMRWNNRITNARNAAKDFVTK 209 Query: 184 -----EMGERPIFLIELVVDLSGSM---------HCAMNSDPEDVNSAPICQDKKRTKMA 229 G + LI ++ + + Sbjct: 210 VLTEDSAGVTTLNLIPFAGQVNPGDVMFDYFRGVRPKIQQGNNGWGNGDQDAPGGSLCTN 269 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR---VEKNIEPSWGTEKVRQYVTRDMDS 286 +NA +D G + + + + ++ Sbjct: 270 NAENADEGAIDPSCTDGTDTAATDTGGFFSPWPQPIANIVVYFDTNGDDIYNRAHKIIN- 328 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + Q D+ F G+ + + + G+ N S Sbjct: 329 FPDGSTRDIDDIYQGAVAFLIDRDPLLFHPDQFLGISVKGGQEKNRYFQV-KGDGNGPFS 387 Query: 347 NVNTIK 352 ++ K Sbjct: 388 DIGPTK 393 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 28/274 (10%), Positives = 76/274 (27%), Gaps = 47/274 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I V + + AP+ + ++ F ++ S Sbjct: 392 TKNNGKIPGDVIDFAFIDYDAWAAYYVAPVAPVEAVNVNMPSSCVEIYDTEFTNTDLPQS 451 Query: 247 HVKEDVYMGL------IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +M + + ++ + ++ + + T +K Sbjct: 452 DDYVPHFMFWPYVREVMDWGWCPGEDTAIQYYSDDAATLSAFIDNMRMHDGTGIQYGLKY 511 Query: 301 AYQILTSDKKRSF---------FTNFFRQGVKIPSLPFQKFIIFLTDGENNN-------- 343 A +L + + F + + +KFI+ ++DG + Sbjct: 512 ALALLDPATGSAVTELISAGLVDSRFLGRPIAWEDEETEKFIVVMSDGAVTDQYRPVDPF 571 Query: 344 -----------------------FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + N C A++ + + ++ + L+ Sbjct: 572 APLNGETELQTQGSGSYTTFSTRGNNLDNLHTQCQLARDLGVTVFAVAFETTDADADELR 631 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 C SS + ++V +I F I++ + + + Sbjct: 632 LCASSDSHFFHVQGT-EIIDAFDTIARQINNLRL 664 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 60/483 (12%), Positives = 124/483 (25%), Gaps = 98/483 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + + + F + F+ LI + K+ + A A LA +++ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 74 RLGDRFE---SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R + S + + + GY+ + ++ + + Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 136 Query: 131 HMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + D + N ++ + F + + + + I L + Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 196 Query: 184 ----------EMGERPIFLIELVVDL----------SGSMHCAMNSDPEDVNSAPICQDK 223 G I + S H + D N + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDRNGHYLQRYT 256 Query: 224 KRT-----------KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + L +A+L ++ + + + + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 264 EKNIEPSW---------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKK 310 K + S ++ + R+ +++ + D + + L D Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPN 376 Query: 311 RSFFTNFFRQGVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN---- 349 N K +I L+DG++N K + Sbjct: 377 SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQK 436 Query: 350 -----------TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K K+N IK+V I I N K CV + Y NA Sbjct: 437 AGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAH 495 Query: 397 SLI 399 L Sbjct: 496 ELE 498 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 60/483 (12%), Positives = 126/483 (26%), Gaps = 98/483 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + + + F + F+ LI + K+ + A A LA +++ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTENNYRND 76 Query: 74 RLGDRFE---SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R + S + + + + GY+ + ++ + + Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLK 136 Query: 131 HMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + D + N ++ + F + + ++ + I L + Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAI 196 Query: 184 ----------EMGERPIFLIELVVDL----------SGSMHCAMNSDPEDVNSAPICQDK 223 G I + S H + D N + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 224 KRT-----------KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + L +A+L ++ + + + + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 264 EKNIEPSW---------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKK 310 K + S ++ + R+ +++ + D + + L D Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPN 376 Query: 311 RSFFTNFFRQGVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN---- 349 N K +I L+DG++N K + Sbjct: 377 SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQK 436 Query: 350 -----------TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K K+N IK+V I I N K CV + Y NA Sbjct: 437 AGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAH 495 Query: 397 SLI 399 L Sbjct: 496 ELE 498 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 61/494 (12%), Positives = 130/494 (26%), Gaps = 92/494 (18%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +LLS F +K I ++K + F + + F+ L+ + KK + A A Sbjct: 21 NLLSLFIIISRKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQAT 80 Query: 62 LAGASKMVSNLSRLGDRFES--ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 LA + L R +SN + L + + IK + + T Sbjct: 81 LALTVENDDLPDELQMRKNVDLVSNFSSAYLPLEHFSVPEIDIKNNCGQLTYNAKITMSY 140 Query: 120 N----IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK--- 172 + ++T + ++ + D T + + + + + Sbjct: 141 FANFLSKTAMTNAITTIGTEDNGAAIKQVSTIQDKATDVIFVADYSGSMNEGFHGKVPRG 200 Query: 173 --------IVSFIPALLRIEMGERPIFLIELVVDLSG----------SMHCAMNSDPEDV 214 + + + + I + H Sbjct: 201 EKINALRDVFNRLNGSILKNSNINLIGFVPFSWGTKRIVIENSQEKKYCHFPFVPKQYRA 260 Query: 215 NSAPICQDKKR-----------TKMAALK---------NALLLFLDSIDLLSHVKEDVYM 254 ++ Q T + + N L + ++ + + + Sbjct: 261 DNNYFRQYTVSGLKKFPGLEGLTDIDKINYGELTLGEYNTLTNVIKNMAKQEYRNKALEF 320 Query: 255 GLIGYTTRVEKN----IEPSWGTEKVRQYVT--------------------RDMDSLILK 290 I I + + + + + ++ L Sbjct: 321 LRITLNIPTYMQQMIFITTTIDYDATIKSINSDAQYIDIPLDDIINESICLNNSNAYSLD 380 Query: 291 PTDSTPAM---KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 +S + A L S K + K ++ ++DG + Sbjct: 381 SHNSHDDLIDKMIAMSPLGQTLVSSGILYANTLFKKESNNSNNKLMVIISDGIDVFINDT 440 Query: 348 VNT-----------IKICDKAKENFIKIVTISINASPNGQRL------LKTCVSSPEYHY 390 +C++ KEN IK+V I+I N K CV + +Y Sbjct: 441 TIQQSIYISKTLIDKGMCERIKENNIKMVFIAIKDGSNETNEPANYIDWKKCV-GEDNYY 499 Query: 391 NVVNADSLIHVFQN 404 V +A L + Sbjct: 500 YVSDAHELEAALRQ 513 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 45/398 (11%), Positives = 107/398 (26%), Gaps = 24/398 (6%) Query: 15 IASEKANFSIIF-ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++ + +I F AL + + +G + +N AA A ++ + + Sbjct: 11 LKKQQGSMTISFLALLFPALFIAVG-------TLMVSAQVMVSNRAAQAADSAALACAFA 63 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ I S S + + + + A Sbjct: 64 DTATLPMMRAYQEYYKPTLKGVSGLEPEIIGSECRISLGYSLSPLLPNFQYESYATKVTA 123 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV-SFIPALLRIEMGERPIFL 192 + + T L + + + K + S ++ + Sbjct: 124 TG-GGYKSVVESKQSSIPTELVLVLDVSGSMGSNIQSLKSILSNALNTIQSQSNNANDLD 182 Query: 193 IELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-LFLDSIDLLSHVK 249 + + + + C D + AL L ++ VK Sbjct: 183 SVSISIVPFDSGVAAQRPPWLSKEAAGIYCIDGLNYRNGNFSAALTVDNLATLHSQQPVK 242 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G + + + + +VR + T S + + L Sbjct: 243 FAKPNGWLSDCNQSSPMLPLTSVFSRVRNSINSLT---ANGGTRSFHGLLWGVRQLIPSW 299 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKEN-FIKIVTI 367 ++++ N + ++ TDG + + C A I++ I Sbjct: 300 QQAWGINVSTVPETR------RKLVLFTDGADEGDTFDQLVNAGFCTTAINQYGIEMNFI 353 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 S + + C +P ++ N L F +I Sbjct: 354 GYGVSSSRIAQFERCAGNPSRVFSATNTTQLNEYFSDI 391 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 34/355 (9%), Positives = 83/355 (23%), Gaps = 43/355 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 + + + + G + ++ + K+ ++S ++A+LA A + R Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASAGAPYGSD----HRAIIEDY 56 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + D + I + + + M L ++ + Sbjct: 57 FRVANMTDYLAAEKEGEI-----VVTVNSASVTANADMTMDTYLMKLSGIKELRTTGGST 111 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR-----IEMGERPIFLIELVVDL 199 L + + + + E G I ++ + Sbjct: 112 AVRKVPKLEVVLVLDVSGSMGSNSKLVNLKKAAKEFVTSLLNGSEPGNTVISIVPFSWSV 171 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL----------------FLDSID 243 S S+ + + C K + A + D Sbjct: 172 SPSVATFEALAVDRKHEFSTCIRFKANDHSHASLATGNSGFSSGQPLDQMIYTALYGNFD 231 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 S E Y + + S ++ + T M Sbjct: 232 EFS-GSESSSDYRSCYANDYMEILPFSVSETELHAKIDSL---QASGNTSGNQGMIWGAA 287 Query: 304 ILTSDKKRSFF---------TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 +L ++ ++ + K + + DG+N N Sbjct: 288 LLDPSFRQITDDLIAAGEVASSQAAIPSNYGTAETLKVAVVMGDGQNTTSYFFSN 342 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 30/76 (39%), Gaps = 3/76 (3%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLI 399 + + + C K + + +I +++LK C SS +++ + + Sbjct: 475 DGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGIN-IN 533 Query: 400 HVFQNISQLMVHRKYS 415 F I+ +V+ + + Sbjct: 534 DAFSAIASNVVNLRLT 549 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 133 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 60/483 (12%), Positives = 124/483 (25%), Gaps = 98/483 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + + + F + F+ LI + K+ + A A LA +++ Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 76 Query: 74 RLGDRFE---SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R + S + + + GY+ + ++ + + Sbjct: 77 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 136 Query: 131 HMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + D + N ++ + F + + + + I L + Sbjct: 137 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 196 Query: 184 ----------EMGERPIFLIELVVDL----------SGSMHCAMNSDPEDVNSAPICQDK 223 G I + S H + D N + + Sbjct: 197 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 256 Query: 224 KRT-----------KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + L +A+L ++ + + + + Sbjct: 257 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 316 Query: 264 EKNIEPSW---------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKK 310 K + S ++ + R+ +++ + D + + L D Sbjct: 317 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPN 376 Query: 311 RSFFTNFFRQGVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN---- 349 N K +I L+DG++N K + Sbjct: 377 SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQK 436 Query: 350 -----------TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K K+N IK+V I I N K CV + Y NA Sbjct: 437 AGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAH 495 Query: 397 SLI 399 L Sbjct: 496 ELE 498 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 133 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 60/483 (12%), Positives = 124/483 (25%), Gaps = 98/483 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + + + F + F+ LI + K+ + A A LA +++ Sbjct: 12 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRND 71 Query: 74 RLGDRFE---SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R + S + + + GY+ + ++ + + Sbjct: 72 RASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLK 131 Query: 131 HMANNRLDSSNNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--- 183 + D + N ++ + F + + + + I L + Sbjct: 132 QTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAI 191 Query: 184 ----------EMGERPIFLIELVVDL----------SGSMHCAMNSDPEDVNSAPICQDK 223 G I + S H + D N + + Sbjct: 192 FVKLNNRIFSNDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYT 251 Query: 224 KRT-----------KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + L +A+L ++ + + + + Sbjct: 252 ASNLKNIPGLDNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKA 311 Query: 264 EKNIEPSW---------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKK 310 K + S ++ + R+ +++ + D + + L D Sbjct: 312 YKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPN 371 Query: 311 RSFFTNFFRQGVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN---- 349 N K +I L+DG++N K + Sbjct: 372 SKGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQK 431 Query: 350 -----------TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNAD 396 T +C K K+N IK+V I I N K CV + Y NA Sbjct: 432 AGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAH 490 Query: 397 SLI 399 L Sbjct: 491 ELE 493 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 133 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 49/458 (10%), Positives = 124/458 (27%), Gaps = 76/458 (16%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 +F +E+ I F + + F+ LI + + +K + A A LA + Sbjct: 11 FNHFTLFKKNEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIE 70 Query: 68 MVSNLSRLGDRFE-------SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + + + N + + Y + Sbjct: 71 NNAIPDEPQQIKNNALVLSYANAYLPSKEFSVPIININDNTYYLEYNAAVTMAYPAKFLT 130 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVS 175 + + ++T + + +T + + +L + + ++ Sbjct: 131 QTSLTN-AITDINITDNGVAIKNKAIEASDLTDVIFVADYSGSMLYNFDVNEPNDHERIN 189 Query: 176 FIPAL-------LRIEMGERPIFLIELVVD------------LSGSMHCAMNSDPEDVNS 216 + + + I I + + N Sbjct: 190 ALRSAFRKLHDIIMNNSNINAIGYIPFSWGTKRIVFENQQQKIYCHFPFSSKIYKPKGNY 249 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEKNIEPS 270 K + L + + +D + + + I + ++ + S Sbjct: 250 LSDEIKKSSNAL-LLLDYIGDIIDYDKTIESITGNAQPIDIPMSDVRTKNVCLQASNAYS 308 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 E+ + + T + + A + K + + Sbjct: 309 LEQEQYINNIDNIIKMEPYGWTLISSGILSANNLFK----------------KEANNRHR 352 Query: 331 KFIIFLTDGENNNFKSNVNT-----------IKICDKAKENFIKIVTISINASPNGQR-- 377 K +I L+DG + + + +C++ + I++ I+I SP+ Sbjct: 353 KLMIILSDGVDTYQDNFLPNKGLFISKTLVEKGMCERVISSGIQMAFIAIAYSPDDDVNE 412 Query: 378 ----LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + CV + +Y NAD L ++I Q + Sbjct: 413 PEYINWRQCV-GKDNYYEAHNADEL---MRDIQQAISK 446 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 51/483 (10%), Positives = 113/483 (23%), Gaps = 101/483 (20%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ I AL+++ +I + + + KKN + A AA LA + + + Sbjct: 5 RQQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTANQDDKTYE 64 Query: 76 GDRFE----------SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +I N K + + E + SS Sbjct: 65 NQLATNYVQTYIRNIAIINDIKVERSEGIDYYPTPDGNEEREYFQYRVTAKTDHTSWLSS 124 Query: 126 RISMTHMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 I + + + Y D + F + + + Sbjct: 125 DIIPSFSPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGNKIRALKDAIQAIANE 184 Query: 183 IEMG-------ERPIFLIELVVDL-------------------------SGSMHCAMNSD 210 I + I + + + S ++ Sbjct: 185 ILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCLTQLDYRPNFNGGNYSSYEDIDWSTW 244 Query: 211 PEDVNSAPICQD-----------------------------------KKRTKMAALKNAL 235 + + + + Sbjct: 245 STWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNTSKSETGSGWYFPDPYSYINFPDSVA 304 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F + L + + G + I + + T Sbjct: 305 KTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTTLSPIQNNMS---PDGGTSVY 361 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN---VNTIK 352 + + QIL + S K I+ L+DG+ + S + Sbjct: 362 QGLIRGAQILEQGRPTSPSPETSAAYNSR-----IKMILMLSDGQEMPYVSTFNQLVNQG 416 Query: 353 ICDKAK------ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +C+ K + + + + I GQ+ K CV + N D + + + I Sbjct: 417 LCNTIKAQFNDSDQQLYMGVLGIEFDAQGQQGFKNCVGQNN----ITNVDDVDDLIKEIL 472 Query: 407 QLM 409 +++ Sbjct: 473 EMI 475 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 83/265 (31%), Gaps = 26/265 (9%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 S L + + D + H Sbjct: 14 QSQGSISMFLALDKSGSMGDPTATV---------------NADDPTEPFTYDCN--PHLN 56 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ++ + TK+ ALK A ++ E V G + Y Sbjct: 57 KKGTKIIYDTCTGSRAHYYTKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSYDLVEYTP 116 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + +WG V YV T+S+ A+ AY LT+ Sbjct: 117 SKLAWGITAVTSYVNAL---ESGGGTNSSGAVNTAYTSLTAKNAAGNDAEDA-AHKLKTG 172 Query: 327 LPFQKFIIFLTDGENNNFK-----SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 +K+I+F+TDG+NN+ + T CD AK I+ TI+ A GQ LL Sbjct: 173 QLPKKYIVFMTDGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHY 232 Query: 382 CVSSPEYHYNVVNADSLIHVFQNIS 406 C S +++ + L+ F+ I Sbjct: 233 CASDDAHYFQAEKMEDLLAAFKAIG 257 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 106/428 (24%), Gaps = 38/428 (8%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++K I F +S+ F+ +I L K + A LA ++ + Sbjct: 25 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENNAKND 84 Query: 74 RLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-- 128 + R + N+ + Y I + + Sbjct: 85 TRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYHIDANIEQPTLFPF 144 Query: 129 -MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN-----------QRYNQKIVSF 176 + ++ + + F + + ++ F Sbjct: 145 TSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGDFHNPDDPEVLSKLDELKRIF 204 Query: 177 IPALLRIEMGERP--IFLIELVVD------LSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 I + I S+H + ++ Sbjct: 205 FKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPKEKNKIYPIPSNEIERNTEAH 264 Query: 229 AALKN-ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-----TEKVRQYVTR 282 A+ +D + + ++ + +I ++ S K + Sbjct: 265 QEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELCLYSSNAYPISLAKDIDDLNV 324 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP--FQKFIIFLTDGE 340 T + + Q +L + G P+ K I + Sbjct: 325 IKTMNEGGSTLVSSGIMQGADLLMDGVNHNKLMIILSDGHDYPTTAVVHDKTITSAKEVR 384 Query: 341 NNNFKSNVNTI-KICDKAKENFIKIVTISINASPNGQ---RLLKTCVSSPEYHYNVVNAD 396 N S +C K +E +IV I I +P+ + CV + Y +N Sbjct: 385 VNVDISRQLVQHGMCKKIRETVGRIVFIGIGYNPSANHYINWAEDCVDTEN-FYLAMNTK 443 Query: 397 SLIHVFQN 404 L ++ Sbjct: 444 ELEDSIRS 451 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 51/483 (10%), Positives = 113/483 (23%), Gaps = 101/483 (20%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ I AL+++ +I + + + KKN + A AA LA + + + Sbjct: 15 RHQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTANQDDKTYE 74 Query: 76 GDRFESI----------SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 N+ K + + E + SS Sbjct: 75 NQLATGYIQAYIRNITSINNIKIERSEGIDNYPTPDGNEEREYFQYRVTAKTNHISWLSS 134 Query: 126 RISMTHMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 I + + + Y D + F + + + Sbjct: 135 DIIPSFAPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGNKIRALKDAIQAIANE 194 Query: 183 IEMG-------ERPIFLIELVVDL-------------------------SGSMHCAMNSD 210 I + I + + + S ++ Sbjct: 195 ILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCITQLDYRPNFNGGNYSSYEDIDWSTW 254 Query: 211 PEDVNSAPICQD-----------------------------------KKRTKMAALKNAL 235 + + + + Sbjct: 255 STWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNASKSETGSGWYFPDPYSYINFPGSVA 314 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F + L + + G + I + + T Sbjct: 315 KTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTALSPIQNNMS---PDGGTSVY 371 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN---VNTIK 352 + + QIL + S T K I+ L+DG+ + S + Sbjct: 372 QGLIRGAQILEQGRPTSPSTETSAAYNSR-----IKMILMLSDGQEMPYVSTFNQLVNQG 426 Query: 353 ICDKAK----ENFIKIV--TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +C+ K ++ + + I GQ+ K CV + N D + + + I Sbjct: 427 LCNTIKAQFNDSDQPLYMGVLGIEFDAQGQQGFKNCVGQNN----ITNVDDVDDLIKEIL 482 Query: 407 QLM 409 +++ Sbjct: 483 EMI 485 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 34/321 (10%), Positives = 79/321 (24%), Gaps = 23/321 (7%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L R+ + N +I+ ALS++ L+ +G + + + M+S +AA++A Sbjct: 17 LGRYFHTLRGLRRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIA 76 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ + + + + I N Sbjct: 77 AVKQINNTADADALKAKVSDWFHAQVDNSYTLGEIDIDTANH---------NITATASGT 127 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S + + + +L Sbjct: 128 VPTTFMKIANIDTVPVSVGSAVKGPATSYLNVYIVIDTSPSMLL-------------AAT 174 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I + N + + +A+ L I Sbjct: 175 TSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYEYSTAKNIKLRADVAGDAVKDVLSLI 234 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D E + +GL + + + P+ T+ R ++ L + + + Sbjct: 235 DTSDSNHERIKVGLYSLGDTLTEVLAPTLSTDTARTRLSTASYGLTSATSKAATYFDVSL 294 Query: 303 QILTSDKKRSFFTNFFRQGVK 323 L +K Sbjct: 295 ATLKQKVGAGGDGTTSGTPLK 315 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 47/455 (10%), Positives = 118/455 (25%), Gaps = 75/455 (16%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 ++ A +++ + +G + + + N +++A +AA LAG+ ++ + + Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPD--------VD 52 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 N + K + + + + + + + Sbjct: 53 NGKVSQAVTTNLAANDPEAKATDISSGGATRSVCVTAEADVDMTLSKVVGLDATTVTA-- 110 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE-----RPIFLIELVVD 198 L + + + I I L+ Sbjct: 111 EACAGYNDIELVMVLDATGSMRGTPIANVKEAAANLVDLIMPDSGANTRSKIGLVPFQGK 170 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + + + ++ + C++ T + D+ S G+ Sbjct: 171 VRIDGNDPVTAERDPDGVGAGCRNADGTLNDG--KLKTEYSDTRSRNSIFYGYTISGVST 228 Query: 259 Y----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK---- 310 Y + + S E + + + T + +K +++L+ Sbjct: 229 YYDRTCSGMSPIRALSSDKEAILDNIGAINAGAVTSGTLISEGIKWGHKVLSPKAPYTEG 288 Query: 311 -------------RSFFTNFFRQGVKIPSLPFQKFIIFLTDG------------------ 339 T R G + S + Sbjct: 289 NTDKKVRKIMIVLTDGDTEDGRCGGRYASASRTVNAYWTNAYFGQGLRPNSASSPYDTLS 348 Query: 340 -----------ENNNFKSNVNTIKICDKAKENF---IKIVTISINA-SPNGQRLLKTCVS 384 + K N + D AK + ++I I L+K S Sbjct: 349 TASATLAQIPDCTDGGKLNQYVLDEADDAKNDADYPVEIFAIRFGDSDATDISLMKRIAS 408 Query: 385 S----PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S +++Y+ ++ + +F+ I Q + R S Sbjct: 409 SKSGTDDHYYDAPDSSDIKDMFKKIGQQLGQRLMS 443 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 37/379 (9%), Positives = 96/379 (25%), Gaps = 37/379 (9%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L+R + N +I+ AL+++ ++ +G + + + M+S + A++A Sbjct: 17 LARVFKTLRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIA 76 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ ++ + K+ + D ++N + Sbjct: 77 AVKEIDTDDAVA----------LKQKVSDWFHAQVENSYTLGDINIDTSNHKITATASGT 126 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + +D S + + + +L Sbjct: 127 VPTTLMRIANIDTVDVSVASAVKGPATSYLNVYIVIDTSPSMLL-------------AAT 173 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I + + + N + + +A+ L I Sbjct: 174 TAGQATMYSGIGCQFACHTGDAHKIGNKTYNNNYEYSTAKNIKLRADVAGDAVKDVLALI 233 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D + + +GL + + + P+ T+ R +T L + + + Sbjct: 234 DTSDSNHQRIKVGLYSLGDTLTEVLAPTLSTDTARNRLTDASYGLTSATSKAATYFDVSL 293 Query: 303 QILTSDKKRSFFTNFFRQGVK-----------IPSLPFQKFIIFLTDGENNNFKSNVNTI 351 L +K K + + + V + Sbjct: 294 TTLKQKVGTGGDGTASNSPLKLVLLLTDGVQSQREWVTDKVTWSNNKAISGTYWNKVAPL 353 Query: 352 --KICDKAKENFIKIVTIS 368 C K + + Sbjct: 354 NPAWCAYLKNQSATMAVLY 372 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 106/428 (24%), Gaps = 38/428 (8%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++K I F +S+ F+ +I L K + A LA ++ + Sbjct: 38 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENNAKND 97 Query: 74 RLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-- 128 + R + N+ + Y I + + Sbjct: 98 TRNNELISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYHIDANIEQPTLFPF 157 Query: 129 -MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN-----------QRYNQKIVSF 176 + ++ + + F + + ++ F Sbjct: 158 TSLLIDHDNFIIGGSAAAIKDVPAMDVVFVTDFSGSMEGDFHNPDDPEVLSKLDELKRIF 217 Query: 177 IPALLRIEMGERP--IFLIELVVD------LSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 I + I S+H + ++ Sbjct: 218 FKIADDIYTANKDSTISFSPFSWGTKSADNKKCSLHFMPKEKNKIYPIPSNEIERNTEAH 277 Query: 229 AALKN-ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-----TEKVRQYVTR 282 A+ +D + + ++ + +I ++ S K + Sbjct: 278 QEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELCLYSSNAYPISLAKDIDDLNV 337 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP--FQKFIIFLTDGE 340 T + + Q +L + G P+ K I + Sbjct: 338 IKTMNEGGSTLVSSGIMQGADLLMDGVNHNKLMIILSDGHDYPTTAVVHDKTITSAKEVR 397 Query: 341 NNNFKSNVNTI-KICDKAKENFIKIVTISINASPNGQ---RLLKTCVSSPEYHYNVVNAD 396 N S +C K +E +IV I I +P+ + CV + Y +N Sbjct: 398 VNVDISRQLVQHGMCKKIRETVGRIVFIGIGYNPSANHYINWAEDCVDTEN-FYLAMNTK 456 Query: 397 SLIHVFQN 404 L ++ Sbjct: 457 ELEDSIRS 464 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 130 bits (326), Expect = 5e-28, Method: Composition-based stats. Identities = 35/384 (9%), Positives = 94/384 (24%), Gaps = 24/384 (6%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + + +I+ + L + G I + + +++ ++A+L A Sbjct: 2 RISRTAGRFVTRDDGAITILSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVL--A 59 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + D + ++ A + I S + + Sbjct: 60 GAGAPADATAADVKLIVEDYFDAADLSQYLNTIDPETDI---VASLNAKSVSASVELEMD 116 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 M + L ++ L + +R + + + + Sbjct: 117 TFLMRLSGVDTLTTAGGATAAIAAPRMEIVLALDVSGSMAGERLTKMKSAAKQFVTDVMS 176 Query: 186 ----GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ-----DKKRTKMAALKNALL 236 G I ++ ++ S + ++ C D ++T + ++ Sbjct: 177 ASDQGTTTISIVPYSWSVTPSDEMFEALSVDVRHNYSTCIDFLESDFEKTAIDPARSYGQ 236 Query: 237 LFLDSIDLL-------SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 S+ + YT + + S + + Sbjct: 237 TIYTSLTGSFGNIGIGDPTVTNTAYDRTCYTDEYFRILPYSNSVTALHNKIDSLK---AA 293 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + MK A +L + + S + I N + Sbjct: 294 GSTSTHLGMKWAAGLLDPAFAPVVSSLQQTRTKTDSSGNLVTYSIVDPSINNTPALYSTG 353 Query: 350 TIKICDKAKENFIKIVTISINASP 373 + + T ++ Sbjct: 354 QVLKVAIVMGDGANDWTYGLDDPN 377 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/351 (9%), Positives = 85/351 (24%), Gaps = 18/351 (5%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + + + ++ +L + + + + Sbjct: 253 DPTVTNTAYDRTCYTDEYFRILPYSNSVTALHNKIDSLKAAGSTSTHLGMKWAAGLLDPA 312 Query: 136 RLDSSNNTIFYNMDVMTSYDYRL--QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 ++ +S + + N + + + G Sbjct: 313 FAPVVSSLQQTRTKTDSSGNLVTYSIVDPSINNTPALYSTGQVLKVAIVMGDGANDWTY- 371 Query: 194 ELVVDLSGSMHCAMNSDPE--DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + D +G M+ + + D + + K +F S S + + Sbjct: 372 -GLDDPNGLMNPDIVENHTQPDYRGPDSNLYRVQYTDDVFKYRYFVFNPSFIAYSEMNCN 430 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + Y + N +T + A+ S ++ Sbjct: 431 TGYWVCVYESEDITNYYLYSTYWNDYTDITNGVYLTP----AQFDALPTTLPNFESQERL 486 Query: 312 SFFTNFFRQ-----GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S+ + G + IC K I + T Sbjct: 487 SWEEAWGLMTPDFYSRTTFDYAPDNMFSKNGTGSIAPETKDDRMADICGATKSKGIVVYT 546 Query: 367 ISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I+ + L+ C SSP H+N + + F +I+ + + + Sbjct: 547 IAFEMGEFDSAADRLENCASSPSQHFNATTLN-ISQAFGSIAANVQKLRLT 596 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 130 bits (326), Expect = 5e-28, Method: Composition-based stats. Identities = 56/436 (12%), Positives = 114/436 (26%), Gaps = 39/436 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 FK + N +II AL ++ + ++G I +K+ MES +AA LA + Sbjct: 4 LFIKFKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALAAVT 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + + +AL + N + S+ S +Q S Sbjct: 64 PAMVAAGQSSAITTAQNVFNAQALTMTGVTYNANDVTVSI-ATSGDKRTATVQYQAQSQA 122 Query: 127 ISMTHMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + M + + L + I + + Sbjct: 123 MLPDVMGFGSIKIGGQATATTTIAPNIDFYLLLDDSPSMAIAATQSGINTMV-------A 175 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + + N A ++ L+ A + + Sbjct: 176 NTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSLGVTLRIDMLRQATQDLMTTAQTT 235 Query: 246 SHVK-EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD-----STPAMK 299 K M + + + + + + + Sbjct: 236 ETQKGTTYRMAIYTFDIGLNTIGNLTSDLSQAQTEAGNIQLLEVYSNNWLTQNDYNDDED 295 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-----ENNNFKSNVNTIKIC 354 Y + G + Q+ + F+TDG N N + ++ +C Sbjct: 296 TNYDTALNGINAIMPNPGNGTGAAGDTP--QEVLFFVTDGVEDEDVNGNRQQSLLNTDLC 353 Query: 355 DKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYNVVNADSL 398 K I+I T + N L+ C S P ++ V + + Sbjct: 354 TAIKNRGIRIAVLYTEYLPLPTNSWYNTYIAPFQNSIAPTLQQCAS-PGLYFEVKSGGDI 412 Query: 399 IHVFQNISQLMVHRKY 414 + Q V Y Sbjct: 413 SAAMSALFQTAVQSSY 428 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 49/437 (11%), Positives = 109/437 (24%), Gaps = 55/437 (12%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + ++ N ++IFA++++ L IG I K + ++ +AA+LA Sbjct: 7 YLSLLSRFRRNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLAATG 66 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 S + + + + + S V Sbjct: 67 YTAMRGSSSDAKTAATNMFNGQMSAHKLTSNSLS----IDITDSVSARTVTGSATVVVKT 122 Query: 127 ISMTHMANNRLDSSNNTIFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 M + S ++ + + + Sbjct: 123 SFMYMFGYPTMTVSASSSASASFPTYMDFYVLVDNSPSQGLG---------ATTADMTTL 173 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + + ++ +++A D+ Sbjct: 174 QNATTDTCAFACHDTYTSSSKKTLQTNSYYD-KAKKLGVTMRIDVVRSATQSLTDTATSS 232 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR----DMDSLILKPTDSTPAMKQA 301 V M + + + + +D + + + M Sbjct: 233 QIVSNQYRMAVYSMGADCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTD 292 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI---------- 351 + S T S QK++ F++DG + + Sbjct: 293 FDGTMSAMSGVIPTQGDGS-----STNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVE 347 Query: 352 ----KICDKAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYN 391 C K IKI T + + NG ++K+C S P Y+Y Sbjct: 348 PLTTTTCTALKARGIKIAVLYTTYLAITSNGYYNTWVKPWRDSIGTIMKSCAS-PGYYYE 406 Query: 392 VVNADSLIHVFQNISQL 408 V ++ S+ + Q Sbjct: 407 VDSSGSIGSALTALFQQ 423 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 67/529 (12%), Positives = 126/529 (23%), Gaps = 137/529 (25%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R ++ A + N +I A +++ + I + K ++ A +AA L A Sbjct: 9 LRRALRRL-ADDGGNVAITVAFAMVPLAIGTLGAIDLARGASAKVQLQDALDAAALGAAR 67 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ L E + D S S + V Sbjct: 68 SSANSPDTLQAAGERLLRQNLALGGDFEL--------VSSSFTFGPDNKVLARAQVRVEP 119 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR------------------ 168 N +D + T + L + Sbjct: 120 YVAGLAGVNNMDIAAATEVVRAGMQLEIALVLDNTGSMNQNNKLYHLKTAAKAFVTAMET 179 Query: 169 -------YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 N +S +P + + +D +GS P + + Sbjct: 180 AAEGNTVPNSIKISLVPFSHTVRVDSDAYRNAAW-IDQNGSSPINNEIFPTATGTQWANR 238 Query: 222 -------DKKRTKMAALKNALLL------------FLDSIDLLSHVKEDVYMGLIGYTTR 262 + A F + G + Sbjct: 239 FTLFSQLGTSWRGCVESRQAPYDIQDTPPTTGATLFTPYFAPDEPDYPAEWYGTKFANSY 298 Query: 263 VEKNIEPSW---GTEKVRQYVT-------------------------------RDMDSLI 288 V+ + + +YV D + Sbjct: 299 VDDRTSSTNWRVRQGNLTKYVNTKGLGTSKGPNAGCGLRPIIRLTTDFDGLRDAVDDLVA 358 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ + + L F GV + +K ++ +TDGEN + Sbjct: 359 DGSTNIPMGLVWGWHTLAP-------MAPFPDGVPYLTEKHKKIVVLMTDGENTILYKDT 411 Query: 349 ----------------------------------------NTIKICDKAK--ENFIKIVT 366 +K+C K I+I Sbjct: 412 PNGSDYSGVGHARQGRVLDPAGRPITESSSQRERTAALDDRLLKLCANMKAPAKDIEIYA 471 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I + S +L+TC SS +++Y+V NA + FQ+I+ + S Sbjct: 472 IRVEVSSGSSSVLQTCASSADHYYDVQNAADMTMAFQSIAGQIAALHLS 520 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 52/394 (13%), Positives = 109/394 (27%), Gaps = 67/394 (17%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ A+ + FL+++ F I + H + + S+ +AA A A+ + L R Sbjct: 33 RHGAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLA 92 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + S N + Sbjct: 93 IQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNA----------GEAPFNGV 142 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + ++ + I + I LVV Sbjct: 143 RVNGQRTAGSLSGPVPLFF---------GNVTGTSIFEPEAFATATYVER----DITLVV 189 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM ++ L+ A+ +F D + ++ +GL Sbjct: 190 DRSGSMA--------------------GSRFNDLQAAIRIFTDLLATTPVDEQ---IGLA 226 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 Y R ++++ + +V T + M+ Sbjct: 227 SYNDRASEDVQLTENFAEVN---NAMDRLRTGGFTSISRGMQAG---------------Q 268 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 P ++ +I +TDG +N + + + I TI+ A + R Sbjct: 269 EIALRGRPPEFVERTMIVMTDGRHNRG---PEPRVVATDLAADGVTIHTITFGAGADFGR 325 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + H++ N D L +++ I+ + Sbjct: 326 MQDVARIGGGRHFHATNGDQLRDIYREIALTLGT 359 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 53/481 (11%), Positives = 127/481 (26%), Gaps = 94/481 (19%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 S+K ++ FALS++ + F + + + + + A A LA SN + Sbjct: 15 RSQKGIAAVWFALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALAITIDDKSNQADA 74 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + +D E + Y+ + NS S + + Sbjct: 75 LATMYINDYVRDISHVDIQTVRTYEEPTEDNDNTEKIQYSVQAVTTHNSWFASNSIPSFE 134 Query: 136 R------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE-- 187 + FY D + F + N + + + ++ Sbjct: 135 TQEKLAGQAVAAKYPFYLGDKIIDLVLVTDFSGSMNNSWDGEIKIDLLKDAVKQISNRIL 194 Query: 188 ----------RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK----- 232 I +I + + ++ + S + + K + Sbjct: 195 VPREGESEVLNRIAIIPFNLRVQEKINDNLYSTSQLRYKGNYRKSVSSVKYEQVNWDYWS 254 Query: 233 ----NALLLFLDSIDLLSH----------------------------------------- 247 A+ + + Sbjct: 255 PYSEEAVEECANKRTDCPNKKSWERDQAKRVADVVNINNNRLEIPDYVGYSKSVRHMFDD 314 Query: 248 --VKEDVYMGLIGYTTRVEKNI--------EPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ ++ + + + + + T + Sbjct: 315 KVANNNLTFHFRSNNNKLYNSSMTRTGNSGFYTIPLTANKANLDKMQTMSSGGNTAAHQG 374 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKIC 354 M + QI+ + + Q K +I ++DG + + +C Sbjct: 375 MLRGLQIMEAGRPNGGSEEETEQYNDR-----LKMLIVISDGMEYPYTEILPGLVNKGMC 429 Query: 355 DKAKEN------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 DKA+E+ + I I +N S + Q + CV +P+ ++++ + I +L Sbjct: 430 DKAREHFQTENGNLYIGVIGVNFSASSQSGFQDCVLNPDE--DIIDVTETEDFIKKIEEL 487 Query: 409 M 409 + Sbjct: 488 I 488 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 37/317 (11%), Positives = 82/317 (25%), Gaps = 27/317 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + RF + N +IIF L + + L+G + + +M+SA ++A L Sbjct: 6 IKRFASLANGFRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAALM 65 Query: 64 GASKMVS-NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + + S + R N + F + + SG + Sbjct: 66 ISKDAATMSDSEITTRARQYVNSLYTNTETPIQTFSAVYTPNNGSG-----ATILLNAGG 120 Query: 123 NSSRISMTHMANN--RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-- 178 N M + N L + + L + + Sbjct: 121 NMPTYFMKIVGTNFSTLPINTASTTKWGSSRMRVALVLDNTGSMDQNGKMTALKKAAANA 180 Query: 179 --------ALLRIEMGERPIFLIELVVDLSGSMHC--AMNSDPEDVNSAPICQDKKRTKM 228 + G+ I ++ D++ A + + +AP + Sbjct: 181 TTGLIKKLSAFNTNEGDVYISVVPFAKDVNVGTSNVGASWLNWSEWEAAPRILTDNSYPI 240 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-------GTEKVRQYVT 281 N + I + D ++ + S+ Sbjct: 241 KVKYNNITYEWADIGPGAPCPFDTKSNGNPRPSQQSSVSKFSFACMDRPGSLSGATDLSN 300 Query: 282 RDMDSLILKPTDSTPAM 298 D ++ + + M Sbjct: 301 NSTDRYLIPSSGTYEGM 317 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 70/201 (34%), Gaps = 24/201 (11%) Query: 233 NALLLFLDSIDLLSHVKEDVYMG---LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + ++ D + V G + + + + + + ++ Sbjct: 405 STWTGCVNDRDQDADTTNAVMSGSGRIYPEQWKDCLSATITPMSNQWATLNSKVNAMNPS 464 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ + +Q L + + F+ K P+ +Q +I+ L+DG N + Sbjct: 465 GNTNQAIGLFWGWQTLNTA------NDPFKAPSKDPNWVYQDYIVILSDGLNTQNRWYTC 518 Query: 350 T------------IKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYN-VVN 394 +CD K + I I TI +N + ++LK C SS ++ + + Sbjct: 519 PNAGPCPTIDGREKTLCDNIKADKITIFTIQVNINSKDPESQVLKDCASSGSGYFQLITS 578 Query: 395 ADSLIHVFQNISQLMVHRKYS 415 A+ F N+ + + + Sbjct: 579 ANDTATAFDNVLNKIAKLRIA 599 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 48/453 (10%), Positives = 118/453 (26%), Gaps = 73/453 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + + + +I F L++M L+ G + +K+++ A +A +L + Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVLTALKEAREQ 62 Query: 72 LS------RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 L + + A I SLSG + N+ Sbjct: 63 LKQGKPDWQSIAEKQGGKAFTNNASKIGGVSGTGATINLSLSGGVLSGS---LNYAANAP 119 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMT-SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N ++ + + + + I Sbjct: 120 THFLRIAGLNTINLKGSASATMSAAQYRDIHFVIDVSASMG-----------------IG 162 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++ V + + H A +DP N A + ++ ++ A++ L I Sbjct: 163 ATKADQQAMQNSVGCAVACHHAEAADPATDNLAAVRAIGATLRIDVVRKAVMDALAKI-- 220 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTD-------ST 295 + + ++ ++ S + T+ Sbjct: 221 --PNDGSTRVAIHSFSNSLKTVFPLSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLN 278 Query: 296 PAMKQAYQILTSDKK--------------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + A LT+ + FF F++ Sbjct: 279 NLLASAGNGLTASQPRGFVLLATDAVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYFA 338 Query: 342 NNFKSNVNTIKI-CDKAKENFIKIVTI--------SI----------NASPNGQRLLKTC 382 C K ++T+ + + P + + C Sbjct: 339 WGLHYVQAPDAANCSAIKAKGYTMMTLETEYLIPDGVYNPTFDAVRGDMGPAMTKSMTDC 398 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 S+P+Y+++ + + Q + V+ + Sbjct: 399 ASAPDYYFHAESPQEIDRAVQTMVSKTVNLSLT 431 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 43/395 (10%), Positives = 89/395 (22%), Gaps = 67/395 (16%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + E+ +F I M L G + + + + ++ + AILA AS L Sbjct: 21 FLRDERGSFIIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAAS-----LD 75 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN------IVNSSRI 127 + D E + ++ +A + + E+ + Sbjct: 76 QDLDPEEVVLDYFTKAGLGHVISQDDIDVFENQTNGEVADDVAVTTRRVEASVSALMPTT 135 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ-------------------------FIE 162 + L ++ L F+ Sbjct: 136 FLRLAHMYDLGLYTEGGAEEALSLSEISLVLDVSGSMGNSSSSGYSKIYELRRAAKRFVN 195 Query: 163 HLLNQ------------RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 +L ++ +P ++ + + + + S D Sbjct: 196 VMLCNPADADETEDCTLTEGDISINIVPYAEQVLLPSNLLQRFNHTSEHTESRCITFYED 255 Query: 211 PEDVNSAPICQ----DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 D + P + AL + D G Sbjct: 256 EFDTVAVPTFSLDTFVTNGRPLPALYGDPIQLTGYFDPSGGTNSTPNPGSNSPCYNDYSG 315 Query: 267 IEPSW------GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF------- 313 + D T MK +L + + Sbjct: 316 STNDYWREIYPMGFSAEALRDEIDDLGASGNTSIDLGMKWGAALLDPAAQPAISDLVAAN 375 Query: 314 --FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 F + + +K I+ +TDGEN + Sbjct: 376 EVNEAFDGRPFEYTQRGIEKVIVLMTDGENTSQDY 410 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 41/92 (44%), Gaps = 2/92 (2%) Query: 324 IPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + ++ + N + N N +IC AK + + TI S + ++++C Sbjct: 503 KTANWYDQWNFLADAHDYFNYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQHDIMRSC 562 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S+P Y+++V D + F I++ + + Sbjct: 563 ASAPAYYFDVDGLD-ISAAFAAIAREISKLRL 593 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 39/378 (10%), Positives = 93/378 (24%), Gaps = 39/378 (10%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R F+ N +I+ AL+++ ++ +G + + + M+S + A++A Sbjct: 23 RVFKAFRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAV 82 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ ++ + K + D ++N + Sbjct: 83 KEIDTDDTDA----------LKEKVADWFHAQVENSYTLGDIDIDTSNHKITATASGTVP 132 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + +D S + + + +L Sbjct: 133 TTLMKIANIDTVDVSVASAVKGPATSYLNVYIVIDTSPSMLL-------------AATTA 179 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 I + N + + +A+ LD ID Sbjct: 180 GQSAMYSGIGCQFACHTGDAHTVGKTKYANNYEYSAAKTIKLRADVAGDAVRDVLDMIDD 239 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + +GL + + + P+ T+ R + L + + + Sbjct: 240 SDSNHQRIKVGLYSLGDTLTEVLTPTLSTDTARNRLADASYGLTSATSKAATYFDVSLAT 299 Query: 305 LTSDKKRSFFTNFFRQGVK--------------IPSLPFQKFIIFLTDGENNNFKSNVNT 350 L +K + T G N + +N Sbjct: 300 LKQKVGTGGDGTSSGSPLKLVLLLTDGVQSQREWVTDKVTWKNGQATYGAYWNKVAPLNP 359 Query: 351 IKICDKAKENFIKIVTIS 368 C K + + Sbjct: 360 -DWCAYVKNQSATMAVLY 376 >gi|152994336|ref|YP_001339171.1| hypothetical protein Mmwyl1_0295 [Marinomonas sp. MWYL1] gi|150835260|gb|ABR69236.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 528 Score = 126 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 40/504 (7%), Positives = 104/504 (20%), Gaps = 107/504 (21%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA--SKMVSN 71 E L ++ ++ + F + ++ A + A LA N Sbjct: 26 FYQDEAGAVLPFAVLIIVGGVIAVSFAVDTTRMVNSSAQLKRATDVAALAIGNIQLTNGN 85 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + +D + +++ T ++ ++ + Sbjct: 86 DDDVDLQKIASGYVLSNLGMDSGLINQIEAEQVTVTKGEVDGSPTYKVSVSLIAQSDLLK 145 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIE-------------------------HLLN 166 + + T L Sbjct: 146 AGGQEQVIF--STVEVVSRPTEVALILPNSGTEDRGELAALRKVSKEFARNLLGDESAGV 203 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFL-----------------IELVVDLSGSMHCAMNS 209 + +S +P + + + + S+ Sbjct: 204 SPTQKVWLSLVPFSQSVNVYDESDPDRINRWAAFGELNPPELRSLFRTGKARSLADPRFP 263 Query: 210 D----------------PEDVNSAPICQDKKRTKMAALKNA------LLLFLDSIDLLSH 247 D + P Q + + +N + + D Sbjct: 264 DRKANLLCVHRGLSAGQNFFWDQPPSGQFEIYYRHDLPENGSPGAPPISWVGPNPDFPDT 323 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ED + + + ++ + + + + AM A L+ Sbjct: 324 QAEDTRWIVADKGCPDAALLPLTDNLAEIDARLDKMSTRFNV---NYAIAMSWAGAALSP 380 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFL----------------TDGENNNFK---SNV 348 + + + +I + DGE N + Sbjct: 381 NMRGGAGWGNNELPLDFSLDGNNVKVIVMLVNTIGDWFDTDAYNFNDGETTNDSMAFARQ 440 Query: 349 NTIKICDKAKENFIKIVTISI--NASPN---------GQRLLKTCVSSPEYHYNV----- 392 +C IK I + + L+ C S + Sbjct: 441 RFSDLCRDFNSKNIKFFFIGVRPGDPEDWGRTLFADVAGPGLRECASGDGNFHFADASNF 500 Query: 393 -VNADSLIHVFQNISQLMVHRKYS 415 + + I+ + Y+ Sbjct: 501 SEGQSQISSSLEKIADDIRRNYYA 524 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 126 bits (316), Expect = 7e-27, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 95/362 (26%), Gaps = 13/362 (3%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R ++ N + +FA++++ L IG I + +++M+ A ++ L + Sbjct: 22 RQVSRRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSRD 81 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + D S + K + + + S S+ ++ Sbjct: 82 LSQGTITAADVAAKASTYFKALYTSTDAQSVAVTASYTASTSSSASN-IQLNASGQIVTQ 140 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP--------- 178 M + + + DV L + + + + Sbjct: 141 FMKLVGFPTMTFNTKATTTWGDVKMRVALALDNTGSMAYSGKMTALQNAVAGSGGLIDQL 200 Query: 179 ALLRIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + L G+ I LI ++ G+ + A + P A Sbjct: 201 SALAKSPGDVYISLIPFAKVVNVGASNYAQSWIDWTDWQNPPTIQPNNGSYQAAIPNASF 260 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDST 295 D++ + G+ + + + + + + S Sbjct: 261 TQSQWDMVGPGSSCPFTSGNGFPYFSCTSGPATASSSASKVPSSGSYSGYICPGYDSASH 320 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + + + ++ T + S G + K N T Sbjct: 321 SYYNGCWNSVQNSTRVNWCTGSYCSCPTTGSNVPNNTCSCTGSGSSTVCKVNTFTHTWIA 380 Query: 356 KA 357 A Sbjct: 381 NA 382 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 75/245 (30%), Gaps = 36/245 (14%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V ++ H + + + + A+ A H++ +V Sbjct: 368 VCKVNTFTHTWIANATSTWTGCVADR-TQPNDANAVSPASSDVATLFPANQHMENNVQYC 426 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 +T++ + + S+ ++ V T+ M A Q L Sbjct: 427 SSSASTKLGQIVPLSYNWTSLKSAVNAM---EPTGGTNQAIGMAWAVQSLIP-------N 476 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---------------NVNTIKICDKAKEN 360 + + + + II L+DG N + + +C K Sbjct: 477 GVLGAPAEDANTTYNRVIILLSDGLNTEDRWPDYGNGSTQASGNPIDARQALLCSNLKNT 536 Query: 361 F-------IKIVTISINASPNGQ---RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I TI +N S +L+ C SSP+ Y + ++ ++ F +I + Sbjct: 537 KDSKGNAMYTIYTIQVNTSSPADPTSTVLQNCASSPDKFYMLTSSSQIVTTFNSIGTALS 596 Query: 411 HRKYS 415 + + Sbjct: 597 KLRVA 601 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 45/442 (10%), Positives = 113/442 (25%), Gaps = 75/442 (16%) Query: 34 LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + G + + + N +++A +AA LAG+ ++ + + ++ D Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQLPDDPDVTNGKVKAAVTANLALNDPD 60 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 A + S + N + + Sbjct: 61 ATDIQVTSGGATRS--------VCVDAKANVDMTLTKVIGIGDTTVTA--EACAGYNDIE 110 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPAL-----LRIEMGERPIFLIELVVDLSGSMHCAMN 208 L + + + + I L+ + + Sbjct: 111 LVLVLDSTGSMKGSPIDSAKDAARDLVNLIMPASTSSTRSKIGLVPFQGKVRIDGSDPVT 170 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE---- 264 ++ P C++ T + K + + + + G+ +T + Sbjct: 171 AERNPDGVGPGCRNADGT-LNTGKLKV-EYSRTATSTNIFYGYTLSGVSTFTDKTCSGMS 228 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK-------------- 310 S + + + T + +K ++L+ + Sbjct: 229 PIRALSSDKNTILNNIEAINAGAVTSGTLISEGIKWGRKVLSPEAPYVEGSTDKKVRKIM 288 Query: 311 ---RSFFTNFFRQGVK---------------------IPSLPFQKFIIFLTDG------- 339 T R G P + T Sbjct: 289 IVLTDGDTEDGRCGGNFASASKTVNTYWTNAYFGQGLKPDTATSPYATLSTATATLAQIP 348 Query: 340 -ENNNFKSNVNTIKICDKAKEN---FIKIVTISINA-SPNGQRLLKTCVSSP----EYHY 390 + K N + D AK + ++I ++ A + L++ SS +++Y Sbjct: 349 DCKDGGKLNQFVLDEADAAKNDLNYPVEIFSVRFGASDATDKSLMQKIASSKPGTTDHYY 408 Query: 391 NVVNADSLIHVFQNISQLMVHR 412 + ++ + +F+ I Q + R Sbjct: 409 DAPSSTGIQDMFKKIGQQLGQR 430 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 43/436 (9%), Positives = 107/436 (24%), Gaps = 48/436 (11%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R + +K + + + A+ + L L+G I + + +++ Sbjct: 18 ARPMRALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQA--------A 69 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 S + R + + + + Sbjct: 70 VDAAALAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTATATV 129 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 I M + + + + + Q + + L Sbjct: 130 PTIFMNALGIGSSEVHAKATA--GYNKLEVVFVIDNSGSMKGTPIQQT--NSAASQLVEL 185 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + V + + + + + D + Sbjct: 186 IMPEGMMTSVKVGLVPFRGKVHLPAGVDGLPDGCRNADGTLNPSWLHEEYFK------TS 239 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + T + + + E + +++ T + +K + Sbjct: 240 YRYPSGSSLNVPKNTCTSIPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHV 299 Query: 305 LTSDKK-----------------RSFFTNFFRQGVKIPSLPFQKFIIF--------LTDG 339 LT + T + G +T Sbjct: 300 LTPEAPFTEGSSAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSH 359 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVN 394 N K N ++ KAKE I++ I S + L+K+ SS +++Y+ + Sbjct: 360 CENGGKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPS 419 Query: 395 ADSLIHVFQNISQLMV 410 A + VF+ I + + Sbjct: 420 AYDIDDVFKKIGRQLG 435 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 42/432 (9%), Positives = 105/432 (24%), Gaps = 48/432 (11%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + +K + + + A+ + L L+G I + + +++ Sbjct: 2 RALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQA--------AVDAA 53 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S + R + + + + I Sbjct: 54 ALAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTATATVPTIF 113 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 M + + + + + Q + + L + Sbjct: 114 MNALGIGSSEVHAKATA--GYNKLEVVFVIDNSGSMKGTPIQQT--NSAASQLVELIMPE 169 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + V + + + + + D + + Sbjct: 170 GMMTSVKVGLVPFRGKVHLPAGVDGLPDGCRNADGTLNPSWLHEEYFK------TSYRYP 223 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 T + + + E + +++ T + +K +LT + Sbjct: 224 SGSSLNVPKNTCTSIPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPE 283 Query: 309 KK-----------------RSFFTNFFRQGVKIPSLPFQKFIIF--------LTDGENNN 343 T + G +T N Sbjct: 284 APFTEGSSAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENG 343 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVNADSL 398 K N ++ KAKE I++ I S + L+K+ SS +++Y+ +A + Sbjct: 344 GKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDI 403 Query: 399 IHVFQNISQLMV 410 VF+ I + + Sbjct: 404 DDVFKKIGRQLG 415 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 45/467 (9%), Positives = 120/467 (25%), Gaps = 68/467 (14%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + KG +K IIF + ++ F + + +E A A LA + Sbjct: 1 MQGFNMKGFTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIA 60 Query: 67 KMVSNLSRLGDRFE--SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + D + + K + + V + Sbjct: 61 SPGKDNKDDQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVA 120 Query: 125 SRISMTHMANNRLDSSNNTIF-------YNMDVMTSYDYRLQFIEHLLNQR---YNQKIV 174 + + +++N + + + L + + N N+ + Sbjct: 121 TAEHDSWISHNEIGVEPKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQM 180 Query: 175 SFIPALLRIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + + E + + D + + ++ + + Sbjct: 181 QVVKDTITRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEA 240 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPT 292 + +D + S V++ + ++ +I + ++ + + T Sbjct: 241 SFESIVDKMFDKSVVEQKPHYASDYNKSQ---DIPLTDKYQEFIDILNSNKVMPARGGGT 297 Query: 293 DSTPAMKQAYQ-----------------ILTSDKKRSFFTNFFRQGVKIPSLPF---QKF 332 S + A + IL+ + + K+ Sbjct: 298 QSWLGLIAAAKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKY 357 Query: 333 IIFLTDGENNNFKSNVNTIK-------ICDKAKEN--------------------FIKIV 365 + DG ++ +K +C+ K+ + + Sbjct: 358 YVDQYDGRTYYYQVYDKFLKSLVGEHGLCESLKKRISSKENKFQSEHAKLEGEKTKVTMG 417 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I +N + CV + Y+ N V++ I L+ Sbjct: 418 VIGVNYNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEE 460 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 124 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 46/439 (10%), Positives = 108/439 (24%), Gaps = 58/439 (13%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN-- 71 + ++ FAL + + + + K +++A + A+LA S+ + Sbjct: 19 FFRARSGQVAVTFALVTLPVMFATAAAVDYGRRNAAKTQLDAALDGAVLAVMSQKTNTIP 78 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + L + A + F + S T +S + + Sbjct: 79 TTTLQNMETQFRTEAAKVPGVTVTSFTPGAPVNTSKTLSLTASYTATVKTSLASMMQIPA 138 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 M + S+ +Y L + M Sbjct: 139 MPVSGTSSATRNTSQY----INYYLLLDNSPSMG---------LAATDADVQNMKIATNG 185 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + D ++ N ++ + ++ L+ A+ +D ++ + + Sbjct: 186 CAFACHQHTFDKKGNITGDDQNDNYHIALRNNIKLRIQVLREAVSALVDQANVSMLLPQQ 245 Query: 252 VYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 M + + V + + ++ + Q Sbjct: 246 FQMEMWTFNDSVTQTKLQAMTPTLNNIKNAAPNIDIAYAYYNQSDN-------QTDFERA 298 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----------------NTIKI 353 T G + +F+ +TDG + S + Sbjct: 299 IARMNTTIPASGDGLTPDKPIRFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFIGPLSPST 358 Query: 354 CDKAKENFIKIVTISINA-SPNGQRL---------------LKTCVSSPEYHYNVVNADS 397 C K +KI I L+ C S ++ V Sbjct: 359 CSALKNKNVKIGIIYTQYLPIYDNDFYNRYVRPYESQIGPSLQACASD-GMYFPVTTNGD 417 Query: 398 LIHVFQNI-SQLMVHRKYS 415 + + S + + S Sbjct: 418 ITAAMLKLFSTTLASVRLS 436 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 46/431 (10%), Positives = 110/431 (25%), Gaps = 53/431 (12%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ F+I F + L + F + + ++ + A A LA + Sbjct: 7 LLASIRRFKQDRSGAFAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQAALALTA 66 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI--------KESLSGYSAVFYNTEI 118 + + ++ + S++ + + D F I ++LS + Sbjct: 67 EDNGDGAQRN--YTLSSDYFRAYMRHDVDVFKPTVIVKSGISPNNQNLSYVEYRVSGQTL 124 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-----LNQRYNQKI 173 Q+ SS + + ++ +N + F + + + + Sbjct: 125 QDSWFSSTFFPSF--DKQVVIGDNGAARKFRSNMDVIFVTDFSGSMNEGFGGSTKLAELK 182 Query: 174 VSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + + + + C + I + Sbjct: 183 RIVLKLSDELFSYNIDNKVGFVPFGWGGKEGIDCDFPFVSHGPVPSDILAGGNYKALEKY 242 Query: 232 KNALLLFLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + + ++ G + + + + + Sbjct: 243 IDISGSVAAIPNPVHDIQIPLSNVNGSTCLRNSHSWKVPLTSSLAE----INQINGMSAE 298 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + L S G D +N N+ Sbjct: 299 GGTLVSSGVLLGVPYLASGTASRKVMVIVSDGTD--------------DPKNVMITPNLI 344 Query: 350 TIKICDKAK-----ENFI-KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH--- 400 +CDK + + + KI I I P K+CV + Y D L Sbjct: 345 NAGMCDKIRQVLSTDESVGKISFIGIAYYPTVD--WKSCV-GDKNFYLPQTIDELEEDLR 401 Query: 401 --VFQNISQLM 409 VF+ + + Sbjct: 402 RAVFEEVGHNI 412 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 45/434 (10%), Positives = 105/434 (24%), Gaps = 62/434 (14%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + +F L ++ ++ +G + ++ + +A +AA+L+ +K +L L Sbjct: 24 RSGSVGFVFGLLLLPMMVAMGVSVDYARVSAARSDLAAAADAAVLSVTNKAAMSLDMLSA 83 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + K + + + + L Sbjct: 84 QARVRDAFLKNIQTMPDISGVSADAVV---IDLLGVRAATLSYTASYRTAFSGILGMRTL 140 Query: 138 DSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 S N + + L + + I + Sbjct: 141 SVSGNAASKSAVPIYMDFYLLLDNSPSMG-----------------VGATSADISTMVSR 183 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED----- 251 + C S ++ ++ A +D+ + V Sbjct: 184 TPDKCAFACHDLSAGNSDYYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTAL 243 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 MG + + S + D + ++ + + Sbjct: 244 YTMGADCASVGLTTVSPLSSDLAAAKTNAQAI-DLMTIQKPGYNNDQCTDFDGVFQSLNG 302 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-------------CDKAK 358 G +L QK + ++DG + + + T K C K Sbjct: 303 KIDV----AGDGSTALTPQKVVFLVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLK 358 Query: 359 ENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYNVVNADSLIHVF 402 IKI T + N ++ C S P ++ V + Sbjct: 359 NRGIKIAVLYTTYLPLPTNDWYNTWIAPFQATLPSAMQGCAS-PGLYFEVSPTQGIADAM 417 Query: 403 QNISQ-LMVHRKYS 415 + Q + + + Sbjct: 418 TTLFQRTVSQPRLT 431 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 46/396 (11%), Positives = 112/396 (28%), Gaps = 63/396 (15%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 + F + V + + +A +A+ AG + +I+ A + + Sbjct: 30 MTMFTVDVAYMQLVRTELRAATDASAKAGMEALRRTQDTEAAIDAAIATAAANKVGGRSL 89 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + I+ L+ + + + + + + + Sbjct: 90 TLTADQIEFGLAFRNVDNSVSFNAGQLPYTAVRVNSAMTESSAAGA-------------- 135 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 F + + S + A + I +D S SM + Sbjct: 136 -VPLFFGSIFGTGQFEPTRSAVSASTEV--------EICFAIDRSHSMCFDLTGVDWSYP 186 Query: 216 SAPICQDKKR--------TKMAALKNALLLFLDSIDLLSH--------------VKEDVY 253 ++ A+L A+ F+ Sbjct: 187 PGTPRNPDPVAFPPHPTLSRWASLSRAMQTFVSITASQEPKPRVAMVTWASKITQSNYEG 246 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + V ++ + + Q + + ++L T+ + +A +IL + Sbjct: 247 KLTKTNSPEVFVDVPLTTNLADLNQAIKGRSEKVMLGATNMAAGIDEARKILNAT----- 301 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K + II +TDG N ++ + A I I ++S+ Sbjct: 302 ---------KSTRPYAHRIIILMTDGLWNQGRNPLL---AAQDAANEGIVIHSVSL-LPR 348 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +G + ++ +Y N+ +L F +I++ + Sbjct: 349 SGDITPQVSSTTGGVNYPATNSAALEAAFADIARTL 384 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 123 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 41/432 (9%), Positives = 104/432 (24%), Gaps = 48/432 (11%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + +K + + + A+ + L L+G I + + +++ Sbjct: 2 RALSALLRRQKGSMATLVAVLLPVVLGLVGLGIDSGMLYLSHSRLQA--------AVDAA 53 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S + R + + + + I Sbjct: 54 ALAGSLQLPYDPQLDKGLVRGAVTQYMDANYPEASLNGVTPGTEERSVTVTATATVPTIF 113 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 M + + + + + Q + + L + Sbjct: 114 MNALGIGSSEVHAKATA--GYNKLEVVFVIDNSGSMKGTPIQQT--NSAASQLVELIMPE 169 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + V + + + + + D + + Sbjct: 170 GMMTSVKVGLVPFRGKVHLPAGVDGLPDGCRNADGTLNPSWLHEEYFK------TSYRYP 223 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 T + + + E + +++ T + +K +LT + Sbjct: 224 SGSSLNVPKNTCTSIPRVQGLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPE 283 Query: 309 KK-----------------RSFFTNFFRQGVKIPSLPFQKFIIF--------LTDGENNN 343 T + G +T N Sbjct: 284 APFTEGSSAKDIRKVIIVLTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENG 343 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSS----PEYHYNVVNADSL 398 K N ++ K KE I++ I S + L+K+ SS +++Y+ +A + Sbjct: 344 GKLNAAMLEEARKVKEAGIEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDI 403 Query: 399 IHVFQNISQLMV 410 VF+ I + + Sbjct: 404 DDVFKKIGRQLG 415 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 45/462 (9%), Positives = 117/462 (25%), Gaps = 68/462 (14%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 KG +K IIF + ++ F + + +E A A LA + + Sbjct: 1 MKGFTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKD 60 Query: 72 LSRLGDRFE--SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 D + + K + + V ++ Sbjct: 61 NKDDQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHD 120 Query: 130 THMANNRLDSSNNTIF-------YNMDVMTSYDYRLQFIEHLLNQR---YNQKIVSFIPA 179 + +++N + + + L + + N N+ + + Sbjct: 121 SWISHNEIGVEPKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQMQVVKD 180 Query: 180 LLRIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + E + + D + + ++ + + + Sbjct: 181 TITRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEASFESI 240 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPA 297 +D + S V++ + ++ +I + ++ + + T S Sbjct: 241 VDKMFDESVVEQKPHYASDYNKSQ---DIPLTDKYQEFIDILNSNKVMPARGGGTQSWLG 297 Query: 298 MKQAYQ-----------------ILTSDKKRSFFTNFFRQGVKIPSLPF---QKFIIFLT 337 + A + IL+ + + K+ + Sbjct: 298 LIAAAKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQY 357 Query: 338 DGENNNFKSNVNTIK-------ICDKAKEN--------------------FIKIVTISIN 370 DG ++ +K +C+ K + + I +N Sbjct: 358 DGRTYYYQVYDKFLKSLVGEHGLCESLKNRISSKENKFQSEHVKLEGEKTKVTMGVIGVN 417 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + CV + Y+ N V++ I L+ Sbjct: 418 YNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEE 455 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 45/462 (9%), Positives = 118/462 (25%), Gaps = 68/462 (14%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 KG +K IIF + ++ F + + +E A A LA + + Sbjct: 1 MKGFTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKD 60 Query: 72 LSRLGDRFE--SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 D + + K + + V ++ Sbjct: 61 NKDDQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHD 120 Query: 130 THMANNRLDSSNNTIF-------YNMDVMTSYDYRLQFIEHLLNQR---YNQKIVSFIPA 179 + +++N + + + L + + N N+ + + Sbjct: 121 SWISHNEIGVEPKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQMQVVKD 180 Query: 180 LLRIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + E + + D + + ++ + + + Sbjct: 181 TITRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEASFESI 240 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPA 297 +D + S V++ + ++ +I + ++ + + T S Sbjct: 241 VDKMFDKSVVEQKPHYASDYNKSQ---DIPLTDKYQEFIDILNSNKVMPARGGGTQSWLG 297 Query: 298 MKQAYQ-----------------ILTSDKKRSFFTNFFRQGVKIPSLPF---QKFIIFLT 337 + A + IL+ + + K+ + Sbjct: 298 LIAAAKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQY 357 Query: 338 DGENNNFKSNVNTIK-------ICDKAKEN--------------------FIKIVTISIN 370 DG ++ +K +C+ K+ + + I +N Sbjct: 358 DGRTYYYQVYDKFLKSLVGEHGLCESLKKRISSKENKFQSEHAKLEGEKTKVTMGVIGVN 417 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + CV + Y+ N V++ I L+ Sbjct: 418 YNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEE 455 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 49/499 (9%), Positives = 128/499 (25%), Gaps = 111/499 (22%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ ++ L ++ + + + + + + + + + AA +A + +L+R Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIEDQPDLAR 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + E + Y + +S S + Sbjct: 76 GLATQYVENYVRDIKSTNLSAQRFHQTEDEGAGILEYIQYTVNAKTTHDSWFASSFIPSF 135 Query: 135 N-RLDSSNNTIF-----YNMDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMG 186 + + D + ++ Y D + F + K + + + Sbjct: 136 DEQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMDDRWGSSRHKKIDDLKTAIDQISS 195 Query: 187 E--------------------------------RPIFLIELVVD----LSGS-------- 202 + + + V +SG Sbjct: 196 KILCTSTDLEYVDGEWKEVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSGGQANATSQL 255 Query: 203 ------------MHCAMNSDPEDVNSA---PICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + + ++ K+ + I + + Sbjct: 256 SYKPNYKPNVSPYSYNDVNWDYWRAYSQNEVLNCANWQSYCPNPKSDNQKYAKRIKDVIY 315 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVR------------------------------ 277 + + Y +R Sbjct: 316 LDNYHVADVYNYVDLSTSVATMFTDKSGLRPNFYGVNGTDLFNAHGSSSSTQFKNIRLSN 375 Query: 278 --QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T + + + QIL S + K K ++ Sbjct: 376 KLSDLNPISSMWADGGTAAFQGILRGSQILKDGDPNSSDDEEQQAYNKK-----IKMLLI 430 Query: 336 LTDGE---NNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHY 390 L+DG+ NN + +CDKA+E + I I I+ + Q + CV P Sbjct: 431 LSDGQESPNNGILKGLVDRGMCDKAREEIPGLYIGVIGIDFRASQQSGFQDCVIDPNE-- 488 Query: 391 NVVNADSLIHVFQNISQLM 409 ++++ +L + + I +L+ Sbjct: 489 DIIDVSNLDELIEKIEELI 507 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 46/426 (10%), Positives = 109/426 (25%), Gaps = 51/426 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I + + +++FA+ + +F + K +E + Sbjct: 4 IRKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLED-----------------AA 46 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 +++A + K + + Y I + + Sbjct: 47 EAAVLAIAAHNADNSGSSSGSAINKKIASDWIGQYMQDMQAISDIKITKLNCNDIAECKE 106 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + Y + T++ + + + + + Sbjct: 107 GLENGESRYFQYEILAKTNHLSWFPGNNSTAGFGE---SFDVVGSATARKFQSESV-DVM 162 Query: 195 LVVDLSGSMHCAMNSDPED--VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 V D SGSM+ + ++ NA + + + Sbjct: 163 FVSDFSGSMNNKWSGGSNSRRYKDLIKIIGDVIKELDKFNNAHTTTTNRVGFT---GFNT 219 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--------TDSTPAMKQAYQI 304 Y + + + S G + + + S TD+ K + Sbjct: 220 YTRKTADNSCYQDQYDRSAGRTVNKIFEVKGCKSRSSGGAKFHDIAMTDNYNEFKNTIKY 279 Query: 305 LTSDKKRSFFTN---FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT--IKICDKA-- 357 + + + P ++ +I L+DG ++ +C K Sbjct: 280 FKPGGGTASYQGIIRGAQMMDAAPEPRPRRIMIILSDGIDSKRSRANKLVEEGMCSKILL 339 Query: 358 ----------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K K+ + + +P L CV + D L + + IS+ Sbjct: 340 KLGNANTSDGKAIKTKMAVVGFDYNPASNPSLAKCVGEHNVYGANNPEDVLNKILELISE 399 Query: 408 LMVHRK 413 + H K Sbjct: 400 EIGHLK 405 >gi|84385675|ref|ZP_00988706.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] gi|84379655|gb|EAP96507.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] Length = 520 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 48/499 (9%), Positives = 133/499 (26%), Gaps = 111/499 (22%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ ++ L ++ + + + + + + + + + AA +A + + +R Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIEDQPDQAR 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + E + Y + +S S + Sbjct: 76 GLATKYVENYVRDIKSTNLSADRFHQAEDEGAGVLEYIQYTVNAKTTHDSWFASSFIPSF 135 Query: 135 N-RLDSSNNTIF-----YNMDVMTSYDYRLQFIEHLLNQRYNQK--IVSFIPALLRIEMG 186 + + D + ++ Y D + F + ++ + + + + + Sbjct: 136 DQQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMNDRWGSSRHIKIDDLKTAIDAISS 195 Query: 187 E--------------------------------RPIFLIELVV---------------DL 199 + + + V L Sbjct: 196 KILCTSIKQDYVDGEWKYVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSGNQARATSQL 255 Query: 200 SGSMHCAMNSDPEDVN------------SAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 S + N P N + I ++++ + K+ + I + + Sbjct: 256 SYKDNYKTNVSPYSYNDVNWDYWRTYSQNDVIRCADRQSRCSYPKSENQKYAKRIKDVIY 315 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVR------------------------------ 277 + + Y ++ Sbjct: 316 QDNYLVADVYNYVDLSTSVSTMFTDKSGLQPDFYSVSGTSLFNAHGSSNSSQFSNIRLSN 375 Query: 278 --QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T + + + Q+L S + K K ++ Sbjct: 376 KLSDLNPINSMWADGGTAAFQGILRGSQVLHDGDPNSSDQEEQQVYNKK-----IKMLLI 430 Query: 336 LTDGE---NNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHY 390 L+DG+ NN + +CDKA+ + I I I+ + Q + CV P Sbjct: 431 LSDGQESPNNGILKGLVDRGMCDKARNEIPGLYIGVIGIDFRASQQSGFQDCVVDPNE-- 488 Query: 391 NVVNADSLIHVFQNISQLM 409 ++++ +L + + I +L+ Sbjct: 489 DIIDVSNLDELIEKIEELI 507 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 54/472 (11%), Positives = 111/472 (23%), Gaps = 98/472 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + +I+FAL+++ L L+G I K +++A +AA+LAG ++ Sbjct: 18 LASDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIA 77 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS---------AVFYNTEIQNIVNS 124 + ++ + K F N K + S + Sbjct: 78 SNAKQGDATAAGLTAGRNQATKAFAINTGKVPFATVSVSRLDVTRSGQTLTATVIYTATI 137 Query: 125 SRISMTHMANNRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + +N + + + + + A Sbjct: 138 QNTFGKILGLSSTTFTNTITASADLASYLDFYLMVDVSGSMGLPT------AAADAEKLA 191 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + + G + + + A+ NA+ L Sbjct: 192 SITKEDQGNCQFACHFPGRKGWNNAAGK------------IQLRSDAVNNAVCELLKRAA 239 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V +G + R+ S T + P T + Sbjct: 240 T-PVVPNQYRIGFYPFINRLATLSPLS-DTTTSMTALRTAAQCDKTWPLAFTNLLDTGST 297 Query: 304 ILTSDKKR----------------SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 L + G + + F+ +TDG N+ + Sbjct: 298 QLFTGNNPTTGTGSGGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLITDGMQNSQSYS 357 Query: 348 VNTIKI-----------------------------CDKAKENFIKIVTISI--------- 369 N C + K+ I + I Sbjct: 358 TNNDARTFPGSPSLFKGYGNAGWDGSQPAQIDPSKCKELKDAGAIISILYIPYNQVKNYT 417 Query: 370 -------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 SP L+ C S + Y +AD + + Sbjct: 418 NDSYIVWENNRVNGFSPTLADPLRKCASQ-GFFYTANSADDITASLGAMFDQ 468 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 48/467 (10%), Positives = 113/467 (24%), Gaps = 97/467 (20%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA----- 59 SR R + A+E+ + ++IF L+++ +++G + ++ + + Sbjct: 13 SRVRKRLRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAA 72 Query: 60 ------------AILAG-ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 A AG + ++ + D R L Sbjct: 73 VKAAAQKESDCVANPAGNNVSNFQGCGQKDIIKAGVAAGVQYMNGDPLMRGADRKPTIEL 132 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM---TSYDYRLQFIEH 163 S + T + + M + + ++ L Sbjct: 133 SSSEGSWSAT-VNYSADIPTNIARLMGVQTIPVNGKVTSNIALGTHMYLNFHLLLDRSMS 191 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + I I ++ + + + H Sbjct: 192 MG-----------------IGATSDDISRLQALTGCAFACHSEGYEAQYYDQPKAQGIRF 234 Query: 224 KRTKMAALKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++ L++A + ++ +E + MG+ + V +E + V V Sbjct: 235 ---RIDDLRDATGALVAQAKMVASANAREHIQMGVYAFNHHVSPLVEMTSDLTNVANAVK 291 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A+ G + S + + +TDG Sbjct: 292 NLDLPTHDDGTQAADAVTWLVA-----------NKIKGNGTGLTSAAPLEIVFLVTDGVE 340 Query: 342 NNFK-------------------------SNVNTIKICDKAKENFIKIVTIS-------- 368 + ++ + CD K + + Sbjct: 341 DGIYTGWNKMVGPTGLPLPWWPSWMTKAPTSAFPVTACDALKSKGAIVAVVYTTYVPFPG 400 Query: 369 -INAS-------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + PN L+ C S Y + + Q++ Sbjct: 401 TVQYDRLIGPFAPNISPNLQGCASQ-GYFFTASEPGDITRGMQSLFN 446 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 57/496 (11%), Positives = 125/496 (25%), Gaps = 109/496 (21%) Query: 5 SRFRFYFK---KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES------ 55 S FR + + + E+AN SIIFA++++ L IG + ++ ++S Sbjct: 16 SAFRGFLQQALRLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAAS 75 Query: 56 --ANNAAILA--GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + + L A +M S+ + E+ A + + + +G Sbjct: 76 VASISQQSLGYNAALQMTSDGTVQVAVEEATKLFNGNAANSLGYTNLSLNAQVMKTGVKL 135 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYN 170 ++ MT + +L + + + + L + + Sbjct: 136 AATVAFSADV---PTTFMTVVGYRKLTVTGTSKSTSSLPPYLDFYLMLDVSGSMGLPSTD 192 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS----DPEDVNSAPICQDKKRT 226 + P + + + H S + N+ C + Sbjct: 193 AE--------QTRLAAINPDNYKQYPNGCTFACHFTSASSCPAANQKYNTNGSCMGYPMS 244 Query: 227 -------------------------------------------KMAALKNALLLFLDSID 243 + A+ A+ L + + Sbjct: 245 RVSYSGVKKLLTNNGGKLPSSLLSSLTAVTSCPTDGSDACIQLRADAVGAAVQQLLVTAN 304 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILK-PTDSTPAMK 299 +GL + + + G+ + +L + T + ++ Sbjct: 305 ATQKTPNQFRIGLYPFVRYLYAYSPLTASINGSPTTPGTINHAAANLASQLDTGANASLG 364 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-----------NFKSNV 348 + G S Q ++ +TDG N + + Sbjct: 365 SGGTHFENAFPT-MNGIITSVGDGSASNKTQPYVFLITDGAQNPQVYWNGSWSGSNSATT 423 Query: 349 NTIKICDKAKENFIKIVTISINAS--------PNGQRL------------LKTCVSSPEY 388 C K I + + I N + L+ C S P Y Sbjct: 424 MDTSKCTTLKSRGIIVSVLYIPYQPIQNPTSFANSEDFYANANIPKIPPSLQACAS-PGY 482 Query: 389 HYNVVNADSLIHVFQN 404 Y + + Sbjct: 483 FYTANSPADITAALNA 498 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 52/390 (13%), Positives = 109/390 (27%), Gaps = 67/390 (17%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 ++ A+ + FL+++ F I + H + + S+ +AA A A+ + L R Sbjct: 1 MLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQRG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + + S N + + Sbjct: 61 QQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNA----------GEAPFNGVRVNG 110 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 ++ + I + I LVVD SG Sbjct: 111 QRTTGSLSGPVPLFF---------GNVTGTSIFEPEAFATATYVER----DITLVVDRSG 157 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM ++ L+ A+ +F D + ++ +GL Y Sbjct: 158 SMA--------------------GSRFNDLQAAIRIFTDLLATTPVDEQ---IGLASYND 194 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 R ++++ + +V T + M+ Sbjct: 195 RASEDVQLTENFAEVN---NAMDRLRTGGFTSISRGMQAG---------------QEIAL 236 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 P ++ +I +TDG +N + + + I TI+ A + R+ Sbjct: 237 RGRPPEFVERTMIVMTDGRHNRG---PEPRVVATDLAADGVTIHTITFGAGADFGRMQDV 293 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 H++ N D L +++ I+ + Sbjct: 294 ARIGGGRHFHATNGDQLRDIYREIALTLGT 323 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 119 bits (299), Expect = 5e-25, Method: Composition-based stats. Identities = 34/322 (10%), Positives = 83/322 (25%), Gaps = 40/322 (12%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES----------ANNAAILA 63 I+ NF I+ AL ++ L G + + + + A + +A Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A M SN + + ++ + + + A+ + I + + + Sbjct: 68 AAMAMNSNGTVSLGKTDARNIFMSQMSGELAEVQVDLGIDVTKTANKLNSQVSF---TAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + + S Y + L + Sbjct: 125 VPTTFMQILGRDSITISGTATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +E + + H D N ++ ++ A D+ Sbjct: 168 VGATPDDVSKLEAKAGCAFACHQM---DKTINNYTIAKSLGVAMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + MG+ + T+ E + KV+ Y +D + + + Sbjct: 225 KTERVSSDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDA-VDLMTIPYQNYNND 283 Query: 298 MKQAYQILTSDKKRSFFTNFFR 319 + + Sbjct: 284 QITNFDSAMTQMNTIIDQAGDG 305 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats. Identities = 42/437 (9%), Positives = 100/437 (22%), Gaps = 80/437 (18%) Query: 42 YVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + +++ +++A ++A LAG+ ++ + + + A Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQLPDDPDVSTGAVTAAATQNLLA--------NDAD 68 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 L + + + + L Sbjct: 69 ATGILVESGGATRSVCVSAEAKVEMTLSQVIGIGDQTVTA--EACAGYNDIELVMVLDAT 126 Query: 162 EHLLNQRY------NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + +V+ I I L+ + + + ++ Sbjct: 127 GSMKGTPIANVKEAATNLVNLIMPSSSSTSTRSKIGLVPFQGKVRIDGNDPVTAEANPDG 186 Query: 216 SAPICQDKKRT------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 P C++ T + K + + V + + Sbjct: 187 VGPGCRNADGTLNNGKLRTEYSKTTTKTNI----FYGYTLSGVSTTSDKTCSGMSPIRAL 242 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK-----------------RS 312 S + +T + T + +K +LT Sbjct: 243 SSDKSAILSNITALNAGQVTSGTIISEGIKWGRHVLTPTAPYVEGSTDTKVRKIMIVLTD 302 Query: 313 FFTNFFRQGVKI---------------------PSLPFQKFIIFLTDG--------ENNN 343 T R G P + T + Sbjct: 303 GDTEDGRCGGSYASASKTINTYWTNAYFGQGLKPDSSASPYSTLSTAALTLAQIPDCKDG 362 Query: 344 FKSNVNTIKICDKAK---ENFIKIVTISINA-SPNGQRLLKTCVSS----PEYHYNVVNA 395 N + AK I+I +I + L+K SS +++++ + Sbjct: 363 GLLNTYVVNEATLAKTDANYPIEIFSIRFGDSDSTDKNLMKQIASSKPGTEDHYFDAPDE 422 Query: 396 DSLIHVFQNISQLMVHR 412 + +F+ I Q + R Sbjct: 423 AGIKEMFKKIGQQLGQR 439 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 29/280 (10%), Positives = 70/280 (25%), Gaps = 14/280 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SR +F++ ++ +I FAL + +L+ LI + +++ M+ A +AA L Sbjct: 1 MSRLTRFFRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLM 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A + L + L + + + T I N Sbjct: 61 AARSTAVTDAELESVGDPAFLAEIAGLNLGLSASNASFKAGAGNHIIGTATATVKPIIAN 120 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + L + R + + Sbjct: 121 LWT-------TDDFNLTATSDVVRSSKNLEVAVVLDITGSMSGSRITDLKTGASDLVDIV 173 Query: 184 EMGERP-----IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ + ++ V ++ + + K A Sbjct: 174 VKDQQAPFYSKVAIVPYSVGVNVGTYADAVRGAVIARTIT--GVSKTNAAVVASAAHGFI 231 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + +S V + Y + Sbjct: 232 VGDKVTISGVSGPTMLNGNTYNITAASADSFTINANTSNA 271 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 35/344 (10%), Positives = 88/344 (25%), Gaps = 42/344 (12%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 S + V + I+ T ++ +++ + ++ + Sbjct: 191 SVGVNVGTYADAVRGAVIARTITGVSKTNAAVVASAAHGFIVGDKVTISGVSGPTMLNGN 250 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-TKM 228 I + I + SA ++ + + Sbjct: 251 TYNITAASADSFTINANTSNAPKYVSGGVATCDTSTNPGCLNFTFTSASNTKETRTLSTC 310 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYM-----GLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + D ++ V + + S ++ + Sbjct: 311 VTERTGTYAYTDIAPSIAPVGRNYPTTKLENSMQPNPCPTATITPLSSDRVTLKGQINAL 370 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN- 342 T + +++ + + ++ S K ++ +TDG N Sbjct: 371 SI---GGSTAGQIGFAWGWYMVSPNFGYLWPNA-TQRPAPYNSKDLVKVVVLMTDGAFNT 426 Query: 343 ---------------------------NFKSNVNTIKICDKAKE--NFIKIVTISI--NA 371 N + T K+CD K+ + I T+ Sbjct: 427 PYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPSLKLTIFTVGFDVGG 486 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 N +LK C + ++ Y L F++I+Q + + + Sbjct: 487 DANAVNMLKYCATDAQHVYFPATGSELKTAFKSIAQEISSLRIA 530 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 59/473 (12%), Positives = 121/473 (25%), Gaps = 98/473 (20%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFE--- 80 + F + F+ LI + K+ + A A LA +++ R + Sbjct: 1 MSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYL 60 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 S + + + GY+ + ++ + + + D + Sbjct: 61 VTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVN 120 Query: 141 NNTIF----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI------------- 183 N ++ + F + + + + I L + Sbjct: 121 ENGAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFS 180 Query: 184 EMGERPIFLIELVVD----------LSGSMHCAMNSDPEDVNSAPICQDKKRT------- 226 G I + S H + D N + + Sbjct: 181 NDGINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGL 240 Query: 227 ----KMAALK---------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-- 271 + L +A+L ++ + + + + K + S Sbjct: 241 DNLSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTIT 300 Query: 272 -------GTEKVRQYVTRDMDSLILKPTDSTPAM----KQAYQILTSDKKRSFFTNFFRQ 320 ++ + R+ +++ + D + + L D N Sbjct: 301 KIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILN 360 Query: 321 GVKIPSL-----------------PFQKFIIFLTDGENNNFKSNVN-------------- 349 K +I L+DG++N K + Sbjct: 361 MKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKL 420 Query: 350 -TIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLI 399 T +C K K+N IK+V I I N K CV + Y NA L Sbjct: 421 ITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 472 >gi|312882153|ref|ZP_07741902.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309370199|gb|EFP97702.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 498 Score = 118 bits (294), Expect = 2e-24, Method: Composition-based stats. Identities = 47/487 (9%), Positives = 121/487 (24%), Gaps = 90/487 (18%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + ++ ++ L ++ + + + + + + + + + AA +A Sbjct: 3 IRQTQNSSLKKQQGVAAVWMGLLLVPIMGVTFWAVEGSRYVQESSRLRDSAEAAAIAVTI 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKR---FIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + S ++ K Sbjct: 63 EDRKGRADSMAEEYVRSYVRDIESTALTTSRDYRARDDAKGIREYIEYTVDARTTHKSWF 122 Query: 124 SSRISMTHMANNRL---DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S + L + + + F + + S Sbjct: 123 VSNFIPSFDRKQELAGRSVAGKEPIQPGEDTIDIVFVSDFSTSMYGSKLRVLKSSIDQIS 182 Query: 181 --------------------LRIEMGERPIFLIELVV----------DLSGSMHCAMNSD 210 + G I + + + S +N D Sbjct: 183 IKILCESPRYDQQISSYICDDKRATGVNRIGFVPFNIRTREMIWSRDGRAVSQLVYLNGD 242 Query: 211 PEDVNSAPICQDKKRTKMAA-------LKNALLLFLDSIDL-------LSHVKEDVYMGL 256 D R +M++ +N + V + + Sbjct: 243 ASVSTYTYNDIDWDRWRMSSKESVEKCAQNYFNCNVPQWQNHQRAKRIFDVVGDYSDPDM 302 Query: 257 IGYTTRVEKNIEPSWGTEKVRQY------------------------------VTRDMDS 286 Y + + + + + Sbjct: 303 FDYIDFDATVSDMFNDKSRSKSNFYRTTKGVRLYSGSLHAQFQNILLTNDLLDLEAIQNM 362 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + + + Q+L + +Q + + K ++ L+DG+ + + Sbjct: 363 TAAGNTAAYQGILRGLQVLKQGQPSIDADEEVQQAYQNKN----KMLLILSDGQETDERI 418 Query: 347 NVNTI--KICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + K+CDKA+E + I I I S + Q K CV + + ++++ +L + Sbjct: 419 INELVKNKMCDKAREEIPGLYIGVIGIKFSASDQNGFKECVENEDE--DIIDVSNLNELI 476 Query: 403 QNISQLM 409 + I +L+ Sbjct: 477 EKIEELI 483 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 53/558 (9%), Positives = 115/558 (20%), Gaps = 165/558 (29%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R K EK ++++ AL L+LI F + K + A + A L Sbjct: 18 TRCFQKLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALLL 77 Query: 65 ASKMVS-----------------------------------NLSRLGDRFESISNHAKRA 89 ++ + N + + +A Sbjct: 78 VAENNAYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENKA 137 Query: 90 LIDDAKRFIKNH-----------------------IKESLSGYSAVFYNTEIQNIVNSSR 126 D+ + + + G + + + + Sbjct: 138 QKDNHLPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVKTH 197 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN---------------- 170 + N S + + F +L N Sbjct: 198 TKNGRLPINSGISYAV-KEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILRS 256 Query: 171 --------------------QKIVSFIPALLRIEMGERPIFLIELVVDLSG--------- 201 + F + SG Sbjct: 257 VVSDIQNILFPTKLSEDASPYNRMGFAAFAGGTRQRGDKNSCVMPYYLKSGVHDFRVAYW 316 Query: 202 ---SMHCAMNSDPEDVNSAPICQDKK---------RTKMAALKNALLL------------ 237 S + + + + AL+ AL Sbjct: 317 QLDSFNYRGSPWDCKDTNVLDDRGNPRPVNACLIKGNPEDALRTALNDRHLSTSMKLIFE 376 Query: 238 -------FLDSIDLLSHVKEDVYMGLIGYTT-----RVEKNIEPSWGTEKVRQYVTRDMD 285 + ++ + + Y +W T+ + Sbjct: 377 DVLDVDKTIKQVENFDGNRVNDYKLTYNNPDHCLGGNEGVETSQAWFTKSKPKVAEALSK 436 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++ +L Q+ ++ L+DGE+N Sbjct: 437 IKPTGSTAASSGFIIGANLLMDK------NTVPEAQPAKLGTNTQRILMVLSDGEDNRPT 490 Query: 346 SNVNT----IKICDKAKEN--------------FIKIVTISINASPNGQRLLKTCVSSPE 387 + T +CD ++ I P + + CV Sbjct: 491 FDTLTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQPPPEQKAAWQKCV-GEN 549 Query: 388 YHYNVVNADSLIHVFQNI 405 +Y + ++L+ F+ I Sbjct: 550 NYYEPSSKEALLDAFKQI 567 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 29/380 (7%), Positives = 82/380 (21%), Gaps = 35/380 (9%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +++ Y +K A + N ++I A SV+ + +G + + + ++ A +A +A Sbjct: 1 MTKVSRYLRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRAL-IDDAKRFIKNHIKESL---------SGYSAVF 113 + ++ + + N + S Sbjct: 61 ATIDPTATPTQTTREAVAKKAFCGNIKQSGGLQNSFCNTTTLDTLGTASATLSTATSNNI 120 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 + + + + +D + L + + + Sbjct: 121 MTVTYSATAHVPTYLLGLVGIDTVDIDAVAKSGVSTSTAEVAFVLDNTGSMSSNNKMTYL 180 Query: 174 VSFI-------PALLRIEMGERPIFLIELVVDL---SGSMHCAMNSDPEDVNSAPICQDK 223 S + + + L+ + + + D V C Sbjct: 181 KSSLDAVLASMLDSTGKNYAKTKVALVPFDTQVSLSNVAGMVDYAGDFSTVTPTYSCSGY 240 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + + S+ Sbjct: 241 SSAQCQVISENASAMCNGNATCLSNNRNYTRSWTSNGNTYFGVFATSYYQSS-------- 292 Query: 284 MDSLILKPTDSTPAMKQ-------AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 T + LT + + + Q + ++ + Sbjct: 293 NTYRSYGNTYYYTYIAWRQVVYRVNSSTLTLNSTNNGGDYYTYQAYYNQPNNYSRYYGAV 352 Query: 337 TDGENNNFKSNVNTIKICDK 356 T N + I Sbjct: 353 TYSTPTAGGYNSTSTTIIKD 372 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 48/369 (13%), Positives = 101/369 (27%), Gaps = 29/369 (7%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 AG + S + + + + + + + + + + N Sbjct: 219 AGMVDYAGDFSTVTPTYSCSGYSSAQCQVISENASAMCNGNATCLSNNRNYTRSWTSNGN 278 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH---LLNQRYNQKIVSFIPA 179 + T + + Y + + L + ++ Sbjct: 279 TYFGVFATSYYQSSNTYRSYGNTYYYTYIAWRQVVYRVNSSTLTLNSTNNGGDYYTYQAY 338 Query: 180 LLRIEMGERPIFLIELVVDLSGSMH------CAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + R + +G + N+ + T + Sbjct: 339 YNQPNNYSRYYGAVTYSTPTAGGYNSTSTTIIKDNTTITANSDLLGVGTSNWTGCVIDRT 398 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 ++ +Y T + + + R Y R T+ Sbjct: 399 QSYDVTSDA-PVAGTPATLYPAAKCATNTLLPVMALTQDIAAARTYAARMA---PAGNTN 454 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-------NNFKS 346 T ++ ++L+ T F +G K++I LTDG N NN + Sbjct: 455 VTIGVQWGMEVLSP-------TAPFSEGGAFTDKAVLKYMIVLTDGINTQNRWTTNNSQI 507 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N C AK I + T+ + + L+ C S Y+YN+ NAD L I Sbjct: 508 NARLALACTNAKNLGITVFTVRVEQGDS--TTLQNCASQTAYYYNLSNADQLPATMSKIM 565 Query: 407 QLMVHRKYS 415 + + + + Sbjct: 566 KSIRKVRLT 574 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 35/325 (10%), Positives = 84/325 (25%), Gaps = 25/325 (7%) Query: 38 GFLIYVLDWHYKKNSMESANNAAILAGASKMV-SNLSRLGDRFESISNHAKRALIDDAKR 96 G I + ++ M +A +A+ L A K+ + LS + D K Sbjct: 22 GSGIDLTSALNARSKMANALDASALKLAGKLSVAKLSDDEIQAGLEKMFTANLSRFDLKA 81 Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH--MANNRLDSSNNTIFYNMDVMTSY 154 + ++ + + V+ + + +LD + Sbjct: 82 SALSELEFEVDWTKGILDVWSD---VSVKTHFIGLGGLGPEKLDVGVTSRVSFASQALEL 138 Query: 155 DYRLQFIEHLLNQRYNQKIVS------FIPALLRIEMGERPIFLIELVVDLS-GSMHCAM 207 L + + K S +P + ++ ++ G+ + Sbjct: 139 ALVLDVTGSMDGDISSLKEASQLLFEALVPENAGRHDQRIRVSIVPYSQGVNLGAKAWKV 198 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI------DLLSHVKEDVYMGLIGYTT 261 + D ++ + A N + D + Sbjct: 199 TNRQSDSSNCVATRGGPNAFTDAYYNYRGARSNFFVAPGALDYFVIRRGSNVSWYPPRNN 258 Query: 262 RVE-KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E + + + + + V T + ++ L+ + + Sbjct: 259 CPESEILPLTNSRKTLLAAVDAL---EAQGGTAGQAGIAWGWKALSWTWHPFWPSG--SD 313 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFK 345 K S K + +TDG+ N Sbjct: 314 PAKSFSSQVGKAAVIMTDGDFNVHY 338 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 97/389 (24%), Gaps = 62/389 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + +E I+ + L+ G I ++ ++ +++ + A+LA AS Sbjct: 21 LHRLLRNEDGALIILSLQVFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAAS---- 76 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 L++ D + ++ +A + K S + + ++ + I Sbjct: 77 -LTQKRDPTLVVKDYLTKAGLGYIASDSSFTPKVEGSI-ALGWRRVSVEVDDDMPTIFGP 134 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-------------------------- 164 + + L ++ +T L + Sbjct: 135 LLGVSSLAATGDTTAMQAVGNVEISLVLDLSGSMTEYVKDNPSCTKNCTSSKTRFQYLQV 194 Query: 165 --------------LNQRYNQKIVSFIPALLRIEMGER-----PIFLIELVVDLSGSMHC 205 + VS +P + +G + V S +M Sbjct: 195 AAKSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGSEMQEGYTLSSDFSVTGSSFAMPQ 254 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAA--LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + D N+ I T+ + S T Sbjct: 255 CADFVANDYNTMVIDGTGPLTRTMYGSSYKYSDSLSALVSDGSTSNNPGQDWHNCMNTPQ 314 Query: 264 EKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + I S + T +D T K +L + Sbjct: 315 NRVIPLSSDPTFLAADKTGFIDKLTAGGWTSIDVGAKWGLALLDPSARDEVAKMTSVSSA 374 Query: 323 KIPSLPFQ--------KFIIFLTDGENNN 343 + P K ++ +TDG N Sbjct: 375 FRETKPRPINYDGDTMKVLVLMTDGANTT 403 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 61/206 (29%), Gaps = 4/206 (1%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D + K T ++A+ + + + Sbjct: 443 YDATRSSTPYYKYETGTWVARSAITQTTQEQQVQTATCTQYKNSWGQWKWNSCSVGTSDC 502 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPA--MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + +T A +Y L K + T G Sbjct: 503 TLISSTTSLKTYTCKKTTTVNVTTEAPLYDVSYDHLYKTKNWNLNTVAGLLGKPYGRSAG 562 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + + + + + T K+CD AK I I +++ +A G+ LLK C S Y+ Sbjct: 563 TQ-YELMANAVYDTSVKDARTKKLCDLAKSKGIYIFSVAADAPSGGKTLLKYCSSGTSYY 621 Query: 390 YNVVNADSLIHVFQNISQLMVHRKYS 415 Y V +L F +I+ + + + Sbjct: 622 YEVQ-GSNLSTAFASIAASISSLRLT 646 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 82/236 (34%), Gaps = 55/236 (23%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LVVDLSGSM + +++A+KN + F++ + Sbjct: 85 YRDLMLVVDLSGSMQQED----------ILQDGDYIDRLSAVKNVVTQFIEQ-------R 127 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +GL+ + + + V + + + LI + T + A + + Sbjct: 128 QGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSE 187 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 Q+ +I L+DG N + I+ + AK+ +KI TI I Sbjct: 188 A------------------PQRVVILLSDGSNTAGTLD--PIEAANIAKKYGVKIYTIGI 227 Query: 370 N------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A + + L K + ++ +A L ++Q I+Q Sbjct: 228 GAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQAIYQAINQ 283 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 82/236 (34%), Gaps = 55/236 (23%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LVVDLSGSM + +++++KN + F++ + Sbjct: 85 YRDLMLVVDLSGSMQQED----------ILQDGDYIDRLSSVKNVVTQFIEQ-------R 127 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +GL+ + + + V + + + LI + T + A + + Sbjct: 128 QGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSE 187 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 Q+ +I L+DG N + I+ + AK+ +KI TI I Sbjct: 188 A------------------PQRVVILLSDGSNTAGTLD--PIEAANIAKKYGVKIYTIGI 227 Query: 370 N------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A + + L K + ++ +A L ++Q I+Q Sbjct: 228 GAGEMEVKQFFMTRKVNTSADLDEKTLTKIATMTGGQYFRARDAQELQTIYQAINQ 283 >gi|27379052|ref|NP_770581.1| hypothetical protein blr3941 [Bradyrhizobium japonicum USDA 110] gi|27352202|dbj|BAC49206.1| blr3941 [Bradyrhizobium japonicum USDA 110] Length = 472 Score = 116 bits (289), Expect = 1e-23, Method: Composition-based stats. Identities = 56/486 (11%), Positives = 116/486 (23%), Gaps = 84/486 (17%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M L+ R + KAN +IIFAL ++ + L+G + K+ + +A +AA Sbjct: 1 MKLM--LRSAMLRFARDRKANVAIIFALMMVPTIFLLGMALDYTLALRKREQLNAAADAA 58 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +A + S + + S + + Sbjct: 59 AIAAVRPAMLTQSDTTVVKATAEAVFAAKANLPGLSAVP--TPTVTIVDSGLARTITVSY 116 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 S+ + + + ++ + + I + I Sbjct: 117 TAQSTNNFPGVLGKQTWQVAGSATARASSAPNMNFYLLMDDSPSMGIGATTTDISNLIKY 176 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + N A Q ++ + +A+ L Sbjct: 177 TAPAYQSAGGSQ----NCGFACHETNIAHDGGTKDNLAIARQRNITLRIDLVTSAVNQLL 232 Query: 240 DSIDLLSH--------------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +S L + + + T Q + Sbjct: 233 NSWSNCPQSGVSGGVMQCMSALNNTTYKAALYTFDLGLNALATLTTPTSAGTQVSNIALM 292 Query: 286 SLILKP---------TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + TD+ + A + L G Q+ + + Sbjct: 293 PVAYQNCVVVTTNCKTDNGTDIAGALKSLNDVMPT------PGLGSNASGDTPQEVVFLV 346 Query: 337 TDGEN-----------------NNFKSNVNTIKICDKAKENFIKIVTISINA-------- 371 TDG NN IC K IKI + Sbjct: 347 TDGVEDKIVSGASTCPNASLASNNRCQQPLDTTICTTIKNRGIKIAVLYTEYLQLKTPNI 406 Query: 372 --------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L++C S P ++ +V + N+ + Sbjct: 407 PVTNSWYMSWVDPYNEPTSLTGTIAQKLQSCAS-PGFYASVQTGGDISDALTNLFIKVAS 465 Query: 412 RKYSVI 417 S++ Sbjct: 466 STASLV 471 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 53/484 (10%), Positives = 127/484 (26%), Gaps = 99/484 (20%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I +E + F + F+ LI + + KK + A A LA + + S Sbjct: 17 FIKNENGTILMSFIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATLALTVENNNIPS 76 Query: 74 RLGDRFES--ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + IS+ A L ++ I S S T ++ Sbjct: 77 SEQEVKNNILISSFAHAYLPEETFSEPVITINSSASHMDYHADITMSYPAKFLNKAFNLI 136 Query: 132 MANN---RLDSSNNTIFYNMDVMTSYDYRLQFIEHLL----NQRYNQKIVSFIPALL--- 181 ++ + + T + + + + +S + + Sbjct: 137 SISDIKLDESAIAKKNTSITAIPTDVVFVTDYSGSMNRDFDGTDIDSTDISKVRIVALRR 196 Query: 182 ---------RIEMGERPIFLIELVVDLSGS-----------MHCAMNSDPEDVNSAPICQ 221 + + + + H + + + Sbjct: 197 IFKNLHNEIQQNENINLVGFVPFTWGTKRTIDNNNTTPTLLCHFPFVPKKHSPDGNYLTK 256 Query: 222 DKKR-----------------TKMAALKNALLLFLDSIDLLSHVKEDV----------YM 254 + K + ++ + ++ + Sbjct: 257 YNISGLKELPGMDKIVSLNMINDINYGKLSDKEYISISEEITDKANYTDREKALRYLSRI 316 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD---------STPAMKQAYQIL 305 + + IE + EK + + ++ + +D ++ A + Sbjct: 317 YYVKLENVISPVIEENIDYEKTIKSINGSDHTIDIPLSDVRVNDFCLRTSMAHTFDSNNM 376 Query: 306 TSDKKRSFFTNFFRQGV-------------KIPSLPFQKFIIFLTDGENNNFKSNVNTI- 351 + + G K + +K ++ L+DG++N+ + Sbjct: 377 DDSQFSKILNSRANGGTLISSGILSGNNLFKETNNNNRKIMVILSDGDDNDNTHAGDNRI 436 Query: 352 ----------------KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +C++ K+N I++V I+I +P+ K CV Y NA Sbjct: 437 NKDAPYLNITKKLIDNGMCERIKDNDIRMVFIAIGYTPDENIDWKKCV-GEGNFYLASNA 495 Query: 396 DSLI 399 L Sbjct: 496 QELE 499 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 43/236 (18%), Positives = 83/236 (35%), Gaps = 55/236 (23%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LVVDLSGSM A + +++A+KN + F++ + Sbjct: 85 YRDLMLVVDLSGSMQQAD----------ILQDGDYIDRLSAVKNVVTQFIEQ-------R 127 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +GL+ + + + V + + + LI + T + A + + Sbjct: 128 QGDRLGLVLFADHAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSE 187 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 Q+ +I L+DG N + I+ + AK+ +KI TI I Sbjct: 188 A------------------PQRVVILLSDGSNTAGTLD--PIEAANIAKKYGVKIYTIGI 227 Query: 370 N------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A + + L K + ++ +A L ++Q I+Q Sbjct: 228 GAGEMEVKQFFMTRKVNTSADLDEKTLTKVATMTGGQYFRARDAQELQTIYQAINQ 283 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 22/295 (7%), Positives = 68/295 (23%), Gaps = 14/295 (4%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + N + A ++ L + G + + + ++ ++ A ++ +LA + + Sbjct: 1 MRRLASDRSGNTLALIAAGLLPLLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGT 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 ++ L D + + + + + Y+ V+ MT Sbjct: 61 EIATLTDIPTDAGTRGQEFFNSNFQDGNYGTQNRTFNMVLENDYSVSGTATVDVPTSVMT 120 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL-NQRYNQKIVSFIPALLRIEMGERP 189 ++ L + + ++ Sbjct: 121 VFGFTKIPVKVECQARISFSDVDVMMVLDVTGSMKHTNSGDTLSKIDSLKATVRNFYDQM 180 Query: 190 -----------IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + +++ T A K Sbjct: 181 EGAKSAGTRIRYGFVPYASNVNVGHLLKDEWVVNSWAYQSRAISGTTTVEAGTKTRENWA 240 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 S + ++E Y + + S + + + T Sbjct: 241 YKSGSRSAWIEESTYAATMNPGSGDTN--PFSCDQSLPKSTYKWHSEKNGIPSTS 293 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 51/192 (26%), Gaps = 28/192 (14%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 KE + G ++ + T ++ Y+ T M ++ Sbjct: 437 DTKKEFIQTGNWWFSGCPAPAQKLKAMTSGELDSYLDSLT---PHGATYHDGGMIWGGRL 493 Query: 305 LTSDKKRSFFTNFFR---------------------QGVKIPSLPFQKFIIFLTDGENNN 343 L+ + + P + T Sbjct: 494 LSQYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLA 553 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + C++ K+ + ++ + N + +KTC S ++ NA L F Sbjct: 554 QTVEERFLFACNEVKKLGATVWVVAFGTAANDK--MKTCAGS-GRYFEAANASQLNDAFS 610 Query: 404 NISQLMVHRKYS 415 I++ + + Sbjct: 611 TIAKSTGDLRIA 622 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 20/202 (9%), Positives = 54/202 (26%), Gaps = 18/202 (8%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + NF+++F + + ++ G + + +K+ +++A ++A L A Sbjct: 2 NHTSSFLKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARAG 61 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + ++ K A + N K ++S + Sbjct: 62 DMSETQA----------MKLAKTTITTNYGINVAKVAVSMVDGDATVKASMDQALV---F 108 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-----ALLRI 183 M S L +L + + I L + Sbjct: 109 GGFMGRKNAAVSAEATATYAYTKYEIALVLDTTGSMLGGKLTSLQNAVIGLVDGMEALGL 168 Query: 184 EMGERPIFLIELVVDLSGSMHC 205 + ++ ++ Sbjct: 169 NKEQLKFAVVPYAGFVNVGPEY 190 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 69/208 (33%), Gaps = 45/208 (21%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + G+ +E + + K+R V T+ + + Sbjct: 343 SPSNFYSNKKDPKGPGFGCEMEPLVPLTTDFSKIRTTVKAL---EANGSTNMLEGVMWGW 399 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-------------- 348 ++L+ F QG +K +IFLTDG+N+ N Sbjct: 400 RVLSDR-------EPFAQGAPKSDASVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVD 452 Query: 349 --------------------NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSPE 387 T C+ AKE+ + I TI + A ++L+ C +S Sbjct: 453 GRLDGMTAANIGQTNNALDKKTKAACENAKEDGVTIYTIRLEEADVGTGKMLEECATSSA 512 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++++ + L +F I + +V + + Sbjct: 513 HYFDAPSRQQLTPIFDAIKKGVVKLRLT 540 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 46/418 (11%), Positives = 113/418 (27%), Gaps = 39/418 (9%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ + SI+FA+ + + K +E A+ AA LA +++ + Sbjct: 3 HKQQGHASILFAMLIPLLFGVFALGSDGARAIQSKARIEDASEAAALALSARDDEHAMSD 62 Query: 76 GDRFESISNHAKRALIDDAKRFI-------KNHIKESLSGY---SAVFYNTEIQNIVNSS 125 ++ + + ++D+ I + + E G A + ++ + + Sbjct: 63 ENKTIVQAYIEEYLPVEDSDVTILGIERLECDDMPECRQGSGRGEARYTQYSVRVSADQT 122 Query: 126 RISMTHMANNRLDSSNNTIF-----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + + F + + Sbjct: 123 PWFGGGSPEVEVPEVWRSQGGAKARKYQSNAVDIVFAADFSGSMASPWTG---------- 172 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----KNALL 236 I ++E V + V + R ++ A+ K LL Sbjct: 173 GSQPKYRDLIDILEKVTVELAPYNFDSQRYNSSVGVSGFNALTYRNELCAVNNLEKQGLL 232 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRV-EKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D ++ + E + ++ + + V R T S Sbjct: 233 GVVDYSRTVARMWETKSCRPPSISNSAGFHDVPLTDDYSTFNRTVDRFT---ARGGTASY 289 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L G + G + S + + Sbjct: 290 QAVMSGARLLDHGSNNRQILIVISDG---QDNNLNHTNGLVNAGMCRDIISRLEGRPSAN 346 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-IHVFQNISQLMVHR 412 ++ ++ I + P+ + CV + + N D L + I + + H Sbjct: 347 G-RDVSARLAFIGFDFEPSMNPAMVRCV-GEDNVFKAENTDELFEQIMFLIREEVGHL 402 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 32/364 (8%), Positives = 84/364 (23%), Gaps = 14/364 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + ++ N + L+++ + L+G + + + +N A +A LAG Sbjct: 3 LRMLLGRLWSDQRGNTLALMTLALIPLVALMGSGLDMTRAYVAQNRFRQACDAGSLAGRR 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + R E+ + +L+ +I + Sbjct: 63 MLAGLTLPQAARDEATKYFMFDFPQGYLQS-----APYTLTMSVPTAGTLQISSQTTVPT 117 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 M + L S V T + + + Sbjct: 118 TLMGLFGFDTLPISTTCSATQDFVNTDIMFVFDLSGSMNCAPGVTGYCGDVEQSGSRMGA 177 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + A N+ ++ KN ++ Sbjct: 178 LRSAATSFYDTLETAQSQLAANNLRLRYGFVNYNSTVNVGRILYEKN--PDWMVQSWSYQ 235 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK--PTDSTPAMKQ-AYQ 303 D + + N ++ ++ + T Sbjct: 236 SRTPDWIDATAYFNNKSACNSAYTYDSQASQADANYTGSRTPDWIDATAYFNGKSACNVA 295 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKE 359 + N+ + +I ++G + + ++ + K Sbjct: 296 YTYDSQVSQANPNYTGIMGGWWLTSNRCSVIVQSNGNPDGYTYGRRSVDVRPFLASNLKA 355 Query: 360 NFIK 363 ++ Sbjct: 356 TNVQ 359 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 44/405 (10%), Positives = 95/405 (23%), Gaps = 52/405 (12%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIV 122 + + + A ++ +G SA T + Sbjct: 244 ATAYFNNKSACNSAYTYDSQASQADANYTGSRTPDWIDATAYFNGKSACNVAYTYDSQVS 303 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY----RLQFIEHLLNQRYNQKIVSFIP 178 ++ M L S+ ++ + + L + + S P Sbjct: 304 QANPNYTGIMGGWWLTSNRCSVIVQSNGNPDGYTYGRRSVDVRPFLASNLKATNVQSPTP 363 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-- 236 P + + + + Sbjct: 364 IWQITGTN-DPSDDRPYEFKSVWNGCIEERKTNSAAINGGSSTTAPSDAYDLDVDLVPYN 422 Query: 237 ------LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + + +G + +V+ + Sbjct: 423 DDTRWRPMWNDVSYYPDWSWSYGVGRQPVAYCPTEAKRLQNYHNNRSGFVSYLNGLVARG 482 Query: 291 PTDSTPAMKQAYQILT------SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T M + L+ S + N KI +K++IF+TDG+ + Sbjct: 483 GTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGFSVKKYMIFMTDGDMSPT 542 Query: 345 KSN-----------------------------VNTIKICDKAKENFIKIVTISINASPNG 375 S+ C+ AK I I I+ + + Sbjct: 543 WSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAKAKGIDIWVIAFSTTLTA 602 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH-RKYSVILK 419 + C S PE + + +LI F+ I + ++++IL Sbjct: 603 D--MTNCASKPEQAAGLSSNAALIAKFKEIGSKIATVLEFALILP 645 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 78/234 (33%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ++ A+K + F++ ++ Sbjct: 84 DMMLVVDLSGSMSEEDMKTDSGFVD----------RLTAVKRVVSDFIEK-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V++ + R + L+ + T + A + Sbjct: 127 DRLGLVLFGDHAYLQTPLTFDRNTVQEQLNRTVLGLVGQRTAIGEGLGLATKTFIESNA- 185 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + I+ AK+N KI T+ I A Sbjct: 186 -----------------PQRTIILLSDGANTAGVLD--PIEAAQLAKDNNAKIYTVGIGA 226 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + ++ NAD L ++Q I + Sbjct: 227 GEMQVRGFFGNQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTIDK 280 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 80/234 (34%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K+ L F++ ++ Sbjct: 84 DMMLAVDLSGSMAIKDM---------QTQSGQSIDRLTAIKHVLSNFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V Q + R + L+ + T + A + Sbjct: 128 DRLGLVLFGDHAYLQTPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N + ++ AKE+ + I T+ I A Sbjct: 182 ------------IKSKAPQRVIILLSDGANTAGVID--PLEAAKLAKESGVTIYTVGIGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ N L ++Q I+Q Sbjct: 228 DEMLQRSIFGVQKVNPSQDLDEKTLTKIAQMTGGKYFRARNPQELDKIYQIINQ 281 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 39/451 (8%), Positives = 112/451 (24%), Gaps = 71/451 (15%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNL 72 I + I+F + ++ + F + + + A A LA + K + Sbjct: 4 IRKNRGVAGILFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDD 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + R D + + + + ++ +V++ + + Sbjct: 64 NVKYARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQASGEVAPFSDFVVSAKAEHKSWI 123 Query: 133 ANNRLDS-------SNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-NQKIVSFIPALLRIE 184 A + + + + F + + + + + Sbjct: 124 AYEEAELKPEFEVAGKSVTRKYLPQPVDVYFIGDFSGSMTGHWKGGKTKLDVVKQTIERV 183 Query: 185 MGERPIFLI------------ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA-- 230 + + F L V SG+++ + T Sbjct: 184 VEDIADFNTEEKSRVALLGYNPLHVKQSGTVYLNAYGYRRSWPKKIAYDYARGTSAQTVA 243 Query: 231 -------LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + ++ + L V + + + + Sbjct: 244 KMFDPPSVYSRVQEYVRGMSRLDVENLVVNNDRFVDYYKFYDI-PLTEDYTHFKARLASA 302 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T S + A Q + + ++ ++ I L+DG +N+ Sbjct: 303 W-LGAEGGTSSWNGIIAAAQ----------------EANRATNINPEQVFIVLSDGADND 345 Query: 344 FKSNVNTIK--ICDKAK------------------ENFIKIVTISINASPNGQRLLKTCV 383 + +C K + + + + I ++ C Sbjct: 346 TNYLQRLVNQGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGVDYRVKESDGFGDC- 404 Query: 384 SSPEYHYNVVNADSLIH-VFQNISQLMVHRK 413 + Y+ + D + + I++ K Sbjct: 405 FGRKNIYHAKDGDDVYKYILNLINEETGRLK 435 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 113 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 78/232 (33%), Gaps = 54/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ++ A+K + F+D ++ Sbjct: 84 DMMLVVDLSGSMAEEDMKTSN---------GDFVDRLTAVKQVVSDFIDQ-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ VR+ + R + +L+ + T + A + Sbjct: 128 DRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLNLVGQRTAIGEGLGLATKTFIESNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + ++ AK+N KI T+ I A Sbjct: 187 -----------------PQRTIILLSDGANTAGV--LEPLEAAQLAKDNHAKIYTVGIGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L K + ++ NAD L ++Q I Sbjct: 228 GEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 113 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 80/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM +D + ++ A+K L F++ Sbjct: 95 KYRDMMLVVDLSGSMQKEDMNDN----------GEYIDRLTAVKRVLSDFVEK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RQGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAAEIAKKYNATIYTVG 237 Query: 369 INASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A L ++ I+Q Sbjct: 238 VGAGEMMVKDFFMTRKVNTASELDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 294 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 79/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM +D + ++ +K L F++ Sbjct: 114 KYRDMMLVVDLSGSMQKEDMNDN----------GEYIDRLTTVKRVLSDFVEK------- 156 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 157 RQGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 213 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 214 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAAEIAKKYNATIYTVG 256 Query: 369 INA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A L ++ I+Q Sbjct: 257 VGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 313 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 15/207 (7%), Positives = 53/207 (25%), Gaps = 17/207 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R R + NF++I ALS + G + + ++ +++ Sbjct: 4 IRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQN-------- 55 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + + R + IS+ R++ + +++ + ++ Sbjct: 56 ---ALDAAVLAVAQRGDKISDAEARSIAASFLTGNLSSAYKNM-AVERNGTSVKLSAEAT 111 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + ++ L + + + + + Sbjct: 112 MPLSFGGLIGRKEATVGASSTADMAFAYYEIALVLDTTGSMRGGKLQAMKEAVNGLIDDL 171 Query: 184 EMGERP-----IFLIELVVDLSGSMHC 205 L+ ++ Sbjct: 172 SSRVTDKERLKFALVPFASFVNVGPQF 198 Score = 88.0 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 69/239 (28%), Gaps = 62/239 (25%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--------VEKNIEPSWGTEKVRQY 279 + A L ++ L E Y+ S +++ Sbjct: 321 LDDAAQAALTGATNLIGLDIRPERWRKVEHEYSDGRGPAYGCLSRPITPLSNDYAALKRE 380 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V+R T+ + ++L+ + + +K +I LTDG Sbjct: 381 VSRFT---ADGNTNIMEGVAWGMRVLSPREPFTEGKEPASD--------VEKIMIVLTDG 429 Query: 340 ENNNFKSNVN-----------------------------------TIKICDKAK------ 358 NN SN T+ C+ AK Sbjct: 430 ANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRTLAACENAKREYTPS 489 Query: 359 -ENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 E+ + I TI + LL+ C + P Y+++ + L +F+ I + + S Sbjct: 490 KEDDVTIYTIRLEEPDVATGTLLQECATGPGYYFDSPSRTQLNAIFKEIRDGITKLRLS 548 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 62/526 (11%), Positives = 127/526 (24%), Gaps = 129/526 (24%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K SE NF+++FAL+ + L +GF V K +++++ +AA L+ + Sbjct: 3 INKFWRSESGNFALLFALAAPAILAAVGFAADVSSVMRAKVNLQNSLDAATLSSSHLSDD 62 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +R K + ++ + Sbjct: 63 EAARRLAFD---GYFQANVANHPELTNAKLTLSVDKGFNYVKTKAIASADV----NLYFA 115 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-----YNQKIVSFIPALLRIEM 185 + + + L + + K++ + Sbjct: 116 FLFGDNQHIEVDAGGVEATNNLEVVLVLDNTGSMAGAKIKALRDATKVLLDNLDGAKSPD 175 Query: 186 GERPIFLIELVVDLS-----GSMHCAMNSDPEDVNSAP---ICQDKKRTKMAALK----- 232 + ++ V ++ + N A + K+ M + Sbjct: 176 RKVTAAIVPFVTAVNINGDKFDPSWIDMAGKSPNNGANFPLLPDGKRVNHMDLFRDLAQG 235 Query: 233 -----------------------------NALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 +A LF+ G +V Sbjct: 236 TGWQGTGWKGCVEARPGSAAISDVAPDQADANTLFVPYFAPDDPGDATGPSASYGNDAKV 295 Query: 264 EKNIEPSWGTEKVRQY--------------VTRDMDSLIL--------------KPTDST 295 N S + + D T Sbjct: 296 YNNSYLSDDVQDGTDTKGKDKKIAKYENPKAKKINDKYAPLTVGPNRACPTPVVPLTADL 355 Query: 296 PAMKQAYQILTS--------------DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 ++ A + + + G + K ++ LTDGEN Sbjct: 356 DKLRTAAAQMQEWNGSGTNVSEGLSWGMRVLSPAPPYTDGAPWKTPNTSKIVVLLTDGEN 415 Query: 342 NNF--------------------------------KSNVNTIKICDKAKENFIKIVTISI 369 + + T+ +CDK K ++I TI++ Sbjct: 416 VVYGASAEPEKSDYTSYGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQQVQIYTITL 475 Query: 370 NASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + R L C ++P +Y V + L +VFQ I+ + Sbjct: 476 QSDTAANRTLYGKCATNPADYYAVNDPSKLPNVFQTIAGKFTTLQL 521 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 77/232 (33%), Gaps = 54/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM ++ A+K + F+D ++ Sbjct: 84 DMMLVVDLSGSMAEEDMKTSN---------GDFVDRLTAVKQVVSDFIDQ-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ VR+ + R + L+ + T + A + Sbjct: 128 DRLGLVLFGDHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ II L+DG N + ++ AK+N KI T+ I A Sbjct: 187 -----------------PQRTIILLSDGANTAGV--LEPLEAAQLAKDNHAKIYTVGIGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L K + ++ NAD L ++Q I Sbjct: 228 GEMQVRGFFGKQTVNTARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 80/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM +D + ++ A+K L F++ Sbjct: 95 KYRDMMLVVDLSGSMQKEDMNDN----------GEYIDRLTAVKRVLSDFVEK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RQGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAAEIAKKYNATIYTVG 237 Query: 369 INA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A L ++ I+Q Sbjct: 238 VGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 294 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 56/411 (13%), Positives = 128/411 (31%), Gaps = 57/411 (13%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + +I+ A +++ L L GF + + + + +A +AA + ++V S Sbjct: 18 RRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQSTTSG 77 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 R + ++ + A N I+ S + + + N Sbjct: 78 RAAARDIASRFEVGGKALSLNDNDIQFGRSTRQSNGSYSFA----------INGTPTNAA 127 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + L F + Y+ ++ + + R + I +V+ Sbjct: 128 RVFGRKTKTSAAGPVE----LPFGGFVGAPEYSTELNA---------VAMRLDYDIVIVL 174 Query: 198 DLSGSMHCAMNSDPEDVN----------SAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 D SGSM ++ + + D ++ A L ++ FL ++ Sbjct: 175 DRSGSMGWDLSGVEFEYPEAVRQRPLVENYFSPPDPTGSRWAILSASVNDFLTILNQRQV 234 Query: 248 VKE---DVYMGLIGYTTRV----EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 Y G + + + + +T ++ TD + Sbjct: 235 AARVGLVTYAGDYTFGKYSSVKLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITA 294 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +LT+ + T Q II +DG N V+ ++ Sbjct: 295 AQTMLTTSSQARLKTG-------------QPIIIVFSDGMFNQGTEPVSL--AASAYSQS 339 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 I +++ A+ G+ + + ++ + A L F++I+ + Sbjct: 340 STIIHSVTFGATAQGRATMNSVTATAGKGLSLHANTAAELAESFRSIANAI 390 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLSEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 G +K I LTDG + S V I AK +++ I++ Sbjct: 208 -----DKSSGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMETIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLTEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +G +K I LTDG + S V I AK +++ I++ Sbjct: 208 -----DKSRGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLTEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +G +K I LTDG + S V I AK +++ I++ Sbjct: 208 -----DKSRGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 51/166 (30%), Gaps = 8/166 (4%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + F++ E I + ++ L G + ++ + + ++ + + A LA A Sbjct: 8 KGIFQRFRNDEDGALIIFSLMMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA-- 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKR-FIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +L ++ + + ++ +A + + + SA + R Sbjct: 66 ---DLDQIMPPADVVRDYLDKAGMLHFLQGEPTVSQGINYRVVSAQASAPMALFFYDLPR 122 Query: 127 ISMTHM--ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 I + ++ S + L + + N Sbjct: 123 IFTSPFSPGMTAINVSGASTAEERVTDVEVSLVLDVSSSMNSLDSN 168 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLTEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +G +K I LTDG + S V I AK +++ I++ Sbjct: 208 -----DKSRGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 78/235 (33%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLSEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ + Sbjct: 151 DRLGLVLFGDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDQS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +G +K I LTDG + S V I AK +++ I++ Sbjct: 211 --------RGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 33/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +K +++ A K L F+ ++ Sbjct: 107 DVMVVVDLSGSMAEPDF---------TSRTGEKISRLDAAKEVLTEFVQ-------SRKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLVLFGDAAFVQTPFTVDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +G +K I LTDG + S V I AK +++ I++ Sbjct: 208 -----DKSRGALEQDQNREKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRVHVIAMGD 260 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 261 PETIGETALDMDTIHRIAKESGGEAFEALNRDELSAAYDEIGKLEPQLYESTTYR 315 >gi|326797316|ref|YP_004315136.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] gi|326548080|gb|ADZ93300.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] Length = 528 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 44/515 (8%), Positives = 108/515 (20%), Gaps = 113/515 (21%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA-- 65 R + S+ L ++ L+ F + ++ A +AA LA Sbjct: 14 RNALNTFLRSDSGAALPFIVLILVGALVGFSFALDTTRMVNTAGQLKRATDAAALAIGQI 73 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +N + +D + ++ + + T + ++ Sbjct: 74 QLRNNNDDETDLDSIAQGYVLNNLGMDSGLIDQIDTTSIFVTQGTNDGHPTFTVTVTLNT 133 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI------------------------ 161 + + + S + + T L Sbjct: 134 QSDLLNAQTEDQVIS--STVEVVSTPTEVALLLPNTLTEDEPELVALRKLGKSFARNLLG 191 Query: 162 -EHLLNQRYNQKIVSFIPALLRIEMGERPIFL-----------------IELVVDLSGSM 203 + + + +S +P + + + S+ Sbjct: 192 EDTDASNSAQKVWLSLVPFSQAVNVYDADDPERISRWAAAGALNPPELRSLFKTGKVRSL 251 Query: 204 HCAMNSD----------------PEDVNSAPICQDKKRTKMAALKNALLLFLDSID---- 243 D + + P Q + +N Sbjct: 252 ADPRFPDRVAKLLCMYRGLGAGENFNWDQQPDSQFGVYYRHDLPQNGSPGATPISWVGPN 311 Query: 244 ---LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S V EDV + + + + + + + + AM Sbjct: 312 PSLWPSSVAEDVRWIVADKGCPNAPLLPLTNDLDAIDARLDEMSTRFNV---NYAIAMGW 368 Query: 301 AYQILTSDKKRSFFTNFFRQGVK-----------------IPSLPFQKFIIFLTDGENNN 343 A L+ + + S + F D ++ Sbjct: 369 AGHALSPNMRGSSGWGDSELPLDFSKSKSNVKVMVMLANTTGDWFDTDAYNFNRDQALDS 428 Query: 344 FKS-------NVNTIKICDKAKENFIKIVTISI--NASPN---------GQRLLKTCVSS 385 +C ++ IK I + + L+ C Sbjct: 429 TGPGSAKAFATQRFHDVCRSFRDKNIKFFFIGVRPGDPADFGRTLFTDIAGPGLRECAGG 488 Query: 386 PEYHYNV------VNADSLIHVFQNISQLMVHRKY 414 Y + + + I++ + Y Sbjct: 489 GGGLYFADASSFVEGKSQIDSLLEEIAEEIRQNYY 523 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 84/242 (34%), Gaps = 31/242 (12%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + E+ + +VVDLSGSM Q KK +++ A K L F + Sbjct: 95 VREQLGRDVMVVVDLSGSMAEQDF---------TSKQGKKISRLQATKEVLADFAKT--- 142 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GLI + + + + + + ++ + T A+ A ++ Sbjct: 143 ----RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKV 198 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 K + +K +I LTDG + S V I AK ++I Sbjct: 199 FEQSGK------QMSAEQAQNDIEREKVVIVLTDGNDTG--SFVEPIDAAKVAKAKGVRI 250 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I++ + + + + S + +N D L ++ I QL S Sbjct: 251 HVIAMGDPQTVGEVALDMETIKRIASESGGEAFEALNRDELSTAYEQIGQLEPQLYESTT 310 Query: 418 LK 419 + Sbjct: 311 YR 312 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 25/274 (9%), Positives = 74/274 (27%), Gaps = 29/274 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++ N ++IF LS + +L++G + K++++ A ++A LA A +V+ + Sbjct: 16 FSSCDRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATTT 75 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ A + + SA + + M Sbjct: 76 NQQAQSQAQVYLLTNVRNAVAV--------VTKAEISADRLTLCLDSTAQIPTTIMKIAH 127 Query: 134 NNRLDSSNNTIFYN---MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE--- 187 + + T M+ L + + ++ + + Sbjct: 128 IETITTKATTCAQTPGGMNGTYEIALVLDNSGSMSKSAGGKSKIAALRDAATSFVNNIYS 187 Query: 188 ----RPIFLIELVVDL---------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + ++ + + ++ + + A Sbjct: 188 KTTDVKMSIVPFSAGVRVLDPSVSSNRTLSWIDVNGNNSQHWLVFGDGSLVAATAKAAAK 247 Query: 235 LLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKN 266 F D+ + +K + + Sbjct: 248 TAGFTSRFDIFTKLKSLNSSWDWGGCFEGPKYPL 281 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 78/306 (25%), Gaps = 84/306 (27%) Query: 179 ALLRIEMGERPIFLIELVVDLSGS----MHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + ++ + + S + ++ ++ + C L A Sbjct: 284 SDTAVDTSNAETLFVPFLAPDEPSTKDKYNNSLYTNNYLAETGGSCSGTVTGDWKLLTRA 343 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 T+ + ++ + + ++ + T+ Sbjct: 344 CK-----YGKPKKDGSGAGPNSSCPTSSSQTVLQLTATQSTITTKISGLTE---NGYTNL 395 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT----------------- 337 ++ ++ T F G + K I+F+T Sbjct: 396 HEGFMWGWRTISP-------TGPFAAGRAYATKDNHKIIVFMTDGFNNWQSATSTVTGSA 448 Query: 338 ------------------DGENNNFKS-------------------------NVNTIKIC 354 DG N + T++ C Sbjct: 449 YQAAGYYSYNGTANQRFPDGTATNGNGVNYQTTLEAAAGSSTDYHDTSRNMQDELTLEAC 508 Query: 355 DKAKENFIKIVTISINASPN-----GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 AK ++I TI + + G ++++ C + +++ + DSL F +I + Sbjct: 509 TNAKTAGVEIYTIGFSVPVDPIDAQGLKMMQDCATDANHYFAATDVDSLNAAFASIGSGV 568 Query: 410 VHRKYS 415 + S Sbjct: 569 GKLRLS 574 >gi|238897540|ref|YP_002923219.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229465297|gb|ACQ67071.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 450 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 46/445 (10%), Positives = 111/445 (24%), Gaps = 49/445 (11%) Query: 1 MHLLSRFRFY---------FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKN 51 + ++ + R + K ++K + F + FL+LI +K Sbjct: 2 LSIIKKLRLFYLLNKGAQPVKYFFKNQKGAIFVFFLIMFSVFLILIYATFEYCRLINEKI 61 Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 E L ++ + + + N E +S Sbjct: 62 KTEQVLEQTSLILTAE-DNGDTSGTPGDRNRFLALSHIESFIPSTTATNESIEIISPNQN 120 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI-----EHLLN 166 + N+ + + + + F L Sbjct: 121 DLKRKYTITLRNAYSSIFNLFSTMSVSGKAAAQKAYQEEPMDVVFVFSFYRHTIENPLRM 180 Query: 167 QRYNQKIVSFIPALLRIEMGE-------RPIFLIELVVDLS------------GSMHCAM 207 ++ ++ + +L ++ + + + + Sbjct: 181 SWETRQEINELKEVLIEKINTILEMNPLNKVGFVPFSWGVKEGRYCLQPFVFLFKPNYDN 240 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALL---LFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + AP + + +K DS + ++ GL Y Sbjct: 241 MNFGPAQFKAPRQTEVISKSIDFVKTVNNIPFAMTDSNVISLASLDNRLCGLSLYRDP-- 298 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR----SFFTNFFRQ 320 I + ++ D LI + + ++L+ ++ Sbjct: 299 --IPLTREKNEI---FMIIKDELIPGYNLISSGILSGVKLLSRGTNSKKLLIIISDTEEA 353 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + ++ + + N ++ +C+K K + IK+ IS S K Sbjct: 354 PPMAINDDNKRMLRIRFNNNTLNISESLIKAGMCEKIKNSGIKMYFISTGGSIESHNYWK 413 Query: 381 -TCVSSPEYHYNVVNADSLIHVFQN 404 C+ H D L + Q Sbjct: 414 TQCIGEKNMHNLNNVDDELKQILQA 438 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 81/234 (34%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSMSIPDM---------VTKNGQSIDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V Q + R + LI + T + A + Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRNTVEQQLDRTVLGLIGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N + + ++ AKE+ +KI T+ + A Sbjct: 182 ------------INSKAPQRVIILLSDGANTSGVID--PLEAAKLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ N L ++ I++ Sbjct: 228 DQMVQKGFFGDRLVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 92/371 (24%), Gaps = 55/371 (14%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + SE+ + + + + L+L GF + V+ + K+ ++ A + + + L Sbjct: 28 RFARSEEGSILVFGLMLFILMLMLGGFAVDVMSFEAKRTDLQQAVDR-----CALTAAAL 82 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 ++ D E + + +A D I + + + + Sbjct: 83 AQTRDPEEVVEDCMLKAGKADYVTLIDHDEGLNYREVVVTAQQPT-------KPLFAHML 135 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL------LNQRYNQKIVSFIPALLRIEMG 186 + L + T L + + + + + + Sbjct: 136 GIDSLTAPAATKAEQKVTNVEIVMVLDVSGSMVRDSYSRPTDKLKNLKAAAKEFVDTMLA 195 Query: 187 ER-----PIFLIELVVDLSGSMHCA-----MNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + I ++ ++ +++ + + Sbjct: 196 KDLNHRISIAIVPYNGQVNLGKSLRQKFNIYDNNGVTYMDCVDMPASVYASTGLSRTLKM 255 Query: 237 LFLDSIDLLSHVKEDVYMGLI---GYTTRVEKNIEPSWG---------------TEKVRQ 278 + D S + G Y+ +P+ G T + Sbjct: 256 PMTANADTFSAAFSNARTGGTPPDTYSGPKTSEAQPTPGNRWCLPTAANVVRLPTGSISS 315 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPF 329 T MK +L + + F + Sbjct: 316 LQASIDGLEGNGATSINAGMKVGLSLLDPSARPMFSEFVGSGEIQSYFHGRPFDYTDEEV 375 Query: 330 QKFIIFLTDGE 340 K +I +TDGE Sbjct: 376 MKVMIVMTDGE 386 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 50/173 (28%), Gaps = 12/173 (6%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + Y+ L + + + + + + + T + Sbjct: 402 SPIYRSSDGEYLSLKLSNGKFYWPYDNTTTNSAAKGKSPTQLTWQQVWASYRTSYIAWQL 461 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + S + + + +C+ AK N + Sbjct: 462 YSRRPGFRSSDRLAAYTAQMNAFRTLTP------------ISTMDAQLQALCNLAKSNNV 509 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I+ A NG+ ++ C +S HY + + F+ I+ + + + S Sbjct: 510 TIFGIAFEAPANGKTQIQNCSTSRSSHYFDASGLEIQTAFRAIASQISYLRLS 562 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 40/443 (9%), Positives = 112/443 (25%), Gaps = 69/443 (15%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRF 79 I+F + ++L+ F + + + A A LA + K + + R Sbjct: 2 AGILFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDENVKYARQ 61 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 D + + + + + +V++ + +A ++D Sbjct: 62 LVDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAPFSDFVVSAKAEHKSWIAYEKVDL 121 Query: 140 -------SNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-NQKIVSFIPALLRIEMGERPIF 191 + + + F + + + + + + + F Sbjct: 122 KPEFEVAGKSVTRKYLPQPVDVYFIGDFSGSMNGHWKGGKTKLDVVKQTIERVVEDIENF 181 Query: 192 LIELVVDLS----GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 E ++ +H + K+ + + + Sbjct: 182 NTEEKSRVALLGYNPLHVKQTGTVYLNSYGYRRSWPKKVAYDYARGTTAQTVAKMFDPPS 241 Query: 248 VKEDVYMGLIGYT----------------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 V V + G + +I + + + Sbjct: 242 VYSRVQEYVRGMSRHDVKNLVVNNDRFVDYYKFYDIPLTEDYTNFKSRLANAS-LGAEGG 300 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T S + A Q + + ++ ++ I L+DG +N+ + Sbjct: 301 TSSWNGIIAAAQ----------------EANRATNINPEQVFIVLSDGADNDTNYLQRLV 344 Query: 352 K--ICDKAK------------------ENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 +C K + + + + I ++ C + Y+ Sbjct: 345 NQGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGVDYRVKESDGFGDC-FGRKNIYH 403 Query: 392 VVNADSLIH-VFQNISQLMVHRK 413 + D + + I++ K Sbjct: 404 AKDGDDVYKYILNLINEETGRLK 426 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 16/228 (7%), Positives = 54/228 (23%), Gaps = 27/228 (11%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + K+ ++ N +++F L++ +G + + + K +++ + + A LA + Sbjct: 11 FRKRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYS 70 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 +N + + A D ++ I Sbjct: 71 NNQQDSHLSKVAEGYFFENAGAD---ARANTDFSYDGIFNEDGSTVLQVSAARRHPTIFG 127 Query: 130 THMANNR--------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLN---QRYNQKIVSFIP 178 ++ + + + L + ++ + I Sbjct: 128 DLLSFVTAGELDWRAFPLAARSQIVVQNQSIELVMVLDNSGSMTGRPKSGGGKRKIDTIK 187 Query: 179 ALLRIEMG-------------ERPIFLIELVVDLSGSMHCAMNSDPED 213 G ++ ++ + Sbjct: 188 EAAIGLTGQFLKGAASSTLKLPVQFGVVPFAAAVNIGPDNHDAKWMDT 235 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 31/275 (11%), Positives = 80/275 (29%), Gaps = 76/275 (27%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 D ++ + + +R+ A ++ + S ++ + G Sbjct: 329 PDEYDGSKGDNDYLEDEGRRSMPAETAMSVQAKVAKYFDGSDLQRGNHPG-PNRGCLSTP 387 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + T+ + +++L++ F QG Sbjct: 388 VTPLTDNQATINAAINAMD---ADGETNIPEGIAWGWRLLSAR-------EPFTQGRAND 437 Query: 326 SLPFQKFIIFLTDGENNNFKS--------------------------------------- 346 + K ++ +TDG+NN Sbjct: 438 AKDNLKVLVLMTDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTSKR 497 Query: 347 ----------NVNTIKICDKAKENF--------IKIVTISIN--ASPNGQRLLKTCVS-- 384 N IC K++ I I TI+ + + ++L++ C S Sbjct: 498 ANRSSFVSAMNEKVAAICQNIKDDGRKATGEDGIVIFTIAFDLNDGSSVKKLMEQCASYG 557 Query: 385 ----SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + +Y+ ++ L+ F +I++ + + + Sbjct: 558 ITDPTKKLYYDAKSSSDLMAAFDSITEQVSSLRIA 592 >gi|258620794|ref|ZP_05715829.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258586992|gb|EEW11706.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 419 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 50/431 (11%), Positives = 110/431 (25%), Gaps = 42/431 (9%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMV 69 K I ++ II+ + + ++++ F + + + A+ A LA + K Sbjct: 1 MKSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKED 60 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + R K + S+ V ++ Sbjct: 61 DENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHK 120 Query: 130 THMANNRLDSSNNTIFYNMDV-------MTSYDYRLQFIEHLLNQRYNQKI-VSFIPALL 181 + ++ + V + + + N K + + ++ Sbjct: 121 SWISYENISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVI 180 Query: 182 RIEMGERPIFLIELVVDLS--GSMHCAMNSDP-----EDVNSAPICQDKKRTKMAALKNA 234 + + F E+ ++ G + + D Q K Sbjct: 181 TRVVNDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYALYNTPQQTVSNMFFPPKRV 240 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 Y + S TE + + + TD Sbjct: 241 NPSDSGLFSHRDIDLTQNYSSFLQIMNDRNFYPPRSACTESWQGIIAAAQAADKA--TDI 298 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE------------NN 342 P + I+ SD + R G + + K ++ DG N Sbjct: 299 NP---EQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLV---DGGLCKNLKQRIRQKPN 352 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 F+S+ T + + + I +N N CV ++ D ++ Sbjct: 353 RFQSSTPTENE-----KTKVTMGVIGVNYQVNPNDGFGDCVGRENIYHATQGEDVYKYIL 407 Query: 403 QNISQLMVHRK 413 I++ K Sbjct: 408 NLINEETGRLK 418 >gi|258624772|ref|ZP_05719703.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] gi|258582934|gb|EEW07752.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] Length = 419 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 50/431 (11%), Positives = 110/431 (25%), Gaps = 42/431 (9%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMV 69 K I ++ II+ + + ++++ F + + + A+ A LA + K Sbjct: 1 MKSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKED 60 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + R K + S+ V ++ Sbjct: 61 DENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHK 120 Query: 130 THMANNRLDSSNNTIFYNMDV-------MTSYDYRLQFIEHLLNQRYNQKI-VSFIPALL 181 + ++ + V + + + N K + + ++ Sbjct: 121 SWISYENISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVI 180 Query: 182 RIEMGERPIFLIELVVDLS--GSMHCAMNSDP-----EDVNSAPICQDKKRTKMAALKNA 234 + + F E+ ++ G + + D Q K Sbjct: 181 TRVVNDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYALYNTPQQTVSNMFFPPKRV 240 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 Y + S TE + + + TD Sbjct: 241 NPGDSALFSHRDIDLTQNYSSFLQIMNDRNFYPPRSACTESWQGIIAAAQAADKA--TDI 298 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE------------NN 342 P + I+ SD + R G + + K ++ DG N Sbjct: 299 NP---EQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLV---DGGLCKNLKQRIRQKPN 352 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 F+S+ T + + + I +N N CV ++ D ++ Sbjct: 353 RFQSSTPTENE-----KTKVTMGVIGVNYQVNPNDGFGDCVGRENIYHATQGEDVYKYIL 407 Query: 403 QNISQLMVHRK 413 I++ K Sbjct: 408 NLINEETGRLK 418 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 46/485 (9%), Positives = 108/485 (22%), Gaps = 96/485 (19%) Query: 4 LSRFRFYFKKG----IASEK--ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 +SR ++ + + + +I+FALS++ L L+G + K +++ A Sbjct: 1 MSRLLSALRRLGRWAARARRPDGSVNILFALSLIPVLGLVGLAVDYGLAAADKTTLDHAA 60 Query: 58 NAAILA---------GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + A LA A++ +NL+ A+ + F + ++ Sbjct: 61 DTAALAAVVTAKSYIAANQGQANLTANAIAAGLAQAANVFAVNAGSVPFAQVTLQPPQLV 120 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQ 167 S + + N+ + + + + Sbjct: 121 RSGQTLTATVSYGATIQNSFGKLLGTPTTLLGNSVTASADLPSYLDFYLLVDVSGSMGLP 180 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 P + + + H + + K + + Sbjct: 181 --------ATPGGMTQLASVNKDMWSDYQQGCQFACHFPGFTGW------GLAAGKIQLR 226 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---- 283 A+ A+ + V +G+ + ++ + + + Sbjct: 227 SDAVNAAVCSLIQRA-STPAVPNQYRVGIYPFINQMATLVGITGSVASLNAAAQCALSWP 285 Query: 284 --------------------MDSLILKPTDS---TPAMKQAYQILTSDKKRSFFTNF--- 317 T P ++ A + + F Sbjct: 286 LAFTNLLDTGTTQLFAYGDPTTGTASGGTHFEVVMPQLQAAIKAFGDGSSSTSPKPFVFL 345 Query: 318 ------------FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 D + + + +C K I Sbjct: 346 ITDGMQNGQHYGAPANGTYAYPGNPSSFWGYADAWWDGSQPSQIDPTVCAGLKSAGATIS 405 Query: 366 TISI----------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +SI SP LK C S P + + Sbjct: 406 ILSIPYNLITFVNNGGGVAWENNRVSGFSPTLATPLKACAS-PGFFATANTPADITASLN 464 Query: 404 NISQL 408 + Sbjct: 465 AMFDQ 469 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 67/236 (28%), Gaps = 19/236 (8%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 K S+ NF+++ L + + L + F + V K+++++A +AA LA + Sbjct: 2 LLNKFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLASSHLGD 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++SR + + F T+ + + Sbjct: 62 LDISRTDAFDR---YFQANIAGHGELANAQATLTVDRGVN---FIKTKAVASADVNLNFG 115 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVSFIPALLRIE 184 +NR + + D L + R K + + Sbjct: 116 FLFGHNR-HIAVDASAVESDNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEATKSP 174 Query: 185 MGERPIFLIELVVDLS-----GSMHCAMNSDPEDVN--SAPICQDKKRTKMAALKN 233 + L+ V ++ N + P+ K+ MA K Sbjct: 175 TRQIRASLVPFVTAVNVNGDEFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQ 230 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 61/187 (32%), Gaps = 41/187 (21%) Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + +K+R+ + + T+ + + ++L+ + G Sbjct: 369 CPTPVVSLTDDFDKLRKAASEMTE-WNGSGTNVSEGLSWGMRVLSP-------AAPYTDG 420 Query: 322 VKIPSLPFQKFIIFLTDGENNNF--------------------------------KSNVN 349 + K ++ LTDGEN + + Sbjct: 421 APWKTPGISKIVLLLTDGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGW 480 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 T +C + K ++I T+ + + R L C S P +Y V + L VFQ+I+ Sbjct: 481 TKSVCTQLKNQGVQIYTMVLQSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQHIANK 540 Query: 409 MVHRKYS 415 + + Sbjct: 541 FSRLQLT 547 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 80/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + + +++A+K L F+ Sbjct: 95 KYRDLMLVVDLSGSMQKEDMN----------LDGEYIDRLSAVKKVLSDFVAK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + I+ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PIEAAEIAKKYNATIYTVG 237 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A+ L ++ I++ Sbjct: 238 VGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYNTINK 294 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 39/444 (8%), Positives = 108/444 (24%), Gaps = 49/444 (11%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA-- 63 R R I ++ IIF + ++++ F + + H + A A LA Sbjct: 11 RLRGSSMS-IKKQQGVAGIIFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAEVASLALI 69 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + K ++ + A+ F + + + +V+ Sbjct: 70 ASPKEGDEKNQEYAQKIVDHYIPDNKGEVVARVFNRRCEYKDGCVQRSGELAPFTDFVVS 129 Query: 124 SSRISMTHMANNRLDSS--------NNTIFYN-MDVMTSYDYRLQFIEHL--LNQRYNQK 172 + + ++ N + + + + + + + Sbjct: 130 AKTKHDSWISYNDGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKT 189 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + + F + ++ RT A + Sbjct: 190 KYDVVADTINRIVDDLREFKTDRKSRVAVIGFHHTAVKKVGRQRTAFDYSSYRTPSATVN 249 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKP 291 N I + + ++I + + Sbjct: 250 NMFTA--PKIHSRNDSSNI----------KTFEDIPLTEDYDAFLTKFNSSNYYASRYGL 297 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD--GENNNFKSNVN 349 T+S + A Q+ + F + ++ N + + + Sbjct: 298 TESWQGIIGAAQMAEQATDLNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRLV 357 Query: 350 TIKICDKAK--------------------ENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 +C+K K + + + I +N + + C + Sbjct: 358 KAGLCEKLKTRISQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDCFGHDNIY 417 Query: 390 YNVVNADSLIHVFQNISQLMVHRK 413 + D ++ I++ K Sbjct: 418 HAKEGNDVYKYILNLINEETGRLK 441 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 29/240 (12%), Positives = 77/240 (32%), Gaps = 31/240 (12%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 R + V+D+SGSM K+A LK A+ +D Sbjct: 174 AAAGDAPRAPLDLVTVLDVSGSM--------------------IGAKLALLKQAMGFVID 213 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMK 299 ++ + ++ ++ R + + + ++SL T+ ++ Sbjct: 214 NLGPHD------RLSVVSFSDRARRVTRLLRMSGDGKAAAKSAVESLVARGGTNIAEGLR 267 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A ++L + R+ ++ + ++ G N+++ V + ++ Sbjct: 268 TAARVLEERRHRNTVSSVILLSDGQDTYTAPRWSRGPGAGATPNYEALVPPSFMATSTRD 327 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + T + + S+ + N + F +Q + +V+++ Sbjct: 328 WSAPVHTFGFGNDHDAAAMHVIAESTAGTFSYIGNEAVIQDAF---AQCIGGL-LTVVVQ 383 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 82/234 (35%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSMSIPDM---------VTKNGQSIDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ + V Q + R + LI + T + A + Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRKTVEQQLDRTVLGLIGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N + + ++ AKE+ +KI T+ + A Sbjct: 182 ------------INSKAPQRVIILLSDGANTSGVID--PLEAAKLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ N L ++ I++ Sbjct: 228 DQMVQQGFFGDRIVNPSQDLDEKTLTEIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 23/287 (8%), Positives = 63/287 (21%), Gaps = 17/287 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SR K I+ ++ N ++ A +++ + +G + + + + ++ A +A +LA Sbjct: 6 ISRLCAGTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLA 65 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G M + E+ S + + E Q Sbjct: 66 GRRAMGGASYDEAAQAEANKMFNFNFPEAKY-----GATGILFSSRALNASDVEGQASAV 120 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP----- 178 M S + L + + Sbjct: 121 LPTELMFMFGKEEFRLSADCTAKLEISNVDVMLVLDVTGSMAQTNAGDSVNRITALKDAT 180 Query: 179 -------ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + G ++ + + ++ + + Sbjct: 181 MDFFDTLTNADVGDGRLRFGVVPYSSTANVGQILLAKNPAWLADTVTLPSRTPIFREVYT 240 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + D D + + + + Sbjct: 241 ETGTETSDDYTDSPTTYSSNWTNDGTVPASNSAACAALTPPANTTPS 287 Score = 81.1 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 37/269 (13%), Positives = 65/269 (24%), Gaps = 40/269 (14%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + I+LV D + A Q + N Sbjct: 405 DADDTAPSDALDMDIDLVPDADADTQWRILIPEIAFPRARHPQYAPSSSDPMTVNVFNSD 464 Query: 239 LDSIDLLSHVKE-----DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPT 292 G+ ++ + T R L T Sbjct: 465 GTRNITSDTSSNGRWQRYSKYWGSGWGVCPAAAMKLTTMTASDRATFNTYVQSLQPLGGT 524 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG------------- 339 M ++L+ + P + I+F+TDG Sbjct: 525 YHDAGMVWGARLLSPTGLFADENA-----TAPNDRPISRHIVFMTDGAMAPNMGNLTFQG 579 Query: 340 -------------ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + + N ++C A++ I I +S N L C SS Sbjct: 580 YEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVGSNDS--LNNCASS- 636 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + NA L FQ I++ + + + Sbjct: 637 GQAFEADNAAELNEQFQAIARQISKLRLA 665 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 80/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + + +++A+K L F+ Sbjct: 95 KYRDLMLVVDLSGSMQKEDMN----------LDGEYIDRLSAVKKVLSDFVAK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + I+ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PIEAAEIAKKYNATIYTVG 237 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A+ L ++ I++ Sbjct: 238 VGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 294 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 80/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + + +++A+K L F+ Sbjct: 89 KYRDLMLVVDLSGSMQKEDMN----------LDGEYIDRLSAVKKVLSDFVAK------- 131 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 132 RKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTF--- 188 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + I+ + AK+ I T+ Sbjct: 189 ---------------VDSDAPQRVMILLSDGSNTAGVLD--PIEAAEIAKKYNATIYTVG 231 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A+ L ++ I++ Sbjct: 232 VGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 288 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 79/235 (33%), Gaps = 36/235 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + +++ A K L F+ + ++ Sbjct: 107 DVMVVVDLSGSMAEQDF---------TSKAGENISRLNAAKEVLSDFVKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T AM A ++ K R Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAMGLAIKVFEQSKSR 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K I LTDG + S V I+ AK ++I I++ Sbjct: 211 IGVEENKE-----------KVAIVLTDGNDTG--SFVEPIEAAKVAKAKGVRIHVIAMGD 257 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + + S + +N D L + +I +L S + Sbjct: 258 PQTLGEAALDMKTIRRIAKESGGKAFEAMNRDELAKAYDDIGRLEPQLYESTTYR 312 >gi|253687077|ref|YP_003016267.1| hypothetical protein PC1_0676 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753655|gb|ACT11731.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 543 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 56/519 (10%), Positives = 128/519 (24%), Gaps = 119/519 (22%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG--- 64 R + I EK + + L M+ L+ F++ ++ A +AA LA Sbjct: 22 REHLSAFIQQEKGAGTGFYVLGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQ 81 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-GYSAVFYNTEIQNIVN 123 A+ S+ + ++ A S++ G ++ T + Sbjct: 82 ATINGEEYSQEETNKLAYEYVKNNLGMNKALSEKLVASDVSVAEGRNSATRKTYTVTVAF 141 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL------------------- 164 ++ S+ + + + + ++ T + + Sbjct: 142 ETKPSLMSLGARKQEVY--STSEVINRPTEIALVMPVTGDMSDADIRSLKSVSRSFVERM 199 Query: 165 ---LNQRYNQKIVSFIPALLRIEMGER-----------------PIFLIELVVDLSGSMH 204 + + + +S +P + + + P + S+ Sbjct: 200 LSSADGKRDNLWLSLVPYSQSVNVYDAEDANRIRRWSTPSALNPPELRSLFASGVVSSLA 259 Query: 205 CAMNSD----------------PEDVNSAPICQDKKRTKMAALKNA------LLLFLDSI 242 D + P+ Q + + +N + + Sbjct: 260 DRRFPDRRANLLCMYRGLGREENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPITWRGPNP 319 Query: 243 DLLSHVKE----DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 DL D + + + K+ Q + T+ AM Sbjct: 320 DLYPWDNNSDAVDTRFIVADRGCPNAALMPLTNEESKLNQRIAEFT---PRFNTNYAIAM 376 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVN------ 349 A L+ + + S + + QK I+ + + N F ++ Sbjct: 377 SWAGAALSPNMRGSDGWGDTTLPLDFNLDGNGDGQKVIVMMANTIGNWFDTDSYNFNRNE 436 Query: 350 -----------------TIKICDKAKENFIKIVTISI--NASPN---------GQRLLKT 381 +C + IK + I + L Sbjct: 437 FRGSTGTDPARSFAAQRFQDLCSSFRARNIKFYFVGIRPGDPEDFGRNLFDREATPGLLV 496 Query: 382 CVSSPEYHYNVVNA--------DSLIHVFQNISQLMVHR 412 C + + A D LI I+ + Sbjct: 497 CTEGEKRMSFIDGAGFGAEGVEDQLIQRLDRIAGQIETE 535 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 86/242 (35%), Gaps = 29/242 (11%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + E+ + +VVDLSGSM + K +++ A K L F + Sbjct: 95 VREQLGRDVMVVVDLSGSMAEQDF---------TSKRGDKISRLDATKEVLADFATT--- 142 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GLI + + + + + + ++ + T A+ A ++ Sbjct: 143 ----RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKV 198 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +K + Q + + +K +I LTDG + S V I AK ++I Sbjct: 199 FEQSEK----QSAAVQDSSVDANEKEKVVIVLTDGNDTG--SFVEPIDAAKVAKAKGVRI 252 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I++ + + + + S + +N D L + I +L S Sbjct: 253 HVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGELEPQLYESTT 312 Query: 418 LK 419 + Sbjct: 313 YR 314 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 36/277 (12%), Positives = 81/277 (29%), Gaps = 22/277 (7%) Query: 153 SYDYRLQFIEHLLN--QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + + I + +P + + + L GS H Sbjct: 94 EIVLVMDTSTSMKCLVEPESYDIDNCVPTKEGHIVYIKDKSYLVNTTFLRGSRHKLFYIT 153 Query: 211 PEDVNSAP----ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 N + + LK+A + + + K + +GL+ + TR + Sbjct: 154 MGTTNYYIQGNKCYRQSSYNEKNRLKHAQESAIKFVKKFENDKN-ISIGLVSFDTRAIEQ 212 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL--------------TSDKKRS 312 E + +V+ + + T+ +K A +IL + + Sbjct: 213 KELTSSLSEVKSSINNLKVAY-NGATNIEAGLKSAQKILKKGNEDADKYVILMSDGFPTA 271 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F + Q + + ++I D K+ I I + Sbjct: 272 FDYAGEKFEENFNEHEVQDNTFINFGYNDYRGYAMKHSINQADSLKKVGINSFIIGFSDG 331 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N ++L K ++ + N D+L + I + Sbjct: 332 ANSEKLNKIAKAAGGEYEEARNTDALNGAYNKIETKV 368 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 30/268 (11%), Positives = 75/268 (27%), Gaps = 61/268 (22%) Query: 202 SMHCAMNSDPE--DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S + +N N ++ R+++ ++K F+D + + ++ Y Sbjct: 698 SGNFTINGKKYYVKDNKVYEFNEQDRSRIDSVKKVANDFVDKFKDDE----NTEIAIVRY 753 Query: 260 TTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS-DKKRSFF 314 +++ + ++ S ++ R T+ ++++Y IL DK + Sbjct: 754 SSKADVVLDNSNKVFLSSKDNETIKKRINSLKADVATNIGDGIRKSYSILDKCDKDSEKY 813 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTD------------------------------------ 338 GV + I L + Sbjct: 814 MILMTDGVPTAYTCYANTIKTLNNCKYGEEELDLGYCPEGYIECYNRKYYYSEVKGDFKL 873 Query: 339 -------------GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 G+ + + + K+K I V + + + ++ LK Sbjct: 874 ENDNRDEGYVIKFGDEYDKNALEYAKQAMQKSKSKNINNVIVGFSDGIDTKK-LKEIAGD 932 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + I + + K Sbjct: 933 NAQYKEAKDLGELSKQYDEIQKDINTIK 960 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 80/246 (32%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ ALK F+D ++ Sbjct: 91 DLMLAVDLSGSMRTDDF----------QLSGRSVDRLTALKAVAGAFIDQ-------RQG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ V + + + L T A+ A + L Sbjct: 134 DRIGLILFGEQPYIQAPLTFDHNTVTRLLHEAVVGLAGNKTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG +N+ + +K A + +K+ TI I A Sbjct: 188 ------------RKDPQAKNVLILLTDGASNSGSLD--PLKAAKLAAQRGLKVYTIGIGA 233 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L ++ ++ + + L ++Q + QL K Sbjct: 234 EAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRYFRARDTEELAQIYQQLDQLEPVEK 293 Query: 414 YSVILK 419 + + Sbjct: 294 DKQVYR 299 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 45/409 (11%), Positives = 108/409 (26%), Gaps = 32/409 (7%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + F +K + ++ F ++ L + + + N A+ Sbjct: 2 VKSMFSSLARQQKGSITLSFLALLLPALFIAVGTLMISAQVMVSNR------------AA 49 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN----HIKESLSGYSAVFYNTEIQNIV 122 + + + + + RA D +K + S + T + Sbjct: 50 QAADSAALACAFADEATPRMMRAYQDYYHPTLKGVSGLTPENQGCRISLGYSVTPLLPHF 109 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV-SFIPALL 181 N R A + + T L + + K + S + Sbjct: 110 NYERYDQNVTATG-GGFKGVVESKHSAIPTELVLVLDVSGSMSPNIQSLKSILSNALNTI 168 Query: 182 RIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-LF 238 + + + + + + + C D + +L Sbjct: 169 QSQSNNANDLDSVSISIVPFDSGVATHRPPWLSEETAGIYCIDGLSYRNGDFSASLTVDN 228 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 L ++ VK + + + + +V+ + T S + Sbjct: 229 LATLHSERPVKFTPPSKWLSDCNQESPMLPLTNVFSRVQNSINSLT---ANGGTRSYQGL 285 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-IC-DK 356 + L ++++ + ++ TDG + N C Sbjct: 286 VWGVRQLIPSWQQAWGMKVSSVPETR------RKLVLFTDGADEGDAFNQLVNAGFCTTA 339 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K+ I++ I SP+ + C +P ++ N L F +I Sbjct: 340 IKQYGIEMNFIGYGVSPSRITQFENCAGNPLRVFSATNTTQLNEYFSDI 388 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 79/235 (33%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + +K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMAEQDFTSKT---------GEKISRLDAAKEVLSDFAKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T A+ A ++ +K Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ K I LTDG + S V I AK ++I I++ Sbjct: 211 RTDVEESKE----------KVAIVLTDGNDTG--SFVEPIDAAKVAKAKDVRIHVIAMGD 258 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 259 PQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGKLEPQLYESTTYR 313 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 80/233 (34%), Gaps = 55/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM QD ++ A+K+ + F+D ++ Sbjct: 86 DMMLVIDLSYSMSQQDM----------AYQDDYIDRLTAVKHVVSDFVDR-------RKG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ E V+ + + + LI T + A + Sbjct: 129 DRVGLVYFADHAYLQTPLTFDRETVKTQLNQTVLKLIGTQTAIGDGIGLATKTFVDSNA- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG NN + ++ + AK+ I TI + A Sbjct: 188 -----------------PQRVMILLSDGSNNAGVLD--PVQAAEIAKKYGTTIYTIGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + L+K + ++ NAD L ++ I+ Sbjct: 229 GEMQVKDFFMTRTVNTAEDLDEKTLIKIANITGGQYFRARNADELATIYDTIN 281 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 75/237 (31%), Gaps = 58/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD S SM + ++ ++++ LK + F+D + Sbjct: 97 DLMLAVDTSRSMTAEDFTV----------HGREVSRLSVLKGIMGKFVD-------GRVG 139 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ + Q + + +L T + + L Sbjct: 140 DRIGLIIFGDTSYVLSPLTFDRNAIHQLLDGIVPTLAGGGTAIGDGIGLGIKKL------ 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG+N +K AK+ I+I TI + + Sbjct: 194 ------------RERPEGSRVLILVTDGKNETGTIP--PLKAAQLAKQEGIRIYTIGVGS 239 Query: 372 S---------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V+Q I + Sbjct: 240 TKNRVRLLSPDLRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRIDE 296 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 78/240 (32%), Gaps = 47/240 (19%) Query: 211 PEDVNSAPICQDKKRTKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 P P + + + + L ++D + + + R Sbjct: 292 PYFAPDEPDNGSYSNSYLSDGVSGGVSERLQALDKYDNGRPNREG-----PNRSCTTTPV 346 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + T R + D T+ + ++++ F +G + Sbjct: 347 TPLTSTERTVLNAIGDMGASGTTNIPNGVGWGIRLISPG-------APFTEGSAWDDDEY 399 Query: 330 QKFIIFLTDGEN---------------------------------NNFKSNVNTIKICDK 356 K ++ LTDG+N + + + T C Sbjct: 400 IKAMVILTDGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSNVLSNELDDRTEAACAY 459 Query: 357 AKENFIKIVTISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 A+ I++ TI+ + + + L++ C S+P +++ ++++L F+ I+ + + + S Sbjct: 460 ARSLGIRVYTITFQVNSSSTRSLMQNCASNPSLYFDSPSSEALEDAFEMIAGDLTNLRLS 519 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 17/232 (7%), Positives = 50/232 (21%), Gaps = 28/232 (12%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + N + IFAL+++ LL G + + ++ Sbjct: 6 LFNLGARLCRETRGNVATIFALTLVPVALLSGGAVDLSQSMNARSR-------------- 51 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH--------IKESLSGYSAVFYNTEI 118 + L +N + A FI + ++ + Sbjct: 52 -LAQALDAAALAVGVNTNLSSSEATGIANDFIAANYPGRELGVVQNVNVYIDDETDTVTV 110 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + + + + L + + S Sbjct: 111 SGEARVRTTMLGMIGLDYITVHWESEVQRARQRLELVMVLDNTGSMGGSKIRNLRESAEL 170 Query: 179 ALLRIEMGERPIFL-----IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + ++ + A + +P+ + Sbjct: 171 LTGILFDAADDPSDVKIGLVPFAATVNVGTNHARAWWMDPDALSPVHAEWAG 222 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 39/445 (8%), Positives = 114/445 (25%), Gaps = 51/445 (11%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA-- 63 R R I ++ IIF + ++++ F + + H + A A LA Sbjct: 11 RLRGSSMS-IKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALI 69 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + K ++ + A+ F + + + +V+ Sbjct: 70 ASPKEGDEKNQEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVS 129 Query: 124 SSRISMTHMANNRLDSS--------NNTIFYN-MDVMTSYDYRLQFIEHL--LNQRYNQK 172 + + ++ N + + + + + + + Sbjct: 130 AKTKHDSWISYNEGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKT 189 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + + F + ++ +A ++RT Sbjct: 190 KYDVVADTINRIVDDLREFKTDRKSRVAVIGFHH---------TAVKQVGRQRTAFDYSS 240 Query: 233 -NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LK 290 ++++ + G I + ++I + + Sbjct: 241 YRTPSATVNNMFTAPKIHSRNDSGNI----KTFEDIPLTEDYDAFLTKFNSSNYYASRYG 296 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD--GENNNFKSNV 348 T+S + A Q+ + F + ++ N + + + Sbjct: 297 LTESWQGIIGAAQMAEQATDLNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRL 356 Query: 349 NTIKICDKAK--------------------ENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +C+K K + + + I +N + + C Sbjct: 357 VKAGLCEKLKTRISQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDCFGHDNI 416 Query: 389 HYNVVNADSLIHVFQNISQLMVHRK 413 ++ D ++ I++ K Sbjct: 417 YHAKEGNDVYKYILNLINEETGRLK 441 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 78/283 (27%), Gaps = 55/283 (19%) Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 MT + + + L + V + + Sbjct: 475 MTFVPLDEQGVDPINKVVWAKTDHFTTFVL----FYIPTWKAIWEVPINKGEREVNQQIK 530 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + V+D SGSM + K A F+D++ Sbjct: 531 -YIDLVFVLDSSGSMSWNDPNGYR-------------------KIAAKSFVDALIQGDRA 570 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ + + + V+ + R T+ ++ A L S Sbjct: 571 ------AVVDFDDYGYLLQPLTTDFQTVKNAIDRIDSW---GGTNIAEGIRIANHQLISQ 621 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K II LTDGE + +AK N I I TI Sbjct: 622 SSDDRI----------------KVIILLTDGEGYYDNNLT------TEAKNNGITIYTIG 659 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + S + L + ++ V +A L VF+ I++++ Sbjct: 660 LGTSVDENLLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTE 702 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 64/263 (24%), Gaps = 34/263 (12%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I+ NF I+ AL ++ + G + + + +A A + S Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSS 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLS--GYSAVFYNTEIQN--------IVN 123 + N DA+ + + L+ T+ N Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANKLNSQVSFTAT 124 Query: 124 SSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S Y + L + Sbjct: 125 VPTTFMQIFGRDSITISGTATAEYQTAAFMDFYILLDNTPSMG----------------- 167 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +E + + H D N ++ ++ A D+ Sbjct: 168 VGATPSDVSKLEAKTGCAFACHQM---DKSTNNYTIAKSLGVAMRIDVVRQATQALTDTA 224 Query: 243 DLLSHVKEDVYMGLIGYTTRVEK 265 + MG+ + T+ E Sbjct: 225 KTERISSDQFRMGVYTFGTKAED 247 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 78/235 (33%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMAEQDF---------TSASGANISRLDATKEVLAEFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ + Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEPS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S P QK I LTDG + S V I AK ++I I++ Sbjct: 211 Q----------AASSKPRQKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + Q + + S + +N D L + +I +L S+ + Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGKLEPQLYQSITYR 313 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 85/240 (35%), Gaps = 48/240 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ T++AA+K + F+D ++ Sbjct: 95 DLLLAVDISGSMQQEDMQIN----------NRPATRLAAVKKVVSDFIDQ-------RQG 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ T+ V Q++ K T A+ + + L + Sbjct: 138 DRIGLILFGTQAYLQTPLTFDTQSVNQFLQEAQLGFAGKDTAIGDAIGLSVKRLKN---- 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K II LTDGEN + ++ A++ KI T+ I A Sbjct: 194 -------QSSASSAKPSNSKVIILLTDGENTAGEV--EPLQAAKLAEKIGAKIYTVGIGA 244 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L ++ ++ N L +++ + +L K Sbjct: 245 DEMIVRGFFGNRRVNPSASLDEETLTAIANTTGGLYFRARNTQELNNIYSELDKLEPTEK 304 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 36/471 (7%), Positives = 109/471 (23%), Gaps = 87/471 (18%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 N +IIFAL+ + L+ G + + + + ++A LA + + + + Sbjct: 19 GNVAIIFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSATLAATTPAMMQQTTATAKT 78 Query: 80 ESISNHAKRALIDDAKRFIKNH----IKESLSGYSAVFYNTEIQNIVNSSRISMTH--MA 133 + S A + + + + + + S + + + + Sbjct: 79 VATSMFAAQVAQINRLTYNSANLTVTVNDDTSASPVKTRTVTVSYLAQVGNAFGSFYHVP 138 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + ++ + + L + + S A + + Sbjct: 139 TSIFTVKASSTA-STARNIDFYLVLDNSPSMELPATTAGLASMTAATGCVFACHENTYSD 197 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-LSHVKEDV 252 + ++ ++ A + +S Sbjct: 198 PENTVQYPGY-------GTIDSYTYAKNAGIALRIDNVREAAKRLASTSQAMMSANGATY 250 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + + T ++ ++++ + + + Sbjct: 251 RLAAYAFNYDTTQLQALTSTTSANVSAISTSINAMTPPLMEKNNYLPTGASYTYPTSAST 310 Query: 313 FFTNFFRQGVKIPSLPF--------------------------------QKFIIFLTDGE 340 + T + Q+ ++ +TDG Sbjct: 311 WTTVTLGSDPTKTNYNVRDAMTDIEMTLTKVNAAMPNPGNGTTASGDKPQEVVMLVTDGM 370 Query: 341 NNNFKSNVNT-----------------------IKICDKAKENFIKIVTI---------- 367 + + +C K I+I + Sbjct: 371 VDGSFYTNTSCTNYASSYSNSYGTFYRCLRPLDTTLCTTIKNRGIRIAVLNLIYYPTPGY 430 Query: 368 SINASP------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 LK+C S+ + ++ V + + Q +V Sbjct: 431 GFYDGAVAPFISTVSPALKSCAST-DLYFEVDTGSDISEAMTYLFQKVVTT 480 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 77/237 (32%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + ++ A+K L F+ Sbjct: 95 KYRDLMLVVDLSGSMQQEDMELN----------GEYIDRLTAVKKVLSDFVAK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RKGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVG 237 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ + D L ++ I+Q Sbjct: 238 VGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEMTGGQYFRARDTDQLEKIYDTINQ 294 >gi|297727663|ref|NP_001176195.1| Os10g0464900 [Oryza sativa Japonica Group] gi|22758314|gb|AAN05518.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432565|gb|AAP54180.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|255679473|dbj|BAH94923.1| Os10g0464900 [Oryza sativa Japonica Group] Length = 646 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 60/204 (29%), Gaps = 12/204 (5%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A LK A+ +D++ + +I +++ + + S T+ + + Sbjct: 183 SGSMVGNKLALLKQAMGFVIDNLGPGD------RLCVISFSSGASRLMRLSRMTDAGKAH 236 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R + SL T+ A+++A ++L R+ + + + D Sbjct: 237 AKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRGGYDRD 296 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + T + + + + N ++ Sbjct: 297 ANYDALVPPSLVRADAGGGGGRAPPVHTFGFGKDHDAAAMHTIAEVTGGTFSFIENEAAI 356 Query: 399 IHVFQN-ISQLMV----HRKYSVI 417 F I L+ + V Sbjct: 357 QDGFAQCIGGLLSVAVQELRLDVA 380 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 77/237 (32%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + ++ A+K L F+ Sbjct: 95 KYRDLMLVVDLSGSMQQEDMELN----------GEYIDRLTAVKQVLSDFVAK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RKGDRLGVVLFGDHAYLQTPLTADRKSVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVG 237 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ + + L ++ I+Q Sbjct: 238 VGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQ 294 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 81/234 (34%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSMSIPDM---------VTKNGQSVDRLTAVKHVLSDFIEK-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ + V + + R + LI + T + A + Sbjct: 128 DRLGLVLFADHAYLQTPLTFDRKTVEKQLDRTVLGLIGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N + + ++ AKE+ +KI T+ + A Sbjct: 182 ------------INSKAPQRVIILLSDGANTSGVID--PLEAAKLAKESGVKIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ N L ++ I++ Sbjct: 228 DQMVQQGFFGDRIVNPSQDLDEKTLTDIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 79/220 (35%), Gaps = 41/220 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +DLSGSM + T+++ +K F+ ++ Sbjct: 92 DIMMAIDLSGSMEERDF----------AVGGRPATRLSIVKETADDFI-------SRRDG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ R ++ E VR+ + + L + T A+ + + L Sbjct: 135 DRLGLVLFSDRAYLQAPLTFDREAVRKLLDQAQVGLTGQKTAIGDAIAVSVKRL------ 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN + K D A + I+I TI + + Sbjct: 189 ------------KDRPEDGRVLVLLTDGANNEGVMS--PDKAADLAAKLGIRIYTIGVGS 234 Query: 372 ----SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ + L +++ I + Sbjct: 235 ARSRDLDERTLRQIADATGGAYFRATDVQGLAQIYRAIDR 274 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 86/242 (35%), Gaps = 29/242 (11%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + E+ + +VVDLSGSM Q K +++ A K L F + Sbjct: 95 VREQLGRDVMVVVDLSGSMAEQDF---------TSKQGDKISRLDATKEVLADFAKT--- 142 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GLI + + + + + + ++ + T A+ A ++ Sbjct: 143 ----RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLATKV 198 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +K + Q I + +K +I LTDG + S V I AK ++I Sbjct: 199 FEQSEK----QSAAVQDSSIDANVKEKVVIVLTDGNDTG--SFVEPIDAAKVAKAKGVRI 252 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I++ + + + + S + +N D L + I +L S Sbjct: 253 HVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGELEPQLYESTT 312 Query: 418 LK 419 + Sbjct: 313 YR 314 >gi|261820223|ref|YP_003258329.1| hypothetical protein Pecwa_0904 [Pectobacterium wasabiae WPP163] gi|261604236|gb|ACX86722.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 539 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 50/513 (9%), Positives = 127/513 (24%), Gaps = 117/513 (22%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG---ASK 67 I EK + +AL M+ L+ F++ ++ A +AA LA A+ Sbjct: 25 LSAFIHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATI 84 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKR--FIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 S+ + ++ A + + + G ++ T + ++ Sbjct: 85 NGEEYSQEDINTLAYEYVKSNLGMNKALSEKLVAGDVSVT-EGRNSATRKTYTVTVAFAT 143 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL--------------------- 164 + S+ + + + ++ T + + Sbjct: 144 KPSLLSLGARAQEVY--STSEVINRPTEVALVMPVTGDMSAGDIRSLKSVSHSFVERLLG 201 Query: 165 -LNQRYNQKIVSFIPALLRIEMGER-----------------PIFLIELVVDLSGSMHCA 206 + + + +S +P + + + P + S+ Sbjct: 202 SADTKRDNLWLSLVPYSQSVNVYDAEDANRIRRWAAPGALNPPELRSLFASGVVSSLADR 261 Query: 207 MNSD----------------PEDVNSAPICQDKKRTKMAALKNA------LLLFLDSIDL 244 D + P+ Q + + +N + + D Sbjct: 262 RFPDRRANLLCMYRGLGREENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPISWRGPNPDF 321 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + D + + + K+ Q + + + + M A Sbjct: 322 DDNDAVDTRWIVADKGCPNAALMPLTNEASKLNQRIDQFSARFNV---NYAIGMSWAGAA 378 Query: 305 LTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNT----------- 350 L+ + + S + + QK I+ + + + F ++ Sbjct: 379 LSPNMRGSDGWGDTTLPLDFNLDGNGDGQKVIVMMANTIGDWFDTDSYNFNRNEFRGGTG 438 Query: 351 ------------IKICDKAKENFIKIVTISI--NASPN---------GQRLLKTCVSSPE 387 +C + IK + I + L C + Sbjct: 439 NDLARTFAAQRFRDLCSSFRARNIKFYFVGIRPGDPEDFGRNLFDREATPGLLVCTEGEK 498 Query: 388 YHYNVVNA--------DSLIHVFQNISQLMVHR 412 + A + LI I+ + Sbjct: 499 RMSFIDGAGFGAEGVENQLIRRLDRIASQIETE 531 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 54/569 (9%), Positives = 122/569 (21%), Gaps = 181/569 (31%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 +K +E ++++ AL L+LIGF + K + + A LA ++ Sbjct: 12 DSIRKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVAEN 71 Query: 69 VSNLSR------------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 D+ + N + I+ K L +A + Sbjct: 72 NDYRENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENANASSD 131 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYN----------------------------- 147 I S + + Sbjct: 132 APITIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLVSADKLKQDR 191 Query: 148 ---------------MDVMTSYDYRLQFIEHLLNQRYNQ--------------------- 171 + + F + + ++ Sbjct: 192 IRMESDTSYAIKEKGIVIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVVSEIS 251 Query: 172 ---------------KIVSFIPALLRIEMGERPIFLI-ELVVDLS-----GSMHCAMNSD 210 + F + + + +S ++ + + Sbjct: 252 KILLPEDVSEGVSPFNRIGFTTFSGGVRQRDVTEGCVFPYEGKVSTTAQYFTIPYWITGN 311 Query: 211 PEDVNSAPICQDKKRT------------KMA---------ALKNALLLFLDSIDLLSHVK 249 + ++ T + K + + I + Sbjct: 312 NTPWRRGDNGRWERSTVRFEDHYKGYYERFDRRSCRGSGNNTKCTIYAYPKKIMDYALKI 371 Query: 250 EDVYMGLIGYTTRVEKNIEPS----------------------------------WGTEK 275 D + + + W +K Sbjct: 372 NDWTTVREIFNNYMNTEGTLAQISEFNGKNRNYDMVFTDEDRCLGGNIGRRTTRAWFDQK 431 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T ++ + I+ + K Q+ I+ Sbjct: 432 NKDITRELNIVRPSGWTSASSGLLVGANIMMDENKS------PDAKPSKLGTNIQRVILV 485 Query: 336 LTDGENNNFKSNVNTI----KICDKAKEN-------------FIKIVTISINA--SPNGQ 376 L+DGE+N + T +CDK +E +I ++ N Sbjct: 486 LSDGEDNWPTYSTLTTLLNNGMCDKIREQLGKLQDPNLRELPG-RIAFVAFGYSPPANQV 544 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K CV + +Y + + L+ F+ I Sbjct: 545 AAWKKCV--GDQYYTAYSKEELLESFKQI 571 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 77/237 (32%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LVVDLSGSM + ++ A+K L F+ Sbjct: 95 KYRDLMLVVDLSGSMQQEDMELN----------GEYIDRLTAVKKVLSDFVAK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + + V Q + + + L+ + T + + Sbjct: 138 RKGDRLGVVLFGDHAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + AK+ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGV--LEPLEAAEIAKKYNATIYTVG 237 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ + + L ++ I+Q Sbjct: 238 VGAGEMMVKEFFMTRKVNTAADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQ 294 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 76/239 (31%), Gaps = 55/239 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P + LVVDLS SM+ +D ++ A+K L F+ Sbjct: 81 SQPHRDMMLVVDLSYSMNQKDMQIGDDY----------IDRLTAVKQVLSDFIAQ----- 125 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + + V Q + + + LI T + A + Sbjct: 126 --RQGDRLGLIFFADHAYLQTPLTLDRTTVAQQLNQAVLRLIGTQTAIGDGIGLATKTFI 183 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 Q+ +I L+DG NN + I+ AK+ I T Sbjct: 184 ES------------------DAPQRVMILLSDGSNNAGVLD--PIEAAHIAKQYHTTIYT 223 Query: 367 ISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + A + + L K + ++ + L ++Q I + Sbjct: 224 VGVGAGEMMVRDFFMTRRINTAEDLDEETLQKIADLTGGQYFRARDKHDLQTIYQTIDK 282 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 79/235 (33%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + +K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMAEQDFTSKN---------GEKISRLDAAKEVLSDFAKT-------RKG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + ++ + T A+ A ++ +K Sbjct: 151 DRLGLILFGDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ K I LTDG + S V I AK ++I I++ Sbjct: 211 RTDVEESKE----------KVAIVLTDGNDTG--SFVEPIDAAKVAKAKDVRIHVIAMGD 258 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 259 PQTVGETALDMNTIKRIAKESGGEAFEALNRDELAKAYDEIGKLEPQLYESTTYR 313 >gi|125532271|gb|EAY78836.1| hypothetical protein OsI_33941 [Oryza sativa Indica Group] Length = 645 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 60/204 (29%), Gaps = 12/204 (5%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A LK A+ +D++ + +I +++ + + S T+ + + Sbjct: 183 SGSMVGNKLALLKQAMGFVIDNLGPAD------RLCVISFSSGASRLMRLSRMTDAGKAH 236 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R + SL T+ A+++A ++L R+ + + + D Sbjct: 237 AKRAVGSLSARGGTNIGAALRKAAKVLDDRLYRNAVESVILLSDGQDTYTVPPRGGYDRD 296 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + T + + + + N ++ Sbjct: 297 ANYDALVPPSLVRSDAGGGGGRAPPVHTFGFGKDHDAAAMHTIAEVTGGTFSFIENEAAI 356 Query: 399 IHVFQN-ISQLMV----HRKYSVI 417 F I L+ + V Sbjct: 357 QDGFAQCIGGLLSVAVQELRLDVA 380 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 82/240 (34%), Gaps = 28/240 (11%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS--------APICQDKKRTKMAALK 232 R+ + + + LV+D SGSM A + P+ + ++ L Sbjct: 42 QRVAVSTQSTMDVALVIDRSGSMAYASDETPDPYVNPASAPPGWTYGDPVPPNSRWLDLV 101 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 ++ F + + + L Y++ ++ + + ++ + T Sbjct: 102 ASVNAFNGFL---VDSPQYEKLCLATYSSTASRDCDLTHTYAEISNELDAISYQFDGGGT 158 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ +LT + ++ +TDG +N KS + + Sbjct: 159 SVGYGLEHGLAVLTD--------------ATHARKFAVRVMVLMTDGHHNTGKSPESMMY 204 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + + TI+ + + R+ + +++ +A L + FQ I++ + Sbjct: 205 ---HLQNHGVTLFTITFSDDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKLPSL 261 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 77/235 (32%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMAEQDF---------TSASGANISRLDATKEVLAEFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAIKVFEQS--- 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S P QK I LTDG + S V I AK ++I I++ Sbjct: 208 -------ESNQAASSKPRQKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + Q + + S + +N D L + +I +L S+ + Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELARAYDDIGKLEPQLYQSITYR 313 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 106 bits (263), Expect = 8e-21, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 79/235 (33%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM K +++ A K L F + ++ Sbjct: 107 DVMVVVDLSGSMAEQDF---------TSASGAKISRLDATKEVLADFAKT-------RQG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ + Sbjct: 151 DRLGLILFGDAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEPS 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S P QK I LTDG + S V I AK ++I I++ Sbjct: 211 QAAF----------SKPRQKVAIVLTDGNDTG--SFVEPIDAAKVAKAKGVRIHVIAMGD 258 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + Q + + S + +N D L + +I +L S+ + Sbjct: 259 PSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGKLEPQLYQSITYR 313 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 21/252 (8%), Positives = 63/252 (25%), Gaps = 11/252 (4%) Query: 34 LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + +G + + ++SM++A ++A+L + +N + + + +L +D Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLMVSKDAAANPTMTSQQITNAVQRYFTSLYND 60 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 F S+ M +L ++ + Sbjct: 61 KSAFGVTVSATYTPSSSSAAAKILASGQGAIQTDFMKIAGFPQLSFGTSSTSTWGNSRMR 120 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIP------ALLRIEMGERPIFLIELVVDLSGSMHCAM 207 L + + + + + G+ I +I D++ Sbjct: 121 VALVLDNTGSMSSNGKMSALQRAAKDMIDSLSAFAKKTGDVYISIIPFSKDVNVDTSNYN 180 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 + + + + D + + T ++ Sbjct: 181 AAWINWAEWLGEPPV-----LDPASSYGGSKPSNWDDIVEDSNCPFKKNSHGFTCADRPA 235 Query: 268 EPSWGTEKVRQY 279 S ++ Sbjct: 236 TLSGAKSDTKRI 247 Score = 88.0 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 57/191 (29%), Gaps = 21/191 (10%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + + + + + + T+ + + Sbjct: 358 ISNTAPSTGSDGTPSTKFYAEQWKDCLPATITAMSSQWSTLKNQIDAMTPSGNTNQSIGL 417 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK------ 352 +Q L++ K +Q +I+ L+DG N + Sbjct: 418 AWGWQSLSTT------NGPIAAPGKESGYVYQDYIVLLSDGLNTQNRWYSCPPSGPCPTI 471 Query: 353 ------ICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +C K K++ + I TI +N ++L+ C S + +A FQN Sbjct: 472 DARQALLCQKVKDSGVTIFTIQVNVGSKDPLSQVLQNCASD-GNFQMITSATETADAFQN 530 Query: 405 ISQLMVHRKYS 415 I + + + Sbjct: 531 ILTQISQLRLA 541 >gi|146340337|ref|YP_001205385.1| hypothetical protein BRADO3364 [Bradyrhizobium sp. ORS278] gi|146193143|emb|CAL77155.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 470 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 47/457 (10%), Positives = 123/457 (26%), Gaps = 71/457 (15%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + AN ++IFAL ++ + L+G + K+ +++A +AA +A + ++ Sbjct: 8 RFARDRCANVAVIFALMMVPTIYLLGMALDYTQALRKQGQLDAAADAAAIAAVRPAMLSV 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + + + + S + ++ + S + + Sbjct: 68 TDTSVVKNTAAAVFATKAAMNGLTAVP--TPDITVTDSGLQRTIQVSYVARSINNFPSVL 125 Query: 133 ANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + ++ L + I + I A + G + Sbjct: 126 GSPSWQVKGSATAQASSAPNMNFYLLLDDSPSMAIAATQTDIDNLIAATSKQPGGSKNCG 185 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSH--- 247 V + + V++ I + K T ++ + NA+ L Sbjct: 186 FACHEV----HPNLDSGASSSTVDNLSIARSKSITLRIDLVVNAVKQLLVGPYTCPQAGI 241 Query: 248 -----------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP---TD 293 + + + + + + + ++ + T Sbjct: 242 SGGVMQCMSAINNTTYKAAIYTFDYNLNTIQTLTSPSSAGTKISNIQLLTVDHQNCVTTA 301 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----- 348 I + + G Q+ + +TDG + ++ Sbjct: 302 ICNT-DFGTDISGALGGVNAIMPDPGTGTNQSGDTPQEVVFLVTDGVEDKLIASSSGCDP 360 Query: 349 ------------------NTIKICDKAKENFIKI---VTISINASPNG------------ 375 +CD K+ I+I T + + + Sbjct: 361 KATYPLPAAGSQVRCQQPLNTAVCDTIKKRNIRIAILYTEYLQLTTDNWYNSRIAQFNSP 420 Query: 376 -------QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L+ C SSP++ V + + Sbjct: 421 SSLSGTIAQRLQACASSPDFFATVQTGGDISDALTKL 457 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 43/494 (8%), Positives = 120/494 (24%), Gaps = 111/494 (22%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 +I L ++ + + + + + + + + AA +A + +R Sbjct: 16 GVAAIWMGLLLVPIMGFTFWAVEGTRYVQESSRLRDSAEAAAMAVTIEDQPGAARALATK 75 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RLD 138 + + + + + + Y + +S S + + + + Sbjct: 76 YVENYVRDIKSTNLSAQRFYQAEDKGTGALEYIQYTVNARTTHDSWFASSFIPSFDKQQE 135 Query: 139 SSNNTI-----FYNMDVMTSYDYRLQFIEHLLNQRYNQ--KIVSFIPALLRIEMGE---- 187 + ++ Y D + F + ++ + K + + + Sbjct: 136 LAGRSLARKYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKIDDLKTAIDQISNNILCK 195 Query: 188 ----------------------------RPIFLIELVVD------------------LSG 201 + + V +G Sbjct: 196 STRQEYVDGEWKDVCDEPGDDTTSDKLLNRVGFVPFNVRTREIVAGNRANATSQLSYKNG 255 Query: 202 SMHC---------AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 N +S ++ + + I + Sbjct: 256 YKAYLSPYSYNDVDWNYWRTYTSSEVNNCAYWQSYCQNPRAENHNYAKRIRDVLEADRYA 315 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVR--------------------------------QYV 280 + Y ++ + Sbjct: 316 VADVYNYVDLPTSVSTMFTDKSGLKANFYGVSGTRLFNAHGSSYSSQFHNIQLSNKLSDL 375 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + + + Q+L S + K K ++ L+DG+ Sbjct: 376 DSIKSMWADGGTAAFQGILRGSQVLHEGDPNSSDQEEQQAYNKK-----IKMLLILSDGQ 430 Query: 341 ---NNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +N + +CDKA++ + I I I+ + Q + CV P ++++ Sbjct: 431 ESPDNGILKGLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQDCVVDPRE--DIIDV 488 Query: 396 DSLIHVFQNISQLM 409 +L + + I +L+ Sbjct: 489 SNLDELIEKIEELI 502 >gi|222612967|gb|EEE51099.1| hypothetical protein OsJ_31818 [Oryza sativa Japonica Group] Length = 600 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 71/238 (29%), Gaps = 32/238 (13%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R + V+D+SGSM K+A LK A+ +D++ Sbjct: 153 AQRAPLDLVTVLDVSGSM--------------------VGNKLALLKQAMGFVIDNLGPG 192 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQI 304 + +I +++ + + S T+ + + R + SL T+ A+++A ++ Sbjct: 193 D------RLCVISFSSGASRLMRLSRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKV 246 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L R+ + + + D + + Sbjct: 247 LDDRLYRNAVESVILLSDGQDTYTVPPRGGYDRDANYDALVPPSLVRADAGGGGGRAPPV 306 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 T + + + + N ++ F I L+ + V Sbjct: 307 HTFGFGKDHDAAAMHTIAEVTGGTFSFIENEAAIQDGFAQCIGGLLSVAVQELRLDVA 364 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 78/246 (31%), Gaps = 39/246 (15%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + +I R + V+D+SGSM TK+A LK A Sbjct: 236 NANSLNNQISRYPRAPVDLVTVLDISGSMA--------------------GTKLALLKRA 275 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTD 293 + + ++ + +I +++ + + ++ RQ + + T+ Sbjct: 276 MGFVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTN 329 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 +++ +++ + ++ + + + + N ++ Sbjct: 330 IAEGLRKGVKVMEDRRDKNPVASIILLSDGRDTY-------TMNQADPNYKLLLPLSMHG 382 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 C+ K I + + + + + +S + + + ++Q + Sbjct: 383 CES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA---LAQCIGGL- 437 Query: 414 YSVILK 419 SV ++ Sbjct: 438 LSVAVQ 443 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 78/246 (31%), Gaps = 39/246 (15%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + +I R + V+D+SGSM TK+A LK A Sbjct: 233 NANSLNNQISRYPRAPVDLVTVLDISGSMA--------------------GTKLALLKRA 272 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTD 293 + + ++ + +I +++ + + ++ RQ + + T+ Sbjct: 273 MGFVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTN 326 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 +++ +++ + ++ + + + + N ++ Sbjct: 327 IAEGLRKGVKVMEDRRDKNPVASIILLSDGRDTY-------TMNQADPNYKLLLPLSMHG 379 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 C+ K I + + + + + +S + + + ++Q + Sbjct: 380 CES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA---LAQCIGGL- 434 Query: 414 YSVILK 419 SV ++ Sbjct: 435 LSVAVQ 440 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 83/242 (34%), Gaps = 33/242 (13%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + E+ + +VVDLSGSM Q K +++ A K L F + Sbjct: 95 VREQLGRDVMVVVDLSGSMAEQDF---------TSKQGDKISRLDATKEVLADFAKT--- 142 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GLI + + + + + + ++ + T A+ A ++ Sbjct: 143 ----RKGDRLGLILFGDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKV 198 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q + + +K +I LTDG + S V I AK ++I Sbjct: 199 FEQSA--------AVQDSSVDANVKEKVVIVLTDGNDTG--SFVEPIDAAKVAKAKGVRI 248 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I++ + + + + S + +N D L + I +L S Sbjct: 249 HVIAMGDPQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGELEPQLYESTT 308 Query: 418 LK 419 + Sbjct: 309 YR 310 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 78/235 (33%), Gaps = 25/235 (10%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM P +K T++ A+K L F ++ Sbjct: 95 DVMVVVDLSGSMAEKDFLTPY---------GEKHTRLDAVKQVLAQFSQH-------RDG 138 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + E + + + T A+ A ++ Sbjct: 139 DRLGLILFGDAAYLQAPFTADHETWLALLDETQVGMAGQSTHLGDAIGLAIKVFNDQADA 198 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + + +K +I LTDG + + I A I+I I++ Sbjct: 199 AKQAATQQGSAVVTRPVKEKVVIVLTDGNDTDSLVP--PIDAAKVAASRDIRIHMIAMGD 256 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + + + + ++ L V++ I +L + S + Sbjct: 257 PRTVGEQALDMEVIEQVASLTGGQSFQALSPAELTRVYKKIGELEPQQYDSTTYR 311 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 64/201 (31%), Gaps = 20/201 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ- 278 TK+A LK A+ + ++ + +I +++ +N TE +Q Sbjct: 260 SGSMAGTKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQE 313 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ +K+ ++L + ++ ++ + T Sbjct: 314 ALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLLSDGQDTY---------TM 364 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 N + + + N I + A + + +S + + + Sbjct: 365 TSPNGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVI 424 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+++ Sbjct: 425 QDAF---AQCIGGL-LSVVVQ 441 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 64/201 (31%), Gaps = 20/201 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ- 278 TK+A LK A+ + ++ + +I +++ +N TE +Q Sbjct: 260 SGSMAGTKLALLKRAMGFVIQNLGPFD------RLSVISFSSTARRNFPLRLMTETGKQE 313 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ +K+ ++L + ++ ++ + T Sbjct: 314 ALQAVNSLVSNGGTNIAEGLKKGARVLIDRRFKNPVSSIVLLSDGQDTY---------TM 364 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 N + + + N I + A + + +S + + + Sbjct: 365 TSPNGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVI 424 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+++ Sbjct: 425 QDAF---AQCIGGL-LSVVVQ 441 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 77/245 (31%), Gaps = 58/245 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + ++ A+K+ + F+ ++ Sbjct: 104 DLMLAVDLSGSMEKTDMP----------LRGVEVDRLTAVKSVVKNFIQK-------RQG 146 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 MGL+ + ++ ++ V + + T A+ A + L Sbjct: 147 DRMGLVVFGSQAFLQSPLTYDLNTVETLLNETEIGMAGNNTAIGDAIGIALKHL------ 200 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG N + +A+E +KI TI I Sbjct: 201 ------------HQNSEKKAVLILLTDGSNTAGAV--QPLDAAKQAQEMGLKIYTIGIGQ 246 Query: 372 SP---------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ------LMV 410 + + L K + + + + L ++Q I Q + Sbjct: 247 NQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFMAKDTNQLNEIYQLIDQLEASQHDIN 306 Query: 411 HRKYS 415 + + Sbjct: 307 NYRLR 311 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 75/226 (33%), Gaps = 53/226 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM A D+ +++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMDYADMQWD----------DEPISRLELVKRLLGDFIE-------GRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRTWLDEALIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RQRPAQSRVLVLITDGANNGGEID--PMVAAQLAADEGVRIYTIGIGA 233 Query: 372 SP----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L ++ ++ N L + Sbjct: 234 DPQQSGAFGSFGFSALDLDETSLRAISDTTGGEYFRARNQAELEQI 279 >gi|22758319|gb|AAN05523.1| unknown protein [Oryza sativa Japonica Group] Length = 731 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 87/293 (29%), Gaps = 32/293 (10%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + +DS V + +E + ++ + R Sbjct: 200 VVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAPSSPATVTSRAP 259 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+S SM TK+A LK A+ + ++ Sbjct: 260 IDLVTVLDVSWSMA--------------------GTKLALLKRAMSFVIQALGPGD---- 295 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ +++ + TE RQ + R + T+ A+++A +++ + Sbjct: 296 --RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRR 353 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R+ + + G+ ++ V + + +++ Sbjct: 354 ERNPVCSIVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGF 413 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 A + + S + A S+ F I L+ + SV Sbjct: 414 GADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLSVVAQELRLSVE 466 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 29/277 (10%), Positives = 70/277 (25%), Gaps = 20/277 (7%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 K S+ NF+++ L + L + F V K+++++A ++A LA + Sbjct: 2 LLNKFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASSHLGD 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 +++R + + + ++ ++ Sbjct: 62 LDITRNDAF---NRYFQANIVGHGELDNAQATLTVDKGVNFVKTKAVASADV----HLNF 114 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-----RYNQKIVSFIPALLRIE 184 + + + + L + R K + + Sbjct: 115 AFLFGDSKHIVVDASAVESNNQLEVVLVLDNTGSMAGARMTALRTATKSLLDTLEAAKSP 174 Query: 185 MGERPIFLIELVVDLS-----GSMHCAMNSDPEDVN--SAPICQDKKRTKMAALKNALLL 237 + + V ++ N + P+ K+ MA K Sbjct: 175 TRKIRASPVPFVTAVNVNGDGFDPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKKT 234 Query: 238 -FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + D+ + K V Y T Sbjct: 235 GWADAGWNGTGWKGCVEARPGAYNISDTPPDPAKPDT 271 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 61/188 (32%), Gaps = 42/188 (22%) Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + +K+R+ ++ + T+ + + ++L+ + G Sbjct: 373 CPTPVVPLTDDFDKLRKAASQMTE-WNGSGTNVSEGLSWGMRVLSP-------AAPYTDG 424 Query: 322 VKIPSLPFQKFIIFLTDGENNNF---------------------------------KSNV 348 + K ++ LTDGEN + + Sbjct: 425 APWKTPGISKIVLLLTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAARNVDG 484 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T +C + K ++I T+ + + R L C S P +Y V + L +VF I+ Sbjct: 485 WTKNVCTQLKNEGVQIYTMVLQSDTAANRALYSACASDPSNYYAVNDPTKLPNVFLQIAN 544 Query: 408 LMVHRKYS 415 + + Sbjct: 545 NFTKLQLT 552 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 64/220 (29%), Gaps = 38/220 (17%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 S + K ++ A+K L FL ++ +GLI + Sbjct: 102 SGSMMTKDFVNKEGKAIDRLEAVKMVLRDFLKE-------RKGEKIGLILFGNAAFVQAP 154 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + + + + T ++ A ++ Sbjct: 155 FTQDLDALEHLLDSLRVGMAGPQTAMGDSIGLAVKMFRESNVTD---------------- 198 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-------QRLLKT 381 + +I ++DG++ K + A +N + + TI I N L + Sbjct: 199 --RMLIVMSDGDDTGSKVP--PKTSAELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLKEI 254 Query: 382 CVSSPEYHYNVVNADSLIHVFQNIS----QLMVHRKYSVI 417 + Y N D L +++ I + + Y + Sbjct: 255 AAITGGKFYYAWNLDDLQDIYKQIDKLKPKEIKVLSYRPV 294 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 49/422 (11%), Positives = 124/422 (29%), Gaps = 45/422 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM----V 69 + K + +I FAL L I I + + K++ + + L + Sbjct: 12 LKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVA 71 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNH---IKESLSGYSAVFYNTEIQNIVNSSR 126 +S+ R + ++ K IK ++ A + Sbjct: 72 DGMSKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHES 131 Query: 127 ISMTHMANNRLDSSNNTIFYNM--DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + +D + ++ + + + + I Sbjct: 132 YMTHAMGFDNIDYTADSESTISFGQGKYEFIFLVDVSPSMG-----------------IG 174 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R +++ + + H S + + R ++ +K+AL + ++ Sbjct: 175 ASNRDRQIMQRAIGCQFACHEPWYS-----SVSRAKSAGARLRIDVVKDALKSLVTQLEE 229 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS---LILKPTDST---PAM 298 + V D+ GL ++ + + G K ++ + L T+ Sbjct: 230 ATEV--DLRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTNFHGVFSDF 287 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L GV +L N + + CD+ K Sbjct: 288 NGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFK 347 Query: 359 ENFI-KIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + T+ + ++ C +S ++ Y+ +A + F+++ + ++ Sbjct: 348 KGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKAFKDLFEALLKS 407 Query: 413 KY 414 Y Sbjct: 408 VY 409 >gi|115482404|ref|NP_001064795.1| Os10g0464500 [Oryza sativa Japonica Group] gi|110289213|gb|AAP54178.2| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113639404|dbj|BAF26709.1| Os10g0464500 [Oryza sativa Japonica Group] Length = 719 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 87/293 (29%), Gaps = 32/293 (10%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + +DS V + +E + ++ + R Sbjct: 200 VVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAPSSPATVTSRAP 259 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+S SM TK+A LK A+ + ++ Sbjct: 260 IDLVTVLDVSWSMA--------------------GTKLALLKRAMSFVIQALGPGD---- 295 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ +++ + TE RQ + R + T+ A+++A +++ + Sbjct: 296 --RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRR 353 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R+ + + G+ ++ V + + +++ Sbjct: 354 ERNPVCSIVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGF 413 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 A + + S + A S+ F I L+ + SV Sbjct: 414 GADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLSVVAQELRLSVE 466 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 70/226 (30%), Gaps = 50/226 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + V+D SGSM + K A F+D++ Sbjct: 493 QQVNYIDLVFVLDSSGSMSWNDPNGYR-------------------KIAAKSFVDALIQG 533 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + + + V+ + R T+ ++ A Q L Sbjct: 534 DRA------AVVDFDNFGYLLQPLTTDFQAVKNAIDRIDSW---GGTNIAEGIRIANQQL 584 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S K II LTDGE + +AK N I I Sbjct: 585 ISRSSEDRI----------------KVIILLTDGEGYYDNNLT------TEAKNNGITIY 622 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI + S + L + ++ V +A L VF+ I++++ Sbjct: 623 TIGLGTSVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKNDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRRTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A + +KI T+ I + Sbjct: 188 ------------RMRPANSRVLVLVTDGANNGGQID--PITAARLAADEGVKIYTVGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + L + Sbjct: 234 DPDKNALQGVLGLNPSLDLDEPTLKDIASLSGGQYFRARDGAELDKI 280 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 78/231 (33%), Gaps = 54/231 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM +D+ ++++ +K+ L FL +E Sbjct: 91 DLLVAVDVSGSMDFPDMHW----------RDEDVSRLSLVKHLLGDFLQQ-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + + + A E +KI I I A Sbjct: 188 ------------RLRPAQSRVLILITDGANNAGQID--PLTAARLAAEEGVKIYPIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 P + L ++ ++ + + L + Q + Sbjct: 234 DPEQTGSLGILGVNPSLDLDEPALKAIAAATGGQYFRARDGEELQAIKQTL 284 >gi|125532269|gb|EAY78834.1| hypothetical protein OsI_33939 [Oryza sativa Indica Group] gi|125575070|gb|EAZ16354.1| hypothetical protein OsJ_31816 [Oryza sativa Japonica Group] Length = 654 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 37/295 (12%), Positives = 88/295 (29%), Gaps = 36/295 (12%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + +DS V + +E + ++ + R Sbjct: 123 VVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAPSSPATVTSRAP 182 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + V+D+S SM TK+A LK A+ + ++ Sbjct: 183 IDLVTVLDVSWSMA--------------------GTKLALLKRAMSFVIQALGPGD---- 218 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ +++ + TE RQ + R + T+ A+++A +++ + Sbjct: 219 --RLSVVTFSSSARRLFPLRKMTESGRQRALQRVSSLVADGGTNIADALRKAARVMEDRR 276 Query: 310 KRSFFTNFFRQGVKIPSL--PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +R+ + + P + D + + + A+ +++ Sbjct: 277 ERNPVCSIVLLSDGRDTYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSAR--HVQVHAF 334 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 A + + S + A S+ F I L+ + SV Sbjct: 335 GFGADHDSPAMHSIAEMSGGTFSFIDAAGSIQDAFAQCIGGLLSVVAQELRLSVE 389 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 79/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM ++ + FLD+ Sbjct: 88 PQSGRDLIMAIDLSGSMAIQDMKKSNGQME---------SRFDLVMRVANQFLDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 134 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 192 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 232 IGLGGGQMIVETTFGQRLINTSEDLDTTVLEKIAEMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|153833207|ref|ZP_01985874.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148870478|gb|EDL69393.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 515 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 45/501 (8%), Positives = 116/501 (23%), Gaps = 111/501 (22%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + +K ++ L ++ + + + + + + + + AA +A + + Sbjct: 9 RSVHRQKGVAAVWMGLLLVPIMGFTFWAVEGTRYVQETSRLRDSAEAAAMAVTIEDQAGS 68 Query: 73 SRLGDRFESISNHAKRALIDDAKR---FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + S ++ + + + L + +S Sbjct: 69 ASTLAAKYVESYVRDIKSMNVSAQRYYRAADDRAGVLEYIQYTVNAKTTHDSWFASSFIP 128 Query: 130 THMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHLL--NQRYNQKIVSFIPALLRIE 184 + L + Y D + F + + + + Sbjct: 129 SFDEQQDLAGRSLARKYPAYLADNNIDIVFVSDFSGSMREQWGFNRHIKIDDLKTAISQI 188 Query: 185 MGE--------------------------------RPIFLIELVVDLSGSMHCAMNSDPE 212 + + V + + Sbjct: 189 SNNILCTSTRQEYVDGEWKDVCDEPGEDTTSDKLLNRVGFVPYNVRTREIIGWNQANTTS 248 Query: 213 --DVNSAPICQDKKRTKMAALKNALLLFLDS-------------------------IDLL 245 + + T + F + I + Sbjct: 249 QLNYTNGYNTHLSPYTYNDIDWDYWSRFSQTQVFNCANNQIFCPRPGWNDQQYARRIADV 308 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ--------------------------Y 279 + + + Y R Sbjct: 309 IRLDQYQVADVYNYVDLPSSVSTMFTDKSARRSNYYGVNQSQLFNGHGRNDSRQFYNLRL 368 Query: 280 VTRDMDSLI------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + D T + + + QIL S + K K + Sbjct: 369 TNKLSDLNPINAMWADGGTAAFQGILRGAQILHEGDPNSSDQEEQQAYNKK-----IKML 423 Query: 334 IFLTDGE---NNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEY 388 + L+DG+ NN + +CDKA++ + I I I+ + Q + CV P Sbjct: 424 LILSDGQESPNNGILKGLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQDCVVDPRE 483 Query: 389 HYNVVNADSLIHVFQNISQLM 409 ++++ +L + + I +L+ Sbjct: 484 --DIIDVSNLDELIEKIEELI 502 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 36/281 (12%), Positives = 72/281 (25%), Gaps = 60/281 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + S + D E + T + + Sbjct: 296 GTSGGASFPVGNQRSTAYSYNSYIDDETGPDCRNS-NTYNTALGQACKYVN-----PKDP 349 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + + + + T ++ + T+ ++ L Sbjct: 350 ASGSPLGIPNGPNFGCTTQPLQRLTNNTNALKTLINNMA---PSGSTNIHEGFMWGWRTL 406 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS------------------- 346 + + + + + K II +TDG N+ + Sbjct: 407 SPNSVFADGQPYASSANSSNATNINKIIILMTDGTNSWGTNSSAPTGSLYFAAGYFRNAN 466 Query: 347 ---------------------------NVNTIKICDKAKENFIKIVTISINASPN----- 374 + T + C K I I TI + + Sbjct: 467 GTTPNPRLTTAYQNTNIADGNTARKALDALTAEACANTKAVNISIYTIGFSVPTDPIDSA 526 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 GQ LL+ C SSP+ Y ++D LI F++I + + + Sbjct: 527 GQTLLRNCASSPDQFYLANSSDDLIKAFKSIQASIGALRLT 567 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 30/324 (9%), Positives = 84/324 (25%), Gaps = 28/324 (8%) Query: 21 NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-AILAGASKMVSNLSRLGDRF 79 N +++FA + +++ G I + +++A +A A+L + + ++ + L Sbjct: 26 NVALLFAFLSVPMVMIGGAAIDYGFATRLETKLQTATDATALLLCQTPLTTSEAELNTLA 85 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 ++ A A ++ ++ + S+ R++ Sbjct: 86 QTTMTGAMGA----------ANLVVDRLAITSSPRKITLTAHKQSTTFFGGLTGTQRINP 135 Query: 140 SNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR----------IEMGER 188 + + L + Q + + Sbjct: 136 GAVSQCATPLPKTFEIALVLDNTGSMAASSGGQSKLRAVQTAATDFVNYVYTSPAFSSAT 195 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--RTKMAALKNALLLFLDSIDLLS 246 + ++ ++ + D N ++ A KN +F Sbjct: 196 KVAIVPFAAAVAVDPSAYRYASWIDQNGLSSYHWTNIVQSNPAPFKNRFEVFAKLQAANR 255 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + + Y V+ + + YV T + Q T Sbjct: 256 NWGWAGCLESLPYPFNVQDGAPTTKDSF----YVPLLAPDEPGDGTSGGASFPVGNQRST 311 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQ 330 + S+ + + + Sbjct: 312 AYSYNSYIDDETGPDCRNSNTYNT 335 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 37/435 (8%), Positives = 106/435 (24%), Gaps = 48/435 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNL 72 I ++ IIF + ++++ F + + H + A A LA + K Sbjct: 3 IKKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLALIASPKEGDEK 62 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 ++ + A+ F + + + +V++ + + Sbjct: 63 NQEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWI 122 Query: 133 ANNRLDSS--------NNTIFYN-MDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALL 181 + N + + + + + + + + + Sbjct: 123 SYNDGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKTKYDVVADTI 182 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + F + ++ RT A + N Sbjct: 183 NRIVDDLREFKTDRKSRVAVIGFHHTAVKQVGRQRTAFDYSSYRTPSATVNNMFTA--PK 240 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQ 300 I + + ++I + + T+S + Sbjct: 241 IHSRNDSSNI----------KTFEDIPLTEDYDAFLTKFNSSNYYASRYGLTESWQGIIG 290 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD--GENNNFKSNVNTIKICDKAK 358 A Q+ + F + ++ N + + + +C+K K Sbjct: 291 AAQMAEQATDLNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRLVKAGLCEKLK 350 Query: 359 --------------------ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + I +N + + C ++ D Sbjct: 351 TRISQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDCFGHDNIYHAKEGNDVY 410 Query: 399 IHVFQNISQLMVHRK 413 ++ I++ K Sbjct: 411 KYILNLINEETGRLK 425 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 38/340 (11%), Positives = 87/340 (25%), Gaps = 26/340 (7%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL 90 M LL +G + + + SM+S +AA++A + + + K+ + Sbjct: 1 MPMLLAVGASFDYIRAYNVRQSMQSDLDAALIAAVKNVDAGDTDA----------LKQKV 50 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 D ++ + M N + S + Sbjct: 51 SDWFHAQTESSYSLGDIEIDTTNHRITATASGTVPTTLMKLANINTVPVSVASAVKGPAS 110 Query: 151 M-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + + +L + I + + Sbjct: 111 SYLNVYIVIDKSPSMLL-------------AATTAGQQAMYNGIGCQFACHTGDSHTIGT 157 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 N A + K + + +A+ +D I E + +GL + + + P Sbjct: 158 ATYSNNYAYSTEKKIKLRADVAVDAVHEVIDMISASDTNHERIKVGLYSLGDTITEVLAP 217 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTP-AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + T + V D+ S ++ + + S + + + Sbjct: 218 TLDTTAAGKRVDSDLTSATSTTYTYFDVSLAALKNKVGTGGDGSSSATPLKLVLLLTDGV 277 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + T G K CD K+ + + Sbjct: 278 QSQREWV-TSGAKYQPKVAPLNPAWCDYIKKQSATMAVLY 316 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 79/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM K ++ + FLD+ Sbjct: 88 PQSGRDLMMAIDLSGSMAIQDMQKSN---------GKMESRFDLVMRVANEFLDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + T ++ V++ + +L T A+ A + L Sbjct: 134 --RQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKL- 190 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K ++ LTDGENN+ + ++ + AK+ IKI T Sbjct: 191 -----------------KRYPGDSKALVLLTDGENNSGA--LQPLQAAELAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + + L K + + N+ L V+++I Q Sbjct: 232 IGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSADLKQVYESIDQ 290 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 73/226 (32%), Gaps = 53/226 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ +++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMAYEDMHWD----------EQPISRLELVKRLLGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ M + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRTWLDEAMIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A E ++I I I A Sbjct: 188 ------------RQRPAQSRVLVLITDGANNGGEID--PMVAAQLAAEEGVRIYAIGIGA 233 Query: 372 SP----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L ++ ++ N L + Sbjct: 234 DPRQSGVLGAFGFSALDLDETSLRAIAEATGGEYFRARNQAELTQI 279 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 75/246 (30%), Gaps = 55/246 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + L VDLSGSM + ++ A+K F++ Sbjct: 85 QTVSGRDLMLAVDLSGSMDIEDFVVD----------GEVSNRLEAVKRVASAFIER---- 130 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +GLI + + + ++ + V + + L T A+ A + L Sbjct: 131 ---RSGDRIGLILFGEQAYLQVPLTFDRKTVEKLLDEAAIGLAGDKTAIGDAIGLAIKRL 187 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + Q+ +I L+DG N + ++ + A +KI Sbjct: 188 ------------------RDNPADQRVLILLSDGANTAGQV--QPLQAAELAAREGLKIY 227 Query: 366 TISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI + A + + + ++ N + L ++ + + Sbjct: 228 TIGVGADEMIVRDFFGTRRVNPSEDLDEAAMTAIAEKTGGRYFRARNTEELDRIYALLDR 287 Query: 408 LMVHRK 413 L + Sbjct: 288 LEPTAR 293 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 78/246 (31%), Gaps = 39/246 (15%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + +I R + V+D+SGSM TK+A LK A Sbjct: 236 NANSLNNQISRYPRAPVDLVTVLDISGSMA--------------------GTKLALLKRA 275 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTD 293 + + ++ + +I +++ + + ++ RQ + + T+ Sbjct: 276 MGFVIQNLGSND------RLSVIAFSSTARRLFPLTKMSDAGRQRALQAVNSVVANGGTN 329 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 +++ +++ + ++ + + + + N ++ Sbjct: 330 IAEGLRKGVKVMEDQRDKNPVASIILLSDGRATY-------TMNQADPNYKLLLPLSMHG 382 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 C+ K I + + + + + +S + + + ++Q + Sbjct: 383 CES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA---LAQCIGGL- 437 Query: 414 YSVILK 419 SV ++ Sbjct: 438 LSVAVQ 443 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 35/302 (11%), Positives = 93/302 (30%), Gaps = 23/302 (7%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ E ++ L ++S L++ G + + + ++ ++S + A+LA A Sbjct: 57 FIRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAA---- 112 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 NL++ + + I++ F + + + +++IV+ + Sbjct: 113 -NLNQEREAADVITDFFTAEG------FGGSIVGTPSIQKNRNGSTIRLESIVDVDTFYL 165 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV------SFIPALLRI 183 + + L + N L + ++ + + Sbjct: 166 RLVGIDTLSAPANATAIEGTGNVEVSLVLDISGSMGSRMTGDAYLYDSDGEIRLDPDGNP 225 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KM-AALKNALLLFLDS 241 R + ++ + + D + + ++ L AL DS Sbjct: 226 LTERRTEDRMFFLIQEANKFIGDLLKDEYRDRVSINLVAYSQHVRLGDDLYTALNTTPDS 285 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID ++ T +W + + D+ ++ P DS + A Sbjct: 286 IDEDDNLGSSYG----SITDGYTAPFTYTWVNAEGDEVEEGTPDATLVGPFDSPIDLTWA 341 Query: 302 YQ 303 Sbjct: 342 GD 343 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 18/203 (8%), Positives = 50/203 (24%), Gaps = 18/203 (8%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSL 287 + L + + + + + + + + G + V Sbjct: 433 SLLDPTMRDLMAGLPTIDEAFRGTRPADYSDASTTKFVVIMTDGQTVSSERLVREKKVDD 492 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 ++ + +SF + + + + Sbjct: 493 PSYNGLDMYDTYAEWEGFRDNGGQSFVDDDGDPTTTWEYQQVNGRWVPSGNAHETIGTVD 552 Query: 348 VN---TIKICDKAKENF------IKIVTISINASPNGQRLLKTCVSSPEYHYN---VVNA 395 +C+ A+ + TIS+ + TC + + + Sbjct: 553 ELNAKLTSLCNLARTKNGDTNDRYTVFTISMGL---VNDTMTTCATDTGDAFTSTITNDP 609 Query: 396 DS--LIHVFQNISQLMVHRKYSV 416 D L +F+ IS + + S+ Sbjct: 610 DEPGLQEIFKTISDQITALRLSL 632 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 84/231 (36%), Gaps = 38/231 (16%) Query: 192 LIELVVDLSGSMHCAMNS--------DPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +VVDLSGSM + D +S D+ +++ A+K L F + Sbjct: 95 DLMVVVDLSGSMDTKDFTLHVKQQTADGIANSSGTEISDEYISRLDAVKRVLHEFAEQ-- 152 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ +GLI + + + + ++ + T A+ A + Sbjct: 153 -----RQGDRLGLILFGDAAYLQAPFTADLASWLRLLDESRVAMAGQSTHVGDALGLAIK 207 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +++SD+ +S K ++ LTDG + + ++ A + I+ Sbjct: 208 VMSSDEIKSSQK--------------NKVVLLLTDGNDTDSSVP--PLEAAKIAAKKGIR 251 Query: 364 IVTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I+I + + + + + ++ L V+Q IS+ Sbjct: 252 VHVIAIGDPQTVGEQAMDMEVIEGVAALTGGKAFKAISTQELNKVYQTISK 302 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 79/221 (35%), Gaps = 42/221 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + L +D SGSM ++ +++K +K+ F Sbjct: 73 DRKGRDLVLALDASGSMEESLY--------------DEKSKFEVVKSMAQNFF------- 111 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 H + D +G++ + + ++ T+ + + S+ T + Q + L Sbjct: 112 HKRFDDNIGIVIFGSFAYIAAPLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQGIKALQ 171 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 QK +I +TDG +N+ + + +KAK+ IKI T Sbjct: 172 ------------------ADTAKQKVLILITDGHHNSGSIS--PRQAVEKAKKLGIKIYT 211 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L + S + + + L +F +++ Sbjct: 212 IGLG-DADKHLLEQIAKESGGKFFYAKSEEDLQSIFSELNK 251 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 29/293 (9%), Positives = 84/293 (28%), Gaps = 42/293 (14%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-----P 189 + ++ +T ++ Y + + + + + G Sbjct: 151 DSTPAAASTGAVVVNTHAEYPAVARAS-STDSFAVLVHVKAP-GMADTVAAGSDKPQPRA 208 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM K+A LK A+ +D++ Sbjct: 209 PLDLVTVLDVSGSM--------------------SGHKLALLKQAMRFVIDNLGPDD--- 245 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + ++ +++ + + ++ + V + T+ ++ A ++L Sbjct: 246 ---RLSVVSFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIAEGLRTAAKVLDER 302 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFII--FLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + R+ ++ + + + G D + I T Sbjct: 303 QHRNAVSSVVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADG--DWSAPIHT 360 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + ++ + N + F +Q + SV+++ Sbjct: 361 FGFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAF---AQCIGGL-LSVVVQ 409 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 79/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM K ++ + FLD+ Sbjct: 88 PQSGRDLMMAIDLSGSMAIQDMQKSN---------GKMESRFDLVMRVANEFLDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + T ++ V++ + +L T A+ A + L Sbjct: 134 --RQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKL- 190 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K ++ LTDGENN+ + ++ + AK+ IKI T Sbjct: 191 -----------------KRYPGDSKALVLLTDGENNSGA--LQPLQAAELAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + + L K + + N+ L V+++I Q Sbjct: 232 IGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESIDQ 290 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 79/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM K ++ + FLD+ Sbjct: 94 PQSGRDLMMAIDLSGSMAIQDMQKSN---------GKMESRFDLVMRVANEFLDT----- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + T ++ V++ + +L T A+ A + L Sbjct: 140 --RQGDRVGLILFGTWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKL- 196 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K ++ LTDGENN+ + ++ + AK+ IKI T Sbjct: 197 -----------------KRYPGDSKALVLLTDGENNSGA--LQPLQAAELAKQYHIKIYT 237 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + + L K + + N+ L V+++I Q Sbjct: 238 IGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESIDQ 296 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 30/298 (10%), Positives = 85/298 (28%), Gaps = 18/298 (6%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + R + +N + + + Y+ V Sbjct: 180 DDESLDLQPAFAERSSGNKIAAGHNAGKIVEIKTYPEVSAASRSNSYDNFTVLVHLKAGA 239 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 E P + + LS + ++ S + TK+A LK A+ + ++ Sbjct: 240 TVARENPRGNLASLPQLSQTPRAPVDLVTVLDISGSM----AGTKLALLKRAMGFVIQNL 295 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQA 301 + +I +++ + ++ RQ + + T+ +++ Sbjct: 296 GSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQAVNSLVANGGTNIAEGLRKG 349 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +++ ++++ + + + N +I D A Sbjct: 350 AKVMEERREKNPVASIILLSDGQDTYTVSGSSG--NQPQPNYRLLLPLSIHGGDNA-GFQ 406 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + A + + S + + F +Q + SV+++ Sbjct: 407 IPVHAFGFGADHDASSMHSISEISGGTFSFIETEAVIQDAF---AQCIGGL-LSVVVQ 460 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 78/226 (34%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P + L +DLS SM SDP Q + ++ A+K + F+D Sbjct: 90 TEPQRDLMLALDLSQSMDTRDFSDP---------QGNLQARVDAVKTVVADFVDR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+ + + + VR +T + + T A+ + ++ Sbjct: 136 --RPYDRLGLVAFGDAPYPLVPFTMDHATVRSMLTGALPGMAGPKTALGDALGLSIKLFQ 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + K ++ LTDG + K K + A +N I+I T Sbjct: 194 QSQAPD------------------KVLVVLTDGNDTASKMP--PDKAAEIASQNHIRIHT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I + + L K ++ ++ + +L ++ + Sbjct: 234 VGIGNPDAQGEEKLDTETLQKIATATGGRYFFGQDQQALAEIYTLL 279 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 29/291 (9%), Positives = 81/291 (27%), Gaps = 18/291 (6%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + R S+ N N + ++ Y+ V G+ Sbjct: 184 QPVPAERNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNI 243 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + S + ++ S + TK+A LK A+ + ++ Sbjct: 244 QRNMSNSPLNSHNPRAPVDLVTVLDISGSM----AGTKLALLKRAMGFVIQNLGSSD--- 296 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +I +++ + T+ RQ + + T+ +++ +++ Sbjct: 297 ---RLSVIAFSSTARRLFPLRRMTDAGRQQALQAVNSLVANGGTNIAEGLRKGAKVMEDR 353 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K+R+ ++ + + N ++ I + + Sbjct: 354 KERNPVSSIILLSDGQDTYTVNGSSG--NQPQPNYQLLLPLSMHGSQNT-GFQIPVHSFG 410 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + S + + F +Q + SV+++ Sbjct: 411 FGTDHDASSMHTISEISGGTFSFIETESVIQDAF---AQCIGGL-LSVVVQ 457 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 36/324 (11%), Positives = 79/324 (24%), Gaps = 35/324 (10%) Query: 1 MHLLSRF-RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 M + R + A E+ ++ FAL ++ +L LI + +K ++ A +A Sbjct: 3 MTMWKRLVTRLVTRLGADERGAIAVQFALLLIPIAVLTFGLIDISRASVQKRQLQDALDA 62 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A L A + + L ++ + F + L Sbjct: 63 ATLMAARSTATTNADLDTIGDA---ALATEMAGLGVTFGPGNSSFVLGDN---NTVVGTI 116 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ------------ 167 V I ++ S L + + Sbjct: 117 QNVVIKPIISNLWSSTNTPVSATATVMRSINHLEVALVLDNTGSMASSLGSGGSKITALI 176 Query: 168 --RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC----AMNSDPEDVNSAPICQ 221 + V A E I ++ + ++ + + + Sbjct: 177 TASKSLVDVLSAAAARATEADAVKISVVPFSMTVNIGSTYQTQTSWLTGTQPAAYGVDNF 236 Query: 222 DKKRTKMAALKNALLLF----------LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + L N L + D D + M + + + + Sbjct: 237 ATSQNRFTLLSNLGLTWGGCVESRPAPFDVTDDAPNPAIGASMFVPFFAPDEPDDNTVNI 296 Query: 272 GTEKVRQYVTRDMDSLILKPTDST 295 +Y S T++ Sbjct: 297 SASSSTKYRDARRYSTSYPITNTY 320 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 36/278 (12%), Positives = 72/278 (25%), Gaps = 45/278 (16%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + S S + V + T+ + Sbjct: 289 PDDNTVNISASSSTKYRDARRYSTSYPITNTYLTDTVTPTGTATNAWSTRSTVVAKYATS 348 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ L+ T + + + + T+ Sbjct: 349 NKATLLSLAKTGTAYGPNAGCGMTSLMRLTNVKA-KADRDTVKGKLDQMIASGNTNVAMG 407 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----- 352 + + L+ ++ K I+ LTDG+N N N Sbjct: 408 LIWGWHTLS----KNAPFADGVDPATTVGKRTTKVIVLLTDGDNTNDTYNNPNASIYTGY 463 Query: 353 -----------------------------------ICDKAKENFIKIVTISINASPNGQR 377 C AK ++I I + S + + Sbjct: 464 GYITQGRLLNASNSPLGATSTATNRRDAIDSREARACTNAKAAGVQIYAIGVGVSSHSRG 523 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L+ C S PE +Y+V +A L VF I+ + + + + Sbjct: 524 ILQDCASKPEMYYDVTDAAQLASVFNTIAGSIQNLRIT 561 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 80/242 (33%), Gaps = 58/242 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + VD+SGSM + Q+++ ++A +K+ + F++ Sbjct: 87 PTTGRDLLVAVDISGSMDTKDM----------VVQNQQIPRIAVVKHIVGDFIER----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+ + T ++ V+Q + T A+ + + L Sbjct: 132 --RVGDRLGLVLFGTSAYLQSPLTFDRTTVKQLLVESQIGFAGPNTAIGDAIGLSIKRL- 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +I LTDG+N + + + D AK++ +K+ T Sbjct: 189 -----------------RDRPAENRVVILLTDGQNTAGEVS--PRQAADLAKQSGVKVYT 229 Query: 367 ISINA---------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + A + L ++ ++ + L ++Q + Sbjct: 230 IGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTGGRYFRAHSPQELNQIYQLL 289 Query: 406 SQ 407 + Sbjct: 290 DE 291 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 77/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM + Q + ++++ +K+ L FL+ +E Sbjct: 91 DLLVAVDVSGSMDFPDMNW----------QGEDVSRLSLVKHLLGDFLE-------GREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + + + A E +KI I I A Sbjct: 188 ------------RQRPAQSRVLILVTDGANNAGQID--PLTAARLAAEEGVKIYPIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L ++ ++ + + L + Sbjct: 234 DPEQTGSLGILGVNPSLDLDEPALKAIAEATGGQYFRARDGEELQAI 280 >gi|50119743|ref|YP_048910.1| hypothetical protein ECA0798 [Pectobacterium atrosepticum SCRI1043] gi|49610269|emb|CAG73712.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 543 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 52/525 (9%), Positives = 125/525 (23%), Gaps = 119/525 (22%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + R + EK + +AL M+ L+ F++ ++ A +AA Sbjct: 16 GITLFLRERLSAFVHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAA 75 Query: 62 LAG---ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-GYSAVFYNTE 117 LA A+ S+ + ++ A S++ G ++ T Sbjct: 76 LAVGHQATINGEEYSQEDTNKLAYEYVKNNLGMNKALSEKLVASDVSVTEGRNSATRKTY 135 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL------------- 164 ++ S+ + + + + ++ T + + Sbjct: 136 TVTAAFETKPSLLSLGAKKQEVY--STSEVINRPTEIALVMPVTGDMSEGDIRSLKSVSR 193 Query: 165 ---------LNQRYNQKIVSFIPALLRIEMGER-----------------PIFLIELVVD 198 + + + +S +P + + + P Sbjct: 194 SFVERMLSSADGKRDNLWLSLVPYSQSVNVYDAEDANRIRRWSTPSALNPPELRSLFASG 253 Query: 199 LSGSMHCAMNSD----------------PEDVNSAPICQDKKRTKMAALKNA------LL 236 + S+ D + P Q + +N + Sbjct: 254 VVSSLADRRFPDRRANLLCMYRGLGREEDFFWDEPPAGQFSIYYRHDLPQNGSPGAPPIS 313 Query: 237 LFLDSIDLLSHVKE----DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + DL D + + + ++ Q + + + Sbjct: 314 WRGPNPDLYPWDNNSNAVDTRFIVADRGCPNAALMPLTNEESRLNQRIEQFSARFNV--- 370 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSN-- 347 + M A L+ + + S + + + QK I+ + + N F ++ Sbjct: 371 NYAIGMSWAGAALSPNMRGSDGWGDAKLPLDFNLDGNGDGQKVIVMMANTIGNWFDTDGY 430 Query: 348 ---------------------VNTIKICDKAKENFIKIVTISI--NASPN---------G 375 +C IK + I + Sbjct: 431 NFKRNQFSGSTGSDVAREFAQQRFRDLCSSFHARNIKFYFVGIRPGDPEDFGRNLFDTEA 490 Query: 376 QRLLKTCVSSPEYHYNVVN--------ADSLIHVFQNISQLMVHR 412 L C + + D LI I+ + Sbjct: 491 TPGLLVCTEGEKRMSFIDGSGFGAEGVEDQLIQRLDRIAGQIETE 535 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 61/201 (30%), Gaps = 14/201 (6%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ- 278 TK+A LK A+ + ++ + +I +++ + ++ RQ Sbjct: 274 SGSMAGTKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQH 327 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ +++ +++ ++++ + + Sbjct: 328 ALQAVNALVANGGTNIAEGLRKGAKVMEDRREKNPVASIILLSDGQDTYTVSGNGG--NQ 385 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + N +I D A I + A + + S + + Sbjct: 386 PQPNYQLLLPVSIHGGDNA-GFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAVI 444 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+++ Sbjct: 445 QDAF---AQCIGGL-LSVVVQ 461 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 72/226 (31%), Gaps = 50/226 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + V+D SGSM + K A F+D++ Sbjct: 493 QQINYIDLVFVLDSSGSMSWNDPNGYR-------------------KIAAKSFVDALIQG 533 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + + + V+ + R T+ ++ A Q L Sbjct: 534 DRA------AVVDFDDFGYLLQPLTTDFQAVKNAIDRIDSW---GGTNIAEGIRIANQQL 584 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + S K II LTDGE + +AK N I I Sbjct: 585 ----------------ISLSSEDRIKVIILLTDGEGYYDNNLT------TEAKNNGITIY 622 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI + S + L + ++ V +A L VF+ I++++ Sbjct: 623 TIGLGTSVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 79/229 (34%), Gaps = 55/229 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + ++ A K F++ + Sbjct: 90 DLMLAVDLSGSMEEQDF----------VINKRSVDRLTAAKMVAADFINR-------RVG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ + V + + L T A+ A + L S++ Sbjct: 133 DRVGLILFGTQAYLQTPLTFDRKTVMTLLNEAVIGLAGDNTAIGDAIGLAVKRLKSEQVN 192 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + + +K + A N +KI TI I A Sbjct: 193 ------------------SRVLVLMTDGANTAGEVS--PLKAAELAAANHLKIYTIGIGA 232 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + L+K S+ +Y N D L +++ Sbjct: 233 DEMIVRSFFGNRKINPSVDLDEKTLIKIAESTGGQYYRARNTDELNNIY 281 >gi|238796988|ref|ZP_04640492.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] gi|238719248|gb|EEQ11060.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] Length = 536 Score = 103 bits (256), Expect = 5e-20, Method: Composition-based stats. Identities = 48/513 (9%), Positives = 124/513 (24%), Gaps = 112/513 (21%) Query: 3 LLSRFRFYFKK------GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 + + R +K I ++K + F + + F+ L+ + + KK + A Sbjct: 8 FMLKNRSILRKYGLLSLFIKNDKGAILLPFVIFLPIFIGLLFLSFEISQFLQKKAKLSDA 67 Query: 57 NNAAILAGASKMVS--NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A LA + + ++ + ++++A L + I + Sbjct: 68 IEQATLALTVENNDIPDANQSQKNRDLVTHYATAYLPSEKFSTPIIDISNNKGHLLYKAE 127 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLL----NQR 168 T + + + + DS +T + + + ++ Sbjct: 128 TTMSYPAQFLANSPLANTKISIADSGAARKDVAVGPSELTDVVFVVDYSSSMINPFHGTY 187 Query: 169 YNQKIVSFIPAL-------LRIEMGERPIFLIELVVDL----------SGSMHCAMNSDP 211 ++ + + + + I I + H Sbjct: 188 GSRSKIDELRDIFYKLNGNILKNDNINTIGFIPFSWGIKKIVGTGQQSKTYCHFPYAPIK 247 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL--------------- 256 + Q L ++D+I+ + + Sbjct: 248 HKSTGDYLRQYTASNLKQFLAPENFNYVDNIEYGELSNRNNRVNYLKIKDEISHSKSNLA 307 Query: 257 ------IGYTTRVEKNIE-PSWGTEKVRQY------------------------------ 279 Y + + + + Sbjct: 308 NEFMIKTHYINKYYIISNILTSNIDYDKTINLMSKKYKSIDIPIDDVLDGNICLSSSTTY 367 Query: 280 -----------VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS----FFTNFFRQGVKI 324 +T + + + T + + A + ++ + Sbjct: 368 SLEFNKVSDESITESLATEPVGLTLVSSGILAANNLFKEANDKNKKLMIVLSDGEDSDNT 427 Query: 325 PSLPFQKFIIFLTDGENNN-------FKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 + +I + D N+ N+ +C+ N I++V I+I +P Sbjct: 428 TTFDKDGNLIAVDDYIKNDEDRKPFRITKNLIDKGMCEAIAANKIRMVFIAIGYTPVNDA 487 Query: 378 L------LKTCVSSPEYHYNVVNADSLIHVFQN 404 + CV + Y +A L Q Sbjct: 488 YSPTYIDWEKCV-GKDNFYLAKDAHELEADLQQ 519 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 103 bits (256), Expect = 5e-20, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 81/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM + +++A+K+ L F++ ++ Sbjct: 83 DLMLVVDLSGSMSQEDMQFN----------GEYIDRLSAVKHVLSDFIER-------RKG 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 126 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTF------ 179 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + ++ D AK+ I T+ + A Sbjct: 180 ------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAADIAKKYNATIYTVGVGA 225 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ ++ L ++ I+Q Sbjct: 226 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQ 279 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 79/237 (33%), Gaps = 55/237 (23%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L+VDLSGSM +D + ++ A+K L F++ Sbjct: 95 KYRDMMLLVDLSGSMQKEDMNDN----------GEYIDRLTAVKRVLSDFVEK------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +G++ + + V Q + + + L+ + T + + Sbjct: 138 RQGDRLGVVLFGDHAYLQTPLTADRRTVMQQINQAVIGLVGERTAIGDGIGLGTKTF--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + S Q+ +I L+DG N + ++ + A++ I T+ Sbjct: 195 ---------------VDSDAPQRVMILLSDGSNTAGV--LEPLEATEIAQKYNATIYTVG 237 Query: 369 INA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + Q L K + ++ +A L ++ I+Q Sbjct: 238 VGAGEMMVKDFFMTRKVNTASDLDEQTLTKIAEMTGGKYFRARDAKELEAIYDTINQ 294 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 29/291 (9%), Positives = 81/291 (27%), Gaps = 18/291 (6%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + R S+ N N + ++ Y+ V G+ Sbjct: 189 QPVPAERNSSNGNAAENNPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAVANTGQNI 248 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + S + ++ S + TK+A LK A+ + ++ Sbjct: 249 QRNMSNSPLNSHNPRAPVDLVTVLDISGSM----AGTKLALLKRAMGFVIQNLGSSD--- 301 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +I +++ + T+ RQ + + T+ +++ +++ Sbjct: 302 ---RLSVIAFSSTARRLFPLRRMTDAGRQQALQAVNSLVANGGTNIAEGLRKGAKVMEDR 358 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K+R+ ++ + + N ++ I + + Sbjct: 359 KERNPVSSIILLSDGQDTYTVNGSSG--NXPQPNYQLLLPLSMHGSQNT-GFQIPVHSFG 415 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + S + + F +Q + SV+++ Sbjct: 416 FGTDHDASSMHTISEISGGTFSFIETESVIQDAF---AQCIGGL-LSVVVQ 462 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 103 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM Q + +++ +K + FL ++ Sbjct: 91 DLLVAVDVSGSMDFPDMQW----------QGDEISRLDLVKALMGDFLQ-------DRQG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RQRPAQSRVLILITDGANNGGQI--HPLTAARLAAQEGVRIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 234 NPEASGTPGLLGLNPSLDLDEASLKEIAGITHGTYFRAHDGAELNAI 280 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 103 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + FLD+ Sbjct: 94 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFLDT----- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 140 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKL- 196 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 197 -----------------KKFPGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 237 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 238 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 102 bits (254), Expect = 9e-20, Method: Composition-based stats. Identities = 27/249 (10%), Positives = 74/249 (29%), Gaps = 45/249 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 199 DRAPIDLVAVLDVSGSMA--------------------GSKLSLLKRAVCFLIQNLGPSD 238 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ T+ +K+ ++L Sbjct: 239 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLXSSGGTNIVEGLKKGVRVL 292 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF------KSNVNTIKICDKAKE 359 +++ + + T ++ N+ IC + +E Sbjct: 293 EERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRE 352 Query: 360 NF-------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV- 410 + I + T + + + S + + + F I L+ Sbjct: 353 SGDEGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVAXVQDAFAMCIGGLLSV 412 Query: 411 ---HRKYSV 416 + +V Sbjct: 413 VAQELRLTV 421 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 15/218 (6%), Positives = 60/218 (27%), Gaps = 14/218 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + K+ + F++I L+++ + G + + + +N++++A + + LA S Sbjct: 24 IKEFLKQYRDDRRGQFAVITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSALAAVS 83 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + R +++ + + + + Sbjct: 84 YSGEEQTEREKRADTLFLNNTAGIAIEDTDLSEEDGAWVYKAE------------YKIKT 131 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + + + L + ++ + + L Sbjct: 132 NFLRVVGIDEFEMESQGAAALANSPMDVVLVLDSSGSMAQDNRMVELKASVKLFLEEFKS 191 Query: 187 ERP--IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + L+ + + + V + Sbjct: 192 NDLTQVALVPFDTQVKATSSLFGAAGNVSVANPLATGS 229 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 68/176 (38%), Gaps = 23/176 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 K+ +Y T +E ++ + V V + T+ T ++ + LT+ Sbjct: 383 KDTIYPEAHCATGSLEPITGLTFDLQSVETAVNKLT---PSGNTNVTIGVQWGMEALTAA 439 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS---------NVNTIKICDKAKE 359 + +K +I LTDG N + + T+ C+ AK Sbjct: 440 APLTGV---------RTGSEVRKVMIVLTDGLNTQNRWWGSRDRNKIDARTLAACNNAKA 490 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I++ T+ + + LLKTC + + ++ V +A L F ++++ + + + Sbjct: 491 MGIELYTVRLVEGN--EDLLKTCAETEDKYHYVTSASQLKTTFADLARQVKGVRLA 544 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 79/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM ++ + F+D+ Sbjct: 94 PQSGRDLIMAIDLSGSMAIQDMKKANGQME---------SRFDLVMRVANQFIDT----- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 140 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 197 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 198 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 237 Query: 367 ISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 238 IGLGGDQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 80/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + +++A+K+ L F++ ++ Sbjct: 111 DLMLVVDLSYSMSQEDMQFN----------GEYIDRLSAVKHVLSDFIER-------RKG 153 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 154 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTF------ 207 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + ++ D AK+ I T+ + A Sbjct: 208 ------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAADIAKKYNATIYTVGVGA 253 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ ++ L ++ I+Q Sbjct: 254 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQ 307 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 47/483 (9%), Positives = 115/483 (23%), Gaps = 107/483 (22%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNS---------------MESANN 58 +K +II L+V+ +L G K+ ++ Sbjct: 10 FTRDDKGGVAIIMGLAVIPLVLASGLAADYAIVQAAKSRLDASADAAALAAIKTAQTTI- 68 Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A L+ + + ++ + +A A K I + G Sbjct: 69 -AELSATNPNPRPQAIAAAMSQAEKSFYAQAGKRAADLLGKPAIDVQIKGQEVTANVAYS 127 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFI 177 ++ + + L + Sbjct: 128 AA---MPSNFGRIAGVKLMNYNGGAGAQLTMAKFLDFYLLLDVSGSMGLPST-------- 176 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 PA + P L + + H A + + + ++ A+ A+ Sbjct: 177 PAGEAALAAKNPDDLAQYPTGCRFACHFAGSQG-----YNVSRANNIQLRIDAVGAAVAQ 231 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTP 296 ++ + + + +G+ + T ++ + + + + TD Sbjct: 232 LMEKAKDTATLPKQYRVGVYPFVTHANAFVDLTDNLRGDQYSVESAINYDPATRTTDFGR 291 Query: 297 AMKQA-----YQILTSDKKRS-----------------------FFTNFFRQGVKIPSLP 328 + + L + K + G + Sbjct: 292 LLDAGKDWVFARDLNPNYKANPNIPADVTPMGAGGSHIHNIFQDINAKIPSVGDGSGASS 351 Query: 329 FQKFIIFLTDGENNNFKSNVNT-----------------------IKICDKAKENFIKIV 365 Q F+ F++DG N+ T +C+ K I + Sbjct: 352 PQPFVFFVSDGMQNSQSFVSATGTWPGVTPYPTPPGQTVSIRAMDPTLCNVLKARGITVS 411 Query: 366 TISINAS--------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I PN ++ C S P +++ + + + + Sbjct: 412 VLEIPYPTFTNPKPFAAAQEFKANDAVPNLSGAMRACAS-PNFYFMADTPEGIADAMKKM 470 Query: 406 SQL 408 + Sbjct: 471 FEQ 473 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 76/235 (32%), Gaps = 35/235 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM K +++ A+K L F+ + ++ Sbjct: 102 DVMVVVDLSGSMAEKDF---------TSTNGLKISRLDAVKKVLNDFVKT-------RKG 145 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + + K T A+ + Sbjct: 146 DRLGLILFGDAAFVQTPFTADHKVWLDLLNQTHVEMAGKSTHLGDAIGLTIKRFEDSNNS 205 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +K I LTDG + + + AK ++I I+I Sbjct: 206 QPLS----------TTSREKVAIILTDGNDTDSYVP--PMDAAKVAKVKGVRIHMIAIGD 253 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + +S + +N D LI+ + IS+L S + Sbjct: 254 PQTVGEQALDMDTINTIADASGGQAFQALNQDELINAYAEISKLEPQLYQSTTYR 308 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 29/293 (9%), Positives = 84/293 (28%), Gaps = 42/293 (14%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-----P 189 + ++ +T ++ Y + + + + + G Sbjct: 135 DSTPAAASTGAVVVNTHAEYPAVARAS-STDSFAVLVHVKAP-GMADTVAAGSDKPQPRA 192 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM K+A LK A+ +D++ Sbjct: 193 PLDLVTVLDVSGSM--------------------SGHKLALLKQAMRFVIDNLGPDD--- 229 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + ++ +++ + + ++ + V + T+ ++ A ++L Sbjct: 230 ---RLSVVSFSSEARRLTRLARMSDAGKALSVNAVESLVARGGTNIAEGLRTAAKVLDER 286 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFII--FLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + R+ ++ + + + G D + I T Sbjct: 287 QHRNAVSSVVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADG--DWSAPIHT 344 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + ++ + N + F +Q + SV+++ Sbjct: 345 FGFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAF---AQCIGGL-LSVVVQ 393 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 83/234 (35%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +V+DLSGSM + + +++ +K A F++ + Sbjct: 99 NIMMVLDLSGSMEITDM----------LLHGRPVSRLLVVKRAAEQFVE-------DRVG 141 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + L K T A+ A + L Sbjct: 142 DRIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDV--- 198 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK++ IKI TI + + Sbjct: 199 ---------------PSKGRVIILLTDGANNSGV--LAPLKAAELAKQDGIKIYTIGLGS 241 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ + +SL ++Q I+Q Sbjct: 242 EADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 295 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 83/234 (35%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +V+DLSGSM + + +++ +K A F++ + Sbjct: 91 NIMMVLDLSGSMEITDM----------LLHGRPVSRLLVVKRAAEQFVE-------DRVG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + L K T A+ A + L Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDV--- 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK++ IKI TI + + Sbjct: 191 ---------------PSKGRVIILLTDGANNSGV--LAPLKAAELAKQDGIKIYTIGLGS 233 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ + +SL ++Q I+Q Sbjct: 234 EADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 83/234 (35%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +V+DLSGSM + + +++ +K A F++ + Sbjct: 99 NIMMVLDLSGSMEITDM----------LLHGRPVSRLLVVKRAAEQFVE-------DRVG 141 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + L K T A+ A + L Sbjct: 142 DRIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDV--- 198 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK++ IKI TI + + Sbjct: 199 ---------------PSKGRVIILLTDGANNSGV--LAPLKAAELAKQDGIKIYTIGLGS 241 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ + +SL ++Q I+Q Sbjct: 242 EADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 295 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 83/234 (35%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +V+DLSGSM + + +++ +K A F++ + Sbjct: 91 NIMMVLDLSGSMEITDM----------LLHGRPVSRLLVVKRAAEQFVE-------DRVG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + L K T A+ A + L Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDV--- 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK++ IKI TI + + Sbjct: 191 ---------------PSKGRVIILLTDGANNSGV--LAPLKAAELAKQDGIKIYTIGLGS 233 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ + +SL ++Q I+Q Sbjct: 234 EADPRALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 67/205 (32%), Gaps = 45/205 (21%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 Q + ++ A+K L F+ +E +GL+ + T ++ VR + Sbjct: 109 QGRSINRLQAVKRVLDDFIQR-------REGDRLGLLLFGTEPYIQAPLTFDLATVRTLL 161 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + T A+ A + L Q+ +I LTDG Sbjct: 162 HEAGIGMAGRATAIGDALGLAVKRL------------------RDRPQEQRVVILLTDGA 203 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINA------------------SPNGQRLLKTC 382 N + K + AK ++I TI I A + L + Sbjct: 204 NTAGEI--APDKAAEIAKAAGVRIYTIGIGAETMVQRGLLGSRRVNPSRDLDEGLLTRIA 261 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ + L ++++I+Q Sbjct: 262 QQTGGEYFRARSLPELELIYESINQ 286 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 73/234 (31%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM D ++ A+KN + F ++ Sbjct: 86 DLMLVLDLSYSMSQEDMQDSS---------GNYIDRLTAVKNVVSQFAQQ-------RKG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + V + LI + T + A + Sbjct: 130 DRLGLVLFADHAYLQTPLTLDRNTISEQVNSLVLQLIGQKTAIGEGIGLATKTFIDS--- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + + I+ + AK+ I TI + A Sbjct: 187 ---------------DAPQRVMILLSDGSNTSGVLD--PIEAANIAKKYNATIYTIGVGA 229 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L+ + ++ NA L ++ I+ Sbjct: 230 GEMMVKDFFMTRKVNTAQDLDEKTLMSIAKITGGQYFRARNAQELATIYDTINS 283 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 80/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + +++A+K+ L F++ ++ Sbjct: 94 DLMLVVDLSYSMSQEDMQFN----------GEYIDRLSAVKHVLSDFIER-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 137 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLKLIGTQTAIGDGIGLATKTF------ 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + ++ D AK+ I T+ + A Sbjct: 191 ------------VDSDAPQRVMILLSDGSNTAGVLD--PLEAADIAKKYNATIYTVGVGA 236 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ ++ L ++ I+Q Sbjct: 237 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQ 290 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + FLD+ Sbjct: 94 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFLDT----- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 140 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 197 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 198 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 237 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 238 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM + K A F+D++ Sbjct: 39 LVFVLDSSGSMSWNDPNGYR-------------------KIAAKSFVDALIQGDRA---- 75 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 ++ + + + V+ + R T+ ++ A L S Sbjct: 76 --AVVDFDDYGYLLQPLTTDFQTVKNAIDRIDSW---GGTNIAEGIRIANHQLISQSSDD 130 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K II LTDGE + +AK N I I TI + S Sbjct: 131 RI----------------KVIILLTDGEGYYDNNLT------TEAKNNGITIYTIGLGTS 168 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L + ++ V +A L VF+ I++++ Sbjct: 169 VDENLLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTE 207 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + FLD+ Sbjct: 88 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFLDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 134 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 192 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 232 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + F+D+ Sbjct: 88 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFIDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 134 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 192 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 232 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 81/238 (34%), Gaps = 47/238 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P+ I L +D+S SM D QD++ ++ +K + F+D Sbjct: 88 TEPVRDILLALDISQSMDSQDFRD---------AQDRQVSRWTVVKAVVADFIDK----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + D +GLI + T + + +R + + T A+ ++L Sbjct: 134 --RTDDRLGLIVFGTGAFPQAPLTRDHKSLRLLLDHTAVGMAGPNTAIGDAIGMGIRMLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S ++R K +I LTDG + ++ + A ++ + + T Sbjct: 192 SAQERD------------------KVLILLTDGNDTGSAVP--PLRAANLAAQHHVTVHT 231 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS----QLMVHRK 413 I I + + L +S + + +L V+ + + + + Sbjct: 232 IGIGSPTASGDDQVDFDTLRGISSASGGQFFQAQDGAALHDVYATLDRITPREVKTLR 289 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 27/249 (10%), Positives = 75/249 (30%), Gaps = 45/249 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 281 DRAPIDLVAVLDVSGSMA--------------------GSKLSLLKRAVCFLIQNLGPSD 320 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ T+ +K+ ++L Sbjct: 321 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVL 374 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF------KSNVNTIKICDKAKE 359 +++ + + T ++ N+ IC + +E Sbjct: 375 EERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQVLEYLNLLPASICPRNRE 434 Query: 360 NF-------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV- 410 + I + T + + + S + + ++ F I L+ Sbjct: 435 SGDEGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAFAMCIGGLLSV 494 Query: 411 ---HRKYSV 416 + +V Sbjct: 495 VAQELRLTV 503 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + F+D+ Sbjct: 88 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFIDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 134 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 192 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 232 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + F+D+ Sbjct: 94 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFIDT----- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 140 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLK 197 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 198 KY------------------PGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 237 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 238 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 21/225 (9%), Positives = 49/225 (21%), Gaps = 23/225 (10%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R + N II ALS++ L+ +G + + + M+S +AA++A Sbjct: 3 FGRAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIA 62 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ + + + + I N Sbjct: 63 AVKQINNTGDTDALKLKVTDWFHAQVENSYTLGEIDIDTTNH---------NITATASGT 113 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + + S + + + +L Sbjct: 114 VPTTFMKIANIDTVPVSVASAVKGPATSYLNVYIVIDTSPSMLL-------------AAT 160 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 I + + N + + Sbjct: 161 TAGQATMYSGIGCQFACHTGDSHKVGNKTYANNYEYSTAKSIKLR 205 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 78/242 (32%), Gaps = 58/242 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM + QDK+ ++ +K + F++ Sbjct: 87 PATGRDLLLAVDISGSMKTPDM----------VVQDKQIARILVVKYVVNEFIER----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + ++ ++ + V + + T A+ A + L Sbjct: 132 --RESDRLGLILFGSQAYLQAPLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRL- 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 Q+ +I LTDG N + + D AK+ IKI T Sbjct: 189 -----------------RERPASQRVLILLTDGANTAGEV--APRQAADLAKQAGIKIYT 229 Query: 367 ISINASP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + A + L ++ ++ N L +++ + Sbjct: 230 VGVGADQMEQRMGLFGGFSRTVNPSSDLDEDTLRYMAETTGGLYFRARNPQELQAIYEEL 289 Query: 406 SQ 407 + Sbjct: 290 DK 291 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 24/279 (8%), Positives = 74/279 (26%), Gaps = 24/279 (8%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 K + SI+ A+ + L + + + + + +++A +AA + A +M + + Sbjct: 5 KRKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGT 64 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA------VFYNTEIQNIVNSSRISMT 130 S+ + + N + + + IQ V + Sbjct: 65 CTGSCASDTTAIFWANFSSAHQANGLGPFQAVSTGPVITPQNASTITIQANVQLPLLFTK 124 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-----------NQKIVSFIPA 179 + +++ S + ++ L + L Q + P Sbjct: 125 ILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTVYGVAAPG 184 Query: 180 LLRIEMGERP-----IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 G + ++ +++ P + S P + ++ + Sbjct: 185 SCGGGTGADTVPNLWVSIVPFAGEMNIFGS--TYGGPSNWQSMPSGWLTAGSDISTTRYG 242 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ + + + + Sbjct: 243 SNGWMGCVMARYSGYNNSPAHIYDVNDANPIQAPFTPFY 281 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 56/218 (25%), Gaps = 65/218 (29%) Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + V +++ + T A++ + ++ + + + Sbjct: 356 GCDPSPTLPETASRSVVEAHISSM-PMMSRGGTMLPQALQAGWFTISPNWQGFWPNP--A 412 Query: 320 QGVKIPSLPFQKFIIFLTD----------------------------------------- 338 + + K ++ +TD Sbjct: 413 LPLAYNTPNMTKVLVLMTDGNNQICPCFPVYNYYGPVAPPQSNGDTDMVAYGRLLQDELG 472 Query: 339 ------GENNNFKS----------NVNTIKICDKAKENFIKIVTI-----SINASPNGQR 377 G + N +CD K + I I I A Q Sbjct: 473 VVSSYNGNGYYGSNGFSSNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQA 532 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L+ C S P +Y+ A S+ F ++ + + S Sbjct: 533 MLQNCASKPGNYYDAPTAASMKQAFSDLGGQLSALRIS 570 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 50/318 (15%), Positives = 96/318 (30%), Gaps = 48/318 (15%) Query: 115 NTEIQNIVNSSRISMTH----MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 T I + + ++ + L+ S + + L + Sbjct: 26 VTRIAPSYRTQQSAIKVPFFHIITEALNISASDEASRLTPAKWQRILLSLSWIM------ 79 Query: 171 QKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 +V+ L R F + +VVDLSGSM K +++ Sbjct: 80 -ILVALTKPTLLGPPQTREQFGRDVMVVVDLSGSMAEKDF---------TSIDGIKISRL 129 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A+K L F + ++ +GLI + + E + + + Sbjct: 130 DAVKKVLNDFAKT-------RKGDRLGLILFGDAAFVQTPFTADHEVWLDLLNQTRVEMA 182 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K T A+ + + + + +K I LTDG + + Sbjct: 183 GKSTHLGDAIGLTIKRFEENDNSQPLS----------TTSRKKVAIILTDGNDTDSYVP- 231 Query: 349 NTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + AK I+I I+I + + +S + +N D LI+ Sbjct: 232 -PMDAAKVAKVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQALNQDELINA 290 Query: 402 FQNISQLMVHRKYSVILK 419 + IS+L S + Sbjct: 291 YAEISKLEPQLYQSTTYR 308 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 28/239 (11%), Positives = 70/239 (29%), Gaps = 34/239 (14%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ R + V+D+SGSM TK+A LK A+ + + Sbjct: 271 QMSQNSRAPVDLVTVLDVSGSMA--------------------GTKLALLKRAMGFVIQN 310 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQ 300 + + +I +++ + TE RQ + + T+ +++ Sbjct: 311 LGPSD------RLSVIAFSSTARRLFPLRCMTEAGRQEALLSVNSLVSNGGTNIAEGLRK 364 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +++ K ++ + + + + K Sbjct: 365 GAKVIVDRKWKNPVASIILLSDGQDTYTVTSPSGMNPRADYKSLLPISIHRNGGTGLK-- 422 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + + A + + S + + F +Q + SV+++ Sbjct: 423 -IPVHSFGFGADHDAASMHSISEISGGTFSFIEAEGVIQDAF---AQCIGGL-LSVVVQ 476 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 80/239 (33%), Gaps = 54/239 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + +DLSGSM + ++ + FLD+ Sbjct: 88 PQSGRDLIMAIDLSGSMAIQDMKKAN---------GQMESRFDLVMRVANQFLDT----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + TR ++ V++ + +L T A+ A + L Sbjct: 134 --RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKL- 190 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDGENN+ + ++ + AK+ IKI T Sbjct: 191 -----------------KKFPGDSKALILLTDGENNSGT--LQPLQAAEIAKQYHIKIYT 231 Query: 367 ISIN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L K + ++ N+ L V+++I + Sbjct: 232 IGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 83/234 (35%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +V+DLSGSM + + +++ +K A F++ + Sbjct: 91 NIMMVLDLSGSMEITDM----------LLHGRPVSRLLVVKRAAEQFVE-------DRVG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + L K T A+ A + L Sbjct: 134 DRIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDV--- 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK++ IKI TI + + Sbjct: 191 ---------------PSKGRVIILLTDGANNSGV--LAPLKAAELAKQDGIKIYTIGLGS 233 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ + +SL ++Q I+Q Sbjct: 234 EADPRALTGDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 24/279 (8%), Positives = 74/279 (26%), Gaps = 24/279 (8%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 K + SI+ A+ + L + + + + + +++A +AA + A +M + + Sbjct: 2 KRKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGT 61 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA------VFYNTEIQNIVNSSRISMT 130 S+ + + N + + + IQ V + Sbjct: 62 CTGSCASDTTAIFWANFSSAHQANGLGPFQAVSTGPVITPQNASTITIQANVQLPLLFTK 121 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY-----------NQKIVSFIPA 179 + +++ S + ++ L + L Q + P Sbjct: 122 ILGVSQIALSEHAQAVRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTVYGVAAPG 181 Query: 180 LLRIEMGERP-----IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 G + ++ +++ P + S P + ++ + Sbjct: 182 SCGGGTGADTVPNLWVSIVPFAGEMNIFGS--TYGGPSNWRSMPSGWLTAGSDISTTRYG 239 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ + + + + Sbjct: 240 SNGWMGCVMARYSGYNNSPAHIYDVNDANPIQAPFTPFY 278 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 33/296 (11%), Positives = 72/296 (24%), Gaps = 72/296 (24%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS------- 241 F + ++ I T +A +++ +S Sbjct: 275 TPFYWPSTYQKYSQSSWFGGTSWVVGDNDWILSGGVVTPSSAARSSYGQMAESPLITSFP 334 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 S V E + + V +++ + T A++ Sbjct: 335 TTSGSLVTESGLQVGPNLGCDPSPTLPETASRSVVEAHISSM-PMMSRGGTMLPQALQAG 393 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD----------------------- 338 + ++ + + + + + K ++ +TD Sbjct: 394 WFTISPNWQGFWPNP--ALPLAYNTPNMTKVLVLMTDGNNQICPCFPVYNYYGPVAPPQS 451 Query: 339 ------------------------GENNNFKS----------NVNTIKICDKAKENFIKI 364 G + N +CD K + I I Sbjct: 452 NGDTDMVAYGRLLQNELGVVSSYNGNGYYGSNGFSSNILPEMNSLVSTVCDNIKNSGITI 511 Query: 365 VTI-----SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I A Q +L+ C S P +Y+ A S+ F ++ + + S Sbjct: 512 YVILYTHEGEEADATTQAMLQNCASKPGNYYDAPTAASMKQAFSDLGGQLSALRIS 567 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 78/246 (31%), Gaps = 49/246 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM ++ +++ A K F+ + Sbjct: 108 DLLLAVDISGSMQIKDFE----------MNGQQVSRLTATKAVARQFI-------SRRVG 150 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ + V + L K T A+ A + L + Sbjct: 151 DRVGLILFGSNAYVQTPLTFDRKTVITLLDEAAVGLAGKATAIGDAIGLAVKRLEQSNRD 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG N + + + + A E+ + I TI I A Sbjct: 211 KRIASKE------------QVLILLTDGVNTAGQLS--APQAAELAAEHGLTIYTIGIGA 256 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + ++ + L ++ I +L + Sbjct: 257 DAMTVQSFFGTQRVNPSADLDEKMLTDIATKTGGRYFRAHDTQELQKIYAMIDKLEPVAR 316 Query: 414 YSVILK 419 V+ + Sbjct: 317 DRVVFR 322 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 81/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + Q + ++ K+ L F++ ++ Sbjct: 94 DLFVALDLSGSMQISDM----------YYQSRPVNRLVISKHVLSDFIEK-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+I + T+ S+ T+ VRQ + + T A+ + L+ Sbjct: 137 DRIGVIVFGTKAYLQAPLSFDTKTVRQLIQETQIGFAGEKTAIGDAIGLGIKQLSEL--- 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K +I +TDG N + + ++ + A E + I TI I A Sbjct: 194 ---------------PSDKKVLILMTDGANTAGRVS--PLQAANFAAEQGVTIHTIGIGA 236 Query: 372 SPNG-----------------QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 + LL+ S + +Y + L ++ +I+ Sbjct: 237 DEMEVQGFFGPQTVNPSEDLDEALLENVASLTGGKYYRAKSTSDLEEIYGDINN 290 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 28/239 (11%), Positives = 65/239 (27%), Gaps = 29/239 (12%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 V A R + V+D+SGSM K+A +K Sbjct: 149 VIVNEAAAGDRDAPRAPLDLVTVLDVSGSMRWD--------------------KLALVKQ 188 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPT 292 A+ + S+ + ++ +++ + + + T ++SL T Sbjct: 189 AMGFVIGSLGPHD------RLSVVSFSSGARRVTRLLRMSHTGKSLATEAVESLRAGGGT 242 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + ++ A ++L + R+ ++ + + Sbjct: 243 NIAEGLRTAAKVLGERRHRNAVSSVILLSDGHDNYSMPRRARGGVPPNYEVLVPPSFVPG 302 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 E I T + + ++ + N + F I L+ Sbjct: 303 TASTG-EGSAPIHTFGFGNDHDAAAMHVVAEATGGTFSFIENEAVIQDAFAQCIGGLLT 360 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 69/200 (34%), Gaps = 45/200 (22%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ + FLD+ ++ +GLI + TR ++ V++ + Sbjct: 118 SRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASI 170 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 +L T A+ A + L K +I LTDGENN+ Sbjct: 171 ALPGPQTAIGDAIGLAVKKLKKY------------------PGDSKALILLTDGENNSGT 212 Query: 346 SNVNTIKICDKAKENFIKIVTISIN------------------ASPNGQRLLKTCVSSPE 387 + ++ + AK+ IKI TI + + L K + Sbjct: 213 --LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGG 270 Query: 388 YHYNVVNADSLIHVFQNISQ 407 ++ N+ L V+++I + Sbjct: 271 KYFRAQNSSDLKKVYESIDK 290 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 57/201 (28%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+A LK A+ + ++ + +I +++ + TE RQ Sbjct: 326 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQ 379 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 380 SLQAVYSLTSNGGTNIAEGLRKGSKVIEDRQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 438 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T C + + + A + L +S + ++ Sbjct: 439 AAPEYCSLLPYTSNGCQQ-----VPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 493 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+ + Sbjct: 494 QDAF---AQCIGGL-LSVVAQ 510 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 69/200 (34%), Gaps = 45/200 (22%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ + FLD+ ++ +GLI + TR ++ V++ + Sbjct: 124 SRFDLVMRVANQFLDT-------RKGDRVGLILFGTRAYLQTPLTFDIATVKKMLDDASI 176 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 +L T A+ A + L K +I LTDGENN+ Sbjct: 177 ALPGPQTAIGDAIGLAVKKLKKY------------------PGDSKALILLTDGENNSGT 218 Query: 346 SNVNTIKICDKAKENFIKIVTISIN------------------ASPNGQRLLKTCVSSPE 387 + ++ + AK+ IKI TI + + L K + Sbjct: 219 --LQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMTGG 276 Query: 388 YHYNVVNADSLIHVFQNISQ 407 ++ N+ L V+++I + Sbjct: 277 KYFRAQNSSDLKKVYESIDK 296 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 57/201 (28%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+A LK A+ + ++ + +I +++ + TE RQ Sbjct: 290 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQ 343 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 344 SLQAVYSLTSNGGTNIAEGLRKGSKVIEDRQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 402 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T C + + + A + L +S + ++ Sbjct: 403 AAPEYCSLLPYTSNGCQQ-----VPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 457 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+ + Sbjct: 458 QDAF---AQCIGGL-LSVVAQ 474 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 80/234 (34%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++AA+K+ L F++ ++ Sbjct: 84 DMLLAVDLSGSMSIEDM---------ITQSGESIDRLAAVKDVLAEFIEQ-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V+Q + R + LI + T + A + Sbjct: 128 DRLGLVLFAQHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N ++ A E+ + I T+ + A Sbjct: 182 ------------INSEAPQRVIILLSDGANTAGVI--EPLEAAKLAAESNVTIYTVGVGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ N L H++Q I Q Sbjct: 228 EEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 82/234 (35%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM I + ++AA+K+ L F++ ++ Sbjct: 84 DMLLVVDLSGSMSIEDM---------IIKNGESIDRLAAVKDVLAEFIEQ-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + ++ V+Q + R + LI + T + A + Sbjct: 128 DRLGLVLFAQHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTF------ 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ II L+DG N ++ A E+ + I T+ + A Sbjct: 182 ------------INSEAPQRVIILLSDGANTAGVI--EPLEAAKLAAESNVTIYTVGVGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ N L H++Q I Q Sbjct: 228 EEMIQKSFFGNRKVNPSQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 80/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + +++A+K+ L F++ ++ Sbjct: 94 DLMLVVDLSYSMSQEDMQFN----------GEYIDRLSAVKHVLSDFIER-------RKG 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + Q + + + LI T + A + Sbjct: 137 DRVGLVLFADHAYLQTPLTLDRDTLSQQLNQAVLRLIGNQTAIGDGIGLATKTF------ 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ ++ L+DG N + ++ D AK+ I T+ + A Sbjct: 191 ------------VDSDAPQRVMVLLSDGSNTAGVLD--PLEAADIAKKYNATIYTVGVGA 236 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ ++ L ++ I+Q Sbjct: 237 GEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQ 290 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 79/247 (31%), Gaps = 55/247 (22%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + ++ + L VD+S SM Q + ++ +K + F Sbjct: 76 FVGEPKALQQTDRNMMLAVDISKSMLEEDM----------QYQGRLVNRLQTVKAVVTDF 125 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ ++ +GLI + + ++ V++ + + L T A+ Sbjct: 126 VEE-------RKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVGLAGNKTAIGDAI 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L + +I LTDG+N + +K + A+ Sbjct: 179 GLGVKRLQDL------------------PESNRVLILLTDGQNTAGEI--EPLKAAELAE 218 Query: 359 ENFIKIVTISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIH 400 + +KI I I A + L ++ +Y N + L Sbjct: 219 KAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDEDTLTAIAENTGGQYYRARNVNELEQ 278 Query: 401 VFQNISQ 407 ++ ++Q Sbjct: 279 IYDVLNQ 285 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 57/201 (28%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+A LK A+ + ++ + +I +++ + TE RQ Sbjct: 281 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQ 334 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 335 SLQAVYSLTSNGGTNIAEGLRKGSKVIEDRQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 393 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T C + + + A + L +S + ++ Sbjct: 394 AAPEYCSLLPYTSNGCQQ-----VPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 448 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+ + Sbjct: 449 QDAF---AQCIGGL-LSVVAQ 465 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 74/234 (31%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + QD ++ A+K + F +E Sbjct: 86 DLMLVVDLSYSMSQKDM----------LSQDDYIDRLTAVKKVVSDFAQQ-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + V + LI T + A + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRNTIAKQVESLVLRLIGDKTAIGEGIGLATKTF------ 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ +I L+DG N + + I+ AK+ I TI + A Sbjct: 183 ------------IDSDAPQRVMILLSDGSNTSGVLD--PIEAAKIAKKYNATIYTIGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ NAD L ++ I+ Sbjct: 229 GEMMVKEFFMTRKVNTAKDLDENTLQEIADLTGGQYFRARNADELATIYDTINS 282 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 20/201 (9%), Positives = 60/201 (29%), Gaps = 14/201 (6%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ- 278 TK+A LK A+ + ++ + +I +++ + ++ RQ Sbjct: 279 SGSMAGTKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFPLRRMSDTGRQQ 332 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ +++ +++ ++++ + + Sbjct: 333 ALQAVNSLVAHGGTNIAEGLRKGAKVMEDRREKNPVASIILLSDGQDTYTVSSSG--ANQ 390 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + N +I D I + A + + S + + Sbjct: 391 PQPNYHLLLPLSIHGGDT-SGFQIPVHAFGFGADHDASSMHSISEVSGGTFSFIETEAVI 449 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+++ Sbjct: 450 QDAF---AQCIGGL-LSVVVQ 466 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 64/204 (31%), Gaps = 45/204 (22%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + T++ +K+ + F+ V++ +GLI + + ++ V + + Sbjct: 115 GNQATRLDVVKSVVTDFIQ-------VRQGDRLGLILFGAQPYIQAPLTYDLVTVGELLN 167 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T A+ + L + ++ LTDG N Sbjct: 168 EATLGIAGNATAIGDAIGLGIKRL------------------RERPADSRVLVLLTDGAN 209 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQRLLKTCV 383 + + + A + IKI T+ + A + L Sbjct: 210 TGGEVS--PEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIAD 267 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 + ++ N L ++++I+Q Sbjct: 268 ETDGQYFRARNTGELELIYESINQ 291 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 44/301 (14%), Positives = 81/301 (26%), Gaps = 65/301 (21%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 T + S+ Y R + P + + Sbjct: 26 TLQGGASIQISSVAGVVRAPRTVRYYLRHLPFALRAAAFALLVVALARPQDVEQNVRTNT 85 Query: 190 IF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I L +D+SGSM + +D K ++ A K F+ Sbjct: 86 EGIDIMLAIDVSGSM---------------LARDFKPDRITAAKEVAGSFIA-------D 123 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + ++ + R LI T + A L Sbjct: 124 RYGDRIGLVAFAGEAFTQSPLTTDQSTLQTLLARIRSGLIEDGTAIGNGLATAINRLRES 183 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K II LTDG NN + + + AK I++ TI Sbjct: 184 DAK------------------SKVIILLTDGVNNQGQI--APMTAAEIAKAQGIRVYTIG 223 Query: 369 IN----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + + L + ++ + + L V+ I+ Sbjct: 224 VGTEGMAPYPAIDMFGNLTFVNQKVEIDEKVLKAISDMTGGRYFRATDKEKLKAVYDEIN 283 Query: 407 Q 407 Q Sbjct: 284 Q 284 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 79/245 (32%), Gaps = 57/245 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 86 DLMLSVDLSGSMQIEDM----------VIDGKVTDRFTLIQHVISQFIER-------RKG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ + T A+ + + Sbjct: 129 DRIGLILFADHAYLQSPLTQDRRTVAQYLNEAEIGLVGRQTAIGEAIALGVKRFDQVENS 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN + + D A + I I T+ + A Sbjct: 189 N------------------RVLILLTDGSNNAGSIS--PEQATDIAAKRGITIYTVGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL--MVH 411 + +L + + ++ N + L ++Q I +L + Sbjct: 229 EVMERRTLFGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEELERIYQEIDKLEPISR 288 Query: 412 RKYSV 416 + S Sbjct: 289 EQLSY 293 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 52/473 (10%), Positives = 122/473 (25%), Gaps = 77/473 (16%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + KAN ++I AL ++ + L+G + KK +++A +AA +A + Sbjct: 5 LLSRFVRDRKANIAVIAALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM 64 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + A + +++ N + + Sbjct: 65 LMQTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDVGLQRTVKVSYNAASLNNFPQ 124 Query: 130 THMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 M N S + ++ + + I + I + Sbjct: 125 LLMNNVSWAISGASTAQASSAPNMNFYLLMDDSPSMGIGATATDISNLIASTAPKYQKAS 184 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH- 247 + + N A + ++ + +A+ L++ Sbjct: 185 SSQ----NCGFACHETNIAHDGGTKDNLAIARANNITLRIDLVTSAVNQLLNTWSNCPQS 240 Query: 248 -------------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--- 291 L + + + T Q + + + Sbjct: 241 GVSGGVMQCMSALNNTTYRAALYTFDLGLNTLASLTTPTTAGAQVSNIALMPVAYQNCVV 300 Query: 292 ------TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 TD+ + A L + G Q+ + +TDG + Sbjct: 301 PTTNCKTDNGTDIAGALTSL------NGIMPSPGLGSNASGDTPQEVVFLVTDGVEDKIS 354 Query: 346 SN--------------VNTIKICDKAKENFIKI---VTISI------------------- 369 S+ IC K+ IKI T + Sbjct: 355 SSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIAILYTEYLQLKTPNVPVTDTWYMSWID 414 Query: 370 ------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +++ + L+ C S P + NV ++ ++ + S+ Sbjct: 415 AYDEPTSSTGAIAKNLQACAS-PGFFSNVQTGGNITQALTDLFLKVASSTASL 466 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 57/201 (28%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+A LK A+ + ++ + +I +++ + TE RQ Sbjct: 290 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTETGRQQ 343 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 344 SLQAVYSLTSNGGTNIAEGLRKGSKVIEDRQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 402 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T C + + + A + L +S + ++ Sbjct: 403 AAPEYCSLLPYTSNGCQQ-----VPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 457 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV+ + Sbjct: 458 QDAF---AQCIGGL-LSVVAQ 474 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 86/245 (35%), Gaps = 57/245 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +DLSGSM I + +++ +K+A F+ + Sbjct: 91 NIMMALDLSGSMEIPDM----------ILHGRPTSRLNIVKSAAEQFVRE-------RSG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ + + L K T A+ A + L S Sbjct: 134 DKIGLILFGTRAYLQTPLTYDRHSILLRLEDATAGLAGKTTSIGDAVGLAVKRLDSA--- 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + II LTDG NN+ + +K + AKE IKI TI + Sbjct: 191 ---------------PKKGRVIILLTDGANNSGV--LAPLKAAELAKEEGIKIYTIGLGS 233 Query: 371 -----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ--LMVH 411 A + + L K + ++ + +SL +++ I+Q + Sbjct: 234 EGDSRALVGDFLMQSPAADLDEETLKKMSDMTGGRYFRATDTESLHLIYKTINQLETINQ 293 Query: 412 RKYSV 416 + +V Sbjct: 294 EQATV 298 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 21/205 (10%), Positives = 66/205 (32%), Gaps = 15/205 (7%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+ LK A+ +D++ + ++ +++ + + + ++ + Sbjct: 185 SGSMVGTKLELLKQAMGFVIDNLGPRD------RLCVVSFSSGANRLMRLARMSDAGKSL 238 Query: 280 VTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R + SL T+ A+++A +++ R+ + + + + Sbjct: 239 ARRAVQSLAAGGGTNIGEALRRAAKVIDERMHRNAVASVVLLSDGQDTYTVPRRGGY--G 296 Query: 339 GENNNFKSNVNTIKICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G + N+ + V A + T + + ++ + + + Sbjct: 297 GRDANYDALVPPSFAFTGAGGRPAAPVHTFGFGTDHDAAAMHTIAEATGGTFSFIEDEAA 356 Query: 398 LIHVFQN-ISQLMV----HRKYSVI 417 + F I L+ + + Sbjct: 357 IQDAFAQCIGGLLSVTVQELRLDIA 381 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 51/473 (10%), Positives = 119/473 (25%), Gaps = 77/473 (16%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + KAN ++I AL ++ + L+G + KK +++A +AA +A + Sbjct: 5 LLSRFVRDRKANIAVITALVMIPIIFLLGMTLDFTQALRKKQQLDAAADAAAIAAVRPAM 64 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + A + +++ N + + Sbjct: 65 LMQTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDVGLQRTVKVSYNAASLNNFPQ 124 Query: 130 THMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 M N S + ++ + + I + I + Sbjct: 125 LLMNNVSWAISGASTAQASSAPNMNFYLLMDDSPSMGIGATATDISNLIASTAPKYQKAS 184 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH- 247 + + N A + ++ + +A+ L++ Sbjct: 185 SSQ----NCGFACHETNIAHDGGTKDNLAIARANNITLRIDLVTSAVNQLLNTWSNCPQS 240 Query: 248 -------------VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--- 291 L + + + T Q + + + Sbjct: 241 GVSGGVMQCMSALNNTTYRAALYTFDLSLNTLASLTTPTTAGAQVSNIALMPVAYQNCVV 300 Query: 292 ------TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 TD+ + A L + G Q+ + +TDG + Sbjct: 301 PTTNCKTDNGTDIAGALTSL------NGIMPSPGLGSNASGDTPQEVVFLVTDGVEDKIA 354 Query: 346 SN--------------VNTIKICDKAKENFIKIVTISINA-------------------- 371 S+ IC K+ IKI + Sbjct: 355 SSCPNGSYASYSRCQQPLDTAICTTIKKRGIKIAILYTEYLQLKTPNVPVTDTWYMSWID 414 Query: 372 --------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L+ C S P + NV ++ ++ + S+ Sbjct: 415 AYDEPTSSTGAIAKNLQACAS-PGFFSNVQTGGNITQALTDLFLKVASSTASL 466 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 77/233 (33%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM QD+ +++ +K+ L FL+ +E Sbjct: 91 DLLVAVDVSGSMDFPDMQW----------QDEDVSRLNLVKHLLGDFLEH-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSKAYLQAPLTFDRHTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + I A E +KI I I A Sbjct: 188 ------------RQRPAQSRVLILVTDGANNGGEI--APITAARLAAEEGVKIYPIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 P + L + + ++ + L + Q + Q Sbjct: 234 DPEQSATLGVLGINPSLDLDEPALKELAQVTGGRYFRARDGQELQAIKQTLDQ 286 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 75/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM D+ +++A+K L F+ ++ Sbjct: 86 DMMLVVDLSYSMSQQDM----------KSGDQFIDRLSAVKQVLSDFIAK-------RQG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + Q + + + LI T + A + Sbjct: 129 DRLGLIFFADHAYLQTPLTLDRQTIAQQLNQAVLRLIGTQTAIGEGIGLATKTFIDS--- 185 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + + ++ AK+ I T+ + A Sbjct: 186 ---------------DAPQRVMILLSDGSNTSGVLD--PMEAAKIAKKYHTTIYTVGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ N L H++ I+Q Sbjct: 229 GEMMVKEFFMTRKINTAEDLDEKTLQAIADETGGQYFRARNQQDLQHIYDTINQ 282 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 77/232 (33%), Gaps = 55/232 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + + +++ + F++ ++ Sbjct: 85 DLMLSVDLSGSMQIEDM----------VLDGKVVDRFSLIQHVISDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ K T A+ A + ++ Sbjct: 128 DRIGLILFADHAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVKRFDKVEQS 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN + + A + I I TI + A Sbjct: 188 N------------------RVLILLTDGSNNAGAIS--PEQATQIAAKRGITIYTIGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + +L + ++ ++ N + L ++Q I Sbjct: 228 DVMERRTLFGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQVI 279 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 33/263 (12%), Positives = 75/263 (28%), Gaps = 53/263 (20%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 L+N + I V+D+SGSM Sbjct: 241 DATGSLVNGSSLRLSRRV-------------PIDIVTVLDVSGSMA-------------- 273 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVR 277 TKMA LK A+ + + + +I +++ + + + Sbjct: 274 ------GTKMALLKQAMGFVIQHLRPSD------RLSVIAFSSTARRLFPLQRMSHHGRQ 321 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q + T+ A+K+A +++ ++ + + + Sbjct: 322 QALQAINSLGAGGGTNIADALKKAVKVIADRSYKNSVCSIILLSDGQDTYNISSNFQGTS 381 Query: 338 DGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 G + +N N + + + + T A + L +S + + Sbjct: 382 AGRRSLVPSANPNELHM--------VPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEG 433 Query: 397 SLIHVFQNISQLMVHRKYSVILK 419 + F +Q + SV ++ Sbjct: 434 VMQDAF---AQCIGGL-LSVAVR 452 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 79/250 (31%), Gaps = 65/250 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+SGSM K++ ++ +K + F++ + Sbjct: 87 DIVLAVDISGSMLAEDFE----------MDGKRQNRLYVVKQVVKDFINK-------RST 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ + + Q++ + L+ T A+ + L Sbjct: 130 DPIGLVVFSANAYTQCPLTLDYGILLQFLEKTEIGLLEDGTAIGSAIASSVDRL------ 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 + K I+ LTDG NN+ + + + + A+ IKI TI Sbjct: 184 ------------RNTKAQSKVIVLLTDGRNNSGQID--PLTAAELAQAFNIKIYTIGAGS 229 Query: 369 -----------INA--------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS--- 406 + + L + + +Y + SL ++Q I Sbjct: 230 KGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANITGGRYYRATDTGSLKEIYQQIDALE 289 Query: 407 ---QLMVHRK 413 + H K Sbjct: 290 KTETEITHYK 299 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 70/240 (29%), Gaps = 66/240 (27%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM + +D + ++ A K F+ + Sbjct: 89 DIMLAIDVSGSM---------------LARDFRPDRITAAKEVAGSFIA-------DRYG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + ++ + R LI T + A L + + Sbjct: 127 DRIGLVAFAGEAFTQSPLTTDQGTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESEAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 K II LTDG NN + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGVNNRGEI--APQTAAEIAKAQGIRVYTIGVGT 226 Query: 371 -----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ + L ++ I+Q Sbjct: 227 EGMAPYPAVDIYGTPTGGTVMAKVEIDEKTLRSIAEQTGGQYFRATDKAKLKAIYDQINQ 286 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 56/532 (10%), Positives = 124/532 (23%), Gaps = 136/532 (25%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ +E +++I + + L+ + K A LA ++ Sbjct: 10 LKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLALMAENND 69 Query: 71 NLSRLGDRFES-ISNHAKRALIDDAKRFIKNHIKESLSGYSAVF-----------YNTEI 118 + + K L + F K + S + N +I Sbjct: 70 FRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTYFKDNLKI 129 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMD----------------------------- 149 + + ++ +L S I + Sbjct: 130 SDKYDYQCNNLQGPNGEQLKSIACEISGKFERPSWLYLGKNNGLSFAETTTINANKIYIQ 189 Query: 150 ------VMTSYDYRLQFIEHLLNQRYNQKI------------------------------ 173 + + + K Sbjct: 190 KNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNTEEGKV 249 Query: 174 ------VSFIPALLRIEMGERPIFLI------ELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + F + + + + + + + Sbjct: 250 ISQYNRIGFTSFAFGAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTMQYSELLSYV 309 Query: 222 DKKRTKMAALKNA--------LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN------I 267 + + + A ++SI+ M + + Sbjct: 310 GSNQQRYSYATLAQYFDAFVDYDKTIESINSFDGKDLSSLMYFSKNSWCLGSANTRINST 369 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 G + V+R L T S+ + +L + Sbjct: 370 YIWAGKNESADLVSRFNRVPALGATLSSSGLLIGANLLMNT------NPDENAQPSKLGA 423 Query: 328 PFQKFIIFLTDGENN--------NFKSNVNTIKICDKAK------------ENFIKIVTI 367 Q+ I+ L+DGE+ N S + +C+K K E +I + Sbjct: 424 NTQRIILVLSDGEDQINNASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFV 483 Query: 368 SINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNIS---QLMVHRKY 414 + P+G + + CV +Y+Y N + L+ F+ I + + H Y Sbjct: 484 AFGYGPSGTQKAAWEKCV--GKYYYVANNKEELLESFRKIIGLVEEVGHSTY 533 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 25/240 (10%), Positives = 72/240 (30%), Gaps = 18/240 (7%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + ++ + + SM + + V + TK+A LK A+ + Sbjct: 234 IHVKAPTSSSNDVTGSLVNESSMRSSRRVPIDLVTVLDVSGSMAGTKLALLKQAMGFVIQ 293 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMK 299 + + +I +++ + + +Q + T+ A+K Sbjct: 294 HLRPSD------RLSVIAFSSTARRLFPLQRMSHHGRQQALQAISSLGAGGGTNIADALK 347 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A +++ ++ + + + G ++ I ++ Sbjct: 348 KAVKVIEDRNYKNSVCSIILLSDGQDTFNISSNFQGTSAGR-----RSLVPPSILNELHM 402 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + T A + L +S + + + F +Q + SV ++ Sbjct: 403 --VPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEGVMQDAF---AQCIGGL-LSVAVQ 456 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 76/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGNVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 231 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A E +KI I I + Sbjct: 188 ------------RMRPATSRALVLVTDGANNAGQID--PVTAARLAAEEGVKIYAIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A E +KI I I + Sbjct: 188 ------------RMRPATSRALVLVTDGANNAGQID--PVTAARLAAEEGVKIYAIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 49/483 (10%), Positives = 109/483 (22%), Gaps = 104/483 (21%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES---------- 55 R + ++ + + +++FAL+V+ + L+G + K +++ Sbjct: 10 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 69 Query: 56 --ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A V ++++ A + ++ SG + Sbjct: 70 VTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSGQTLDA 129 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRYNQK 172 + +N + + + + + Sbjct: 130 TV---SYTATVQSTFGRLFGLSVTTLTNRVNASADIAGYLDFYLMVDVSGSMGLPTTDSD 186 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +E + + H + K + + A+ Sbjct: 187 AALLASKS------------VEDQGNCQFACHFPNRKGWN------LAAGKIQLRSDAVN 228 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 NA+ LD V +G+ + R+ S T + D P Sbjct: 229 NAVCALLDRASK-PIVPNQYRIGIYPFINRLATLAPLS-DTTTSLASLKTTADCGKAWPL 286 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQG----------------VKIPSLPFQKFIIFL 336 T + L ++ + T S + F+ + Sbjct: 287 AFTNLLDTGSTQLFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLI 346 Query: 337 TDGENNNFKSN-----------------------------VNTIKICDKAKENFIKIVTI 367 TDG N+ + C K+ I + Sbjct: 347 TDGMQNSQTYSSWKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISIL 406 Query: 368 SI----------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I SP L+ C S P + +A + + Sbjct: 407 YIPYNYVKSYTNEGTIVWENNRVNGFSPTLADPLRQCAS-PGLFFTANSAKDITASLGAM 465 Query: 406 SQL 408 Sbjct: 466 FDQ 468 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 79/244 (32%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMNEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N + +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLD--PLKAAELAKAE 231 Query: 361 FIKIVTISI-----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I TI+ N + L K + + + + L ++ Sbjct: 232 GVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM ++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMDFPDMQWK----------NEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAKIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RQRPAQSRVLVLITDGANNGGQI--HPLTAARLAAQEGVRIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 234 NPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGAELDAI 280 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 79/243 (32%), Gaps = 59/243 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM ++ ++ A+K F+ Sbjct: 80 PNSGRDLMLAVDISGSMRVEDM----------QVGNRMARRIDAVKQLGSDFM------- 122 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + +R S+ + V++++ + T A+ A + L Sbjct: 123 SRRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLEAQIGFAGQETAIGDAIGLAVKRLQ 182 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + ++ LTDG++ + ++ + A + ++I T Sbjct: 183 E------------------RPASSRVLVLLTDGQDTASTVD--PLEAANLAADLGVRIYT 222 Query: 367 ISINASP----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I I A + L++ S+ ++ + + L V++ Sbjct: 223 IGIGADSLTLPGLLGSPLGARTVNPSADLDESTLIEIARSTGGQYFRARDPEELATVYRL 282 Query: 405 ISQ 407 + Q Sbjct: 283 LDQ 285 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 202 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 251 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR ++ + L + T A+ Sbjct: 252 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIAL 304 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 305 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 344 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + L K + + + + L ++ Sbjct: 345 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 404 Query: 404 NISQ 407 + + Sbjct: 405 ELDR 408 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 79/237 (33%), Gaps = 55/237 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + L VD+SGSM + ++ ++ A+++ F++ Sbjct: 87 PQSGRDLLLAVDISGSMRVEDM----------VIANQAVRRIDAVRDIGAEFIER----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + +R S+ + V+Q+++ T A+ A + L Sbjct: 132 --REGDRVGLILFGSRAYMQSPLSFDRDTVKQFLSEAQIGFAGSETAIGDALGLAVKRLR 189 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + +I LTDG++ + + A +K+ T Sbjct: 190 DKEDGD------------------RVVILLTDGQDTASSVD--PLDATALAANYGVKVYT 229 Query: 367 ISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I I A + + L S+ ++ + D L ++ + Sbjct: 230 IGIGADEMLVPSLFGNRRVNPSAELDEETLSAMAESTGGRYFRARSPDELAKIYDLL 286 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 19/202 (9%), Positives = 56/202 (27%), Gaps = 16/202 (7%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 TK+A LK A+ + ++ + +I +++ + T +Q Sbjct: 192 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQ 245 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 246 SLLAVNSLTANGGTNIAEGLRKGSKVIEERQSKNPVCSIILLSDGQDTYTVSPTAGVHKG 305 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 T + + + A + L +S + ++ Sbjct: 306 ATEYCALLPSTTTNGSQQ-----VPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEATI 360 Query: 399 IHVFQNISQLMVHRKYSVILKG 420 F +Q + SV+ +G Sbjct: 361 QDAF---AQCIGGL-LSVVAQG 378 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 76/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM SD D ++ A+K + F + +E Sbjct: 86 DLMLVLDLSYSMSQEDMSDGSDYVD----------RLTAVKKVVSDF-------AIKREG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + V V + + LI T + A + Sbjct: 129 DRLGVVLFADHAYLQTPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIGLATKTF------ 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I S Q+ +I L+DG N + + I+ AK+ I TI + A Sbjct: 183 ------------IDSDAPQRVMILLSDGSNTSGVID--PIEAAKIAKKYDATIYTIGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ +A L ++ I+ Sbjct: 229 GEMMVKEFFMTRKVNTAQDLDEKALMQIAQITGGQYFRARDAKELATIYDTINS 282 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 27/296 (9%), Positives = 75/296 (25%), Gaps = 20/296 (6%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ ++ +I FAL + +L+ L+ V ++ M+ A +AA L A Sbjct: 18 FARRLRRDDRGAIAIQFALLALPLSILLFGLLDVGRLSLQRRQMQDALDAATLMAARSTA 77 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ + L ++ + S + + I Sbjct: 78 TSSADLDTTGDAAFLAEIAGMNLGLTA-------SSSTFSAGTNNRVIGTATATLRPIIA 130 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER- 188 + + ++ L + N + L+ + + + Sbjct: 131 NLWQSGNFTVTASSEVVRASKNLEIALVLDITGSMGNGTRIADLKVAAADLVDVLVRDTQ 190 Query: 189 -----PIFLIELVVDLSGSMHCAM---NSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + L+ ++ A P + + + A + Sbjct: 191 TPFYSKMALVPYSAGVNVGATYADAVRGPVPVKTITGAAWASGSARSITGITRANPAVVT 250 Query: 241 ----SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + +V G+ ++ + + T + T Sbjct: 251 ASGHGLSTGDYVYITGVRGMTSVNDKIYRVTRSDPDKVSLNSTNTSSASNYTNGGT 306 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/373 (9%), Positives = 80/373 (21%), Gaps = 63/373 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 D + N + + + + + +T Sbjct: 283 SDPDKVSLNSTNTSSASNYTNGGTIQKCLTSTCQVVITTSTSHGFTTGDEIRFAGMSGLT 342 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 L+ + TI + + S +P Sbjct: 343 SLNGTTRTITVLTNTTFDSSLTGPGSSTYSSGGTATCEESTVP----------------- 385 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + + ++ + + A + Sbjct: 386 GCEKIRFTNVDGYERVNSQSTCATERIGSQAYTDAAPSTAY---------VGSHYPTAGS 436 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 S ++ + + T + + ++ + + Sbjct: 437 SSSTVCPTATITPLSTDKTALKAQINGLT---VGGATAGQIGLAWGWYMVAPNFGYLWPN 493 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENN-----------------------------NFKS 346 ++ + K +I +TDG N N S Sbjct: 494 A-SQRPAAYKARDLMKVVILMTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSS 552 Query: 347 NVNTIKICDKAKE--NFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++CD K N I + T+ + L C SS + Y L F Sbjct: 553 FDQAAELCDSIKASANDITLYTVGFTVGNDQTARNFLTNCASSTDKAYFPATGSELKASF 612 Query: 403 QNISQLMVHRKYS 415 Q I+Q + + + + Sbjct: 613 QAIAQEISNLRIA 625 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 72/236 (30%), Gaps = 39/236 (16%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + V+D+SGSM TK+A LK A+ + + Sbjct: 260 TSRAPVDLVTVLDVSGSMA--------------------GTKLALLKRAMGFVIQHLGPS 299 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +I +++ + + +Q + T+ A+K+A ++ Sbjct: 300 D------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAKV 353 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFII-FLTDGENNNFKSNVNTIKICDKAKENFIK 363 + ++ + + + D + S +N IC + Sbjct: 354 IEDRNYKNPVCSIILLSDGQDTYNISSNVRGTRPDYRSLVPSSILNH-TICT------VP 406 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + A + L SS + + + F +Q + SV+++ Sbjct: 407 VHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAF---AQCIGGL-LSVVVQ 458 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 72/236 (30%), Gaps = 39/236 (16%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + V+D+SGSM TK+A LK A+ + + Sbjct: 260 TSRAPVDLVTVLDVSGSMA--------------------GTKLALLKRAMGFVIQHLGPS 299 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +I +++ + + +Q + T+ A+K+A ++ Sbjct: 300 D------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGAGGGTNIADALKKAAKV 353 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFII-FLTDGENNNFKSNVNTIKICDKAKENFIK 363 + ++ + + + D + S +N IC + Sbjct: 354 IEDRNYKNPVCSIILLSDGQDTYNISSNVRGTRPDYRSLVPSSILNH-TICT------VP 406 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + A + L SS + + + F +Q + SV+++ Sbjct: 407 VHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAF---AQCIGGL-LSVVVQ 458 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 76/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + ++ A K L FLD Sbjct: 89 GVVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGSVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +E +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 231 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 79/244 (32%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMNEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N + +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLD--PLKAAELAKAE 231 Query: 361 FIKIVTISI-----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I TI+ N + L K + + + + L ++ Sbjct: 232 GVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 79/244 (32%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMNEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N + +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLD--PLKAAELAKAE 231 Query: 361 FIKIVTISI-----------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I TI+ N + L K + + + + L ++ Sbjct: 232 GVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 77/243 (31%), Gaps = 59/243 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM D+ +++ A+K F++ Sbjct: 87 PNSGRDLMLAVDISGSMKIEDMEVS----------DELVSRIRAVKQVGSRFIEQ----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + S+ T V++++ + T A+ A + L Sbjct: 132 --REGDRLGLILFGSNAYVQSPLSFDTATVKRFLLEAQIGFAGQDTAIGDAIGLAVKRL- 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +I L+DG++ + A + I+I T Sbjct: 189 -----------------KERPAENRVLILLSDGKDTASSV--QPLNAAKLAADLGIRIYT 229 Query: 367 ISINASP----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I I A + L + + ++ N + L +++Q Sbjct: 230 IGIGADSLTMPGLFGSSFGARQVNPSAELDEAGLQQIAKITDGKYFRARNPEELANIYQL 289 Query: 405 ISQ 407 + Q Sbjct: 290 LDQ 292 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 237 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 283 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 284 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 339 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 340 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 379 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 380 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 439 Query: 407 Q 407 Q Sbjct: 440 Q 440 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 45/315 (14%), Positives = 83/315 (26%), Gaps = 53/315 (16%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI---- 190 + + + + Q + I R++ Sbjct: 699 DASSLATSQSTVEKIQTVDAMMVFDLSGSMNEIMSGQNQLKDIGEFSRVKNQMDINKVYY 758 Query: 191 -------FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-------------------- 223 D S M A S D Sbjct: 759 WNKYEKSGWWPWTYDKSVGMGTAAVSGNVYAKYPVKYIDGQWKKYVDGSYQSISDSDVMA 818 Query: 224 -KRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-- 278 +K++ALK+A F+ I S IG + + G KV + Sbjct: 819 VWTSKISALKDAASGFVTGISDTSPDSLVGIATFYGIGNGWNSSTEGKLNHGLSKVNKNE 878 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T ++ AY L + + +K++I +D Sbjct: 879 MLKSVNALFADGGTSPQKGLEHAYSELQKAEDGN-----------------KKYVILFSD 921 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE ++ + T K KE ++T+ + + L V+S + A+ L Sbjct: 922 GEPSDSNDKMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFTADTAEEL 981 Query: 399 IHVFQNISQLMVHRK 413 +FQNI + + Sbjct: 982 NKIFQNIQSTITQSR 996 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 76/218 (34%), Gaps = 39/218 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D SGSM D ++TK +++ + F+ + + Sbjct: 84 DIVLAIDASGSMQEKGF----------DPTDPQKTKFDVVRSLVKAFISK-------RRN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + ++ E V++ + + T A+ ++ ++L + + Sbjct: 127 DNIGVVIFGSFAYIASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLLKESQAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K +I LTDG + K AK+ +KI TI I Sbjct: 187 ------------------SKIVILLTDGIDTASK--TPPDVAVKMAKKYGVKIYTIGIGD 226 Query: 372 --SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L Y++ +A L ++ I++ Sbjct: 227 KRGIDEAFLRWLAQQGHGYYFYAKDASMLRKIYDEINR 264 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 55/448 (12%), Positives = 120/448 (26%), Gaps = 72/448 (16%) Query: 7 FRFYFK--KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R + + + +E + F + F+ L ++ V + KK + A A LA Sbjct: 7 IRKFNRLTQFKKNEHGAILVSFIIIFPFFIALTFIILEVSIFLQKKAKLSDAIEQATLAL 66 Query: 65 ASKMV---SNLSRLGDRFESISNHAKRALIDDAKR--FIKNHIKESLSGYSAVFYNTEIQ 119 + + + +R +S + + L +AV ++ Sbjct: 67 TVENDGIPNAAQQTKNRELVLSYANAYLPSEGFSDPIINIDDNTNYLGYNAAVTMTYPVE 126 Query: 120 NIVNSSRISMTHMANNRLDSSN-NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV---- 174 + S + + + T + + +L + + Sbjct: 127 FLGRSPLTNSISNIQTTDNGEAIKNKTIEVSEPTDVVFVADYSGSMLLSFSDDVSIKNGE 186 Query: 175 ---------SFIPALLRIEMGERPIFLIELVVDLSG----------SMHCAMNSDPEDVN 215 + ++ I I H + N Sbjct: 187 RINALRSAFRILHNTIKNNSNVNTIGFIPFGSGTKRKVSENGENKEYCHLPFSPKIYKPN 246 Query: 216 SAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT------RVEKNIE 268 + ++ + TK + + +D + + E+V I + Sbjct: 247 GDYLSENAEATKNAWTFLDVIGDHIDYKKTIMSITENVQPIDIPMRDIKHKEICLSGTNS 306 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 S E+ + ++ L T + + A I Sbjct: 307 YSLEREQFDYSIENIIEMAPLGGTLISSGILSANNIFKE----------------TADNG 350 Query: 329 FQKFIIFLTDGENNN-----------FKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 +K +I L+DG ++ + +C+ +N I++ I+I SP Sbjct: 351 HKKLMIILSDGMDSYNSTMLPNKGFFISKTLIDEGMCEMIIKNGIQMAFIAIAYSPENNV 410 Query: 378 ------LLKTCVSSPEYHYNVVNADSLI 399 K CV + +Y NA L Sbjct: 411 NAPEYINWKQCV-GEDNYYEAHNAHELE 437 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 75/237 (31%), Gaps = 47/237 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P+ I LVVD+S SM D Q + ++ A++ + F+D Sbjct: 89 EPMRDILLVVDISQSMDSEDFRD---------AQGRPASRWQAVQAVVGDFIDK------ 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + D +GLI + + +R + R + T A+ ++L Sbjct: 134 -RPDDRLGLIVFGAGAYPQAPLTRDHAALRLLLQRTAVGMAGPNTALGDAIGLGIRMLDH 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +R K +I LTDG + + + A ++ + + TI Sbjct: 193 AGERD------------------KILILLTDGNDTASAVP--PARAAELAAQHRVVVHTI 232 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRK 413 I + L + + + SL V+ + + + + Sbjct: 233 GIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATLDRITPHEVRTLR 289 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L S+ ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L S+ ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L S+ ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDYEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L S+ ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 99 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 78/231 (33%), Gaps = 52/231 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM Q + +++ +K L F+ ++ Sbjct: 88 DIMLAVDLSGSMVEQDM----------AYQGRYVDRLSMVKAVLKNFIAQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V + + L+ + T A+ A + + + Sbjct: 131 DRLGLILFGDTAFLQTPLTRDLNTVSKMLEEAQIGLVGRATAIGDALGLAVKRFSQKQDS 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + + ++ LTDGEN N+ + A+E IK+ T+ + Sbjct: 191 N------------------RILVLLTDGENTAG--NLAPEEALLLAREEGIKVYTVGVGS 230 Query: 371 --------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +S + L K + ++ + SL ++Q + + Sbjct: 231 QGGNRFNLFSMSGSSSLDESLLQKIATETGGLYFRATDVASLQQIYQELDK 281 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR ++ + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGALN--PLKAAELAKAE 231 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +D+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAIDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A + +KI I I + Sbjct: 188 ------------RLRPANSRVLVLVTDGANNAGQID--PITAARLAADEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSALGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 64 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 113 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 114 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 166 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 167 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 206 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + + L K + + + + L ++ Sbjct: 207 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYA 266 Query: 404 NISQ 407 + + Sbjct: 267 ELDR 270 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 231 Query: 361 FIKIVTISIN-----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ + + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 21/244 (8%), Positives = 60/244 (24%), Gaps = 10/244 (4%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 K I + N IIF L + + G + ++ + + ++ A +AA+L Sbjct: 4 ISQSLTKFIGDRQGNTLIIFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLKATQ 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 K+ +S + + + + S + + Sbjct: 64 KI--EVSEAAAKTAAAMAFEMNLSDHPELQTASHTFAIE---TSDNAKVVHYTSEITQRP 118 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + + L + + + + + ++L + Sbjct: 119 YFLQLLGLGEQTIRVASSAQSESDPFELLFVLDTTGSMASNNKMTYLKTSVSSVLSSLIS 178 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPE-----DVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ V + D + + A D Sbjct: 179 TYGDGNEDVKVGVVAFNTQVRLPASTSYSFVDYTQCYLRTSVNYYACRTVWRAYEALCDQ 238 Query: 242 IDLL 245 + Sbjct: 239 VGST 242 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 44/357 (12%), Positives = 100/357 (28%), Gaps = 48/357 (13%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 AS S + + N+ + A + + + ++ + + Sbjct: 202 ASTSYSFVDYTQCYLRTSVNYYACRTVWRAYEALCDQVGSTICATATSKAFYRVYTSGGV 261 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + M + S YN+ + + + S + Sbjct: 262 TYYAASLMGYAKSGSKYTLYTYNLTTTVN-----------ASTLVMSTVSSGLTTSTGTS 310 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + P S+ + + + Sbjct: 311 VAYSPSGYTAFSSSYFSSLATSSWGGCLTDRNQSFDVSVAPYATDVV------------- 357 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + Y+ TT + K ++ + V Y++ T+ T ++ ++ Sbjct: 358 -----DSNYIAASCSTTALAKVLDLTSDFTSVNTYLSSLS---PGGNTNITLGVQFGMEM 409 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-------NFKSNVNTIKICDKA 357 L+ + + +K++I +TDG N N N T C A Sbjct: 410 LSP-------AEPYTKATAFGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAA 462 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 K I + + + + LL+ C S Y+Y++ A L Q+I + + Sbjct: 463 KAQGITLFVVRVEDGDSS--LLEACASQSSYYYDLSQASDLTKTMQDIFATINKLRL 517 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 74/223 (33%), Gaps = 33/223 (14%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +V+DLSGSM K +++ A K L F + +E Sbjct: 99 DVMIVLDLSGSMAETDF---------SPDPGKSLSRLDAAKEVLKQFAAT-------REG 142 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + E + + ++ + T A+ A ++ + + Sbjct: 143 DRLGLILFGDAAFLQAPFTADLETWQTLLQETDVAMAGQSTHLGDAIGLAIKVFNNSDR- 201 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S +K I LTDG + S V+ A N +++ TI++ Sbjct: 202 -------HGQQDQNSAKREKVAIILTDGNDTG--SFVSPRDAARVAAVNGVRLHTIAMGD 252 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + + ++ L +Q I + Sbjct: 253 PATVGEQALDLDTLQQLATLTGGQLFQALDEAQLTRAYQVIGE 295 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 81/246 (32%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 84 DLMVAVDLSGSMQIEDM----------VLDGKAVNRFIMVQSVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALSVKRFD----- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN+ + + D A + IKI +I + A Sbjct: 182 -------------LVDESNRILVLLTDGSNNSGSIS--PEQAADIAAKRGIKIYSIGVGA 226 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++L ++ ++ NA L ++Q I +L + Sbjct: 227 DVMERRTLFGKERVNPSMDLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEIDKLEPISR 286 Query: 414 YSVILK 419 V + Sbjct: 287 DQVSYR 292 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 80/239 (33%), Gaps = 55/239 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + LVVD+S SM + Q + ++ A+K L F+D Sbjct: 85 PVTGRDLMLVVDISPSMDEQDM----------VLQGRSINRLQAVKRVLDDFIDQ----- 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + T ++ E VR + + + T A+ + + L Sbjct: 130 --REGDRLGLILFGTEPYVQAPLTFDRETVRTLLFEAGLGMAGRATAIGDAIGLSVKRL- 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 Q+ +I LTDG N + + K + A+ +++ T Sbjct: 187 -----------------RERPQEQRVVILLTDGANTAGQVS--PDKATEIAQAAGVRLYT 227 Query: 367 ISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I I A + + L + + ++ + L ++ +I+Q Sbjct: 228 IGIGADTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGRYFRARSLPELEMIYDSINQ 286 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 80/264 (30%), Gaps = 69/264 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + I L +D+S SM + +D K ++ ALK Sbjct: 78 SSSEITKTKTTEGIDIILAIDMSSSM---------------LAKDLKPNRIEALKRVASQ 122 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F++ ++ +G++ Y+ + + V Q + I T Sbjct: 123 FIEE-------RKSDRIGIVVYSGESYTKVPATTDKSIVLQSLKDIKQGEIEDGTAIGMG 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L S K II +TDG NN + + + A Sbjct: 176 LGTAINRL------------------KDSKTKSKVIILMTDGVNNTGVID--PLSAAELA 215 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 KE I++ TI I + + + L + S+ ++ + Sbjct: 216 KEYGIRVYTIGIGTNGKALSPVAYNPDGSLQYDMVPVEIDEKLLGEIAQSTGGKYFRATD 275 Query: 395 ADSLIHVFQNISQ----LMVHRKY 414 L ++ I + + KY Sbjct: 276 NKKLAQIYTEIDKLEKSKIEELKY 299 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 78/243 (32%), Gaps = 59/243 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM ++ ++ A+K F+ Sbjct: 88 PNSGRDLMLAVDISGSMRVEDM----------QVGNRMARRIDAVKQLGSDFM------- 130 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + +R S+ + V++++ + T A+ A + L Sbjct: 131 SRRSGDRLGLILFGSRAYLQSPLSFDIQTVQRFLLESQIGFAGQETAIGDAIGLAVKRLQ 190 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +I LTDG++ + ++ + A + ++I T Sbjct: 191 E------------------RPATSRVLILLTDGQDTASTVD--PLEAANLAADLGVRIYT 230 Query: 367 ISINASP----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I I A + L+ S+ ++ + + L V++ Sbjct: 231 IGIGADSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTGGQYFRARDPEELATVYRL 290 Query: 405 ISQ 407 + + Sbjct: 291 LEK 293 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 78/228 (34%), Gaps = 43/228 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + + +D+SGSM + P + ++ +++ ++ + F+ Sbjct: 90 ENAARDVVMAIDISGSMDARDFATP---------EGERIQRLSGVREVVRAFV------- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E M LI + T + E + + R + T A+ + + Sbjct: 134 SGREGDRMALIVFGTSAYLQAPLTDDLETIIALLDRTEVGMAGPHTALGDAIGLSIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + Q+ +I L+DG + + + + + A + ++I T Sbjct: 194 TSEID------------------QRLLILLSDGSDTASRMS--PVNAAEIAADRGVEIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L + + ++ +A SL V+ I + Sbjct: 234 IGVGDPDATGENRVDLTTLKEVAQRTGGQYFFAEDAASLEAVYDRIDE 281 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 18/162 (11%), Positives = 45/162 (27%), Gaps = 2/162 (1%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R F + ++ N I A +++ ++G + + + K ++ A +A +LA Sbjct: 5 IKRALFILMRLYRNQAGNTLAIVAAAMLPLAGMVGGALDISRGYLAKTRLQQACDAGVLA 64 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G M S+ + + + + + + Sbjct: 65 GRKVMGSSGVLSDSVRDEVRKYVSFNYPSGYLGSTLATTDINPTL--GSNDQIALSLTTA 122 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 M N + + + N L + Sbjct: 123 IPTAVMRLFGRNNMSITASCTARNDYSNIDIVLVLDTTGSMA 164 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 65/207 (31%), Gaps = 37/207 (17%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 DS + G + + + + Y T + + T + Sbjct: 340 TDSNTRWTVADPTRASGQYACPKAMRELQQMTA--TDFNNYFTFNNGFIPNGGTWLDVGL 397 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN----------- 347 A ++L+ D S + + P +++IF+TDG + SN Sbjct: 398 LWAARLLSRDGLWSTENDEL-----YHTYPVSRYVIFMTDGYMSIGSSNYAAYAQEDYWR 452 Query: 348 --------------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-----PEY 388 + C K KI TIS A L C SS PE+ Sbjct: 453 RVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTISFGAGSTLDSNLINCSSSTNTTNPEF 512 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYS 415 Y ++ L VF++I + + + S Sbjct: 513 AYKADSSSDLNRVFRDIGENIGSLRLS 539 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 75/237 (31%), Gaps = 47/237 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P+ I L VD+S SM D Q + ++ A++ + F+D Sbjct: 89 EPMRDILLAVDISQSMDSEDFRD---------AQGRPASRWQAVQAVVGDFIDK------ 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + D +GLI + + +R + R + T A+ ++L Sbjct: 134 -RPDDRLGLIVFGAGAYPQAPLTRDHAALRLLLQRTAVGMAGPNTALGDAIGLGIRMLDH 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++R K +I LTDG + + + A ++ + + TI Sbjct: 193 ARERD------------------KILILLTDGNDTASAVP--PARAAELAAQHRVVVHTI 232 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRK 413 I + L + + + SL V+ + + + + Sbjct: 233 GIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATLDRITPHEVRTLR 289 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 22/238 (9%), Positives = 71/238 (29%), Gaps = 9/238 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + N++++ ++++ + + + + + +K + +A +AA A A ++ Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLTEG 82 Query: 72 LSRLGDRFESISNHAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++ D A + + + SG + ++ Sbjct: 83 ATDDQLKAYALDFFNANLNDIDPASATLNVTLPSNTSGGGLLTMTAQLAYKPYFYPAFAQ 142 Query: 131 HMANNRLDSS-----NNTIFYNMDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALLRI 183 + + D++ + T + L + L QK + + + Sbjct: 143 LVGKSATDANQKINFSVTSQVRLKNTLEVALVLDNSGSMTTLGTGSGQKRIDLLKTASKQ 202 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + + + + + VN P + L D Sbjct: 203 LVDTLAQQAVMIK-QVDKPVQFGLVPFAASVNVGPANGNASWMDTEGLSPVSNENFDW 259 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/325 (9%), Positives = 74/325 (22%), Gaps = 77/325 (23%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + + + + P +L D + + N P Sbjct: 360 DASPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWKLTQDPDEAAPVTYGAVNSWWNDDP 419 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-IGYTTRVEKNIEPSWGT--EK 275 + L + G Y+ + + + Sbjct: 420 TSGTGQSR--------LRNMAKYFQPRPIDAPALPAGNGPNYSCTTNPITPLTDVSVADG 471 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILT----------------------------- 306 T+ M +++++ Sbjct: 472 ATSIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRRETEKGNDKVVIVLTDGANT 531 Query: 307 -----------SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT------DGENNNFKS--N 347 +S + ++ + + + D N N+ + N Sbjct: 532 YYTPSSLGYSDPANSKSTYASYGYLNPGYNGTSVGRMFMGTSSAIGQLDYSNGNYTNALN 591 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ------RLLKTCVSS------------PEYH 389 +C+ AK I ++T++++ S LK+C S P Sbjct: 592 EQMATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSCSSDSRFRKDPTDPSKPAKL 651 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 + SL + F+ I + + + Sbjct: 652 FWNATGASLSNDFKEIGNELSNLRV 676 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 76/230 (33%), Gaps = 51/230 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ +K F+ ++ Sbjct: 92 DLMLAVDLSGSMSRQD----------LSYDNIPVDRLTIIKGVADDFIAK-------RKG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI ++TR ++ VR + + + T A+ A + L Sbjct: 135 DRIGLILFSTRAYVQAPLTFDRNVVRDLLRTSSIGMTGQETAIGDAIALAVKTL------ 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ ++ LTDG NN+ + + AK N +KI TI + A Sbjct: 189 ------------RTRPQEQRVLVLLTDGANNSGMLSPIP--AAEIAKANGVKIYTIGVGA 234 Query: 372 SP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ +A L ++ +I + Sbjct: 235 DAFAVGQRMVNPSFDLDEGALEQIAQMTGGRYFRARDAAGLAAIYNDIDR 284 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRIWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 49/483 (10%), Positives = 109/483 (22%), Gaps = 104/483 (21%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES---------- 55 R + ++ + + +++FAL+V+ + L+G + K +++ Sbjct: 6 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 65 Query: 56 --ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A V ++++ A + ++ SG + Sbjct: 66 VTAKEFIAANAQQSDVMTAGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRSGQTLDA 125 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN-MDVMTSYDYRLQFIEHLLNQRYNQK 172 + +N + + + + + Sbjct: 126 TV---SYTATVQSTFGRLFGLSATTLTNRVNASADIAGYLDFYLMVDVSGSMGLPTTDSD 182 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +E + + H + K + + A+ Sbjct: 183 AALLASKS------------VEDQGNCQFACHFPNRKGWN------LAAGKIQLRSDAVN 224 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 NA+ LD V +G+ + R+ S T + D P Sbjct: 225 NAVCALLDRASK-PIVPNQYRIGIYPFINRLATLAPLS-DTTTSLASLKTTADCGKAWPL 282 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQG----------------VKIPSLPFQKFIIFL 336 T + L ++ + T S + F+ + Sbjct: 283 AFTNLLDTGSTQLFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLI 342 Query: 337 TDGENNNFKSN-----------------------------VNTIKICDKAKENFIKIVTI 367 TDG N+ + C K+ I + Sbjct: 343 TDGMQNSQTYSSWKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISIL 402 Query: 368 SI----------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I SP L+ C S P + +A + + Sbjct: 403 YIPYNYVKSYTNEGTIVWENNRVNGFSPTLADPLRQCAS-PGLFFTANSAKDITASLGAM 461 Query: 406 SQL 408 Sbjct: 462 FDQ 464 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 73/226 (32%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L +DLS SM P+ K ++ A++ + F+ + Sbjct: 90 TETARDVMLAIDLSASMDYRDFPGPD---------GKPVSRFDAVQRVVDQFVAN----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + + + R V + T ++ A + Sbjct: 136 --REGDRVGLIVFGAKAYLQLPFTRDLNTARALVDLMQVGMAGPQTALGDSIGLAIRAFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + + +I LTDG + K I + A+ N I+I T Sbjct: 194 SSEVDD------------------RVLILLTDGNDTASKMT--PINAAEIAQLNGIEIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I I + + + L S ++ + +L V+ I Sbjct: 234 IGIGDAEATGEDRIDFETLASIAERSGGQFFDAQDETALRQVYDRI 279 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 81/223 (36%), Gaps = 38/223 (17%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L +D+SGSM S + ++++ K F+ + Sbjct: 76 STKKGRDLILTIDVSGSMAQKGFSKE----------ESEKSRYEVAKEIAKRFIKN---- 121 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQI 304 + +G++ + + ++ + + + D + T A+ +A + Sbjct: 122 ---RFSDNIGIVIFGSFSFSASPLTYDLKALLEMFDLMSDVGIAGNNTAIGDAIFEAIKN 178 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L S + + K II LTDG++N K + + +AK+ IKI Sbjct: 179 LESGEAK------------------SKVIILLTDGKHNFGKKS--PKEGVVEAKKRGIKI 218 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T+ I + + L K + + N+ L VF+ I + Sbjct: 219 YTVGIGTDYDKKLLEKMAKETNAKSFFAKNSKELEEVFKEIEE 261 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 72/237 (30%), Gaps = 47/237 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 +P+ + L +D+S SM P+ ++ +++ +K + F+D Sbjct: 89 QPVRDLLLAIDISQSMETEDFVAPD---------GRREDRLSGVKAVVADFIDR------ 133 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++D +GLI + T + ++ + + + T A+ A + Sbjct: 134 -RQDDRLGLIVFGTAAYPQAPLTQDHATLKLLLGQVSTRMAGPNTAIGDAIGVAIKQFEH 192 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +I LTDG + + A I + T+ Sbjct: 193 AG------------------EHDQVLILLTDGNDTGSAVP--PDRAASMAAARHIVVHTV 232 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRK 413 I + L + + + SL V+ + + + + Sbjct: 233 GIGDPQAEGEEKVDFDALRAIAAKTGGRFFPAQDQASLRQVYAELDRITPHEVRELR 289 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I A Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGA 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I A Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGA 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 4 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 46 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 47 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 100 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I A Sbjct: 101 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGA 146 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 147 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 193 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I A Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGA 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 78/239 (32%), Gaps = 47/239 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +RP+ + L +DLS SM +D ++ +++A+K + F+ Sbjct: 87 QQRPVRDLMLAIDLSQSMQTQDFNDAN---------GQRIDRLSAVKEVVQGFIQR---- 133 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++D +GLI + + + + + + T A+ A ++L Sbjct: 134 ---RKDDRLGLIVFGSGAFAQAPLTLDHASLSLLLEDSGIGMAGPNTAIGDAIGLALKLL 190 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +K +I LTDG + + A+ + I Sbjct: 191 EQA------------------HEPEKVLILLTDGNDTSSAITPQHAAAMAAAR--GVVIH 230 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRK 413 TI I + L + ++ ++ + +L V+ + + + + Sbjct: 231 TIGIGDPSAEGEAKVDLSALEQIARTTGGRYFRAEDRSALDQVYATLDRLTPHQVQTLR 289 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q+ + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QEDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L S+ ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAESTGGEYFRARSSAELESI 280 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 43/483 (8%), Positives = 106/483 (21%), Gaps = 104/483 (21%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES---------- 55 R + ++ + + +++FAL+V+ + L+G + K +++ Sbjct: 6 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 65 Query: 56 --ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A V+ +++ A + ++ +G + Sbjct: 66 VTAKEFIAANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRTGQTLDA 125 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + +N ++ + + + + Sbjct: 126 TV---SYTATVQSTFGRIFGLSATTLTNRVNASVDLASYLDFYLMVDVSGSMGLPTKDSD 182 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + H + K + + A+ Sbjct: 183 AEVLAMQSKEKQGN------------CQFACHFPDSVGWTK------AAGKIQLRSDAVN 224 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 NA+ L V +G+ + ++ + T + + P Sbjct: 225 NAVCELLKRA-STPVVPNQYRIGIYPFINQLATLAPLT-DTTTSLAALRTAAQCDKVWPL 282 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQG----------------VKIPSLPFQKFIIFL 336 T + L ++ T S + F+ + Sbjct: 283 AFTNLLDTGSTQLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLI 342 Query: 337 TDGENNNFKSN-----------------------------VNTIKICDKAKENFIKIVTI 367 TDG N+ + C K I + Sbjct: 343 TDGMQNSQSYSAWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVL 402 Query: 368 SINAS----------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + P L+ C S P + Y D + + Sbjct: 403 YIPYNIVKNYNNDSYIVWENGRVNQFSPTLADPLRKCAS-PGFFYTANTQDDITASLGAM 461 Query: 406 SQL 408 + Sbjct: 462 FKQ 464 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q ++ T++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMDYPDMQW----------QGEELTRLELVKVLLGDFIEQ-------RHG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + T A+ A + L Sbjct: 134 DRVGLILFGSKAYLQAPLTFDRRTVRVWLDEARVGIAGSNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + A E ++I TI I A Sbjct: 188 ------------RERPTNSRVLVLITDGANNGGELDPLL--AATLAAEESVRIHTIGIGA 233 Query: 372 -----------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + L + ++ ++ L + Sbjct: 234 VPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASSAELKAI 280 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 81/274 (29%), Gaps = 64/274 (23%) Query: 175 SFIPALLRIEMGERPI----FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 S A R+ +GE I I +VVD SGSM +D +T++ Sbjct: 71 SVAAAGPRVAVGENTIRREGIAIMMVVDTSGSMRALDLADG----------GLDQTRLEV 120 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLI 288 +K+ F+ D L D +GL+ + + + + + + Sbjct: 121 VKDVFRAFVAGEDGLDGRSNDT-IGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERA 179 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A + L + + II LTDG NN Sbjct: 180 EDGTAIGDGLGLAVERLRESEAS------------------SRVIILLTDGVNNAG--IE 219 Query: 349 NTIKICDKAKENFIKIVTISINASP-----------------------NGQRLLKTCVSS 385 ++ + A IK+ TI + L + Sbjct: 220 TPLEAAELASRLGIKVYTIGAGTDGVAPVRVTNPLTGAEELRPMPVEIDEATLEAIAEHT 279 Query: 386 PEYHYNVVNADSLIHVFQNISQ----LMVHRKYS 415 ++ + D L V++ I + + R+ Sbjct: 280 GGRYFRATDGDGLRQVYEQIDRLERTEISERRLR 313 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMLSVDLSGSMQIEDM----------VIDGKVVDRFTLIQHVISDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V QY+ L+ K T A+ + ++ Sbjct: 128 DRIGLILFADHAYLQSPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGVKRFDKVEQS 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN + A + I I TI + A Sbjct: 188 N------------------RVLILLTDGSNNAGAIT--PEQASQIAAQRGITIYTIGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + + ++ N + L ++Q I L + Sbjct: 228 DVMERRTLFGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQVIDTLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RMRPATSRALVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 81/246 (32%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMLAVDLSGSMQIEDM----------VIDGKVVDRFTLIQHVVSEFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLVLLTDGSNNAG--NIEPQQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ ++ N++ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLQQIADATHGRYFRARNSEELEQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 44/483 (9%), Positives = 107/483 (22%), Gaps = 104/483 (21%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES---------- 55 R + ++ + + +++FAL+V+ + L+G + K +++ Sbjct: 6 RLKGRATALASNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGV 65 Query: 56 --ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A V+ +++ A + ++ SG + Sbjct: 66 VTAKEFIAANAQQSDVTASGIKAGESQALKAFNANASKVPFATVSLSQLEIVRSGQTLDA 125 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + +N ++ + + + + Sbjct: 126 TV---SYTATVQSTFGRTFGLSATTLTNRVNASVDLASYLDFYLMVDVSGSMGLPTKDSD 182 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + H + K + + A+ Sbjct: 183 AEALAMQSKEKQGN------------CQFACHFPDSVGWTK------AAGKIQLRSDAVN 224 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 NA+ L V +G+ + ++ + T + + P Sbjct: 225 NAVCELLKRA-STPVVPNQYRIGIYPFINQLATLAPLT-DTTTSLAALRTAAQCDKIWPL 282 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQG----------------VKIPSLPFQKFIIFL 336 T + L ++ T S + F+ + Sbjct: 283 AFTNLLDTGSTQLFTNNDPKTGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLI 342 Query: 337 TDGENNNFKSN-----------------------------VNTIKICDKAKENFIKIVTI 367 TDG N+ + C K I + Sbjct: 343 TDGMQNSQSYSAWKDTKTFSGNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVL 402 Query: 368 SINAS----------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + P L+ C S P + Y D + + Sbjct: 403 YIPYNIVKNYNNDSYIVWENGRVNQFSPTLADPLRKCAS-PGFFYTANTQDDITASLGAM 461 Query: 406 SQL 408 + Sbjct: 462 FKQ 464 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 36/252 (14%), Positives = 77/252 (30%), Gaps = 62/252 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + + ++ A K+ L F+ + Sbjct: 89 DLLIALDLSGSMQVTDMALN----------GQPANRLEAAKSVLSDFIQE-------RRG 131 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+I + ++ S+ T+ + Q V + T A+ + L Sbjct: 132 DRIGIIVFGSKAYLQAPLSFDTKTINQLVQEAQIGFAGEQTAIGDAIGLGIKRLEDK--- 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K +I +TDG N + + A +KI TI I A Sbjct: 189 ---------------PSDKKVLILMTDGANTAGRV--QPQQAATFAASQNVKIHTIGIGA 231 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI-------S 406 + L + ++ + + L ++Q + + Sbjct: 232 DSMIVQSFFGPKAINPSSDLDETLLKNIAAQTGGEYFRAKSTEDLQAIYQTLDALEPTPA 291 Query: 407 QLMVHRKYSVIL 418 + + R + + Sbjct: 292 EDIWQRPLTSLF 303 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RLRPANSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 67 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 109 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 110 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL------ 163 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 164 ------------RLRPANSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 209 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 210 DPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 256 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RLRPANSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSALGLSPSLDLDEPTLKEIASISGGQYFRARDGDQLEKI 280 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 69/201 (34%), Gaps = 31/201 (15%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + ++ +K+ + F+ + + MGL+ + + ++ + + V+ Sbjct: 105 NPAASRFDVVKSIVKDFISQ-------RTNDNMGLVVFGSYSFIASPLTYDKHILSRIVS 157 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + K T A+ Q +L S K I LTDG + Sbjct: 158 QLEVGMAGKYTALYEALAQGVNLL------------------KMSKAKSKVAILLTDGYS 199 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLI 399 + + D AK+ +K+ I I N LLK + + NA L Sbjct: 200 TAGADKIPLDVVLDMAKKEGVKVYPIGIGGPDEYNRAVLLKIAKETGGVAFGASNASQLK 259 Query: 400 HVFQNISQ----LMVHRKYSV 416 V++ I + + + +S Sbjct: 260 EVYKKIDELEKSEIKNETFSY 280 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 88/245 (35%), Gaps = 57/245 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +DLSGSM I D+ +++ +KNA F+ + Sbjct: 91 NIMMALDLSGSMEIPDM----------ILHDRPASRLTVVKNAAEQFVR-------DRLG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + +R ++ + V + L K T A+ A + L + Sbjct: 134 DKIGLILFGSRAYLQTPLTYDRQTVLLRIEDATVGLAGKTTSIGDAVGLAVKRLDAV--- 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + II LTDG NN+ + +K + AK+ IKI TI + A Sbjct: 191 ---------------PQKGRVIILLTDGANNSG--ILEPLKAAELAKDEGIKIYTIGLGA 233 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL--MVH 411 + + L + + ++ + +L +++ I+QL + Sbjct: 234 ATDPRALTNGFLMQAAAADLDEETLKEMSAMTGGRYFRATDTATLNSIYKTINQLERVSQ 293 Query: 412 RKYSV 416 + SV Sbjct: 294 EQASV 298 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM ++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMDFPDMQWK----------NEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RERPAQSRVLVLITDGANNGGQI--HPLTAARLAAQEGVRIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 234 NPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGAELDAI 280 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM Q++ +++ +K + FL +E Sbjct: 91 DLLVAVDVSGSMDFPDMQW----------QNEDISRLDLVKALMGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RERPAQSRVLVLITDGANNGGQI--HPLTAARLAAQEGVRIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 234 NPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHDGAELDAI 280 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 98.8 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 72/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDDEISRLELVKKLFGDFIE-------GRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + + A E +KI TI I A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQIS--PQTAARLAAEERVKIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L ++ ++ ++ L + Sbjct: 234 DPQQGGVIGLFGLNPGLDLDEPVLRGIAETTGGEYFRARSSAELESI 280 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 23/236 (9%), Positives = 70/236 (29%), Gaps = 41/236 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + V+D+SGSM +K++ LK A+ + ++ Sbjct: 201 DRAPIDLVAVLDVSGSMA--------------------GSKLSLLKRAVCFLIQNLGPSD 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQIL 305 + ++ +++ + ++ R+ T+ +K+ ++L Sbjct: 241 ------RLSIVSFSSTARRIFPLRRMSDNGREAAGLAINSLTSSGGTNIVEGLKKGVRVL 294 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +++ + + T ++ + ++ I + Sbjct: 295 EERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNP---------RQGRQAIIPVH 345 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSV 416 T + + + S + + ++ F I L+ + +V Sbjct: 346 TFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAFAMCIGGLLSVVAQELRLTV 401 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 77/237 (32%), Gaps = 53/237 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + LVVD+S SM + Q ++ ++ A+K L F+D Sbjct: 84 PTTGRDLMLVVDISPSMDEPDM----------VRQGRRINRLQAVKQVLAEFIDQ----- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + ++ ++ V + + T A+ A + L Sbjct: 129 --RQGDRLGLILFGSQAYVQAPLTFDRTTVNILLQEAGLGMAGNATAIGDAVGLAVKRL- 185 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 Q+ I LTDG N + K + A+ + +++ T Sbjct: 186 -----------------RERPLEQRVAIVLTDGANTAGEIT--PDKASELAQASAVRLYT 226 Query: 367 ISINASPNG----------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I I A + L + + ++ N L ++ +I+Q Sbjct: 227 IGIGAGADSAITGLLQRNPSRDLDEALLTRMAQQTGGQYFRARNLAELGGIYTSINQ 283 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKNDE----------VSRLVLVQQLLGDFLE-------SRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ + VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFLQAPLTYDRQTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + +KI TI I + Sbjct: 188 ------------RMRPANSRVLVLVTDGANNAGQID--PLTAARLAADEGVKIYTIGIGS 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L + S ++ + D L + Sbjct: 234 DPEKNALQSALGLSASLDLDEPTLKEIARLSGGQYFRTRDGDQLEKI 280 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 18/199 (9%), Positives = 56/199 (28%), Gaps = 9/199 (4%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R Y + + +K +I AL ++ + + + + + + + + + AA LA Sbjct: 2 RTHSYQNRSLHKQKGVAAIWMALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALAVT 61 Query: 66 SKMVSNLSRLGDRFESISNHAK-RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + S +++ A+R + + + + + Sbjct: 62 IEDKPGAASVMAENYVRSYVRDIKSINVQAERREPGNSRNEEAADFIQYTVNATTTHDSW 121 Query: 125 SRISMTHMANNRLDSSNNTIF-----YNMDVMTSYDYRLQFIEHLL---NQRYNQKIVSF 176 S + D + ++ + F + K + Sbjct: 122 FANSFIPSFDETQDIAGRSLARKYLSSVGGKNIDIVFVSDFSGSMNFDWMDPNGNKKIDD 181 Query: 177 IPALLRIEMGERPIFLIEL 195 + +R + + Sbjct: 182 LKTAIRAISNKFICQDVRN 200 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 65/190 (34%), Gaps = 14/190 (7%) Query: 229 AALKNALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ +F+D ++ + + L G ++ + + + Sbjct: 331 DVDESVNTMFIDKSKEKSNYYRVAPDIKLFGVPEDFDELQFNNISLTNRSEEIKAIDLMW 390 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE----NNN 343 T + + + QIL K + K + L+DGE +N Sbjct: 391 ANGATAAFQGILRGSQILYDGKPDGSDQQELQAYNDK-----LKIMFILSDGEETPHDNG 445 Query: 344 FKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ +CDKA++ + I I I+ Q K CV E ++++ + + Sbjct: 446 ILKSLVKAGMCDKARDKIPGLYIGFIGIDFHATQQDAFKQCVLDSEQ--DIIDVHDVDEL 503 Query: 402 FQNISQLMVH 411 + I +L+ Sbjct: 504 IEKIEELIGK 513 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 71/227 (31%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM Q ++ T++ +K L F++ + Sbjct: 91 DLLLAVDVSGSMDYPDMQW----------QGEELTRLELVKVLLGDFIEQ-------RHG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + T A+ A + L Sbjct: 134 DRVGLILFGSKAYLQSPLTFDRRTVRVWLDEASVGIAGSNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + A E ++I TI I A Sbjct: 188 ------------RERPANSRVLVLVTDGANNGGEIEPLL--AATLAAEENVRIHTIGIGA 233 Query: 372 -----------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + L + ++ +++ L + Sbjct: 234 VPEEGGVLSRFGFNPGLDLDEPTLRAIAEQTGGEYFRAASSEQLQAI 280 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM ++ +++ +K L FL +E Sbjct: 44 DLLVAVDVSGSMDFPDMQWK----------NEDISRLDLVKALLGDFLQ-------DREG 86 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 87 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL------ 140 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 141 ------------RQRPAQSRVLVLITDGANNGGQI--HPLTAARLAAQEGVRIYTIGIGA 186 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 187 NPEASGTPGLLGLNPSLDLDEAALREIGEITHGAYFRAHDGAELDAI 233 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 74/236 (31%), Gaps = 57/236 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD S SM + ++ +K+ + F+ +++ Sbjct: 94 DLMLAVDTSQSMEIQDM----------RLHGEPVDRLTVIKSVVDDFISH-------RKN 136 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ + VR + + T A+ A + L Sbjct: 137 DRIGLILFGTQAYLQTPLTFDHKTVRTLLNESRIGIAGGQTAIGDAIGLALKRL------ 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K +I LTDG N + ++ + A +KI T+ + A Sbjct: 191 ------------KNHKTGSKVLILLTDGANTAGSVS--PVQAAELAARQGMKIYTVGVGA 236 Query: 372 SP--------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + K + ++ N D L ++Q+I + Sbjct: 237 DEMRIPGVLGFGSQIVNPSADLDEVTMKKIASLTGAQYFRARNTDELRRIYQHIDK 292 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 51/393 (12%), Positives = 101/393 (25%), Gaps = 54/393 (13%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 AS ++ + + + I N E + S V T + Sbjct: 1004 VASNEKWTVTVTKENNNVTTTLKNSSGTAVKDNKILNETPEEIINSSMVTSKTAKVKDWD 1063 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN-QKIVSFIPALLR 182 +T A + S ++I + L + + Sbjct: 1064 QRTYDITINATST---STSSIIETKTSVADIMLVLDVSGSMGEDITSYSYTFVANNTSEA 1120 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPED------------------VNSA--PICQD 222 + + + +++ GS + N+ Sbjct: 1121 RDDKKLLNRNVTYYIEVDGSYKEMWYYSSYNKGWRVGPRGSSDDAAKDKYNNCKIYTRTS 1180 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------------- 269 T++ ALKNA+ F+D S +G+ +++ + N Sbjct: 1181 TTETRLDALKNAVNQFIDDTAKKSPNS---KIGITVFSSTDDYNRPYGNHGTSVSLGEVG 1237 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + KV + D TD ++ A L + + K Sbjct: 1238 TADSAKVTELKNFVKDLKANGGTDPAVGLEDAKNKLDAMVDTNPKYVVLFTDGKPTGGGN 1297 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLK------ 380 + + E + D AK + TI + + L Sbjct: 1298 KWNSNAQKNAETQAGELKTGLRNNVDNAKNP-YTVYTIGFALNDEGDRAKTFLSGGTYDG 1356 Query: 381 ----TCVSSPEYHYNVVNADSLIHVFQNISQLM 409 SS + +A SL +FQ+IS + Sbjct: 1357 KKDPGIASSSDCAKTADDAASLTQIFQSISSTI 1389 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 57/202 (28%), Gaps = 18/202 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 TK+A LK A+ + ++ + +I +++ + T +Q Sbjct: 302 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQ 355 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 356 SLLAVNSLTSNGGTNIAEGLRKGSKVIEERQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 414 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G +T I + A + L +S + ++ Sbjct: 415 GAPEYCALLPSTNG------NQQIPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 468 Query: 399 IHVFQNISQLMVHRKYSVILKG 420 F +Q + SV+ +G Sbjct: 469 QDAF---AQCIGGL-LSVVAQG 486 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 77/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+SGSM + ++ A+K + F+ +E Sbjct: 91 DILLAVDISGSMEREDM----------QLSGQTVNRLMAVKAVVGNFVTE-------REG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ + ++ + T A+ + + L Sbjct: 134 DRLGLILFGEKAYLQTPLTFDRKTMQTLLYEAQLGFAGNGTAIGDAIGLSVKRLQQ---- 189 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + +I LTDG NN + + +K + A +KI TI + Sbjct: 190 --------------RPENHRVVILLTDGANNAGELD--PLKAAELASSAKVKIYTIGVGA 233 Query: 371 -----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A + Q L ++ ++ N + L+ ++Q +++ Sbjct: 234 ETQEAWGLFGKRVTNPSADLDEQTLTAIAEATGGQYFRARNPEELMAIYQELNR 287 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 74/233 (31%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM A Q + ++ +K+ L F++ + Sbjct: 91 DLLLAVDVSGSMEYADMHW----------QGESIGRLELVKHLLGQFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTWLEEAAIGIAGKDTAIGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG N + A ++I TI I A Sbjct: 188 ------------RQRPAQSRVLILVTDGANTAGEI--APSVAARLAAAEGVRIHTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 P + L + ++ +++ L + + +++ Sbjct: 234 DPRQDGPPGLLGLTPGLDLDEPTLRAIAEETGGSYFRARSSEELRAIEETLAR 286 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 75/227 (33%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D++ +++ +K L FL +E Sbjct: 91 DLLVAVDVSGSMDFPDMQWK----------DEEVSRLDLVKALLGDFLQ-------DREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + + A + ++I TI I A Sbjct: 188 ------------RQRPAQSRVLVLITDGANNGGRI--HPLTAARLAAQEDVRIYTIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +P + L + + ++ + L + Sbjct: 234 NPEASGTPGLLGLNPSLDLDEASLKEIADLTHGAYFRAHDGAELDAI 280 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 29/237 (12%), Positives = 71/237 (29%), Gaps = 39/237 (16%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R + V+D+SGSM TK+A LKNA+ + ++ Sbjct: 233 SASSRAPLDLVTVLDVSGSMA--------------------GTKIALLKNAMSFVIQTLG 272 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +I +++ + T +Q + + T+ +K+ Sbjct: 273 PND------RLSVIAFSSTARRLFPLRRMTLAGRQQALQAVSSLVASGGTNIADGLKKGA 326 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +++ + ++ + + L + + + + Sbjct: 327 KVIEDRRLKNPVCSIILLSDGQDTYTLPSDRNLL---DYSALVPPSILPGT-----GHHV 378 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +I T + + + S + S+ F +Q + SV++K Sbjct: 379 QIHTFGFGSDHDSAAMHAIAEISSGTFSFIDAEGSIQDGF---AQCIGGL-LSVVVK 431 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 98.0 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN+ N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNSG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 98.0 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 41/227 (18%), Positives = 74/227 (32%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRFWLDEAKIGIAGKNTALGDAIGLGLKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I A Sbjct: 188 ------------RLRPATSRVLVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGA 233 Query: 372 SPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 P+ L LK S S ++ + D L + Sbjct: 234 DPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|148974876|ref|ZP_01811856.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] gi|145965385|gb|EDK30634.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] Length = 418 Score = 98.0 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 40/419 (9%), Positives = 103/419 (24%), Gaps = 39/419 (9%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRFESISNHAKRA 89 ++ + F + + + A A LA + + + R+ Sbjct: 1 MMVIFMAFSMQMSQQMLAHTRLLEAAEVASLALIASPREDEENNVKYARYLVDRYVVDNT 60 Query: 90 LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD------SSNNT 143 D + + ++ +V ++ + +A ++ S Sbjct: 61 DDVDVAVYTSICEYKDGCVQASGELAPFSDFVVRATAKYTSWIAYEDVNLKPEFSVSGRA 120 Query: 144 -IFYNMDVMTSYDYRLQFIEHLLNQRYNQKI-VSFIPALLRIEMGE---------RPIFL 192 + + F + N N K+ + + ++ + + + L Sbjct: 121 VTRKYLPQPVDVYFIGDFSGSMGNPWKNGKMKLDVVKETIKRVVDDIEEFNSEEKSRVAL 180 Query: 193 IELV----------VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + V L+ + A + A +M ++ Sbjct: 181 LGYNPLHVKQSNEIVRLNAYGYRASWRKKHAYDYARNSPATTVRRMFDEPTLYNEIIEPS 240 Query: 243 DLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +S + + K +I + + R + T S + Sbjct: 241 HGMSRYEVERLYKRNNDFDDYFKFYDIPLTEDYDNFRAQLMSA-QLKAGGGTSSWNGIIA 299 Query: 301 AYQILTSDKKRSFFTNFF--RQGVKIPSLPFQKFI--IFLTDGENNNFKSNVNTIKIC-D 355 A Q + F G QK + + Sbjct: 300 AAQEANKATSLNPEQVFIVLSDGSDSDKTYLQKLVDQGLCKKLRSTISAKRNRFQSNAPT 359 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH-VFQNISQLMVHRK 413 +A++ + + I IN C + Y+ + + + + I++ K Sbjct: 360 EAEKTKVTMGVIGINYRVQASDGFGDC-FGKKNIYHAKDGEDVYKYILNLINEETGKLK 417 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 98.0 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 73/222 (32%), Gaps = 54/222 (24%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 VD+SGSM D +++ ++ L FL+ ++ +GL Sbjct: 2 VDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKGDRVGL 44 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 I + T+ ++ VR ++ + K T A+ A + L Sbjct: 45 ILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------- 93 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + ++ +TDG NN + + I A E +KI I I + P+ Sbjct: 94 -------RMRPATSRALVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGSDPDKD 144 Query: 377 RL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 L LK S S ++ + D L + Sbjct: 145 ALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 186 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 31/292 (10%), Positives = 79/292 (27%), Gaps = 28/292 (9%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + SS I N + + +N V + Sbjct: 204 STERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQ 263 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + S S ++ S + TK+A LK A+ + S+ Sbjct: 264 TNMQPTSQSCRAPVDLVTVLDVSGSM----AGTKLALLKRAMGFVIQSLGPCD------R 313 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +I +++ + T+ RQ + + T+ +++ +++ K ++ Sbjct: 314 LSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLISNGGTNIAEGLRKGAKVMLDRKWKN 373 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-----IKIVTI 367 ++ Q + G ++ + N I + Sbjct: 374 PVSSIILLSDG------QDTYTVCSPGGAHSRTDYSLLLPF--SIHRNGGTGFQIPVHAF 425 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + +S + + F +Q + SV+++ Sbjct: 426 GFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAF---AQCIGGL-LSVVVQ 473 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 53/231 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMGEGDM----------VLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L Sbjct: 144 RIGLLVFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRL------- 196 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 197 -----------RSQPEGQRVLILLTDGVSNAGV--LEPLRAAEVARAEGVRIHTVAFGGD 243 Query: 371 --------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + + + L ++ + + Sbjct: 244 GSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAGIYAELDR 294 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L +++ I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYKEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM D D ++ A+K + F +E Sbjct: 86 DLMLVLDLSYSMSKEDMLDDGDYVD----------RLTAVKKVVSDFASK-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + + V + + LI + T + A + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRKTIAEQVNQLVLRLIGEKTAIGEGIGLATKTF------ 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ ++ L+DG N + + ++ AK+ I TI + A Sbjct: 183 ------------VDSDAPQRVMVLLSDGSNTSGVLD--PLEAAKIAKKYNATIYTIGVGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L+ + ++ +A L ++ I+ Sbjct: 229 GEMVVKEFFMTRKVNTAQDLDERTLMDIAQVTGGQYFRARDAKELATIYDTINS 282 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 48/159 (30%), Gaps = 9/159 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R +F + I + +II L + LL +G I + + + ++SA ++A+LA Sbjct: 1 MTRRLSFFSRLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A+K + + + K + + S + Sbjct: 61 AAAKYKHGMPEASISGTVNAFLSAN---GTLKSAVIGKPEVSSDEAELCLDVGDA----- 112 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFI 161 M + S + V L Sbjct: 113 VPTTFMQVANIQSVPISVRSCAALPGVKQLEIALVLDVS 151 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 78/232 (33%), Gaps = 43/232 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+PI + L +D+S SM +D K +++A+K+ + F+ Sbjct: 86 EQPIRDLMLAIDISQSMEATDYTDAN---------GAKSDRLSAVKSVVRDFIAR----- 131 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++D +GLI + T + + + + T A+ + L Sbjct: 132 --RKDDRIGLIVFGTGAYPQAPLTLDHASLLLLLDEVGIGMAGPNTALGDAIGLTIKALE 189 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +K +I LTDG N+ S + A+ N I + T Sbjct: 190 K------------------TPEQEKVLILLTDG--NDTSSAITPDHAAHLAQANGIVVHT 229 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I + L ++ + + +L V+ + +L H Sbjct: 230 IGIGDPQATGDAKVDLTTLQAIARTTGGQFFRADDRQALQQVYATLDRLTPH 281 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 67/216 (31%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D K ++ A KN F+ + MGL+ ++ + + Sbjct: 99 DVSGSMLARDLKPDRLTAAKNVASDFVK-------GRPGDRMGLVIFSGETFTQVPLTTD 151 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + LI T + A L S K Sbjct: 152 HGVMLNMLAEMKNGLIDDGTAIGDGLATAISRL------------------KDSEAISKV 193 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 +I LTDG NN + T + AK I++ TI + Sbjct: 194 VILLTDGMNNAGSVDPYT--AAEIAKLYGIRVYTIGVGSYGTAPYPVQTPFGTQIQQMKV 251 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ + L +++ I + Sbjct: 252 EIDEKLLASVASMTGGKYFRATSNQKLDEIYEEIDK 287 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 75/233 (32%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D+ ++++ +++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDFPDMQWN----------DEDVSRLSLVQHLLGDFLE-------SRDG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRVWLDEARIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + + + A +KI I I A Sbjct: 188 ------------RMRPAQSRVLILVTDGANNGGEID--PLTAAKLAASEGVKIYPIGIGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 P + L + ++ + L + + Q Sbjct: 234 DPEESGATALLGGNPTLDLDEPALKAIAEVTGGRYFRARDGKQLQAIKDTLDQ 286 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVASQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 31/292 (10%), Positives = 79/292 (27%), Gaps = 28/292 (9%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + SS I N + + +N V + Sbjct: 204 STERSSSTRDIDNNSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPLTSGRQNSGTNQ 263 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + S S ++ S + TK+A LK A+ + S+ Sbjct: 264 TNMQPTSQSCRAPVDLVTVLDVSGSM----AGTKLALLKRAMGFVIQSLGPCD------R 313 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + +I +++ + T+ RQ + + T+ +++ +++ K ++ Sbjct: 314 LSVISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNGGTNIAEGLRKGAKVMLDRKWKN 373 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-----IKIVTI 367 ++ Q + G ++ + N I + Sbjct: 374 PVSSIILLSDG------QDTYTVCSPGGAHSRTDYSLLLPF--SIHRNGGTGFQIPVHAF 425 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + +S + + F +Q + SV+++ Sbjct: 426 GFGTDHDATSMHAISETSGGTFSFIEAEGVIQDAF---AQCIGGL-LSVVVQ 473 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 58/202 (28%), Gaps = 18/202 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 TK+A LK A+ + ++ + +I +++ + T +Q Sbjct: 296 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRQQ 349 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ A+++ +++ + ++ + + Sbjct: 350 SLLAVNSLTSNGGTNIAEALRKGSKVIEERQAKNPVCSIILLSDGQDTYTVSPTAGV-HK 408 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G +T + + A + L +S + ++ Sbjct: 409 GAPEYCALLPSTNG------NQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAI 462 Query: 399 IHVFQNISQLMVHRKYSVILKG 420 F +Q + SV+ +G Sbjct: 463 QDAF---AQCIGGL-LSVVAQG 480 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 72/230 (31%), Gaps = 44/230 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P I L +DLS SM + A ++ A++ + F+ Sbjct: 91 EPQRDILLALDLSQSMDARDFPGADGKPLA---------RVEAVRQVVADFVGK------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GL+ + + E VR + + + T A+ A ++ Sbjct: 136 -RPGDRIGLVAFGDAPYPLAPFTMDHELVRTMIADTVPGMAGPRTSLGDALGLAIKMFGK 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K +I LTDG + + +K + AK + T+ Sbjct: 195 ------------------TTAPEKVLIVLTDGNDTASRMP--PLKAAEIAKSKGVIFHTV 234 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I + L K S+ ++ + L +++ + + Sbjct: 235 GIGDPAATGEDKLDTATLQKIAASTGGRYFFGGDQSQLAAIYEVL-DQIT 283 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 78/264 (29%), Gaps = 69/264 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + I L +D+S SM + +D K ++ ALK Sbjct: 78 SSSEITKTKTTEGIDIILSIDMSSSM---------------LAKDLKPNRIEALKRVAAQ 122 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + +G++ Y+ + + V Q + I T Sbjct: 123 FIQQ-------RASDRIGIVVYSGESYTKVPATTDKSIVLQALKEIRQGEIEDGTAIGMG 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L S K II +TDG NN + + + A Sbjct: 176 LGTAINRL------------------KDSKTKSKVIILMTDGVNNTGVID--PLSAAELA 215 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 KE I++ TI I + + + L + + ++ + Sbjct: 216 KEYGIRVYTIGIGTNGKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISKITGGKYFRATD 275 Query: 395 ADSLIHVFQNISQ----LMVHRKY 414 + L ++ I + + KY Sbjct: 276 NNKLAQIYTEIDKLEKSKIEELKY 299 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 76/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+DLS SM D+ +++A+KN + F+ +E Sbjct: 86 DLMLVLDLSYSMSQEDM----------QEGDQYIDRLSAVKNVVSDFVKQ-------REG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + E + V + LI T + A + + Sbjct: 129 DRLGLVLFADHAYLQTPLTLDRETISDQVNSLVLRLIGDKTAIGEGIGLATKTFVDSEA- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ ++ L+DG N + + ++ AK+ I TI I A Sbjct: 188 -----------------PQRVMVLLSDGSNTSGV--LEPLEAARIAKKYNATIYTIGIGA 228 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ NAD L ++ I+ Sbjct: 229 GEMMVKEFFMTRKVNTAKDLDEKTLKQIADLTGGQYFRARNADELATIYDTINN 282 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDSNA------------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKLNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|301133566|gb|ADK63405.1| C3HC4 type zinc finger protein [Brassica rapa] Length = 677 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 30/291 (10%), Positives = 78/291 (26%), Gaps = 44/291 (15%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + + + + ++ I P + R Sbjct: 190 PAESTKPGGVSGKLEVKTYPEISEVV---RSVSFKDFSVLINLKAPPVSSSSSSSRAPVD 246 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM TK+A LK A+ + ++ Sbjct: 247 LVTVLDVSGSMA--------------------GTKLALLKRAMGFVIQNLGPFD------ 280 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + +I +++ ++ TE +Q + + T+ + + ++L + + Sbjct: 281 RLSVISFSSTSRRSFPLRLMTETGKQEALQAVNSFVSNGGTNIAEGLTKGAKVLIDRRFK 340 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ + T G + N I + A Sbjct: 341 NSVSSIVLLSDGQDTYTMTSPTGSNTKGADYKTLLPKEV---------NRIPVHAFGFGA 391 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 + + +S + + + F I L+ + + Sbjct: 392 DHDASLMHSIAENSGGTFSFIESETVIQDAFAQCIGGLLSVMVQELRVKIE 442 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ D AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAADIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 79/230 (34%), Gaps = 34/230 (14%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + R++ R I L +D SGSM+ +K ++ K F+ Sbjct: 71 VDRVDPLNRNGKDIVLAIDASGSMNSTGF-----DFEGEAALPQKLSRFEIAKIVASEFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +G++ Y ++ + + ++ + + T A+ Sbjct: 126 QK-------RLSDNVGIVLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIA 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + K + K ++ LTDGE+N+ +++ AKE Sbjct: 179 MSLRAFKHSKAK------------------SKIVVLLTDGEHNSG--DISPKDALVLAKE 218 Query: 360 NFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 IKI TI + + L K S + NA L ++++I + Sbjct: 219 ENIKIYTIGMGNRGEADEALLKKIADESGGEFFYATNAKELKEIYEHIDE 268 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 70/227 (30%), Gaps = 54/227 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VD+SGSM QD + +++ +K F++ + Sbjct: 91 DLLLAVDVSGSMDYRDMRW----------QDDEISRLELIKKLFGDFIE-------DRRG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N + A E +KI TI + A Sbjct: 188 ------------RQRPAESRVLVLITDGANTGGQI--APQIAAQLAAEQQVKIYTIGVGA 233 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 P + L + ++ ++ L + Sbjct: 234 DPQQGGVPGLFGFNPGLDLDEPTLRGIAEITGGEYFRARSSAELESI 280 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDSNA------------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 53/231 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMGEGDM----------VLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRL------- 196 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 197 -----------RSQPEGQRVLILLTDGVSNAGV--LEPLRAAEVARAEGVRIHTVAFGGD 243 Query: 371 --------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + + + L ++ + + Sbjct: 244 GSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAGIYAELDR 294 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 47/409 (11%), Positives = 116/409 (28%), Gaps = 45/409 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM----V 69 + K + +I FAL L I I + + K++ + + L + Sbjct: 12 LKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVA 71 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNH---IKESLSGYSAVFYNTEIQNIVNSSR 126 +S+ R + ++ K IK ++ A + Sbjct: 72 DGMSKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHES 131 Query: 127 ISMTHMANNRLDSSNNTIFYNM--DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + +D + ++ + + + + I Sbjct: 132 YMTHAMGFDNIDYTADSESTISFGQGKYEFIFLVDVSPSMG-----------------IG 174 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R +++ + + H S + + R ++ +K+AL + ++ Sbjct: 175 ASNRDRQIMQRAIGCQFACHEPWYS-----SVSRAKSAGARLRIDVVKDALKSLVTQLEE 229 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS---LILKPTDST---PAM 298 + V D+ GL ++ + + G K ++ + L T+ Sbjct: 230 ATEV--DLRTGLYSFSNYLHIQTGLNKGISKFKREANKIAIHREYLRGGGTNFHGVFSDF 287 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L GV +L N + + CD+ K Sbjct: 288 NGVLRSLKPKADVKQHIIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFK 347 Query: 359 ENFI-KIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + T+ + ++ C +S ++ Y+ +A + Sbjct: 348 KGEVRTVHTMLVEPDRAHYVRASTSSMRACATSADFFYSANSAAEIDKA 396 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 53/522 (10%), Positives = 114/522 (21%), Gaps = 126/522 (24%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKMVSN--------------LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 L ++ + DR ++AK + L Sbjct: 66 LLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLY 125 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDV------------ 150 S + + V + + + + +I + Sbjct: 126 LRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQ 185 Query: 151 ------------------------------MTSYDYRLQFIEHLLN-------------- 166 + Sbjct: 186 TLVSSSQLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPN 245 Query: 167 ------------------------QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + F+ + + Sbjct: 246 RRIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNK 305 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTT 261 N S + + + N + F + + Sbjct: 306 QAEISNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYPFSLSSLSSRNFCLEN 365 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 K +W ++ + L T T + ++T Sbjct: 366 NKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDT------NKDPEAA 419 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN------------FIKIV 365 + ++ ++ L+DGE+N N +C+K KE +I Sbjct: 420 PNKLNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIA 479 Query: 366 TI--SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N + K CV + +Y V + L+ F+ I Sbjct: 480 FVALGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQI 519 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 72/232 (31%), Gaps = 54/232 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLS SM + ++ +K F+ + Sbjct: 93 MMLAVDLSASMSSPD----------LVQSGVPANRLQVVKRVADDFIAR-------RTGD 135 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI ++TR + VRQ + + + T A+ A + L Sbjct: 136 RIGLILFSTRAYVQAPLTLDRNVVRQLLAEASIGMTGRNTSIGDAIGLAVKTL------- 188 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + +I LTDG N + + ++ A + ++I TI + A Sbjct: 189 -----------RDRPAKDRVLILLTDGANTSGVLD--PMEAAAIAAKENVRIHTIGVGAD 235 Query: 373 P-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ N L ++ +I + Sbjct: 236 SNFTDIQPGMLMNPSGDLDEEALKKIAGLTGGQYFRARNDKGLAAIYADIDR 287 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTSQDLDEKTLQSIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 76/241 (31%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQTVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 I S Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 F------------------IDSDAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYNTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 53/522 (10%), Positives = 114/522 (21%), Gaps = 126/522 (24%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKMVSN--------------LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 L ++ + DR ++AK + L Sbjct: 66 LLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLY 125 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDV------------ 150 S + + V + + + + +I + Sbjct: 126 LRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKFWLPWGQ 185 Query: 151 ------------------------------MTSYDYRLQFIEHLLN-------------- 166 + Sbjct: 186 TLVSSSQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPN 245 Query: 167 ------------------------QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + F+ + + Sbjct: 246 RRIDALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNK 305 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTT 261 N S + + + N + F + + Sbjct: 306 QAEISNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYPFSLSSLSSRNFCLEN 365 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 K +W ++ + L T T + ++T Sbjct: 366 NKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDT------NKDPEAA 419 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN------------FIKIV 365 + ++ ++ L+DGE+N N +C+K KE +I Sbjct: 420 PNKLNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIA 479 Query: 366 TI--SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N + K CV + +Y V + L+ F+ I Sbjct: 480 FVALGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQI 519 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDSNA------------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDM----------VINGKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 70/220 (31%), Gaps = 47/220 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I V+D SGSM + + F+D + Sbjct: 25 DGINIAFVIDSSGSMFYNDPNGLR-------------------REVTHKFIDRLTDND-- 63 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +IG+ + + + EK+ V + T+ A+ AY + + Sbjct: 64 ----MAAVIGFDYKATVLEQFTSNKEKLHDAVDKI---RSDGGTNIGRAVSIAYDLFNNL 116 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + KF+I LTDG+ + + AK+ IKI TI Sbjct: 117 DNNRK-------------EKYPKFLILLTDGDGDYSEEYTIL------AKKAGIKIYTIG 157 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + L + +++ +A L +F+ I+ Sbjct: 158 LGNGVSEKLLKDIAKGTDGEYFHAKDASKLNKIFEKIADK 197 >gi|125575071|gb|EAZ16355.1| hypothetical protein OsJ_31817 [Oryza sativa Japonica Group] Length = 579 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 66/201 (32%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A +K A+ +D++ + ++ ++T + +E + Sbjct: 152 SGSMAGRKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRLLRMSEVGKAT 205 Query: 280 VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R ++SL+ T+ ++ A ++L + ++ ++ S + Sbjct: 206 AKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVILLSDGKDSYVVPRR------ 259 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G ++ V + I T A + + S+ V N ++ Sbjct: 260 GNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAI 319 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV ++ Sbjct: 320 QDSF---AQCIGGL-LSVAVQ 336 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 74/234 (31%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM + ++ A+K L F+ +E Sbjct: 85 DLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK-------REG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + + LI T + A + Sbjct: 128 DRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA- 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ +I L+DG N + ++ + AK+ I T+ + A Sbjct: 187 -----------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTIYTVGVGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L ++ +++ N L +++ I+Q Sbjct: 228 GEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 76/235 (32%), Gaps = 40/235 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + +++ A+K+ L F+ + +E Sbjct: 99 DVMVAVDLSGSMSEMDF---------SSSDGQAVSRLDAVKSVLHEFVAT-------REG 142 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ + Sbjct: 143 DRLGLILFGDAAYLQTPFTADHDVWLALLDQTEVAMAGQSTHLGDAIGLAIKVFEQSESS 202 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K ++ LTDG + S V A ++I I++ Sbjct: 203 ---------------KDKEKVVVVLTDGNDTG--SFVEPKDAAIVAAAKGVRIHVIAMGD 245 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + S + ++ ++L ++ I +L S + Sbjct: 246 PATIGEQALDMATIDNIASQSGGQAFQALDQEALQQAYRTIGELEPKLYESTQYR 300 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 35/306 (11%), Positives = 83/306 (27%), Gaps = 61/306 (19%) Query: 135 NRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + + S Y + L + + + + Sbjct: 4 DSITISGTATAEYQTAAFMDFYILLDNTPSMG-----------------VGATPDDVSKL 46 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 E + + H D N ++ ++ A D+ + Sbjct: 47 EAKAGCAFACHQM---DKTINNYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFR 103 Query: 254 MGLIGYTTRVE-----KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 MG+ + T+ E + KV+ Y +D + + + + + Sbjct: 104 MGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDA-VDLMTIPYQNYNNDQITNFDSAMTQ 162 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-------------CD 355 G ++ +K + F++DG +++K + T K C Sbjct: 163 MNTIIDQ----AGDGTSNISAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCK 218 Query: 356 KAKENFIKI---VTISINASPNG-------------QRLLKTCVSSPEYHYNVVNADSLI 399 K+ +KI T + N ++ C S P +++ V + + Sbjct: 219 PLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQGEIPTKMQACAS-PGFYFEVSPTEGIT 277 Query: 400 HVFQNI 405 + + Sbjct: 278 DAMKAL 283 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 26/269 (9%), Positives = 69/269 (25%), Gaps = 40/269 (14%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 ++ + P L R I V+D+SGSM Sbjct: 124 VSASESFSKFGVLVRVLAPPLDNTLPHHRAPIDIVNVLDVSGSMA--------------- 168 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQ 278 K+ LK A+ + ++ + ++ +++ + + + Sbjct: 169 ------GKLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRILPLRTMSGSGRED 216 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ T+ +++ ++L ++ + + + Sbjct: 217 AISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVASIILLSDGCDTQSHSTHNRL--- 273 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + +++++ I T + + S + + D L Sbjct: 274 --EYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHAISDVSGGTFSFIESIDIL 331 Query: 399 IHVF-------QNISQLMVHRKYSVILKG 420 F +I V K G Sbjct: 332 QDAFARCIGGLTSIVARDVQLKVRSASPG 360 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 69/204 (33%), Gaps = 45/204 (22%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + +++A+K L F+ ++ +G++ + + + V Q + Sbjct: 10 GEYIDRLSAVKKVLSDFVAK-------RKGDRLGVVLFGDHAYLQTPLTADRQTVIQQIK 62 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + L+ + T + + + S Q+ +I L+DG N Sbjct: 63 QTVIGLVGQRTAIGDGIGLGTKTF------------------VDSDAPQRVMILLSDGSN 104 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISIN------------------ASPNGQRLLKTCV 383 + I+ + AK+ I T+ + A + Q L K Sbjct: 105 TAGVLD--PIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAE 162 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 + ++ +A+ L ++ I++ Sbjct: 163 MTGGQYFRARDAEQLEKIYDTINK 186 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 82/246 (33%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + + ++N L F++ + Sbjct: 85 DLMLAVDLSGSMQIEDM----------VINGKTVDRFSLIQNVLGEFIER-------RNG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + ++ L+ K T A+ A + + Sbjct: 128 DRLGLILFADHAYLQAPLTQDRRSIATFLADAQIGLVGKQTAIGEAIALAVKRFDQVSES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG NN N+ + A + + I T+ + A Sbjct: 188 N------------------RVLVLLTDGSNNAG--NIEPDVAAEIAAKRNVTIYTVGVGA 227 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + + Y++ N++ L ++Q I QL + Sbjct: 228 ELMERRTIFGKERVNPSMDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQKIDQLEPVSR 287 Query: 414 YSVILK 419 ++ + Sbjct: 288 DNLSYR 293 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 45/200 (22%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ A+K L F+ +E +GLI + + + V + + + Sbjct: 109 DRLTAVKQVLSEFIAK-------REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVL 161 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 LI T + A + Q+ +I L+DG N Sbjct: 162 KLIGTQTAIGEGIGLATKTFIDSNA------------------PQRVMILLSDGSNTAGV 203 Query: 346 SNVNTIKICDKAKENFIKIVTISINA------------------SPNGQRLLKTCVSSPE 387 + ++ D AK+ I T+ + A + + L ++ Sbjct: 204 LD--PLEAADIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGG 261 Query: 388 YHYNVVNADSLIHVFQNISQ 407 +++ N L +++ I+Q Sbjct: 262 HYFRARNQQDLQNIYDTINQ 281 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 79/233 (33%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D++ +++A +++ L FL+ +E Sbjct: 91 DLLVAVDVSGSMDFPDMQWK----------DEEVSRLALVQHMLGDFLE-------GREG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAYLQAPLTFDRRTVRHWLDEARIGIAGKNTAIGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I +TDG NN + + + A + +KI I I A Sbjct: 188 ------------RQRPAHSRVLILVTDGANNGGEID--PLTAARLAADEGVKIYPIGIGA 233 Query: 372 -----------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ + + L+ + + Q Sbjct: 234 APEQSGTTGSLGVNPSLDLDEPTLKEIAEVTGGQYFRAQDGEQLLGIKTTLDQ 286 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 81/249 (32%), Gaps = 50/249 (20%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + IE+V+D SGSM + + ++ A+KN F+ + Sbjct: 67 TVSQTEGIAIEMVIDRSGSMQAMDFN----------IDGEPVDRLTAVKNVASKFITGGE 116 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQA 301 L + +GLI + + P+ V + + T A+ + Sbjct: 117 DLE-GRFSDLVGLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALS 175 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L + + K +I LTDGEN + + I+ + A+ Sbjct: 176 VEKLNA------------LDARQERKVQSKILILLTDGENTAGELD--PIQAAELAETLG 221 Query: 362 IKIVTISIN-----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSL 398 IKI I + + + L K + ++ + DSL Sbjct: 222 IKIYAIGVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSL 281 Query: 399 IHVFQNISQ 407 +++ I Q Sbjct: 282 DAIYREIDQ 290 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 92/291 (31%), Gaps = 64/291 (21%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMG---------ERPIFLIELVVDLSGSMHCA 206 + R K + + +L + + + VDLSGSM Sbjct: 39 LVADVSKSTPKARDMLKWLMWTALVLAVARPQWLGDPIELPSKGRDLMVAVDLSGSMQIE 98 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + +K + +++ + F++ + +GLI + Sbjct: 99 DM----------VLDNKTVDRFTLVQHVVSDFIER-------RVGDRIGLILFGDHAYLQ 141 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + V QY+ L+ K T ++ A + + ++ + Sbjct: 142 SPMTQDRRSVAQYLREAQIGLVGKQTAIGESIALAVKRFENLEESN-------------- 187 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--------------- 371 + ++ LTDG NN + K A E + I TI + A Sbjct: 188 ----RVLVLLTDGTNNAGSIS--PDKAAAIAAERKVTIYTIGVGAEMMERRSFFGRDRVN 241 Query: 372 ---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + ++L + ++ ++ +++ L ++Q I +L + + + Sbjct: 242 PSMDLDEEQLQRIANATQGKYFRARSSEDLAAIYQEIDKLEPVSRDTQSFR 292 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 79/246 (32%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + + + +++ L F++ ++ Sbjct: 84 DLMIAVDLSGSMQIEDM----------VINGQTVNRFTLIQHVLSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V ++ L+ K T A+ A + + Sbjct: 127 DRLGLILFADHAYLQAPLTLDRRSVATFLDDAQIGLVGKQTAIGEAIALAVKRFDKVDES 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ A + I I TI + A Sbjct: 187 N------------------RVLILLTDGSNNAG--NIEPEVAAQIAAKRNITIYTIGVGA 226 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L K + ++ N++ L ++Q I +L + Sbjct: 227 EILERRTIFGKERINPSMDLDEDQLKKLAAMTKGRYFRARNSEELASIYQEIDKLEPVSR 286 Query: 414 YSVILK 419 + + Sbjct: 287 DQLSYR 292 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 47/238 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P+ I L VD+S SM +D Q + ++ A+K + F+ Sbjct: 88 TEPVRDILLAVDISQSMDSVDFND---------AQGQPLSRWDAVKAVVADFIAQ----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + D +GLI + T + ++ + + T A+ ++L Sbjct: 134 --RADDRLGLIVFGTGAYPQAPLTRDHASLKLLLDEAAVGMAGPNTAVGDAIGLGIRMLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +++ K +I LTDG + A ++ + + T Sbjct: 192 AAEEQD------------------KVLILLTDGNDTGSAVPPARAAA--LAAQHHVTVHT 231 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS----QLMVHRK 413 I I + L + ++ + + +L V+ + + + + Sbjct: 232 IGIGDPAATGEDRVDFDILREVARTAGGQFFPARDLATLREVYATLDRITPREIKTLR 289 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 76/239 (31%), Gaps = 60/239 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + ++ A+K L F+ ++ Sbjct: 91 DLLLAVDLSGSMKIEDMEVN----------GDRVPRIVAVKTVLNEFIQR-------RKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKK 310 +GLI + ++ ++ V++++ + T A+ + + L Sbjct: 134 DRLGLILFGSQAYVQAPLTFDQTTVQRFMREAQIGFAGEENTAIGDAIGLSVKRL----- 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + +I LTDG+NN K N A N I I TI + Sbjct: 189 -------------RDRPGDRHVMILLTDGQNNGGKINPIP--ASKIAANNGIIIYTIGVG 233 Query: 371 ASP----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A + + L + ++ ++ N L +++ + Q Sbjct: 234 ADEMVMPGVLGSSFGSRRVNPSADLDEKTLQQVATATGGQYFRARNPQELEKIYRLLDQ 292 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 69/230 (30%), Gaps = 44/230 (19%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P I L +DLS SM + A ++ A+K + F+ Sbjct: 91 EPQRDILLALDLSQSMDTKDFPGADGKPLA---------RVDAVKQVVADFVGR------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GL+ + + V + + + T A+ A ++ Sbjct: 136 -RPGDRIGLVAFGDAPYPLAPFTMDHALVETMIADAVPGMAGPRTSLGDALGLAVKMFEK 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K +I LTDG + + +K + AK + + + Sbjct: 195 ------------------TTVPEKVLIVLTDGNDTASRMP--PLKAAEIAKSKGVVVHAV 234 Query: 368 SINAS-------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I + L K + ++ + L ++Q + + Sbjct: 235 GIGDPLATGEDKLDTATLQKIAEKTGGRYFFGGDQAQLASIYQVL-DQIT 283 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 78/234 (33%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + A ++ + F++ ++ Sbjct: 91 DLMLAVDLSGSMQIEDM----------VLNGKAVDRFAMVQQVMSEFIER-------RKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++T L+ K T A+ A + K+ Sbjct: 134 DKLGLILFADHAYLQAPLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIALAVKRFDKAKQS 193 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN+ + D A + + I TI + A Sbjct: 194 N------------------RVLILLTDGSNNSGSIT--PEQAADIAAKRGVTIYTIGVGA 233 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + +L + ++ N+D L ++Q I + Sbjct: 234 EVMERRTLFGKERVNPSMDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQEIDK 287 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMLAVDLSGSMQIEDM----------VINGKVVDRFTLIQHVVSEFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN + + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAGNID--PDQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIAEVTHGRYFRARNSQELEQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFITK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRVGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 I S Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 F------------------IDSDAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYQTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ ++ N L ++ I+ Sbjct: 221 YTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 30/322 (9%), Positives = 79/322 (24%), Gaps = 9/322 (2%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 II A ++ IG + K + + + A LA A + S R + Sbjct: 2 IITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEFSKLGSEQDARKAAKK 61 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + +K+ + + M L+ + + Sbjct: 62 AFDAQVSQ---LNLHGAKLKKLNIVTDDETMKVSVDAVFELPTTLMQIAGFKTLEVATRS 118 Query: 144 IFYNM--DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 +V+ + + + + + + ++++ Sbjct: 119 DAVGGGQEVILDIVMCIDATGSMGATLRSVQRNALSFEANLKNRLKELGRQVDIIRVRPI 178 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + + + K+ + F+DS + + GL + Sbjct: 179 YYWDYDYDG---WSRSYGLKKSTFLKLPDQRTQFKNFVDS-ESAYGGGDWPEAGLECFNE 234 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + + T + V ++ + + S+ RS+ Sbjct: 235 GLRSSWFKTTNTRQSVFPVVALWTDAPADSPNNWRNINSSGTPYPSNMPRSWGGLEAAWN 294 Query: 322 VKIPSLPFQKFIIFLTDGENNN 343 K +F + N Sbjct: 295 NGSLISQDNKMFVFFGNQSQNG 316 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 75/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLS SM D +++A+K L F+ ++ Sbjct: 35 DMMLVVDLSYSMSKEDMQFNGDY----------IDRLSAVKQVLSDFISK-------RQG 77 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + V + + + + LI T + A + Sbjct: 78 DRLGLVLFADHAYLQTPLTLDRHTVAEQLNQTVLRLIGTKTAIGEGIGLATKTF------ 131 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S Q+ +I L+DG N + I+ AK+ I T+ + A Sbjct: 132 ------------VDSDAPQRVMILLSDGSNTAGVLD--PIEAAKIAKKYNATIYTVGVGA 177 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L++ + ++ ++ L ++ I+ Sbjct: 178 GEMMVKEFFMTRKVNTAQDLDEKSLMEIAKLTGGQYFRARDSKELATIYDTINN 231 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 96.5 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|125532270|gb|EAY78835.1| hypothetical protein OsI_33940 [Oryza sativa Indica Group] Length = 606 Score = 96.5 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 66/201 (32%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A +K A+ +D++ + ++ ++T + +E + Sbjct: 152 SGSMAGRKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRLLRMSEVGKAT 205 Query: 280 VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R ++SL+ T+ ++ A ++L + ++ ++ S + Sbjct: 206 AKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVILLSDGKDSYVVPRR------ 259 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G ++ V + I T A + + S+ V N ++ Sbjct: 260 GNGMSYMDLVPPSFASSGGRGRLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAI 319 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV ++ Sbjct: 320 QDSF---AQCIGGL-LSVAVQ 336 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 96.5 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 62/216 (28%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ A K F+ + +GL+ + + Sbjct: 98 DVSTSMLARDFTPDRLEAAKEVATKFI-------LERPQDKIGLVVFAGESFTQCPLTTD 150 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + +I T + A L S K Sbjct: 151 QAVLVNLLREVKSGMIQDGTAIGLGLANAVNRL------------------KDSPGKSKV 192 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 +I LTDG NN + + AK I++ TI + Sbjct: 193 VILLTDGINNQGAI--APVTAAELAKAFGIRVYTIGVGTYGEAPYPVPTPFGVQLQNMPV 250 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ + D L ++ I Q Sbjct: 251 EIDEGVLQQIANVTGGKYFRATDNDKLQQIYSEIDQ 286 >gi|22758317|gb|AAN05521.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432564|gb|AAP54179.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 606 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 66/201 (32%), Gaps = 17/201 (8%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A +K A+ +D++ + ++ ++T + +E + Sbjct: 152 SGSMAGRKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEASRRTRLLRMSEVGKAT 205 Query: 280 VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R ++SL+ T+ ++ A ++L + ++ ++ S + Sbjct: 206 AKRAVESLVDDSATNIGDGLRVAGRVLGDRRHKNAVSSVILLSDGKDSYVVPRR------ 259 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G ++ V + I T A + + S+ V N ++ Sbjct: 260 GNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAI 319 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F +Q + SV ++ Sbjct: 320 QDSF---AQCIGGL-LSVAVQ 336 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 35/242 (14%), Positives = 77/242 (31%), Gaps = 45/242 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + P + L VD+S SM + + +++ +K+ L F+ Sbjct: 81 VQSFPSRDLLLAVDISQSMQIKDMTIN----------GEAVDRLSMVKSYLQSFIKQ--- 127 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +G+I + + + + + L K T A+ A + Sbjct: 128 ----RQGDRIGIILFADHAYLMVPFTQDWQAAGLLLDEVNIGLAGKFTAIGEAITLAVKK 183 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + P K +I L+DG+ + + + AK + +KI Sbjct: 184 TLHE----------------PKPIQNKTLILLSDGK--DSINTIQPTDAAALAKASGLKI 225 Query: 365 VTISINASP---------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 TI I + + L + + ++ + L ++Q I L+ Sbjct: 226 YTIGIGSDSTDAEAESDLDETTLEEIANMTGGQYFRARSEQDLSEIYQQI-NLIEPLSLK 284 Query: 416 VI 417 + Sbjct: 285 TV 286 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + K + ++ + F++ ++ Sbjct: 84 DLMMAVDLSGSMQIEDM----------VLDGKTVDRFTMIQAVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFD----- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN+ + + A + +KI +I + A Sbjct: 182 -------------RVDESNRILVLLTDGSNNSGSIS--PEQAAAIAAKRGVKIYSIGVGA 226 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L ++ ++ NA L ++Q I +L + Sbjct: 227 EVMERRTLFGKERVNPSMDLDETQLTALAQTTGGLYFRARNAQELESIYQEIDKLEPISR 286 Query: 414 YSVILK 419 + + Sbjct: 287 DQLSYR 292 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 61/182 (33%), Gaps = 27/182 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ +K + F+ +++ MGL+ + ++ + + +++ Sbjct: 109 SRFDVVKEIVSDFISQ-------RKNDNMGLVVFGAYSFIASPLTYDVNILNKILSQLQI 161 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + K T ++ Q +L S K I LTDG + Sbjct: 162 GMAGKYTALNTSLAQGANLL------------------KQSKSKTKIAILLTDGYSTPQV 203 Query: 346 SNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + D K+ IK+ I I N + LLK S + +A L V++ Sbjct: 204 DTITLDIALDMIKKEGIKVYPIGIGMPHEYNTEALLKIANESGGVAFGASSAAELQEVYK 263 Query: 404 NI 405 I Sbjct: 264 KI 265 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 79/256 (30%), Gaps = 37/256 (14%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + +Q R + V+D+SGSM Sbjct: 248 PSANPDQGTGKLANESSAGSSRNRAPVDLVTVLDVSGSMA-------------------- 287 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 TK+A LK A+ + + + +I +++ + +Q + Sbjct: 288 GTKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQRISHHGRQQALQAI 341 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ A+K+A +++ ++ + + + G + + Sbjct: 342 NSLGASGGTNIADALKKAMKVIEDRSYKNSVCSIILLSDGQDTYNISSSV----QGASPD 397 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +KS + I + A+ + + + A + L +S + + + F Sbjct: 398 YKS-LVPSSIINDAR-HTVPLHAFGFGADHDSDSLHSIAQASGGTFSFIEDEGVMQDAF- 454 Query: 404 NISQLMVHRKYSVILK 419 +Q + S++++ Sbjct: 455 --AQCIGGL-LSIVIQ 467 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 79/256 (30%), Gaps = 37/256 (14%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + +Q R + V+D+SGSM Sbjct: 248 PSANPDQGTGKLANESSAGSSRNRAPVDLVTVLDVSGSMA-------------------- 287 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 TK+A LK A+ + + + +I +++ + +Q + Sbjct: 288 GTKLALLKRAMGFVVQHLGPSD------RLSVIAFSSSARRLFHLQRISHHGRQQALQAI 341 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ A+K+A +++ ++ + + + G + + Sbjct: 342 NSLGASGGTNIADALKKAMKVIEDRSYKNSVCSIILLSDGQDTYNISSSV----QGASPD 397 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +KS + I + A+ + + + A + L +S + + + F Sbjct: 398 YKS-LVPSSIINDAR-HTVPLHAFGFGADHDSDSLHSIAQASGGTFSFIEDEGVMQDAF- 454 Query: 404 NISQLMVHRKYSVILK 419 +Q + S++++ Sbjct: 455 --AQCIGGL-LSIVIQ 467 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 81/249 (32%), Gaps = 50/249 (20%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + IE+V+D SGSM + + ++ A+KN F+ + Sbjct: 79 TVSQTEGIAIEMVIDRSGSMQALDFN----------IDGEPVDRLTAVKNVASKFITGGE 128 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQA 301 L + +GLI + + P+ V + + T A+ + Sbjct: 129 DLE-GRFSDLVGLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALS 187 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L + + K +I LTDGEN + + ++ + A+ Sbjct: 188 VEKLNA------------LDARQERKVQSKILILLTDGENTAGELD--PVQAAELAETLG 233 Query: 362 IKIVTISIN-----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSL 398 IKI I + + + L K + ++ + DSL Sbjct: 234 IKIYAIGVGTTGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSL 293 Query: 399 IHVFQNISQ 407 +++ I Q Sbjct: 294 DAIYREIDQ 302 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 75/231 (32%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ + F+ + Sbjct: 90 MMLAVDLSGSMKIDDMQLN----------GRQVNRLTMTKSVVYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + ++ + Sbjct: 133 RIGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDEREESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N N+ + + A +K+ TI + A Sbjct: 193 ------------------NVLILLTDGQNTAG--NITPEQAKELAISKGVKVYTIGVGAD 232 Query: 373 P------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L S+ ++ NA L ++Q + Sbjct: 233 KMLIQSFFGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQELQAIYQQL 283 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 77/229 (33%), Gaps = 39/229 (17%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + +R + +D SGSM + + + + K ALK L F+ Sbjct: 71 DQKTSSKRKGRDLVFALDTSGSMAESGFNPE----------NVQNRKFDALKELLRSFIT 120 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + +G+ + T I S+ V + + T + Sbjct: 121 K-------RYNDNVGVSIFGTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAM 173 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +IL + + K II +TDG N+ + + KAK+ Sbjct: 174 ALKILKKGEAKE------------------KVIILITDGYQNSGAVS--VKEAVQKAKKQ 213 Query: 361 FIKIVTISINA-SPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 +KI TI I S LL+ ++ + N L +++ I + Sbjct: 214 HVKIYTIGIGDRSAFDANLLQLIAKNTDAKMFEAKNVKMLQDIYKEIDK 262 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 38/251 (15%), Positives = 78/251 (31%), Gaps = 66/251 (26%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I + +D+S SM + +D + ++ ALK F++ Sbjct: 82 VSTKTKTTKGIDIVIAIDVSASM---------------LAKDLRPNRLEALKKVAASFIN 126 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMK 299 + + +GLI Y + V + +++I T + Sbjct: 127 -------GRPNDRIGLIEYAGESFTKTPITSDKSIVLSALKSIQYNNIIEGGTAIGMGLA 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L S K II +TDGENN + + + A+E Sbjct: 180 TGVNRL------------------KDSKALSKVIILMTDGENNAGQID--PRIAAELAQE 219 Query: 360 NFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNAD 396 IK+ TI + + + + L + ++ ++ N + Sbjct: 220 FGIKVYTIGMGTNGMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQYFRATNNE 279 Query: 397 SLIHVFQNISQ 407 L ++ I + Sbjct: 280 KLQEIYDEIDK 290 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 80/271 (29%), Gaps = 69/271 (25%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D +G+ H ++ + + + K F D V +G+ Sbjct: 437 DKNGADHVKSEANTGYASGNQYKRQEWINKY---------FTDDGGNRPSVGNSNPLGMG 487 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + ++ S T+ + T+ + ++ L+ Sbjct: 488 AGPNSMCSSVSVSDLTDNKNTTQAKLTSMQASGATNVQMGVAWGWRTLSPG-------EP 540 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFK-------------------------------- 345 F +G + +K +I +TDG N + Sbjct: 541 FTEGRPYDAEDNKKIMIIMTDGNNTYYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYD 600 Query: 346 ---------------SNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSS--- 385 + + + C AK I I +I+ + + + L+ C SS Sbjct: 601 GEANPGHNSQTFTKAMDEHLTETCTNAKNAGITIYSIAFDVPNGSSVKATLEDCASSDVG 660 Query: 386 -PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +++ N +LI F+ I++ + + S Sbjct: 661 GGKLYFDANNNAALIDTFEKIAERLADLRIS 691 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 30/335 (8%), Positives = 73/335 (21%), Gaps = 52/335 (15%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 K+ E+ N + ALS+ L+ IG + + + + +SA +A +A A Sbjct: 19 LSNTVKRFRDDERGNVFVFTALSLPVMLMAIGAGADYAELYRARVNFQSAVDAGAIAAAK 78 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + K + I + + R Sbjct: 79 NLAATGQVQTSKDIGEEVFRSNLSHLGEKAVREGQINFDMGDGDCAVQGVITTATLPHDR 138 Query: 127 ISMTHM----------------ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 S ++ + L + Sbjct: 139 FFSLSFVDQSQQKGFGANKIVKGQEEFILSASSTVECGNDTIEIALVLDNSGSMRWNGKI 198 Query: 171 QKI-----------------------VSFIPALLRIEMGERPIFL-IELVVDLSGSMHCA 206 + S +P + +G S + Sbjct: 199 GTLRQASNSLVETLHTTMGSANKAIQFSVVPFAATVNVGTNNRNEPWMDTQGRSSTHWEM 258 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV--- 263 ++ S + + A + L L + V++ + T Sbjct: 259 IDPSTSTDFSFSGGRYLQSNGQALSRFTLYDSLPNTSWQGCVEQRPHPYHTQDDTPSISN 318 Query: 264 ---------EKNIEPSWGTEKVRQYVTRDMDSLIL 289 + +W + ++ + + Sbjct: 319 PDTLIVPSFAPDTPDNWDNDYNKRLSNIVVGADPH 353 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 76/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + ++ A K L FLD Sbjct: 89 GDVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGNVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +E +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L + Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQRHG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 231 Query: 361 FIKIVTISINASPN-----------------GQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ S + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 96.1 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 74/231 (32%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ + F+ + Sbjct: 90 MMLAVDLSGSMKIDDMQLN----------GRQVNRLTMTKSVVYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + + + Sbjct: 133 RLGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDERDESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N N+ + + A +K+ TI + A Sbjct: 193 ------------------NVLILLTDGQNTAG--NITPEQAKELAINKGVKVYTIGVGAD 232 Query: 373 P------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L S+ ++ NA L ++Q + Sbjct: 233 KMLIQSFFGSREINPSQELDEGMLTDIATSTGGQYFRARNAQELEAIYQQL 283 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 30/285 (10%), Positives = 83/285 (29%), Gaps = 21/285 (7%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + +++ +F + ++ V + L+ Sbjct: 191 QQTEATDSVNSRSKKTAEISTYPEFSSIPQSSTHDGFSVLIHLKAPSASPDQATCRLVNE 250 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 S S ++ S + TK+A LK A+ + + + Sbjct: 251 SSVRSPSGRAPVDLVTVIDISGSM----AGTKLALLKRAMGFVIQHLGPSD------RLS 300 Query: 256 LIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 +I +++ + + +Q + T+ A+K+A +++ ++ Sbjct: 301 VIAFSSSARRLFHLQRMSHYGRQQALQAINSLGAGGGTNIADALKKATKVIEDRSYKNSV 360 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + + G + ++ S V + D + I A + Sbjct: 361 CSIILLSDGQDTYNICSNVR----GGSKDYSSLVPPSILSDT--RRMLPIHAFGFGADHD 414 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 L +S + + + F +Q + SV+++ Sbjct: 415 SDSLHSIAEASGGTFSFIEDEGVMQDAF---AQCIGGL-LSVVVQ 455 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 30/285 (10%), Positives = 83/285 (29%), Gaps = 21/285 (7%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + +++ +F + ++ V + L+ Sbjct: 191 QQTEATDSVNSRSKKTAEISTYPEFSSIPQSSTHDGFSVLIHLKAPSASPDQATCRLVNE 250 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 S S ++ S + TK+A LK A+ + + + Sbjct: 251 SSVRSPSGRAPVDLVTVIDISGSM----AGTKLALLKRAMGFVIQHLGPSD------RLS 300 Query: 256 LIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 +I +++ + + +Q + T+ A+K+A +++ ++ Sbjct: 301 VIAFSSSARRLFHLQRMSHYGRQQALQAINSLGAGGGTNIADALKKATKVIEDRSYKNSV 360 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + + G + ++ S V + D + I A + Sbjct: 361 CSIILLSDGQDTYNICSNVR----GGSKDYSSLVPPSILSDT--RRMLPIHAFGFGADHD 414 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 L +S + + + F +Q + SV+++ Sbjct: 415 SDSLHSIAEASGGTFSFIEDEGVMQDAF---AQCIGGL-LSVVVQ 455 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 81/236 (34%), Gaps = 57/236 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM ++ + +++ L F++ ++ Sbjct: 85 DLMLAVDLSGSMQIEDME----------LGNRVVDRFTMVRHVLSDFIER-------RDG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ V +++ + L+ + T A+ + ++ Sbjct: 128 DRLGLILFADQAYLQAPLTFDRFAVARFLDEAVLGLVGQQTAIGDAIALGVKRFNDLEQS 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDGENN + + A+++ +K+ TI I + Sbjct: 188 ------------------SRVLVLLTDGENNAGRFT--PAQAVSLARQSGVKLYTIGIGS 227 Query: 372 --------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ S+ ++ + + L ++Q + Q Sbjct: 228 AEIRRRGLLGTRTVNPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQELDQ 283 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 62/201 (30%), Gaps = 42/201 (20%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ K L F+D + + + LI + + ++ +++ Sbjct: 98 DFTPNRLEKAKEVLDEFIDK-------RGNDRLSLIVFGGDAYTKVPLTFDHNVIKEMTR 150 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + I T + M + S K II LTDGEN Sbjct: 151 KLTVDDITSNTRTAIGMGIGVAL----------------NRLKDSEAKSKVIILLTDGEN 194 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-----------------NGQRLLKTCVS 384 N+ + + D AKE IKI TI I A + L + Sbjct: 195 NSGEMS--PSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYKTVKNTELDENMLKSIAET 252 Query: 385 SPEYHYNVVNADSLIHVFQNI 405 + ++ ++ +F I Sbjct: 253 TGGEYFRASDSKEFKEIFNKI 273 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 74/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRIGLILFADHAYLQTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDSNA------------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYHTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ +++ N L +++ I+ Sbjct: 221 YTLGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 72/241 (29%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFITK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRVGLILFADHAYLQTPLTMDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 181 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYQTTI 220 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ ++ N L ++ I+ Sbjct: 221 YTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTIN 280 Query: 407 Q 407 Q Sbjct: 281 Q 281 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 77/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM K + +++ + F++ ++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVN----------GKTVDRFTLIQHVVSDFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 71/228 (31%), Gaps = 43/228 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + L +DLS SM DP ++ A++ + F+ Sbjct: 90 TQPVRDLLLALDLSQSMDTRDFRDPS---------GALIPRVQAVREVVSSFVAR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + V+ + + + T A+ ++ Sbjct: 136 --RPGDRIGLIVFGDAPYPLAPFTLDHALVQTMIRDLLPGMAGPSTALGDAVGLGIKMFD 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +K +I LTDG + K + D AK+ + + T Sbjct: 194 QS------------------PAPEKVLIVLTDGNDTASKMP--PERAADIAKQRHVTVHT 233 Query: 367 ISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I I L + + ++ + +SL ++ + + Sbjct: 234 IGIGDPSAEGEQRVDLGVLQRMAAQTGGRYFFGADQNSLESIYATLDR 281 >gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana] Length = 641 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 55/193 (28%), Gaps = 23/193 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMD 285 KM +KNA+ + ++ + +I +++ + +E +Q Sbjct: 219 KMELMKNAMSFVIQNLGETD------RLSVISFSSMARRLFPLRLMSETGKQAAMQAVNS 272 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+ +K +++ + ++ + + F G Sbjct: 273 LVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDGQDN------FTFSHAGVRLRTD 326 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 C I I T + + + + S + + F Sbjct: 327 YESLLPSSCR------IPIHTFGFGSDHDAELMHTISEVSSGTFSFIETETVIQDAF--- 377 Query: 406 SQLMVHRKYSVIL 418 +Q + SV++ Sbjct: 378 AQCIGGL-LSVVI 389 >gi|15223093|ref|NP_172283.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana] gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 641 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 55/193 (28%), Gaps = 23/193 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMD 285 KM +KNA+ + ++ + +I +++ + +E +Q Sbjct: 219 KMELMKNAMSFVIQNLGETD------RLSVISFSSMARRLFPLRLMSETGKQAAMQAVNS 272 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+ +K +++ + ++ + + F G Sbjct: 273 LVADGGTNIAEGLKIGARVIEGRRWKNPVSGMMLLSDGQDN------FTFSHAGVRLRTD 326 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 C I I T + + + + S + + F Sbjct: 327 YESLLPSSCR------IPIHTFGFGSDHDAELMHTISEVSSGTFSFIETETVIQDAF--- 377 Query: 406 SQLMVHRKYSVIL 418 +Q + SV++ Sbjct: 378 AQCIGGL-LSVVI 389 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 53/231 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L +D+SGSM + + ++ A K L FLD + Sbjct: 101 MMLAMDVSGSMGEGDM----------VLGGQAVDRLTAAKAVLADFLDR-------RAGD 143 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR + + L + T A+ A + L Sbjct: 144 RIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRL------- 196 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 Q+ +I LTDG +N + ++ + A+ ++I T++ Sbjct: 197 -----------RSQPEGQRVLILLTDGVSNAGV--LEPLRAAEVAQAEGVRIHTVAFGGD 243 Query: 371 --------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + + + L ++ + + Sbjct: 244 GSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAGIYAELDR 294 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 59/522 (11%), Positives = 128/522 (24%), Gaps = 126/522 (24%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN-AA 60 +L ++ K+ +E ++II AL LL + F + K + A + AA Sbjct: 6 NLTTKLLSTVKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 61 ILAGASKMV-------------SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 +L A + + +R ++AK + L Sbjct: 66 LLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLY 125 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDV------------ 150 S + + V + + + + ++ + Sbjct: 126 LRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNQNGTAKSVACVVQGSVQRKFWLPWGQ 185 Query: 151 ------------------------------MTSYDYRLQFIEHLLN-----------QRY 169 + R Sbjct: 186 TLVSSNQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYIDRKGDAHKPNRR 245 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN--SDPEDVNSAPICQDKKRTK 227 +V + + I + R I + +G ++ + D + + +++K Sbjct: 246 IDALVEVVGEVQNILVTPRKIKEKCESLQSNGLLYLLQQVLAKKGDTSGCVLPYYVQQSK 305 Query: 228 --------------------------MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 M K + + G Sbjct: 306 TEYISELLRGRRGRTIREGLKEIERHMDIAKTVNQIKNFNNGEKQSYSFSFNNGDFCLGG 365 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 K +W +K L T T M ++TS + Sbjct: 366 NEGKETTQAWFDQKKPNVSEALGKIEPLGGTAVTSGMLIGINLMTSK------NSEPEAA 419 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKEN------------FIKIV 365 + ++ ++ L+DGE+N +C + K+ ++ Sbjct: 420 PSKLNTNTRRVLLILSDGEDNQPSEKTLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRVA 479 Query: 366 TISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I+ N K CV +++Y+V + L+ F+ I Sbjct: 480 FIAFGTNLPDNQLNAWKQCV--GKHYYSVFSKQGLLDAFKQI 519 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 76/237 (32%), Gaps = 39/237 (16%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + R + V+D+SGSM K++ LK A+ + ++ Sbjct: 239 SVSSRAPLDLVTVLDVSGSM--------------------SGIKLSLLKRAMSFVIQTLG 278 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ +++ ++ T +Q + + T+ A+K+ Sbjct: 279 PND------RLSVVAFSSTAQRLFPLRRMTLTGRQQALQAISSLVASGGTNIADALKKGA 332 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +++ ++++ ++ + + + N +I + + Sbjct: 333 KVVKDRRRKNPVSSIILLSDGQDTHS-----FLSGEADINYSILVPPSILPGT---SHHV 384 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +I T + + +S + S+ F +Q M SV++K Sbjct: 385 QIHTFGFGTDHDSAAMHAIAETSNGTFSFIDAEGSIQDAF---AQCMGGL-LSVVVK 437 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 57/181 (31%), Gaps = 13/181 (7%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K A E N ++F L ++ + ++G + ++++ A + A+L ASK+ + Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFMLGATADYTRYATTRSALRQATDVAVLTVASKLTAT 73 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + ++ + + A + + + + V M Sbjct: 74 TTDAQAKAQAQVILNAQPRMSTA--------SITTASIATTKQTFCATSEVTIQNSFMQM 125 Query: 132 MANNRLDSSNNTIF-----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 L S + N + + +L+ + +S + + + Sbjct: 126 ARVTSLTPSVTSCADLAWGANPNATYEVALVVDNSGSMLSSDGSVTKISALKTAAKSFVD 185 Query: 187 E 187 Sbjct: 186 T 186 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 29/275 (10%), Positives = 76/275 (27%), Gaps = 61/275 (22%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D +G + + +K+ + + + + + G Sbjct: 318 IDDNGGVTTRTYYGSTTTYTCSDTASGSWSKLTH----VCKYKPTAAKSGNYGPTSFFGP 373 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK------ 310 + T + + + T+ ++ ++ + Sbjct: 374 NAFCPDHTTQRLLQLTTSQTTIK-NKIDQLVANGNTNLQEGFMWGWRTISPNGPFAAGRP 432 Query: 311 -------------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--------- 348 F ++ + ++ + +GE N + Sbjct: 433 YATSNNRKVMVFMTDGFNHWGAYPNTVVGSDYEALGYYTYNGEKNLRLPDGSRGDRVDYQ 492 Query: 349 -----------------------NTIKICDKAKENFIKIVTISINASPN-----GQRLLK 380 T++ C AK +++ TI + S + G LLK Sbjct: 493 NALKAARNSNSSYLATARDAQDELTLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLK 552 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +C ++ ++++ V NA+ L F +I + + S Sbjct: 553 SCATNVDHYFAVENANQLNAAFSSIGIGLGKLRLS 587 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 23/261 (8%), Positives = 74/261 (28%), Gaps = 11/261 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + N++++ A++++ + + I + + +K + +A +AA A A ++ Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLTEG 65 Query: 72 LSRLGDRFESISNHAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + R ++ + A + + + +G + + Sbjct: 66 ATDDQLRAYALDFFNANLNDLNPANATLNLTLPSNTAGGGLLKMTARLNYKPYFYPAFAQ 125 Query: 131 HMANNRLDSSNN-----TIFYNMDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALLRI 183 + + D++ + T + L + L QK + + + Sbjct: 126 LVGKSATDANQSISFDVTSQVRLKNTLEVALVLDNSGSMTTLGTGSGQKRIDLLKTASKQ 185 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-- 241 + + + + + + VN + L D Sbjct: 186 LVDTLAQQAVMIK-QVDRPVQFGLVPFAASVNVGSGNGNASWMDTEGLSPVSNENFDWST 244 Query: 242 IDLLSHVKEDVYMGLIGYTTR 262 ++ + T Sbjct: 245 LNAADKYAQKTNGIWYKRGTG 265 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/252 (11%), Positives = 56/252 (22%), Gaps = 69/252 (27%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGL-IGYTTRVEKNIEPSWG--TEKVRQYVTRDMDSLI 288 ++ L + G Y+ + T+ Sbjct: 408 QSRLRNMAKYFQPRPINAPALPTGNGPNYSCTTNAITPLTDVSVTDGATAIKAAIDLMQP 467 Query: 289 LKPTDSTPAMKQAYQI------------------------LTSDKKRSFFTNFFRQGVKI 324 T+ M +++ LT + + Sbjct: 468 NGGTNVPEGMAWGWRVVSSGEPFTQGRLETEKGNDKVVIVLTDGANTYYTPSSLSYSDPA 527 Query: 325 PSLPFQKFIIFLTDGENNNFKS------------------------NVNTIKICDKAKEN 360 S +L G N N +C+ AK Sbjct: 528 DSKSTYASYGYLNPGYNGTSVGRMFMGTSTAIGQFDYSNGNYTNALNEQMATLCNNAKAA 587 Query: 361 FIKIVTISINASPNGQ------RLLKTCV------------SSPEYHYNVVNADSLIHVF 402 I ++T++++ S LK+C S P + SL + F Sbjct: 588 NIMVMTVALDLSTTKASDKLAIDALKSCSSESRFRKDPTDPSKPAKLFWNATGASLSNDF 647 Query: 403 QNISQLMVHRKY 414 + I + + + Sbjct: 648 KEIGNELSNLRV 659 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 38/241 (15%), Positives = 72/241 (29%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 68 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFITK--- 114 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E MGLI + + + V + + + LI T + A + Sbjct: 115 ----REGDRMGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKT 170 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 171 FIDS------------------DAPQRVMILLSDGSNTAGVLD--PLEAANIAKQYQTTI 210 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ ++ N L ++ I+ Sbjct: 211 YTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTIN 270 Query: 407 Q 407 Q Sbjct: 271 Q 271 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM+ + K ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMNEPDM----------VLGGKVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N + +K + AK Sbjct: 192 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLD--PLKAAELAKAE 231 Query: 361 FIKIVTI-----------------SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I TI N + L K + + + + L ++ Sbjct: 232 GVRIHTIAFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 77/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + ++ A K L FLD Sbjct: 89 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGNVVDRLTAAKAVLSDFLD 138 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +E +GL+ + R + VR + + L + T A+ Sbjct: 139 R-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIAL 191 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L ++ Q+ ++ LTDG N N +K + AK Sbjct: 192 SVKRLREQRQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 231 Query: 361 FIKIVTISINASPN-----------------GQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ S + L K + + + + L ++ Sbjct: 232 GVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRARDTEELAGIYA 291 Query: 404 NISQ 407 + + Sbjct: 292 ELDR 295 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 27/277 (9%), Positives = 63/277 (22%), Gaps = 23/277 (8%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + ++K N + A +++ LIG + + + + M+ A +AA LAG M Sbjct: 10 GFMSRLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAM 69 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 ++ ++ E+ F + ++ Sbjct: 70 TTSSMTQANKDEAKKFFDFNFPQG---TFQAATFTPVIRSKPGETTTVQVTASTTMPTTV 126 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL-----NQRYNQKIVSFIPA---- 179 M L S T L + + + ++ + Sbjct: 127 MKIFRYETLPLSVTCEARFDIGNTDVMLVLDTTGSMAYAISDGKGGSTTRLAALKQAVKD 186 Query: 180 ------LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--MAAL 231 G + ++ V +T+ + Sbjct: 187 FYDTLGAGSNATGRIRYGFMPYSSTVNVGYQL---PTNYLVGGISGETWDYQTRRLLTTY 243 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 A I V + + Sbjct: 244 GTATNETGSWIYTSGSVSAPTTYSSTSGGSASQCVAP 280 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 66/213 (30%), Gaps = 38/213 (17%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 ++ +L++ T+ + P+ + Y+ + + T Sbjct: 473 SSSGTWLNTQSRKDGGWNACPSQSRRLTSYTNRTSTPTGQSSSFNSYIDNLI---AVGGT 529 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------- 345 M + L+ + N G + I+F+TDG+ + ++ Sbjct: 530 YHDIGMLWGARFLSPKGIFASDNNSAPNGFN-----ISRHIVFMTDGDMSAYQQVYGAYG 584 Query: 346 ---------------------SNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTC 382 N +C+ K I I I + N Q L+ C Sbjct: 585 YQQLDARVAPGNTSDTDLTAIHNTRLQMLCNAIKAKGITIWVIGFRNQSEGNIQTPLQNC 644 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +S + +A SL F++I++ + + S Sbjct: 645 ATSSNHWTMAYDATSLSQKFKDIAKNIGGLRVS 677 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 61/198 (30%), Gaps = 45/198 (22%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ A K L F+ ++ +G+I + ++ S+ + + Q V Sbjct: 116 NRLDAAKQVLNRFITE-------RQGDRIGIIVFGSKAYLQAPLSYDLDTIAQLVNETQI 168 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A+ + L + ++ +I +TDG N + Sbjct: 169 GFAGENTAIGDAIGLGIKRLANIDAD------------------KRVMILMTDGANTAGR 210 Query: 346 SNVNTIKICDKAKENFIKIVTISINA------------------SPNGQRLLKTCVSSPE 387 + A + +KI TI I A + + L K + Sbjct: 211 V--KPDQAAQFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVADLTQG 268 Query: 388 YHYNVVNADSLIHVFQNI 405 ++ + L ++ + Sbjct: 269 QYFRAKSTQELASIYATL 286 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 83/245 (33%), Gaps = 49/245 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E IE+VVD SGSM D+ ++ A+K F+ D L Sbjct: 81 ENDGIAIEMVVDRSGSMQAMDF----------QLGDEHVDRLTAIKKVAGDFVTGGDNLD 130 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQI 304 + +GLI + + P+ + + + + + T A+ A + Sbjct: 131 -GRLSDLVGLITFAGYADGVTPPTLDHAFLVSQLNHSQIVTNRSEDGTAIGDAISLAVEK 189 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + + K II LTDGENN ++ I+ + A+ IK+ Sbjct: 190 LNA------------LDARRKEKIQSKIIILLTDGENNAG--DLEPIQAAELAQTMGIKV 235 Query: 365 VTISIN----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 TI + + + + L K + ++ + DSL ++ Sbjct: 236 YTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVASITGGKYFRATDTDSLAKIY 295 Query: 403 QNISQ 407 I Q Sbjct: 296 GEIDQ 300 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 79/248 (31%), Gaps = 57/248 (22%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + + +D+SGSM + +DK+ ++ A+K L F+ Sbjct: 75 VGDTMDLPATGRDLMISIDISGSMEMPDM----------VIEDKEVDRLVAVKALLTDFI 124 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAM 298 ++ +G+I + + ++ + V+ + L T + Sbjct: 125 AR-------RKGDRVGMILFGEQAYLQTPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGI 177 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + L + +I LTDG+NN N ++ + A+ Sbjct: 178 GLAVKRL------------------RERDANNRVLILLTDGQNNTGALN--PLQAAELAE 217 Query: 359 ENFIKIVTISINASP-------------------NGQRLLKTCVSSPEYHYNVVNADSLI 399 I I TI + A + + L+ + ++ + + Sbjct: 218 HAGITIYTIGVGADEMIVKNRFFGNRRINPSLELDEESLIAVAEKTGGRYFRARDTKEME 277 Query: 400 HVFQNISQ 407 ++Q I + Sbjct: 278 EIYQIIDE 285 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 67/208 (32%), Gaps = 36/208 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + D K +++ A K A F++ + V +GL+ + + Sbjct: 93 DVSGSMMADDLKPSRLDAAKAAARSFVERMPAG------VKVGLVSFAAGAVLESGLTAD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + V + R T + ++ + + Sbjct: 147 HQGV---IERIDLLERRANTAIGEGLLESLKAFPTGANHQV--------------AVPAT 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------NGQRLLKTC 382 +I L+DG N + + +AK +++ TI + + + L Sbjct: 190 VILLSDGRNRIGIA---PQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIA 246 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ +AD L +++ + + Sbjct: 247 EVTGGRYFAADSADRLQEIYRELGSQIG 274 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 65/208 (31%), Gaps = 49/208 (23%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D K ++ A KN + F+ + + D +GL+ ++ + + + Sbjct: 110 DFKPNRLEAGKNIAIDFIKN-------RPDDRIGLVIFSGESFTQCPLTIDHDVLINLYH 162 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + +I T + A L S K +I LTDG N Sbjct: 163 DIKNGMIEDGTAIGMGLATAVNRL------------------RGSEAKSKVVILLTDGVN 204 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISIN----------------------ASPNGQRLL 379 N I + AK+ I++ T+ I + L Sbjct: 205 NAGSIP--PITAAEIAKQFGIRVYTVGIGTQGYAPYPVPSPYGGVVYQRMEVQIDEPTLT 262 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K + ++ N D+L +++ I Q Sbjct: 263 KIAAITGGKYFRATNNDALTRIYKQIDQ 290 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 82/230 (35%), Gaps = 34/230 (14%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + ++ R I L +D SGSM+ + ++V+ K+ ++ K F+ Sbjct: 43 VDKLSPNNRHGKDIVLAIDASGSMNSSGFDFEDEVSDG-----KRLSRFEITKIIASEFI 97 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +G++ Y ++ E V Q + + + T A+ Sbjct: 98 QK-------RISDNVGVVLYGDFAFIASPITYEKEIVTQMLGYLTQGMAGQNTAIGEAIA 150 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + K + K I+ L+DGE+N+ + + + AKE Sbjct: 151 MGVRSFKHSKAK------------------TKVIVLLSDGEHNSGSVS--PKEATELAKE 190 Query: 360 NFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 IKI TI++ + L S ++ +A L +++ I + Sbjct: 191 QGIKIYTIAMGNKGEADEALLETIAKDSNGEFFSASSAKELKNIYDEIDK 240 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 75/233 (32%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + + ++ +K L F++ ++ Sbjct: 88 DIMLAVDLSGSMTEQDMAYN----------GQYVDRLTMVKAVLSDFIEQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + ++ L+ + T A+ + + + + Sbjct: 131 DRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFANKDES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG+N N A+E IK+ TI + + Sbjct: 191 N------------------RIVVLLTDGQNTAGNLN--PEDALLLAREEGIKVYTIGVGS 230 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + ++ + L ++ + + Sbjct: 231 DNPRGFSLFNVGGSSGSNLDESLLKKIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 51/183 (27%), Gaps = 9/183 (4%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + ++ + N ++F LSV+ +++ G + ++ + A LA AS Sbjct: 1 MKRPVNEFVSDHRGNVVMLFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVAS 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + S ++ + + A I I V+ Sbjct: 61 RITAATSTADAIKQAQNVLRSASQRLAAATISNATISADR-------KTFCIDAQVSIPT 113 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMT--SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + M + + + + T L + + + Sbjct: 114 MIMKIARIDSMAPAVMSCAEIGGGSTNYEIALALDNSGSMNESAGGATKIQSLKTAATNF 173 Query: 185 MGE 187 + Sbjct: 174 VNS 176 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 66/211 (31%), Gaps = 12/211 (5%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTE 274 P + ++ ++ + + + E V + + ++ T Sbjct: 366 CPNAATQTALQLTPTQSTITAKIAQLTAAGDTNLHEGVMWAWRSISPNPPFSAGSAYNTA 425 Query: 275 KVRQYVTRDMDSLILK--PTDSTPAM---KQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 VR+ + D T++ Y K R GV S Sbjct: 426 GVRKILVLMTDGYNNWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLD 485 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-----SPNGQRLLKTCVS 384 + + + T + C+ AK I+I +I+ + G LLK+C + Sbjct: 486 GAANSWTDYKSVSRQAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCAT 545 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +++ ++ + F I+ + + S Sbjct: 546 NADHYLLATDSTQIDRAFSQIAMNLSKLRLS 576 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 71/240 (29%), Gaps = 58/240 (24%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I L +D SGSM + K T++ A+K + F+ Sbjct: 85 PGVDIMLCLDTSGSMQALDF----------KVEGKSVTRLEAVKKVVADFIGK------- 127 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +E +GL+ + + + + V R + T A+ + L Sbjct: 128 RETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNRMKIGMAGDRTAIGSAIAIGGKRLKDL 187 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K + K +I LTDG NN + + +E IK+ TI Sbjct: 188 KSK------------------SKILILLTDGRNNAGEIS--PQAAARAVREFGIKLYTIG 227 Query: 369 IN---------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ N+ L ++ I + Sbjct: 228 VGGKGPAPFRMKTLFGTRLVPQHVDLDEVTLRNVAKTGGGKYFRAANSQELQEIYDIIDR 287 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 95.3 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 47/355 (13%), Positives = 99/355 (27%), Gaps = 44/355 (12%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 L+ + +A F + + + V T+ V + Sbjct: 143 NFMELTSQAHNKRVVFKTTVKADGSSKGGFSRYSSSNVAASSADVVVVTQDGLTVTKTAK 202 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + ++ + T L + ++ + + Sbjct: 203 ELAAKNVWEIEVKVEGKNVVLQEATDVVLVLDRSGSMGQGVVDKNNPNAQKCTVLTCTNS 262 Query: 188 RPIFLIELVVDLS---GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 D + + + I ++ +K+A FLD++ Sbjct: 263 NRW--HRHNADCYDEEYYILKCTQNHTHTLPGDFIANSCYVSRADKVKDASYTFLDTLQE 320 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V + ++ Y K ++ + + L T++ ++ A QI Sbjct: 321 KEDVN----ISVVTYAGTASKVTN-----SNLKSGIESAYNVLGTDGTNTGRGIEIASQI 371 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ S K I+ L+DGE+N S + AK + Sbjct: 372 LS------------------NSTAPNKMIVVLSDGESNAGNS----RTAANSAKNKGCIV 409 Query: 365 VTI--SINASPNGQRLLKTCVS-----SPEYHYNVVNA-DSLIHVFQNISQLMVH 411 TI I + NG + L C S + Y + ++L +F I+ + Sbjct: 410 YTIGAGIASGSNGAKELFDCASVDQSTNKAKFYLADDTGNALNEIFAEIAGEIQE 464 >gi|125559999|gb|EAZ05447.1| hypothetical protein OsI_27661 [Oryza sativa Indica Group] Length = 704 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 59/186 (31%), Gaps = 13/186 (6%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ LK A+ +D + + ++ ++ ++ I + ++ + ++S Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAVES 300 Query: 287 LI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T+ + +A ++ + R+ + + G +N+ Sbjct: 301 LAAGGGTNILKGLVEAAKVFDGRRYRNAVASVILLSDGQDTYNVNGG-----WGASNSKN 355 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN- 404 +V ++ + + + T + + + + N + F Sbjct: 356 YSVLVPPSFKRSGDRRLSVHTFGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQC 415 Query: 405 ISQLMV 410 I L+ Sbjct: 416 IGGLLS 421 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 24/243 (9%), Positives = 71/243 (29%), Gaps = 34/243 (13%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + R + V+D+SGSM TK+A LK A+ Sbjct: 251 AESPQSSQDSRAPVDLVTVLDVSGSM--------------------SGTKLALLKRAMGF 290 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTP 296 + ++ + +I +++ ++ TE + + + + T+ Sbjct: 291 VIQNLGPSD------RLSVIAFSSTARRHFPLRRMTETGKLEALQAVNSLVSSGGTNIAE 344 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 +++ ++++ K ++ + + + + + Sbjct: 345 GLRKGFKVVVDRKWKNPVCSIILLSDGQDTYTISGTSMTRPQADYKSLLPTSIHRNGSSG 404 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + I + + + + S + + F +Q + SV Sbjct: 405 FR---IPVHAFGFGSDHDAASMHSISEISGGTFSFIEAEGVIQDAF---AQCIGGL-LSV 457 Query: 417 ILK 419 +++ Sbjct: 458 VVQ 460 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 47/395 (11%), Positives = 113/395 (28%), Gaps = 41/395 (10%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ + +E ++II AL LL + F + K + A + A Sbjct: 6 NLTAKLFSTVKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--------------ESLS 107 L ++ N +++ + + + F ++ L Sbjct: 66 LLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLY 125 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDVMTSYDYRLQFIE 162 S + + V + + + + +I + + L + + Sbjct: 126 LRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQ 185 Query: 163 HLLNQR---YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 L++ + ++ E + +V DLS SM A+N+ + P Sbjct: 186 TLVSSSQLYDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINATG----NNPP 241 Query: 220 CQDKKRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGYTTRVEKNIEPSWG---- 272 + ++ AL+ A+ + V MG + + + E + Sbjct: 242 EVNYPNRRIDALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTNCVLPY 301 Query: 273 ---TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 +E ++ ++ + + L K + F Q Sbjct: 302 YVKSEDKKREISAKFKKGYNGGNHIAKGFELLDRDLDIPKTIDQISQFDGQKRTYDFTLD 361 Query: 330 QKFIIFLT-----DGENNNFKSNVNTIKICDKAKE 359 K + + + N + K+ Sbjct: 362 SKTSRNYCLEDNVNKKTTQAWFDKNNRAVASALKQ 396 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 79/350 (22%), Gaps = 73/350 (20%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ-------- 171 +V+SS++ + N + + + ++ Sbjct: 186 TLVSSSQLYDGRVGINSGKTYAV-KEKQITIPIDLMMVTDLSRSMMWAINATGNNPPEVN 244 Query: 172 --------------------------------KIVSFIPALLRIEMGERPIFLI----EL 195 + F+ + + Sbjct: 245 YPNRRIDALREAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTNCVLPYYVK 304 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK--NALLLFLDSIDLLSHVKEDVY 253 D + N + + K + + F + Sbjct: 305 SEDKKREISAKFKKGYNGGNHIAKGFELLDRDLDIPKTIDQISQFDGQKRTYDFTLDSKT 364 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 V K +W + R + + T T + ++ Sbjct: 365 SRNYCLEDNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEK----- 419 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKAKEN--------- 360 F ++ ++ L+DGE+N + +C + KE Sbjct: 420 -NKDFEAMPNKIGTNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNY 478 Query: 361 -----FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I V N Q + K CV + +Y+V + +L F+ I Sbjct: 479 PKVETRIAFVAFGFNPPQKQQEVWKKCV--GDQYYSVSSKQALFDAFKQI 526 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 79/234 (33%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM ++ ++ +K+ L F+ + Sbjct: 88 DLMIAVDLSGSMKIDDMQVN----------GRQVDRLQMIKSVLHDFIQR-------RIG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ E V Q + + L+ + T A+ A + S ++ Sbjct: 131 DRLGLIFFADTAYLQAPLTYDRETVSQLLNESLIGLVGEQTAIGDAIGLAIKRFKSKEES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K +I LTDG+N N+ + + A N + + TI + A Sbjct: 191 N------------------KVLILLTDGQNTAG--NITPEQANELAINNGVTLYTIGVGA 230 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L S+ ++ +A SL ++ + + Sbjct: 231 DQMLVQSIFGSRQVNPSQELDEGMLTTLAESTGGRYFRARDAQSLTEIYSKLDE 284 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 30/245 (12%), Positives = 70/245 (28%), Gaps = 34/245 (13%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 Q + A R + V+D+SGSM TK Sbjct: 238 NPEQVTGRSVSATSVGYPTSRAPVDLVTVLDVSGSMA--------------------GTK 277 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS 286 +A LK A+ + + + +I +++ + + +Q + Sbjct: 278 LALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNSL 331 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+K+A +++ ++ + + I G ++ S Sbjct: 332 GASGGTNIADALKKAAKVIEDRSHQNPVCSIILLSDGQDTYNIPSNIR----GARPDYSS 387 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-I 405 V + + + + + + L +S + + + F I Sbjct: 388 LVPSSILNHTFRL--VPVHGFGFGVDHDSDALHSIAEASGGTFSFIEDEGVIQDAFAQCI 445 Query: 406 SQLMV 410 L+ Sbjct: 446 GGLLS 450 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 72/216 (33%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F++ + + +G+ + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKDVAAEFIN-------GRPNDNIGITLFAGESFTQCPLTVD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + LI T + A L K + K Sbjct: 148 HAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG NN +++ + + AK I++ TI + + Sbjct: 190 IILLTDGVNNKG--DISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPV 247 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ + L V++ I + Sbjct: 248 EIDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 80/240 (33%), Gaps = 50/240 (20%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +I R + V+D+SGSM TK+A LK A+ + Sbjct: 232 DQISQYRRAPVDLVTVLDVSGSM--------------------GGTKLALLKRAMGFVIQ 271 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMK 299 ++ + +I +++ + + ++ RQ + + T+ ++ Sbjct: 272 NLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQQALQAVNSLVANGGTNIFDGLR 325 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + +++ ++R+ + II L+DG + ++ + KA Sbjct: 326 KGAKVMEDRRERNSVAS----------------IILLSDGRDTYTTNHPDPSY---KAML 366 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + + + + + S + + + ++Q + SV ++ Sbjct: 367 PQIPVHSFGFGSDHDASVMHSVSEFSGGTFSFIESESVIQDA---LAQCIGGL-LSVAVQ 422 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 32/279 (11%), Positives = 76/279 (27%), Gaps = 15/279 (5%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R R + + N ++IF LS+ ++L + + ++ + A +AA LA A Sbjct: 12 RLRKLVSRLRDDRRGNVAMIFGLSLPVIVMLALGGVDLHRITTARSQFQDALDAATLAAA 71 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK-ESLSGYSAVFYNTEIQNIVNS 124 + + L + + + + + S A + N Sbjct: 72 RSSETTPAGLKSVALATLHGNIQGTEVEPINDADVEVAMNDKSVVIATAQGRVKTLVANI 131 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN-------QRYNQKIVSFI 177 + ++ L S + L + + R + S Sbjct: 132 VLPPYGQLLDDTLPISARSEVNRSSRDVEVALVLDITGSMNDCADSCSSGRKIDNLKSAA 191 Query: 178 PALLRIEMGER------PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 L+ I + + L + ++ A + ++ ++ Sbjct: 192 KELIDIVVQTNQSPFYSKVALAPYSMGVNVGDTYASRARGSLDSNTQSITAATWL-TGSV 250 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 K + +++ K G I E + Sbjct: 251 KTITSISRAYTAVVTASKHGFKTGDIVTIWSAETMAPLN 289 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 38/440 (8%), Positives = 107/440 (24%), Gaps = 74/440 (16%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 + V D + + + ++ + + S S + + F Sbjct: 217 MGVNVGDTYASRAR--GSLDSNTQSITAATWLTGSVKTITSISRAY--TAVVTASKHGFK 272 Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT----IFYNMDVMTSY 154 I S + N + + + + + + + S + + + Sbjct: 273 TGDIVTIWSAETMAPLNGVALTVGSVTTNTFSLVGEDSRYYSAFSGQAYVAKCARTDCNI 332 Query: 155 DYRLQ-----------FIEHLLNQRYNQKI-----------VSFIPALLRIEMGERPIFL 192 L + ++ I + + + Sbjct: 333 VITLARHGLSSEGDAAVLGNMGGLSQLNNIGFRVASVTPTTATLALDASQANLATTAKGG 392 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 S+ + VN+ T + + + D + Sbjct: 393 AAYTSGGQLICGVDGCSNRDFVNAIGAWTRFPGTPCVSERAGSQAYTD-AAPSASSWVGR 451 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G + + + + + V + T + + +++ + Sbjct: 452 SYASGGNACPASQIVPLTNVKKTLTDAVDGMT---AVGSTAGHIGLAWGWYLVSPNF--- 505 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN----------------------------NF 344 + S K ++ +TDGE N N Sbjct: 506 GLWSGLGAPAAYDSSKTLKAVVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNG 565 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQ---------RLLKTCVSSPEYHYNVVNA 395 S ++C+ K + + T+ + L+ C ++P+ + ++ Sbjct: 566 SSFEQAYRLCENMKAADVIVYTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSS 625 Query: 396 DSLIHVFQNISQLMVHRKYS 415 L F++I + + + S Sbjct: 626 TDLSDAFRDIGRDITRLRIS 645 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 30/280 (10%), Positives = 82/280 (29%), Gaps = 28/280 (10%) Query: 153 SYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + L + + I + +P + + + L GS H Sbjct: 93 EIVLVMDTSTSMECLVEPESYDIDNCVPTKEGHIVYIKNKSYLVNTAFLQGSRHKLFYIT 152 Query: 211 PEDVNSAPIC-------QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 N ++ ++ K + + F+ + +++ +GL+ + T Sbjct: 153 IGTTNYYIQGNKCYRQSSYNEKNRLQHAKESAIKFVQKFEND----KNISIGLVSFDTTA 208 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL--------------TSDK 309 + + +V + + T+ +K A Q+L + Sbjct: 209 NSQKDITSKLNEVEDSINSLKVAD-NGATNIEAGLKSAQQLLKKGNKDADKYVILMSDGF 267 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +F + Q + + + ++I + K++ I I Sbjct: 268 PTAFDYAGEKVEKNFNYHEIQDNTFINFGYYDYSGYAMKHSINQANSLKKDGINSFIIGF 327 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + N ++L ++ + N D+L + + + Sbjct: 328 SEGANSEKLNNIAKAAGGEYEEAKNTDTLNGAYDKLETKV 367 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/269 (10%), Positives = 72/269 (26%), Gaps = 61/269 (22%) Query: 201 GSMHCAMNSDPE--DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S + +N N +K R+++ ++K F+D + + ++ Sbjct: 696 SSGNFIINGKKYYVKDNKVYEFNEKDRSRLDSVKKVANDFVDKFKNDE----NTEIAIVR 751 Query: 259 YTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT-------- 306 Y+++ ++ S + R T+ ++++Y IL Sbjct: 752 YSSKANIVLDGSNKIFLNGKDNEIIKKRINSLKADGGTNIGDGIRKSYSILDKCDKDSEK 811 Query: 307 ------------------------SDKKRSFFTNFFRQGVKIPSLPFQKFI--------- 333 + K +F +K+ Sbjct: 812 YMILMTDGVPTAYTCYANTIKASNNCKYSKDNLDFGYCPEGYIDCYNRKYYYSEVKGNFK 871 Query: 334 ---------IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + G+ + + + K+K I V + + + ++ LK Sbjct: 872 LENNNRDEGYVIKFGDEYDKNALEYAKQAMQKSKSKNINNVIVGFSDGIDTEK-LKEIAG 930 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + I + + K Sbjct: 931 DNAQYKEAKDLGELSKQYDEIQKDINTIK 959 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 42/201 (20%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ K L F+D + D + LI + + ++ +++ Sbjct: 98 DFTPNRLEKAKEVLSEFIDK-------RTDDRLALIVFGGDAYTKVPLTFDHNVIKEMTG 150 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + I T + M + S K II LTDGEN Sbjct: 151 KLTVDDITSNTRTAIGMGIGVAL----------------NRLKDSEAKSKVIILLTDGEN 194 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-----------------NGQRLLKTCVS 384 N+ + + D AKE IKI TI I A + L + Sbjct: 195 NSGEMS--PSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYTTVKNTELDENMLKSIAET 252 Query: 385 SPEYHYNVVNADSLIHVFQNI 405 + ++ ++ +F I Sbjct: 253 TGGEYFRASDSKEFKEIFNKI 273 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 67/228 (29%), Gaps = 54/228 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D +MA K F+ + + Sbjct: 89 DIMLTMDVSASM---------------LTDDVYPNRMAVAKEVASEFI-------SSRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +GL + + + + L+ T + A L Sbjct: 127 DNIGLTIFAGEAFTQCPMTLDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQ 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K +I LTDG NN + + AK+ I++ T Sbjct: 187 DSKAK------------------SKIVILLTDGSNNVGSIS--PMTAAAIAKKFGIRVYT 226 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L VS+ Y + L ++Q+I + Sbjct: 227 IGFGRETGEEIGAIDYRALQNIAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 80/231 (34%), Gaps = 44/231 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + P + L +DLSGSM DP ++ +++A+K + F+ Sbjct: 88 VRDEPARDLMLAIDLSGSMATRDFVDP---------AGERMDRLSAVKRVVADFVAK--- 135 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ + + + VR + + + T A+ ++ Sbjct: 136 ----RKGDRIGLVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKL 191 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +K +I LTDG N+ S + + + AK++ + + Sbjct: 192 MADS------------------HAQEKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVV 231 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ 407 TI I + + L + + + + L V+ + + Sbjct: 232 HTIGIGDPGTTGEDRVDLEALARIASITGGRAFRALGQQRDLAEVYATLDK 282 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 76/240 (31%), Gaps = 43/240 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L VD+SGSM K+ +++A K L F+ Sbjct: 90 EKTARDLMLAVDISGSMEATDF---------VDATGKQTDRLSAAKQVLKQFVA------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + + E + + ++ T A+ + Sbjct: 135 -GREGDRLGLIVFGSAAYLQAPFTDDRETWLALLDESIVNMAGPSTALGDAIGLSIAHF- 192 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + +I LTDG + K + AK + I T Sbjct: 193 -----------------RESKTKNRVLIVLTDGNDTGSKVP--PLDAAQVAKAEGVTIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +++ + + L ++ +N + +L +Q I +L S+ + Sbjct: 234 VAVGDPETVGEEALDLEVLDSIAQTTGGVSFNAADLKALQETYQRIDELEPASYDSLSYR 293 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 72/216 (33%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F++ + + +G+ + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKDVAAEFIN-------GRPNDNIGITLFAGESFTQCPLTVD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + LI T + A L K + K Sbjct: 148 HAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG NN +++ + + AK I++ TI + + Sbjct: 190 IILLTDGVNNKG--DISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPV 247 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ + L V++ I + Sbjct: 248 EIDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 77 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 126 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 127 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 179 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 180 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 219 Query: 361 FIKIVTISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ S + L K + + + D L ++ Sbjct: 220 GVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTDELAGIYA 279 Query: 404 NISQ 407 + + Sbjct: 280 ELDR 283 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 72/216 (33%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F++ + + +G+ + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKDVAAEFIN-------GRPNDNIGITLFAGESFTQCPLTVD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + LI T + A L K + K Sbjct: 148 HAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG NN +++ + + AK I+I TI + + Sbjct: 190 IILLTDGVNNKG--DISPLTAAEIAKSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPV 247 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ + L V++ I + Sbjct: 248 EIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 73/232 (31%), Gaps = 44/232 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + L+VDLS SM D Q K ++ A+K + F++ Sbjct: 88 TQVQSARDLMLLVDLSQSMEAKDFQD---------QQGNKIDRLEAVKLVVDDFIER--- 135 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + T+ + + E R + + T A+ A Q Sbjct: 136 ----REGDRIGLILFGTKAYLQVPFTQDLETARFLLDEAQIGMAGAQTMLGDAIGLAIQT 191 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +I LTDG + + + A + I I Sbjct: 192 FEDS------------------KTENRVLILLTDGNDTGSQVP--PDQAAKFAAQRNIVI 231 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 TI+I + + L + + + LI ++ +I + Sbjct: 232 YTIAIGNPETEGTEKIDEETLQLIADQTGGQFFRGSDRQGLIQIY-DILDQL 282 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMLAVDLSGSMQIEDM----------VINGKVVDRFTLIQHVVSEFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 37/246 (15%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + K + +++ + F++ ++ Sbjct: 85 DLMLAVDLSGSMQIEDM----------VINGKVVDRFTLIQHVVSEFIER-------RKG 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T ++ A + + Sbjct: 128 DRIGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDES 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDG NN N+ + A + I T+ + A Sbjct: 188 N------------------RVLILLTDGSNNAG--NIEPEQAAQIAANRKVTIYTVGVGA 227 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ N+ L ++Q I +L + Sbjct: 228 DVMERRTLFGRERVNPSMDLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDKLEPVSR 287 Query: 414 YSVILK 419 + + Sbjct: 288 DQLSYR 293 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 49/217 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + ++ A K F+ + +GL+ + + Sbjct: 95 DISTSMLAQDLQPDRIQAAKQVAGNFIT-------DRPGDRIGLVAFAGEAFTQSPLTTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ + R ++ T + A L + K Sbjct: 148 QGTLQTLLGRLRSGVVEDGTAIGNGLATAINRLRESNAK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------- 370 II LTDGENN + + + A++ I++ TI + Sbjct: 190 IILLTDGENNRGEI--APLTAAEIARDQGIRVYTIGVGTRGTAPYPTVDFFGNPTVVQAK 247 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ + L ++ I+Q Sbjct: 248 VQIDEKILGEIADLTGGRYFRATDNAKLQSIYDEINQ 284 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 78/228 (34%), Gaps = 43/228 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P + L+VDLSGSM ++P + ++ +++A+K L FL Sbjct: 90 TVPTRDLLLIVDLSGSMDARDFTNP---------EGERIDRLSAVKGVLDEFLTR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + + + R + + T A+ + Sbjct: 136 --REGDRVGLIVFGSAAFVQVPFTQDLNACRILLEETTVRMAGPRTVFGDALGLGITLFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S ++ II LTDG + + + A +N +K+ Sbjct: 194 ------------------RSEVEERVIIALTDGNDTGSRVP--PAEAAKIANDNTVKVHV 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + + L ++ +++ N + L ++ + + Sbjct: 234 VGVGDPTTTGEDVLDEEALNAVASTTGGRYFHANNREELEDIYTELDR 281 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 80/234 (34%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM ++ ++ +K+ L F+ + Sbjct: 88 DLMIAVDLSGSMKIDDMQVN----------GRQVDRLQMIKSVLHDFIQR-------RVG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ E V Q + + L+ + T A+ A + S K+ Sbjct: 131 DRLGLIFFADTAYLQAPLTYDRETVSQLLGESLIGLVGEQTAIGDAIGLAIKRFQSKKES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K +I LTDG+N + + + A N + + TI + A Sbjct: 191 N------------------KVLILLTDGQNTAGNIS--PQQANELAINNGVTLYTIGVGA 230 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + S+ ++ +A+SL ++ + + Sbjct: 231 DQMMVQSIFGSRQVNPSQELDESMLTQLAESTGGRYFRARDAESLKAIYDKLDE 284 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 64/205 (31%), Gaps = 35/205 (17%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + + ++ +A++ + F+ +GLI + T + + Sbjct: 1 SATREGRPVSRWSAVQAVVADFITRRGD-------DRLGLIVFGTAAYPQAPLTLDHAAL 53 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + + T A+ A ++L + K +I L Sbjct: 54 QLLLRHTAVGMAGPNTAIGDAIGLAIRMLDAV------------------DEPDKVLILL 95 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYH 389 TDG + + A ++ I+I TI + + L ++ Sbjct: 96 TDGNDTGSAVP--PQRAATLAAQHHIRIHTIGMGDPQARGDDKVDFDLLEHIAQATGGRF 153 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 + + +SL V+ + + R+ Sbjct: 154 FQANDRESLQQVYATL-DQITPRRV 177 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 49/309 (15%), Positives = 93/309 (30%), Gaps = 42/309 (13%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + L+ S + Y L F ++ + + Sbjct: 35 TQQSAIKVPFFAIL-IDALNISTTQGASQLVATRWQKYLLCFTWIMILFALTKPTLL--- 90 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ E+ I +VVDLSGSM Q K +++ A+K L F Sbjct: 91 --GPPQVREQIGRDIMVVVDLSGSMQKKDF---------VNQQGHKISRLDAVKEVLTDF 139 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + ++ +GLI + + + + + ++ K T A+ Sbjct: 140 IKT-------RQGDRLGLILFGDAAFVQTPFTADHDVWLDLLMQTRVNMAGKSTHLGDAI 192 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + K +K I L+DG + I AK Sbjct: 193 GLTIKRFNEATKNQ-----------TSEKTREKVAIILSDGNDTGSYVP--PIDAAMVAK 239 Query: 359 ENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N ++I I+I + Q + K S Y +N L++ + I +L Sbjct: 240 VNAVRIYMIAIGDPKSVGEQSLDMQTINKIASVSGGQAYQALNQQELLNAYAKIDKLEPQ 299 Query: 412 RKYSVILKG 420 S + Sbjct: 300 LYQSTQYRA 308 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPMTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 78/240 (32%), Gaps = 43/240 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L +DLSGSM D + ++ ++ A K+ L F Sbjct: 90 EKSARDLMLALDLSGSMDARDFRD---------AEGHEQNRLTAAKDVLEGFAAQ----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E +GLI + + E + + ++ + T A+ A I Sbjct: 136 --REGDRLGLIVFGNAAYLQAPFTDDRETWQTLLEESEVAMAGQSTALGDAIGLAISIFQ 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + +I LTDG + + A N + I T Sbjct: 194 AS------------------DTTNRVLIVLTDGNDTGSRVP--PRDAATIAAANDVTIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +++ + + L ++ + ++ +L + I++L + S+ + Sbjct: 234 VAVGDPATIGEEALDLETLNAVAETTGGASFQALDTQALEKAYDEINRLEPAKYDSLSYR 293 >gi|262173885|ref|ZP_06041562.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] gi|261891243|gb|EEY37230.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] Length = 403 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 46/415 (11%), Positives = 103/415 (24%), Gaps = 42/415 (10%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRFESISNH 85 + + ++++ F + + + A+ A LA + K + R Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKLVDRYV 60 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 K + S+ V ++ + ++ + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYENISLKPEFTV 120 Query: 146 YNMDV-------MTSYDYRLQFIEHLLNQRYNQKI-VSFIPALLRIEMGERPIFLIELVV 197 V + + + N K + + ++ + + F E+ Sbjct: 121 NGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVITRVVNDLKDFDTEVKS 180 Query: 198 DLS--GSMHCAMNSDP-----EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 ++ G + + D Q K Sbjct: 181 RVALLGYHNFNIKQGGRSLEAYDYALYNTPQQTVSNMFFPPKRVNPSDSGLFSHRDIDLT 240 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 Y + S TE + + + TD P + I+ SD Sbjct: 241 QNYSSFLQIMNDRNFYPPRSACTESWQGIIAAAQAADKA--TDINP---EQVFIILSDGA 295 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGE------------NNNFKSNVNTIKICDKAK 358 + R G + + K ++ DG N F+S+ T Sbjct: 296 DCPWQRQDRWGRLRTTEYYLKKLV---DGGLCKNLKQRIRQKPNRFQSSTPTENE----- 347 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + I +N N CV ++ D ++ I++ K Sbjct: 348 KTKVTMGVIGVNYQVNPNDGFGDCVGRENIYHATQGEDVYKYILNLINEETGRLK 402 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPMTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 56/513 (10%), Positives = 141/513 (27%), Gaps = 121/513 (23%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ +E ++II AL LLL+ F + K + A + A Sbjct: 6 NLTAKLFNSIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK----------ESLSGYSA 111 L ++ S + +++ + + ++ K L S Sbjct: 66 LLLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSD 125 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 + + + + + + +I + + L + + L++ Sbjct: 126 DKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVS 185 Query: 167 QRY---NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + ++ + + +V DLSGSM ++ Sbjct: 186 SSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPID----------KRIPS 235 Query: 224 KRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGYT-------------------- 260 ++ AL++ + + D MG + + Sbjct: 236 SSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQ 295 Query: 261 ------TRVEKNIEPSWGTEKVRQYVT---RDMDSLILKPTDS----------------- 294 + + +N + ++ + QY+ ++ Sbjct: 296 SKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEG 355 Query: 295 -----------TPAMKQAYQILTSDKKRSFFTNFF-------------RQGVKIPSLPFQ 330 + A + D + + F + + Sbjct: 356 KETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTR 415 Query: 331 KFIIFLTDGENNNFKSNVNTI----KICDKAKEN------------FIKIVTISINA--S 372 + ++ L+DGE+N +C+K K ++ +++ Sbjct: 416 RILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPP 475 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + K CV + +Y V + L+ F+ I Sbjct: 476 QDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 56/513 (10%), Positives = 141/513 (27%), Gaps = 121/513 (23%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +L ++ K+ +E ++II AL LLL+ F + K + A + A Sbjct: 6 NLTAKLFNSIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAA 65 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK----------ESLSGYSA 111 L ++ S + +++ + + ++ K L S Sbjct: 66 LLLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSD 125 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 + + + + + + +I + + L + + L++ Sbjct: 126 DKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVS 185 Query: 167 QRY---NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + ++ + + +V DLSGSM ++ Sbjct: 186 SSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPID----------KRIPS 235 Query: 224 KRTKMAALKNALLLFLDSI---DLLSHVKEDVYMGLIGYT-------------------- 260 ++ AL++ + + D MG + + Sbjct: 236 SSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQ 295 Query: 261 ------TRVEKNIEPSWGTEKVRQYVT---RDMDSLILKPTDS----------------- 294 + + +N + ++ + QY+ ++ Sbjct: 296 SKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEG 355 Query: 295 -----------TPAMKQAYQILTSDKKRSFFTNFF-------------RQGVKIPSLPFQ 330 + A + D + + F + + Sbjct: 356 KETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTR 415 Query: 331 KFIIFLTDGENNNFKSNVNTI----KICDKAKEN------------FIKIVTISINA--S 372 + ++ L+DGE+N +C+K K ++ +++ Sbjct: 416 RILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPP 475 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + K CV + +Y V + L+ F+ I Sbjct: 476 QDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 84/265 (31%), Gaps = 43/265 (16%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + + + + E+ R F + ++VDLSGSM D Sbjct: 67 MLILSWLLIVTALAKPSILGEVQTREAFGRDVLMLVDLSGSM---------DEADFTTAD 117 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T++ A KN L F+ + GLI + + + + Sbjct: 118 GSTLTRLNAAKNVLKTFIAK-------RSGDRFGLILFGDAAFIQTPFTADQQVWLSLLE 170 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T A+ ++ + P Q+ +I LTDG + Sbjct: 171 EAQTGMAGQSTHLGDAIGLGIKVFEQN----------------PQPSEQQVMIVLTDGND 214 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVN 394 S V + A IKI TI++ + + + + + + ++ Sbjct: 215 TG--SFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAID 272 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 L +Q I +L + S + Sbjct: 273 QAELDKAYQLIDKLEPQQYSSASFR 297 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 66/250 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+S SM + +D K ++ ALK + F+ Sbjct: 81 AVSKKTKTNSGIDIIMAIDVSASM---------------LARDLKPNRLEALKKVAIDFV 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D + + +G++ Y + V++ ++ + T + Sbjct: 126 DR-------RPNDRIGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGTAIGMGLG 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L + K II LTDG NN + + A+E Sbjct: 179 SGVNRLKESTAK------------------SKVIILLTDGVNNAGNID--PRTATELARE 218 Query: 360 NFIKIVTISINASP------------------------NGQRLLKTCVSSPEYHYNVVNA 395 IK+ TI I + + + L + ++ ++ + Sbjct: 219 LEIKVYTIGIGTNGMADFPWSKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKYFRATDN 278 Query: 396 DSLIHVFQNI 405 SL ++ I Sbjct: 279 QSLKEIYDEI 288 >gi|326504154|dbj|BAK02863.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 651 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 28/245 (11%), Positives = 74/245 (30%), Gaps = 34/245 (13%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + +R + V+D+S SMH +K+A LK A+ Sbjct: 155 GTEAAGDDPAQRAPVDLVTVLDVSSSMH--------------------GSKLALLKQAMR 194 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDST 295 +D + + ++ +++R + + ++ + V T+ Sbjct: 195 FVIDILGPDD------RLSVVSFSSRARRVTRLTRMSDAGKALCVRAVESLTARTGTNIA 248 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKIC 354 ++ A ++L + R+ + + + G N + + Sbjct: 249 EGLRTAAKVLDERRHRNGVSCVVLLSDGQDNYTPMRQ--AFGRGLPNYAALLPPSFARTG 306 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 A + + T + + ++ + N + F +Q + Sbjct: 307 TGAGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENEAVIQDAF---AQCVGGL-L 362 Query: 415 SVILK 419 SV+++ Sbjct: 363 SVVVQ 367 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 77/251 (30%), Gaps = 66/251 (26%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I + +D+S SM + +D + ++ ALK F++ Sbjct: 82 VSTKTKTTKGIDIVIAIDVSASM---------------LAKDLRPNRLEALKKVASSFIN 126 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMK 299 + + +GL+ Y + V + +S+I T + Sbjct: 127 -------GRPNDRIGLVEYAGESFTKTPITSDKSIVLSALKGIQYNSIIEGGTAIGMGLA 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S K II +TDGENN + + + A+E Sbjct: 180 TGVNRI------------------KDSKALSKVIILMTDGENNAGQID--PRIAAELAQE 219 Query: 360 NFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNAD 396 IK+ TI + + + + L + ++ ++ N Sbjct: 220 FGIKVYTIGMGTNGTALSPYARNPNGTFVYENIQVTIDEELLEEIAETTGGQYFRATNNK 279 Query: 397 SLIHVFQNISQ 407 L ++ I + Sbjct: 280 KLQEIYDEIDK 290 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 71/221 (32%), Gaps = 47/221 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + +++ A+K F + Sbjct: 92 DLMLAIDLSGSMQIEDM----------QVGARLVSRIEAVKAIASDFTSQ-------RVG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ T V +++ + T A+ A + L Sbjct: 135 DRVGLILFGTRAYVQAPLTFDTATVTRFIREAQLGFAGEDTAIGDALGLAIKRL------ 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG++ + ++ A E+ IK+ TI I+ Sbjct: 189 ------------RERPAESRVLILLTDGQDTASTVD--PMEATALAAESGIKVYTIGISR 234 Query: 372 ----------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L ++ ++ N L ++ Sbjct: 235 RIGARAGGSGEVDEALLNAIAEATGGEYFRARNPAELQSIY 275 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 29/259 (11%), Positives = 69/259 (26%), Gaps = 25/259 (9%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 +IF LS+M +L++G + + +++ A ++A+L ASKM + + + ++ Sbjct: 1 MIFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMTESTTNAQAKDQAQV 60 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + + A + + S V M L + + Sbjct: 61 VLNAQPRMTTAI--------VTGATVSEDKRTVCATAKVTIQNSFMQMAQLATLTPTVKS 112 Query: 144 IFYNM-----DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM-------GERPIF 191 L + + Q +S + + + Sbjct: 113 CANLAGGADPGTTYEIALVLDNSGSMNSSSDGQSKISILKSAANSFVDTMFSKSNNVKFS 172 Query: 192 LIELVVDLSGSM----HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 ++ ++ + + + K AA D D L Sbjct: 173 VVPFSSGVAAVDPSEPSSRNAAWVDKNGANSQHWIAFGGKTAANAAGFTSRFDIFDKL-K 231 Query: 248 VKEDVYMGLIGYTTRVEKN 266 + + +V Sbjct: 232 ARNSALDWRGCFEPQVYPL 250 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 67/230 (29%), Gaps = 19/230 (8%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 D GS + + + + + K+ L + ++ Sbjct: 327 SDFKGSSSFCPDPGTQRILQLTQKKSDVQNKINQL---VANGATNLHEGFMWGWRTLSPN 383 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 ++ K R+ + D + A Y L Sbjct: 384 APFSGGRAYQAP------KNRKIMVFMTDGFNSWNSRVNTATGSTYDTLGYYSYNGAENE 437 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFIKIVTISI-- 369 F G + + ++ + + ++ + T + C AK I++ TI Sbjct: 438 RFPDGSQGNGVNYRSLLAAAANNSSSYQTISRAMQDELTRQACTNAKTAGIEVFTIGFSV 497 Query: 370 -NASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + Q L+K C ++ ++++ +A L F I + + S+ Sbjct: 498 SGDPIDAQGLALMKECATNEDHYFKAEDASQLNAAFSQIGIGLGKLRLSL 547 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 37/330 (11%), Positives = 87/330 (26%), Gaps = 61/330 (18%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF---------IPALLRIEMGERPIFLIELV 196 Y L + + I ++P +V Sbjct: 85 YEDSKSREIVLVLDTSGSMDQEINQLCEDCAYYCKDCDKWIYETRENHKNQKPYINHTIV 144 Query: 197 VDLSGSMHC-----------AMNSDPEDVNSAPICQD--------KKRTKMAALKNALLL 237 G +N P +S + TK+ LK A Sbjct: 145 RRFGGYYCYDCNKYIENEETHINYRPYANHSFSDKKYCNNHKAYESYTTKIHELKKAAKN 204 Query: 238 FLDSIDL-----LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-------YVTRDMD 285 F+DS+ + +++ +G++ Y N T+ R+ + Sbjct: 205 FIDSLTSTKTDGQTPNVKNLKIGIVSYNNSGYINEGLVQVTDSDRKNNGNINELKDTIEN 264 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-----------VKIPSLPFQKFII 334 T++ +++A +L + + + F G +L + Sbjct: 265 LRADGGTNTGDGLRKAAYLLNEENEANKTVIFMGDGEPTYYSSDRWGNDYTNLDDTNQYV 324 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----NGQRLLKT------CVS 384 T + + K I + K + ++ + ++ + S Sbjct: 325 GGTGYSDADGKCLSYAKTIGEIIKGEQYNVFSVGYGLGDENSASNNKMKQIHESMGGISS 384 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + ++ VFQ I+ ++ + Sbjct: 385 GENSTFFASDEGAIDKVFQQIADTIIKTSF 414 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 77/254 (30%), Gaps = 37/254 (14%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q+ ++ + R + V+D+SGSM K Sbjct: 228 QKVFSILIHLKAPKSLESVSSRAPLDLVTVLDVSGSM--------------------KGA 267 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 K+A LK A+ + ++ + +I +++ + Q + Sbjct: 268 KLALLKKAMCFVIQTLGPND------RLSVIAFSSTARRLFPLRQMNVNGRMQAIQAVNS 321 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+ + +K+ +++ + ++ + + D N Sbjct: 322 LVDGGGTNISDGLKKGAKVIEHRRLKNPVCSIILLSDGQDTYSVP----TFDDELQTNHS 377 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + V + N ++I T + + +S + S+ + F Sbjct: 378 ALVPPSILPGT--GNHVQIHTFGFGMDHDSAAMHAIAETSSGTFSFIDAEGSIQNGF--- 432 Query: 406 SQLMVHRKYSVILK 419 +Q + SV++K Sbjct: 433 AQCIGGL-LSVVVK 445 >gi|326530406|dbj|BAJ97629.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 657 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 28/245 (11%), Positives = 74/245 (30%), Gaps = 34/245 (13%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + +R + V+D+S SMH +K+A LK A+ Sbjct: 161 GTEAAGDDPAQRAPVDLVTVLDVSSSMH--------------------GSKLALLKQAMR 200 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDST 295 +D + + ++ +++R + + ++ + V T+ Sbjct: 201 FVIDILGPDD------RLSVVSFSSRARRVTRLTRMSDAGKALCVRAVESLTARTGTNIA 254 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKIC 354 ++ A ++L + R+ + + + G N + + Sbjct: 255 EGLRTAAKVLDERRHRNGVSCVVLLSDGQDNYTPMRQ--AFGRGLPNYAALLPPSFARTG 312 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 A + + T + + ++ + N + F +Q + Sbjct: 313 TGAGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENEAVIQDAF---AQCVGGL-L 368 Query: 415 SVILK 419 SV+++ Sbjct: 369 SVVVQ 373 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 81/231 (35%), Gaps = 44/231 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + P + L +DLSGSM DP ++ +++A+K + F+ Sbjct: 80 VRDEPARDLMLAIDLSGSMATRDFVDP---------AGERMDRLSAVKRVVANFVAK--- 127 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ + + + VR + + + T A+ ++ Sbjct: 128 ----RKGDRIGLVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKL 183 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +K +I LTDG N+ S + + + AK++ + + Sbjct: 184 MADS------------------HAQEKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVV 223 Query: 365 VTISINASP-------NGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ 407 T+ I + + L + + + + + L V+ + + Sbjct: 224 HTVGIGDPGTTGEDRVDLEALARIASITGGRAFRALGQEKDLTEVYATLDK 274 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 202 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 251 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 252 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 304 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 305 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 344 Query: 361 FIKIVTISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ S + L K + + + + L ++ Sbjct: 345 GVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 404 Query: 404 NISQ 407 + + Sbjct: 405 ELDR 408 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 71/223 (31%), Gaps = 43/223 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + N T++ ++K L FL +E Sbjct: 94 DLLLAVDLSGSMETKDFKNKSGEN---------VTRLDSVKEVLSEFLAE-------REG 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + + E ++ + + T A+ + I + Sbjct: 138 DRVGLVFFGSAAFIQMPFTEDLEICQELMDEAQVRMAGPQTMLGDAIGLSISIFDQSELE 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K +I LTDG + K A++ I I T+++ Sbjct: 198 D------------------KVLILLTDGNDTGSLV--APEKAAQIARDKGIVIHTVAVGD 237 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + +Y N + L ++ I + Sbjct: 238 PAAAGEQALDEATLRSISSLTKGKYYWAGNREELAGIYDEIDK 280 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 78/244 (31%), Gaps = 54/244 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R + L VDLSGSM + K ++ A K L FLD Sbjct: 202 GEVIQPPREARQMMLAVDLSGSMSEPDM----------VLGGKVVDRLTAAKAVLSDFLD 251 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ +GL+ + R + VR + + L + T A+ Sbjct: 252 R-------RDGDRVGLLVFGQRAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIAL 304 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + L K+ Q+ ++ LTDG N N +K + AK Sbjct: 305 SVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN--PLKAAELAKAE 344 Query: 361 FIKIVTISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ TI+ S + L K + + + + L ++ Sbjct: 345 GVRVHTIAFGGSGGYSLFGVPIPAGGDDDIDEDGLRKIAQQTGGRFFRARDTEELAGIYA 404 Query: 404 NISQ 407 + + Sbjct: 405 ELDR 408 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 72/239 (30%), Gaps = 54/239 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + ++ A+K + F++ + Sbjct: 112 DLMLCIDLSGSMLSEDYERG----------GDRINRLQAIKPVIQAFIER-------RPS 154 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ ++ R ++ + + R LI T + L ++ Sbjct: 155 DRIGIVLFSGRAYTMAPLTFDHRWLGSQLERIKVGLIEDGTAIGDGLGVGLTRLEQAQRE 214 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 S F++ LTDG NN + + + AK I + TI Sbjct: 215 S------------GGKRQGAFVVLLTDGANNRGS--LTPQQAAELAKARGIPVYTIGAGQ 260 Query: 369 --------------------INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + L + + + + ++ F+ I + Sbjct: 261 DGIVPFPVFDDKGRKLGYRRIMSDLDEGALRDIAEMTGGHFFRAADVGTVESAFRAIDR 319 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 72/234 (30%), Gaps = 56/234 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM + ++ A K L FLD ++ Sbjct: 107 LMLAVDLSGSMSDPDM----------RLGGRVVDRLTAAKAVLADFLDR-------RDGD 149 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + + VR + + L + T A+ A + L Sbjct: 150 RIGLLVFGQQAYALTPLTADLATVRDQLRDSVVGLAGRETALGDAIALAVKRL------- 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 ++ +I LTDG N + +K + AK +++ TI++ Sbjct: 203 -----------REQPQGERVLILLTDGVNTAGVLD--PLKAAELAKAEHVRVYTIALGGD 249 Query: 373 P-------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + + + L ++ + + Sbjct: 250 GGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFRARDTAQLASIYAELDR 303 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 84/272 (30%), Gaps = 54/272 (19%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEM--GERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + P L R + V+D+SGSM Sbjct: 124 VSASESFSNFGVLVRILAPPLDNTLPHHRARAPIDVVTVLDVSGSMA------------- 170 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 +K+ LK A+ + ++ + ++ +++ + + + R Sbjct: 171 --------SKLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARRMLPLRRMSGSGR 216 Query: 278 QYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T +DS+ + T+ +K+ Q+L ++ + II L Sbjct: 217 EDATSVVDSISAIGGTNIVAGLKKGVQVLEERRQHNSVA----------------TIILL 260 Query: 337 TDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +DG + + + + +++K+ + T + + + +S + + Sbjct: 261 SDGCDTQSHNAQNRLDYLKEESKQPTFPVYTFGFGSDHDSAAMHAISDASRGTFSFIESI 320 Query: 396 DSLIHVFQN-ISQLMV------HRKYSVILKG 420 + L F I L+ K G Sbjct: 321 NILQDAFARCIGGLISIVARDVQLKVRSASPG 352 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 93.4 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 84/265 (31%), Gaps = 43/265 (16%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + + + + E+ R F + ++VDLSGSM D Sbjct: 67 MLILSWMLIVSALAKPSILGEVQTREDFGRDVLMLVDLSGSM---------DEADFTTAD 117 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T++ A KN L F+ + GLI + + + + Sbjct: 118 GSTLTRLNAAKNVLKTFIAK-------RSGDRFGLILFGDAAFIQTPFTADQQVWLSLLE 170 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T A+ ++ + P Q+ +I LTDG + Sbjct: 171 EAQTGMAGQSTHLGDAIGLGIKVFEQN----------------PQPSEQQVMIVLTDGND 214 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVN 394 S V + A IKI TI++ + + + + + + ++ Sbjct: 215 TG--SFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAID 272 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 L +Q I +L + S + Sbjct: 273 QAELDKAYQLIDKLEPQQYSSASFR 297 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 77/233 (33%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + + ++ +K L F++ ++ Sbjct: 88 DIMLAVDLSGSMTEQDMAYN----------GQYVDRLTMVKAVLTDFIEQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + ++ L+ + T A+ + + + K Sbjct: 131 DRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFANKDKS 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG+N N+ + A++ IK+ TI + + Sbjct: 191 N------------------RIVVLLTDGQNTAG--NLKPEEALLLARDAGIKVYTIGVGS 230 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ + L ++ + + Sbjct: 231 DNPRGFSLFNMGGMSGDTIDEGLLKRIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 74/244 (30%), Gaps = 59/244 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+SGSM + K+ ++ +K+ + F+ + + Sbjct: 88 DILLAVDVSGSMQAMDFTLN----------GKRTNRLEVVKDVMAKFISQ-------RPN 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R P+ + + +I T A+ L Sbjct: 131 DSIGLVAFAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDGTAIGSAIGTGVNRLREK--- 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--- 368 + II LTDG NN K + + AK +K+ TI Sbjct: 188 ---------------KSPSQIIILLTDGINNAGKVP--PLIAAEAAKSFKVKVYTIGAGT 230 Query: 369 -----------IN--------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + L K + ++ + +SL V+ I+ + Sbjct: 231 RGEAPIPITDAFGRRQLVRARVDIDDKTLSKVAQITGARYFRATDTESLEKVYAEINSME 290 Query: 410 VHRK 413 + Sbjct: 291 TTSR 294 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 74/226 (32%), Gaps = 47/226 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM + +++ A+K F + + Sbjct: 91 DLLLAIDLSGSMQIEDM----------QIGNSLVSRITAVKAIAADF-------ASRRTG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ + V+Q++ + T A+ A + L Sbjct: 134 DRVGLILFGTRAYVQAPLTFDVKTVKQFIEEAQLGFAGEDTAIGDALGLAVKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +I LTDG++ + ++ A E +KI TI I+ Sbjct: 188 ------------RERPADSRVLILLTDGQDTASTVD--PMEAAALASEMNVKIYTIGISR 233 Query: 372 ----------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L ++ ++ L ++Q + + Sbjct: 234 RLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDIYQVLDE 279 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 78/253 (30%), Gaps = 64/253 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + I + VD+S SM + +D + ++ ALKN Sbjct: 79 TFTISENNDDTKGIDIMMSVDVSLSM---------------LARDLEPDRLTALKNIAKK 123 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+D + +GL+ Y+ + + + + + + T Sbjct: 124 FVDK-------RPGDRIGLVTYSGEAFTKVPVTSDHAVLLEELENLNPLELQPGTAIGEG 176 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L K + K II +TDG N + + A Sbjct: 177 LSVAVSHLRHSKAK------------------SKIIILMTDGVNTIENAMPAQVG-AQLA 217 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 K N I++ +I I + + L + ++ ++ + Sbjct: 218 KSNDIRVYSIGIGTNGYALMPTQTDIFGDLVFTEVEVKIDEPVLREIAQTTGGKYFRATS 277 Query: 395 ADSLIHVFQNISQ 407 SL V++ I+Q Sbjct: 278 NQSLEEVYEEINQ 290 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 71/234 (30%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + ++ +K L F+ + Sbjct: 98 DLMVAVDLSGSMQTQDMVVN----------GNEVDRLVMVKTVLGDFIQR-------RVG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ V Q ++ + L+ T A+ A + + Sbjct: 141 DRIGLILFADTAYLQAPLTFDRTTVEQLLSETVIGLVGDSTAIGDAIGLAAKRFSDK--- 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K ++ LTDG+N N+ + A + IKI I + A Sbjct: 198 ---------------PNVNKVLVLLTDGQNTAG--NITPDQALSLAVDQNIKIYPIGVGA 240 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ + L +++ + + Sbjct: 241 DAMTVNSLFGQRQVNPSADLDEGLLTRLAKDTGGQYFRARDTQELEQIYRLLDR 294 >gi|219363061|ref|NP_001136879.1| hypothetical protein LOC100217033 [Zea mays] gi|194697454|gb|ACF82811.1| unknown [Zea mays] Length = 459 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 62/197 (31%), Gaps = 18/197 (9%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTR 282 K+A LK A+ ++++D + ++ +++ + T +Q + Sbjct: 2 SGHKLALLKRAMRFVIENLDPSD------RLSVVAFSSSAWRLFPLQRMTASGQQQSLQA 55 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ +++A +++ + R+ + + S D Sbjct: 56 VNSLAADGGTNIAEGLRKAARVVEDRQARNPVCSIMLLSDGVDSHNLPPRDGSAPDYGPL 115 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +S + E+ + I + + + S + A S+ F Sbjct: 116 VPRSILP-------GSEHHVPIHAFGFGMDHDSRAMHAVAQMSSGTFSFIDMAGSIQDAF 168 Query: 403 QNISQLMVHRKYSVILK 419 +Q + SV+ + Sbjct: 169 ---AQCIGGL-LSVVAQ 181 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 93.4 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 76/246 (30%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VDLSGSM + + + +++ + F++ ++ Sbjct: 84 DLMIAVDLSGSMQIEDM----------VLNGQAVDRFTMIQDVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALGVKRFD----- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN+ + + A + +KI I + A Sbjct: 182 -------------MVDKSNRILVLLTDGSNNSGSIS--PEQAAAIAAKRGVKIYAIGVGA 226 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L + ++ ++ L ++Q I +L + Sbjct: 227 DVMERRSIFGTERVNPSMDLDEAQLSSLAKITGGLYFRARSSQDLQQIYQEIDKLEPISR 286 Query: 414 YSVILK 419 + + Sbjct: 287 DQLSYR 292 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 93.4 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAVDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGESFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPMTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYVSMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 93.4 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 67/226 (29%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L VD+SGSM + P+ + ++ A+K+ + F+ Sbjct: 91 TSAARDLVLAVDISGSMDDRDMTAPD---------GTRLQRLQAVKDVVGAFVAE----- 136 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E + LI + + + + V + + + + T A+ A + Sbjct: 137 --REGDRISLIVFGAKPFIQAPFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFE 194 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + +I L+DG + I A + I I T Sbjct: 195 DSEIEE------------------RLLILLSDGADTASTMT--PINAAQIAAQEGITIYT 234 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + L Y + + L ++ I Sbjct: 235 IGVGNPDGSGEERLDPATLEDIATRGGGAFYFADDVEGLSEIYAEI 280 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 74/226 (32%), Gaps = 36/226 (15%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + T++AA + F+ + + Sbjct: 114 DIMLAMDVSVSMSESD---------------ILPTRLAAARRVAQAFVR-------GRRN 151 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + + QY+ D +I T A+ + + Sbjct: 152 DRIGLVIFAGEAFSLCPLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALARCINRMRDRPA 211 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S T + + K II L+DG+N + I AK IKI TI++ Sbjct: 212 ASSDT--TQAKTEQWKSERSKVIILLSDGDNTAGNLD--PITAASLAKAFNIKIYTIAVG 267 Query: 371 AS---------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + V++ L VF ISQ Sbjct: 268 QPVASASEASTVDEGILKKIATIGKGSFFRAVDSGRLKTVFAQISQ 313 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 78/246 (31%), Gaps = 55/246 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLSGSM + + + +++ + F++ ++ Sbjct: 84 DLMLAVDLSGSMQIEDM----------VLNGQTVDRFTMIQDVVSDFIER-------RKG 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + V Q++ L+ K T A+ + Sbjct: 127 DKLGLILFADHAYLQAPLTQDRRSVAQFLQEAQIGLVGKQTAIGEAIALGVKRFD----- 181 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ LTDG NN+ + + A + +KI I + A Sbjct: 182 -------------MVDKSNRILVLLTDGSNNSGSIS--PEQAAAIAAKRGVKIYAIGVGA 226 Query: 372 SP------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +L+ ++ ++ ++ L ++Q I +L + Sbjct: 227 DVMERRSIFGTERVNPSMDLDEAQLISLAKTTGGLYFRARSSQDLQLIYQEIDKLEPISR 286 Query: 414 YSVILK 419 + + Sbjct: 287 DQLSYR 292 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 79/246 (32%), Gaps = 64/246 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + + L + ++ ++ + L V++ I + + Sbjct: 227 NGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK-LE 285 Query: 411 HRKYSV 416 K SV Sbjct: 286 KTKLSV 291 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 38/352 (10%), Positives = 106/352 (30%), Gaps = 39/352 (11%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + + D+ F + + S + T N + ++++ Sbjct: 1219 STKIGEKDSATFTITNTYTPIDINSVIEYNKTATLLDWNQRTYKIDL-------TASSKT 1271 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS----FIPALLRIEMGERPIFLIELVVDLS 200 +M L + + ++ F I+ + D + Sbjct: 1272 TQSMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKTYYIKTQNGIYQQLSWSWDNT 1331 Query: 201 GSMHCAMNSDPEDVNSA----PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 S + + V+ + Q +TK+ ALK+A F++++ + D +G+ Sbjct: 1332 WSYTDSYSGKTVTVDPNTTDVYVAQKSNQTKIDALKSAATTFVNNVA---NKNSDCRVGI 1388 Query: 257 IGYTTRVEKNIEPSWGTEKVRQ-------YVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ + + + + + T + +A +I + + Sbjct: 1389 VTFSNDGY-IKPITNNSYTLAKVGTSKGDIINTIDGLKTGGDTYPAKGLDKANEIFSENS 1447 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS- 368 S+ T G K + + + N + + + + Sbjct: 1448 SNSWETVEQTDGRKKMVVFLTDGVPAPANTNNFDENLAGAGTNSAKILHDQGVATYALGI 1507 Query: 369 INASPNGQRL-----------LKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 A+ + + +++ SS E + + D+L +F++I+ + Sbjct: 1508 FGAANSDGTMDNASVQRIDKYMQSIASSHEKYMTADSVDNLSSLFESITNNI 1559 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 18/206 (8%), Positives = 58/206 (28%), Gaps = 16/206 (7%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 TK+A LK A+ + ++ + +I +++ + T +Q Sbjct: 294 SGSMAGTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLRRMTESGRKQ 347 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ +++ +++ + ++ + + Sbjct: 348 SLLAVNSLTSNGGTNIAEGLRKGSKVIEERQAKNPVCSIILLSDGQDTYTVSPSTGAHKP 407 Query: 339 GENN-NFKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + + + + C + + A + L +S + Sbjct: 408 YTVSPTAGAQKASAEYCALLPSTNGSQQVPVHVFGFGADHDAVSLHSISQTSGGTFSFIE 467 Query: 394 NADSLIHVFQNISQLMVHRKYSVILK 419 ++ F +Q + SV+ + Sbjct: 468 TEATIQDAF---AQCIGGL-LSVVAQ 489 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 35/115 (30%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R + N II ALS++ L+ +G + + + M+S +AA++A Sbjct: 9 FGRAFAALRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIA 68 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 ++ + + + + I +A Sbjct: 69 AVKQINNTGDTDALKLKVTDWFHAQVENSYTLGEIDIDTTNHNITATASGTVPTT 123 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 93.0 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 71/216 (32%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F++ + + +G+ + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKDVATEFIN-------GRPNDNIGITLFAGESFTQCPLTVD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + I T + A L K + K Sbjct: 148 HAVLLNLLKDMKCGFIEDGTAIGMGLANAVTRLKDSKAK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG NN +++ + + AK I++ TI + + Sbjct: 190 IILLTDGVNNKG--DISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPV 247 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ + L V++ I + Sbjct: 248 EIDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 93.0 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 77/233 (33%), Gaps = 55/233 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLS SM ++ +++ +K + F+ ++ Sbjct: 87 MMLAVDLSRSMQAEDMQIN----------NRMVDRLSLVKTVVADFIQQ-------RKGD 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V Y+ + + L+ + T + A + + Sbjct: 130 RVGLIFFADNAYLQAPLTFDLKTVSGYMQQAVLGLVGEQTAIGEGIGLALKRFDAA---- 185 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 QK +I LTDG+N+ + + A+E +KI TI + A Sbjct: 186 --------------DNPQKVLILLTDGQNSAGEV--KPLDAAKFAQEQGVKIYTIGVGAD 229 Query: 373 P------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + ++ +A SL ++ + + Sbjct: 230 AYYKRTLFGNQKVDPSRDLDEVTLKTIAAQTGGQYFRARDASSLAAIYAELDK 282 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 92.6 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ LV+D+S SM ++AA K A F D + Sbjct: 99 NRAVVMLVIDVSESMASNDVP---------------PNRLAAAKEAGKQFADQLTPA--- 140 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ V+ + T + + A Q + + Sbjct: 141 ---INLGLVEFAANATLLVPPTTNRGAVKSGIDSL---QPAPKTATGEGIFTALQAIATV 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIKIV 365 I+ +DG N AK ++I Sbjct: 195 GSVMGGGEGP----------PPARIVLESDGAENVPLDPNAPQGAFTAARAAKGQGVQIS 244 Query: 366 TISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TIS + Q L K C + ++ + DSL +V+ + + + Sbjct: 245 TISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGEAFHADSLDSLKNVYTTLQRQIG 303 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 44/312 (14%), Positives = 83/312 (26%), Gaps = 64/312 (20%) Query: 159 QFIEHLL-NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-GSMHCAMNSDPEDVNS 216 F++ +L ++ IP + G R + G S + + Sbjct: 15 DFLDLMLEGDLKKYTSINLIPFGGTVNTGPRLLSRYFPTRGYMDGQPASEYGSGSQITSK 74 Query: 217 AP------ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 C + K L + L + L + ++ + +R+ N Sbjct: 75 LYRFLGAKHCIEYKPEDFD-LNDIPLNSRAQLPHLYYWRKTNPWCPENFASRMYLN---R 130 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF---FRQGVKIPSL 327 ++ V R S T + + L+ + + + Sbjct: 131 NNLGGLKAAVDRLTLSD---GTGMDIGLLWEAKALSPKLRTAAALDGGLLPGHPTDWSDK 187 Query: 328 PFQKFIIFLTDGENNN-------------------------------------------F 344 QK I+ +TDG Sbjct: 188 QTQKVIVLMTDGGITAQYRPKDPWKGLNPKDMRRGIVNARRNVQYVTTRGNMNSPANSKH 247 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVF 402 S +CD+AK I I T+ N L C +SP ++Y V + L F Sbjct: 248 NSVAYMKTMCDQAKAKGIIIYTVGFQIRRNTLPDLSLSYCATSPSHYYFVES-SDLSAAF 306 Query: 403 QNISQLMVHRKY 414 + I+ + + Sbjct: 307 KAIASSIKSLRI 318 >gi|262164956|ref|ZP_06032694.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] gi|262027336|gb|EEY46003.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] Length = 403 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 44/416 (10%), Positives = 102/416 (24%), Gaps = 44/416 (10%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRFESISNH 85 + + ++++ F + + + A+ A LA + K + R Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKVVDRYV 60 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 K + S+ V ++ + ++ + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYENISLKPEFTV 120 Query: 146 YNMDV-------MTSYDYRLQFIEHLLNQRYNQKI-VSFIPALLRIEMGERPIFLIELVV 197 V + + + N K + + ++ + + F E+ Sbjct: 121 NGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKEVITRVVNDLKGFDTEVKS 180 Query: 198 DLS--GSMHCAMNSDP-----EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +S + + D Q K Sbjct: 181 RVSLLAYHNYNIKQGSRTLEAYDYALYNTPQQTVSNMFLPPKRVTPSDSALFSHRDIDLT 240 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 Y + S TE + + + TD P +Q + IL+ Sbjct: 241 QNYSSFLQIMNDRNFYPPKSACTESWQGIIAAAQAADKA--TDINP--EQVFIILSDGAD 296 Query: 311 RSFFTNFFRQGVKIPSLPFQKFII--FLTDG-----------ENNNFKSNVNTIKICDKA 357 S+ + + K + + G + N F+S+ T Sbjct: 297 CSWQ-----RQDRWGRWLTTKNYLKKLVDSGLCKNLKQRIRQKPNRFQSSTPTENE---- 347 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + I +N N CV ++ D ++ I++ K Sbjct: 348 -KTKVTMGVIGVNYQVNPNDGFGDCVGRENIYHATQGEDVYKYILNLINEETGRLK 402 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 38/423 (8%), Positives = 98/423 (23%), Gaps = 50/423 (11%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + ++ A+ ++ + + + V ++ ++ + ++A LAGA ++ Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATG 78 Query: 78 ----------------------RFESISNHAKRALIDDAKRFI--KNHIKESLSGYSAVF 113 +++ + + + ++ Sbjct: 79 TVHEYLTHNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKNPT 138 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 + N + + D + +I L + + Sbjct: 139 TVSVRMTYENMPFFFGHLLGRDSFDITAESIATYQSR--DIMLVLDLSGSMNDDSEF-NS 195 Query: 174 VSFIPALLRIEMGERPIFLI--ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + ++ + + +L A+ + P +S + + Sbjct: 196 IGKLGFDHIYSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSVTYRGNSVV 255 Query: 232 KNALLLF--LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + + YT K I W T + + S Sbjct: 256 VKSDKTIKQISVKTSNGSTYNYYPGSSLNYTANPNKEIRYVWVTSGKNSNNSDQVQSFDF 315 Query: 290 KPTDSTPAMKQAY---QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +K A + S+ + F N Sbjct: 316 DGQRINT-IKTALGLDNLAYPYPGGSWNDYVNYCLGTGQNNNAGYRYRFGYFNWINYLLE 374 Query: 347 NVNTIKIC-DKAKENFIKIVT--------ISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + D K + I I +G+ + V YN N D Sbjct: 375 RQYSSNSTPDLWKASAQPITAVKNSVDLFIHFMQEGDGRDRIGLAV------YNAPNGDG 428 Query: 398 LIH 400 L+ Sbjct: 429 LLE 431 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/368 (11%), Positives = 104/368 (28%), Gaps = 52/368 (14%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + +S + +++ + + IK ++ +G + +Y N + + ++ Sbjct: 238 QSSGQAKSSVTYRGNSVVVKSDKTIKQISVKTSNGSTYNYYPGSSLNYTANPNKEIRYVW 297 Query: 134 NNRLDSSNNTIFYNM-------DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 +SNN+ L L ++ L Sbjct: 298 VTSGKNSNNSDQVQSFDFDGQRINTIKTALGLDN---LAYPYPGGSWNDYVNYCLGTGQN 354 Query: 187 ERPIFLIELVVDLSGSMHCAMN--SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + G + NS P + A+KN++ LF+ Sbjct: 355 NNAGYRYRF-----GYFNWINYLLERQYSSNSTPDLWKASAQPITAVKNSVDLFI-HFMQ 408 Query: 245 LSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ + + + + + + T+ M + Sbjct: 409 EGDGRDRIGLAVYNAPNGDGLLESTLTENLPFIMTQSRQRQAGHYHNYTNIGGGMTVGRE 468 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-------SNVNTIKICDK 356 L + ++ K ++ LTDG+ N + + Sbjct: 469 ELQTRGRKGAV----------------KMMVLLTDGQANWVNGGVNNNAAKNYVLNEAYL 512 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-------DSLIHVFQNISQLM 409 + I+TIS+ A + + + + H+NV + L +F+ ++ Sbjct: 513 CADQGFTIITISLGAGADKALMDQVAEITGGVHFNVPGGQTVDEYSEDLTEIFRQVA--- 569 Query: 410 VHRKYSVI 417 HR ++ Sbjct: 570 GHRPLKLV 577 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 36/270 (13%), Positives = 80/270 (29%), Gaps = 66/270 (24%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIF---LIELVVDLSGSMHCAMNSDPEDVNSAP 218 +L I+ + + I + +D+S SM Sbjct: 56 PFMLRIIALALIIVVLARPQTTNSWQNSEIEGIDIMMAIDVSTSM--------------- 100 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + +D K ++ A K+ F++ + + +G+ + + + Sbjct: 101 LAEDLKPNRLEAAKDVAAEFIN-------GRPNDNIGITLFAGESFTQCPLTVDHAVLLN 153 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +I T + A L K + K II LTD Sbjct: 154 LFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAK------------------SKVIILLTD 195 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQR 377 G NN +++ + + AK I++ TI + + + + Sbjct: 196 GTNNKG--DISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKT 253 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + ++ ++ + L V++ I + Sbjct: 254 LTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 76/239 (31%), Gaps = 43/239 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 +P + + +DLS SM DP ++ A++ + F+ Sbjct: 91 QPARDLLIALDLSQSMDTRDFGDPS---------GALIPRVQAVRQVVSGFVAR------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GLI + + + V+ +T + + T A+ ++ Sbjct: 136 -RPGDRIGLIVFGDAPYPLAPFTLDHQLVQTLITGLLPGMAGPSTALGDAIGLGIKMFEH 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K +I LTDG + + + AKE + + TI Sbjct: 195 SEA------------------PEKVLIVLTDGNDTASRMP--PERAGGIAKERKVVVHTI 234 Query: 368 SINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I L + + ++ + L ++ + ++ H + ++ + Sbjct: 235 GIGDPNASGEEKVDLGVLQRLAAQTGGRYFFGADQAGLETIYATLDRITPHNQKTLSWR 293 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 79/229 (34%), Gaps = 43/229 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L +D+SGSM +D + N ++A +++ + F++ Sbjct: 89 TSKSARDLILAIDISGSMDTRDFTDASNENL---------QRLAGVRDVVRAFVE----- 134 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E M LI + ++ + T + + + + + T A+ + + Sbjct: 135 --GREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTF 192 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + Q+ +I L+DG + + + + + A+ ++I Sbjct: 193 EAS------------------EIEQRLLILLSDGADTASRMS--PLNAAEIARGAGVEIF 232 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI++ + L + ++ + +L ++ I + Sbjct: 233 TIAVGDPDGTGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 36/270 (13%), Positives = 80/270 (29%), Gaps = 66/270 (24%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIF---LIELVVDLSGSMHCAMNSDPEDVNSAP 218 +L I+ + + I + +D+S SM Sbjct: 56 PFMLRIIALALIIVVLARPQTTNSWQNSEIEGIDIMMAIDVSTSM--------------- 100 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + +D K ++ A K+ F++ + + +G+ + + + Sbjct: 101 LAEDLKPNRLEAAKDVAAEFIN-------GRPNDNIGITLFAGESFTQCPLTVDHAVLLN 153 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +I T + A L K + K II LTD Sbjct: 154 LFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAK------------------SKVIILLTD 195 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQR 377 G NN +++ + + AK I++ TI + + + + Sbjct: 196 GTNNKG--DISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKT 253 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + ++ ++ + L V++ I + Sbjct: 254 LTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 92.6 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 79/229 (34%), Gaps = 43/229 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L +D+SGSM +D + N ++A +++ + F++ Sbjct: 89 TSKSARDLILAIDISGSMDTRDFTDASNEN---------VQRLAGVRDVVRAFVE----- 134 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E M LI + ++ + T + + + + + T A+ + + Sbjct: 135 --GREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTF 192 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + Q+ +I L+DG + + + + + A+ ++I Sbjct: 193 EAS------------------EIEQRLLILLSDGADTASRMS--PLNAAEIARGAGVEIF 232 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI++ + L + ++ + +L ++ I + Sbjct: 233 TIAVGDPDATGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|42407700|dbj|BAD08848.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408122|dbj|BAD09262.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|125602049|gb|EAZ41374.1| hypothetical protein OsJ_25891 [Oryza sativa Japonica Group] Length = 704 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 17/186 (9%), Positives = 55/186 (29%), Gaps = 13/186 (6%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMD 285 K+ LK A+ +D + + ++ ++ ++ I + ++ + Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAMES 300 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ + +A ++ + R+ + + G +N+ Sbjct: 301 LAAGGGTNILKGLVEAAKVFDGRRYRNAVASVILLSDGQDTYNVNGG-----WGASNSKN 355 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN- 404 +V ++ + + + T + + + + N + F Sbjct: 356 YSVLVPPSFKRSGDRRLSVHTFGFGTDHDAVAMNAIAEETGGTFSFIENQAVVQDAFAQC 415 Query: 405 ISQLMV 410 I L+ Sbjct: 416 IGGLLS 421 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNVGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNRG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 74/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNVGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN + + + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNRGEIS--PLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 66/216 (30%), Gaps = 49/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + QD ++ K F+D ++ +GL+ Y + Sbjct: 109 DASGSMLAQDFDPNRLEVAKRVAKKFVD-------SRKGDRVGLVVYEGEAYTACPATLD 161 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + +++ ++ + T + A L K Sbjct: 162 YKLLKEQISAIEPGHLEPGTAIGSGLGVAVTRL------------------RSDSLISKV 203 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG +N +++ + AK ++ TI + A Sbjct: 204 IILLTDGSSNTG---PEPLEVAELAKAKKCRVYTIGVGADGMAPTPVNTPFGVVYQNLPV 260 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + ++ ++ + SL ++ I + Sbjct: 261 EIDEGVLKEIASATNGKYFRAQDEKSLEKIYAEIDK 296 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 70/225 (31%), Gaps = 49/225 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + QD K ++ A K+ F++ + + +GL+ ++ + Sbjct: 96 DISGSMLAQDLKPNRLEAAKDVAASFIN-------GRPNDNIGLVVFSAESFTQCPLTTD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + +I T + A + S K Sbjct: 149 HTVLLNLFKDIQSGMIQDGTAIGLGLANAVSRI------------------KDSHAKSKV 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 II LTDG NN + + + AK +++ TI + Sbjct: 191 IILLTDGSNNAGEI--APVTAAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPV 248 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + L + ++ ++ + SL ++ I M K SV Sbjct: 249 EIDEATLKQIASTTGGQYFRATDNASLKEIYSEI-DQMEKTKISV 292 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 21/213 (9%), Positives = 56/213 (26%), Gaps = 11/213 (5%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 I A S++ + ++G + + K+ ++ A +AA LA ++ + G +I + Sbjct: 4 IGAASIIPLVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPANIQD 63 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 A + + + + + + M ++D + N Sbjct: 64 KADNFFDTNFPSGMYGTTNVGYTLSAGTATQMDGAATASVPTTLMKVFNVPQIDIAVNCS 123 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS-----------FIPALLRIEMGERPIFLI 193 L + + + + I ++ Sbjct: 124 AELDLPNIDVVLVLDMSGSMNSNGTTGSKRITALKNAVFSFYDVVMAAKPAGTRVRIGIV 183 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +S S ++ D + Sbjct: 184 PYNGAVSVGDELLTLSTTTGIDYLADSWDYQTR 216 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 54/204 (26%), Gaps = 39/204 (19%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 L + G + G+ + +V+R T M A Sbjct: 428 ATLETTDDKPTRGYNCPNRTHKLQEYLLNGSARNADFVSRINALSPKGGTMHDIGMIWAG 487 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-------------- 348 ++++ D + G + +IF+TDGE SN Sbjct: 488 RLISPDGIFAADNASAPNGDP-----ISRHVIFMTDGEMGASPSNTTAYGNYDMDGRMAG 542 Query: 349 ------------------NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 IC + + I +I+ + + C + Sbjct: 543 FAASGSWTENQLAAIHNLRLEAICKAIRNKNVTIWSIAFGLPHSAYT--QGCATGTSRAL 600 Query: 391 NVVNADSLIHVFQNISQLMVHRKY 414 N+ L F++I+ + + Sbjct: 601 TAANSSELDSRFRDIAGSIAELRL 624 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 77/266 (28%), Gaps = 69/266 (25%) Query: 169 YNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ P I L VD+S SM + +D K + Sbjct: 65 IMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSM---------------LAEDLKPNR 109 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---- 283 + A K F++ + + +GL + + + Sbjct: 110 LEAAKQVASEFIN-------GRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDI 162 Query: 284 -MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI T + A L K + K II LTDG NN Sbjct: 163 AQRGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKVIILLTDGSNN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKT 381 +++ + + AK+ I+I TI + + + + L + Sbjct: 205 RG--DISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEI 262 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ ++ + L V+Q I + Sbjct: 263 AGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 64/207 (30%), Gaps = 39/207 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +M K F+ + + +GL + + Sbjct: 95 DVSASMLTEDVYPNRMVVAKEVASEFI-------SGRPNDNIGLTIFAGEAFTQCPMTLD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + L+ T + A L K + Sbjct: 148 HAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLK 380 K +I LTDG NN + + A++ I+I TI + + L Sbjct: 192 --SKIVILLTDGSNNAGSIS--PMTAAAIARKFGIRIYTIGFGKETGEEIGAIDYKTLQD 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ 407 VS+ Y + L ++Q+I + Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 77/266 (28%), Gaps = 69/266 (25%) Query: 169 YNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ P I L VD+S SM + +D K + Sbjct: 65 IMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSM---------------LAEDLKPNR 109 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---- 283 + A K F++ + + +GL + + + Sbjct: 110 LEAAKQVASEFIN-------GRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDI 162 Query: 284 -MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI T + A L K + K II LTDG NN Sbjct: 163 AQRGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKVIILLTDGSNN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKT 381 +++ + + AK+ I+I TI + + + + L + Sbjct: 205 RG--DISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEI 262 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ ++ + L V+Q I + Sbjct: 263 AGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 20/211 (9%), Positives = 60/211 (28%), Gaps = 23/211 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 KM +K A+ +D++ + ++ ++T + I + ++ + Sbjct: 134 SGSMAGKKMERVKRAMGFLIDNLGSDD------RLSVVAFSTDARRIIRLTRMSDDGKAA 187 Query: 280 VTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS----------LP 328 R ++SL T+ + A +L + ++ + + Sbjct: 188 AKRAVESLAASGSTNIRGGLDVAAMVLDGRRHKNAVASVILLSDGQDNQSMHHEYLPTSW 247 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 K + G + + + + T + + + Sbjct: 248 VPKHSPAFSKGGYDVLVPPSFQRTAGGDHRC--VTVHTFGFGIDHDAAAMHYISEVTGST 305 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + N + F ++ + SV ++ Sbjct: 306 FSFIENHAVIQDAF---ARCIGGL-LSVAVQ 332 >gi|187919333|ref|YP_001888364.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] gi|187717771|gb|ACD18994.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] Length = 451 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 42/446 (9%), Positives = 117/446 (26%), Gaps = 50/446 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ + SI A+S+++ L ++G + + K +++A + A++A + + Sbjct: 11 FRRRQRGSVSIFVAVSLIALLGILGLAVDSGFGYMIKARLDAATDGAVIAAGEAVTRGNN 70 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ--NIVNSSRISMTH 131 + + + F+ + + I + Sbjct: 71 QTEQTNNAQQAATAFFAANYPAGFLGSSVSAGTPSIVFNAGTVTIGMTAQASVPVTFSKV 130 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-YNQKIVSFIPALLRIEMGERPI 190 + N L+ S+++ + + + L + + L ++ + Sbjct: 131 LGFNVLNVSSSSQAIR--KTLDMVFVIDNTKSLNTSGVPAAVRSNAVAFLNNFDVTNDRV 188 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 L+ + N+ D + +K N+ ++ + L+ V Sbjct: 189 ALMHFAYGTVVDVPFKGNTRGFDRATMTTDINKYT--FDGSTNSPEALWNARNQLNTVIT 246 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDK 309 + S + +L S A + L+ Sbjct: 247 QPSSLRVIVFFSDGAPNSFSSFFTTNQSKCNNISTTLASGDGPSGSASGLFSINALSQKL 306 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFII--------------FLTDGENNNFKSNVNTIKICD 355 + N V + + +++G N + + + Sbjct: 307 ASPCYQNDTSGFVTAMPKWYNAHNVNEQTFPIWPVTSPRVVSNGNATYVNVNRVSRNLLE 366 Query: 356 ----KAKENFIKIVTISIN---------ASPNGQRLLKTCVSSP-------------EYH 389 A+ + T+ + GQ +LK ++ + Sbjct: 367 AMAAAARAEGTYVFTLGYGNELTKPAGPDNELGQDVLKCMANTADSLSRCYNPKQPVGVY 426 Query: 390 YNVVNADSLIHVFQNISQLMVHRKYS 415 L F ++ + + S Sbjct: 427 CYAATPADLKPCFSQLASQI--LRIS 450 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 75/227 (33%), Gaps = 43/227 (18%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P + L +DLS SM P+ A ++ A++ + F+ Sbjct: 91 EPQRDLMLGLDLSQSMDTKDFRAPDGNLEA---------RVDAVRKVVGDFVAR------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GLI + + E VR+ ++ + + T A+ A ++ Sbjct: 136 -RPGDRIGLIAFGDAPYPLAPFTMDHELVREIISGTLPGIAGPRTSLGDAVGLAIKMFEK 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K +I LTDG + K +K + AK N + + T+ Sbjct: 195 ------------------TTVPEKVLIVLTDGNDTASKMP--PLKAAEIAKRNGVVVHTV 234 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + L K ++ ++ + L + + + Sbjct: 235 GIGDPQATGEDRLDATALEKIAETTGGRYFFGGDQAQLAAAYDVLDK 281 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 70/225 (31%), Gaps = 49/225 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + QD K ++ A K+ F++ + + +GL+ ++ + Sbjct: 96 DISGSMLAQDLKPNRLEAAKDVAASFIN-------GRPNDNIGLVVFSAESFTQCPLTTD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + +I T + A + S K Sbjct: 149 HTVLLNLFKDIQSGMIQDGTAIGLGLANAVSRI------------------KDSHAKSKV 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 II LTDG NN + + + AK +++ TI + Sbjct: 191 IILLTDGSNNAGEI--APVTAAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPV 248 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + L + ++ ++ + SL ++ I M K SV Sbjct: 249 EIDEATLKQIASTTGGQYFRATDNASLKEIYSEI-DQMEKTKISV 292 >gi|296156498|ref|ZP_06839336.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295893097|gb|EFG72877.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 446 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 39/444 (8%), Positives = 117/444 (26%), Gaps = 49/444 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ + SII A+S+++ L ++G + + K +++A + A++A + ++ Sbjct: 8 RRRQRGSVSIIVAVSLIALLGILGLAVDSGLGYVIKARLDAATDGAVIAAGEAVTRGSNQ 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY--SAVFYNTEIQNIVNSSRISMTHM 132 + + F+ + + A + + + Sbjct: 68 TEQTNNAQQAATAFFTANYPAGFLGSSVAAGTPSIVFDAGTVTIGMTAQASVPVSFSKVL 127 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-YNQKIVSFIPALLRIEMGERPIF 191 L+ S+ + + + L + + L ++ + Sbjct: 128 GFKVLNVSSTSQAIR--KTLDMAFVIDTTGSLNTSGVPAAVRSNAVAFLNNFDVTNDRVA 185 Query: 192 LIELVVDLSGSMHCAMNSDPED--------VNSAPICQDKKRTKMAALKNALLLFLDSID 243 L+ + + N+ D + +N L + Sbjct: 186 LMHFAYGTVVDVPFSGNARGFDRTTMTADINKYTFNGSTNSAEAIWNARNQLNTVISQPS 245 Query: 244 LLSHVKEDVYMGLIGYTT------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L + +++ + + ++ + L T +P Sbjct: 246 SLRVIVFFSDGAPNSFSSFFTTNQSGCNKSAGTIASPDSTGTMSGLYNMNALNQTLGSPC 305 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN----VNTIKI 353 + L + + + + + K + + +G N + Sbjct: 306 FQSNVTKLVTAMPKWYNAHDVNE--KTFPIWPVTAPRAVPNGNVTYVNVNRASRNLLEAM 363 Query: 354 CDKAKENFIKIVTISIN---------ASPNGQRLLKTCVSSP-------------EYHYN 391 A++ + T+ + GQ +LK ++ + Sbjct: 364 ASAARDEGTYVFTLGYGPELVQPAGPDNELGQDVLKCMANTADSLSRCYNPKQPVGVYCY 423 Query: 392 VVNADSLIHVFQNISQLMVHRKYS 415 L F ++ + + S Sbjct: 424 AATPADLKPCFSQLASQI--LRIS 445 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 43/246 (17%), Positives = 80/246 (32%), Gaps = 50/246 (20%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + IE+VVD SGSM + ++ A+KN F++ + Sbjct: 79 VTTSEGIAIEMVVDRSGSMQAMDF----------KIDGEHVDRLTAIKNVAGKFVEGKEE 128 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAY 302 L + + +GL+ + + P+ + + + T A+ A Sbjct: 129 LE-GRFNDLVGLMTFAGYADGITPPTLDHPYLVSQLNNIQIVTNRSEDGTAIGDAISLAV 187 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L + + K II LTDGENN + I+ + A+ I Sbjct: 188 EKLNA------------LDARRDEKVKSKVIILLTDGENNAGEV--EPIQAAELAETLGI 233 Query: 363 KIVTISIN-----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K+ TI + + + L K + ++ + DSL Sbjct: 234 KVYTIGVGTKGEAPVPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHGKYFRATDTDSLE 293 Query: 400 HVFQNI 405 ++ I Sbjct: 294 KIYHEI 299 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 74/250 (29%), Gaps = 66/250 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + +D K ++ ALK + F+ Sbjct: 83 SVSKRTKTNRGIDIVMAIDVSASM---------------LARDLKPNRLEALKKVAVDFV 127 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D + + +G++ Y + V++ + R + T + Sbjct: 128 DR-------RPNDRIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGTAIGMGLG 180 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L K + K II LTDG NN + T + AKE Sbjct: 181 SRVNRLKDSKAK------------------SKVIILLTDGVNNAGNIDPTTAT--ELAKE 220 Query: 360 NFIKIVTISINASP------------------------NGQRLLKTCVSSPEYHYNVVNA 395 IK+ TI I + + L + ++ + Sbjct: 221 LGIKVYTIGIGTNGMADFPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKYFRATDN 280 Query: 396 DSLIHVFQNI 405 SL ++ I Sbjct: 281 TSLKEIYDEI 290 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 43/416 (10%), Positives = 120/416 (28%), Gaps = 42/416 (10%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + F +I A+ + SF+ ++ ++ + K ++ + S Sbjct: 13 RLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDH-----VIYRTSPK 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + R N + + + + + S S V + VNS + Sbjct: 68 NLYDLREAGRDNFIRHQIEKALNTYNSRDLSNTGSIESIVKDAVILTKNVNS-------L 120 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + + +Q + +S ++ + + Sbjct: 121 PLQFTVDIALSTTVQLRGSL---------LQMFSQSKGKVDISRRKKVM--YKQNIGLMI 169 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + D A D P + N + + K Sbjct: 170 MPFAWD---GYWLASRGKVADSKVHPPKYLEYSHYYQQYLNR-NTLVKNFLSQIPYKNFC 225 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST--PAMKQAYQILTSDKK 310 M Y++ + + + + TR+ T + + +TS++ Sbjct: 226 -MAPYHYSSILYWAVG-TLTYSVDNKTTTREYYKDPYYATWDHFPYSFIKNVFDMTSNQF 283 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENF---- 361 K+++ L G + S ++ C + Sbjct: 284 GDGQVLTNTNHCFPHGASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGR 343 Query: 362 --IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 344 DAITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 70/216 (32%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++S+ + QD ++ + + F+ + +GL+ + + + P+ Sbjct: 95 DLSSSMLAQDFSPSRFEVARRTAIQFVQ-------GRRADRIGLVVFAGQAFTQVPPTLD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + R + T A+ A L S K Sbjct: 148 YRFLLTMLQRLQVGRLEDGTAIGTAIATAINRL------------------KNSEARSKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS--------------IN-------A 371 II LTDG+NN + + + + A++ I+I TI Sbjct: 190 IILLTDGQNNRGEID--PLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPV 247 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + + ++ +A +L ++ I + Sbjct: 248 EIDEAMMREVAEKTGGRYFRATDARTLEAIYAEIDR 283 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 70/240 (29%), Gaps = 56/240 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 P + + + VD S SM + ++ +MA +K + F+D Sbjct: 95 STPGYDLMIAVDASHSMEALDFTVE----------GRQVNRMAVVKGVMGRFID------ 138 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + ++ + RQ + + S+ T A+ L Sbjct: 139 -ARQGDRVGLILFGSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAIALGVSKL- 196 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +I + DG+NN + A+ +I Sbjct: 197 -----------------RERPEGSRVMIVIADGDNNAGSF--APKEAARLARATGTRIYV 237 Query: 367 ISINASP-------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + + + L + + ++ + +L + I Q Sbjct: 238 IGVGSKQPSIPILEEGSVRYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEISSRIGQ 297 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 91.5 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 68/216 (31%), Gaps = 40/216 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D ++ K F+ + + +GL + + Sbjct: 95 DISASMLTEDVFPNRIEVAKEVASDFI-------SGRPNDNIGLTIFAGEAFTQCPMTVD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + LI T + + L K + Sbjct: 148 HAALLNLLHNVRTDLVVKGLIQDGTAIGMGLANSVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ-------RLLK 380 K II LTDG NN + + AK+ I+I TI + G L Sbjct: 192 --SKVIILLTDGSNNVGSIS--PMTAASIAKKYGIRIYTIGLGKESEGDLGAIDYKTLQN 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 VS+ Y + L ++Q+I + + K V Sbjct: 248 IAVSTNGEFYRAQSQAELSKIYQDIDK-LEKTKLRV 282 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 91.5 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 64/207 (30%), Gaps = 39/207 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +M K F+ + + +GL + + Sbjct: 95 DVSASMLTEDVYPNRMVVAKEVASEFI-------SGRPNDNIGLTIFAGEAFTQCPMTLD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + L+ T + A L K + Sbjct: 148 HAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLK 380 K +I LTDG NN + + A++ I++ TI + + L Sbjct: 192 --SKIVILLTDGSNNAGSIS--PMTAAAIARKFGIRVYTIGFGKETGEEIGAIDYKTLQD 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ 407 VS+ Y + L ++Q+I + Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 91.5 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 75/233 (32%), Gaps = 55/233 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L VDLSGSM A + + ++ +K+ L F++ +E Sbjct: 88 IMLAVDLSGSMEIADM----------QLEGRSVNRLTMVKHVLSDFIER-------REGD 130 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ V+Q + + L+ + T A+ + + Sbjct: 131 RLGLILFADTAYLQTPMTYDRNTVKQMLNESVLGLVGERTAIGDAIALSVKRF------- 183 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 + ++ LTDG+N + + A+ + I I++ A Sbjct: 184 -----------RDDEKSNRVLVLLTDGQNTAGNLP--PEQALELAQAYDVTIYPIAVGAE 230 Query: 372 -----------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + ++ + + L ++Q + + Sbjct: 231 EVVVDSFFGQRRVNPSRDLDVPLMQSIAKQTGGKYFRARSTNELEEIYQRLDK 283 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 77/235 (32%), Gaps = 26/235 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + +S KK +++ A K L F+ + ++ Sbjct: 99 DVMVVVDLSGSMAEQDFVSKQSSDSGT---VKKISRLEATKEVLADFVKT-------RKG 148 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ + Sbjct: 149 DRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSSED 208 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + K I LTDG + I A ++I I++ Sbjct: 209 KASAEENAKPRE-------KVAIVLTDGNDTGSYV--EPIDAAKVAAAKGVRIHMIAMGD 259 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 260 PRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGELEPQLYESTTYR 314 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 72/232 (31%), Gaps = 56/232 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D +++ K F+++ + D +GL + + Sbjct: 95 DVSTSMLTPDLPPSRIETAKQVAYEFINN-------RPDDNIGLTVFGGEAYTQCPLTTD 147 Query: 273 TEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + +I T + A L K + Sbjct: 148 HSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHLEQSKSK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------- 373 K II LTDGENN + + + + AK I+I TIS+ Sbjct: 192 --SKVIILLTDGENNAGEIS--PLTAAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGET 247 Query: 374 ---------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ-LMVHRKYS 415 + + L S+ Y + L ++QNI + K S Sbjct: 248 YEAAIKQNTDPKTLEAIANSTGGKFYQARSKAKLRDIYQNIDRLEKTKLKVS 299 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 29/246 (11%), Positives = 69/246 (28%), Gaps = 36/246 (14%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 Q I + A R + V+D+SGSM TK Sbjct: 234 NPEQVIGRPVNATSVGYPTARAPVDLVTVLDVSGSMA--------------------GTK 273 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDS 286 +A LK A+ + + + +I +++ + + +Q + Sbjct: 274 LALLKRAMGFVIQHLGPSD------RLSVIAFSSTARRLFHLQRMSHSGRQQALQTVNSL 327 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T+ A+K+A +++ ++ + + I E ++ Sbjct: 328 VASGGTNIADALKKAAKVIEDRSHQNPVCSIILLSDGQDTYNIPSNIRGAR-PEYSSLVP 386 Query: 347 NVNTIKICDKAKENF-IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN- 404 + + + + + L +S + + + F Sbjct: 387 SSILN------RTFGLVPVHGFGFGVDHDSDALHSIAEASGGTFSFIEDEGVIQDAFAQC 440 Query: 405 ISQLMV 410 I L+ Sbjct: 441 IGGLLS 446 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 75/239 (31%), Gaps = 43/239 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 +P + + +DLS SM DP ++ A++ + F+ Sbjct: 91 QPARDLLIALDLSQSMDTRDFRDP---------AGALIPRVQAVRQVVSGFVAR------ 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GLI + + V+ + + + T A+ ++ Sbjct: 136 -RPGDRIGLIVFGDAPYPLAPFTLDHRLVQTLIADLLPGMAGPSTALGDAIGLGIKMFEH 194 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K +I LTDG + + + AKE + + TI Sbjct: 195 SEA------------------PEKVLIVLTDGNDTASRMP--PERAGGIAKERKVVVHTI 234 Query: 368 SINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + L K + ++ + L ++ + Q+ H + ++ + Sbjct: 235 GIGDPNASGEEKVDLDVLQKLAAQTGGRYFFGADQAGLETIYATLDQITPHNQKTLSWR 293 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 32/298 (10%), Positives = 75/298 (25%), Gaps = 22/298 (7%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ ++ +I FAL + +L+ ++ V ++ M+ A +AA L A Sbjct: 18 FARRLRRDDRGAIAIQFALLALPLSILLFGVLDVSRLSLQRRQMQDALDAATLMAARSAA 77 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + L ++ + S + I Sbjct: 78 TASADLDTTGDAAFLAEIAGMNLGLTA-------SSSTFSVGTGNRVIGTATATLKPIIA 130 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + L + R V+ + + + Sbjct: 131 NLWQAGDFTVTATSEVVRSSKNLEVALVLDITGSMSGTRIADLKVAASDLVDIVIRDTQT 190 Query: 190 -----IFLIELVVDLSGSMHCAMNSDPEDVNSAPI-CQDKKRTKMAALKNALLLFLDSID 243 + L+ ++ + M P V + T + + AL L S + Sbjct: 191 PFYSKVALVPYAAGVNVDTYADMARGPIPVRNISNVAWLATGTSIRGVTKALPALLWSDN 250 Query: 244 LLSHVKEDVYMGLI---------GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + V++ I + + TR + T Sbjct: 251 HGLVTGDRVFISGISGGVLTSMASLNGAIYTVTRLDSNVVYLNGIDTRLKSNSPSGGT 308 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 60/239 (25%), Gaps = 39/239 (16%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + + A +RT A +A + + ++ K Sbjct: 397 NWGTNEVRALSTCVSERTGADAYTDAAPS---TAFVGTNYPSTSADSYSPNPCPSAKITP 453 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 S ++ + + T + + ++ + + + ++ S Sbjct: 454 LSSDKTALKAQINNYS---VGGSTAGQIGLAWGWYMVAPNFGYIWPSA-SQRPAAYKSKD 509 Query: 329 FQKFIIFLTDG----------------------------ENNNFKSNVNTIKICDKAKE- 359 K +I +TDG N +C K Sbjct: 510 LMKVVIMMTDGAFNTPYCNGVIAANAGIGSGSDEDHINCNATNGDPFAQARALCTVIKNS 569 Query: 360 -NFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 N I + T+ + + L C S + L F I++ + + + Sbjct: 570 ANDITLYTVGFAVGSDYTAKTFLTDCASDSSKAFFPATGSELKASFTAIAREISSLRIA 628 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 76/247 (30%), Gaps = 65/247 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM + +D + ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSGSM---------------LAEDLQPNRLEAAKDVAAKFIN-------DRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + +I T + A L + + Sbjct: 127 DNIGITLFAGESFTQCPLTIDHTALLNLFGNIQTGVIEDGTAIGMGVSNAVARLKDSQAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 K II LTDG NN +++ + + AK I++ T+ I Sbjct: 187 ------------------SKVIILLTDGSNNAG--DISPLTSAEIAKTYGIRVYTVGIGT 226 Query: 371 ---------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L ++ ++ + SL ++Q I + Sbjct: 227 RGTAPYPIQTMTGAIQRIQVEVDIDEPTLKDIARTTGGVYFRATDNTSLQDIYQEI-DQL 285 Query: 410 VHRKYSV 416 K +V Sbjct: 286 EKTKLNV 292 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 73/231 (31%), Gaps = 55/231 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM ++ ++ K+ L F+ + Sbjct: 90 MMLAVDLSGSMKIDDMELN----------GRQVNRLTMTKSVLYDFIQR-------RVGD 132 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ + V ++ + L+ + T A+ A + ++ + Sbjct: 133 RLGLILFADTAYLQAPLTYDRDTVSTLLSESVIGLVGEQTAIGDAIGLAVKRFDEKEESN 192 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I LTDG+N + + + A +K+ TI + A Sbjct: 193 ------------------NVLILLTDGQNTAG--FITPEQAKELAVNKKVKVYTIGVGAD 232 Query: 373 P------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L S+ ++ + + L ++ + Sbjct: 233 KMLIQSFFGSRQVNPSQELDEDMLSDLASSTGGQYFRARDVNELEAIYAKL 283 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + + LI T + A L K + Sbjct: 127 DNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 39/352 (11%), Positives = 96/352 (27%), Gaps = 42/352 (11%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 +++AL ++ + L+G I + + + M+ A +AA+LAG M + + G R E+ Sbjct: 1 MLWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNGIVDDGVRAEATK 60 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 S+S + I M ++ L S + Sbjct: 61 FFNFNFQQGMFGSKP---FTPSISSATTSKTTVVINAATTVPTSLMRIFGSDELPVSVSC 117 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS---------FIPALLRIEMGERPIFL-- 192 V T + L + ++ + + + + Sbjct: 118 NASQDFVNTDIVFVLDTTGSMRDKATSSDSQTKIEALRSAVLALYDQLAPVQNQLAASGM 177 Query: 193 ------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + ++ + ++ + Q + + + S Sbjct: 178 RLRYGVVPYASAVNIGAAIRAANPNYMLSGSWTYQS---------RQVVTENMTSWSCGY 228 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + I + + ++ T K + D+ L+ T AY +T Sbjct: 229 RSGSYDFGSGIC---TYFRYLPRAFDTSKYVSGASVDVAELV--GTA-------AYNGVT 276 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + + + + + G + ++ K K Sbjct: 277 PTRSSANKVTWAGCVEERQTARVSSNDTAIPPG-ATDLDIDLIPTNDATKWK 327 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/288 (12%), Positives = 73/288 (25%), Gaps = 61/288 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK---KRTKMAALKNAL 235 + + + +G +++ + A A Sbjct: 252 TSKYVSGASVDVAELVGTAAYNGVTPTRSSANKVTWAGCVEERQTARVSSNDTAIPPGAT 311 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-----------------WGTEKVRQ 278 L +D I K Y + Y + + + W + +++ Sbjct: 312 DLDIDLIPTNDATKWKPYWPEVEYASYQVEIYKNDPYKPQFACPSPAASMQGWSRDDLKK 371 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 Y+ T M + S K +P +K+IIF+TD Sbjct: 372 YLNTLT---PDGGTYHDNGMMWGAR-----WASSGGIFGGNNPEKYNMMPVKKYIIFMTD 423 Query: 339 GENNNFKSNVNTIK-------------------------------ICDKAKENFIKIVTI 367 G S + + +C KAK I + Sbjct: 424 GLFETGYSRLYSSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLLCSKAKSMGYSIWVL 483 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + L C S+P N +L+ F I + + + + Sbjct: 484 GFATTLDAS--LTNCASTPSQASTSSNQAALMARFVEIGKNIGALRLT 529 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 72/212 (33%), Gaps = 46/212 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + +D K ++ K L F+D + + + L+ + + ++ Sbjct: 89 DLSQSMLQRDFKPNRLETAKKLLEEFIDK-------RINDRISLVVFGGDAYTKVPLTFD 141 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 V+ ++ I T + + L + + Sbjct: 142 HNVVKDITSKLTTDDITSNNRTAIGMGLGVSLNRLKDSEAK------------------S 183 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------SPN------- 374 K II +TDGENN+ + + + + AKE IKI TI I A Sbjct: 184 KVIILMTDGENNSGEMS--PMGASEIAKELGIKIYTIGIGAREIQIRVPFGHTTVKNTEL 241 Query: 375 GQRLLKTCVSS-PEYHYNVVNADSLIHVFQNI 405 + LLK S+ ++ + +F I Sbjct: 242 DENLLKNIASTTGGEYFRAGSEKEFQEIFNRI 273 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 71/228 (31%), Gaps = 43/228 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + L +DLS SM P V ++ A++ + F+ Sbjct: 90 TQPVRDLLLALDLSQSMDTRDFKTPSGVLEP---------RVDAVRQVVADFVAR----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + VR+ + + + T A+ ++ Sbjct: 136 --RTGDRIGLIVFGDAPYPLAPFTLDHALVRELLADMVPGMAGASTSLGDAIGLGIKMFD 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +K +I LTDG + + + D AK + + T Sbjct: 194 QS------------------HAQEKVMILLTDGNDTASRMP--PAQAADIAKTRGVVVHT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I + L ++ ++ + SL ++ + + Sbjct: 234 VGIGDPATTGEQKVDLDALKHIASTTGGRYFFGADQTSLASIYATLDR 281 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 76/232 (32%), Gaps = 53/232 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + + ++ +K L F++ ++ Sbjct: 88 DIMLAVDLSGSMTEQDMAYN----------GQYVDRLTMVKAVLSDFIEQ-------RQG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + ++ L+ + T A+ + + S K+ Sbjct: 131 DRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFASKKES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG+N N A+E IK+ TI + + Sbjct: 191 N------------------RIVVLLTDGQNTAGNLN--PEDALLLAREEGIKVYTIGVGS 230 Query: 372 SP----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + ++ + L ++ + + Sbjct: 231 DNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGGLYFRAKDVAGLQQIYAELDK 282 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 33/241 (13%), Positives = 76/241 (31%), Gaps = 43/241 (17%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ + + VDLSGSM + P V ++ A+K L + Sbjct: 88 QQKSGRDLMIAVDLSGSMEARDFTLPSGVT---------VDRLDAVKQVLKELAAN---- 134 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E +GLI + + + +Q + + T A+ + ++ Sbjct: 135 ---RESDRLGLIVFGAAAYLQTPFTDDHQVWQQLLDETEIGMAGPSTVFGDAIGLSIKLF 191 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + + +I LTDG + + A N ++I Sbjct: 192 SDSDSDN------------------RVLIMLTDGNDTGSTVP--PVDAAKVAAANGVRIY 231 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 TI+I + + + + + ++ D + + I +L ++ Sbjct: 232 TIAIGDPATVGEDALDMDTITRVSKIADGRTFRALDQDEMRQAYITIGELEPELYETISF 291 Query: 419 K 419 + Sbjct: 292 R 292 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 75/245 (30%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + LV+D SGSM ++ A K+A ++ +D Sbjct: 83 QTKEGVNVVLVIDDSGSMQATDY---------------SPNRLEATKSAAEELINDLDPK 127 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +G++ + + S + V + + T + + Sbjct: 128 DY------VGIVVFESGASTASYLSPDKDSVIENLENI--MEKDGATAIGDGLSLGINM- 178 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +K +I L+DG NN + + AK++ I++ Sbjct: 179 -----------------ADSIPNRKKVVILLSDGVNNAGVIS--PDEAIQFAKDSDIQVF 219 Query: 366 TISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI + + + L + + ++ V+ +L ++ NI+ Sbjct: 220 TIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEIYSNINSE 279 Query: 409 MVHRK 413 + K Sbjct: 280 IKREK 284 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 61/177 (34%), Gaps = 38/177 (21%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GLI + ++ + ++Q V L T ++ A + Sbjct: 127 RQGDRLGLILFADHAYLQTPLTFDLKTIQQMVDESEIGLAGTRTAIGESIAMAIKRF--- 183 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + Q+ +I ++DG NN+ I+ +A +N I I TI Sbjct: 184 ---------------VENKNEQRVLILVSDGANNSGSI--EPIQAAKQAAKNNITIYTIG 226 Query: 369 IN------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + + L + + ++ N L +++Q +++ Sbjct: 227 MGAEQMIKRGLFGNQRINPSADLDEKTLTEIANLTGGKYFRARNQTELQNIYQTLNK 283 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 76/234 (32%), Gaps = 55/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVD+S SM + Q + ++ A+K L F+ ++ Sbjct: 90 DLMLVVDISPSMDEQDM----------VLQGRSINRLQAVKRVLDDFI-------SRRQG 132 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T ++ E VR + + + T A+ A + L Sbjct: 133 DRLGLILFGTEPYVQAPLTFDLETVRTLMREAGLGMAGRATAIGDAVGLATKRL------ 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 Q+ ++ LTDG N + K + A I++ TI I A Sbjct: 187 ------------RNRPQDQRVVVLLTDGANTAGEIT--PDKATEIAAAASIRLYTIGIGA 232 Query: 372 ------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ + L ++++I + Sbjct: 233 ESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRARSLPELELIYESIDR 286 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 79/227 (34%), Gaps = 38/227 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFS----------LDGETVSRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAA 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 P K ++ L+DG NN + + AK+ +++ TI++ Sbjct: 203 ADGEG-------PPPARDKVVVLLSDGANNAGQ--TAPKDVAALAKDLGVRVYTIALGPI 253 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L +S + V D L +V I + Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 78/226 (34%), Gaps = 43/226 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E + L +D+SGSM + P+ ++ ++A +++ + F++ Sbjct: 90 ETAARDVVLAIDISGSMDSRDFTAPD---------GTRKQRLAGVRDVVRGFVE------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ M LI + + + + + + R + T ++ A + Sbjct: 135 -GRDGDRMALIVFGSAAYLQAPLTADLDAILALLDRTQVGMAGPHTALGDSIGLAIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + Q+ +I L+DG + + + + + A ++I T Sbjct: 194 SSEID------------------QRLLILLSDGSDTASRMD--PVNAAEIAAGRDVEIFT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + L + ++ + +L V+++I Sbjct: 234 VGVGDPDATGENRVDLDTLRAIADRTGGAYFFAADEAALTAVYESI 279 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 16/172 (9%), Positives = 50/172 (29%), Gaps = 11/172 (6%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 +K + N++++ L+++ + + I + ++ + +A +AA +A A ++V+ Sbjct: 2 IRKFANDRRGNYALMTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIVA 61 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + D + +L + ++ + + Sbjct: 62 GATDAEAIAYAKNFFEANLAHIDPAN---TTLAVTLPNNNTGGGTLKLCGTLTYKPYFLP 118 Query: 131 HMANNRLDSSNNTIF--------YNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 +S N + L + V Sbjct: 119 TAKILAGGTSGNATTMAFNTCSEVRLKNTLEVSLVLDNSGSMKELGKGSNKV 170 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 42/177 (23%), Gaps = 16/177 (9%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 ++ PA DL+ + ++ + R + Sbjct: 329 INDTPAATGTPATLFVPMFAPDETDLT-DSNSRPANNNWRTDVTSNSSSAIRQRF----- 382 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILKP 291 + + + + + + T Sbjct: 383 -MPKYFADPGSTTVTPSYGMDAGPNTSCSTTPIKPLTDVSTTAGASAVKTAIDAMAADGA 441 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ M ++ L+S T F +G K +I LTDG N + + Sbjct: 442 TNVPEGMAWGWRTLSS-------TAPFTEGRPETERGNDKVLIVLTDGANTYYTPDS 491 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 37/313 (11%), Positives = 77/313 (24%), Gaps = 34/313 (10%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + T+F M D N + A+ + M + Sbjct: 332 TPAATGTPATLFVPMFAPDETDLTDSNSRPANNNWRTDVTSNSSSAIRQRFMPKYFADPG 391 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V S M N+ P+ +A+K A+ + + + Sbjct: 392 STTVTPSYGMDAGPNTSCSTTPIKPLTDVSTTAGASAVKTAIDAMAAD-GATNVPEGMAW 450 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +T P + + + DS A + Sbjct: 451 GWRTLSSTAPFTEGRPETERGNDKVLIVLTDGANTYYTPDSVIAQTYSGTNYNYGANDLA 510 Query: 314 FTNFFRQGVKIPSLPFQKFII-----------FLTDGENNNFKS--NVNTIKICDKAKEN 360 + + + +D N N+ N + +C+ AK Sbjct: 511 GNKAIYSALGYVTPYSNGYSYGRMFLGTSSSVIKSDYSNANYTKAMNEHFTTLCNNAKAA 570 Query: 361 FIKIVTISINASPNGQ------RLLKTCVSSPEYH-------------YNVVNADSLIHV 401 + ++TI+++ LK C SS + +L + Sbjct: 571 NVMVMTIALDLDATNTAEKTQMDALKAC-SSDSRFSKDPTDPSKPMKLFWNSTGATLSND 629 Query: 402 FQNISQLMVHRKY 414 F+ I + + + Sbjct: 630 FKAIGNELSNLRI 642 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 79/212 (37%), Gaps = 42/212 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + +D + +++ A K A F+ ++ V +GL+ +++ P+ Sbjct: 100 DTSRSMRAEDLEPSRLEAAKAAAREFIRAMPPG------VEVGLVAFSSYATLLQPPTTD 153 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E++ Q V + T + A ++L ++ P Sbjct: 154 RERLEQAVDLLDLAHR---TAIGDGLVAALRVL---------------PLEDSDAPGGMS 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------ASPNGQR 377 ++ L+DG NN ++ +A+ +++ T+ + A + + Sbjct: 196 VVLLSDGRNNYGID---PLEAARQAEAQGVRVYTVGVGLSENTYVFANGYYIRAGLDEET 252 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + + +Y +AD L V+Q +++ + Sbjct: 253 LQEIAALTGGAYYRASSADELRAVYQTLARAV 284 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 90.7 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 79/267 (29%), Gaps = 45/267 (16%) Query: 162 EHLLNQRYNQKIVSFIPA--LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 ++ + +V+ + R + + VDLSGSM Sbjct: 73 GVMVWIAWGLLVVAAARPEWVGEPIEINRSARDLMVAVDLSGSMEAQDF---------TT 123 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 Q +K ++ A+K L F ++ +GLI + + + + Sbjct: 124 EQGEKIDRLTAVKQVLTEFSQR-------RDGDRLGLIVFGSAAYLQAPFTADKDTWLTL 176 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + ++ T A+ + + +I LTDG Sbjct: 177 LQETEIAMAGASTSIGDAIGLSISTFEHS------------------DTDNRVLIVLTDG 218 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNV 392 + + + A +KI TI+I + L + + ++ Sbjct: 219 NDTGSRVP--PVDAARVANARDVKIYTIAIGDPETIGEDAMDVDTLKQVSDITGGAYFEA 276 Query: 393 VNADSLIHVFQNISQLMVHRKYSVILK 419 ++ +L + +I L S+ + Sbjct: 277 LDRQALERAYLDIESLEPALYQSLSYR 303 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 67/209 (32%), Gaps = 35/209 (16%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + +++A+K L FL +E +GL+ + S + Sbjct: 108 DFTNQAGVQVDRLSAVKEVLGEFLQR-------REGDRVGLVVFGDAAYLQAPFSTDLQL 160 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R+ + + T A+ + + + + K II Sbjct: 161 SRRLLDECEVGMAGPRTAFGDAIGLGVNLFSESEAPA------------------KTIIA 202 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEY 388 LTDG + + ++ A I+I T++I + Q L + + Sbjct: 203 LTDGNDTKSQVPS--VEAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGS 260 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ + SL ++ + + RK ++ Sbjct: 261 YFFAADRASLAGIYDQL-DEIETRKIKMV 288 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 77/240 (32%), Gaps = 43/240 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L +DLSGSM D +++ +++A K L F Sbjct: 90 EKSARDLMLALDLSGSMDARDFRDQN---------GQQQNRLSAAKQVLKDFAAQ----- 135 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GLI + + + ++ T A+ A + Sbjct: 136 --RDGDRLGLIVFGNAAYLQAPFTDDIATWLTLLEESEVAMAGPSTALGDAIGLAISLFQ 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + ++ LTDG + + + A N + I T Sbjct: 194 AS------------------ETRNRVLVVLTDGNDTGSRVP--PVDAASIAAANDVTIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 +++ + + L S+ + ++ +L ++ I++L + S+ + Sbjct: 234 VAVGDPSTIGEEALDLETLDAIASSTRGASFLALDTLALKDAYEQINRLEPAQYDSLSYR 293 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 67/200 (33%), Gaps = 42/200 (21%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ A++ + F++ + ++GL+ + + + + ++ + Sbjct: 106 RPVSRYEAMQAVISEFVE-------DRRGDFIGLVVFGSFADVQAPLTPDLNAIQSLLAD 158 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + A + L S + ++ L+DGENN Sbjct: 159 LRPGMADSRTAIGDGLALAVRQL------------------RESTTEDRVVVLLSDGENN 200 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISIN---------------ASPNGQRLLKTCVSSPE 387 + + + A I++ TI +S + Q L + + Sbjct: 201 SGEI--RPDEATAVAAAENIRVYTIGFGSAGRDSLLQSFGLRSSSLDEQTLREIAEQTQG 258 Query: 388 YHYNVVNADSLIHVFQNISQ 407 +Y ++ L VF++I + Sbjct: 259 RYYRATSSAELAEVFRDIER 278 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 79/227 (34%), Gaps = 38/227 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFS----------LDGETVSRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAA 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 P K ++ L+DG NN + + AK+ +++ TI++ Sbjct: 203 ADGAG-------PPPSRDKVVVLLSDGANNAGQ--TAPKDVAALAKDLGVRVYTIALGPI 253 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L +S + V D L +V I + Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 18/196 (9%), Positives = 63/196 (32%), Gaps = 30/196 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRD 283 TK+A LK A+ + ++ + +I +++ + + ++ RQ + Sbjct: 257 GTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAV 310 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T+ +++ +++ +R+ + + T+ + + Sbjct: 311 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYT--------TNHPDPS 362 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +K + I + + + + + S + + + Sbjct: 363 YKVMLPQ-----------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDA-- 409 Query: 404 NISQLMVHRKYSVILK 419 ++Q + SV ++ Sbjct: 410 -LAQCIGGL-LSVAVQ 423 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 18/196 (9%), Positives = 63/196 (32%), Gaps = 30/196 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRD 283 TK+A LK A+ + ++ + +I +++ + + ++ RQ + Sbjct: 231 GTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAV 284 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T+ +++ +++ +R+ + + T+ + + Sbjct: 285 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYT--------TNHPDPS 336 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +K + I + + + + + S + + + Sbjct: 337 YKVMLPQ-----------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDA-- 383 Query: 404 NISQLMVHRKYSVILK 419 ++Q + SV ++ Sbjct: 384 -LAQCIGGL-LSVAVQ 397 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 18/196 (9%), Positives = 63/196 (32%), Gaps = 30/196 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRD 283 TK+A LK A+ + ++ + +I +++ + + ++ RQ + Sbjct: 256 GTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAV 309 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T+ +++ +++ +R+ + + T+ + + Sbjct: 310 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYT--------TNHPDPS 361 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +K + I + + + + + S + + + Sbjct: 362 YKVMLPQ-----------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDA-- 408 Query: 404 NISQLMVHRKYSVILK 419 ++Q + SV ++ Sbjct: 409 -LAQCIGGL-LSVAVQ 422 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 90.3 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 79/259 (30%), Gaps = 52/259 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR------------------TK 227 E+ + + VDLSGSM + P N + Sbjct: 89 QEKSARDLMIAVDLSGSMAVEDFTLPIATNELTNRAKNDTDSSATKSSTNDTGKGEKVNR 148 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A+K+ L F+ +E +GLI + + + + + Sbjct: 149 LVAVKHVLNAFVK-------SREHDRLGLILFGDAPYLQAPFTDDIATWQALLNESDIGM 201 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T A+ A + + +I LTDG + K Sbjct: 202 AGQSTAFGDAIGLAISVFQQS------------------DTQNRVLIVLTDGNDTASKVP 243 Query: 348 VNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ A IKI TI+I + + L + + +N++ L+ Sbjct: 244 --PVEAAKVAAARDIKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQGKSFQALNSEELLK 301 Query: 401 VFQNISQLMVHRKYSVILK 419 V+ I +L + S+ + Sbjct: 302 VYAEIDRLEPQQFDSLSFR 320 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 77/266 (28%), Gaps = 69/266 (25%) Query: 169 YNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ P I L VD+S SM + +D K + Sbjct: 65 IMVILILARPQTTDNWQNTEIEGIDIMLAVDVSTSM---------------LAEDLKPNR 109 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---- 283 + A K F++ + + +GL + + + Sbjct: 110 LEAAKQVASEFIN-------GRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDI 162 Query: 284 -MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI T + A L K + K II LTDG NN Sbjct: 163 AQRGLIEDGTAIGMGIANAVTRLKDSKAK------------------SKVIILLTDGSNN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKT 381 +++ + + AK+ I+I TI + + + + L + Sbjct: 205 RG--DISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEI 262 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ ++ + L V+Q I + Sbjct: 263 AGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|170744425|ref|YP_001773080.1| hypothetical protein M446_6382 [Methylobacterium sp. 4-46] gi|168198699|gb|ACA20646.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 482 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 44/481 (9%), Positives = 116/481 (24%), Gaps = 105/481 (21%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI----------LAGASKM 68 N S+I A S++ + +IG + K ++ A +AA +A ++ Sbjct: 20 SGNVSLIVAFSLIPLIGMIGLGVDYGMAVGAKTKLDHAADAAALAAVVTAKAYVAANARN 79 Query: 69 VS--NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + +++ + + + A A F + I+ + SG + T S Sbjct: 80 WNVWDIAVAEGQARAANAFAVNAGSVPFTHFALDPIQLTRSGQTFEATVT---YTATVSN 136 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 S + + + + ++ Sbjct: 137 NFGPLFGIRTTAVSGRAVATTSVPSYLDFYLLIDVSGSMGLPSTTD-------GQAQLAA 189 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R F + H + + + A+ A+ + Sbjct: 190 LNRIDFFKVYQQGCQFACHFPGFVG-----YDLAVFNNIQLRSGAVNVAVCNLIKRAA-Q 243 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---------------------QYVTRDM 284 V +GL + T++ + + T + + + Sbjct: 244 PEVANQYRVGLYPFITQMGTLQDLTADTSALNLKAGCAASNPMVFTQLLDTGATQLDSNG 303 Query: 285 DSLIL---KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG---VKIPSLPFQKFIIFLTD 338 D T ++ + ++ T + + + ++ + Sbjct: 304 DPSTGVGSGGTHFETSLTSMLATIKANGYGDGSTQIKPKPFVFLITDGMQNNQWYSIQIN 363 Query: 339 GENNNFKSNVNTIKI-------------------CDKAKENFIKIVTI------------ 367 G+ S CD ++ + + + Sbjct: 364 GKRYYSGSPSKFSAYPDANWSPGGSDPVPMDWGYCDTLRQAGVTVSVLLIPYIKIDFTYV 423 Query: 368 ------------SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI---SQLMVHR 412 ++ + + C S P Y +++ + + + H Sbjct: 424 KSDIADENNKVNGFSSGL--PDVARQCAS-PGYFQMADTPEAIDRALDAMFMKATQVAHL 480 Query: 413 K 413 K Sbjct: 481 K 481 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 43/416 (10%), Positives = 119/416 (28%), Gaps = 42/416 (10%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + F +I A+ + SF+ ++ ++ + K ++ + S Sbjct: 13 RLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDH-----VIYRTSPK 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + R N + + + + S S V + VNS + Sbjct: 68 NLYDLREAGRDNFIRHQIEKALNTYNSRDLSNIGSIESIVKDAVILTKNVNS-------L 120 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + + +Q + +S ++ + + Sbjct: 121 PLQFTVDIALSTTVQLRGSL---------LQMFSQSKGKVDISRRKKVM--YKQNIGLMI 169 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + D A D P + N + + K Sbjct: 170 MPFAWD---GYWLASRGKVADSKVHPPKYLEYSHYYQQYLNR-NTLVKNFLSQIPYKNFC 225 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST--PAMKQAYQILTSDKK 310 M Y++ + + + + TR+ T + + +TS++ Sbjct: 226 -MAPYHYSSILYWAVG-TLTYSVDNKTTTREYYKDPYYATWDHFPYSFIKNVFDMTSNQF 283 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENF---- 361 K+++ L G + S ++ C + Sbjct: 284 GDGQVLTNTNHCFPHGASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGR 343 Query: 362 --IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 344 DAITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM K ++ A K+ + F+++ + + Sbjct: 89 DIMLAMDVSGSMQAMDF---------------KPNRLEAAKDVAISFINN-------RPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V ++ T + A L K + Sbjct: 127 DNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + +I LTDG NN ++ D A+ I++ T+ + Sbjct: 187 ------------------SRVVILLTDGSNNMG--DITPRMAADIARTFGIRVYTVGVGT 226 Query: 371 --------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L S ++ V+ ++L +++ I + Sbjct: 227 RGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDK 283 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 77/249 (30%), Gaps = 59/249 (23%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I L VD+SGSM + ++ ++ +K + F+ Sbjct: 83 DSSGIDIVLAVDVSGSMEALDFTIN----------NEPANRVDVVKKVVFRFIGE----- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + D +GL+ + R + + + + + ++ T A+ + L Sbjct: 128 --RPDDRIGLVAFAGRPYMVSPLTLDHDWLGRRLQTIHPGMVEDGTAIGSAIGSSINRL- 184 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDG NN K + + A+ IKI T Sbjct: 185 -----------------RDQKAKSKVVILLTDGMNNAGKI--LPVTAAEAAETLGIKIYT 225 Query: 367 IS--------------IN--------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + L K + +Y + DSL ++ Sbjct: 226 IGAGSRGEVPVPITDKFGNQKIVRAKVDIDEATLEKVAQMTGAKYYRATDTDSLKKIYSE 285 Query: 405 ISQLMVHRK 413 I++L ++ Sbjct: 286 INKLETTKR 294 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 79/227 (34%), Gaps = 38/227 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFS----------LDGETVSRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L + + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQVAA 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 P K ++ L+DG NN + + AK+ +++ TI++ Sbjct: 203 ADGEG-------PPPARDKVVVLLSDGANNAGQ--TAPKDVAALAKDLGVRVYTIALGPI 253 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L +S + V D L +V I + Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 90.3 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 22/244 (9%), Positives = 67/244 (27%), Gaps = 17/244 (6%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 +V ++ E + ++ S + TK+ Sbjct: 320 NFTVLVHVKAPSASEASKKQNYEDCEGNMVKDPGCRAPIDLVTVLDVSGSM----SGTKL 375 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SL 287 A LK A+ + ++ + ++ +++ ++ T ++ R ++ L Sbjct: 376 ALLKRAMAFVISNLSPED------RLSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLL 429 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ +++ ++L ++R+ + + + L + + Sbjct: 430 CTGGTNIAEGLRKGAKVLEDRRQRNPVASIMLLSDGQDTYSLSSRGVVLFPSDEQRRSAR 489 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-IS 406 +T I + + + S + + F I Sbjct: 490 QSTRYG-----HVQIPVHAFGFGVDHDAATMHAISEVSGGTFSFIQAESLVQDAFAQCIG 544 Query: 407 QLMV 410 L+ Sbjct: 545 GLLS 548 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 89.9 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 39/392 (9%), Positives = 99/392 (25%), Gaps = 92/392 (23%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + + + + S + E NI M N+ ++ Sbjct: 35 NNLTSQFELKQNINSNNYTVDESFNINYDISSLMGIDKNDYDAVNSKDK--------EIV 86 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS------------- 202 + + ++ K I E V +G Sbjct: 87 LVVDTSGSMNDRNVEGKYK--INYSAEPIFSEEEYRWYRRVSLGNGQKYIIHELIDTPER 144 Query: 203 --------------MHCAMNSDPEDVNSAP-ICQDKKRTKMAALKNALLLFLDSIDLLSH 247 ++ D + K+ L+ A F++ + Sbjct: 145 EIKIGKETYNEYIEINKKRYYLKYDYSDGNWYAIAYSEPKIDELQKAAKNFVNKFE---- 200 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +K + +GL+ Y ++ + +++ + I T+ ++ A IL + Sbjct: 201 IKANTKIGLVSYGNNGKEVHSLTNELDRINSSID--SGLSIGGGTNVGDGIRMANGILNN 258 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFI---------------------------------- 333 + G+ + + Sbjct: 259 GSDADKYIVLMTDGMPTAATCYNDIYYKNNRFYSKYGEILNGNGNPLSYFNYIGNYKYKF 318 Query: 334 -------------IFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 I + G+N+ + + + +A E+ + I + N ++L Sbjct: 319 EYNPNDYTHEDEKIIMNYGDNDYGNIALNYSKEALKRASESGVNNFVIGFSNGINREKLN 378 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 Y+ ++ D L V++ I+ + + Sbjct: 379 GIATEGNGYYREAMHGDELTDVYKRIADEINN 410 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 65/200 (32%), Gaps = 45/200 (22%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ A+K L F+ +E +GLI + + E V Q + + + Sbjct: 23 DRLTAVKQVLSDFIAK-------REGDRIGLILFADHAYLQTPLTLDRETVTQQLNQAVL 75 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 LI T + A +I Q+ II L+DG N Sbjct: 76 KLIGTQTAIGEGIGLATKIFIDS------------------DAPQRVIILLSDGSNTAGV 117 Query: 346 SNVNTIKICDKAKENFIKIVTISINA------------------SPNGQRLLKTCVSSPE 387 + ++ + AK+ I T+ + A + + L ++ Sbjct: 118 LD--PLEAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEKTLQTIASTTGG 175 Query: 388 YHYNVVNADSLIHVFQNISQ 407 ++ N L +++ I+Q Sbjct: 176 QYFRARNQQDLQNIYDTINQ 195 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 78/223 (34%), Gaps = 35/223 (15%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L++DLS SM M S RT+M A+K+A+ F+ + Sbjct: 85 LDIVLLLDLSSSMQEEMGSGQSLKTGTTAAG---RTRMDAVKDAVRTFVR-------GRR 134 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSD 308 D +GL+ ++ ++ + + Y+ L T + A +L Sbjct: 135 DDRIGLVVFSDNAYVISPLTFDHQYLLDYLGFVDGEILLGEGQTAIGDGLALASAVL--- 191 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + ++ TDGE+N + I++ +AK I++ I Sbjct: 192 -----------ARQAGRDARGHQVVVLFTDGESNRGRD---PIEVVGEAKSAGIRVHVIG 237 Query: 369 INASPNGQR------LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ + L + V++ ++ + L+ + I Sbjct: 238 VDLDAEVKTRPGVQLLRRGVVAAGGRYFAADSERDLLTASRTI 280 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 83/265 (31%), Gaps = 43/265 (16%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + + + + E+ R F + ++VDLSGSM D Sbjct: 67 MLILSWLLIVTALAKPSILGEVQTREAFGRDVLMLVDLSGSM---------DEADFTTAD 117 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T++ A KN L F+ + GLI + + + + Sbjct: 118 GSTLTRLNAAKNVLKTFIAK-------RSGDRFGLILFGDAAFIQTPFTADQQVWLSLLE 170 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T A+ ++ + P Q+ +I LTDG + Sbjct: 171 EAQTGMAGQSTHLGDAIGLGIKVFEQN----------------PQPSEQQVMIVLTDGND 214 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYNVVN 394 S V + A IKI I++ + + + + + + ++ Sbjct: 215 TG--SFVEPVDAAKIAAARGIKIYIIAMGDPTHVGEQPMDMEVVQRVSQLTQARAFIAID 272 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 L +Q I +L + S + Sbjct: 273 QAELDKAYQLIDKLEPQQYSSASFR 297 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 81/264 (30%), Gaps = 70/264 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + +D K ++ ALK F+ Sbjct: 81 DESTRVKSTKGIDIVMAIDVSASM---------------LARDLKPDRLQALKQVAARFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAM 298 + + + +GL+ Y + V + +S+I T + Sbjct: 126 N-------GRPNDRIGLVEYAGESYTKTPLTSDKTVVLSSLNSIEYNSIIEGGTAIGMGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + K II LTDGENN+ + + A Sbjct: 179 ATAVNRLKESTAK------------------SKVIILLTDGENNSGFID--PKIASELAV 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK+ TI + + + L + ++ ++ + Sbjct: 219 EFGIKVYTIGLGTNGMASSPIGILPNGRFQYGNQPVKIDETLLKEIAKTTGGQYFRATSN 278 Query: 396 DSLIHVFQNISQ----LMVHRKYS 415 L +++ I++ + KY+ Sbjct: 279 TKLNEIYEEINKLEKTDIEEFKYT 302 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM K ++ A K+ + F+++ + + Sbjct: 89 DIMLAMDVSGSMQAMDF---------------KPNRLEAAKDVAISFINN-------RPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + + V ++ T + A L K + Sbjct: 127 DNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + +I LTDG NN ++ D A+ I++ T+ + Sbjct: 187 ------------------SRVVILLTDGSNNMG--DITPRMAADIARTFGIRVYTVGVGT 226 Query: 371 --------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L S ++ V+ ++L +++ I + Sbjct: 227 RGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDK 283 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++ A + Sbjct: 80 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APNRLTAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNRESTKTAIDKLQLADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P ++ ++DG+ + N Sbjct: 175 ATGEGIFTALQAI----------ATVGAVIGGGDEPPPARVVLMSDGKETVPSNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I T+S + + L K S + + Sbjct: 225 AYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASSL 284 Query: 396 DSLIHVFQNISQLMV 410 + L VF N+ + + Sbjct: 285 EQLKQVFTNLQEQIG 299 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 77/235 (32%), Gaps = 26/235 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + + + KK +++ A K L F+ + ++ Sbjct: 99 DVMVVVDLSGSMAEQDFVSKQSSD---LGAVKKISRLEATKEVLADFVKT-------RKG 148 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + ++ + T A+ A ++ + Sbjct: 149 DRLGLILFGDAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSSED 208 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + K I LTDG + I A ++I I++ Sbjct: 209 KASAEENAKPRE-------KVAIVLTDGNDTGSYV--EPIDAAKVAAAKDVRIHMIAMGD 259 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + S + +N D L + I +L S + Sbjct: 260 PRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGELEPQLYESTTYR 314 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 74/229 (32%), Gaps = 43/229 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ I + +DLSGSM +D + ++ K L F Sbjct: 89 QKKSAREIMVALDLSGSMSEEDFADKKGNKH---------DRLTIAKQVLREFAAQ---- 135 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E +GLI + + + + T A+ + + Sbjct: 136 ---REHDRLGLILFADSAYVQAPFTEDINVWQSLLEDVELGYAGFKTAFGDAIGLSIAVF 192 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 ++ R Q+ +I LTDG++ + K +K + A + +KI Sbjct: 193 EQEQSR------------------QRVMILLTDGDDTSSKMP--PVKAAEIAAKYGVKIY 232 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI+I + L K ++ ++ ++ L + I Q Sbjct: 233 TIAIGDPSTKGRYKMDLPTLEKVSAATGGQMFHAMDRKQLDQAYATIDQ 281 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 71/209 (33%), Gaps = 45/209 (21%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D + T++ + K + + L S+D + G+I + + S ++V + + Sbjct: 104 DYQPTRLESSKRSAEILLKSLDPKDYA------GIITFESGATSAAYLSPDKDRVIRKLQ 157 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + + S +K +I L+DG N Sbjct: 158 AIE--PKEGATAIGDGLALGIDMAESM------------------PNRKKVVILLSDGVN 197 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-----------------NGQRLLKTCVS 384 N + + A+E I++ TI + + + L + S Sbjct: 198 NAGVI--HPEQAAGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAAS 255 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++ V+ +L ++ N+++ +V K Sbjct: 256 TNGQYFKSVDDRTLSEIYSNLNKEIVREK 284 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 74/237 (31%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + +I T + A L K + Sbjct: 127 DNIGITLFAGESFTQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + ++ ++ + L V++ I + Sbjct: 227 NGMAPYPVRVGGTTQYINTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 73/218 (33%), Gaps = 50/218 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ D + ++ + + F+D + + LI + + + Sbjct: 89 DVSGTMQSVDFEPNRLEVARKTIQDFVDQ-------RPSDRIALIAFAGTAYTRVPLTLD 141 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 VR+ + S+ + T A+ L S K Sbjct: 142 HNVVRESLQDISFKSVNEEGTAIGMAISVGLNRL------------------KKSTSPSK 183 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I LTDG+NN + NT AK++ IKI TI + + Sbjct: 184 IMILLTDGDNNAGSIDPNT--ASTLAKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYE 241 Query: 374 ----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 N L K ++ +Y ++++L VF NI++ Sbjct: 242 SGLLNEDLLKKIAETTNGQYYRAKDSNALSQVFANINK 279 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 75/252 (29%), Gaps = 66/252 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + +D K ++ ALK F+ Sbjct: 81 DVSTRTNTTRGIDIVIAIDVSASM---------------LARDLKPNRLEALKEVASQFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAM 298 + +GL+ Y + V + +S+I T + Sbjct: 126 A-------DRPSDRIGLVEYAGESYTRTPITSDKSIVLSSLNDIQYNSIIEGGTAIGMGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L + + K II +TDG NN + + A+ Sbjct: 179 ATSVNRLKDSRAK------------------SKVIILMTDGVNNAG--FIEPSTASELAQ 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK+ TI + + + L + + ++ + Sbjct: 219 EFGIKVYTIGLGTNGTALSPVALRPDGSFQYGSIPVEIDEALLQEIADKTGGLYFRATDN 278 Query: 396 DSLIHVFQNISQ 407 +SL ++ I++ Sbjct: 279 ESLEEIYAEINK 290 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 38/222 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + P I L +D S SM ++ ++ A+K + F+ Sbjct: 82 KTPGVDIILCLDASESMAQPDF----------AIDGQRVNRLTAVKKVVHDFVKR----- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + + + + + T A+ A + + Sbjct: 127 --RDTDRIGLVVFGDYAFTQAPLTLDKGLLLNLIENLRIGMAGRKTAIGDALGVAGKRI- 183 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I L+DGEN ++ + IKI T Sbjct: 184 -----------------KDIPAMSKVVILLSDGENTAG--DMTPQGAAEALAALGIKIYT 224 Query: 367 ISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + G + L + +Y+ N + L +++ I + Sbjct: 225 IGMGTEQAGSKELAQIAAIGQGKYYHASNTEQLDSIYKEIDK 266 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 79/237 (33%), Gaps = 54/237 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLS SM + ++ A+K L F++ +E+ Sbjct: 87 DIVLAVDLSRSMEAEDYFVDRK----------RSNRLQAVKPVLSAFINR-------REN 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + R ++ + + + R LI T ++ A L K Sbjct: 130 DRIGLIAFAGRAYTVAPLTFDHKWLARQTERLQIGLIEDGTAIGDSLAVATSRLLEGAK- 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + FI+ LTDGEN + ++ AK+ I++ TI+ Sbjct: 189 -----------ERAGEREGAFIVLLTDGENTAGMMD--PMEGATLAKDAGIRVYTIAAGK 235 Query: 372 SP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + L+K + + N+D++ F+ I Sbjct: 236 NGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETNGEFFRAENSDTIDQAFEKI 292 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 72/233 (30%), Gaps = 54/233 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VDLSGSM + + ++ +K L F++ + Sbjct: 88 DIMLAVDLSGSMTEQDMAYN----------GQYVDRLTMVKAVLSDFIEQ-------RTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + V + + L+ + T A+ + + S Sbjct: 131 DRLGLILFGDTAFLQTPLTRDLKTVTKMLNEAQIGLVGRATAIGDALGLSVKRFASKDDS 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + ++ LTDG+N N A+E IK+ TI + + Sbjct: 191 N------------------RIVVLLTDGQNTAGNLN--PDDALLLAREEGIKVYTIGVGS 230 Query: 372 SP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ + L ++ + + Sbjct: 231 DNPRGFSLFNMGGSGGSNLDERLLKNIADDTGGLYFRAKDVAGLKQIYAELDK 283 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 67/224 (29%), Gaps = 47/224 (20%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VD+SGSM + ++ ++ F+ + +G Sbjct: 93 AVDVSGSMSTQDMAGNA-------------NRLQVVQKVAGDFIRR-------RHGDQVG 132 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T+ + V Q++ M + T A+ A + L S Sbjct: 133 LILFGTQPYLQAPLTTDLNTVGQFLDEAMIGVAGTQTAIGDAIGLAIKRLRDATNVSGRK 192 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-- 373 +I LTDG N+ + A ++I TI + + Sbjct: 193 GET-------------VLILLTDGSNDAGAMP--PDEAAKMAAAAGLRIYTIGVGSDQTD 237 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L ++ ++ + ++L V+ I + Sbjct: 238 PFGMGGANDLDEDTLKLIAKTTGGEYFRATDVENLQQVYTRIDR 281 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 73/229 (31%), Gaps = 43/229 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+ + L VD+SGSM + ++ A+K+ L F+ Sbjct: 90 EKSARDVVLAVDISGSMDQRDFKAADGTPK---------QRLEAVKDVLRAFIA------ 134 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ M LI + TR + + + ++ + + T A+ + Sbjct: 135 -ARDGDRMALIIFGTRAFVQAPFTEDLQSLNGFLEQTAVGMAGPNTALGDAIGLGIRTFE 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + Q+ +I L+DG + + + + A + + I T Sbjct: 194 SSEVD------------------QRMMIVLSDGADTSSRMT--PVIAASIAADKGVVIYT 233 Query: 367 ISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + + L + ++ + +L V++ I Sbjct: 234 IGVGDPDATGEDRVDLDALKDIANKTQGQYFFADDEAALTEVYRQIDAQ 282 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 73/253 (28%), Gaps = 65/253 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I + +D+S SM + +D + +++ALK Sbjct: 79 TQDISTRTKTTKGIDIVMAIDVSSSM---------------LARDLRPNRLSALKEVAAE 123 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + +GL+ Y + V + + T Sbjct: 124 FIRK-------RPNDRIGLVAYAGESYTKTPITSDKSIVLGALREITYGQLNDGTAIGMG 176 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + L S K II LTDG NN + D A Sbjct: 177 LATSVNRL------------------KESTAISKVIILLTDGVNNAG--FIEPQTAADLA 216 Query: 358 KENFIKIVTISINASPNG----------------------QRLLKTCV-SSPEYHYNVVN 394 E IK TI + + N + LL ++ ++ + Sbjct: 217 LEYGIKTYTIGLGTNGNALSPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEYFRATD 276 Query: 395 ADSLIHVFQNISQ 407 + L +++ I++ Sbjct: 277 NEKLEAIYEEINK 289 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++ A + Sbjct: 80 TIAMAGPTHDVRI-SRNRAVVMLVIDVSQSMRATDV---------------APNRLTAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNRESTKTAIDKLQLADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P ++ ++DG+ + N Sbjct: 175 ATGEGIFTALQAI----------ATVGAVIGGGDEPPPARVVLMSDGKETVPSNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I T+S + + L K S + + Sbjct: 225 AYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTASSL 284 Query: 396 DSLIHVFQNISQLMV 410 + L VF N+ + + Sbjct: 285 EQLKQVFTNLQEQIG 299 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 87/276 (31%), Gaps = 21/276 (7%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS- 70 K+ I ++ + I AL + + L+ + V K + A A+L+ ++ S Sbjct: 12 KRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQAVLSLTAENNSG 71 Query: 71 ----NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + A + +K +K L + + I + Sbjct: 72 RKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDEN--TMKFEPICTTQN 129 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 ++T + S++ + + ++ Q +S +I Sbjct: 130 NAITPKNGKQYAYSSSHVTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSGSMAQKIN-N 188 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +V DLSGSM + N+ + + + +K+ LK L D L Sbjct: 189 VNLPLDLMVVADLSGSMDYNI-------NNHKVYSNTEASKLTLLKQVLEELTDKYLLSE 241 Query: 247 HVKEDVYMGLIGYTTRVEK------NIEPSWGTEKV 276 + + +I + + + W + Sbjct: 242 EANPNNRISMIPFAMGAQHPIRNSCVLPFEWNQSHI 277 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 63/183 (34%), Gaps = 16/183 (8%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A+ L+ +K+ ++ + + + S ++ + S Sbjct: 381 AVNNMLNEPARSDVLKQQTRRTILILSDGSDSIGDDSGENNWYQKEIPFMNFS------R 434 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF--QKFIIFLTDGENNNFKSNVNTI 351 T + Q L + +S G + + T G + + +NT Sbjct: 435 ITENLILGKQELFNKSPQSKNLENHIYGYRYNYPIYLTNNTEKIQTKGLCDVIRDKLNTK 494 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQ--L 408 K+N KI+ + + + + + CV + +Y+ + +SL+ F+ IS+ Sbjct: 495 N-----KDNNTKIIFVELGYNSSSKDTWLHCVGGTQNYYSATSKESLLEAFKQAISKSDD 549 Query: 409 MVH 411 + H Sbjct: 550 IGH 552 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 83 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 125 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 126 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 178 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 225 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 226 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 285 Query: 413 KY 414 K Sbjct: 286 KV 287 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTPDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 36/248 (14%), Positives = 78/248 (31%), Gaps = 60/248 (24%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 ++ + I +I L +DLS SM ++ A+KN + F+ Sbjct: 77 TRKMNVKTEGINII-LALDLSKSMAALDF----------KLDGAIVNRLDAVKNVVKDFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + +G++ + + + + + + ++R T AM Sbjct: 126 MK-------RSGDRIGMVVFGSEAFTQMPLTRDYDTIAFVLSRLKIGAAGPSTAIGDAMG 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + L K + +I LTDG++N+ + D A+E Sbjct: 179 ISLKRLEDVKSK------------------SNIVILLTDGKSNSGEIT--PGAAADIARE 218 Query: 360 NFIKIVTISIN----------------------ASPNGQRLLKTCVSSPEYHYNVVNADS 397 +K+ TI + + + L + + + + DS Sbjct: 219 RGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVDMDHEALKEIADKTGGAFFAAADTDS 278 Query: 398 LIHVFQNI 405 L ++ I Sbjct: 279 LKKIYDMI 286 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 73/241 (30%), Gaps = 55/241 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 8 STTTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFIAQ--- 54 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E +GLI + + + V + + + + LI T + A + Sbjct: 55 ----REGDRIGLILFADHAYLQTPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKT 110 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 Q+ +I L+DG N + ++ + AK+ I Sbjct: 111 FIDSAA------------------PQRVMILLSDGSNTAGVLD--PLEAANIAKQYQTTI 150 Query: 365 VTISINA------------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + A + + L ++ ++ N L ++ I+ Sbjct: 151 YTVGVGAGEMIVKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTIN 210 Query: 407 Q 407 Q Sbjct: 211 Q 211 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 61/226 (26%), Gaps = 18/226 (7%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS--KMVS 70 + E+ NF++I AL ++ LL + + +N+++++ +AA LA Sbjct: 36 RFGKDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRFSTGE 95 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + D I AL DA F ++ + + S Sbjct: 96 SHTVVQDYGARIFYANVTALSADAINFQIAFPQDKTTDQQVQATAAFTYKSLFGVVASRL 155 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI--VSFIPALLRIEMGER 188 N T + L + + R + + + Sbjct: 156 TGDNWDKHQYTLTASVRLKNTIEVALVLDNSKSMDETRSGSSKKRIDLLKDAASQLVETM 215 Query: 189 P--------------IFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 L+ ++ + + + + Sbjct: 216 ASQSALITYVEKPVQFSLVPFAGSVNVGPQYLNAAWMDPEGKSSVN 261 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/316 (12%), Positives = 83/316 (26%), Gaps = 89/316 (28%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-------A 229 P L + V + + ++S +++ + + Sbjct: 331 GPFALDVTPPSDNNPDTLFVPMFGPAEYYNVDSRGNVISTVLNSWWQDDMSLAYSPRQSD 390 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSL 287 K L LD I + K G Y+ + T + ++ T + Sbjct: 391 LKKYYLRDSLDKI----YRKGRSEGGGPNYSCTTLPLTPLTDVTTEQGMKTVQTAIKAMV 446 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ AM ++ + + +G K +I LTDG N +K + Sbjct: 447 PNGGTNVPEAMAWGWRTIVQGAPFTEARASTERGND-------KVVIVLTDGANTYYKYD 499 Query: 348 VNTIKICDKA-------------------------KENFI-------------------- 362 D+A +E+ + Sbjct: 500 GLAGSGPDRAGNLSYYSTHGYTARITKKYSQSRLFQESGVSVSQNNTTYTKALNARFAKL 559 Query: 363 ---------KIVTISINASPNGQ------RLLKTCVSS---------PEYHYNVVNADSL 398 ++T++++ + LL++C S+ P + L Sbjct: 560 CDNAKAANIIVMTVALDLNEANSTEKAQIDLLRSCSSNSRVRMEGGKPAKLFWNSTGGEL 619 Query: 399 IHVFQNISQLMVHRKY 414 F+ I + + + Sbjct: 620 SETFRQIGDELSNLRL 635 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 83 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 125 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 126 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 178 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 225 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 226 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 285 Query: 413 KY 414 K Sbjct: 286 KV 287 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 72/237 (30%), Gaps = 58/237 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D SGSM K T++ +K + F+ +E Sbjct: 87 DIMLCLDTSGSMQALDFE----------LDGKPVTRLTVVKKVVADFIKE-------RET 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + + V + + T A+ A + L K + Sbjct: 130 DRIGLVVFGQEAFTQSPLTMDKGLLLSLVDKMEIGMAGDSTAIGNAIAVAGKRLKDLKAK 189 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 K +I LTDG +N ++ + A IKI TI + Sbjct: 190 ------------------SKIMIILTDGRSNTG--DITPEEAAGAAAALGIKIYTIGVGG 229 Query: 371 --------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + ++ ++ L ++++ I++ Sbjct: 230 TGPAPFKVNTFFGPRIVNQSVDLDEKTLKEIAAIGKGKYFRATDSKELANIYEIINK 286 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 71/221 (32%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K F++ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKQVAAEFIN-------GRPNDNIGLTVFAGEAFTQCPLTVD 147 Query: 273 TEKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + +I T + A L K + Sbjct: 148 HGVLLNLFNSIKGDIAQRGMIEDGTAIGMGLANAISRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------- 373 K II LTDG NN +++ + + AK+ I++ TI + + Sbjct: 192 --SKVIILLTDGSNNRG--DISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQY 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + Q L + ++ ++ + L V++ I + Sbjct: 248 VNVPVEIDEQTLTQIAGTTNGNYFRATSNSKLEEVYREIDK 288 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 35/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM T+++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PTRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 49/225 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++S + D K T++ A K+ F+ + + +GL+ + + Sbjct: 95 DISSTMLAGDIKPTRLEAAKSVATEFI-------LSRPNDNIGLVIFARESFTQCPLTTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + +I T + A + K + K Sbjct: 148 HAVLVNLFNGVNNGMIEDGTAIGLGLANAVNRIKDGKSK------------------SKV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDG NN+ ++ I + AK I+I TI + Sbjct: 190 IILLTDGSNNSG--DIAPITAAEIAKTFGIRIYTIGVGTHGVINIPVSTPMGIQYQRVQS 247 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L + ++ + L +++Q I + + + + Sbjct: 248 EFDAKSLENIANLTGGKYFGATDNSKLRNIYQEIDK-LEKTRIKI 291 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 73/216 (33%), Gaps = 22/216 (10%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V D +G+ H ++++ + K LD + D Sbjct: 66 PFVYDAAGNQHKKGRDLVLAIDASGSMAQSGFDEKDRFKTKYETTLDLSADFIKHRFDDN 125 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 MG++ + T ++ E + + + + T A+ QA + L+ + Sbjct: 126 MGVVIFGTFAYTASPLTYDLEAMESMLKMTTVGIAGESTAIGDALMQAMRTLSYGEA--- 182 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--A 371 K II LTDG +N +S+ KAKE IKI TI + + Sbjct: 183 ---------------QSKAIILLTDGYHNAGRSSPKAAVA--KAKEKGIKIYTIGVGKSS 225 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L S Y +A L V++ I + Sbjct: 226 DYDAALLDTIAKESGGKSYAAASAAQLKEVYKEIDK 261 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 39/123 (31%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 FR + ++ + NF+++ A++ + + I +++ ++SA +AA L Sbjct: 6 FRLHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVR 65 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + + S + T N ++ Sbjct: 66 AYGEGATEDDAYDAAQKFFWSNYALPQESVVDALAVATEPSTQALSVKFTRNVNEDTATA 125 Query: 127 ISM 129 + Sbjct: 126 EFV 128 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 79/218 (36%), Gaps = 39/218 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D S SM+ D ++K +K + F+D +++ Sbjct: 84 DIMLVIDSSDSMNQWGF----------DPGDPNKSKFDVVKEVVGDFIDK-------RKN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + ++ + +R+ + + K T A+ Q Y IL+ + Sbjct: 127 DRIGLINFASVAFVASPLTFEKDFLRKILQMQEPGIAGKRTAINDALLQTYNILSKSDAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K I LTDG +N + + +I ++ IK+ TI I + Sbjct: 187 ------------------SKIAILLTDGIDNASRIS--FDEIRRLISDSDIKLYTIGIGS 226 Query: 372 --SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + + SL +++ I + Sbjct: 227 YRDFDAPYLKALAQAGHGRFFAASDRRSLQKIYEAIDR 264 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 77/258 (29%), Gaps = 52/258 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 V+ + + R ++ LV+D+S SM ++ Sbjct: 77 MSFTVAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APNRLV 120 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A + A F D + + +GLI Y + P+ E + + + + Sbjct: 121 AAQEAAKQFADQLTPG------INLGLIAYAGTATVLVSPTTNREATKAAIDKLQLADR- 173 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A Q + + P I+ ++DG+ + N Sbjct: 174 --TATGEGIFTALQAV----------ATVGAVIGGGDEPPPARIVLMSDGKETVPSNPDN 221 Query: 350 TIKI---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNV 392 AK+ + I T+S + + L K S + Sbjct: 222 PKGAYTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLKKIADLSGGDAFTA 281 Query: 393 VNADSLIHVFQNISQLMV 410 + + L VF N+ + + Sbjct: 282 SSLEQLKQVFTNLQEQIG 299 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 78/229 (34%), Gaps = 40/229 (17%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + L VD+SGSM + +++ T++ A+K + F+ Sbjct: 78 VGEATSMPNSGRDLLLAVDISGSMREPDM----------VYNNRRITRLMAVKKVVGDFV 127 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ +GL+ + T+ ++ + V++ + + T A+ Sbjct: 128 AR-------RQSDRLGLVLFGTQAFLQAPLTFDVKTVQEMLIEAESGYAGEATAIGDAIA 180 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + L ++ II LTDGEN + + T D A + Sbjct: 181 LSIKRL------------------REQPNAKRVIILLTDGENTAGELGIATAT--DLAVK 220 Query: 360 NFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 KI TI+ + + + + + + N L + + + Sbjct: 221 ANTKIYTIAFSPYDREVDSHSMQQIAEQTGGEFFRARNTRDLEEIHRQL 269 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 88.8 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 77/227 (33%), Gaps = 38/227 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 100 IVLALDLSGSMERKDFS----------LDGETVSRLAAVKRVGAEFIRR-------RAGD 142 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + P++ T V + + L+ + T + A + L + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTAAVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQLAD 202 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 P K ++ L+DG NN + + AK+ +++ TI++ Sbjct: 203 AEGGG-------PPPSRDKVVVLLSDGANNAGQ--TAPKDVAALAKDLGVRVYTIALGPI 253 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L +S + V D L V I + Sbjct: 254 DMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLESVAAAIDE 300 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 75/239 (31%), Gaps = 47/239 (19%) Query: 189 PIFLIELVVDLSGSMHCAMNSDP--------EDVNSAPICQDKKRTKMAALKNALLLFLD 240 L+ +V+ S N D D++++ + D ++ K F+ Sbjct: 63 TFSLVVIVLARPQSYSAWDNKDSEGIDIMLAMDISASMLTNDVIPNRLEVAKEVASDFI- 121 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-----SLILKPTDST 295 + + +GL + + + + LI T Sbjct: 122 ------SGRPNDNIGLTIFAGEAFTQCPLTTDHASLINLLNSVRTDLVVKGLIQDGTAIG 175 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + A L S K + K +I LTDG NN + + + Sbjct: 176 MGLINAVGRLKSSKAK------------------SKVVILLTDGSNNVGSIS--PMTAAE 215 Query: 356 KAKENFIKIVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 AK+ I++ TI + + + L + + Y+ + L ++++I++ Sbjct: 216 IAKKFNIRVYTIGLGTEQNNGYSDIDYTTLRQIANVTNGKFYSAQSQTELSQIYKDINK 274 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 70/194 (36%), Gaps = 31/194 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K A F +I + L+ Y + I+ + ++ V D Sbjct: 397 LNETKTATREFASTIFKSDAD-----VCLVSYDSSARNVIDSTDNEYALKAAV---RDLS 448 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A++ +Y+ L ++ I+ ++DGE N Sbjct: 449 AGGGTNIEDALRVSYERLEGSGSD------------------KRIIVLMSDGEANEGLVG 490 Query: 348 VNTIKICDKAKENFIKIVTISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + I ++ K++ + I T+ + QR+++ S P HY V +A L + F Sbjct: 491 DDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIAS-PGCHYEVDDASQLRYFFG 549 Query: 404 NISQLMVHRKYSVI 417 +I + ++ + Sbjct: 550 DIGDDINGTRFIYV 563 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 43/311 (13%), Positives = 89/311 (28%), Gaps = 74/311 (23%) Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI---------- 177 + + D S+ T + + +L K + Sbjct: 15 FLGFIPLVFRDLSHKKRKGVAVSSTKNMQVISYFGAVLWLLKISKYIILSALILAIARPR 74 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + E I L +D+S SM + +D + ++ ALK Sbjct: 75 TFSISEDRDETKGMDIVLSIDVSLSM---------------LAKDLEPDRLTALKEIART 119 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + +GL+ Y+ + + V + + + T+ Sbjct: 120 FIKQ-------RTTDRIGLVEYSGEALMRVPLTSDHRVVEEELMSFNPMDLEGGTNIGDG 172 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L K + K II +TDG N + + + A Sbjct: 173 LAVAVSHLRKSKAK------------------SKIIILMTDGVNTIDNAMS-PLTAAELA 213 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 + N IK+ TI I ++ + L + ++ + Sbjct: 214 RNNDIKVYTIGIGSNGLALMPTQQDIFGNLVFTEEQVKIDEYLLRDVAQITGGKYFRATS 273 Query: 395 ADSLIHVFQNI 405 +SL +++ I Sbjct: 274 NESLKQIYEEI 284 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 77/266 (28%), Gaps = 71/266 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + ++ I L VD S SM + +D + ++ ALK + F Sbjct: 80 DVSTKIKSDKGVDIMLTVDTSLSM---------------LARDLEPDRLTALKAVAVKFS 124 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ +GL+ Y+ + + E + + + + T + Sbjct: 125 KE-------RQADRLGLVSYSGEALTRVPLTTDREVLIREINALESGELEDGTAIGIGLA 177 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFKSNVNTIKICD 355 A + K + K II +TDG N ++ + Sbjct: 178 TAINHIKDSKAK------------------SKVIILMTDGVESINPTNDLMYISPQTAAE 219 Query: 356 KAKENFIKIVTISIN-----------------------ASPNGQRLLKTCVSSPEYHYNV 392 A IK+ TI I + + L + ++ Sbjct: 220 MATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIADLTGGLYFRA 279 Query: 393 VNADSLIHVFQNISQ----LMVHRKY 414 + SL ++Q I + + KY Sbjct: 280 TDNQSLQKIYQEIDRLEKSEINEVKY 305 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 75/265 (28%), Gaps = 69/265 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 I + +D+S SM + +D K + ALK F+ Sbjct: 81 SENAHTKITDGIDIVMAIDVSASM---------------LSKDLKPNRFEALKKVASQFV 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + +GL+ Y + + ++ I T + Sbjct: 126 K-------DRPNDRIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLA 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A L K + + II LTDG NN + + A E Sbjct: 179 TAVNRLKESKAK------------------SRVIILLTDGVNNTGFID--PQTAAELAAE 218 Query: 360 NFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNAD 396 I++ TI I ++ + + K + ++ + Sbjct: 219 YGIRVYTIGIGSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQ 278 Query: 397 SLIHVFQNISQ----LMVHRKYSVI 417 L ++ I++ + KY+ + Sbjct: 279 KLQQIYDEINKMETTKIEEFKYTEV 303 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 64/190 (33%), Gaps = 28/190 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K+ A K L F +++ +GL+ + ++ + Sbjct: 84 TSGSMAEFNKIDAAKAVSLDFAKK-------RKNDALGLVVFGNIAYIASPLTFDKKTFE 136 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + R S+ T A+ + + + +K II LT Sbjct: 137 DILKRIYVSIAGGKTAIYDALFLSSNLFKNANG-------------------EKIIILLT 177 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG +N S + K K+ IK+ +I+I + L K + Y + + Sbjct: 178 DGMDN--MSITPLDVVIKKLKKEHIKVYSIAIGGDADLSVLKKISKETNGKFYIASSLED 235 Query: 398 LIHVFQNISQ 407 L ++ +I++ Sbjct: 236 LKKIYSDINK 245 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 75/242 (30%), Gaps = 68/242 (28%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM + +D K ++ A K F++ + + Sbjct: 89 DIMLAVDVSTSM---------------LAEDLKPNRLEAAKQVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILT 306 +GL + + + LI+ T + A L Sbjct: 127 DNIGLTIFAGEAFTQCPLTVDHGVLLNLFQSIKCDIAQKGLIMDGTALGMGLANAVSRLK 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K II LTDG NN +++ + + AK+ I++ T Sbjct: 187 DSKAK------------------SKVIILLTDGVNNRG--DISPLTAAEIAKQFGIRVYT 226 Query: 367 ISINASP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + + Q L + ++ ++ + L V+Q I Sbjct: 227 IGVGTNGTAPYPMQTYAGVQYVQVPVEIDEQTLTQIAGTTNGNYFRATSNSKLKEVYQEI 286 Query: 406 SQ 407 + Sbjct: 287 DK 288 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 71/217 (32%), Gaps = 48/217 (22%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D++ + + +D K ++ A KN + F+ + + +GL+ ++ + Sbjct: 98 TDISGSMLAEDLKPNRLEAGKNIAIDFIK-------GRPEDRIGLVIFSGESFTQCPLTI 150 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + + + ++ T + A L + + K Sbjct: 151 DHDVLINLFSDISNGMVEDGTAIGMGLATAVNRLKDSEAK------------------SK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------------- 370 +I LTDG N + + AK+ +++ TI + Sbjct: 193 VVILLTDGSNTTGSIP--PLTAAEIAKQMKVRVYTIGVGTKGYAPYPVKTPFGTQYQQVP 250 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L K + ++ N + L ++Q I + Sbjct: 251 VTIDEGVLSKIAGITGGKYFRATNNEKLKEIYQQIDK 287 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 75/265 (28%), Gaps = 69/265 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + QD K + ALK F+ Sbjct: 81 SENAQTKITDGIDIVMAIDVSSSM---------------LSQDLKPNRFEALKKVASQFV 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + +GL+ Y + + + + T + Sbjct: 126 K-------DRPNDRIGLVVYAGESYTKTPVTTDKGIILSSLAELTYGQVEDGTAIGMGLA 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A L K + + II LTDG NN + + + A E Sbjct: 179 TAVNRLKESKAK------------------SRVIILLTDGVNNTGVID--PLIAAELAAE 218 Query: 360 NFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNAD 396 IK+ T+ I + + + K + ++ N Sbjct: 219 YGIKVYTVGIGTNGMALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQ 278 Query: 397 SLIHVFQNISQ----LMVHRKYSVI 417 L ++ I++ + KY+ + Sbjct: 279 KLQQIYDEINKLETSKIEEFKYTEV 303 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 78/239 (32%), Gaps = 61/239 (25%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + +D+SGSM D ++R+++A K+ + F+ +E Sbjct: 88 LVVALDISGSMEEQDMDDN----------GQRRSRIAVTKDVAMDFVKQ-------REGD 130 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQILT 306 + L+ + T ++ V Q++ ++ T A+ A + L Sbjct: 131 RIALVLFGTHPYLQTPLTFDHPTVMQHIYEAQLTMADDLQRGIHATAIGDAIGLAVKRL- 189 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K +I LTDG +N + +K A +KI T Sbjct: 190 -----------------RDIDAPDKTLILLTDGSDNASQV--APLKAAQIAAREGLKIYT 230 Query: 367 ISINASPNGQRLLK------------------TCVSSPEYHYNVVNADSLIHVFQNISQ 407 I + A LL ++ ++ N + L ++Q+I + Sbjct: 231 IGLGAEQRQASLLGFDFGFGKNREIDEKTLKDIAKATDGRYFRARNPEELREIYQHIDR 289 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 83/257 (32%), Gaps = 70/257 (27%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I + +DLS SM + +D K +M ALK F++ Sbjct: 86 NTTNGIDIVMAIDLSSSM---------------LAKDFKPNRMEALKEVAASFVE----- 125 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQI 304 ++ +G++ YT + V + D+++ T + A Sbjct: 126 --ARQSDRIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDNVLQDGTGIGMGLATAVNR 183 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L K + K II +TDG NN + + + AKE IK+ Sbjct: 184 LKDSKAK------------------SKVIILMTDGVNNAG--FIEPVTAAEFAKEFGIKV 223 Query: 365 VTISINASPNG----------------------QRLLKTCV-SSPEYHYNVVNADSLIHV 401 TI I + N ++L+K + ++ + SL + Sbjct: 224 YTIGIGTNGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYFRAQSNSSLESI 283 Query: 402 FQNISQ----LMVHRKY 414 + I++ + K+ Sbjct: 284 YSEINKLEKTEIEDLKF 300 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/311 (10%), Positives = 77/311 (24%), Gaps = 37/311 (11%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLL-----------NQRYNQKIVSFIPALLRIE 184 L + + + + + + + R Sbjct: 56 SLSVTGTASSTSESSKADVVIVFDISNSMDEETNTYVEYATGRYGSVSSDAPTGSSTRRR 115 Query: 185 MGERPIFLIEL-----VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + R + + + S D + + ++T++ K+A + Sbjct: 116 LYRRSTNNWGYYQYTEITNDTTSGTVYYLGDNYQYHEYTGKRYSQKTRLDVAKSATNTMI 175 Query: 240 DSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL------KPT 292 D + + V + L+ + T S +E + V T Sbjct: 176 DQLLANNATNPGSVRISLVSFDTFASDATAWSTSSENLHSIVNGYKTPQSSHLGGHRGGT 235 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI---------PSLPFQKFIIFLTDGENNN 343 + A+++A F G P + D ++ Sbjct: 236 NWEDALQKA-DGTQPRADAQKHVIFVSDGNPTFRISSINGNPDDQYNDVHGHGDDDYYHS 294 Query: 344 FKSNVN--TIKICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + K + T+ + Q L +S + +Y + +L Sbjct: 295 HPNYNYDAAKDDAKKIVDGGAAFYTVGTFGDAARMQNLATEAGAS-DNYYKADDEAALKA 353 Query: 401 VFQNISQLMVH 411 F+NI + H Sbjct: 354 AFKNIVASITH 364 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVPPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 78/256 (30%), Gaps = 52/256 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V+ + + R ++ LV+D+S SM +++AA Sbjct: 79 LTVAMAGPTRDVRV-PRNRAVVMLVIDVSQSMRATDV---------------SPSRLAAA 122 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F D + + +GLI Y + P+ E + + + + Sbjct: 123 QEASKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKTAIDKLQLADR--- 173 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A Q + + P I+ +DG+ + N Sbjct: 174 TATGEGIFTALQAI----------ATVGAVIGGGDEPPPARIVLFSDGKETVPSNPDNPK 223 Query: 352 KI---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 AK+ + I TIS + Q L K S + + Sbjct: 224 GAFTAARTAKDQGVPISTISFGTPYGYVEINEQRQPVPVDDQMLKKIADLSEGEAFTASS 283 Query: 395 ADSLIHVFQNISQLMV 410 + L V+ N+ Q + Sbjct: 284 LEQLREVYANLQQQIG 299 >gi|134100328|ref|YP_001105989.1| hypothetical protein SACE_3793 [Saccharopolyspora erythraea NRRL 2338] gi|133912951|emb|CAM03064.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 327 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 72/241 (29%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + L VD+S SM ++ A K A F D + Sbjct: 86 PRNRATVMLTVDVSLSMKATDVE---------------PNRLEAAKVAAKEFADQLTPG- 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+ + + P+ V+Q + S T + + A + Sbjct: 130 -----INLGLVSFAGTATVLVMPTTDRASVKQAIDNLKLSEA---TATGDGINAAMSAID 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 S F + V PS I+ + DG + +AK+ I Sbjct: 182 S----------FGKMVGGPSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIP 231 Query: 364 IVTISINASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + + + + S + +A+ L V+ + + + Sbjct: 232 ISTISFGTKHGSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQI 291 Query: 410 V 410 Sbjct: 292 G 292 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 75/253 (29%), Gaps = 65/253 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I + +D+S SM + +D K +++ALK Sbjct: 78 TEDISTRTKTTKGIDIVMAIDVSSSM---------------LARDLKPNRLSALKEVAAD 122 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + +GL+ Y + V + + T Sbjct: 123 FIRQ-------RPNDRIGLVAYAGEAFTKTPITSDKSIVLNSLREITYGQLNDGTAIGMG 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + L S K II LTDG NN+ + D A Sbjct: 176 LATSVNRL------------------KESKAISKIIILLTDGVNNSG--FIEPQTAADLA 215 Query: 358 KENFIKIVTISINASPNG----------------------QRLLK-TCVSSPEYHYNVVN 394 E IK TI + + N ++LL+ ++ ++ + Sbjct: 216 VEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLEGIAETTGGKYFRATD 275 Query: 395 ADSLIHVFQNISQ 407 + L ++ I++ Sbjct: 276 NEKLEAIYDEINK 288 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 75/265 (28%), Gaps = 69/265 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 I + +D+S SM + +D K + ALK F+ Sbjct: 88 SENAHTKITDGIDIVMAIDVSASM---------------LSKDLKPNRFEALKKVASQFV 132 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + +GL+ Y + + ++ I T + Sbjct: 133 K-------DRPNDRIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLA 185 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A L K + + II LTDG NN + + A E Sbjct: 186 TAVNRLKESKAK------------------SRVIILLTDGVNNTGFID--PQTAAELAAE 225 Query: 360 NFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNAD 396 I++ TI I ++ + + K + ++ + Sbjct: 226 YGIRVYTIGIGSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQ 285 Query: 397 SLIHVFQNISQ----LMVHRKYSVI 417 L ++ I++ + KY+ + Sbjct: 286 KLQQIYNEINKMETTKIEEFKYTEV 310 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 68/213 (31%), Gaps = 47/213 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +++ +D + T+ A + A F++ + +GLI + + Sbjct: 58 DASTSMQAEDFQPTRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLD 110 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +++ + + T A+ A L + K Sbjct: 111 YSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAE------------------SKV 152 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--------------- 377 I LTDG NN + + + A+ +++ I + +S + Sbjct: 153 AILLTDGRNNRGQID--PRTAAEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAG 210 Query: 378 -----LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 L VS+ +++ N D+L ++ I Sbjct: 211 VDAEMLRSVSVSTGGQYFSATNRDALERIYAEI 243 >gi|291008772|ref|ZP_06566745.1| hypothetical protein SeryN2_29978 [Saccharopolyspora erythraea NRRL 2338] Length = 324 Score = 88.0 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 72/241 (29%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + L VD+S SM ++ A K A F D + Sbjct: 83 PRNRATVMLTVDVSLSMKATDVE---------------PNRLEAAKVAAKEFADQLTPG- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+ + + P+ V+Q + S T + + A + Sbjct: 127 -----INLGLVSFAGTATVLVMPTTDRASVKQAIDNLKLSEA---TATGDGINAAMSAID 178 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 S F + V PS I+ + DG + +AK+ I Sbjct: 179 S----------FGKMVGGPSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIP 228 Query: 364 IVTISINASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + + + + S + +A+ L V+ + + + Sbjct: 229 ISTISFGTKHGSIEIEGEQEFVEVDDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQI 288 Query: 410 V 410 Sbjct: 289 G 289 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 88.0 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 43/368 (11%), Positives = 100/368 (27%), Gaps = 41/368 (11%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +LA + + ++ + + + + N Sbjct: 460 VLANNLSGNHDSTVTLIALDAWGHSLQNINVTLNNTAPSLGTLSLDGSNESNLINFTTDQ 519 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + N + + + + Y + Sbjct: 520 YGRVITEFTSSNFVGNCTIIGLSDAINDSLSIEIRNQPFISASIDAEPYVVTSGDIVNIT 579 Query: 181 LRIEMG-----ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT--------K 227 I + R L++D SGSM + + + Sbjct: 580 TVITVEGELPVSRSAATSMLILDRSGSMDPDYYAGTALDIVLVLDRSGSMKFLGNAPEQP 639 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQYV-TRDMD 285 + K+A +F++++ + +G++ +++ + S + + Sbjct: 640 LTDAKSAAKIFMENLLS------NTEVGVVSFSSTSTVDRQPVSLNISGNKDLLHNAIDS 693 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T AM A +L + +K +I LTDG Sbjct: 694 MVADGGTAIGDAMADANNLLING-----------------RPDAKKIMIVLTDGVATAGS 736 Query: 346 SNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + I A N I+I +I + +S + L + + +YN + L V+ Sbjct: 737 DRDGSDAIST-ANLNNIRIYSIGLGSSEYIDEPMLKRIASETGGSYYNAPSGSELQTVYN 795 Query: 404 NISQLMVH 411 IS+ + Sbjct: 796 TISKEISD 803 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 71/240 (29%), Gaps = 55/240 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D+SGSM + +++A ++ F++ + Sbjct: 92 DLMLAIDVSGSMAQEDYE----------LDGRPVSRLAVVRTVASAFVER-------RAG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + TR ++ V + + L + T A+ A + L Sbjct: 135 DRLGLILFGTRAYLQTPLTFDGATVAAMLRDSVVGLAGRETAIGDAIGLAVKRL------ 188 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 Q+ +I LTDG+N + +++ TI I Sbjct: 189 ------------REQPEGQRVLILLTDGDNTAGALDPLEAAELAAQ--AGVRVYTIGIGG 234 Query: 371 -----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + + + + L V+ + +L + Sbjct: 235 GELGVRSLFGMRLLRQASDFDPATLERIAEITGGRAFTADSRQQLEAVYDELDRLEPSER 294 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 75/242 (30%), Gaps = 68/242 (28%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM + +D K ++ A K F++ + + Sbjct: 89 DIMLAVDVSTSM---------------LAEDLKPNRLEAAKQVAAQFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +GL + + + LI T + + L Sbjct: 127 DNIGLTIFAGEAFTQCPLTIDHGVLLNLFGSIKGDIAQRGLIEDGTAIGMGLANSISRLK 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K II LTDG NN +++ + + AK+ I++ T Sbjct: 187 DSKAK------------------SKVIILLTDGSNNRG--DISPLTAAEIAKQFGIRVYT 226 Query: 367 ISINASP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + + Q L++ ++ ++ + L V++ I Sbjct: 227 IGVGTNGTAPYPMPTYAGVQYVNVPVEIDEQTLIQIASTTNGNYFRATSNSKLKEVYEEI 286 Query: 406 SQ 407 + Sbjct: 287 DK 288 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 73/237 (30%), Gaps = 63/237 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + +D K ++ A K+ F++ + + Sbjct: 89 DIMLAIDVSTSM---------------LAEDLKPNRLEAAKDVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ + + + +I T + A L K + Sbjct: 127 DNIGITLFAGESFTQCPLTVDHAVLLNLFQGIKCDIIEDGTAVGMGIANAVTRLKDSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K II LTDG NN +++ + + AK I++ TI + Sbjct: 187 ------------------SKVIILLTDGTNNKG--DISPLTAAEIAKSFGIRVYTIGVGT 226 Query: 372 SP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L + + ++ + L V++ I + Sbjct: 227 NGMAPYPVPVGGTVQYINTPVEIDEKTLTQIAGITDGNYFRATSNSKLKEVYEEIDK 283 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 72/225 (32%), Gaps = 49/225 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD K ++ A K+ F++ + + +GL+ + + Sbjct: 96 DISTSMMAQDLKPNRLEASKDVASAFIN-------GRPNDNIGLVVFAAESFTQCPLTTD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + +I T + A + + + K Sbjct: 149 HTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRIKDSQAK------------------SKV 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 II LTDG NN + + + AK +++ TI + Sbjct: 191 IILLTDGVNNQGEI--APVTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDV 248 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + L + ++ ++ + SL ++ I + M K SV Sbjct: 249 EIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK-MEKTKISV 292 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 44/364 (12%), Positives = 87/364 (23%), Gaps = 53/364 (14%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA +A + + ++ + + + Sbjct: 252 AADVAACRANARIYAYTTSGGTYRVFVTSNYVQNNGYCWPNYYTYTYCGYTVVSRTDYYS 311 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + + + + Sbjct: 312 TTTWTRTNNASNSTPWPSASYYGTPSYSYAQYNGTITATPTSAGGYGSGSTTTIKD---- 367 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + V S + P DV+ T A K A Sbjct: 368 -----NSTITANSDLLGVGTDSWNGCVIDRKQPYDVSGQSPIASNTDTLYPAAKCATNNL 422 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 L + + VR + + T+ T + Sbjct: 423 L-------------------------PVMGLTTDIAAVRAHAQKLT---PAGNTNITIGV 454 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-------NFKSNVNTI 351 + ++L+ + F K++I +TDGEN N T+ Sbjct: 455 QWGMELLSPE-------LPFNTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTINARTL 507 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 C AK+ I + TI + + +LK+C S PEY Y+V + L + + Sbjct: 508 LACQAAKDLGITVYTIRVMEGNS--DMLKSCASRPEYFYDVTASSQLTSTLAKVFYSIQS 565 Query: 412 RKYS 415 + + Sbjct: 566 TRLT 569 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 31/354 (8%), Positives = 77/354 (21%), Gaps = 23/354 (6%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 N I F L + + G + + N ++ A +AA L GA Sbjct: 21 FWRDVSGNTMIAFGLIAATLVAAAGAGVEFSQAQEQTNRLQDAADAAALRGAL---MAKD 77 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + I K S + Sbjct: 78 ETAAKAAADTVFTLNLTD----SGIAPTSKGLKFEISGSHKTAVYTATAQIKTTFLKLAG 133 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA------LLRIEMGE 187 L + + L + + + + + + + + Sbjct: 134 IETLTVGATSTAEAEMRKSEIALVLDSTGSMSRDSRMTNLKAAVDSVLASLLVSGENVWD 193 Query: 188 RPIFLIELVVDLS--GSMHCAMNSDPEDVNSAPICQDKKRT--KMAALKNALLLFLDSID 243 + ++ +S G + + + C T + +L L Sbjct: 194 ARVGIVPFDTQVSIRGVPAAGIAGEFGETPYTYSCTGSSMTSDRCTSLGTTRDLMCSGAA 253 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-----RQYVTRDMDSLILKPTDSTPAM 298 ++ + + + + + S + Y ++ TD Sbjct: 254 DVAACRANARIYAYTTSGGTYRVFVTSNYVQNNGYCWPNYYTYTYCGYTVVSRTDYYSTT 313 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + + + I + TIK Sbjct: 314 TWTRTN-NASNSTPWPSASYYGTPSYSYAQYNGTITATPTSAGGYGSGSTTTIK 366 >gi|332185631|ref|ZP_08387379.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] gi|332014609|gb|EGI56666.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] Length = 420 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/428 (9%), Positives = 105/428 (24%), Gaps = 45/428 (10%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 ++FAL++ IG + ++ + + +AA L SK + + S Sbjct: 1 MMFALALPVLTCSIGMGVDYARAAKAQSKLNAIADAAALLAVSKNAMRADDATAAYFARS 60 Query: 84 NHA-KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + + A + + +++ + S + + + L S Sbjct: 61 FFSLQSAALVKSDGITLSNVTVQAPTDGNGRRTAVVNYRATSENVFARILGMSTLTISGK 120 Query: 143 T-IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 + + + L + + + + Sbjct: 121 SETANAIAPDIDFYMLLDVSASMALPTTSSGLNKV--------AQSNTSRCVFACHTGEK 172 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + ++ A +A+ + ++ Y I Sbjct: 173 RFRGYDAHGKQTDLYGVALSYGLPLRIDAEGDAVNQLTATARSMASKNGSDYRIAITTFR 232 Query: 262 RV---EKNIEPSWGTEKVRQYVTRD---MDSLILKPTDSTPAMKQAYQ---ILTSDKKRS 312 + + I PT + + + + +SD Sbjct: 233 GARGFSVRQPLTNDLTAAGHKAANLKPPYYASIGCPTSACKSSEVGWNDRDTGSSDAMDQ 292 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI------CDKAKENFIKIVT 366 + G + Q + +TDG N ++ CD K I+I Sbjct: 293 INAMIPQPGSGVNGQDPQAVVFMVTDGMRNEKSPKGARPEVAFDTAKCDMIKHRGIRIAV 352 Query: 367 ISINASPNG------------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + L++C S P + V + + Q Sbjct: 353 LYTEYLRDAVKGTTNLERSVEPYLYQVEPALQSCAS-PGLYTKVTTDGDISAALNTLFQN 411 Query: 409 -MVHRKYS 415 + + + Sbjct: 412 AVATTRIT 419 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 72/225 (32%), Gaps = 49/225 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD K ++ A K+ F++ + + +GL+ + + Sbjct: 96 DISTSMMAQDLKPNRLEASKDVASAFIN-------GRPNDNIGLVVFAAESFTQCPLTTD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + +I T + A + + + K Sbjct: 149 HTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRIKDSQAK------------------SKV 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 II LTDG NN + + + AK +++ TI + Sbjct: 191 IILLTDGVNNQGEI--APVTAAEIAKTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDV 248 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + L + ++ ++ + SL ++ I + M K SV Sbjct: 249 EIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK-MEKTKISV 292 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 87.6 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 38/362 (10%), Positives = 77/362 (21%), Gaps = 71/362 (19%) Query: 16 ASEKANFSIIFALSVM-SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++F L V + ++G I + + + SA + LA A + S + Sbjct: 9 RRQGGQAIVMFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAARLLGSGSNA 68 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNH--------------------IKESLSGYSAVFY 114 + + A+ + +F + + Sbjct: 69 AQQLSMAKATAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGSGISTYKVR 128 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++ + M + + S + + + Sbjct: 129 TVAVKATATMPTLFMRIIGMPTVTVSGSGTA--SRRDVRVILVMDRSSSMG--------- 177 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 I I + L S S E ++ Sbjct: 178 ---TYYSGINQTPPSINDMALKFVNSFSGAGEFGGRDEVGLVVYGGSGIVAYP---PRDI 231 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D G I D T + Sbjct: 232 TKDYTDYTKFTPPDNNFKASGNIP----------------------KYIADITSGSNTGT 269 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ AY L +D + I+ TDG N + N I Sbjct: 270 AEALYLAYMTLRADAATNPDLATKL-----------NVIVLFTDGIPNGVTAMANDKTIA 318 Query: 355 DK 356 ++ Sbjct: 319 NQ 320 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 84/359 (23%), Gaps = 42/359 (11%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 ++ N K + L+ + ++ +++++ + + + + Sbjct: 249 KASGNIPKYIADITSGSNTGTAEALYLAYMTLRADAATNPDLA--TKLNVIVLFTDGIPN 306 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI---------EMGERPI 190 + + + + + +L ++ + I Sbjct: 307 GVTAMANDKTIANQHYLMIPNCTNLGLGDTSRTPMLSGSPNPNIAGWFAQWGGNSYTDNS 366 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +D++S K+ + + L+ + + Sbjct: 367 GPHGFHKPMMAYADTGYTGKGDDIDSYMKNPGHDGGKIDQMTGTGCTADPMVGELNKLPD 426 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G + + T + + + Sbjct: 427 HDIYGNYLNLSAAPAVSGLTPPVGSAGALYKL--------GTLYSTSTQCNNSSYNPSAP 478 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + ++ + + D + T K I TI Sbjct: 479 DNACQMGLGSWQAAAHQAWKIWNQIIWDKATQTNIPDPATNKASP-------VIFTIGFE 531 Query: 371 ASPNG---QRLLKTCVSSP-----------EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ + +LL+ + P YN + +++ FQ I + + S Sbjct: 532 STASDLPDMKLLQLIANDPSSPAPFSTRVQGKAYNAKDPNAVDAAFQQIRSEI--LRLS 588 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 87.6 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 74/265 (27%), Gaps = 69/265 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + QD K + ALK F+ Sbjct: 81 SENAKTKITDGIDIVMAIDVSASM---------------LSQDLKPNRFEALKKVASQFV 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + +GL+ Y + + ++ I T + Sbjct: 126 K-------DRPNDRIGLVIYAGESYTKTPVTTDKLIILNALSEITYGQIEDGTAIGMGLA 178 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A L K + + II LTDG NN + + A E Sbjct: 179 TAVNRLKESKAK------------------SRVIILLTDGVNNTGFID--PQTAAELAAE 218 Query: 360 NFIKIVTISIN-----------------------ASPNGQRLLKTCVSSPEYHYNVVNAD 396 IK+ T+ I + + K + ++ N Sbjct: 219 YGIKVYTVGIGTNGMALSPYALNADGSIIYRMQQVDIDEPLMKKIAQVTKGRYFRATNNQ 278 Query: 397 SLIHVFQNISQ----LMVHRKYSVI 417 L ++ I+Q + KY+ + Sbjct: 279 KLQQIYDEINQMETTKIEEFKYTEV 303 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 87.6 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 82/256 (32%), Gaps = 70/256 (27%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + +D+SGSM + +D K +M ALK F++ Sbjct: 88 TTKGIDIVMAIDVSGSM---------------LAKDLKPNRMEALKRVAADFVEE----- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQIL 305 + + +GL+ Y + + + + + D+++ T + A L Sbjct: 128 --RPNDRIGLVLYASEAYTKTPVTSDKPIILEAIKGIRYDTVLQDGTGIGMGLATAVNRL 185 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K + + II LTDG NN + D AK+ IK+ Sbjct: 186 KDSKAK------------------SRVIILLTDGVNNAG--FIEPETAADIAKQYGIKVY 225 Query: 366 TISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 TI + + + + + + ++ + D L ++ Sbjct: 226 TIGLGTNGMAESPYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGTYFRATSNDKLAEIY 285 Query: 403 QNISQ----LMVHRKY 414 +I++ + K+ Sbjct: 286 NSINKLETTEIQELKF 301 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 87.6 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 40/296 (13%), Positives = 82/296 (27%), Gaps = 65/296 (21%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LI 193 N ++ + + L + L IV P I Sbjct: 29 NDASLQVSSTKAFEGMAKTKKIWLYYFPSTLIVVALLVIVLARPQTSDSYSNSTTEGINI 88 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + +D+SGSM + +D ++ A K+ + F+ + + Sbjct: 89 VIAMDISGSM---------------LARDLSPNRLEAAKDVGIEFI-------LSRPNDN 126 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL+ + + + + +I T + A + + + Sbjct: 127 FGLVVFAGESFTQCPITSNHASLVNLFKQVDFGIIQDGTAIGLGLATAINRIKDAEGK-- 184 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K II LTDG NN ++ I A+ I++ TI + Sbjct: 185 ----------------SKVIILLTDGTNNTG--DIAPISAAQIAQSYGIRVYTIGVGTQG 226 Query: 374 ----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + ++ ++ N SL ++ I + Sbjct: 227 IAEVPMLDQFGNIHYTEAEVVIDETTLQQIASTTGGKYFRATNVSSLKQIYSEIDK 282 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 87.6 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 81/244 (33%), Gaps = 50/244 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + L+VDLSGSM + KK +++ A+K L FL Sbjct: 113 PTRDLLLLVDLSGSMAQEDFKND---------AGKKVSRLDAVKEVLDGFLAK------- 156 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GL+ + + + ++ + + T A+ + Sbjct: 157 RKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVNLFDE- 215 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K II LTDG N+ KS V ++ A + IKI T++ Sbjct: 216 -----------------DTERAKTIIALTDG--NDTKSKVPPVEAARVATQRDIKIYTVA 256 Query: 369 INASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ-------LMVHRKY 414 I + Q L + ++ + + L ++ + + + HR Sbjct: 257 IGDPTTVGEDKLDEQSLKDVASETGGKYFFAADREHLAGIYDELDKIETQTIQTISHRPR 316 Query: 415 SVIL 418 + I Sbjct: 317 TDIY 320 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 30/323 (9%), Positives = 72/323 (22%), Gaps = 9/323 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F K+ A N ++ A+ G + + ++ K ++ A + LAGA Sbjct: 3 FTKLMKRLQADTSGNALMLVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAGAW 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + L + + + ++ G Sbjct: 63 SRGNGDMGLEYKTRAKQEFYINLSETKDYLLTHSIELQTFDGTPDSAVYMHATVSAQLPF 122 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + T+ Y L V + Sbjct: 123 TKVMINEGMTIAVQARATWETQKQFTACLYSLDPSSTRTMWFNGGPTVDA---ACGVGAR 179 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I SG+ + +N ++ + ++ + + Sbjct: 180 SNADNAIVTNGG-SGAQNINWVVAGGTINDGA--GAFVNAEVVENYDNMVDPWEGLTPPD 236 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + T K + ++ + S P + AY + Sbjct: 237 NATPRTVTCGSADANWQADEAQLDAITFKYYRGKNKNDAKSAGAISYSGPGSESAYDV-- 294 Query: 307 SDKKRSFFTNFFRQGVKIPSLPF 329 + F + P Sbjct: 295 -TYATNVGAMFTSEPNDYNQAPV 316 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 79/233 (33%), Gaps = 55/233 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L VDLSGSM A + + ++ +K L F++ ++ Sbjct: 88 IMLAVDLSGSMEIADMT----------LDGRNVDRLEMVKAVLGDFIER-------RKGD 130 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + ++ V+Q + + L+ + T A+ A + ++ + Sbjct: 131 RLGLILFADTAFLQTPITYDRNTVQQMLDESVLGLVGERTAIGDAIALAVKRFKGKQQTN 190 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 + ++ LTDG+N + + + AK ++I I++ A Sbjct: 191 ------------------RVLVLLTDGQNTAGNLS--PEQALELAKAYDVRIYPIAVGAE 230 Query: 372 -----------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + ++ + + L ++Q + + Sbjct: 231 EVVVDSVFGRRKVNPSRDLDVPLMQNLADETGGEYFRARSTEELERIYQLLDE 283 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 40/288 (13%), Positives = 84/288 (29%), Gaps = 43/288 (14%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + +Q + LL ++ + E+ + L VD Sbjct: 42 VALSGETPRTGASVRQRLTVQALGSLLGWCLLVGALARPEWVGEPVQLEKTARDLMLAVD 101 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 LSGSM A ++ +++A K L F+ +E +GLI Sbjct: 102 LSGSMDAADF---------VDASGEQIDRLSAAKGVLNEFVA-------GREGDRLGLIV 145 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + E + + ++ T ++ A Sbjct: 146 FGNAAYLQAPFTDDHETWLALLDESIVNMAGPSTALGDSIGLAIAHF------------- 192 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----- 373 S + +I LTDG + + + + AK + I T+++ Sbjct: 193 -----RQSKTENRVLIVLTDGNDTGSRVP--PLDAAEVAKVEGVTIYTVAVGDPTTVGEE 245 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + L + + + +L +Q I L S+ + Sbjct: 246 ALDMETLETVARLTGGDSFVASDLVALRETYQRIDALEPASYDSLSYR 293 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 71/230 (30%), Gaps = 36/230 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D + A DV+ + D +++ A ++A F D + + +GLI Sbjct: 80 DRQVPRNKATVILVMDVSRSMNATDVAPSRIRAAQSAAKKFADDLTEG------INLGLI 133 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + P+ ++ V + + + T + + A + + Sbjct: 134 SFAGTPSTLVSPTPDHTATKKAVDKLVLADK---TATGEGIFAALDQIRT---------- 180 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTISIN---- 370 + P I+ L+DG+ + KAKE I + TIS Sbjct: 181 LNAVLGGPEAAPPAHIVLLSDGKQTVPDEPTDPRGAFTAARKAKEEGIPVSTISFGTAYG 240 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + S + + D L V++ + + Sbjct: 241 TVELDGDRVPVPVDDPSLKQIANLSGGNFFTASSLDELNEVYEKLQSEIG 290 >gi|238793630|ref|ZP_04637253.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] gi|238727045|gb|EEQ18576.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] Length = 480 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 49/208 (23%), Gaps = 18/208 (8%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 I +E+ F + + + LI F + + KK + A A LA + Sbjct: 12 RLLVSLIKNEQGAILFPFIIFLPLIIGLIFFSFELAHFLQKKTKLSDAMEQATLALTVEN 71 Query: 69 VSNLSRLGDRFE----SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 ++ + S + N I + N S Sbjct: 72 NNSTPSAAQITKNAEIVSSYAQAYLPAETFSTPTINIIYNNGRIEYGAEINMSYSAKFLS 131 Query: 125 S---RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR-----------YN 170 + T + ++ I T + + + Sbjct: 132 NIQVTNLSTIINATDRGAARKNIISAPIEKTDVVFVADYSNSMDEYFYHDENEPKKIVAL 191 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVD 198 ++I + + + I I Sbjct: 192 REIFNRLNDNVLKNKNIHTIGFIPFSWG 219 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 70/215 (32%), Gaps = 35/215 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D +++AA K F+D + + +GL+ ++ + P+ Sbjct: 94 DVSNSMAATDISPSRLAAAKQGAQAFVDQLPPK------INLGLVSFSGTAAVLVPPTTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + V+ + T + + Q V Sbjct: 148 RDAVKAGINGLQLGPA---TAIGEGIYAGLSAI---------NTVSSQFVNSGQAVPPAA 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--------------NGQRL 378 I+ L+DGE + N + AK+ I + TI+ N L Sbjct: 196 IVLLSDGETTRGRPNN---QAAQAAKDAHIPVSTIAYGTPNGTLDVGGQLIPVPVNEPAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + H+ + D L +++ + + +RK Sbjct: 253 SQIAEQTGGSHHRATSGDELTSIYKGLGSSIGYRK 287 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 78/253 (30%), Gaps = 64/253 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + + I L +D+S SM + +D ++ ALK+ + Sbjct: 75 TFTISQDRDDTKGVDIMLSIDVSLSM---------------LAKDLNPDRITALKDIAVK 119 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + +G++ Y + + + V + + + T Sbjct: 120 FVQK-------RPNDRIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNSAGLEPGTAIGEG 172 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L K + K +I +TDG +N + + A Sbjct: 173 LSVAVNHLVKSKAK------------------SKVVILMTDGVSNIQNAIP-PQVAAELA 213 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 K N IK+ I I + + L + ++ ++ + Sbjct: 214 KNNNIKVYAIGIGTNGYALMPTSQDIFGDLVFTETEVTIDENTLREIAQTTGGKYFRATS 273 Query: 395 ADSLIHVFQNISQ 407 SL V+ I+Q Sbjct: 274 NSSLEEVYDEINQ 286 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 70/216 (32%), Gaps = 36/216 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + D ++ A K + F++ + + L+ + + P+ Sbjct: 93 DVSLSMMATDVDPNRLEAAKKSAKNFVNQL------PSKFNVALVNFAGTASIIVPPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V++ + + T + + + Q LT + P+ P Sbjct: 147 RATVQRSIDGLELAE---STATGEGIFTSLQALTQ----------VPPDPEHPNDPAPAR 193 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------PNGQRL 378 I+ L+DG+ + + AKE I TI+ P+ L Sbjct: 194 IVLLSDGKRTVGR---TAQEGAQAAKEKNTPIYTITFGTDSGFIEMDGIRQRVPPDRAEL 250 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + Y +A L V+++I + + K Sbjct: 251 RSVAEITGGEAYTAESAGELEDVYKDIGSSVGYDKV 286 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 40/297 (13%), Positives = 83/297 (27%), Gaps = 46/297 (15%) Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 + ++ + N + S + + Sbjct: 21 YSYFTEALAANSGNTVRVSIREVSTTSQPKIVMTLSANNS-------KGLPVTDLSADDF 73 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-KRTKMAALKNALLLFLD 240 ++ + I + N DP DV A A ++A ++ Sbjct: 74 IVKENGKEQSDIAV-------YPFYQNPDPIDVVLALDTSASMNDDAFTAAQDAAYGLIN 126 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +GLI + + +V++ + + S+ T + Sbjct: 127 GLSPED------KVGLITFDKTARVIEPLAQDHARVQESIQKLSRSV---GTALYQGLSL 177 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A Q + + K I+ +TDG N + N + KA+E Sbjct: 178 AAQEVAKGQNT-------------------KAIVLMTDGFNTSR--NTTLEEAVAKAQEV 216 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR-KYSV 416 + T+ + Q L K + +++ L VF +ISQ + + S Sbjct: 217 GASVFTVGFGKKVDTQGLQKIANETGGEYFSAPTNAQLRRVFADISQKLHQEYRLSY 273 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 35/164 (21%), Gaps = 1/164 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + + A ++ + +G I + K+ +++ +AA LAGA Sbjct: 26 LAGDRRGSTLALMAAGLIPVIAALGAGIDAGRLYLVKSQLQAGVDAAALAGARAFAVTDG 85 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 R + S + D + T I M Sbjct: 86 SPAARDKQASAYFYGNFASDYMGVSNLQLTPDFKTV-GGINVTTITARAIVPMTFMRIFG 144 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 L + + Sbjct: 145 FQPRTMQAVAKAELQPRPLEVMVVLDDTGSMKANLSGGRTRMVA 188 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 31/264 (11%), Positives = 66/264 (25%), Gaps = 33/264 (12%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + +D S A N D K A K+ Sbjct: 370 YVGLNNGDATKADDVVRRLDGSYYTPSAANRAGNDWYIDWKRIPKYYDG-AVWKDPASAT 428 Query: 239 LDSIDLLSHVKEDVYMGLIGYTT-RVEKNIEPSWGTEK---VRQYVTRDMDSLILKPTDS 294 ++ + + E+ + P +G K + + T Sbjct: 429 VNPLGGSVDSGSKNTTPMPSPNWQCPEEAMPPRYGRMKSEYLSHIANQHGAIYPANGTIH 488 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGV---------------------KIPSLPFQKFI 333 + Y++L D + + + ++ Sbjct: 489 HAGLLWGYRLLVRDDVFTRANPTTERPRRALVFMTDGVNEIGESQNGYLDRTFTWYGRWS 548 Query: 334 --IFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNG-QRLLKTCVSSPEY 388 + N+ + K C + N +I I++ A+ + C P Sbjct: 549 DSRIAANQSNSETQMLRRFEKTCANIQREANPPEIYIIALVANSSAIDTAFNNCA--PGR 606 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 Y + D L FQ+++ +V Sbjct: 607 VYRTSSTDELRRAFQDVAAELVDL 630 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 70/217 (32%), Gaps = 49/217 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + QD + ++ A K + F+D + +GL+ ++ + Sbjct: 103 DISGSMLAQDLQPDRLEAAKRVAMNFVD-------SRISDRIGLVIFSGESFTQCPITTD 155 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ + + ++ T + + + L S K Sbjct: 156 HGVLKNQIAQVKSGMLQDGTAIGMGLATSVERL------------------RTSKAKSKV 197 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------- 370 II LTDG NN + + + AK I++ TI + Sbjct: 198 IILLTDGVNNTGLID--PLTALEIAKAFKIRVYTIGVGTIGKAPFPMTMPDGSIQMQMQD 255 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + K V + ++ + L +++ I + Sbjct: 256 VQLDEPLMKKISVETGGKYFRATSNKELENIYGEIDK 292 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 75/195 (38%), Gaps = 32/195 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K A F+DSI + ++ +GL+ Y+ ++ +T + Sbjct: 242 LDETKKAAAKFVDSI-----LNKNSNIGLVSYSDEATSLSGICSNDVFLKNTITSLSSAE 296 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + +AY +L + + +K I+ ++DG K Sbjct: 297 ---NTNIEDGLSRAYSMLQLGQSK------------------KKLIVLMSDGLPTLGKDG 335 Query: 348 VNTIKICDKAKENFIKIVTISI-----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 IK +K K+ + I T+ GQ L++ S HY V +++ L+ F Sbjct: 336 EELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYLMEKIAS-EGCHYEVSSSEDLVFFF 394 Query: 403 QNISQLMVHRKYSVI 417 ++++ + +KY + Sbjct: 395 EDVAGQIGGQKYIYV 409 Score = 36.8 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 10/121 (8%), Positives = 26/121 (21%), Gaps = 5/121 (4%) Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 T N + + + + N L + ++ + Sbjct: 191 TFSGGETNFNNVFLCEWG--DYSVTGNDGLKVTSKKRDIVLTLDISASMDGIPLDETKKA 248 Query: 176 FIPALLRIEMGERPIFLIELV---VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + I I L+ LSG + + + ++ Sbjct: 249 AAKFVDSILNKNSNIGLVSYSDEATSLSGICSNDVFLKNTITSLSSAENTNIEDGLSRAY 308 Query: 233 N 233 + Sbjct: 309 S 309 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 72/226 (31%), Gaps = 50/226 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + L +D SGSM + K+A F+ +DL Sbjct: 61 ASPVDVVLSIDSSGSMTTSDPG-------------------DLRKSAAKEFVTGLDLSMD 101 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +G++ + T + T+ + + T +K A +L+ Sbjct: 102 -----RVGVVSWNTSAIS-WPLTNNTKDIESAIDST---GADGNTCLDTGLKSAIDLLSE 152 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN---FKSNVNTIKICDKAKENFIKI 364 K I+ LTDG + + + D+A+ I + Sbjct: 153 CSGS-------------------KVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILV 193 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TI + + + L + S+ Y+ +A++L +++ I + Sbjct: 194 FTIGLGPDADARNLTEIAHSTGGEFYSAPDANALAGIYKRIRSSIT 239 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 76/226 (33%), Gaps = 35/226 (15%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I + +DLSGSM + ++ A+K F+ + Sbjct: 105 IMIAMDLSGSMERRDF----------ALDGETVNRLTAVKRVGTDFIRR-------RAGD 147 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + PS+ T V + + + + T + A + L Sbjct: 148 RIGLVIFADQAYVAAAPSFDTAAVARALDEATIGISGRSTGIGDGLGLALRRLDPR---- 203 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 P K +I L+DG NN + + + A+E IK+ TI++ Sbjct: 204 -DAGGEAASGSKPGEKPAKAVILLSDGANNAGQ--TAPKDVAELARELGIKVYTIALGPR 260 Query: 371 --ASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A +G + L +S + V + L+ V I + Sbjct: 261 DMADADGEQDVVDTETLRDMARASGGEAFRVRTTEDLVRVADAIDR 306 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 64/207 (30%), Gaps = 39/207 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +M K F+ + +GL + + Sbjct: 95 DISASMLTEDVFPNRMEVAKEVASEFI-------SSRPSDNIGLTIFAGEAFTQCPMTLD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + L+ T + + L K + Sbjct: 148 HAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLK 380 K +I LTDG NN + + AK+ I++ TI + + + L Sbjct: 192 --SKVVILLTDGSNNVGSIS--PMTAATIAKKFGIRVYTIGLGRETGEDIGAIDYKTLQD 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ 407 V + Y + L ++Q+I + Sbjct: 248 IAVLTNGEFYRAQSQAELSKIYQDIDK 274 >gi|242078369|ref|XP_002443953.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] gi|241940303|gb|EES13448.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] Length = 567 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 18/178 (10%), Positives = 57/178 (32%), Gaps = 10/178 (5%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+A LK A+ +D + + ++ ++ + + ++ + ++S Sbjct: 96 KLALLKQAMCFVIDQLGPAD------RLSVVTFSNDASRLTRLARMSDAGKASAKIAVES 149 Query: 287 LILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L ++ T+ + A ++L ++++ + DG + Sbjct: 150 LAVQGFTNIKQGIHVAAEVLAGRREKNVVAGMILLSDGHDNCGGTS---VRPDGTKSYVN 206 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ + + I T S + + ++ V + ++ F Sbjct: 207 LVPPSLTVAAGSSRPAAPIHTFGFGTSHDAGAMHAVAEATGGTFSFVGDEAAIQDSFA 264 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 55/158 (34%), Gaps = 27/158 (17%) Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + V+ + + + + + T A+ A + L Sbjct: 2 FADHAYLQAPLTLDNLAVQSLLQKAVIGMAGRDTAIGDAIGLAVKKL------------- 48 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-------A 371 + +I LTDGENN + ++ + AK+ I+I TI + Sbjct: 49 -----RERPEGSRILILLTDGENNAGA--LKPLQAAELAKQYDIRIYTIGVGGKGGMFSR 101 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N L K + ++ N +L +V+++I + + Sbjct: 102 GLNETELKKIAQLTNGAYFPATNLGALNNVYEHIDKTL 139 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 33/313 (10%), Positives = 79/313 (25%), Gaps = 44/313 (14%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV---VDLS 200 + RL + L V+ + + R + L V S Sbjct: 340 TASGHGLAADDYVRLTNMGGLAQLNDGGYRVASVSGSQVVLDTTRSLGLAGASKGGVTYS 399 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 D + P ++ + + ++ V + + Sbjct: 400 SGGKLMNGRDGSEWRVFPTADGYVNVHASSTCVSERVGVERYTDARPSTAYVGRSYLS-S 458 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + E S + ++ T + A+ L+ D F + Q Sbjct: 459 SNSCPSAELSALSTSASSLKSKIDQMSAGGSTAGQIGIAWAWYALSPDFASLF--SGEGQ 516 Query: 321 GVKIPSLPFQKFIIFLTDGENN----------------------------NFKSNVNTIK 352 K I +TDGE N N ++ Sbjct: 517 PGAYAPSDTLKVAILMTDGEFNTPFRDGVIALDAGTGSGGLDSHIDLNSSNGDPFAQSVA 576 Query: 353 ICDKAKENFIKIVTISI---------NASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 +C + + + T+ +++ C ++ + + + + L F Sbjct: 577 LCQAMQAKGVVVYTVGFDLGSATGREGVVDTALDVMRECATNEQTHFFQADDGTDLKEAF 636 Query: 403 QNISQLMVHRKYS 415 + I + + + + Sbjct: 637 RAIGRDITRLRIA 649 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 26/234 (11%), Positives = 69/234 (29%), Gaps = 16/234 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + + N +++FA+++ L++ I + K ++ A +AA LA A Sbjct: 13 LKSLASRFLRQTQGNVAMMFAMALPVLLMITLGAIDIHQASKVKAQLQDALDAAALAAAR 72 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + + + +A + + + + + N Sbjct: 73 STFTDDVNINKVGLAALKANMPSYFGEASGDTASFVLLNNR-VTGEATVNVKVLVANVVL 131 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN--------------QK 172 + ++ L S+ + L + N N ++ Sbjct: 132 PPYGKLLDDFLPVSSRSEVLRASRNVEVAMALDITGSMDNCSRNCPPTSKLEDLQAAAKE 191 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V + + + + LI ++ P D + I T Sbjct: 192 LVDIVVQDQQTPFYSK-VALIPYAAGVNVGSSAISARGPLDTTTHNISGATWLT 244 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 71/230 (30%), Gaps = 54/230 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + ++ A K F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLRPNRIEAAKQVASEFI-------IGRPNDNIGLAIFAGESFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HASLLNLLQNVRTDIAARGLIEDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNRG--DISPSTAAEIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L + + Y N + L +++ I + K +V Sbjct: 248 VNVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKEI-DQLEKSKLNV 296 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 25/226 (11%), Positives = 63/226 (27%), Gaps = 18/226 (7%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + E+ NF++I AL ++ LL + + +N+++++ +AA LA + + Sbjct: 4 RFAKDERGNFAMIMALVLVPLLLAGMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGE 63 Query: 73 SRL--GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 S+ + AL DA F K+ + + S Sbjct: 64 SQTVVQVYGAQVFTANLTALSADAVNFDVAFPKDKTTDQQIQATAGFTYKSLFGVIASRL 123 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN--QRYNQKIVSFIPALLRIEMGER 188 + + + + L + +K + + + Sbjct: 124 TGDDWDQNQYTLSSSVRLKNTIEVALVLDNSRSMDETRSGSTKKRIDLLKEAASQLVETM 183 Query: 189 P--------------IFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 L+ ++ + + +P+ Sbjct: 184 AAQSTLITHVENPVQFSLVPFAGSVNVGPQYLNAAWMDPEGRSPVN 229 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 41/316 (12%), Positives = 80/316 (25%), Gaps = 89/316 (28%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-------A 229 L + V + + +S ++ + T + Sbjct: 299 GTYALDVTPPSDNNPDTLFVPMFGPAEYYNTDSRGNVTSTVLNSWWQDDTSLAYSPRQSD 358 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSL 287 K L LD I + G Y+ + T + ++ T + Sbjct: 359 LKKYYLRDSLDKIYRGGRSND----GGPNYSCTSSPLTPLTDVTTEQGMKTIQTAIKAMV 414 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ AM ++ + + +G K +I LTDG N +K + Sbjct: 415 PSGGTNVPEAMAWGWRTIVRGAPFTEARPSTERGND-------KVVIVLTDGANTYYKYD 467 Query: 348 -----------------------------------------------VNTIKI------- 353 T + Sbjct: 468 GLAGSGPDRAANFSYYSAHGYTARITKHYSQARLFQESGVSVSQNNSTYTKAMNARFAKL 527 Query: 354 CDKAKENFIKIVTISINASPNGQ------RLLKTCVSS---------PEYHYNVVNADSL 398 CD AK I ++T++++ S LL++C S+ P + L Sbjct: 528 CDNAKSANIIVMTVALDLSETNSTEKAQIDLLRSCSSNSRVRTESGRPAKLFWNSTGGEL 587 Query: 399 IHVFQNISQLMVHRKY 414 F+ I + + + Sbjct: 588 SETFRQIGDELSNLRI 603 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 38/264 (14%), Positives = 78/264 (29%), Gaps = 71/264 (26%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I + +D+S SM + +D + ++ ALK F++ Sbjct: 82 VSTRIKTNKGIDIVMAIDVSASM---------------LSKDLRPNRLTALKKVAAEFIE 126 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAM 298 + +GL+ Y + ++ + ++ T + Sbjct: 127 -------GRPSDRIGLVVYAGESFTKTPITTDKSIIQNALKDIKYKHGELIGGTAIGMGL 179 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L K + K II LTDG NN + + A Sbjct: 180 ATAVNRLKDSKAK------------------SKVIILLTDGVNNAG--FIEPQIASELAV 219 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK TI I + + + L + ++ ++ N Sbjct: 220 EYGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQVEIDEKLLQQIAKTTGGKYFRATNT 279 Query: 396 DSLIHVFQNISQ----LMVHRKYS 415 L ++ I++ + +Y+ Sbjct: 280 KKLAEIYDEINKLEKTEIEEFRYT 303 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 79/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM +++ A + Sbjct: 72 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APSRLVAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y ++P+ E + + + + T Sbjct: 116 EAAKQFADQLTPG------INLGLIAYAGTATVLVQPTTNREATKNGLDKLQLADR---T 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P I+ ++DG+ + N Sbjct: 167 ATGEGIFTALQAI----------ATVGAVIGGGDEPPPARIVLMSDGKETVPSNPDNPKG 216 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I T+S + + L K S + + Sbjct: 217 AFTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLEKIAQLSGGDAFTASSL 276 Query: 396 DSLIHVFQNISQLMV 410 + L VF ++ Q + Sbjct: 277 EQLKAVFTSLQQQIG 291 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 82/245 (33%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + LV+D+SGSM + +++ A K++ + ++S+ Sbjct: 84 QTKEGVNVVLVMDVSGSMQAQDYT---------------PSRLEAAKSSAEILINSLKSK 128 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + G++ + + S EKV + + T + + Sbjct: 129 DYA------GIVTFESGATTAAYLSPYKEKVIEKLRNVA--PKEGSTAIGDGLSLGIDMA 180 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +S +K II L+DG NN + + AK N I++ Sbjct: 181 SSI------------------PNKKKVIILLSDGVNNAGYIS--PDEAIQYAKANNIQVY 220 Query: 366 TISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI + ++ + L + ++ ++ +L +++NIS+ Sbjct: 221 TIGMGSNGNVLLGYDWFGNPQYAELDEATLQAIANDTGGKYFKSIDDKTLDEIYKNISEN 280 Query: 409 MVHRK 413 + K Sbjct: 281 IKREK 285 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 76/234 (32%), Gaps = 43/234 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + VDLSGSM DP Q + ++ +K L FL ++ Sbjct: 95 MMIAVDLSGSMEARDFVDP---------QGEILRRVDGVKALLQSFL-------LKRDSD 138 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + + + + Q + + + T A+ A + + Sbjct: 139 RIGLIAFGENAYLQAPFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAVNHFEQSEVEN 198 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K ++ LTDG + + + + A E + I I+I Sbjct: 199 ------------------KVLLLLTDGNDTSSEFP--PLDAAHYAGEQGVVIYPIAIGDP 238 Query: 373 P-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + L + + + + SLI V++ + QL S ++ Sbjct: 239 KNVGEDSLDIATLERIADLTQGRVFEADDGQSLIEVYKVLEQLQPQLFDSYTIR 292 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 78/256 (30%), Gaps = 52/256 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V+ + R + LV+D+S SM +++AA Sbjct: 71 LTVALAGPTADKRV-PRNRATVVLVIDVSLSMKATDVE---------------PSRLAAA 114 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F D + + +GL+ + + P+ ++ + + S Sbjct: 115 QEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTPNRDETKAAIDNLTLSER--- 165 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + Q + + + I+ L+DG+ +S + Sbjct: 166 TATGEAIFTSLQSIDTLAAVLGGSEQA----------PPARIVLLSDGKQTVPESPDDPR 215 Query: 352 K---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 +AK+ + I TIS + L + S + + Sbjct: 216 GGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTASS 275 Query: 395 ADSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 276 LEELRDVYDTLEEQIG 291 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 31/245 (12%), Positives = 85/245 (34%), Gaps = 29/245 (11%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + ++S +P R I VVD+SGSM + ++ + Sbjct: 13 KDDFLMISLVPP---QNYSSRTNSNICCVVDVSGSMSSEAKIINQSSQK--SDENYSLSI 67 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDS 286 + +K+++ + ++++ + + ++ ++ + + + + + + Sbjct: 68 LDVVKHSIKMIVNTLGSEDY------LSIVTFSDSANVLFDLLPMNDSNKTMAIEKIENL 121 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + A IL ++K + + I LTDG+ + Sbjct: 122 STEGGTELWKGLNSALNILLNNK----------------TPNTNQSIFLLTDGQPTDSGI 165 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNI 405 + N +K + I T ++S N + + K + + +A + F + Sbjct: 166 DTNLVKFKQAYPKLNCTINTFGFSSSSNSELMNKIAMEYNGMFSFIPDASFIATAFANAL 225 Query: 406 SQLMV 410 + + Sbjct: 226 ANTLT 230 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 78/256 (30%), Gaps = 52/256 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V+ + R + LV+D+S SM +++AA Sbjct: 71 LTVALAGPTADKRV-PRNRATVVLVIDVSLSMKATDVE---------------PSRLAAA 114 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F D + + +GL+ + + P+ ++ + + S Sbjct: 115 QEAAKSFADGLTPG------INLGLVAFAGTASVLVSPTPNRDETKAAIDNLTLSER--- 165 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + Q + + + I+ L+DG+ +S + Sbjct: 166 TATGEAIFTSLQSIDTLAAVLGGSEQA----------PPARIVLLSDGKQTVPESPDDPR 215 Query: 352 K---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 +AK+ + I TIS + L + S + + Sbjct: 216 GGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTASS 275 Query: 395 ADSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 276 LEELRDVYDTLEEQIG 291 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 64/207 (30%), Gaps = 39/207 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +M K F+ + +GL + + Sbjct: 95 DISASMLTEDVFPNRMEVAKEVASEFI-------SSRPSDNIGLTIFAGEAFTQCPMTLD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + L+ T + + L K + Sbjct: 148 HAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLK 380 K +I LTDG NN + + AK+ I++ TI + + + L Sbjct: 192 --SKVVILLTDGSNNVGSIS--PMTAATIAKKFGIRVYTIGLGRETGEDIGAIDYKTLQD 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ 407 V + Y + L ++Q+I + Sbjct: 248 IAVLTNGEFYRAQSQAELSKIYQDIDK 274 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 71/221 (32%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A KN F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKNVASEFI-------SDRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HASLINMLRSVRTDIAARGLISDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNMG--DISPLTSAQIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L ++ +Y + + L ++++I + Sbjct: 248 VNIPVEIDSKTLKNIAETTDGNYYRATSNNQLKQIYKDIDK 288 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 73/230 (31%), Gaps = 54/230 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +M A K+ F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLTPNRMEAAKDVAAEFIA-------DRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNMG--DLSPMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L S+ Y N L ++++I + + K SV Sbjct: 248 VNMPVEIDTKVLKDIAASTDGNFYRATNNQELKQIYKDIDK-LEKSKMSV 296 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 86.1 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 61/182 (33%), Gaps = 27/182 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+ +K+ + F+ +++ +GL+ + ++ + + V++ Sbjct: 109 TRFDVVKDIVSNFIKE-------RQNDNIGLVVFGAYSFIASPLTYDENILNKIVSQLYI 161 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + K T ++ Q +L S K I LTDG + Sbjct: 162 GMAGKYTALFTSLAQGVNLL------------------KMSESKSKVGILLTDGFSTPEV 203 Query: 346 SNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + D A + IKI I I N + L K + + +A L V++ Sbjct: 204 DKIPFDVALDMAIKEKIKIYPIGIGMPHEYNIEVLRKIAEKTGGKAFGAASATELKEVYK 263 Query: 404 NI 405 I Sbjct: 264 EI 265 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 86.1 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 37/451 (8%), Positives = 109/451 (24%), Gaps = 50/451 (11%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + +++ S++ +S+ + L ++G + ++ + +A +AA LA A + + Sbjct: 3 RSFPHAQRGGISLMVIVSLTTLLAVVGLAFSAGLSYLVRSKLNAATDAAGLAAARAISNG 62 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ + + + + ++ ++ + + I ++S + Sbjct: 63 TTQADQIANAKAAGQRFFHANFPSNYLMSNATLNDISVTFSGSEVTIGVSASASLPAALF 122 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + L N + S I L + + Sbjct: 123 GGFGTSALAPAVVTETKRKDLDMIVVMDTSGSLSPSAANVRS-SAITFLNQFNATRDRVG 181 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--------------------RTKMAAL 231 L+ + D S A L Sbjct: 182 LVHFAFGAIVDDAIRQTARGFDRASMTNHIKAYAFSGSTASAEGMYTARQQINSVPTANL 241 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR--QYVTRDMDSLIL 289 + + + + Y+ + + D L Sbjct: 242 NRSNMRVIVFFSDGAPNSFGAYLNWKPGVACADPGTIYTDDDGAGTPAGLYKLDQQYDDL 301 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKS 346 T + + L +T+ N + Sbjct: 302 GGTCTPADLPSKAASLPDWYNAHNPAKTPNDPALREFPVVTTSPRVVTNAITYANVNRAA 361 Query: 347 NVNTIKICDKAKENFIKIVTISINA------------SPNGQRLLKTCVSSP-------- 386 + K+++ I + T+ + + + + + V +P Sbjct: 362 RNLVEAMAAKSRDEGIYVFTLGLGSSLKTGTGVDGEKGEDTLKCMANAVDAPARCYNPAK 421 Query: 387 --EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + L F ++ + + S Sbjct: 422 PVGVYCFAATQNDLTPCFSKLASAI--LRIS 450 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 80/243 (32%), Gaps = 67/243 (27%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D+S SM + D + ++ K+ + F+ ++ Sbjct: 84 VNDIFFVIDVSRSM---------------LADDFRPNRLEVAKDKISDFVA-------LR 121 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSD 308 +GLI ++ R + S + ++Q V +L T+ A+ A Sbjct: 122 PTDRIGLIMFSERAFTLLPLSTDLKLIKQMVGEINVGGMLGSGTNIGDALGLAVARGAQS 181 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 ++ K II LTDG +N + I+ ++AK+ IK+ TI Sbjct: 182 LAKN------------------KVIILLTDGVSNVG--FLTPIQAAEEAKKQGIKVYTIG 221 Query: 369 I------------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I S + + L + + + + +L V Sbjct: 222 IGGRGDAKIPYGKNIFGRQRYQNIPGGSIDFKTLKEIADKTNGQTFEAQDEKALAEVLSE 281 Query: 405 ISQ 407 I + Sbjct: 282 IEK 284 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 74/242 (30%), Gaps = 68/242 (28%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L VD+S SM + +D K ++ A K F++ + + Sbjct: 89 DIMLAVDVSTSM---------------LAEDLKPNRIEAAKQVAAEFIN-------GRPN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +GL + + + +I T + A L Sbjct: 127 DNIGLTIFAGEAFTQCPLTVDHGVLLNLFQSVSCDMVQRGMIEDGTALGMGLANAVSRLK 186 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K +I LTDG NN +++ + + AK+ I++ T Sbjct: 187 DSKAK------------------SKVVILLTDGVNNRG--DISPLTAAEIAKQFGIRVYT 226 Query: 367 ISINASP---------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I + + + Q + + ++ ++ + L V++ I Sbjct: 227 IGVGTNGTAPYPMQTYAGVQYVQMPVEIDEQTMSQIAGTTNGNYFRATSNTKLKEVYREI 286 Query: 406 SQ 407 + Sbjct: 287 DK 288 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/518 (6%), Positives = 116/518 (22%), Gaps = 116/518 (22%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R F + ++ I L + +IG ++ V + + ++++++AA LA Sbjct: 1 MNRLNKLFCLFLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL---------------SG 108 GA + + + ++ + + + K + Sbjct: 61 GAEIIPTATTAAAVYAKATAYSSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQGISCV 120 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT---SYDYRLQFI---E 162 + + ++ + + + S + + + L Sbjct: 121 GPLSYNSIQVMQQAVVPLYFARIIGRSSMTISATSTAAKGGASSRPYNVALVLDTTYSEI 180 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + N +++ + + + P ++ + Sbjct: 181 SYDSDCGNSQMLCTLQGVQILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFPQMVTS 240 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT---EKVRQY 279 + N + + G ++ + ++ + Sbjct: 241 TVSSDYDCSSNTPANTV-------YTFPIPGAGTYAPSSSTYRVLDFQSDYRVGDTSTSL 293 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK------------IPSL 327 + + + A S+ + + Sbjct: 294 NQASNLTKAVGGFSGCTGIYPATSASQSNFQATSGQYGTYYPSTIYAAQSSLIHEQTLFP 353 Query: 328 PFQKFIIFLTDGENNNFKSNV------------------------------------NTI 351 Q +I + DG ++N I Sbjct: 354 DSQNVMIIIGDGNATAPQTNNGYPVMSTTASVSATPGASTLAGTSSGLYPSWNGECGQAI 413 Query: 352 KICDKAKENFIKIVTISINASPNG---------------------QRLLKTCVSSPEYHY 390 + A ++ T++ + G + + +S Y++ Sbjct: 414 TAANFATSQGTRVYTVAYGSPSAGCASDQSGQGKIPGLYPNVLPCNEMAQMASAS--YYF 471 Query: 391 N--------------VVNADSLIHVFQNISQLMVHRKY 414 L +F I+ + + Sbjct: 472 FSDFKQSGSGSVCTAAQVMVELSDIFLQIAGDLTVARL 509 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 35/281 (12%), Positives = 73/281 (25%), Gaps = 68/281 (24%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMN---------SDPEDVNSAPICQDKKRTKMAA 230 + +E I L +D+SGSM + K +++ Sbjct: 82 AMEVEYTSTDGVDIMLALDVSGSMGTLDMLTRTEQAKLGVMNAEKILKRGEYWKYSRLGY 141 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS---- 286 ++ + F+ + +GL + R + + + + D Sbjct: 142 AQDVIAEFIGK-------RHSDRIGLSAFGARSFTQCPLTMDYGSLLEILKASDDLARDT 194 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 L+ T + A L S + +I LTDG +N Sbjct: 195 LVNNRTAIGDGLMNALARL------------------KMSDAKSRVVILLTDGRDNASVV 236 Query: 347 NVNTIKICDKAKENFIKIVTISIN----------------------------ASPNGQRL 378 ++ + AK +K+ T+ + + L Sbjct: 237 P--PVRAAEVAKSLGVKVYTVGVGKKSGKILAFQQNPWTGEISWGERDITPEEGIDEDVL 294 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + Y N L ++ I +L ++ Sbjct: 295 KAIASKTGGRFYRAENKAELEKIYSEIDELEKTEIETIAYA 335 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 72/221 (32%), Gaps = 44/221 (19%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 DV+ + + D + +M A K A F+D + V +G+I + P+ Sbjct: 91 VTDVSGSMLATDVQPNRMIAAKRAARRFVDEV------PRTVNLGVISFNNTATVLQSPT 144 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 V + R S T + A+ A ++L R Sbjct: 145 RNRSDVLTAIDRLAVS---GGTATGEAIATATEML-------------RNQPGENGRRPP 188 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS------------------ 372 I+ ++DG + N + I+ +A+ I I T++ Sbjct: 189 SAIVLISDGTSTNGRD---PIEAAAEARRLRIPIYTVAFGTDQGTITVPGRDGVERTERV 245 Query: 373 -PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 P+ L + + + +AD L VF+ + + R Sbjct: 246 PPDPTALAQIAEMTGGETFTADSADRLDTVFERLGSQLGTR 286 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 73/222 (32%), Gaps = 40/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LVVDLSGSM D K++ ++ A+K+ L F +E Sbjct: 95 DLMLVVDLSGSMSERDFLDNS---------GKEQDRLTAVKSVLETFAAK-------REG 138 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + E + + + + T A+ + + Sbjct: 139 DRLGLILFGDSAYLQSPFTADHEAWLALLDQAQVGMAGESTHLGDAVGLTIKTYIDN--- 195 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +K I LTDG + + I A+ I++ +++ + Sbjct: 196 ------------PENQTVEKVAIILTDGNDTDSLVP--PIDAAKVAQAYGIRLYIVAMGS 241 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + + + ++ + L V+Q IS Sbjct: 242 PNTTGDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQTIS 283 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/325 (11%), Positives = 99/325 (30%), Gaps = 52/325 (16%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 + + N+ +L A + + + + + +++ + Sbjct: 209 KSFSLVGNYNNPNLLTVDAEVKSENVPITGLNKDNFTIEIGSKKVNDVTVSDVGEGKYKL 268 Query: 153 SYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEM-----GERPIFLIELVVDLSGSMHCA 206 S++ + Y + + + + LV+D SGSM Sbjct: 269 SFNPPKQDSNGNYDLNVYVKYKKVTLSDSELNAVRYGEDNANANANVMLVIDRSGSM--- 325 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + +++ KN+ LF+D ++ G++ +++ + Sbjct: 326 -----------------SGSPISSAKNSANLFIDYMEAED------MAGVVSFSSSARYD 362 Query: 267 IEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + T +V+ + + T M+ L + Sbjct: 363 YHLATLTPEVKNSIKQKINSIYASGVTAIGSGMRYGLNDL----------------LNYG 406 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 I+ L+DG N+ ++ N I K + I++ T+ + + + + L + Sbjct: 407 DPNNPWAIVLLSDGYQNSGENPNNVIP---SIKASNIQVYTVGLGPAVDQKLLGNIADQT 463 Query: 386 PEYHYNVVNADSLIHVFQNISQLMV 410 +Y L ++ +I ++ Sbjct: 464 GGKYYYSPTDSQLQEIYNDIVGKII 488 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 80/259 (30%), Gaps = 68/259 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+SGSM + +D K ++ ALK F+ Sbjct: 18 DVTAKSRTTKGIDIVMAIDVSGSM---------------LAKDFKPNRLDALKRVASTFI 62 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPA 297 + + + +GL+ Y + + Q + DS+I T Sbjct: 63 E-------DRINDRIGLVVYAGESYTRTPITSDKTVILQSLKTVEYDDSIIADGTGIGVG 115 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A + K + + II LTDG NN + D A Sbjct: 116 LATAINRIKDSKAK------------------SRVIILLTDGVNNAGTIDPRM--AADIA 155 Query: 358 KENFIKIVTISINASP------------------------NGQRLLKTCVSSPEYHYNVV 393 K+ IK+ TI I + + + + + + ++ Sbjct: 156 KQYGIKVYTIGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTDGKYFRAT 215 Query: 394 NADSLIHVFQNISQLMVHR 412 + L ++ I++L Sbjct: 216 DDKKLKAIYAEINKLETTE 234 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 67/216 (31%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + QD T++ A K F++ + + +GL+ + + Sbjct: 98 DISGTMMAQDFSPTRLEAAKKVAAEFIN-------DRPNDRIGLVIFGGESFTQCPLTTD 150 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + +T +I T + + L S + Sbjct: 151 HKVLLNLLTEVKFGMIEDGTAIGLGLANSVNRL------------------KDSKSKSRV 192 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--------------------- 371 +I LTDG NN + + + A I++ TI I + Sbjct: 193 VILLTDGSNNAGQI--APLTAAELAASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVSG 250 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ + SL ++ I Q Sbjct: 251 DFDERTLTEIAAITKGQYFRATDNTSLSAIYDEIDQ 286 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 70/244 (28%), Gaps = 65/244 (26%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I L +DLS SM + QD + ++ K + F+ Sbjct: 83 KANGIDIMLSMDLSSSM---------------LAQDFEPNRLEVSKKMAIEFVK------ 121 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+ + + + + ++ + + T + A L Sbjct: 122 -GRPHDRIGLVVFAGEAFTQCPLTTDHKILETFLEQLECGNLEDGTAIGMGLAGAVNRL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S K II LTDG NN + + AKE IK+ + Sbjct: 180 -----------------KKSPAKSKVIILLTDGVNNVGYF--KPLTAGELAKELGIKVYS 220 Query: 367 ISIN-----------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + + + L + + ++ N L ++ Sbjct: 221 IGVGTIGEALTPVSRLSDGSFFLDYAQVEIDEELLREIARMTGGQYFRAKNNQDLRQIYN 280 Query: 404 NISQ 407 I + Sbjct: 281 TIDR 284 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 60/211 (28%), Gaps = 40/211 (18%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + + + K + T + + T+ + Sbjct: 345 RLCKYRNQPKSYGTSSSQGPNFLCKTQPITDLTNDKQALLDAVAAMRADGYTNIHQGVVW 404 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-------------- 346 +++LT + S R + ++ +I +TDG N Sbjct: 405 GWRVLTPQEPFSRG----RSPDQKREKDHRRIMIVMTDGANTYQDKSSSHNRTEYNAYGY 460 Query: 347 --------------------NVNTIKICDKAKENFIK-IVTISINASPNG-QRLLKTCVS 384 + T C A + TI+ ++LL+ C S Sbjct: 461 GTEQRLGSGIDTAGEIAAKMDERTALACRNAATYEATQVYTIAFQVGDYATRKLLRDCAS 520 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 SPE ++ + L+ F+ I + + + + Sbjct: 521 SPEMAFDAGSNSELVTAFERIGKEISRLRLA 551 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 17/225 (7%), Positives = 49/225 (21%), Gaps = 17/225 (7%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + ++ + N +I F G I + +K+ ++SA + + Sbjct: 5 RFLSDRRGNIAIAFGSFAFLLTAGSGVGIDMSRVVTEKSRLQSAA-----DATALAANYK 59 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S + + + ++ V + Sbjct: 60 SGTYTAEQIRQHAEAYFDGLY----TAPERGSVSRNVTVGDGTISVEAGVTMPTFFAPLL 115 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL--------LRIE 184 + + L + R + + + E Sbjct: 116 GVEEISFAVMAESKVGTASFDVVLVLDNSGSMAGSRMTTLKQAASDLIRTLMSINEISTE 175 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + L+ ++ A + +P+ +T Sbjct: 176 DDRVMVGLVPFTAFVNIGADKATQPWMDREGRSPVHWTNFQTGSD 220 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 69/239 (28%), Gaps = 64/239 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM ++ A K+ F+ H + Sbjct: 99 DIVLAIDLSGSMLAEDFEPK--------------NRIEAAKSVATDFI-------HQRLS 137 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKK 310 +GL+ ++ + + + +++ + T A+ A L Sbjct: 138 DRIGLVVFSGKSFTQCPLTLDYRLLTNFISELKAGTIEEDGTAIGTAIATATNRLRESTA 197 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-- 368 + K II LTDG+NN + + + A IKI T+ Sbjct: 198 K------------------SKVIILLTDGQNNAGEI--EPVTAAELAAALGIKIYTVGAG 237 Query: 369 -------------IN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + S ++ + +SL + I + Sbjct: 238 TRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRYFRATDLESLKKTYHEIDE 296 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 39/264 (14%), Positives = 78/264 (29%), Gaps = 70/264 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + +D + ++ A K F+ Sbjct: 81 DVSTRTNSTQGIDIVMAIDVSASM---------------LARDLQPNRLEATKAVGEEFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAM 298 + +GL+ Y+ + V + + ++++ T + Sbjct: 126 K-------GRPSDRIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAIGSGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L S K II LTDG NN+ + + AK Sbjct: 179 ATSVNRL------------------KDSKAESKVIILLTDGVNNSGFID--PKVASELAK 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK+ TI + + + L + + ++ N Sbjct: 219 EFGIKVYTIGVGTNGMALTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNN 278 Query: 396 DSLIHVFQNISQ----LMVHRKYS 415 + L ++ I Q + KY+ Sbjct: 279 EKLEDIYDEIDQLEKTEIEEFKYT 302 >gi|227114885|ref|ZP_03828541.1| hypothetical protein PcarbP_18070 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 539 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 47/470 (10%), Positives = 119/470 (25%), Gaps = 98/470 (20%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME---SANNAAILAGASK 67 I EK + +AL M+ L+ F++ ++ A A+ A+ Sbjct: 25 LSAFIQEEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVAHQATI 84 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-GYSAVFYNTEIQNIVNSSR 126 S+ + ++ A +++ G ++ T + ++ Sbjct: 85 NGEEYSQEETNKLAYDYVKNNLGMNKALSDKLEVGDVAVTEGRNSETRKTYTVTVAFETK 144 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---------------------- 164 S+ ++ + + + ++ T + + + Sbjct: 145 PSLLNLGARQQEVF--STAEVINRPTEVAFVMPVTGDMSEGDIRSLKSVSRSFVERMLSS 202 Query: 165 LNQRYNQKIVSFIPALLRIEMGER-----------------PIFLIELVVDLSGSMHCAM 207 + + + +S +P + + + P + S+ Sbjct: 203 ADGKRDNLWLSLVPYSQSVSVYDAEDANRIRRWSAPGALNPPELRSLFASGVVSSLADRR 262 Query: 208 NSD----------------PEDVNSAPICQDKKRTKMAALKNA------LLLFLDSIDLL 245 D + P+ Q K + +N + + D Sbjct: 263 FPDRRANLLCMYRGLGREENFFWDEPPVGQFKVYYRHDLPQNGSPGAPPISWRGPNPDFD 322 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 D + + + K+ Q + + + + M A L Sbjct: 323 DTDAVDTRWIVADRGCPNAALMPLTNEESKLNQRIDQFSARFNV---NYAIGMSWAGAAL 379 Query: 306 TSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNT------------ 350 + + + S + + QK I+ L + N F ++ Sbjct: 380 SPNMRGSDGWGDATLPLDFNLDGNGDGQKVIVMLANTIGNWFDTDSYNFNRNAFSGQTGS 439 Query: 351 -----------IKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPE 387 +C + IK + I + R L ++P Sbjct: 440 DPARTFAAQRFRDLCSSFRARNIKFYFVGIRPGDPEDFGRNLFDAEATPG 489 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 71/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 83 PRDRATVVVAIDVSRSMVATDV---------------DPSRLSAAKTAAKDFLGDLPP-- 125 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V + T + + L Sbjct: 126 ----RFNVSLVKFAASSQVVVPPTTDRAVVSTAIANLQVL---PSTAIGEGIYSSLNAL- 177 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 178 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 225 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 226 IAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 285 Query: 413 KY 414 K Sbjct: 286 KV 287 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 72/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDVE---------------PSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V +T T + + L Sbjct: 128 ----RFNVSLVKFAASAQVVVAPTTDRAAVSTAITNLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 67/213 (31%), Gaps = 47/213 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +++ +D + T+ A + A F++ + +GLI + + Sbjct: 86 DASTSMQAEDFQPTRFEAAREAAGAFVE-------GRVSDRVGLIVFAAEAYTQAPLTLD 138 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +++ + + T A+ A L + K Sbjct: 139 YSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAE------------------SKV 180 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--------------- 377 I LTDG NN + + + A+ +++ I + +S + Sbjct: 181 AILLTDGRNNRGQID--PRTAAEVAQTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAG 238 Query: 378 -----LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 L S+ +++ N D+L ++ I Sbjct: 239 VDAEMLRSVSTSTGGQYFSATNRDALERIYAEI 271 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 77/256 (30%), Gaps = 52/256 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++ + + R ++ LV+D+S SM + +MAA Sbjct: 79 LTIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDV---------------QPNRMAAA 122 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F D + + +GLI Y + P+ + ++ + + + Sbjct: 123 QEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNRDSTKRALDKLQFADR--- 173 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A Q + + P I+ +DG+ + N Sbjct: 174 TATGEGIFTALQAI----------ATVGAVIGGGDAPPPARIVLFSDGKETMPTNPDNPK 223 Query: 352 KI---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 AK+ + I TIS + + L K S YN Sbjct: 224 GAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAAT 283 Query: 395 ADSLIHVFQNISQLMV 410 L V+ + Q + Sbjct: 284 LQELKSVYATLQQQIG 299 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 75/230 (32%), Gaps = 54/230 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + +M A K+ F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLRPNRMEAAKDVATEFI-------SGRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + LI T + A L S Sbjct: 148 HASLLRLLQATRTDIAARGLIDDGTAVGMGLANAVSRL------------------KDSK 189 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN + + + + AK I++ TI + + Sbjct: 190 SKSKVVILLTDGSNNMGEIS--PMTAAEIAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L ++ Y N + L ++++I + + K++V Sbjct: 248 VNIPVEIDTKTLSDIAQTTDGNFYRATNNNELKKIYRDIDK-LEKTKFNV 296 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 80/244 (32%), Gaps = 51/244 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + LV+D+S SM +C D + T++ A + A + F+D ++ Sbjct: 103 PKNRATVVLVIDISLSM---------------MCDDVRPTRVDAARQAAIKFVDEMEPT- 146 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 + +GL+ + + I PS E V++ + + L T + + A Q + Sbjct: 147 -----LQLGLVTFAGTAQTLIAPSSDHEVVKRALDDAIRPDKLAARTATGEGIYTALQQI 201 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFI 362 + K S I+ +DG+ +AK + Sbjct: 202 ETLKGILGGA----------SKAPPARIVLESDGKETVPDDLNAPRGAFTAAKEAKAKEV 251 Query: 363 KIVTISIN----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I +IS + L K S + + D L V+ +++ Sbjct: 252 PIYSISFGTASPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTASSLDQLTDVYSSLN 311 Query: 407 QLMV 410 + Sbjct: 312 AEIG 315 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM + +MAA + Sbjct: 80 TIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDV---------------QPNRMAAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ + + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNRDSTKAALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P I+ +DG+ + N Sbjct: 175 ATGEGIFTALQAI----------ATVGAVIGGGDKPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + L K S YN + Sbjct: 225 AFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ + Q + Sbjct: 285 QELKAVYATLQQQIG 299 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 39/353 (11%), Positives = 91/353 (25%), Gaps = 14/353 (3%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + + ++ + + + T Q++V S + N + Sbjct: 2 KQMARFSYLFTFLLLLNFIFSTNLASASNNVTVNLSVTPSQSVVILPTTSNAKASLNLM- 60 Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + + + + V I + F + D Sbjct: 61 LTPTGNPQTERDPIDLVFVFDKSGSMDFKVASNSSVKRIDSAKSAMTNALMFFDGQNTSD 120 Query: 199 ----LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + S + + +S +K+ + N + S DV Sbjct: 121 RFGFVPFSSNANTDVVSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVAS 180 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP--------AMKQAYQILT 306 L +++ + I + GT + + + + T Sbjct: 181 KLFDSSSKDKNIIFLTDGTPTFSFSDEKVYYKSYWGSSSGNDRVEYQSYTDLTYQFTETT 240 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + + T N N K+ + IK+ T Sbjct: 241 KYWYKMGDYFYKNNTKYNMNTTYNPNGNATTYKNNFEKLVKQNDKKVVQSLSSSGIKLYT 300 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + S +G L + + + N + + V NIS+ + K V +K Sbjct: 301 LGFGDSIDGDYLAELANLTGGSYKNAI-GQDINEVLMNISEEVAAPKVDVEVK 352 >gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] Length = 584 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 69/205 (33%), Gaps = 31/205 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +K+A LK A+ +D++ + ++ ++ + I + + + Sbjct: 157 SGSMQGSKLALLKQAMGFVIDNLGPAD------RLSIVSFSNDASREIRLTRMSGDGKAS 210 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++SL T+ + + A ++L + R+ T+ +I L+D Sbjct: 211 AKEAVESLVADGSTNISRGLLVASEVLADRRYRNAVTS----------------VILLSD 254 Query: 339 GENNNFKSNVNTIKIC----DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 G++N N + A I T + + + + V N Sbjct: 255 GQDNQSGVGRNHQNLVPPLFRDADSRPGSIHTFGFGSDHDAAAMHAIAEVARGTFSFVEN 314 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 + F +Q + SV+ + Sbjct: 315 LAVIQDSF---AQCIGGL-LSVVAQ 335 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 79/240 (32%), Gaps = 50/240 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR---TKMAALKNALLLFLDSIDLLSHV 248 I L +D+SGSM + I K + ++ K + F++ Sbjct: 85 DIVLALDMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRFIEQ------- 137 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + +GLIG+ + P+ + ++ + +I + T + L Sbjct: 138 RPNDRIGLIGFADQAYSFAPPTLDHAWLLAHLEQLEPGMIGQQTGIAAPLASGVNRL--- 194 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S ++ ++ TDG NN + + KE + I T+ Sbjct: 195 ---------------KKSDAPRRVLVLFTDGRNNVDN-RLTPEQAAALGKEFDVVIHTVG 238 Query: 369 INA--------------------SPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQ 407 I + ++LL++ + +++ +AD + V I+Q Sbjct: 239 IGSRNAFVLVTDPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGMKQVMDEINQ 298 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 73/230 (31%), Gaps = 54/230 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +M A K+ F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLTPNRMEAAKDVAAEFIA-------DRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNMG--DLSPMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + L ++ Y N L ++++I + + K SV Sbjct: 248 VNMPVEIDTKVLKDIAATTDGNFYRATNNQELKQIYKDIDK-LEKSKMSV 296 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 71/242 (29%), Gaps = 51/242 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + + +D+S SM ++++A K A FL + Sbjct: 85 PRDRATVVVAIDVSRSMVATDV---------------DPSRLSAAKTAAKDFLGDLPP-- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + + + P+ V + T + + L Sbjct: 128 ----RFNVSLVKFAASSQVVVPPTTDRAVVSTAIANLQVL---PSTAIGEGIYSSLNAL- 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K P I+ L+DG N + + ++ +A + + T Sbjct: 180 ---------KLVPDDPKHPGQKPPAAIVLLSDGATNVGRPS---LEAAKEAGRQHVPVYT 227 Query: 367 ISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I+ N L +S ++ + L V+++I+Q + + Sbjct: 228 IAYGTAGGYVVEGGQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYE 287 Query: 413 KY 414 K Sbjct: 288 KV 289 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 75/232 (32%), Gaps = 36/232 (15%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVN--SAPICQDKKRTKMAALKNALLLFLDSI 242 + ++ + + VDLS SM P + + +++ ALK+ L F Sbjct: 87 VTQKAGRDMMIAVDLSQSMEQKDYLLPSEEPSQDGQLKAASNVSRLVALKSLLSSFSQQ- 145 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 ++ +GLI + + + + + + + + T A+ + Sbjct: 146 ------RDGDRLGLIVFGSGAYLQVPFTEDVRLWQTLLEQMDTQMAGPATAIGDAVGLSI 199 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + S Q+ ++ +TDG + + + + + A I Sbjct: 200 RAFE------------------RSNTSQRILLLVTDGSDTSSRLD--PVDAARVAAAEGI 239 Query: 363 KIVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +I T+ + + L K + + ++ ++ + I + Sbjct: 240 EIFTLGMGSVDTVGDDQVDFNTLNKIAKITNGRAFEGNSSTAIAEILAQIDK 291 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/437 (9%), Positives = 107/437 (24%), Gaps = 69/437 (15%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + + + IF L + + ++ + ++ + A+ LA ++ Sbjct: 4 FDMNRLKNSCGAAAFIFILILPVLICVMALSLQASQILLAQSKITEASEVTSLALSALSE 63 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + S++A R L + ++ + F T++ Sbjct: 64 ERAQQKL------SSYATRVLKHYLVGTDDVKGQATMQSSTFQF-QTDLVGEATHEFWFK 116 Query: 130 THMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHLLNQRYNQKIV--SFIPALLRIEMG 186 + S + + + E + ++ + I ++ Sbjct: 117 HKPQADTFKVSGASTSRKHKPQPMDVYFITDLSESMNRSEPSRLTIVKDAIRQVVSKLPK 176 Query: 187 ERPIFLIELVVDLS-----------------------GSMHCAMNSDPEDVNSAPICQDK 223 I + + D + Sbjct: 177 GSRAAFIGYNTENVKLTGRYFDKRTGREITSKKPTELQGPNIWAEKSIYDYLTGRHPDFV 236 Query: 224 KRTKMA-ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 AL +D + + T +I + ++ + Sbjct: 237 IENLFDIALSKKKKSSIDKLLEGYGPPDLDQKIRDFETKYPFYDISLTTDLGSFKETIKS 296 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T S + A + RQ + + Q+ + LTDGE++ Sbjct: 297 NAIN-ANGNTHSWNGIIAAAR------------EAHRQPSSVFNP--QQVFVLLTDGEDS 341 Query: 343 NFKSNVNTIKICDKAKEN-------------------FIKIVTISINASPNGQRLLKTCV 383 +C+K + + + + I + +P + C Sbjct: 342 KKFPKGYYAPLCEKIRNDISDKQNRSQIQNASVEEKTKVTMSVIGVEFNPYKNEGVTEC- 400 Query: 384 SSPEYHYNVVNADSLIH 400 E + D L+ Sbjct: 401 FGRENIFEAKREDELVK 417 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 32/421 (7%), Positives = 82/421 (19%), Gaps = 66/421 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + N +++F L++ L + + + S++ A + A LA + Sbjct: 23 IMRFRREKAGNVAVVFGLTLPVLALCFATAVDLSGIYGANRSLQQAADVAALAAGREYGR 82 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-- 128 S + A + ++ Sbjct: 83 TQDADYLSSVSEAFFFHNAGDE---TRGTTQFSYDGVFREDGLTILKVTARRQLPTFFGD 139 Query: 129 ------MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL------------------ 164 + + + + L + Sbjct: 140 ALMWVTGGKLDWRQFPLYAKSEIVVQNRSIELALVLDNSGSMQDRPRSGGSKSKIDIIKD 199 Query: 165 --------------LNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNS 209 + + +P + +G + S H ++ Sbjct: 200 AAEDLAKQFLSSDKGSTEEFPVQFAVVPFSSSVNVGPQYKNADWMDTQGRSPIHHENLDW 259 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + R + + + Y TR+ Sbjct: 260 GGWLSGATSGGWEWIRDR------------GWVYTAPSSGAPMARYNGSYWTRITTGEPL 307 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + A K + + Sbjct: 308 TRFYVYDNARYKSQFGTWRGCVEARPNGLA-ATDTAPVSSKPETLFVPTFAPDEYDDSDY 366 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNGQRL--LKTCVSS 385 + G + + + AK ++ I T S N TC ++ Sbjct: 367 GWNDYLDS-GSGSPGSAKEAMAEQAKVAKYFDSGYSITT----PSSNRSDWGPNSTCATT 421 Query: 386 P 386 P Sbjct: 422 P 422 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 30/286 (10%), Positives = 74/286 (25%), Gaps = 84/286 (29%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + ++S + MA F + + G Sbjct: 361 YDDSDYGWNDYLDSGSGSPGSAKEAMAEQAKVAKYFDSGYSITTPSSNRSDWG----PNS 416 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + T+ ++ T+ + +++LT+ F +G Sbjct: 417 TCATTPITPLTKTLKTVTDAIDVMGAQGATNIPHGLAWGWRLLTAR-------PPFTEGR 469 Query: 323 KIPSLPFQKFIIFLTD-------------------------------------------- 338 K ++ +TD Sbjct: 470 SHDEPDNLKVLVLMTDGNNTYNLNSGGRPLEIRDYNRSTYGSYGYGAAYSHGSSSRKPGR 529 Query: 339 ------GENNNFKSNVNTIKI-------CDKAKENF--------IKIVTISIN--ASPNG 375 G ++ + + C+ K + I I TI+ + Sbjct: 530 IYDGTTGNAKDYSVDSYVAAMDQNVAKVCENVKADGRKPGGTDGILIFTIAFDLRDGEPV 589 Query: 376 QRLLKTCVS------SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++L++ C S S + +Y+ + + L FQ+I++ + + + Sbjct: 590 KKLMEDCASNGLIDASEKLYYDAQSQEELAAAFQSITEQISSLRIA 635 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 42/265 (15%), Positives = 79/265 (29%), Gaps = 69/265 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I + +D+S SM + +D K ++ ALK Sbjct: 78 TKDVSTRTKTTKGIDIVMAIDVSSSM---------------LARDLKPDRLTALKKVAAD 122 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + +GL+ Y + V + + + T Sbjct: 123 FIKK-------RPNDRIGLVAYAAESYTKTPITSDKSIVLSSLRQITHGQLEDGTAIGMG 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L S K II LTDG NN+ + D A Sbjct: 176 LATAVNRL------------------KDSKSKSKVIILLTDGVNNSG--FIEPKTAADLA 215 Query: 358 KENFIKIVTISINASPNG----------------------QRLLKTCVS-SPEYHYNVVN 394 E IK TI + + N ++LL+ + + ++ + Sbjct: 216 VEYKIKTYTIGLGTNGNALTPIAFNPDRTYRYGMRQVEIDEKLLEEIATVTGGQYFRATD 275 Query: 395 ADSLIHVFQNISQ----LMVHRKYS 415 + L ++ I++ + KY+ Sbjct: 276 NEKLSAIYNEINKLEKTDIEEIKYT 300 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 56/513 (10%), Positives = 124/513 (24%), Gaps = 127/513 (24%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 SR + + KAN + FAL+++ L IG K ++SA A Sbjct: 8 SRATALVARFVGDSKANIATTFALALLPILTAIGCGTDYSMAMRLKVKLQSAA---DAAS 64 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE--SLSGYSAVFYNTEIQNIV 122 + + N + S+ + A +++A K + + S T+ ++ + Sbjct: 65 IASISVNSAGYAAAMAMTSDGSVTAGVNEADNIFKGNASTFGGYTLTSETSTVTKTRSTL 124 Query: 123 NSSRISMTHMANNRLDSSN---------NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 +S + L ++ + + + L + + + Sbjct: 125 SSQVQFTAAVPTTFLTVIGYQSITVSGSSSSSVTLPLYLDFYLTLDVSGSMGLPSTSAE- 183 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHC------AMNSDPEDVNSAPICQDKKRTK 227 + P + + + H +S + + C ++ Sbjct: 184 -------AQRMQAISPDNYRQYPTGCTLACHFSPQNSACTDSGTQGYPTNNYCLGYAISR 236 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-----PSWGTE-------- 274 ++ L + + + G + + P+ GT+ Sbjct: 237 --VSQSGYKSLLTTNKNNPKGVQLPSSIVSGLPNSLYNKLPTVANCPTDGTDDCIQLRLD 294 Query: 275 --------------KVRQYVTRDMDSL----------------ILKPTDSTPAMKQAYQI 304 ++ + L I T + + A Sbjct: 295 AVGYAVNQLFTTANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSSTINYAAAN 354 Query: 305 LT------------------SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---- 342 L S + G + ++ +TDG + Sbjct: 355 LATLLDTNTNASLGSGGTHIDTALSSVNSLITSVGDGSATTNTLPYVFLVTDGAQDPQVK 414 Query: 343 ----------NFKSNVNTIKICDKAKENFIKIVTISI-------------NASPNGQRL- 378 N + +N C K I I + I + Sbjct: 415 GVPNGSWSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNASFAGDEDDYANNN 474 Query: 379 -------LKTCVSSPEYHYNVVNADSLIHVFQN 404 L+ C S P + Y + Sbjct: 475 IPNIPPSLQACAS-PGFFYTANTPADITSALNA 506 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 71/230 (30%), Gaps = 54/230 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F+ + + +GL + + Sbjct: 95 DVSASMLAEDLKPNRIEAAKDVAAEFI-------SGRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRDMDSLI-----LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + T + A L S Sbjct: 148 HASLLTLLQDVRTDMATRGLINDGTAIGMGLANAVSRL------------------KDSK 189 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN----------------- 370 + +I LTDG NN +++ + AK I++ TI++ Sbjct: 190 TKSRVVILLTDGANNAG--DISPLTAAQMAKSLGIRVYTIAVGTSKVAPYPIEVGGRVQY 247 Query: 371 ----ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 A + + L + + Y+ N L ++ +I + K SV Sbjct: 248 ISRPADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHDI-DQLEKTKMSV 296 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/293 (13%), Positives = 81/293 (27%), Gaps = 67/293 (22%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHC 205 + RL I P + + I LV+D+S SM Sbjct: 62 VANNNPWTYLRLIPTGFFFLALIMVIIALARPQKSNERVEQFTEGIDIMLVMDISESMDL 121 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 QD K ++ A K + F++ + +G++ + Sbjct: 122 ---------------QDFKPNRLEAAKATAIDFIN-------GRFGDRIGMVVFAGEAYS 159 Query: 266 NIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + + + + K T A+ A + Sbjct: 160 LAPLTNDYKLLTDLIQDISFNMMEAKGTAIGSAIASATNRM------------------K 201 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------- 373 S K +I L+DGE+N + + A IKI TI++ Sbjct: 202 ESESASKVLILLSDGESNAGNVD--PLFAAQLASALDIKIYTIAVGKDGMVPYGTDFFGR 259 Query: 374 --------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRKY 414 + L + + + +L ++F I ++ +Y Sbjct: 260 PQMVESYLDETNLREIAKIGNGEFFRASDGGTLNNIFDRIDTMEKAEILENRY 312 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 67/196 (34%), Gaps = 37/196 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + T+ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG NN + + I A E +KI I I + Sbjct: 188 ------------RMRPATSRALVLVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 233 Query: 372 SPNGQRLLKTCVSSPE 387 P+ L +P Sbjct: 234 DPDKDALQSVLGLNPS 249 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 58/182 (31%), Gaps = 27/182 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + LL F+D + + L + S+ + ++ + Sbjct: 95 GGRFDIARKVLLDFIDR-------RPKDRIALEVFADYAYLAAPMSYEKKGLKTILAALE 147 Query: 285 DSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++ T A+ ++ ++ RS + +I LTDG + Sbjct: 148 PGVVGGRDTALYEALFLGARLFKKEEGRS-----------------NRVMILLTDGIDTV 190 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + K I++ T+ + L K S+ Y+ ++L ++++ Sbjct: 191 GNIP--LEAAIRELKRAHIRVYTVGVGDDFRRGVLEKIARSTGGRFYDARYPEALANIYR 248 Query: 404 NI 405 I Sbjct: 249 RI 250 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 80/240 (33%), Gaps = 43/240 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P + L+VDLSGSM +D + ++ A+K L FL Sbjct: 90 DQPTRDLLLLVDLSGSMDTKDFTDAS---------GRTVDRLTAVKAVLDDFL------- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ G++ + + + + R+ + + + T A+ + Sbjct: 134 SRRKGDRAGVVVFGDAPFALVPFTTDLDLCREMLRDTVVGMAGPRTALGDAIGLGIALFD 193 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 ++ K II LTDG + + + AK+ I I T Sbjct: 194 RSTVKA------------------KTIIALTDGNDTASQVP--PTEAAGVAKDKGIVIHT 233 Query: 367 ISINASPN------GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 ++I + LK S + + ++ D L ++ + R +V + Sbjct: 234 VAIGDPSTVGEDKLDETALKDVASATGGGFFRALDRDELARIYGRLDGTETRRIDTVTFR 293 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 33/290 (11%), Positives = 75/290 (25%), Gaps = 22/290 (7%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + + + + S++ +I+FA+ + L + + H K +++A +AA L GA Sbjct: 4 KHHSSRRLQSQRGGTAILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARS 63 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + + + A + A ++ + ++ Sbjct: 64 LSDPGGQPYNWSAASIKALDVARSNVANGGQIQDAAIETGYWNILNPALGMRPAGTPGVP 123 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + R+ ++ + N + + L I P M Sbjct: 124 ATGDVPAVRVTTAISATQNNGPLQLLFAPIL---GITERSIQASAIAVIAPPSGGTGM-- 178 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 V+ H V + K + +++ L Sbjct: 179 -----FPFVIATPMLDHYWDRDTNSPVLENGVAPTIKLGSIYHFEDSDDGVLSGEWTTFQ 233 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ G W K D I T P Sbjct: 234 TEDGNPSGR------------FLWDLLKDLTTNGNDTALYIGDNTYIQPG 271 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 76/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM +MAA + Sbjct: 80 TIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APNRMAAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P I+ +DG+ + N Sbjct: 175 ATGEGIFTALQAI----------ATVGAVIGGGDKPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + L K S YN + Sbjct: 225 AFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ + Q + Sbjct: 285 QELKSVYATLQQQIG 299 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 33/351 (9%), Positives = 90/351 (25%), Gaps = 33/351 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R ++ + ++ +II LS+ + +G + + + ++ ++++ +A Sbjct: 1 MNSAARHPKVMRRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + + + + + + Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFV---FFQNKSVQMSTNANVTFSDSLTDPFLT 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + S SN ++ L + A+ Sbjct: 118 RSAVTTPSSIKYVQCTATLSNIAHWF-----IEVLNVL------PGTKLANAAEVSASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V +GS + ++ + AAL Sbjct: 167 ATVGGGQTTCAIPVFVCRATGSPSYNVGDWITSLSGSSTTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P G+ ++ R ++ T+ Sbjct: 218 -----DGSTNEPTIASELSGNTCNITSPPDLGSTGLKSASLRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 Y + + F + DG + Sbjct: 273 QPDFTGYAYVGPNYGPPGTAGIKGDAYTQFVTDRTTFKPYQGDGAPPSGSG 323 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 74/244 (30%), Gaps = 59/244 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I L +D+SGSM + ++ +++ +K F++ Sbjct: 90 QASGIDIMLALDVSGSMIAEDFT----------IGGERASRVDVVKQVTQKFIE------ 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + +G+I + R + + Q + R L+ T A+ L Sbjct: 134 -ARPNDRIGMIAFAARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGTAIGSAIASCTTRLI 192 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + ++ LTDG+NN K + + + A +K+ T Sbjct: 193 ERKDSK-----------------SRIVVLLTDGDNNAGKVS--PLTAAEAASALGVKVYT 233 Query: 367 IS---------------IN--------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + L K + Y + SL +++ Sbjct: 234 IGAGTKGFAPMPVGRDVFGRKVYQNVKVDVDEDTLKKIADMTKAKFYRATDTKSLTQIYE 293 Query: 404 NISQ 407 I Q Sbjct: 294 EIDQ 297 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 69/220 (31%), Gaps = 51/220 (23%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ A K F+ H ++ +GL+ ++ + + +++Y+ Sbjct: 127 DFTPNRLEAAKLVAKNFV-------HGRKYDRIGLVIFSGEAYSVSPLTTDYKLLKRYIE 179 Query: 282 RD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D + T A+ + R K +I ++DG+ Sbjct: 180 DIREDMIQENGTAIGSALGMGTIRMQESASR------------------SKVVILISDGD 221 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTI-------------------SINASPNGQRLLKT 381 N + I A + IKI TI +N + + L + Sbjct: 222 NTAGNLD--PITASRLATAHNIKIYTILVGRSGKVPYGRDMFGQPQYVNNTVDESVLREI 279 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRKYSVI 417 Y + +L +VF I++ ++ ++ I Sbjct: 280 AKIGEGKFYRASDNQALKNVFAEINRLEKTEIIENRFKSI 319 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ LV+D+S SM ++AA K A F D + Sbjct: 95 NRAVVMLVIDVSESMASTDVP---------------PNRLAAAKEAGKQFADQLTPA--- 136 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ V+ + T + + A Q + + Sbjct: 137 ---INLGLVEFAANATLLVPPTTNRSAVKAGIDSL---QPAPKTATGEGIFTALQAIATV 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 I+ +DG N A+ ++I Sbjct: 191 GSVMGGGEGP----------PPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQIS 240 Query: 366 TISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TIS + Q L K C + ++ + DSL +V+ + + + Sbjct: 241 TISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIG 299 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 78/249 (31%), Gaps = 56/249 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + LVVD+S SM D +++ A + A F D + Sbjct: 84 PRNRATVMLVVDVSNSM---------------KSTDVAPSRLKAAQAAGKRFADDLTDG- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + P+ + + R + T + + A Q + Sbjct: 128 -----INLGLISFAGTASTLVSPTPDHSATKNALDRLKLADK---TATGEGIFAALQQID 179 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK---ICDKAKENFIK 363 + + PS I+ L+DG+ +S + KAKE + Sbjct: 180 T----------LNAVLGGPSGAPPARIVLLSDGKQTVPESPDDPRGGFTAARKAKEKGVP 229 Query: 364 IVTISIN-------------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + TIS + + L K S + + D L V+ Sbjct: 230 VSTISFGTLTGTVDLETPGGGVERVPVPVDDESLRKIANLSGGDFFTASSLDELNKVYST 289 Query: 405 ISQLMVHRK 413 + + + + + Sbjct: 290 LQKQIGYER 298 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 79/263 (30%), Gaps = 70/263 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+S SM + +D ++ ALK F+ Sbjct: 82 DVSTKTKTTRGIDIVMSIDVSASM---------------LAKDLLPNRLEALKKVAADFI 126 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAM 298 + + + +GL+ Y + +R +++I T + Sbjct: 127 E-------GRPNDRIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKYNTIIEGGTAIGMGL 179 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L + + K II LTDG NN + + A Sbjct: 180 ATSVNRLKDSRAK------------------SKVIILLTDGVNNGGFID--PKIASELAV 219 Query: 359 ENFIKIVTISINASPNG----------------------QRLLKTCVS-SPEYHYNVVNA 395 E IK+ TI + + + LLK + ++ N Sbjct: 220 EYGIKVYTIGLGTNGTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVTGGKYFRATNN 279 Query: 396 DSLIHVFQNISQ----LMVHRKY 414 L ++ I++ + +KY Sbjct: 280 KKLAQIYDEINKLEKTEIEEKKY 302 >gi|2811055|sp|O07395|Y335_MYCAV RecName: Full=UPF0353 protein MAV335 gi|2183263|gb|AAC46199.1| MAV335 [Mycobacterium avium] Length = 335 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 76/255 (29%), Gaps = 51/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM +MAA + Sbjct: 80 TIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APNRMAAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q+ + P I+ +DG+ + N Sbjct: 175 ATGEGIFTALQVQAIATVGAVIAG---------DKPPPARIVLFSDGKETMPTNPDNPKG 225 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + L K S YN + Sbjct: 226 AFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNARSL 285 Query: 396 DSLIHVFQNISQLMV 410 L V+ + Q + Sbjct: 286 QELKSVYATLQQQIG 300 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 59/215 (27%), Gaps = 54/215 (25%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D + + E + E + V + L T + + Sbjct: 436 DGSTRAAIISETSPETYGPNASCPEPLTRLTNNLSTVTAAIDSMNYWL-NGGTVISEGLM 494 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---NNFKS---------- 346 A++ L+ + G +K I+ +TDG N +N + Sbjct: 495 WAWRTLSP-------QKPYADGAAYTDKKTKKVIVLMTDGVNGLADNGNAASANISDYSA 547 Query: 347 -------------------------NVNTIKICDKAKENFIKIVTISINAS--------P 373 + K CD AK I I T+ N + Sbjct: 548 YGYMGASRLSVADGVTTYAGLQTFLDDRLKKACDNAKAKGISIYTVMFNHNGFLSATEQA 607 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 LL C S PEY + ++ +L F I+ Sbjct: 608 RSATLLSYCASKPEYAFLATDSAALNSAFGQIASS 642 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 19/271 (7%), Positives = 63/271 (23%), Gaps = 29/271 (10%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 I F L+++ + G I ++S++ A + Sbjct: 2 IAFGLAMIPVTFMAGAAIDYGRATLLRSSLQK--------AADAGALAAGARTSLTQLAR 53 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + + ++ +++ ++Q + + M + + + Sbjct: 54 EQIAKNAVLANLGAKARNLSLTITETEPSAGVFQVQIQASIATSIMKVARFDTIPVVVTS 113 Query: 144 IFYNMDV---MTSYDYRLQFIEHLLNQ------------------RYNQKIVSFIPALLR 182 + L + + VS +P + Sbjct: 114 EARVVGGSTNPIEIALALDNTGSMRDDMPALKQAAKTLAQNVMSGSGGNVKVSVVPYVAA 173 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + G + ++ + + M + ++ D+ Sbjct: 174 VNPGLTDLTQVDTAAVAPATGNWFMYGWLGYDKNCTPSWGPGGGTGPGSGSSGDTTGDAR 233 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 DL+ + + + + + T Sbjct: 234 DLIQLLDPFRRIARELFGVGAAYAADVTPNT 264 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 84/271 (30%), Gaps = 62/271 (22%) Query: 164 LLNQRYNQKIVSFI--PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + I + + I + +D+SGSM + + Sbjct: 59 MRWFVLALFIFALAQPRFVQSETKVSASGVDIVVALDMSGSM--------LAEDEGFVLN 110 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++ T+ ++ L F+D ++ +GL+ + T+ + P+ E + + + Sbjct: 111 GQQATRFIIARDVLKKFVDK-------RQSDRIGLVVFGTQAYVAVPPTLDHEFLLKNLE 163 Query: 282 RDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R I T A+ + L K + K II +TDG+ Sbjct: 164 RLGIGSINGNQTAIGSALSTSMNRLRELKSK------------------SKIIILMTDGQ 205 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN------------------------ASPNGQ 376 NN K + + A+ IKI TI + + Sbjct: 206 NNAGKVP--PLTAAEAARALGIKIYTIGVGTKGVARMAVGTDPFSGQKIYQQVPVDIDEG 263 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + +Y + +L ++ +I + Sbjct: 264 TLTSISKMTNAKYYRADSTATLEKIYADIDR 294 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 76/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM +MAA + Sbjct: 80 TIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDV---------------APNRMAAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + + P I+ +DG+ + N Sbjct: 175 ATGEGIFTALQAI----------ATVGAVIGGGDKPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + L K S YN + Sbjct: 225 AFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQLSGGNAYNAASL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ + Q + Sbjct: 285 QELKSVYATLQQQIG 299 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 31/327 (9%), Positives = 83/327 (25%), Gaps = 76/327 (23%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L ++ + + + + L+ + + + S Sbjct: 4 VLDVSSSMIEENRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVNFGLANTAWLK 63 Query: 217 APICQDKKRTKMAALKNALLL-------FLDSIDLL-SHVKEDVYMGLIGY--------- 259 + + K+ ++D + + S K +MG I Sbjct: 64 SYNGTAAVPKRWTDPKSVYTSSSYKLSYWIDGVTPVMSTSKNYYWMGCIEPRADVEVRDT 123 Query: 260 ----------------------------TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + ++ V Sbjct: 124 GAIGDGMGDAPPGTAAFVAMDANPKSGTSFCPPPVTPLTGDFAYLKSVVKNLTSE---GS 180 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-- 349 T + + L+ + + S K ++F+TDGE N + Sbjct: 181 TRLDAGVVAGWYTLSPKWQGVWGDETSPA---EVSDSVHKVMVFMTDGEMNTKYDPNDKF 237 Query: 350 -------------------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 C K++ I+I T+S +A + + + C ++ + + Sbjct: 238 DWICSQTQSSACNAFATAAMQTACTAMKKSGIEIYTLSYSADADVVNI-RNCATNTAHFF 296 Query: 391 NVVNADSLIHVFQNISQLMV--HRKYS 415 + ++ V++ I+ + + + Sbjct: 297 TA-SPATIKTVYETIAAAIRGDTLRLT 322 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 60/218 (27%), Gaps = 51/218 (23%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 QD ++ A K F+D + +GL ++ + + ++ + Sbjct: 120 QDFTPNRLEAAKQVANDFID-------GRFQDRIGLTIFSGEAYSLSPLTTDYKMLKNQI 172 Query: 281 TRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T + T A+ + + K +I L+DG Sbjct: 173 TDIDFKMMEASGTAIGSALAVGTNRMRESDSK------------------SKVLILLSDG 214 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISIN-------------------ASPNGQRLLK 380 +NN + A IKI TI+I S + L Sbjct: 215 DNNAGNID--PETSAKLANAYGIKIYTIAIGKEGKVPYGKDFFGRTRYIENSMDVTGLKN 272 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ----LMVHRKY 414 Y + +L VF I Q + +Y Sbjct: 273 IAKIGEGQFYRATDNQALEEVFSIIDQYEKAEIKETRY 310 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 66/216 (30%), Gaps = 48/216 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D T++ A K F++ ++ +GL+ + + Sbjct: 95 DISGTMMANDFSPTRLEAAKKVASEFIN-------DRQSDRIGLVIFAGESFTQCPLTTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ +I T + + L + + + Sbjct: 148 HRVLLNLLSEVKFGMIEDGTAIGLGLANSVNRLKDSQSK------------------SRV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--------------------- 371 +I LTDG NN + + + A I++ TI I + Sbjct: 190 VILLTDGSNNAGQI--APLTAAELAASYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSG 247 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + + ++ + SL ++ I Q Sbjct: 248 DFDERTLTEIASKTGGSYFRATDNTSLSGIYDEIDQ 283 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 43/234 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + VDLSGSM Q ++ +K L FL + Sbjct: 30 MMVAVDLSGSMEAKDF---------VDQQGINVRRIDGVKLLLESFLQQ-------RTGD 73 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + + + + + + + T A+ A + + Sbjct: 74 RVGLIAFGDDAYLQAPVTEDFDTLSLLLEQMDVRMAGAGTALGDAIGVAVNHFEHSESNN 133 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K ++ LTDG++ + + A E + I I+I + Sbjct: 134 ------------------KVLLLLTDGKDTTSQFP--PVDAAHFAGERGVTIYPIAIGDA 173 Query: 373 PNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 N + L + + + ++ D+L V+Q +++L S +++ Sbjct: 174 TNVGEEAIDLEMLARIASYTGGQVFEALDGDALAAVYQTLNELEASLFESYVIR 227 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 68/211 (32%), Gaps = 43/211 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + D ++ K+ F+ + + +GL + + Sbjct: 95 DISASMLSDDVDPNRIEVAKDVAKEFIA-------SRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + ++ LI T + A L K+ Sbjct: 148 HASLLNLLAGIRADLSVNHLIQDGTAIGMGLANAVGRLKDVKEG---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-----------NGQ 376 K +I LTDG NN +++ + A++ +++ TI + + + Sbjct: 192 --SKVVILLTDGSNNVG--DISPLTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYK 247 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + + Y + L ++++I + Sbjct: 248 TLQDIAMQTDGEFYRAQSRAELSQIYKDIDK 278 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 69/221 (31%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A KN F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKNVASEFIA-------DRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + A+ I++ TI I + Sbjct: 192 --SKVVILLTDGSNNMG--DISPMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L ++ Y N L ++++I Q Sbjct: 248 VNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 76/252 (30%), Gaps = 66/252 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+S SM + +D K ++ ALKN F+ Sbjct: 81 DVSTKTKTTRGIDIVMAIDVSASM---------------LAKDLKPNRLEALKNVASEFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAM 298 + + +GL+ Y + V + + +++I T + Sbjct: 126 K-------GRPNDRIGLVEYAGESYTKTPITSDKSIVLRSLQEIRYNNIIEGGTAIGMGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L K + K II LTDG NN+ N + A Sbjct: 179 ATAVNRLKDSKAK------------------SKVIILLTDGVNNSGSIN--PKIASELAV 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK TI + + + L + + ++ N Sbjct: 219 EFGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIAQVTGGKYFRATNN 278 Query: 396 DSLIHVFQNISQ 407 L ++ I++ Sbjct: 279 KKLAEIYDEINK 290 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 69/221 (31%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A KN F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKNVASEFIA-------DRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + A+ I++ TI I + Sbjct: 192 --SKVVILLTDGSNNMG--DISPMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L ++ Y N L ++++I Q Sbjct: 248 VNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 33/259 (12%), Positives = 80/259 (30%), Gaps = 54/259 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 I S + + R I + VD+S SM + Sbjct: 64 TIGVLIASMALPAMPV-QQPRERATIMVAVDVSLSMAAND---------------IDPNR 107 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A K + F++++ + +GL+ +++ P+ + V + + Sbjct: 108 LEAAKKSAQGFVETL------PDRFNVGLVAFSSTATVVSSPTHDHQAV---IGSIENLQ 158 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + + + ++S + + I+ L+DGEN + + Sbjct: 159 LGPGTAIGEGVFASLESISSFDEDADVDPP------------PSAIVLLSDGENTSGRDI 206 Query: 348 VNTIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVV 393 + + E + + TI+ A + + L + + Y Sbjct: 207 SQAVAMAA---EQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGGHFYEAE 263 Query: 394 NADSLIHVFQNISQLMVHR 412 + L V+++I + Sbjct: 264 SETELDEVYEDIGSSLGTE 282 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ LV+D+S SM ++AA K A F D + Sbjct: 95 NRAVVMLVIDVSESMASTDVP---------------PNRLAAAKEAGKQFADQLTPA--- 136 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ V+ + T + + A Q + + Sbjct: 137 ---INLGLVEFAANATLLVPPTTNRAAVKAGIDSL---QPAPKTATGEGIFTALQAIATV 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 I+ +DG N A+ ++I Sbjct: 191 GSVMGGGEGP----------PPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQIS 240 Query: 366 TISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TIS + Q L K C + ++ + DSL +V+ + + + Sbjct: 241 TISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIG 299 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 78/265 (29%), Gaps = 69/265 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + I + +D+S SM + +D K ++A+LK Sbjct: 78 TEDISTKTKTTKGIDIVMAIDVSSSM---------------LARDLKPNRLASLKKVAAD 122 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + +GL+ Y + V + + T Sbjct: 123 FIKK-------RPNDRIGLVVYAGESYTKTPITSDKGIVLNALKEITYGSLEDGTAIGMG 175 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + L S K II LTDG NN+ + + A Sbjct: 176 LATSVNRL------------------KESKALSKVIILLTDGINNSG--FIEPQTAAELA 215 Query: 358 KENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVN 394 E IK TI + + + L + ++ ++ N Sbjct: 216 VEYDIKTYTIGLGTNGNALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYFRATN 275 Query: 395 ADSLIHVFQNISQ----LMVHRKYS 415 +SL ++ I++ + KY+ Sbjct: 276 NESLASIYDEINKLEKTEIEEFKYT 300 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 66/220 (30%), Gaps = 43/220 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + VDLSGSM + K ++ +K+ LL FL + Sbjct: 95 MMVAVDLSGSMEARDF---------VDLKGDKTRRIDGVKSLLLDFLAQ-------RASD 138 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GLI + + + + + T A+ A + Sbjct: 139 RVGLIAFGDAAYLQAPFTEDKGALSLLLKEMDVRMAGAGTALGDAIGVAVNHFSHS---- 194 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K ++ LTDG + + + ++ A + I I I+I Sbjct: 195 --------------DTDNKVLLLLTDGNDTSSEFP--PLEAARYAAQQGIVIYPIAIGDP 238 Query: 373 P-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + L + + + + ++ V+ I Sbjct: 239 ANVGEDSLDIEMLQQIADITYGQVFEAQDGEAFTQVYSII 278 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 64/187 (34%), Gaps = 25/187 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ +KN +L +D ++ + +I + + ++ + ++ Sbjct: 250 NSGSMDGEKIENVKNTILQLIDMLNDHD------RLSIITFNSYAKQLCGLRKVNKDNKE 303 Query: 279 YVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + S T+ T ++ A+ IL + K+R+ ++ + L+ Sbjct: 304 NLQKITKSIQADGGTNITSGLQTAFSILQNRKQRNSVSS----------------VFLLS 347 Query: 338 DGENNNFKSN--VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DG++NN S + +E I + +G + + Y V Sbjct: 348 DGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYYVERN 407 Query: 396 DSLIHVF 402 D + F Sbjct: 408 DQVDEFF 414 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ LV+D+S SM ++AA K A F D + Sbjct: 95 NRAVVMLVIDVSESMASTDVP---------------PNRLAAAKEAGKQFADQLTPA--- 136 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ V+ + T + + A Q + + Sbjct: 137 ---INLGLVEFAANATLLVPPTTNRAAVKAGIDSL---QPAPKTATGEGIFTALQAIATV 190 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 I+ +DG N A+ ++I Sbjct: 191 GSVMGGGEGP----------PPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQIS 240 Query: 366 TISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TIS + Q L K C + ++ + DSL +V+ + + + Sbjct: 241 TISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIG 299 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 22/215 (10%), Positives = 59/215 (27%), Gaps = 4/215 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 +F+ + + ++I A+ + +G + W+ K+ ++ A + A+ A + Sbjct: 11 LFRFFRSLAKDQGGSVAVIAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAASV 70 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + R + + A + + I +G +A ++ SR Sbjct: 71 RYRAGD----ARALMETAALRSARVTGYQPSIGTITTGVQAGSTAGSGTVSVELTETHSR 126 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + S + + L V + Sbjct: 127 LFSSVFTTDPVVLSARAVAELKGGSRACVLALSPTAPGAVTVTGSTDVQLNGCSVVSNSN 186 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 FL+ L + + Sbjct: 187 ASDAFLMRNGSALMSTDCVYTVGGAITTTGLNLTG 221 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 70/221 (31%), Gaps = 39/221 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R I L +D SGSM+ + S + + ++ ++ F+ Sbjct: 86 NRNGIDIVLSLDGSGSMNASGFSKE----------EPRLSRFEVVQKIASDFVMK----- 130 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + +G++ + ++ E V + + + + T + + L Sbjct: 131 --RIEDNVGVVLFGDFAFIATPVTYEKEIVSEMIGYLSHGMAGQNTAIGEGIAMGVRALR 188 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + K II LTDGE+N+ + + + I++ T Sbjct: 189 DSKAK------------------SKVIILLTDGEHNSGSIS--PKEAVAMVGKEHIRLYT 228 Query: 367 ISINA-SPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 I I LLK + N L V+ I Sbjct: 229 IGIGQKGEFDNALLKQLAHDGHGKFFAAANEKELQSVYDEI 269 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 59/212 (27%), Gaps = 49/212 (23%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + A ++ + + + +GL+ + + ++Q + Sbjct: 106 PNRFEAARDVASEMIA-------ARPNDNIGLVVFAGESFTLCPLTVDHNVIQQMLETTE 158 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A L S K II LTDG NN Sbjct: 159 IGQLEDGTAIGLGLATAINTL------------------RGSDNKSKVIILLTDGSNNAG 200 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKTCV 383 ++ + A++ I+I T++ + + L Sbjct: 201 --DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAE 258 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +Y + L +++ I + + + Sbjct: 259 QTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|251789655|ref|YP_003004376.1| hypothetical protein Dd1591_2051 [Dickeya zeae Ech1591] gi|247538276|gb|ACT06897.1| conserved hypothetical protein [Dickeya zeae Ech1591] Length = 526 Score = 83.4 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 43/510 (8%), Positives = 111/510 (21%), Gaps = 108/510 (21%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIY-------------------- 42 L + F ++ E+ S + L +M LL ++I Sbjct: 4 LRAHFGDSLRRFWRQERGAGSAFYVLGMMGLLLTGAYVIDTVQTTGDASQIKRAADAAAL 63 Query: 43 -------------VLDWHYKK-------------NSMESANNAAILAGASKMVSNLSRLG 76 ++ + + +AA ++ SN R Sbjct: 64 AVGREALSKNNSDFSAADRQRLAWEYVRSNLGMNSKLAQQLDAADISVTEGRRSNARRTF 123 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV------------NS 124 S ++ + + I + ++ N Sbjct: 124 TVSVSFASQPVLLKVAQRQHEIYSTVEVIYRPSEVALVLPNTSTETAAELAVLRRLGKNF 183 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRI 183 + + + RL + ++ ++ R+ + + + Sbjct: 184 ATDFLADYPDRRLALVPYSQSVSVYDDANWSSRIRNWSLSGALKPVELSSLFRNNDYGIA 243 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-LFLDSI 242 + R + + D +AP + + NA ++ I Sbjct: 244 NLASRMMPDLR--SKRMCLYRGLNQGDNYFWETAPAGSFRVHYRHDLPINAPDMPAIEWI 301 Query: 243 DLLSHVKEDVYMGLIGY-----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + Y + + + + + T+ A Sbjct: 302 GPNPDFGQANGVNDTRYIIGDKGCPTAALLPLTDDLNAISARLDQMEPQF---NTNYAIA 358 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--------- 348 M A L + S + + G + Sbjct: 359 MGWAAMSLAPSFRGSDGWGDGEYPLDFNDNGSGNIKAIVMMGNTTGDWFDTDSYNAYVGE 418 Query: 349 ------------------NTIKICDKAKENFIKIVTISI--NASPN---------GQRLL 379 +C+ + + I + + L Sbjct: 419 AIDGDGQGGDLGKQAATLRFRSLCNSFRSRDLLFYFIGVRPGDPEDFGRTLFGAVAVPGL 478 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + C ++ + ++ S IS+L+ Sbjct: 479 QACATNDGGYIRFADSSSFSGAEGQISELL 508 >gi|86147474|ref|ZP_01065786.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] gi|85834767|gb|EAQ52913.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] Length = 460 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 42/451 (9%), Positives = 115/451 (25%), Gaps = 62/451 (13%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + +++ +++ LL++G + + K+ + +A + A+ R Sbjct: 13 RGLVALMSIIALPFILLVVGLSVDAGRAYIVKSKLFAA--VDAASIAAARAVANGEDAGR 70 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + + D + + + S + +I I + + + + Sbjct: 71 AAAQKYFSANIPADFYSA-TPSLGAVNFAYDSFGNISIDISATAQVPTIFLPLIGLDTFN 129 Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI----PALLRIEMGERPIFLIE 194 + + L I + G I L++ Sbjct: 130 PGVSAQSIRR--PVDLVLVIDNTTSLRLGSIGDVTQDVIDRSKSFVENFHEGFDRISLVK 187 Query: 195 LVVD--LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + S + T A NA +++ L V + Sbjct: 188 FAFGAEVPVGFNATRGHSRSSIKSEIDSFNFGSTSNAQYTNASEGMYRALNELRTVTDPA 247 Query: 253 YMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK----PTDSTPAMKQAYQILTS 307 + ++ +T ++ E + S + Sbjct: 248 NLKVIVFFTDGAPNTFATTFDFEGGGTHTGAIRSSDGSSGRPRGLWRHDTIATTLPGGYE 307 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIF--------LTDGENNNFKSN-----------V 348 + + + + +F I + N+ +N Sbjct: 308 GRDIDDYISEIPEHYTAHDTNATEFKILNPTHAVRPVAQYAPNSHSANDLYTRVNRVARN 367 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQR----------LLKTCVSS------------- 385 I +KA+ I + T+ + +S LL+ SS Sbjct: 368 LVEDIAEKARTEDIYVFTLGLGSSLTSATGPDSEFGEDLLLRMANSSALLDDPDLSSDYD 427 Query: 386 ----PEYHYNVVNADSLIHVFQNISQLMVHR 412 + + ++ ++L F + +++ Sbjct: 428 PAQLEGVYCHAIDEEALGPCFDKMLDVIIRL 458 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 9/81 (11%), Positives = 29/81 (35%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + ++ + + NF+++ A++ + + I +++ ++SA +AA L Sbjct: 9 HMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVRAYG 68 Query: 70 SNLSRLGDRFESISNHAKRAL 90 + + Sbjct: 69 EGATEEEAYDAAQKFFWSNYA 89 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 18/148 (12%), Positives = 46/148 (31%), Gaps = 13/148 (8%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + + + ++ +I+FAL + L + + H K +++A +AA L GA Sbjct: 4 THHSSRRLHRQRGGTAILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHS 63 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + + A ++ A+ +++++ A N Sbjct: 64 LSDAGGQP---------YNWSAAVNAAQNVVQSNVANGAHIQDATIET----GYWNLQNP 110 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYD 155 S+ + + Sbjct: 111 SLGLRPAGTGSVPAAGDVPAVRTTVAIS 138 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 67/221 (30%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A KN F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKNVAAEFI-------SGRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L S Sbjct: 148 HTSLLNLLRNVRTDIAARGLISDGTAVGMGLANAVSRL------------------KDSK 189 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + AK I++ TI + + Sbjct: 190 TKSKVVILLTDGSNNMG--DISPMTSAQIAKSLDIRVYTIGVGTNKVAPYPMSVGGGTQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y N L ++++I + Sbjct: 248 INIPVEIDSKTLSDIAAVTEGNFYRATNNQQLKQIYKDIDK 288 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 74/252 (29%), Gaps = 66/252 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+S SM + +D + ++ ALK+ F+ Sbjct: 81 DVSTKTKTTKGIDIVMAIDVSASM---------------LARDLRPNRLEALKDVASEFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAM 298 + + +G++ Y + V + S + T + Sbjct: 126 Q-------GRPNDRVGIVLYAGESYTKTPITSDKSIVLGALNDVKFSEVLENGTAIGMGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L S K II LTDG NN+ + + A Sbjct: 179 ATSVNRL------------------KDSKALSKVIILLTDGVNNSGTID--PKLASELAV 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK TI I ++ + L + + ++ N Sbjct: 219 EYGIKTYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVTGGQYFRATNN 278 Query: 396 DSLIHVFQNISQ 407 L +++ I++ Sbjct: 279 QKLEAIYEEINK 290 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 59/212 (27%), Gaps = 49/212 (23%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + A ++ + + + +GL+ + + ++Q + Sbjct: 106 PNRFEAARDVASEMIA-------ARPNDNIGLVVFAGESFTLCPLTVDHNVIQQMLETTE 158 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A L S K II LTDG NN Sbjct: 159 IGQLEDGTAIGLGLATAINTL------------------RGSDNKSKVIILLTDGSNNAG 200 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKTCV 383 ++ + A++ I+I T++ + + L Sbjct: 201 --DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVEADVQIDEGTLRHIAE 258 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +Y + L +++ I + + + Sbjct: 259 QTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 69/275 (25%), Gaps = 66/275 (24%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + + I L +D+SGSM ++ A + Sbjct: 69 SIGAARPQATVPITSNS-TTIMLALDVSGSMCSTDVP---------------PNRITAAE 112 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F+ +GL+ ++ + P+ ++K+ + + + T Sbjct: 113 KAATAFIK------AQPAGSRIGLVTFSGIAGLLVPPTTDSQKL---LDALQNLTTSRGT 163 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + + T G I+ LTDG N Sbjct: 164 AIGQGILTSIDAIADADPSVAPTGSAVSGNGTG-PYAADVIVVLTDGANTQGVD---PQT 219 Query: 353 ICDKAKENFIKIVTISINASP-------------------------------------NG 375 +A +++ TI + + Sbjct: 220 AAKQAAARRLRVYTIGFGTTTPAPMVCGSSQVGGFGGFGGFGGFGGGGRLGDRSPLVIDE 279 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 Q L ++ +Y NA L + + + Sbjct: 280 QALRDVAATTGGTYYRAQNAGQLQDALGTLPRNIT 314 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 78/258 (30%), Gaps = 51/258 (19%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 V+ ++ R + LV+D+S SM + T++ Sbjct: 69 MLLTVALAGPTAEQKV-PRNRATVMLVIDVSLSMKATDV---------------QPTRLE 112 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A + A F + + + +GLI + + P+ V Q + + Sbjct: 113 AAQVAAKSFAEGLTPG------INLGLISFAGSATVLVAPTTDRSAVSQGIDGLKLAQ-- 164 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A+ A + S K + ++ +TDG+ Sbjct: 165 -STATGDAIVAALSAIDSFGKVVGGADGP----------PPARVVLMTDGKETVGT--RK 211 Query: 350 TIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AKE I I TIS + + + + S + +A Sbjct: 212 ATDAAGDAKEAGIPISTISFGTERGSVDINGKAQEVPVDDESMKEIAKISGGEFFKAASA 271 Query: 396 DSLIHVFQNISQLMVHRK 413 + L V+ + + + + K Sbjct: 272 EELRRVYDTLGEQIGYEK 289 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 33/309 (10%), Positives = 80/309 (25%), Gaps = 6/309 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + AN ++IFA++++ L +G + ++ M++A ++A+LAG + Sbjct: 4 RFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAGVLEPTDAA 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 +N A + L G + T+ ++ +++ Sbjct: 64 KIARASAAFTANFQPSWSAATASFTVNG---GELGGSANSTVPTQFLGVLGIKTMTIAAQ 120 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + + + + L + S I G Sbjct: 121 SAAVATAKRKVCILLTASPDAQAFLVNSGAKLTGPTCEIHVASTAGVAGMINSGTTLDVG 180 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + GS P + + + F + +V Sbjct: 181 KVCI---KGSATQNGGPYPVVSTGCAAIANPFVGALPPVSAPGCSFTNQTYSAGNVTLSP 237 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + G ++ + T T + LT + + Sbjct: 238 GTYCGSTNFNGSGTLTLRPGLYVIKSGAMIFNSGWTVSGTGVTFYLADQNATLTFNGNVT 297 Query: 313 FFTNFFRQG 321 + G Sbjct: 298 AKLSAPSSG 306 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 83.0 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 32/359 (8%), Positives = 96/359 (26%), Gaps = 31/359 (8%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 + + N + A+ ++ L +I F + V + + ++ + +AA LAGA +V Sbjct: 21 ARRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQ 80 Query: 77 DRFESI----------SNHAKRALIDDAKRFI----------KNHIKESLSGYSAVFYNT 116 + + + + +F+ S + Sbjct: 81 AKATEYLVRNPVGSSMTFVNEEEVPAKIAQFVAEHGDDFEVEAGEWNASTRSFETTNTLP 140 Query: 117 EIQNIV----NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 ++ + + +++ M L F + + + Sbjct: 141 STLSVSMEYPTMPTFFGKILGKDSFSIRASSVA--MYQPRDIMVVLDFSGSMNDDSTFEA 198 Query: 173 IVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + + + + I L+ A + + + + Sbjct: 199 FGKLGRSWVESNLQQCWADIGN-PTYGSLTFEPKWANCKGAVPTDGSKPQIYVEYRNTSV 257 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + + S + GL T + +++ + + ++L + Sbjct: 258 YVTSTLNLENVVLQFSGGTRQTFSGLSAKTG--TFQGSSTNSGKQITKVWVKSGNNLSGE 315 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ + + + K++F N + + ++ N Sbjct: 316 GTNYGEPFDFSSSNMNNMVKKAFGLNSVSYPYNGSWDAYIDYCEQSSNSNKNAGYRYKY 374 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 42/356 (11%), Positives = 83/356 (23%), Gaps = 54/356 (15%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 N + + + +S+ Sbjct: 252 YRNTSVYVTSTLNLENVVLQFSGGTRQTFSGLSAKTGTFQGSSTNS-----GKQITKVWV 306 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 + T+Y F N +F + I+ S Sbjct: 307 KSGNNLSGEGTNYGEPFDFSSS---NMNNMVKKAFGLNSVSYPYNGSWDAYIDYCEQSSN 363 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMA--------------ALKNALLLFLDSIDLLSH 247 S A + + R A ALK++L +F+D I + Sbjct: 364 SNKNAGYRYKYGYLNLMNYWLESRRSYAQTPVLWKTHAQPVRALKDSLAIFMDFITEVEV 423 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + E+V + + T+ + A L Sbjct: 424 QDRVGLAVYNAPNGEGMVEVPLTLEVEQVATIANQRQAGHYHEYTNIGGGLNAARLHLDQ 483 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-------VNTIKICDKAK-- 358 K I+ +TDG+ N + I + Sbjct: 484 ----------------HGRPNAFKMIVLITDGQANWRNGSYSIANAENYLISEANLCAHD 527 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-------DSLIHVFQNISQ 407 +VT+S+ + + + + + H+NV D L F+ I++ Sbjct: 528 SRKYPVVTLSLGTNADTDIMEQVATITNSTHFNVPGGSTIEQYHDQLSETFRKIAK 583 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 83.0 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 77/244 (31%), Gaps = 51/244 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + LVVD+S SM C + T++ A + A + F+D ++ Sbjct: 96 PKNRATVVLVVDISLSMVCDDV---------------RPTRVDAARQAAIKFVDEMEPT- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL 305 + +GL+ + + I PS E V+ + + L T + + A Q + Sbjct: 140 -----LQLGLVTFAGTAQTLIAPSSDHEIVKHALDEAIRPDKLAARTATGEGIYTALQQI 194 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFI 362 + + S I+ +DG+ +AK + Sbjct: 195 ET----------LSSILGGKSKAPSARIVLESDGKETVPDDLNAPRGAFTAAKEAKAKEV 244 Query: 363 KIVTISIN----------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I +IS + L K S + + D L V+ +++ Sbjct: 245 PIYSISFGTTRPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTAGSLDQLSDVYSSLN 304 Query: 407 QLMV 410 + Sbjct: 305 AEIG 308 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 83.0 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 11/155 (7%), Positives = 46/155 (29%), Gaps = 4/155 (2%) Query: 11 FKKGIAS--EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ E+ + + A+ +++ L + F + V + + +++ +++ + A Sbjct: 1 MRRLKHDDRERGAVAPMTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQAC 60 Query: 69 VSNLSRL--GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 N + + ++ A +D + + + + + Sbjct: 61 AQNAASADCAAPTSAATSLAGLNALDGVSNAPQASVNLGTGTVDVTTQSRNTSGDNHFTL 120 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + + + + + L F Sbjct: 121 VFARALGVETANIQASAQAKFGGFSATDAIPLTFS 155 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 83.0 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 64/187 (34%), Gaps = 25/187 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ +KN +L +D ++ + +I + + ++ + ++ Sbjct: 1612 NSGSMSGQKIENVKNTILQLIDMLNDND------RLSIITFNSHAQQLCGLRKVNKDNKE 1665 Query: 279 YVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + S T+ T ++ A+ IL S K+R+ ++ I L+ Sbjct: 1666 NLQKITKSIYANGGTNITSGLQTAFSILQSRKQRNSVSS----------------IFLLS 1709 Query: 338 DGENNNFKSN--VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DG++NN S + +E I + +G + + Y V Sbjct: 1710 DGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYYVERN 1769 Query: 396 DSLIHVF 402 D + F Sbjct: 1770 DQVDEFF 1776 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 83.0 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 76/240 (31%), Gaps = 54/240 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D+SGSM + + +++ + + FLD + + Sbjct: 88 DIILVLDISGSMLAEDYEIDQK----------RVSRLDIVLEVVKTFLDK-------RTN 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R ++ +++ + + I T A+ A L K+ Sbjct: 131 DRIGLVAFAGRAYTVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSRLEGKKES 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F+I LTDG NN N+ I+ A + + TI Sbjct: 191 G------------ERKKIGSFLILLTDGANNCG--NLTPIEAARLAAHAAVPVFTIGAGI 236 Query: 372 SP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + ++ ++++++ FQ I Sbjct: 237 NGEVTMPVMDEERRKIGSQTVVSEVDEGLLRNIAQLTGGEYFRATDSNAIVSAFQAIDAQ 296 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 72/241 (29%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R ++ LV+D+S SM +M A + A F D + Sbjct: 93 PRNRAVVMLVIDVSQSMRATDVE---------------PNRMVAAQEAAKQFADELTPG- 136 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI Y + P+ + + + + + T + A+ A Q + Sbjct: 137 -----INLGLIAYAGTATVLVSPTTNRDATKNALDKLQFADR---TATGEAIFTALQAI- 187 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 + P I+ +DG+ + N AK+ + Sbjct: 188 ---------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVP 238 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + + L K S YN L V+ ++ Q + Sbjct: 239 ISTISFGTPYGFVEINGQRQPVPVDDETLKKVAQLSGGNAYNAATLAELKSVYASLQQQI 298 Query: 410 V 410 Sbjct: 299 G 299 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 59/212 (27%), Gaps = 49/212 (23%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + A ++ + + + +GL+ + + + + Q + Sbjct: 106 PNRFEAARDVASEMIA-------ARPNDNIGLVVFAGESFTLCPLTVDHDVILQMLDATE 158 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A L S K II LTDG NN Sbjct: 159 IGQLEDGTAIGLGLATAINTL------------------RGSDNKSKVIILLTDGSNNAG 200 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP---------------------NGQRLLKTCV 383 ++ + A++ I+I T++ + + L Sbjct: 201 --DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVEADVQIDEGTLRHIAQ 258 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +Y + L +++ I + + + Sbjct: 259 QTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 80/267 (29%), Gaps = 73/267 (27%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E I L +DLSGSM ++ T++ A K + F+ Sbjct: 84 QEAEGIDIVLALDLSGSMRALD----------LSTRENIVTRLDAAKEVVQEFIGK---- 129 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQI 304 + +GL+ + + + +++ V R I T A+ + Sbjct: 130 ---RPHDRIGLVAFAADAFVVSPLTLNHDWLKKNVQRLELGDINLSGTAIGTALGASVNR 186 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + +I LTDGENN+ + + + AK +K+ Sbjct: 187 L------------------RDHESRSRIVILLTDGENNSGTLS--PLSAAEAAKSLNVKV 226 Query: 365 VTISIN--------------------------------ASPNGQRLLKTCVSSPEYHYNV 392 TI+ + + L + + + Sbjct: 227 YTIATGRKGRVEVAEMSRDGRVIRDRNGNPLYRGRSELSDYDESELREIAKLTGGQFFKA 286 Query: 393 VNADSLIHVFQNISQLMVHRKYSVILK 419 + L +++ I +L K +V L+ Sbjct: 287 SESGDLENIYDEIDEL---EKTTVELR 310 >gi|260778728|ref|ZP_05887620.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] gi|260604892|gb|EEX31187.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] Length = 463 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 37/395 (9%), Positives = 85/395 (21%), Gaps = 37/395 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I K ++I ++ +L G I K + +A A+ Sbjct: 10 IRKSKGLVALISVIAAPFLILATGTAIDSGRAFLVKAKLFAA--VDAAGIAAARAVAEGE 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 R +I + D + S + ++ S + + Sbjct: 68 DAARDAAIKFYNANLPTDYH-DSTTASPTVTFGYDSFGNISIDLSASAEVSTTFLGVFGH 126 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR----IEMGERPI 190 + L+ S + + L + I Sbjct: 127 SSLEISA--TAQTVRRPVDLVLVVDNTTSLRLGSIGDVTDDVVARSKDFITNFNESFDRI 184 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVN----------------SAPICQDKKRTKMAALKNA 234 L++ + + + + + AL+N Sbjct: 185 ALVKYAYGAEVPVAFQSSRGHSRSDITTEIDAFDFGSLSSLQYTNSSEGIYLALDALRNV 244 Query: 235 LLL----FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV---TRDMDSL 287 + + +G +T +I S + + + L Sbjct: 245 TDPANLKVIVFFTDGAPNTFASEFDFVGTSTNYTGSIRSSDESSGTPRGLWYHDAIATQL 304 Query: 288 ILK---PTDSTPAMKQ--AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D + + Y S F + + Sbjct: 305 PGSGYEGSDIDDYIAELPDYYTAHSASATEFNVLNPSHPDRPVTQYNPSTHSANDLYIRV 364 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 N + I A+ I + T+ + +S Sbjct: 365 NRVARNLLEDIATAARGEDIYVFTLGLGSSLTSST 399 >gi|42407699|dbj|BAD08847.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408121|dbj|BAD09261.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] Length = 703 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 57/186 (30%), Gaps = 17/186 (9%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+A LK A+ L + ++ ++ + I + +E + ++S Sbjct: 285 KLALLKRAMGLL----------GPGDRLAVVSFSYSARRVIRLTRMSEGGKASAKSAVES 334 Query: 287 LILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T+ + +A ++ + R+ + + G +N+ Sbjct: 335 LHADGCTNILEGLVEAAKVFDGRRYRNAVASVILLSDGQDNYNVNGG-----WGASNSKN 389 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN- 404 +V ++ + + + T + + + + N + F Sbjct: 390 YSVLVPPSFKRSGDRRLPVHTFGFGTDHDASAMHTIAEETGGTFSFIENQAVVQDAFAQC 449 Query: 405 ISQLMV 410 I L+ Sbjct: 450 IGGLLS 455 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 73/238 (30%), Gaps = 54/238 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P I L +D+S SM ++ A + AL +F+ + Sbjct: 83 DPRTTIVLALDVSRSMRATDVL---------------PSRFEAAREALKVFIREL----- 122 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +GL+ ++ + + P+ +++ V T + + Q L Sbjct: 123 -PQGARIGLVTFSRAATEVVAPTTNRQRLLDSVELIGLEF---GTAIGEGILTSLQALPP 178 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 Q II LTDG + + ++ A E ++I TI Sbjct: 179 ----------LEQRKDAKDPSELATIILLTDGRSISGID---PLEAARIAAEQKVRIHTI 225 Query: 368 SINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + L + + ++ V +A L +Q +SQ Sbjct: 226 GVGRVTEGPVPGLESVYQWAAYFDEDVLKQIAAITGGKYFFVNSAGKLRETYQQLSQS 283 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 70/243 (28%), Gaps = 48/243 (19%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I +DLSGSM + +++ ALK Sbjct: 82 RTIAASPAQPASGRDIVFALDLSGSMAAEDF----------VLDGHAASRIDALKRVGAA 131 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + +GL+ + R S+ + V + + L+ T Sbjct: 132 LIKR-------RTGDRIGLVIFAERAYAAAPLSFDVDAVSRTLAEIPLGLVGHSTAIGEG 184 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A + LT + I+ L+DG N+ ++ + + A Sbjct: 185 LGLALKRLTES------------------KAPSRVIVLLSDGANDAGTTD--PTGVAELA 224 Query: 358 KENFIKIVTI-----------SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +KI TI + + L + + V + L + I Sbjct: 225 NNLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRLAEIGGGEAFRVRTTEDLAYASAAIE 284 Query: 407 QLM 409 +L+ Sbjct: 285 RLV 287 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 52/155 (33%), Gaps = 5/155 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R+R F + + ++IFAL+ + + L+G I + + +++SA +A +LAG Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + +S + + A + + ++ + E + Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVT-----VASDKTSVEARAEQVIK 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + S + M L Sbjct: 130 LTFGAFVGMASIPISARARASVVGRMRLCMLALDP 164 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 77/239 (32%), Gaps = 54/239 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 P + L VD+S SM + ++ A + AL F+ + Sbjct: 80 QADPKAAVVLAVDISLSMQATDV---------------QPSRFEAARAALRTFIREL--- 121 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 E + + L+ + + + ++ + V +L T A+ ++ Q L Sbjct: 122 ---PEGLRLALVTFARDAHLVVPLTTDRGRLLEAVDFLQLNL---GTAIGDAILESIQAL 175 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + P II LTDG + + +A I++ Sbjct: 176 PP----------LSERAEDPDPRRLATIILLTDGRSLGGVD---PVVAAQEAARQQIRVH 222 Query: 366 TISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI I + + + L + ++ V +A+ L ++++++ Sbjct: 223 TIGIGRTTSGPVPGLPEVYAQAALFDEETLKEVARVGDGQYFYVDSAEKLKEAYRDLTR 281 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 52/155 (33%), Gaps = 5/155 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R+R F + + ++IFAL+ + + L+G I + + +++SA +A +LAG Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + +S + + A + + ++ + E + Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVT-----VASDKTSVEARAEQVIK 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + S + M L Sbjct: 130 LTFGAFVGMASIPISARARASVVGRMRLCMLALDP 164 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 52/155 (33%), Gaps = 5/155 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R+R F + + ++IFAL+ + + L+G I + + +++SA +A +LAG Sbjct: 15 RWRQPFARLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGG 74 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + +S + + A + + ++ + E + Sbjct: 75 NALKLVVSSSESIVGLTTQTIQAEAKAGADAPVSIQVT-----VASDKTSVEARAEQVIK 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + S + M L Sbjct: 130 LTFGAFVGMASIPISARARASVVGRMRLCMLALDP 164 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 808 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 844 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 845 NQVRFGALKYADDPEVLFYLEDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHMFTEAR 904 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 905 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 949 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + ++ V L +F +++ + + Sbjct: 950 -AGANPAELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 989 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 49/182 (26%), Gaps = 19/182 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1015 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 1074 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1075 FGNTHIGAALRKVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1117 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 1118 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 1175 Query: 409 MV 410 + Sbjct: 1176 IC 1177 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 56/175 (32%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + V +G + Y + E + + + + + T++ Sbjct: 458 TFLSAVVGMFDIAPHKVRVGAVQYADSWDLEFEINKYSNRQDLGKAIENIRQMGGNTNTG 517 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 518 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 559 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I++ I + Q L+ + Y V + D+L + + Q + Sbjct: 560 RLREEHIRVYAIGVK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEIC 612 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 642 MKTFMKNLVSKSQIGPDQVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 699 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 700 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 740 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 741 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 797 Query: 411 HRK 413 + Sbjct: 798 SPR 800 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/343 (10%), Positives = 82/343 (23%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ + S S E + + + + Sbjct: 110 KALQEAHRTYFSAPADGRDKKQFPPILVVVASSESED-DVVEASKALQKDGVKIISIGVQ 168 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I G +E Sbjct: 169 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVTKYKE----GAVDDIFVEA 222 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 S + + + + LK +S+ L + + +G Sbjct: 223 CQGPSVADVVFL---------LDMSINGSEENFDYLK---GFLEESVSALDIKENCMRVG 270 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 271 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 319 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 320 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 373 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 374 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 415 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 44/353 (12%), Positives = 95/353 (26%), Gaps = 51/353 (14%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A+L S +V + ++ + K + K+ I + Sbjct: 68 AVLQPQSVVVQKQPLAQNPVQAKTIQPKTVVPKSLTSIQKSDISIEKQEKIQKLDTKTML 127 Query: 120 NIVNSSRISMTHMANNR---------LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 + + + L T+ + D + + + Q Sbjct: 128 QEQEIKPDLQQMVKDAKKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVSVKTQDST 187 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 I+ ++ RP + V+D SGSM K+ Sbjct: 188 NDILEEQKEQVKQVEQSRPSIDLVCVIDNSGSMQ--------------------GEKIQN 227 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-IL 289 +K LL LD ++ + LI + + ++ + ++S+ Sbjct: 228 VKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNLRKVDDENTPNIQSIINSITAD 281 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD M A+ IL + + ++ F + +K ++ Sbjct: 282 GGTDINSGMLMAFNILQKRQFFNPVSSIFLLSDGQDNGADEKIKKYINSN---------- 331 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K I + + +G + + C Y V + + F Sbjct: 332 -----QSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFYYVEKINQVDEFF 379 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 44/371 (11%), Positives = 111/371 (29%), Gaps = 63/371 (16%) Query: 53 MESANNAAILA-GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + A ++A+ A ++ +L+ +R++ + +++ + + Sbjct: 34 LHGAPDSALPATAVAEENLSLTENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVR 93 Query: 112 VFY---NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 F ++ V I N L + + + + S + Sbjct: 94 RFLNSGKQPPKDAVRIEEIINYFPYNYPLPTDGRPFAVHTETIDSP------------WQ 141 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 K++ + P + +VD+SGSM K+ Sbjct: 142 PEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEE-------------------NKL 182 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ L + + + LI Y++ E + P+ G++K + Sbjct: 183 PLVQKTLRILTQQLRPQD------KVTLITYSSGEELVLPPTSGSDK-ETILKAIDKLKA 235 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++ AY+ + I+ TDG+ N S+ Sbjct: 236 EGSTSGESALRMAYEEAQKAFVPNGIN----------------RILLATDGDFNVGVSDT 279 Query: 349 NT-IKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNIS 406 T + + ++ I + T+ + +++ + + + N + + Sbjct: 280 ETLKSMVAEKRKTGISLSTLGFGTDNYNEDMMEQIADAGDGNYSYIDNE---KEAKKVLQ 336 Query: 407 QLMVHRKYSVI 417 Q + +V Sbjct: 337 QQLTSTLATVA 347 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 82.2 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 72/249 (28%), Gaps = 76/249 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D K +++ K + ++ + Sbjct: 91 DIVFAIDVSKSM---------------LAEDVKPSRIEKAKRIISELIELL-------HG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 + I Y + + + + ++ D L + T A++ A + Sbjct: 129 DRIAFIPYAAQAYPQLPLTSDYSAAKIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSNQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K +I L+DGE++ + ++ + KE I+I TI + Sbjct: 189 S------------------SKILIILSDGEDHQQGAT----EMIQEVKEKGIRIFTIGLG 226 Query: 371 ASP---------------------------NGQRLLKTCVSSPEYHYNVVNADS----LI 399 + N L + +++ N L Sbjct: 227 TTQGTTIPIKENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGSNTQQVIDNLQ 286 Query: 400 HVFQNISQL 408 V NI + Sbjct: 287 KVLNNIEKN 295 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 48/360 (13%), Positives = 99/360 (27%), Gaps = 51/360 (14%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + A L S V S + ++ + K A+ F K+ I Sbjct: 62 ISQVIKQASLKPKSVAVQKQSLAQNPIKAKATQLKSAVSKQLTSFQKSDISIQKEEKIQK 121 Query: 113 FYNTEIQNIVNSSRISMTHMANNR---------LDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + + + + L T+ + D + + Sbjct: 122 LDTKTMLREQEIKPDLQQMIKDAKKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVS 181 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + Q I+ ++ RP + V+D SGSM Sbjct: 182 VKTQDSTNDILEEQKEQVKQAEQSRPSIDLVCVIDNSGSMQ------------------- 222 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ +K LL LD ++ + LI + + +K + + Sbjct: 223 -GEKIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNLRKVDDKNTPNIQKI 275 Query: 284 MDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 ++S+ + TD M A+ IL + + ++ F + +K ++ Sbjct: 276 INSITAEEYTDINSGMLMAFNILQKRQFFNPVSSIFLLSDGQDNGADEKIKKYINSN--- 332 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K I + + +G + + C Y V + + F Sbjct: 333 ------------QSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFYYVEKINQVDEFF 380 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 65/221 (29%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K+ F+ + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKDVASEFV-------SGRPSDNIGLTIFAGESFTQCPLTID 147 Query: 273 TEKVRQYVTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + T + A L S Sbjct: 148 HGSLLNLLNNVRTDIAARGLIQDGTAVGMGLANAVSRL------------------KGSK 189 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + A+ I++ TI + + Sbjct: 190 AKSKVVILLTDGSNNMG--DISPLTAAQIARSLGIRVYTIGVGTNKVAPYPMPVAGGVQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y N L ++++I + Sbjct: 248 VNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIYRDIDK 288 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 78/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ + R + LV+D+S SM T++AA + Sbjct: 74 SVALAGPTEDKRV-PRNRATVILVIDVSLSMKATDVE---------------PTRLAAAQ 117 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +A F D + + +GL+ + + P+ E + + S T Sbjct: 118 DAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNREATKVAIDNLQLSER---T 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + Q + + + I+ L+DG+ ++ + Sbjct: 169 ATGEAIFTSLQSIDT----------LAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRG 218 Query: 353 ---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 +AK+ + I TIS + L + S + + Sbjct: 219 GFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSL 278 Query: 396 DSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 279 EELRDVYDTLEEQIG 293 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 69/207 (33%), Gaps = 39/207 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + D ++ KN F++ + + +GL + + Sbjct: 95 DISASMLTNDVTPNRLIVAKNVASDFIN-------GRPNDNIGLTIFAGEAFTQCPLTID 147 Query: 273 TEKVRQYVTRD-----MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + LI T + A L + + Sbjct: 148 HATLINLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLKASNAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ-------RLLK 380 K +I LTDG NN + + AK+ I++ TI + +G L + Sbjct: 192 --SKIVILLTDGSNNVGSIS--PMTAATIAKKFNIRVYTIGLGTEQSGNYNDIDYTTLKQ 247 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQ 407 +++ Y + L+ ++ +I++ Sbjct: 248 IALTTNGEFYRAQSQTELLQIYNDINK 274 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 43/354 (12%), Positives = 83/354 (23%), Gaps = 10/354 (2%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R F + ++ N I A +++ + L+G + + K ++SA +A LA Sbjct: 5 IKRALFILMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALA 64 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G M S + + + A L F + S S T ++ Sbjct: 65 GRRIMGSGRWTDNNGRPNTTALATFDLNFAQNSFGAENRTRSYSESDGTVSGTASADV-- 122 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL----NQRYNQKIVSFIPA 179 M + T + L + K ++ + Sbjct: 123 -PMTLMRVLNVPTKRVEVTCEGQMRIPNTDVMFVLDNSGSMNEVIPGDTTGLKKMAGLQL 181 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +R I V D + K +L Sbjct: 182 AIRCFYEALARQNITDVSAAQCGTSADPTGDLSSQVQLRFGFVNYDNMVNVGKLLPNNYL 241 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D ++ Y +WG S Sbjct: 242 --ADSWTYRSRTATGIQTVYAWTAGTQSATTWGPWSSTPN-NFANPSSYSSTYSVVSGGN 298 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + L + + + E+N S+V T ++ Sbjct: 299 TSTTTLADGLAYVKKAAPTSAAACSALNSYGSYDQLVGISESNGTSSDVTTPEV 352 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 38/385 (9%), Positives = 85/385 (22%), Gaps = 57/385 (14%) Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 R S + D + NI + + Sbjct: 396 RAPFNSANNTYLQTRDGTANKPLSWTAYDRITEWTYGPRTDINIAGLKAGQSAWNSTVTI 455 Query: 138 DSSNNTIFYN----MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + + + + ++ Sbjct: 456 PLAGRVRRQLYLSGSTIRSDIETVADLSVTWDGCIEERQ-----TFFNTDGNPSDDWLNY 510 Query: 194 ELVVDLSGSMHCAMNSDPE----DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + M D + P + + + D + + Sbjct: 511 PSSPSDAIDMDIDRVPDSDPATRWKPLLPNAVWGPKGTLVGNTWSGDYTTDPVKAGENTS 570 Query: 250 EDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + V + + + G + + T + ++ Sbjct: 571 SGDTGRNLSNNSCVTASRKLTNYNGVGGNPSAQDLSNYVGTLVPHNNTYHDIGLLWGARL 630 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG------------------------- 339 ++ + G Q+ +IF+TDG Sbjct: 631 MSPTGIFASENATTGGGA-----QIQRHLIFMTDGATATTVNNYASYGLEWWDRRQIAPA 685 Query: 340 --------ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-QRLLKTCVSSPEYHY 390 +N N +N + +C K I + I +S + L C +SP Y Y Sbjct: 686 GPNDANYDDNLNAVNNARSNALCTAIKNKNITLWVIYYGSSDTATKTRLTNCATSPSYFY 745 Query: 391 NVVNADSLIHVFQNISQLMVHRKYS 415 N LI F+ I+ + + + + Sbjct: 746 EARNTTLLIGKFREIADRISNLRLT 770 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 85/248 (34%), Gaps = 53/248 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 R+ GE I L +D+S SM +++ A K L F Sbjct: 85 FSERLSPGEAKGIDIMLALDISKSMLQEDFDGK--------------SRLDAAKTVALQF 130 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPA 297 +++ + +GL+ + + + + + V D++ T + A Sbjct: 131 IEN-------RRRDRIGLVLFKGKSFTQCPLTLDHDVLSMLVRAASVDAVPESGTATGSA 183 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L S ++ +I LTDGE+N + + + A Sbjct: 184 ILIAVNRL------------------RASESPERVLILLTDGEHNAGEVD--PVTAAGIA 223 Query: 358 KENFIKIV--TISINASPNGQRLLKTCVS--------SPEYHYNVVNADSLIHVFQNISQ 407 ++I T+S+ S +G+ +L + + + +A+SL F I Sbjct: 224 AGEGVRIYMATVSVPGSRSGEDMLASARDLSGEVSRITGGRSFRANDANSLNRTFSEI-D 282 Query: 408 LMVHRKYS 415 + +++ Sbjct: 283 QLEKSRFT 290 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 34/332 (10%), Positives = 82/332 (24%), Gaps = 22/332 (6%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + + + + + S L + Sbjct: 702 FQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARPNVRKFLILITDGE 761 Query: 145 FYNMDVMTSYDY----RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++ + + + + Q + + I + Sbjct: 762 AQDIVKEPAIALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQRVEDDLVF 821 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 G +V D + + + K V G + Y Sbjct: 822 GICSPREECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIGLVKKADVGKNQVRFGALKY 881 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 E K+ D + T + A+ + + T + Sbjct: 882 ADDPEVLFYLDDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHMFTEARGSRL------ 935 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 + + +I +TDGE+++ T ++ I ++ + I A N LL Sbjct: 936 ------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI-AGANPVELL 985 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 SS + ++ V L +F +++ + + Sbjct: 986 AMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 1016 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1042 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGAFIGEKEISFQIENIKQI 1101 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1102 FGNTHIGAALRKVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1144 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 1145 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 1202 Query: 409 MVHR 412 + Sbjct: 1203 ICTT 1206 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 485 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 544 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 545 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 586 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I + Q L+ + Y V + D+L + + Q + Sbjct: 587 RLREEHIRVYAIGVK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 641 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 64/183 (34%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G + ++ ++ + + + ++ +D + + Sbjct: 669 MKTFMKNLVSKSQIGPDRVQIGAVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 726 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 727 TLTGSALSFVSQYFSPT--------------KGARPNVRKFLILITDGEAQD-----IVK 767 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L V ++ + Sbjct: 768 EPAIALRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRVEDDLVFGIC 824 Query: 411 HRK 413 + Sbjct: 825 SPR 827 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/343 (10%), Positives = 85/343 (24%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 137 KALQEAHRTYFSAPANGRDKKQFPP-ILVVLASSESEDDVEEASKALQKDGVKIISVGVQ 195 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I G +E Sbjct: 196 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVTKYKE----GAVDDIFVE- 248 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 249 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 297 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 298 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 346 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 347 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 400 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 401 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 442 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 58/178 (32%), Gaps = 15/178 (8%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 F + V + G + ++ + + + D T Sbjct: 511 TFNGFLKSSDQVGVAYFGGDVPGGYTPRYDVSQTLTNDTL-SANNSIDDLWAYGGTPMGG 569 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 +K A Q L ++ P+ +DG + + I+ + Sbjct: 570 GIKVARQELVANTAPGNIPVMIVLSDGNPT--------LTSDGTASETLAIQEAIEEAET 621 Query: 357 AKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K+ I I TI N + LLK +SP+Y+Y ++ L +++ I++ + Sbjct: 622 TKQTTIGGEQILIYTIGFGNDAN-ETLLKQIATSPDYYYFAATSEELSSIYRQIAKEL 678 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 70/180 (38%), Gaps = 31/180 (17%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ F+DS+ L LI + + E S E+++ V Sbjct: 265 KLAIDNFIDSMTLTD------KTALISFEDEAKLVSEFSDNKEELKGLVN----PYFGGG 314 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVN 349 T+ +++ A + L + + II L+DG N +N Sbjct: 315 TNVRASVEMAIEQLN-----------------TVQHWYTRHIILLSDGDVNININLANNT 357 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + KA +N IKI TI + + + Q+ LK C + ++ A+ L +++++S+ Sbjct: 358 VDDLIKKAVDNNIKIHTIGLGSGADNQK-LKDCAEYTGGQYFTAETAEKLDAIYKDLSKN 416 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 65/188 (34%), Gaps = 25/188 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 +K+ +KN +L L+ ++ + LI + T+ ++ + + Sbjct: 219 NSGSMNDFSKIENVKNTILQLLEMLNEND------RLSLITFNTKAKQLCGLKNVNNQNK 272 Query: 278 QYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + S+ TD ++ A+QIL S K+++ ++ I L Sbjct: 273 KSLQTITKSIKADGGTDIIRGIEIAFQILQSRKQKNSVSS----------------IFLL 316 Query: 337 TDGENN--NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +DG++N + + +E I + +G + K Y V Sbjct: 317 SDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEK 376 Query: 395 ADSLIHVF 402 D + F Sbjct: 377 NDQVDEFF 384 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 65/221 (29%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ A K F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLHPNRIEAAKAVAAEFIA-------GRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + T + A L K + Sbjct: 148 HASLLNLLQNVRTDIAARGLIQDGTAVGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNMG--DLSPMTSAQIAKSLGIRVYTIGVGTNKVARYPMPVTGGIQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L ++ Y N L ++ +I + Sbjct: 248 VNIPVEIDTKTLSDIAATTDGNFYRATNNRELKQIYNDIDK 288 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 28/184 (15%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+ +K + F+ +++ MG++ + ++ + ++ V+ Sbjct: 109 TRFDVVKEIVSNFI-------SSRKNDNMGIVVFGAYSFIASPLTYDSNILKGVVSNLYI 161 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + K T ++ Q +L S K I LTDG N Sbjct: 162 GMAGKFTALFESLAQGVNLL------------------KTSKSKTKIAILLTDG-YNTPD 202 Query: 346 SNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 S D A + +K+ I I + Q++L K + + NA L V+ Sbjct: 203 SEFPFDAAIDFANKQGVKVYPIGIGKSDEYNQKMLEKIAEQTGGVAFGASNASELAIVYA 262 Query: 404 NISQ 407 I++ Sbjct: 263 KINE 266 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 78/219 (35%), Gaps = 51/219 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D T++ A K ++ F+ + + L+ + R ++ Sbjct: 54 DVSPSMMAEDMIPTRLEASKKTMIDFIKK-------RNFDKISLVSFALRASVLSPATFD 106 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + + T + A +L S K+ + +K Sbjct: 107 YTSLEEEIKKIEI-DEEGSTSIGLGIATAVDMLRSVKEDN-----------------EKI 148 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------- 373 II LTDGENN+ + + + A IKI TI I + Sbjct: 149 IILLTDGENNSGEID--PKLASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIR 206 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 N + L+ ++ ++N NA +L +V+ I + Sbjct: 207 ADFTLNEESLIDIAATTGGKYFNAKNASALDNVYNTIDR 245 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++M A + Sbjct: 80 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PSRMVAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + P I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAI----------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTPYGFVEIDDQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELRAVYSSLQQQIG 299 >gi|310799477|gb|EFQ34370.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 698 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 81/259 (31%), Gaps = 29/259 (11%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 L + + I P E + I LV+D+SGSM Sbjct: 46 VQEDAPSEDVALSLHPLAS-----KDGLIAKITPPTQPTEPTDHVPCDIVLVIDVSGSMG 100 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 C P N ++ + + +K+A L++++ +G++ + ++ + Sbjct: 101 CN---APVPANPGEKAENYGLSVLDLVKHAARTVLETLNDGD------RLGIVTFASKAK 151 Query: 265 KNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + K + + ++ + T+ + + ++ + + + Sbjct: 152 VLQKLTPMDAKNKALAEKIINGMRPDDATNLWHGLLEGIKLFNTCGEMNMGRVPA----- 206 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 ++ LTDG N+ N + ++ I T S L Sbjct: 207 ---------MMVLTDGMPNHMCPNQGYVPKLRGMEQLSASIHTFGFGYSLRSGLLKSIAE 257 Query: 384 SSPEYHYNVVNADSLIHVF 402 + + +A + VF Sbjct: 258 IGGGNYSFIPDAGMIGTVF 276 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++M A + Sbjct: 80 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PSRMVAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + P I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAI----------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELRAVYSSLQQQIG 299 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 41/153 (26%), Gaps = 6/153 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ +A ++I LS+ + G + H ++ ++ A + A L A Sbjct: 1 MMRAIRRFLADRTGAVALIIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAG 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ S + + A D I + + + Sbjct: 61 ELSVAGSDVSVEAMARRLAFDSARATD------PGITRVSAAVVGRGTSVTVAINETVQS 114 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + + ++ + + L Sbjct: 115 LFGRLLTLPSMEIGASATAESSGSTRVCLLTLD 147 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 64/237 (27%), Gaps = 27/237 (11%) Query: 29 SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKR 88 L G ++V D K ++ A ++A LA A+ + + + + N Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKIQTSEAEAYARNFVAG 60 Query: 89 ALIDDAKRFIK------NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + + + + +++ S SG S + T + S M + S + Sbjct: 61 QMANYLQSGVDIKGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPLMQAVGFKTQHLSTS 120 Query: 143 TIFY----NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 S L + D Sbjct: 121 GTTVGGHSQTQGSISMFLALDKSGSMGESTATV---------------NEDDPTETFTYD 165 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +H ++ + + TK+ ALK A ++ + V G Sbjct: 166 CN--LHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFSQLNSADPNAQYVRTG 220 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 72/216 (33%), Gaps = 42/216 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D ++ A K A F+D + ++ +GL+ + + PS Sbjct: 93 DVSTSMLAGDVDPDRLTAAKEAGRRFVDGL------PDEFNVGLVAFAGSAAVLVPPSTD 146 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLP 328 E + + R + + + T A+ A + L + + Sbjct: 147 REALHDGIGRLAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPP-------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPN 374 II L+DG N + ++ D+A + + TIS + Sbjct: 193 -PARIIILSDGANTSGMD---PMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVD 248 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 GQ L + + L V+++I + Sbjct: 249 GQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSVG 284 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 72/216 (33%), Gaps = 42/216 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D ++ A K A F+D + ++ +GL+ + + PS Sbjct: 93 DVSTSMLAGDVDPDRLTAAKEAGRRFVDGL------PDEFNVGLVAFAGSAAVLVPPSTD 146 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLP 328 E + + R + + + T A+ A + L + + Sbjct: 147 REALHDGIGRLAEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPP-------------- 192 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPN 374 II L+DG N + ++ D+A + + TIS + Sbjct: 193 -PARIIILSDGANTSGMD---PMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVD 248 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 GQ L + + L V+++I + Sbjct: 249 GQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSVG 284 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 69/464 (14%), Positives = 132/464 (28%), Gaps = 97/464 (20%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLL----LIGFLIYVLDWHYKKNSMESANN 58 ++ R R F+ +I+ A+S++ L+ + + K+ +A + Sbjct: 1 MMKRERKKFRDSF------VAILLAISMVVGLVPVNSITSYAKDKS-AGNVKSQPRAAFD 53 Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRAL---IDDAKRFIKNHIKESLSGYSAVFYN 115 + VS+ S L + + N + +D + S G Sbjct: 54 -------EESVSDESSLETWTKVVENSTENIGRIWVDKTVSDKNIKLPASSQGKEIEIDK 106 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-------R 168 +V S +S S + I D + L + + Sbjct: 107 GTSDFLVGLSALS-----------STSNISTVADKPLDIVFVLDTSGSMSDPMEYIYSPT 155 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN---------SAPI 219 YN + +E I I + A N Sbjct: 156 YNVVTDGRVEYYAEVEGNYVKIDRITGLFGFFKHWEVAGKEVTPKKNESDTNGIQFYTRR 215 Query: 220 CQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTE 274 + ++KM ALK A+ F D ++ + M ++ +++ + + Sbjct: 216 EKPNSQSKMGALKIAVNQFAQETAKRNDSITDAAKQHRMSIVTFSSESYIRQSLKAYNSN 275 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V ++ T + M++A + L K QK +I Sbjct: 276 TVSEFERTINGLNANGATYANLGMEKAKESL-----------------KNVREKAQKVVI 318 Query: 335 FLTDGENNNFKSN----VNTIKICDKAKENFIKIVTISINASPNGQ-------------- 376 F TDG + NTI+ K++ KI +I + N Sbjct: 319 FFTDGTPGRSGFDDDTANNTIQAAKSLKDDLTKIYSIGVFDQANPDNTSSSFNAYMHGVS 378 Query: 377 ---------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L + Y+ +AD L +F+ I + M Sbjct: 379 SNYPNATKWTELGERAENSNYYKAAQDADELNKIFEEIFEEMNS 422 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 68/212 (32%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D + T++ A K F+D + + +GL+ + + S Sbjct: 94 DVSNSMAATDIQPTRLEAAKQGAQAFVDQLPP------RINLGLVSFAGSAAVLVPASTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E VR + T + + Q +T+ + P Sbjct: 148 RESVRSGIRGLQLGPA---TAVGEGIFASLQAITTAG---------ERMSDEGQPPPPAA 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DGE + N + A++ + + TI+ N L Sbjct: 196 IVLLSDGETTRGRPNT---QAATAARDAEVPVDTIAYGTSDGTLDVGGQQIPVPVNEDAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + ++ D L V++ + + Sbjct: 253 RELAEQTGGSYHRATTGDELQSVYRGLGSSIG 284 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/277 (9%), Positives = 80/277 (28%), Gaps = 14/277 (5%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 K+ I E +++I L + + L+ + K + + A+L+ +++ Sbjct: 6 FNQIKRFIQDESGVYAVIGGLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSAE 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 S + + A+ + + + ++ S S + T+ I Sbjct: 66 NNSGRKSNDYKLSNTD--AENGHFNPNSKIGERDLEISKSFVTTYLPQTDPNKIKLQPVC 123 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + T N + +++ + + + ++ ++ Sbjct: 124 TTTDKKNRQGHTASTETICTVAGTIEHKSWFPLKVGSTEVIPTEVNIASNSKAIKKNT-I 182 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + DLSGSM + + +K+ LK L + Sbjct: 183 SIPIDLMVAADLSGSMRYDLENR--------YEPKDGTSKIDILKAVLTELSSNSLFSQE 234 Query: 248 VKEDVYMGLIGY---TTRVEKNIEPSWGTEKVRQYVT 281 ++ + + + + + + + Sbjct: 235 SNDNNRIAVSPFALGAEYSTTECTLPFALKNNNRTIN 271 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 11/71 (15%), Positives = 25/71 (35%), Gaps = 9/71 (12%) Query: 342 NNFKSNVNTIKICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 N +++C+ ++ IV + K CV +Y+ Sbjct: 472 NRRTPLSKDLQMCNIIRDKLNKLNDDKNTSIVFVEFGYKSKSADAWKHCVGD-GNYYSAK 530 Query: 394 NADSLIHVFQN 404 + +SL++ F+ Sbjct: 531 DKESLLNSFKQ 541 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 807 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 843 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 844 NQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEAR 903 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 904 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 948 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N LL SS + ++ V L +F +++ + + Sbjct: 949 -DGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 988 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1014 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 1073 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1074 FGNTHIGAALREVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1116 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 1117 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 1174 Query: 409 MVHR 412 + Sbjct: 1175 ICTT 1178 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 457 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 516 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 517 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 558 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I I Q L+ + Y V + D+L + + Q + Sbjct: 559 RLREEHIRVYAIGIK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 613 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 641 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 698 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 699 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 739 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 740 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 796 Query: 411 HRK 413 + Sbjct: 797 SPR 799 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 38/343 (11%), Positives = 86/343 (25%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 109 KALQEAHRTYFSAPANGRDKKQFPP-ILVVLASSESEDNVEEASKALRKDGVKIISVGVQ 167 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I I G +E Sbjct: 168 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVIKYKE----GAVDDIFVE- 220 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 221 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 269 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 270 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 318 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 319 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 372 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 373 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 414 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 67/227 (29%), Gaps = 43/227 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++ + + VDLSGSM K ++ A+K L + Sbjct: 76 QQKAGRDLMIAVDLSGSMETEDF---------SQADGKPADRLTAVKTVLRQLANE---- 122 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +GLI + + + + + T A+ A ++ Sbjct: 123 ---RAGDRLGLIVFGSSAYLQSPFTEDHRTWLLLLNETRIRMAGPSTALGDAVGLAIKLF 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + ++ LTDG + + A I+I Sbjct: 180 ------------------KDAETEHRVLLLLTDGNDTGSLVP--PVDAARVAATEDIRIY 219 Query: 366 TISINASP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I++ + L + + + ++++ LI VF+ + Sbjct: 220 PIAVGDPTAVGEEAIDLDTLARMAEVTGGQAFEALSSEDLIAVFKLL 266 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 67/180 (37%), Gaps = 25/180 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +KN +L L+ ++ + LI + T+ ++ + ++ + Sbjct: 177 SKIENVKNTILQLLEMLNEND------RLSLITFNTKAKQLCGLKKVNNQNKESLQTITK 230 Query: 286 SL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-- 342 S+ TD T ++ A+QIL S K+++ ++ I L+DG+++ Sbjct: 231 SIKADGGTDITSGLEIAFQILQSRKQKNSVSS----------------IFLLSDGQDDGA 274 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + K + +E I + +G + K Y V D + F Sbjct: 275 DIKIKNLLKTTYQQLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 334 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 74/199 (37%), Gaps = 30/199 (15%) Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 ++ + K++++ +K + F++ + +GL+ + T Sbjct: 91 DTSDSMSTYGFNEKKYKQSRLNVVKEVVQNFIN-------SRVKDRIGLVVFGTTAGIAS 143 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S+ E + V ++ K T + + Q+L S Sbjct: 144 PLSFDKEAQKNIVGNINVGVLGKSTALIDGIVSSIQLL------------------KNSK 185 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 K II L+DGE++ K AK+ IKI TI+I + ++K + Sbjct: 186 SKSKIIILLSDGEDSASKIP--LEFALKLAKKYNIKIYTITI--DKSYSDMMKVIANKNG 241 Query: 388 -YHYNVVNADSLIHVFQNI 405 ++ V N + L+ V+++I Sbjct: 242 AKNFEVQNKEDLVKVYKSI 260 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + ++ A KN F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLRPNRLEAAKNVAAEFI-------SGRPNDNIGLTIFAGESFTQCPMTTD 147 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 148 HASLLNLLRNVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I +TDG NN +++ + A+ I++ TI + + Sbjct: 192 --SKVVILITDGSNNMG--DISPMTSAQIAQSLGIRVYTIGVGTNKVAPYPMNVGGTTQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y N L ++ +I + Sbjct: 248 VNIPVEIDSKTLSDIAAVTEGNFYRATNNKELKQIYNDIDK 288 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 41/316 (12%), Positives = 83/316 (26%), Gaps = 60/316 (18%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 F L + + N + R + L Sbjct: 70 FETAIPAKEIVLVLDISGSMDEEIENPCTNKRVRYCTRHSSSDPNHEEWFLSWHR----- 124 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + E S K+ LK A F++ + + ++K +G++ Y++ Sbjct: 125 WINDYCVEHNTSGEHNITANNKKIDELKRAANGFIERMKDVPNLK----IGIVAYSSIAT 180 Query: 265 KNIEPSWGTEKVRQYVTR--------------------------DMDSLILKPTDSTPAM 298 N GT+KV+ + + L T+ M Sbjct: 181 INPNSKSGTKKVKSLDSNSSHDVTNYNSLGANFLQSNDSRLTSVINNLEALGGTNIGEGM 240 Query: 299 KQAYQILTSDKKRSFFTNFF---RQGVKIPSLPFQKFIIFLTD---------GENNNFKS 346 ++A +L S K + T K D G + KS Sbjct: 241 RKAVYMLDSGDKSASKTIVLMTDGLPTFYSVTGSNKNNYMTIDNTEPKIAGIGTGLDTKS 300 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLL---------KTCVSSPEYH----YNVV 393 + + + K +I +G L + P+ + + Sbjct: 301 INYSRAVGEIIKSRGYNSFSIGYGLDTDGNDTLLSIHEAMTGVSIKGKPDLYESSGFFPT 360 Query: 394 NADSLIHVFQNISQLM 409 + +++ VF I+ + Sbjct: 361 STNAIQAVFNRIATQI 376 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 69/241 (28%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + L +D+S SM +D + ++ A K A F+ + Sbjct: 86 PRNRATVVLAIDVSLSMEA---------------RDVEPDRLTAAKEAAKKFVTEL---- 126 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V +G++ + + P+ V + + T + + + Q + Sbjct: 127 --PNGVNLGIVSFAGTASLLVSPTPDRTLALNAVDKLELAQR---TATGEGIYTSIQSIK 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK---ICDKAKENFIK 363 + R + II +DG+ + KAKE I Sbjct: 182 N----------IRDVLGGEDNAPPARIILESDGKQTVPTDLDDPRGGFTAARKAKEEGIP 231 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + L + S + + + L + ++ + Sbjct: 232 ISTISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQFFAASSLNDLNEAYGSLRDEI 291 Query: 410 V 410 Sbjct: 292 G 292 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 35/242 (14%), Positives = 69/242 (28%), Gaps = 56/242 (23%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K +M KNA F+D++ + +G++ + + V + + K+ Sbjct: 209 TSGSMKDFGRMKGAKNAANAFVDNVLD---GSQSTQIGIVRFASNVSIVSDFTSNKAKLH 265 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + +KQA +L + GV + +L Sbjct: 266 SAINALS---AEGGTFTQAGVKQARTMLAGSGADKKYMVVLSDGVPTFNYKISHPNNYLI 322 Query: 338 DG--------------------------------------------------ENNNFKSN 347 DG E + + S Sbjct: 323 DGGPGSHTHEKQTGKQLPENEFLYNESATGSGNSMWKQYERVKIGGGIFPTYEYHYYNSG 382 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I AK +++ T+ + G +L+ S+P V + L VF+ I+ Sbjct: 383 NCAIAESGFAKAANMRVFTVGLQTDATGSGVLRDIASAPGDFTEVTDVSQLTPVFEQIAA 442 Query: 408 LM 409 + Sbjct: 443 QI 444 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 68/219 (31%), Gaps = 51/219 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K F+ + + +G++ Y + Sbjct: 102 DVSASMLAEDLKPNRLEATKKVAADFIK-------GRPNDRIGVVVYAGESYTKTPITTD 154 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + D ++ T + A L S K Sbjct: 155 EMISLRAINEIAFDGVLENGTAIGMGLATAVNRL------------------KDSEALSK 196 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDG NN+ + + A E IK+ TI I + Sbjct: 197 VIILMTDGVNNSGFID--PKIASELALEYDIKVYTIGIGTNGNAPSPVAQIGRNKFRMAM 254 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + + V + ++ N L ++ I + Sbjct: 255 MPVEIDEELMKQIAVDTGGKYFRATNNKKLEEIYGEIDK 293 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 44/136 (32%), Gaps = 9/136 (6%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + ++IFA + + +LLIG + +++++ A +A +L+ A K + Sbjct: 16 LRTLLRDRNGAVAVIFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVKGGN 75 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 S R A ++ + S + + + + Sbjct: 76 PDSGQLAR-----YLNSNMSPGGAAT----NVTLTRSVATGGAVTFVGDADFSVATNFLK 126 Query: 131 HMANNRLDSSNNTIFY 146 + +++ Sbjct: 127 MAGLGAIKLHSHSEAT 142 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 68/222 (30%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D D + K Sbjct: 841 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMTDLVKKADVGK 877 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K D + T + A+ + + T + Sbjct: 878 NQVRFGALKYADDPEVLFYLDTLGTKWEVISVLQNDQPMGGNTYTAEALAFSDHMFTEAR 937 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ T ++ I I+ + I Sbjct: 938 GSRLQRG------------VPQVLIVITDGESHDADKLNAT---AKALRDKGILILAVGI 982 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + ++ V L +F ++S + + Sbjct: 983 -AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 1022 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 52/184 (28%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + T + + + I Sbjct: 1048 DDFKKMKAFLASVVQDFDVTVNRVRIGAAQFSHNYQPEFPLGTFTGEEEISLQIEKIQQI 1107 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ + D + ++ LTDG++ + Sbjct: 1108 FGYTHIGAALRRVGRYFRPDMGSRINAG------------TPQVLLVLTDGQSQD----- 1150 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + + V + D L V + I + Sbjct: 1151 EVAQAAEDLRRKGINIYSVGIGDVDDQQ--LVQITGTADKKLTVHDFDELRKVKKRIVRH 1208 Query: 409 MVHR 412 + Sbjct: 1209 ICTT 1212 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 55/175 (31%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + + V G + Y + E + + K + + T++ Sbjct: 491 TFLSELVGMFNIAPQKVRFGAVQYADSWDLEFEINKYSNKHDLGKAIENIRQMGGNTNTG 550 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + + Sbjct: 551 AALNFTLGLLQKAK-------------KERGNKVPCHLVVLTNGVSKDSIVEP-----AN 592 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I + I + Q L+ + Y V + D+L + + Q + Sbjct: 593 RLREELIHVYAIGVR--EANQTQLREIAGEEKRMYYVHDFDALKDIRNQVVQEIC 645 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 675 MKIFMKNLVSKSQIGADRVQIGVVQFSHVNKEEFQLNRYMSQSEISDAIDRMAHIGETTL 734 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 735 TGHALTFVSQYFSP--------------AKGARPNVRKFLILITDGEAQDIVKDP----- 775 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ I I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 776 AVALRQEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQHIEDDLVFGICSP 832 Query: 413 K 413 + Sbjct: 833 R 833 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 57/197 (28%), Gaps = 27/197 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + LK +S+ L + + +GL+ ++ + S G K Sbjct: 273 NGSQENFDYLKE---FLEESVSALDIKENCMRVGLVAFSNETKVINSLSRGINKSEVLQY 329 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + A+ ++ F Q + + + +T Sbjct: 330 IQNLSPQAGNAYTGAAI-----------RKIRKEVFGAQNGSRKNQGVPQIAVLVT---- 374 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL- 398 + + T + + + I T+ I + + L+ S P Y + + L Sbjct: 375 HRPSEDNVTKAAVN-LRRQGVTIFTMGIEGASD--TQLEKIASHPAEQYVSKLKSFSDLA 431 Query: 399 ---IHVFQNISQLMVHR 412 + + + H Sbjct: 432 AHNQTFLKKLRNQITHT 448 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 81.1 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 69/184 (37%), Gaps = 29/184 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDS 286 + K + + F++S +G++ + T N + Sbjct: 907 SVAKTSAVSFVESRGDGD------QVGVVSFYTSASLNSALKQMNSGTNKTTVKNAINSL 960 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD + +K+A L + K+ + +++II LTDG + + Sbjct: 961 SASGGTDISSGIKKAIAELDAHKRST----------------AKQYIIVLTDGYSQYPEF 1004 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 ++ DKAK I TI + + + LK S PEY+Y V++ + L + +I Sbjct: 1005 DLI---EADKAKAKGYTIFTIGMGMAD--EDTLKKIASKPEYYYRVLSPEQLEAAYYDIG 1059 Query: 407 QLMV 410 Q + Sbjct: 1060 QEIG 1063 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 28/314 (8%), Positives = 64/314 (20%), Gaps = 20/314 (6%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + L + + + F + + + +V Sbjct: 504 ANGEAESTFLLSADAKGKGEVKIKATSGSLPPAYFTIVYTNATFISIDTELTTSGTAGVV 563 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + + + + + Sbjct: 564 ELEKNPYITVDITLKGLGWQNRA----KPSDVMLVMDRSGSMAWNAGMVFPDPASGSPQK 619 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNS--DPEDVNSAPICQDKKRTKMAALKNALLL--- 237 + VD + + + + + T + N+ Sbjct: 620 GVLARNSRSTFNSEVDAYNNANKFWDGIVGSSKWKLLYTYSNSQSTPLTIELNSPSNRYY 679 Query: 238 -------FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 FL ID ++ + + K Y R LI Sbjct: 680 GINGNYYFLMVIDPDGKTYVAGKNYDGVSSSHKTYQNTLTINSPKNGNY--RIYGLLIKG 737 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T S + +I + S + + + G NN + Sbjct: 738 YTGSGSTTTYSLKIRSQTITNSGSQPSSYYSSYQSAYNYADSMYDDNWGIYNNTRW--EH 795 Query: 351 IKICDKAKENFIKI 364 I + E I+I Sbjct: 796 IATYHNSDERNIQI 809 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 76/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++M A + Sbjct: 80 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PSRMVAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAI----------ATVGAVIGGGDTXPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELRAVYSSLQQQIG 299 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/273 (12%), Positives = 76/273 (27%), Gaps = 62/273 (22%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 +L I + P ++ R + LV+D+S SM Sbjct: 64 MLVGLVFLTIAAAGPTSVQKV--PRNRATVVLVMDVSLSMEATDVP-------------- 107 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +++ + A F+D + + + +G + + P+ E V+ + Sbjct: 108 -PSRLEVAQQAGKEFVDGLT------QGINLGFVTFAGTASVMQSPTTNREAVKAAIDNI 160 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + + A Q + + P I+ ++DG+ Sbjct: 161 KLAER---TATGEGILTALQSIETLATVLGGAE----------TPPPARIVLMSDGKQTV 207 Query: 344 FKS-----NVNTIKICDKAKENFIKIVTISIN---------------------ASPNGQR 377 + AK I + TIS + + Sbjct: 208 PDDKDVDNPRHAFTAARLAKSKGIPVSTISFGTEWGSVEIPDQDGQGGSQRVKVPVDNES 267 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 L + S Y + + L V+ + + + Sbjct: 268 LREIAKLSGGEFYTASSLEELTAVYDTLEEQIG 300 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 807 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 843 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 844 NQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEAR 903 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 904 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 948 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N LL SS + ++ V L +F +++ + + Sbjct: 949 -DGANPMELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 988 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1014 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 1073 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1074 FGNTHIGAALREVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1116 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 1117 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 1174 Query: 409 MVHR 412 + Sbjct: 1175 ICTT 1178 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 457 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 516 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 517 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 558 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I I Q L+ + Y V N D+L + + Q + Sbjct: 559 RLREEHIRVYAIGIK--EANQTQLREIAGEEKRVYYVHNFDALKDIRNQVVQEICTE 613 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 641 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 698 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 699 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 739 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 740 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 796 Query: 411 HRK 413 + Sbjct: 797 SPR 799 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/343 (10%), Positives = 85/343 (24%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 109 KALQEAHRTYFSAPTNGRDKKQFPP-ILVVLASSESEDDVEEASKALRKDGVKIISVGVQ 167 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I G +E Sbjct: 168 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVTKYKE----GAVDDIFVE- 220 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 221 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 269 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 270 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 318 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 319 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 372 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 373 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 414 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 1039 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 1075 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 1076 NQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEAR 1135 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 1136 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 1180 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N LL SS + ++ V L +F +++ + + Sbjct: 1181 -DGANPMELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 1220 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1246 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 1305 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1306 FGNTHIGAALREVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1348 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 1349 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 1406 Query: 409 MVHR 412 + Sbjct: 1407 ICTT 1410 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 677 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 736 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 737 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 778 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I I Q L+ + Y V N D+L + + Q + Sbjct: 779 RLREEHIRVYAIGIK--EANQTQLREIAGEEKRVYYVHNFDALKDIRNQVVQEICTE 833 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 873 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 930 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 931 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 971 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 972 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 1028 Query: 411 HRK 413 + Sbjct: 1029 SPR 1031 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/343 (10%), Positives = 85/343 (24%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 329 KALQEAHRTYFSAPTNGRDKKQFPP-ILVVLASSESEDDVEEASKALRKDGVKIISVGVQ 387 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I G +E Sbjct: 388 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVTKYKE----GAVDDIFVE- 440 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 441 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 489 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 490 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 538 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 539 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 592 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 593 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 634 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 71/252 (28%), Gaps = 65/252 (25%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + I + +D+S SM + +D ++ ALK F Sbjct: 77 SSENTKTETTKGIDIVMAIDVSSSM---------------LAKDLSPNRLVALKEVASNF 121 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + +GL+ Y + V + + + T + Sbjct: 122 IK-------DRPNDRIGLVVYAGESYTKTPITSDKRLVLEALKEIKYGALTDGTAIGMGL 174 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L S K II LTDG NN + + A Sbjct: 175 ATSVNRL------------------KDSKALSKVIILLTDGVNNAG--FIEPATAAELAV 214 Query: 359 ENFIKIVTISINASPNG----------------------QRLLKTCVS-SPEYHYNVVNA 395 E IK TI + + N + LL+ + + ++ + Sbjct: 215 EYDIKTYTIGLGTNGNALSPIAYNPDGSFRYGMAEVQIDEALLEQIATLTGGKYFRATDN 274 Query: 396 DSLIHVFQNISQ 407 L ++ I+ Sbjct: 275 KKLEAIYNEINS 286 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 73/212 (34%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D ++AA K+ F+D + + +GL+ ++ + P+ Sbjct: 94 DVSNSMAATDVAPNRLAAAKDGADAFIDQLPP------RINLGLVSFSGSAALLVPPTTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + VR + T + Q +T+ ++ P Sbjct: 148 RQSVRSGIHGLQL---GPSTAIGEGIFAGLQAITTAGEQL---------AADGGTPPPAA 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DGE + N + A++ + + TI+ N L Sbjct: 196 IVLLSDGETQRGRPNAQAAQ---AARDAGVPVDTIAYGTADGSLDVGGQEIPVPVNEDAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ ++ + D L V++N+ + Sbjct: 253 REIARATDGSYHRAASGDELRSVYENLGSSIG 284 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 78/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ + R + LV+D+S SM T++AA + Sbjct: 72 SVALAGPTEDKRV-PRNRATVILVIDVSLSMKATDVE---------------PTRLAAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +A F D + + +GL+ + + P+ E + + S T Sbjct: 116 DAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNREASKVAIDNLQLSER---T 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + Q + + + I+ L+DG+ ++ + Sbjct: 167 ATGEAIFTSLQSIDT----------LAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRG 216 Query: 353 ---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 +AK+ + I TIS + L + S + + Sbjct: 217 GFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSL 276 Query: 396 DSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 277 EELRDVYDTLEEQIG 291 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 70/221 (31%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K +M A K+ F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRMEAAKDVAAEFI-------SGRPNDNIGLTIFAGEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRD-MDSLILK----PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + D T + A L K + Sbjct: 148 HQSLLNLLQNVRTDLSARGLIEDGTAVGMGLANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------- 372 K +I LTDG NN +++ + + AK I++ TI + + Sbjct: 192 --SKVVILLTDGSNNRG--DLSPMTSANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQY 247 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y N + L ++++I + Sbjct: 248 VNIPVEIDTKTLSGIASVTHGNFYRATNNNELKQIYKDIDK 288 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 110/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGHSRSPDISKVVIVVTDGRPHDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + R+D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGFGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F++ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 57/193 (29%), Gaps = 28/193 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMD 285 + LK LL +D + + LI ++T ++ + ++ Sbjct: 1463 LDLLKETLLFLVDLLQTGD------RICLIQFSTNAQRLTPLLSIESKDNIKSIKNEINR 1516 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+ M+ A+ +L + ++ T+ F L DG N + Sbjct: 1517 LVAKGGTNICQGMQLAFDVLKQRRYKNPITSVFLLSDG------------LNDGAENKIR 1564 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH----- 400 + + E I T + + K Y + + + Sbjct: 1565 DLLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLMDGNFYYIGDIHRIDECFIDA 1624 Query: 401 ---VFQNISQLMV 410 +F ISQ + Sbjct: 1625 LGGLFSVISQNVS 1637 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 80.7 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 60/176 (34%), Gaps = 17/176 (9%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D + K V G + Y+ E + + K D T + Sbjct: 829 FMIDLVKKADVGKNQVQFGALKYSDFPEVLFNLNEFSSKSEIISFIQNDHPRGGSTYTAK 888 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + + + + +I +TDGE+++ Sbjct: 889 ALAHSAHLFSESLGSRM------------HRGVPQVLIVITDGESHDAH---LLNATARA 933 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ I ++ + I N + LL S+ + ++ V N + L +F+N+S + + Sbjct: 934 LRDKGILVLAVGIE-GANHEELLSMAGST-DRYFFVENFEGLKGIFENVSASVCNT 987 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 52/183 (28%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + ++ V++GL ++ +K+ I Sbjct: 1012 EDFKKMKDFLVTIVNDFDIRPGKVHVGLAQFSHEYRPEFSLIPFRDKIEVKNQIGRIQQI 1071 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++ D Q+ ++ LTDG++ + Sbjct: 1072 FGNTLIGAALRNVGSYFWPDFGSRINAG------------VQQVLLVLTDGQSQD----- 1114 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ + + Q L S + V N L + + + + Sbjct: 1115 EVAQAAEDLRNKGIDIYSLGVGQVNDQQ--LIQISGSAKKKLTVDNFSELDKIKKRVVRD 1172 Query: 409 MVH 411 + Sbjct: 1173 VCT 1175 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 59/176 (33%), Gaps = 20/176 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + I + + V G + Y+ E E + K + + TD+ Sbjct: 456 FLAEVIGMFNIGPHKVRFGAVQYSHLWEWEFEMDRYSNKNDLVKAVENIRQLGGNTDTGA 515 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ ++L F++ + + + ++ LTDG +++ + + Sbjct: 516 ALD---KML----------PLFQRARQQRARKVPQHLVVLTDGLSHDS-----VREPAGR 557 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I + I + L+ S Y V N DSL + + + + Sbjct: 558 LRGDNINVYAIGVK--EANHTQLEEIAGSDSRVYYVHNFDSLKDIKNRVVRSICSE 611 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 71/229 (31%), Gaps = 44/229 (19%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + ++ E + I +VD SGS+ + + + K Sbjct: 599 DIKNRVVRSICSEEACKEMSADIMFLVDSSGSIGGDNFEKMKTFMKNVVNRTKIG----- 653 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 V +GL+ ++ ++ + + K + D SLI + Sbjct: 654 ------------------ANQVQVGLVQFSDINKEGFQLNQYDTKTKISDAIDGLSLIGR 695 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A+ + K +KF++ LTDG++ + Sbjct: 696 GTLIGGALTFVSDYFSVS--------------KGARPNVKKFLVLLTDGKSQD-----AV 736 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + +++ + I ++ + S Q L+ + + V N D L Sbjct: 737 KEAAVALRQDGVIIYSVGVFGSEYSQ--LEEISGRSDMVFYVENFDILK 783 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 32/260 (12%), Positives = 74/260 (28%), Gaps = 31/260 (11%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM-HCAMNSDPEDVNSAPICQ 221 + + + I + + ++ + + + C S + V Sbjct: 183 SMFSPNMTRIIEDVVKFEEGT------LTIVLFLFLSNFPIEACHDASVVDIVFLVDESV 236 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + LK +++ID + + +GL+ Y+ + GT K Sbjct: 237 NGTDENFEHLK---GFLVETIDSFDVKENCMRIGLVMYSNETKLVSRLGTGTNKSDILQQ 293 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D S + A+ + + S S + + +T Sbjct: 294 IDGLSPKAGRALTGAAINVTRKEIFSRGAG-----------SRKSQGVLQITVLIT---- 338 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIH 400 + S N + + + + + I N +L + E + V + + Sbjct: 339 -HRSSEDNVSEAALSLRREGVTVFAVGIE-GANETQLDQIASYPREQYVSMVKSYSDMGA 396 Query: 401 ---VFQNISQLMVHRKYSVI 417 +FQ + + K SV Sbjct: 397 YYRIFQKKLRNEIQNKVSVA 416 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 80.7 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 69/244 (28%), Gaps = 13/244 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R K+ S N +I AL + + + G+ + + W+ K ++ A + A +AGA Sbjct: 4 IRQTAKRLRQSNTGNAMMILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVAGAY 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + N + + + I + K S + + ++ S Sbjct: 64 SLSYNGTAGDWSARAEQEYDANRSITTGYATANDSTKGVTDYGSFTQNSVTVSATMDVSL 123 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + N+ ++ + N ++ + Sbjct: 124 PFSSILLSTPTTINVNSQAMFEKTDG-------ATGCMIALKPNDTAITING-----NVT 171 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + S S++ K L + + D + Sbjct: 172 INAPCGMAVSSTSSISIN-KSGGSGSINPGWVYTGGGVNDKFQELIDKFNATAPAADQVE 230 Query: 247 HVKE 250 ++ Sbjct: 231 VNED 234 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 80.7 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 76/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ + + R ++ LV+D+S SM +M A + Sbjct: 80 TVAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PNRMVAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + P I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAI----------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELNSVYASLQQQIG 299 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 80.7 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 77/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM ++M A + Sbjct: 72 TIAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PSRMVAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 116 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKNALDKLQFADR---T 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + P I+ +DG+ + N Sbjct: 167 ATGEAIFTALQAI----------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKG 216 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 217 AYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATL 276 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 277 AELRAVYSSLQQQIG 291 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 109/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + +D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F+ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 79/228 (34%), Gaps = 42/228 (18%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + A D++ + QD + T+ A K F+ S+ + + +GL+ Sbjct: 78 VPVPDPTATVIVTIDISLSMRAQDIQPTRFEAAKQEAKNFVRSL------PDGIKVGLVS 131 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + EP+ ++ + + + + T + ++ + + D+ Sbjct: 132 FAGYATLEAEPTTDH---QRVIDQIELLQMARRTAIGDGLLESLRAIPKDENGK------ 182 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN------AS 372 ++ L+DG N+ +++ A++ + + TI + Sbjct: 183 --------PLGPSTVVLLSDGRTNSGVD---PMEVAPFARDMGVVVHTIGLGRRSNPGDP 231 Query: 373 P----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + L ++ +Y +A++L ++N+ +++ Sbjct: 232 DQYWGGYWMQFDEETLRAIAEATGGQYYAAGSAEALRQAYRNLGRMVG 279 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 80/247 (32%), Gaps = 36/247 (14%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + P + + V+D+SGSM+ + V +A + + +++ Sbjct: 51 QVLLRVEAPPAADLN--SHVPLDVVAVLDVSGSMN-------DPVAAASPKSNLQGSRLD 101 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDS 286 LK ++ + + + ++ + K + + Sbjct: 102 VLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRL 155 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T PA+++A +IL + S F I+ LTDG++ Sbjct: 156 QARGGTALMPALEEAVKILDERQGSSRNRVGF--------------ILLLTDGDDTTGFR 201 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 AK + T + AS + + LL S + V + D+L ++ ++ Sbjct: 202 WTRDAIHGAVAK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD-DNLANIAGALA 257 Query: 407 QLMVHRK 413 + K Sbjct: 258 VCLGGLK 264 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 71/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I + V+D SGS+ + +D + + K Sbjct: 806 ILDVVFVIDSSGSIDHDEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T ++ A+ + + T + Sbjct: 843 NRVRFGALKYADDPEVLFYLGDFDTKLEVISVLQNDQPMGGNTYTSEALGFSDHMFTEAQ 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 903 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + ++ V L +F +++ + + Sbjct: 948 -AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 987 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 54/177 (30%), Gaps = 19/177 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + V +G ++ + + + + T Sbjct: 1020 EFLVSVVQDFDVSLNRVRIGAAQFSHNYRQEFPLGTFIGEKEISFQIENIQQLGGNTHIG 1079 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++Q D + + ++ LTDG++ + + + Sbjct: 1080 DALRQVGHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD-----EVAQAAE 1122 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + I I ++ I + Q+L++ ++ + V N D L V + I + + Sbjct: 1123 ALRHRGIDIYSVGIG-DVDDQQLIQITGAAEKK-LTVHNFDELKKVKKRIVRNICTT 1177 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 456 SFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K++ ++ LT+G + + ++ + Sbjct: 516 AALNFTLSLLQKAKQQRGN-------------KVPCHLVVLTNGMSKDSI-----LEPAN 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I++ I + + L+ + Y V + D+L + + Q + Sbjct: 558 RLREEHIRVYAIGVK--EANKTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEIC 610 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 57/170 (33%), Gaps = 26/170 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ V +G++ ++ ++ + + + + + T Sbjct: 640 MKTFMKNLVSKSQIGANRVQIGVVQFSGVNKEEFQLNRFMSQ-SDISNAIDQMVHIGETT 698 Query: 294 -STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ Q + K +KF+I +TDGE + + Sbjct: 699 LTGSALSFVSQYFSPT--------------KGARPNVRKFLILITDGEAQD-----VVKE 739 Query: 353 ICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ I I ++ ++ L+ PE + V N D L H+ Sbjct: 740 PAVALRQEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQHI 786 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 50 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 86 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 87 NQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEAR 146 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 147 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 191 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N LL SS + ++ V L +F +++ + + Sbjct: 192 -DGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 231 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 19/184 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 257 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 316 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 317 FGNTHIGAALREVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 359 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + I I ++ I + Q L + E V N D L V + I + Sbjct: 360 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRIVRN 417 Query: 409 MVHR 412 + Sbjct: 418 ICTT 421 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 62/212 (29%), Gaps = 50/212 (23%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + ++ K + F+ ++D +GL+ ++ + + + ++ Sbjct: 102 AGDFDPKDRITVAKEVIAEFVK-------GRKDDRIGLVVFSGHAFTQVPLTLDYDFLQN 154 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T A+ + L S K +I LTD Sbjct: 155 LLGQVQTVRRPDGTAIGLALAHSVNGL------------------RNSTTKSKVVILLTD 196 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ---------------------- 376 G NN ++ + + A+ +++ TI + NG+ Sbjct: 197 GSNNRG--DIEPAQAAEIARALDVRVYTILVGKPGNGEYPVHDPWRDETYLIPAPTAEDE 254 Query: 377 -RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L + + + L V+ I + Sbjct: 255 VALRDIAEQTGGIFFRAGDEQGLRDVYDTIDK 286 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 74/275 (26%), Gaps = 51/275 (18%) Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 ++ + V R + + ++ V RP Sbjct: 42 SNDDDEKIVTRSRPTPPIVPARVKLRSINND-MAPLEESKLKVMLELTGGDSSSYGRPGL 100 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+S SM + KM +K A+L + + Sbjct: 101 DLVAVLDVSRSM--------------------EGDKMEKMKTAMLFIIKKLGPTD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 + ++ ++ + T K ++ ++ L T+ T ++ A ++L Sbjct: 136 -RLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTALKVLKGRSF 194 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I+ ++DGE N + I T Sbjct: 195 NGERVVG---------------IMLMSDGEQNAGSD-------ATGVSVGNVPIHTFGFG 232 Query: 371 ASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQN 404 + + L S +V N DSL F Sbjct: 233 INHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQ 267 >gi|84385695|ref|ZP_00988726.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] gi|84379675|gb|EAP96527.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] Length = 436 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 43/437 (9%), Positives = 102/437 (23%), Gaps = 60/437 (13%) Query: 35 LLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFE--SISNHAKRALID 92 + F + +E A A LA + N D + Sbjct: 1 MTFAFSAGYTQRLLAHSKIEEATEIASLALIANPGGNNQEDKDYAKNLVDLYVTDNINDV 60 Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV-- 150 + + V++ + + ++ + V Sbjct: 61 EIDVTTTRCEYADGCVQRNHELSPFTDFTVSARTEHKSWITHDEIGVEPEFKVSGRSVTR 120 Query: 151 -----MTSYDYRLQFIEHLL----NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 + L + + ++ N+ + + + + + F Sbjct: 121 KFLPQPVDIYFILDTSQSMSNRWSGEKNNKTQMDVVKETIVKVVEDLKTFKTGDEKKSQI 180 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 S+ + + + + T+ K+ F +D + Y Sbjct: 181 SLVTYNAYNAKFDKKSNQVKLYDYTR--DFKHTTETFEGIVDKMFDQDWPAETTSYAYMY 238 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQ----------------I 304 ++I + + + V D T S + A + I Sbjct: 239 NTSQDIPLTDDYDNFIKLVKSDKLKPATGGGTMSWLGLIAAAKEANKVEDVNRNPEQVFI 298 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF---LTDGEN-----NNFKSNVNTIKICDK 356 SD + ++ K + +DG K+ V K+C+ Sbjct: 299 FLSDGADNKVNYPKDLYLEKTRSYRSKHNVDWTHYSDGSTVLQYQQYLKTLVTKYKLCET 358 Query: 357 AKE--------------------NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +E + + I +N CV ++ D Sbjct: 359 LQETISKKKNKFQSKHSSLEGEKTKVTMGVIGVNYIVKEDDGFGDCVGRKNIYHAKKGKD 418 Query: 397 SLIHVFQNISQLMVHRK 413 ++ I++ K Sbjct: 419 VYKYILNLINEETGRLK 435 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 79/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ ++ R + LV+D+S SM T++AA + Sbjct: 72 TVALAGPTADKKV-PRNRATVILVIDVSLSMQATDVE---------------PTRLAAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GL+ + + P+ + + + S T Sbjct: 116 EAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNRDATKVAIDNLKLSER---T 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + Q + + + I+ L+DG+ +++ + Sbjct: 167 ATGEAIFTSLQSIDT----------LSAVLGGSDQAPPARIVLLSDGKQTVPENSDDPRG 216 Query: 353 ---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 +AK+ + I TIS + L + S + + Sbjct: 217 GFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTASSL 276 Query: 396 DSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 277 EELRQVYDTLEEQIG 291 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 80.3 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 79/255 (30%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ ++ R + LV+D+S SM T++AA + Sbjct: 72 TVALAGPTADKKV-PRNRATVILVIDVSLSMQATDVE---------------PTRLAAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GL+ + + P+ + + + S T Sbjct: 116 EAAKSFADGLTPG------INLGLVAFAGTASVLVSPTTNRDATKVAIDNLKLSER---T 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + Q + + + I+ L+DG+ +++ + Sbjct: 167 ATGEAIFTSLQSIDT----------LSAVLGGSDQAPPARIVLLSDGKQTVPENSDDPRG 216 Query: 353 ---ICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 +AK+ + I TIS + L + S + + Sbjct: 217 GFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTASSL 276 Query: 396 DSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 277 EELRQVYDTLEEQIG 291 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 79.9 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 76/252 (30%), Gaps = 66/252 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + I + +D+S SM + QD K +++ALK F+ Sbjct: 81 DISTRTKTNKGIDIVMAIDVSSSM---------------LAQDLKPDRLSALKRVASAFV 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAM 298 D + +GL+ Y + V+ + LI T + Sbjct: 126 D-------DRLSDRIGLVVYAGESYTLTPITSDKGIVKGSLREISYQGLIEDGTAIGMGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + L + + K II LTDG NN+ + + A Sbjct: 179 ATSVNRLKDSRAK------------------SKVIILLTDGVNNSGFID--PKIATELAV 218 Query: 359 ENFIKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNA 395 E IK TI + ++ + L + ++ ++ + Sbjct: 219 EFGIKTYTIGLGSNGTARAPVGILPNGSFQYAMTKVEIDEALLQEIATATGGIYFRATDN 278 Query: 396 DSLIHVFQNISQ 407 L +++ I++ Sbjct: 279 KKLEEIYEEINK 290 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 80/227 (35%), Gaps = 48/227 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E+ + + +D+SGSM + + PE T++ K L F+D Sbjct: 81 NEKKGVDVMIALDVSGSMSRSRDFLPE-------------TRLGVSKKLLRKFIDK---- 123 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQI 304 ++ +GL+ + + E + + + + ++ + T A+ + Sbjct: 124 ---RKSDRLGLVVFAGAAYLQAPLTGDRESLNEILGTIEEETVAEQGTAIGDAIILSTYR 180 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIK 363 L + + R K I+ +TDG +N K + T E+ +K Sbjct: 181 LRASQAR------------------SKVIVLITDGVSNTGKIDPVT---ATDLAEHIGVK 219 Query: 364 IVTISIN-----ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I ++ I N + L + S+ + + + + V +I Sbjct: 220 IYSVGIGKEDGSYEINFEILRELSASTGGKFFRAEDPEEMKAVLTSI 266 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 33/279 (11%), Positives = 83/279 (29%), Gaps = 31/279 (11%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++ + K+ EK ++++ AL L LI F + + + A A Sbjct: 5 NMTMSLFYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAA 64 Query: 62 LAGASKMVSNLSRLGDRFESI-------------SNHAKRALIDDAKRFIKNHIKESLSG 108 L ++ + + R A + + ++ + Sbjct: 65 LLLTTENNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRNQELVQGMVKL 124 Query: 109 YSAVFYN-TEIQNIVNSSRISMTHMANNR---LDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 Y + + + + + + + ++++ ++ + L + L Sbjct: 125 YLRSYDKEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVEGDVKRKFWLPWSYTL 184 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N ++ + E + LV D+S SM P+D P Sbjct: 185 TSNNRNTVDINSGKSYAVKEKDILIPIDLMLVNDISTSM----FKPPKDDPQGP------ 234 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYT 260 K+ +LK + + + K +G+ + Sbjct: 235 -KKIDSLKTVVKAVANILIPDEPPKNISKYNRIGITSFG 272 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 66/217 (30%), Gaps = 30/217 (13%) Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + D ++ + + A ++ F + + L + Sbjct: 369 IGYDKHVTDNPLRDVFYQYHDINASLASIASFKGNDISYDITFNKSPLCLEKTQKSTATD 428 Query: 267 IEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 I W + K + + T ++ + +L + K + Sbjct: 429 I---WFSSKKSSELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKDATPDKIK------- 478 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTI---KICDKAKEN------------FIKIVTISIN 370 Q+ ++ L+DG + + + +C+K + KI ++ Sbjct: 479 -TNTQRILLVLSDGVDTALPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFG 537 Query: 371 ASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + + CV P ++ N +L+ VF+ I Sbjct: 538 YEQDSELRKEWENCV-GPGNYHQAKNEKALLEVFKQI 573 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 76/258 (29%), Gaps = 52/258 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V+ ++ R + LV+D+S SM T++AA Sbjct: 68 LVVAIAGPQADAKV-PRNRATVVLVIDVSLSMQATDV---------------APTRLAAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F D + + +GL+ + + P+ V+Q V S Sbjct: 112 QAAAKSFADQLTPG------INLGLVSFAGTAAVLVSPTTDRTAVKQAVDGLKLSE---S 162 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ A Q + S + + I+ ++DG+ + Sbjct: 163 TATGEAIFAALQSIDSFSRTVAASGTEGPP--------PARIVLMSDGKQTVPGPDGEND 214 Query: 352 -----KICDKAKENFIKIVTISINAS--------------PNGQRLLKTCVSSPEYHYNV 392 +A I + TIS + + + S + Sbjct: 215 PRGSFTAAKQAAAEKIPVSTISFGTDYGTIDIEGGRTRVAVDDASMQQIASLSGGQFFTA 274 Query: 393 VNADSLIHVFQNISQLMV 410 + L V+ + + + Sbjct: 275 ASESQLRQVYSELGEQIG 292 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 69/212 (32%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D T++ A K F+D + + +GL+ + + S Sbjct: 94 DVSNSMAATDITPTRLEAAKQGAQAFVDQLPP------RINLGLVSFAGSAAVLVPASTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E VR + T + + Q + + +R P Sbjct: 148 RESVRSGIRGLQLGPA---TAVGEGIYASLQAIATAGQRLSDEG---------QPPPPAA 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DGE + N + A++ I + TI+ N + L Sbjct: 196 IVLLSDGETTRGRPNT---QAATAARDAEIPVDTIAYGTSDGTLDVGGQQIPVPVNEEAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + ++ + D L V++ + + Sbjct: 253 RELADQTGGSYHRATSGDELQSVYRGLGSSIG 284 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 59/176 (33%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D + K V G + Y E + K D + T + Sbjct: 828 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLDDLSTKWEVISVLQKDQPMGGNTYTA 887 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ +T Sbjct: 888 EALGFSDHMFTEARGSRL------------HKGVPQVLIVITDGESHDADKLNDT---AK 932 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + ++ V L +F ++S + + Sbjct: 933 ALRDKGILVLAVGI-AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 986 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 49/183 (26%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ K + I Sbjct: 1012 DDFKKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPLGTFIGKKEISFQIENIQQI 1071 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG++ + Sbjct: 1072 FGYTHIGAALREVGDYFRPDMGSRINAG------------TPQVLLVLTDGQSQD----- 1114 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + I I ++ I + Q L + + V N D L V + I + Sbjct: 1115 EVAQAAEELRHKGIDIYSVGIGDVDDQQ--LIQITGTADKKLTVHNFDELTKVKKRIVRN 1172 Query: 409 MVH 411 + Sbjct: 1173 ICT 1175 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + + V +G + Y + E + T K + + T++ Sbjct: 455 TFLSEVVGMFNIAPQKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTG 514 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 515 AALNFTLGLLQKAK-------------KQRGNRVPCHLVVLTNGMSKDSI-----LEPAN 556 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I++ I + Q L+ + Y V + D+L + + Q + Sbjct: 557 RLREELIRVYAIGVK--EANQTQLREIAGEDKRVYYVHDFDALKDIRNQVVQEIC 609 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 64/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 639 MKTFMKNLVSKSQIGADRVQIGVVQFSDINKEEFQLNRYMSQNEISNAIDRMAHIGETTL 698 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K ++F+I +TDGE + + Sbjct: 699 TGSALTFVSQYFSP--------------AKGARPNVRRFLILITDGEAQDIVKDP----- 739 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ I I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 740 AVALRQEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQHIEDDLVFGICSP 796 Query: 413 K 413 + Sbjct: 797 R 797 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 56/195 (28%), Gaps = 27/195 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + LK +S+ L + + +GL+ Y+ + S G K Sbjct: 237 NGSQENFDYLKE---FLEESVSALDIKENCMRVGLVTYSNETKVINSLSRGVNKSEVLQN 293 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + A+ ++ F + + + + +T Sbjct: 294 IQNLSPRAGKAYTGAAI-----------RKIRKEVFSARNGSRKNQGVPQIAVLVT---- 338 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL- 398 + + T + + + I T+ I + + Q L+ S P + + L Sbjct: 339 HRPSEDNVTKAAVN-LRREGVTIFTMGIEGASDSQ--LEKIASHPAEQHVSKLKTFSELA 395 Query: 399 ---IHVFQNISQLMV 410 + + + Sbjct: 396 AHNQTFLKKLRNQIT 410 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 49/176 (27%), Gaps = 3/176 (1%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ L R R + E+ N ++IFALS + +L+ + + + ++ + A Sbjct: 1 MNWLDRCRRWLAAFGRDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LAGA+++ + + + + I L V + Sbjct: 61 ALAGANELALAIDDAAAIERAKVFIDGHVSEWKSAPAVTPEIAVILRDKQRVIQVVLKGH 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + + + + + +S Sbjct: 121 T---PSFFANMLPPGGWKYHAEARAVAVGLTPLCVLITGSSGSKMLNVKDSGRLSA 173 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 79.9 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 79/255 (30%), Gaps = 54/255 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI-CQDKKRTKMAALKNALLLFLDSIDL 244 + I + +DLSGSM +++ D+ P + K+ T++ K L F+ Sbjct: 118 SDDKGIDIVVALDLSGSMRAILDARASDLPGQPKLPRGKRLTRLDTAKLVLQDFI----- 172 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQ 303 + +G++ + P+ + Q V++ ++I T A+ A Sbjct: 173 --SRRRTDRLGVVVFGKAAYVLSPPTLDYHLLTQMVSQMTLNVIDGSATAIGDALGTAVA 230 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFI 362 L S K +I LTDG++N S + Sbjct: 231 RL------------------RRSDAQSKVVILLTDGDSNAGAISPEYATHLATSL---GA 269 Query: 363 KIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLI 399 K+ TI I N L + + Y +A +L Sbjct: 270 KVYTIQIGTDDEVEVEDGIDLFGQPRYVRHRFPVNPALLQEIAQKTGGASYVATDAKALA 329 Query: 400 HVFQNISQLMVHRKY 414 ++ + ++ Sbjct: 330 DSMHDVLDRLEKTRF 344 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 79.9 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 12/92 (13%), Positives = 39/92 (42%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 S++ ++F + +++ + + G + + + K +++A +AA L+GA + Sbjct: 9 HSQRGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGAKVLNDMHDVG 68 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 +++ + A + ++ S + Sbjct: 69 QATAAALTTFNMHLEGELADAGLVPTVEVSET 100 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 79.9 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 70/221 (31%), Gaps = 40/221 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ + + + + K Sbjct: 813 VLDIVFVIDSSGSIDYNEYNIMK-----------------------EFMIKLVKKADVAK 849 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V G + Y+ + + DS T + A+ + + T + Sbjct: 850 DRVRFGALKYSYDPTILFYLDEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHMFTEAR 909 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ ++ I I+ + I Sbjct: 910 GSRI------------NQKVPQVLIVITDGESHDA---NQLNATAKALRDKGILILAVGI 954 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 A N + LL S+ + ++ V L +FQN+S + Sbjct: 955 -AGANTEELLAMAGST-DKYFFVETFGGLKGIFQNVSDSIC 993 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 60/183 (32%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + +D V++GL ++ T + +M + Sbjct: 1020 ENFKKMKDFLVSVVDDFDIGPSRVHIGLAQFSHVYRAEFFLGSFTSEGEVSTQIEMTQQV 1079 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+KQ Q + Q+ ++ LTDG++ + Sbjct: 1080 FGNTHIGAALKQVEQYFRPEMGSRINVGI------------QQVLLVLTDGQSQD----- 1122 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K + + I I ++ I + Q+L++ +S + + N D L + + I + Sbjct: 1123 EVAKAAEDLRRKGIDIYSLGIG-DVDEQQLIQISGTSDKK-LTIDNFDELKKIKKRIVRN 1180 Query: 409 MVH 411 + Sbjct: 1181 ICT 1183 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 23/188 (12%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T +K L ++ + + V +G + Y+ E E + + K D Sbjct: 454 YPTDFQEMKAFLSEVIEMFTIAPYK---VRVGAVQYSHIQELEFEINKYSNKNDLGKAID 510 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T++ A+ +L ++ +I LTDG Sbjct: 511 NIWQLGGNTNTGAALDFTLGLL-------------QRAKTQRGNKVPCHLIVLTDGM--- 554 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 S+ N ++ K K+ I + I + N +LL+ + + Y V N DSL + Sbjct: 555 --SDDNVLEPAKKLKDENINVYAIGV-KEANRTQLLEI-AGTEKRVYYVYNFDSLKDIKN 610 Query: 404 NISQLMVH 411 + Q + Sbjct: 611 QVVQGICS 618 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 68/180 (37%), Gaps = 22/180 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + ++ V +G++ ++ ++ + + + + D S I + T Sbjct: 647 MKTFMKNLVNKSQIGEDQVQVGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRETL 706 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+K K +KF+I +TDGE+ + + Sbjct: 707 TGSALKFVSDYFHPS--------------KGARPGVRKFLILITDGESQDPVKDPAM--- 749 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +++ + I ++ + N + L PE + V D L H+ ++ + + + Sbjct: 750 --ALRQDGVIIYSVGV-YGAN-ETQLVEISGKPEMIFYVETFDILKHIEDDLVFGICNPR 805 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/346 (11%), Positives = 91/346 (26%), Gaps = 35/346 (10%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + + K + L+ + + + + + Sbjct: 108 NALEKVHKTYFSGPMNGRDKNQFP-QVLVVLTSAHSEDDVEGPAKALQRDGVKIITLGMQ 166 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 N + + + S I + + + G Sbjct: 167 NASEENLKTMATAQFHYNLRTVRDVGTF------STNMTSIIKEVAKYKAGADDADE--- 217 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +LS H + + + + L++ +S+ + + +G Sbjct: 218 --ELSFPYHITICHGTSVADVVFLLDEYVNGTQENLEHLKGFLEESVSSFDVKENCMRIG 275 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ YT + S GT K S + AM + + + S +K S Sbjct: 276 LVTYTDETKVIHSLSTGTNKSEVLQEIQKLSPKAGRAYTGAAMTKVRKEVFSVQKGSRRM 335 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + I + + + + D + + + T+ + S + Sbjct: 336 QG-----------VPQIAILV----AHRPSEDNVSEAALD-LRREGVTVFTVGLEGSDDT 379 Query: 376 QRLLKTCVSSPEYH-YNVVNADSL---IHVF-QNISQLMVHRKYSV 416 Q L + PE + + L F + + + H K SV Sbjct: 380 Q-LGQISSHPPEKYLSKLKTFSELAAHNQTFQKKLRGEIQH-KVSV 423 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 79.9 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 109/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 86 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSV 145 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + +D + + DY + R Q+ Sbjct: 146 QDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 205 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 206 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 265 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F+ I D L + +GL+ Y++ V + K Sbjct: 266 GSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 325 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 326 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 372 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 373 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 425 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 426 QIGKKLQKKIC 436 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 19/248 (7%), Positives = 64/248 (25%), Gaps = 11/248 (4%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 + ++++ + + + + +K ++ +A +AA A A ++ + R ++ Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTEGATDDQLRAYALDF 60 Query: 85 HAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN- 142 + A + + + +G + + + + + D++ Sbjct: 61 FNANLNKINPANTTLTVTLPSNTTGGGLLKMTARLDYKPYFYPVFGQLVGKSETDANQRI 120 Query: 143 ----TIFYNMDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 T + L + Q + + + + + Sbjct: 121 SFNITSEVRLKNTLEVALVLDNSGSMTKTGTGSGQTRIDLLKTAAKQLVDTLAQQAAMIK 180 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS--IDLLSHVKEDVYM 254 + + + VN P + L D ++ + Sbjct: 181 -QVDRPVQFGLVPFAASVNVGPGNGNAPWMDTEGLSPVSNENFDWSTLNAADKYAQQTNG 239 Query: 255 GLIGYTTR 262 T Sbjct: 240 IWYKRGTG 247 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/325 (9%), Positives = 71/325 (21%), Gaps = 77/325 (23%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + + + + P L D + + N P Sbjct: 325 DAAPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWRLTQDPDEAAPTTYGAINSWWNDDP 384 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-IGYTTRVEKNIEPSWG--TEK 275 + L + G Y+ + T Sbjct: 385 TSSTGQSR--------LRNMSKYFQPRPINAPALAAGNGPNYSCSTNPITPLTDVSVTTG 436 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI--------PSL 327 + T+ M +++++S + + +G + Sbjct: 437 LTAIKAAIDLMKPDGGTNVPEGMAWGWRVVSSGEPFTQGRPETERGNDKVVIVLTDGANT 496 Query: 328 PFQKFIIFLTD----------------------------------------GENNNFKSN 347 + + +D N N Sbjct: 497 YYTPSSLSHSDPADSKSTYASFGYLNPGYNGTSVGRLFMGTSSAIGQFDYSNGNYTNALN 556 Query: 348 VNTIKICDKAKENFIKIVTISIN------ASPNGQRLLKTC------------VSSPEYH 389 +C+ AK I ++T++++ A LK+C S P Sbjct: 557 EQMATLCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSCSSNSRFRKDPTDASKPAKL 616 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 + SL + F+ I + + + Sbjct: 617 FWNATGASLSNDFKEIGNELSNLRI 641 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 79.5 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 65/242 (26%), Gaps = 54/242 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R +I LV+D+S SM ++ A + A F + Sbjct: 86 PRNRAVIMLVIDMSQSMRATDVE---------------PNRLKAAEQAASQFASQLTPG- 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+G+ + P+ + + + T + A+ A + Sbjct: 130 -----INLGLVGFAGTPYLLVPPTP---QHQATIDALKKLDFADSTATGQAIFTALHAIG 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFI 362 + I+ L+DG N + + AK+ + Sbjct: 182 ATAVTGGDNPP------------PARIVLLSDGRENKPSNPSDPHDGVYTAARLAKDEGV 229 Query: 363 KIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I TIS + ++ S Y N L + I Sbjct: 230 PISTISFGTKGGEIEMDGQRVAVPVSTDQMKTIARLSGGQPYTATNIGELNKSYNAIENE 289 Query: 409 MV 410 + Sbjct: 290 IG 291 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 77/218 (35%), Gaps = 45/218 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D + ++ A K A+ ++ +DL + G+I + + S Sbjct: 96 DTSGSMEAADYQPDRITAAKEAIGTLINQLDLKDYA------GIITFDSGASTAAYLSPD 149 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++V + + S T + A + + Sbjct: 150 KQRVIEKLGMIAASD--DSTAIGDGLALAVDM------------------SKSIPNRKSV 189 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-----------------NG 375 +I L+DGE+N + + AKE+ +++ T+++ +S + Sbjct: 190 VILLSDGESNAGYVS--PETAAEFAKESGVQVFTVAMGSSEKVLVGYDWANNPQYATVDE 247 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + L S+ Y+ V+ +L +++ + +VH K Sbjct: 248 ETLEYIADSTGGGFYSSVDEKTLGNIYSQLDDAIVHEK 285 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 785 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 821 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 822 NQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMFTEAR 881 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I +TDGE+++ T ++ I ++ + I Sbjct: 882 GSRL------------NKGVPQVLIVITDGESHDADKLNAT---AKALRDKGILVLAVGI 926 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N LL SS + ++ V L +F +++ + + Sbjct: 927 -DGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVTASVCN 966 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 47/177 (26%), Gaps = 19/177 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + V +G ++ + + I Sbjct: 1009 NDFKKMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQI 1068 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + + ++ LTDG++ + Sbjct: 1069 FGNTHIGAALREVEHYFRPDMGSRI------------NTGTPQVLLVLTDGQSQD----- 1111 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + I I ++ I + Q L + E V N D L V + I Sbjct: 1112 EVAQAAEALRHRGIDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVNKRI 1166 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 402 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 461 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 462 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 503 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I I Q L+ + Y V + D+L + + Q + Sbjct: 504 RLREEHIRVYAIGIK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 558 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 619 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 676 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 677 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 717 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 718 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 774 Query: 411 HRK 413 + Sbjct: 775 SPR 777 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/343 (11%), Positives = 84/343 (24%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 62 KALQEAHRTYFSAPANGRDKKQFPP-ILVVLASSESEDNVEEASKALRKDGVKIISVGVQ 120 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I I G +E Sbjct: 121 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVIKYKE----GAVDDIFVE- 173 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 174 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 222 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 223 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 271 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I Sbjct: 272 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIE--GTS 324 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 325 DTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 367 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 38/185 (20%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ A+K F++ ++ +GL+ + R ++ V + + Sbjct: 125 STRLEAVKRVARSFVEE-------RQGDRIGLVLFANRAYVAAPLTFDLAAVGRAIEEAS 177 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A + +T + I+ L+DG++N Sbjct: 178 IGITGRSTAIADGLGLALKRVTESGA------------------ASRVIVLLSDGQDNAH 219 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-----------NGQRLLKTCVSSPEYHYNVV 393 + + ++ A + ++I TI++ + L +S Y V Sbjct: 220 QID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVR 277 Query: 394 NADSL 398 + L Sbjct: 278 GMEDL 282 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 109/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + +D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F+ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 47/275 (17%), Positives = 92/275 (33%), Gaps = 59/275 (21%) Query: 163 HLLNQRYNQKIVSFIPALLRIEM------GERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + ++ V IP L I G ++ + D++G DV+ Sbjct: 39 SMSKVLKSRYYVKDIPFALIILALLFSIIGLARPAKVDHLSDINGEGIYISL--VVDVSP 96 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + +D T++ A K ++ F+ + + L+ + R ++ + Sbjct: 97 SMMAEDMMPTRLEASKKTMIDFIKK-------RNFDKISLVAFALRASVLSPSTFDYTLL 149 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T + A +L S K + +K II L Sbjct: 150 EEEIKNIKI-DEEGSTSIGLGIATAVDMLRSVKGDN-----------------EKIIILL 191 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------------------- 373 TDGENN+ + + + A IKI TI I + Sbjct: 192 TDGENNSGEID--PKLASEIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFS 249 Query: 374 -NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 N + L+ ++ ++N NA +L +V+ I + Sbjct: 250 LNEEALIDIASTTGGKYFNAQNASALDNVYNTIDR 284 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 65/243 (26%), Gaps = 63/243 (25%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I L VD+S SM + + +M K+ + F+ Sbjct: 82 SSDGIDIALTVDISSSMLAEDF--------------QPQNRMEVAKDKVKRFV------- 120 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ ++ + + V + Sbjct: 121 MGRKSDRVGLVAFSGEALTQVPLTTDYPVVLAAIDNLQVGQ------------------L 162 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 D S + ++ LTDGENN + A I+I T Sbjct: 163 EDGTAIGTAIATAANRLRNSPGRSRVMVLLTDGENNRGAID--PRTAAQAAGTFGIRIYT 220 Query: 367 ISINASP----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + + L + S+ ++ +A +L +++ Sbjct: 221 IGVGTDGMAAVPVGRGLFGLRYENRPVKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQ 280 Query: 405 ISQ 407 I + Sbjct: 281 IDR 283 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 67/221 (30%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K +M A K F+ + + +GL + + Sbjct: 86 DVSTSMLAEDLKPNRMEAAKKVAAEFI-------SDRANDNIGLTIFAGEAFTQCPMTTD 138 Query: 273 TEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + LI T + A L K + Sbjct: 139 HASLLNLLQGVRTDIASRGLIADGTAVGMGLANAVSRLKESKAK---------------- 182 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------- 373 K II LTDG NN +++ + AK I++ TI + + Sbjct: 183 --SKVIILLTDGSNNMG--DISPLTAAQIAKSLGIRVYTIGVGTNTVAPYPVTVGGTTQY 238 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L S+ Y N L ++ +I + Sbjct: 239 VNVPAEIDTKTLKDIAQSTDGGFYRATNNAELKEIYNDIDR 279 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 45/391 (11%), Positives = 102/391 (26%), Gaps = 57/391 (14%) Query: 41 IYVLDWHYKKNSMESANN-------AAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + + +E+A++ AA L L ++ Sbjct: 261 VEGSGFRRYRVQVEAASDGRVQNNEAAALIRVQGPPRILLVAQTAADARPLMTALTATGI 320 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVN--SSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + + + N M + D + + Sbjct: 321 IAELVSPEAAPRTLADLSTYDALVLVNTPARMLPVGLMQAIPGYVRDLGRGLLMIGGEDS 380 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + RP I V+D SGSM +DP Sbjct: 381 FGV----------GGYGRTAVEEALPVYMDVRNRELRPDLAIVFVIDKSGSMDACHCADP 430 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + K+ K+A++ + +G++ + + Sbjct: 431 ---DRGAPITSSSERKIDIAKDAIVQAAALLGPQD------TVGVVTFDGAASATFPATR 481 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G V Q + PT+ + +A ++L R K Sbjct: 482 G-ATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQVDARI------------------K 522 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 +I LTDG + + + + +E I + ++ + + L + +Y Sbjct: 523 HMILLTDGWGSGGD----QLDLAARLREQGITLTVVAAGSG-SAAYLKQLAAEGGGRYYP 577 Query: 392 VVNADSLIHVF-----QNISQLMVHRKYSVI 417 + + +F I +V + + + Sbjct: 578 AADMAEVPQIFVQETITAIGNYIVEQPFVPV 608 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 76/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ + + R ++ LV+D+S SM +M A + Sbjct: 80 TVAMAGPTHDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PNRMVAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F D + + +GLI Y + P+ E + + + + T Sbjct: 124 EAAKQFADELTPG------INLGLIAYAGTATVLVSPTTNREATKAALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + P I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAI----------ATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELNSVYVSLQQQIG 299 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 109/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + +D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F+ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 25/219 (11%), Positives = 66/219 (30%), Gaps = 50/219 (22%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + +M K L F+ + + + +GL+ + + +++ Sbjct: 106 AGDFRPQNRMHVAKEVLSEFIAN-------RVNDRIGLVVFAGAAYTQAPLTLDYGVLKE 158 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V + ++ T A+ + L + + + ++ +TD Sbjct: 159 VVKQLRTRVLEDGTAIGDALATSLNRLRDSEAK------------------SRVVVLITD 200 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISIN-----------------------ASPNG 375 G+NN+ K + + + A+ + I TI + N Sbjct: 201 GDNNSGKIS--PMDSANMAQALKVPIYTILVGKGGKVPFPQGTDLFGNTVWRDTEIPINP 258 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + + +Y + + L Q + + K Sbjct: 259 ELMQDIADRTGGEYYRATDPEQLREGLQKVLDSLERSKL 297 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 60/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + M T + Sbjct: 834 INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A F + + ++ +I +TDG++++ +T Sbjct: 894 AKALKHA------------NALFTEEHGSRINQNVKQMLIVITDGKSHDHDQLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + ++ I I + + Q+ L+ + V N D L VF + + M Sbjct: 939 SELRDKGITIFAVGVG--KANQKELEGMAGNKNNAIYVDNFDKLKDVFTLVQESMCTE 994 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/282 (12%), Positives = 83/282 (29%), Gaps = 28/282 (9%) Query: 135 NRLDSSNNTIFYNMDVMTS--YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + +I + Y + +L + + S L+ E + Sbjct: 364 EDVTVFALSIQGANNTQLEEIVSYPPEQTISMLKSYADLETYS--TKFLKKLQNEIWSQI 421 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKED 251 + +++ D++ + K +K +L + + Sbjct: 422 STYAEQRNLDKTGCVDTKEADIHFLIDGSSSIQEKQFEQIKRFMLEVTEMFS---IGPDK 478 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ Y+ E + + + I T + A+ QI+ + K Sbjct: 479 VRVGVVQYSDDTEVEFYITDYSNDIDLRKAILNIKQITGGTYTGRALDYILQIIKNGMKD 538 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++I LTDG + + ++ + + I I + I A Sbjct: 539 RM-------------SKVPCYLIVLTDGMSAD-----RVVEPAKRLRAEQITIHAVGIGA 580 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L+ E N D+L + + + + K Sbjct: 581 ANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICTEK 620 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVARDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDGSLVFHVENFDHLKA 792 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 38/185 (20%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ A+K F++ ++ +GL+ + R ++ V + + Sbjct: 112 STRLEAVKRVARSFVEE-------RQGDRIGLVLFANRAYVAAPLTFDLAAVGRAIEEAS 164 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A + +T + I+ L+DG++N Sbjct: 165 IGITGRSTAIADGLGLALKRVTESAA------------------ASRVIVLLSDGQDNAH 206 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-----------NGQRLLKTCVSSPEYHYNVV 393 + + ++ A + ++I TI++ + L +S Y V Sbjct: 207 QID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVR 264 Query: 394 NADSL 398 + L Sbjct: 265 GMEDL 269 >gi|218460583|ref|ZP_03500674.1| hypothetical protein RetlK5_14236 [Rhizobium etli Kim 5] Length = 309 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 61/268 (22%), Gaps = 23/268 (8%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +AA++A ++ + + + + + I+ N Sbjct: 3 LDAALIAAVKQIDNVEDADTLKKKVSDWFHAQVDNSYSLGEIEIDTANH---------NI 53 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM-TSYDYRLQFIEHLLNQRYNQKIVS 175 M +D S + + + +L Sbjct: 54 TATASGTVPTTFMKIANIESVDVSVASAVKGPATSYLNVYIVIDTSPSMLL--------- 104 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I + N + K + + +A+ Sbjct: 105 ----AATTSGQATMYAGIGCQFACHTGDAHTIGKKKYANNYEYSSEKKIKLRADVAGDAV 160 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 LD ID E + +GL + + + P+ T+ R + L + + Sbjct: 161 REVLDMIDESDANHERIKVGLYSLGDTLTEVLTPTLSTDTARTRLADASYGLTSSTSKAA 220 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + L + +K Sbjct: 221 TYFDVSLATLKQKVGAGGDGSTSGSPLK 248 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 63/207 (30%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + V + + K++ +K A+L + + + + ++ + Sbjct: 51 TGGDSSNDRPGLDLVAVLDVSGSMEGEKISKVKTAMLFMIKKLSSID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + +E ++ + ++ L T+ T +K +L + Sbjct: 105 SGDARRLCPLRQISETSQRELENLINGLKAEGATNITAGLKTGLNVLNDRRLSGGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 I+ ++DGE N + + + T + + L Sbjct: 164 --------------IMLMSDGEQNAGGD-------AAQVPVGNVPVHTFGFGINHEPRVL 202 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S +V N D+L F Sbjct: 203 KAIAQNSVGGTFSDVQNTDNLSKAFSQ 229 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 76/227 (33%), Gaps = 31/227 (13%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM+ + + + + +++ LK ++ + +D Sbjct: 69 PIDVVAVLDVSGSMNDPVAAAAAASPESNLQA----SRLDVLKASMKFIIRKLDDGD--- 121 Query: 250 EDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + K + + T PA+++A +IL Sbjct: 122 ---RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAVKILD 178 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + S FI+ LTDG++ AK + T Sbjct: 179 ERQGGSRN--------------HVGFILLLTDGDDTTGFRWTRDAIHGAVAK---YPVHT 221 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + AS + + LL S + V + D+L ++ ++ + K Sbjct: 222 FGLGASHDPEALLHIAQGSRGTYSFVDD-DNLANIAGALAVCLGGLK 267 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 33/397 (8%), Positives = 80/397 (20%), Gaps = 52/397 (13%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 KG ++ I+ +++ + L + + K +++A +AA L+GA Sbjct: 4 TKTIGKGRRRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVA 63 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDA-----------KRFIKNHIKESLSG-------- 108 + L R E + D F + + ++ Sbjct: 64 IQKEYDYLRARQEGLVTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDP 123 Query: 109 -----YSAVFYNTEIQNIVNSSRISMTHMANNR----LDSSNNTIFYNMDVMTSYDYR-- 157 Y + V + M ++ + + + Sbjct: 124 FVPVLTPGAQYVRVTVSDVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMVC 183 Query: 158 ------------LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV-DLSGSMH 204 L + + + L I + S Sbjct: 184 AEDLGEDNFGYPLNKMMAMKISSQQNTPIGPGNFQLIRLGDNTGAADIRRAMAGESAGDQ 243 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 D + + + + L + + + Sbjct: 244 FCFGYDGPNATEQVE--TEPGNTVGPVAQGLNTRMGRWQGPVNSDDHPQDWDTCVGE--- 298 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 ++ ++ D M Y + + Sbjct: 299 ---PLDVDSDGNITDKNGSPITIPDGTADYHAYMDTNYPQYYHNGWYNSVEGHSCPAAAD 355 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S F + G + + C N Sbjct: 356 GSDSFDRNSNSSAMGMEKR-REFSIVVADCGTGNNNG 391 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 69/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYQEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 NQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNAT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL S + +Y V L +F ++S + + Sbjct: 948 -AGANSWELLAM-AGSGDKYYFVETFGGLKGIFSDVSASVCN 987 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKYYFQPDTGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 516 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V + D+L ++ + Q + Sbjct: 558 KLREENIRVHAIGV--KEANQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEIC 610 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDPAL--- 742 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 743 --ALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 237 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 293 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 294 RIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 339 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 340 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 394 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 395 LATHNQTFLKKLRNQITHT 413 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/284 (12%), Positives = 70/284 (24%), Gaps = 19/284 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 + A V +LLIG + + + +N ++SA +A LAG + S + ++ Sbjct: 1 MLPLAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQA 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + DD + + E + + M + + Sbjct: 61 AAFFNANFNEDDL----GATETNFATSSADGGSLVEGIATTDVEMVLMNLFGVISVPINV 116 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLL--NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 T L + K + + ++ Sbjct: 117 ECSATMDIGNTDVTMVLDTTGSMSQTLSGTTTKRIDALRTAMKNFYDTVSAA----TTGS 172 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + + + VN + + L+ L GY Sbjct: 173 NARVRYSFVPYSSSVNVGQLI-------YDLDPDYLVDTWAIQSRTPVFNTVTEQILTGY 225 Query: 260 TTRVEKNIEPSWG--TEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 T V T + Y + +SL T + A Sbjct: 226 DTPVTTTASSYSNETTGNDQSYNSTRYNSLSACNTAKPADVAWA 269 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 47/158 (29%), Gaps = 33/158 (20%) Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE-- 340 T M ++ + D N + + IIF+TDG+ Sbjct: 479 ADALSANGSTYHDLGMLWGLRLSSPDGPWQAMVNETPENGG----EVSRHIIFMTDGQMD 534 Query: 341 ------------------------NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + + + + +CD AK ++ I+ + N Sbjct: 535 TNYKVMSTYGIEWHDRRITDDGVTDQDARHTLRFRALCDAAKAKGFRVWVIAFASDLNDD 594 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 L C S+ + NA L FQ I++ + + Sbjct: 595 --LSYCASASST-FPATNATELNTAFQEIAKNVAELRV 629 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D + K V G + Y E K D + T + Sbjct: 829 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTYTA 888 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ T Sbjct: 889 EALGFSDHMFTEARGSRLQ------------KGVPQVLIVITDGESHDADKLNAT---AK 933 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + ++ V L +F ++S + + Sbjct: 934 ALRDKGILVLAVGI-AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 987 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 47/183 (25%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + V +G ++ K + I Sbjct: 1013 DDFRKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++Q D + ++ LTDG++ + Sbjct: 1073 FGYTHIGAALRQVGHYFRPDMGSRINAG------------TPQVLLVLTDGQSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + + I ++ I + Q L + V N D L V + I + Sbjct: 1116 EVARAAEDLRHKGVDIYSVGIGDVDDQQ--LIQITGTAGKKLTVHNFDELTKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + + V +G + Y + E + T K + + T++ Sbjct: 456 TFLSEVVGMFNIAPQKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 516 AALNFTLGLLQKAK-------------KQRGNRVPCHLVVLTNGMSKDSI-----LEPAN 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I++ I + Q L+ + Y V + D+L + + Q + Sbjct: 558 RLREELIRVYAIGVK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEIC 610 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 64/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKTFMKNLVSKSQIGADRVQIGVVQFSDVNKEEFQLNRYMSQNEISNAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K ++F+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------AKGARPNVRRFLILITDGEAQDIVKDP----- 740 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ I I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 741 AVALRQEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQHIEDDLVFGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 57/187 (30%), Gaps = 25/187 (13%) Query: 233 NALLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 L FL+ S+ L + + +GL+ Y+ + S G K S Sbjct: 245 EYLKEFLEESVSALDIKEHCMRVGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGK 304 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ ++ F + + + + +T + + T Sbjct: 305 AYTGAAI-----------RKIRKEVFSARNGSRKNQGVPQIAVLVT----HRPSEDNVTK 349 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL----IHVFQNI 405 + + + I T+ I + + Q L+ S P + + L + + Sbjct: 350 AAVN-LRREGVTIFTMGIEGASDSQ--LEKIASHPAEQHVSKLKTFSDLAAHNQTFLKKL 406 Query: 406 SQLMVHR 412 ++H Sbjct: 407 RNQIMHT 413 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D + K V G + Y E K D + T + Sbjct: 809 DFMVDLVKKADVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTYTA 868 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ T Sbjct: 869 EALGFSDHMFTEARGSRLQ------------KGVPQVLIVITDGESHDADKLNAT---AK 913 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + ++ V L +F ++S + + Sbjct: 914 ALRDKGILVLAVGI-AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 967 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 47/183 (25%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + V +G ++ K + I Sbjct: 993 DDFRKMKEFLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQI 1052 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++Q D + ++ LTDG++ + Sbjct: 1053 FGYTHIGAALRQVGHYFRPDMGSRINAG------------TPQVLLVLTDGQSQD----- 1095 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + + I ++ I + Q L + V N D L V + I + Sbjct: 1096 EVARAAEDLRHKGVDIYSVGIGDVDDQQ--LIQITGTAGKKLTVHNFDELTKVKKRIVRN 1153 Query: 409 MVH 411 + Sbjct: 1154 ICT 1156 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + + V +G + Y + E + T K + + T++ Sbjct: 436 TFLSEVVGMFNIAPQKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTG 495 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 496 AALNFTLGLLQKAK-------------KQRGNRVPCHLVVLTNGMSKDSI-----LEPAN 537 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +E I++ I + Q L+ + Y V + D+L + + Q + Sbjct: 538 RLREELIRVYAIGVK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEIC 590 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 64/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 620 MKTFMKNLVSKSQIGADRVQIGVVQFSDVNKEEFQLNRYMSQNEISNAIDRMTHIGETTL 679 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K ++F+I +TDGE + + Sbjct: 680 TGSALTFVSQYFSP--------------AKGARPNVRRFLILITDGEAQDIVKDP----- 720 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ I I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 721 AVALRQEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQHIEDDLVFGICSP 777 Query: 413 K 413 + Sbjct: 778 R 778 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 57/187 (30%), Gaps = 25/187 (13%) Query: 233 NALLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 L FL+ S+ L + + +GL+ Y+ + S G K S Sbjct: 225 EYLKEFLEESVSALDIKEHCMRVGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGK 284 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ ++ F + + + + +T + + T Sbjct: 285 AYTGAAI-----------RKIRKEVFSARNGSRKNQGVPQIAVLVT----HRPSEDNVTK 329 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL----IHVFQNI 405 + + + I T+ I + + Q L+ S P + + L + + Sbjct: 330 AAVN-LRREGVTIFTMGIEGASDSQ--LEKIASHPAEQHVSKLKTFSDLAAHNQTFLKKL 386 Query: 406 SQLMVHR 412 ++H Sbjct: 387 RNQIMHT 393 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 38/185 (20%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ A+K F++ ++ +GL+ + R ++ V + + Sbjct: 112 STRLEAVKRVARSFVEE-------RQGDRIGLVLFANRAYVAAPLTFDLAAVGRAIEEAS 164 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A + +T + I+ L+DG++N Sbjct: 165 IGITGRSTAIADGLGLALKSVTESSA------------------ASRVIVLLSDGQDNAH 206 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-----------NGQRLLKTCVSSPEYHYNVV 393 + + ++ A + ++I TI++ + L +S Y V Sbjct: 207 QID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVR 264 Query: 394 NADSL 398 + L Sbjct: 265 GMEDL 269 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 22/364 (6%), Positives = 71/364 (19%), Gaps = 56/364 (15%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS---- 70 + + + ++ + + + + ++ + A L G + + Sbjct: 9 RRGRRGTILPLLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGV 68 Query: 71 -------NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF---------- 113 ++ S + + + + + + + Sbjct: 69 AYNNLPAAVTAAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGG 128 Query: 114 ----------YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + V+ M N + S + L Sbjct: 129 AMSAPGGNSWTAMRVTLGVSQPTYFMRVFGVNSMPSGAVATAVYR--PRDIAFVLDMTGS 186 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + + M P L+ + + +D +S Sbjct: 187 MAFSSTFNSGNAQGSSSDYQSMNPDP--LVPKAGHYTTVQSRIVAADNLANSSNEALPR- 243 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 F + + Y + T + + + Sbjct: 244 ------------NNFTITTPGGPPIVRSYYYDPSNWGTPSTVASPVTTKGDGSANLLPAF 291 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS--LPFQKFIIFLTDGEN 341 + + + + + F + P P +TD Sbjct: 292 HRWSPPESGADSD------NYIAPTYNFAGYNAFHKGNETTPQGPTPAPDTYGTMTDASG 345 Query: 342 NNFK 345 + Sbjct: 346 LTYT 349 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 49/371 (13%), Positives = 111/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + R+D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F++ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 79/245 (32%), Gaps = 29/245 (11%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + LV+D+SGSM ++ N+ + ++ K++ F++ Sbjct: 391 GVTGVPLKQGTDVVLVIDVSGSMDWDVDGKQTTDNT--------KKRITIAKDSAKQFVN 442 Query: 241 SI--DLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRD--MDSLILKPTDS 294 + + K + + ++ +++ N + + + + + TD Sbjct: 443 QLFANNEDGSKSNNRVSVVIFSSSGYTNGILCSLKNVDNKQTVIDAIDGISNNPTGGTDY 502 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 AM A Q+L + K + P + G + + K Sbjct: 503 DNAMTMAEQVLETVKDTTRNKAVLFMSDGAPENGYNGKT-----GYDIYPDAFKAHEKSS 557 Query: 355 DKAKENFIKIVTISINASPNGQRLLK----------TCVSSPEYHYNVVNADSLIHVFQN 404 + I T+S + + L S+ + N + + L + F N Sbjct: 558 EIKNNYGATIYTVSFGLKGSQYKELTEDRCRQILRDYMASNENCYKNANSKEDLENAFTN 617 Query: 405 ISQLM 409 I+ + Sbjct: 618 IATAI 622 >gi|118348690|ref|XP_001007820.1| U-box domain containing protein [Tetrahymena thermophila] gi|89289587|gb|EAR87575.1| U-box domain containing protein [Tetrahymena thermophila SB210] Length = 790 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 39/268 (14%), Positives = 90/268 (33%), Gaps = 29/268 (10%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEM-------GERPIFLIELVVDLSGSMHCAMNSDPEDV 214 +Q + ++++ +R I V+D+SGSM Sbjct: 100 GSNNQSNSSQLKLQINRFNDQVKISIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSKG 159 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + T + +K+++ ++++D + L+ + T K + + E Sbjct: 160 DIESNGL----TILDLVKHSVKTIINNLDERD------RLSLVAFHTNAYKITDLTPMNE 209 Query: 275 KVRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 R + + L T+ + QA +++ + +++S R I Sbjct: 210 NGRNHAIKELEKLIPLDSTNIWDGIYQALEVVKAGQQQSIQKGEQRVAFSQ--------I 261 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYHYN 391 + TDG+ N + + +EN + I T + + + L + + Sbjct: 262 LLFTDGQPNVIPPRGHLPMLKKYKEENDVNCSISTFGFGYNLDSELLDQLAIEGRGSFAF 321 Query: 392 VVNADSLIHVF-QNISQLMVHRKYSVIL 418 + + + VF +S LM +L Sbjct: 322 IPDGQFVGTVFVNALSNLMTTLAVDAVL 349 >gi|254780135|ref|YP_003064548.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] gi|254039812|gb|ACT56608.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 37/414 (8%), Positives = 111/414 (26%), Gaps = 38/414 (9%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + F +I A+ + SF+ ++ ++ + K ++ + S Sbjct: 13 RLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTALLQEVLDH-----VIYRTSPK 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + R N + + + + S S V + VNS + Sbjct: 68 NLYDLREAGRDNFIRHQIEKALNTYNSRDLSNIGSIESIVKDAVILTKNVNS-------L 120 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + + + +Q + +S ++ + + Sbjct: 121 PLQFTVDIALSTTVQLRGSL---------LQMFSQSKGKVDISRRKKVM--YKQNIGLMI 169 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + D A D + N + + K Sbjct: 170 MPFAWD---GYWLASRGKVADSQVHRPKYLEYSHYYQQYLNR-NTLVKNFLSQIPYKNFC 225 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + D + + + +TS++ Sbjct: 226 MAPYHYKSIFYWAFENFQYHQDNNMLIFYSHRDFYRITWDHFPYSFIKNVFDMTSNQFGD 285 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENF------ 361 K+++ G + S ++ C ++ Sbjct: 286 GQVLTNTNHCFPHGASHSKYMLMFAIGNQLSRSSVGKETIDRILQDCYYMHKHHRTGRGA 345 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 I I ++ + + + L+ C S P +Y + + ++++ + +++++ ++ +S Sbjct: 346 ITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVITNWFS 399 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 78/220 (35%), Gaps = 32/220 (14%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +F +++D SGSM V + + K A ++ Sbjct: 92 SNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKLASKNYVK----- 146 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 D G++ +T+ E + + + + T+ A+K++ +IL Sbjct: 147 -AQGNDDRTGIVLFTSEANTVCELTNSEYVLMNAIDKI---YSNGGTNFNNAIKESIRIL 202 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T+ +K I+ ++DGE + + + D A EN IKI Sbjct: 203 TN-----------------TRNDSEKRILLVSDGE------SELSSSVIDLAIENNIKIN 239 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T+ I N + L + ++ V AD LI+++ I Sbjct: 240 TVYIGGQNNNELLKNVAERTGGKYFKAVTADELINIYSEI 279 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 54/183 (29%), Gaps = 12/183 (6%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ ++ +D + + +G++ Y+ V +++ T + R+ + S Sbjct: 990 RIELVRETCHFLIDQLTADDY------LGIVSYSNTVREDVPLLRMTPEARRLAHTMISS 1043 Query: 287 LI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF- 344 L T ++ + + G S TDG+ Sbjct: 1044 LTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCF-LFTDGQATTGP 1102 Query: 345 ---KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + I + T + + L + +Y + AD + Sbjct: 1103 CTVNEIMGQMTSLQSPADQNITVHTFGFGDDHSVELLQGVAEAQSGVYYYISCADDIPSG 1162 Query: 402 FQN 404 F + Sbjct: 1163 FGD 1165 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 73/225 (32%), Gaps = 52/225 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+S+ + D K ++ ALK F+ + + +GL+ Y + Sbjct: 98 DVSSSMLANDLKPNRLEALKKVAATFVQ-------DRINDRIGLVVYAGESYTRTPVTSD 150 Query: 273 TEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + Q + DS+I T + A + K + Sbjct: 151 KTIILQSLKSVEFDDSIIADGTGIGVGLATAINRIKDSKAK------------------S 192 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------------- 373 + II LTDG NN+ + AKE IK+ TI I + Sbjct: 193 RIIILLTDGVNNSGTID--PRTAASIAKEYGIKVYTIGIGTNGKAMFPVAKDANGKLVFK 250 Query: 374 ------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + + ++ ++ + L ++ I++L Sbjct: 251 MMPVEIDEKLMQEIAKNTDAKYFRATSNKKLQAIYDEINKLETTE 295 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 111/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + + + R I + + + Sbjct: 101 AALLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGEGGRARSPDISKVVIVVTDGRPQDSV 160 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S I + + R+D + + DY + + Q+ Sbjct: 161 RDVSARSRASGIELFAIGVGRVDKATLREIASEPQDEHVDYVESYSVIEKLSKKFQEAFC 220 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 221 VVSDLCATGDHDCEQVCVSSPASYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 280 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F++ I D L + +GL+ Y++ V + K Sbjct: 281 GSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 340 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 341 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 387 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 388 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 440 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 441 QIGKKLQKKIC 451 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 53/166 (31%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V++ K + T + Sbjct: 62 FLSQVIESLDVGPNATRVGLVNYASAVKQEFPLRAHGSKAALLQAVRRIQPLSTGTMTGL 121 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A S +G + S K +I +TDG + +V+ Sbjct: 122 AIQFAITRAFS----------EGEGGRARSPDISKVVIVVTDGRPQDSVRDVSARS---- 167 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + + I++ I + + L+ S P E+ V + + Sbjct: 168 -RASGIELFAIGVGRVD--KATLREIASEPQDEHVDYVESYSVIEK 210 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 72/233 (30%), Gaps = 47/233 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + +D+SGSM + + R +++ K+ L F+ Sbjct: 87 STKGVDIMIALDISGSMVNSYD-------------FLPRNRLSVSKDLLREFVKK----- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQIL 305 + +G++ + S + + + I + T A+ + L Sbjct: 129 --RLYDRIGIVVFAGAAYLQSPLSSDRFALDELIAGTSSEDIEEQGTAVGDALVLSSYRL 186 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + + K II LTDG +N K + K IK+ Sbjct: 187 KNSEAK------------------SKVIILLTDGVSNTGKLD--PDTAAYTTKTMGIKVY 226 Query: 366 TISIN-----ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ-LMVHR 412 I I N + L K ++ + + + L V I Q +V Sbjct: 227 CIGIGKEEGQYEINYESLQKISSNTNGKFFRAESPEVLESVLNEIDQLEVVEL 279 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + K V G + Y E K+ D + T + Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTA 888 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ + T Sbjct: 889 EALAFSNHMFTEARGSRL------------HKGVPQVLIVITDGESHDAEKLNAT---AK 933 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL S + +Y V L +F ++S + + Sbjct: 934 ALRDKGILVLAVGI-AGANSWELLAM-AGSGDKYYFVETFGGLKGIFSDVSASVCN 987 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKYYFQPDTGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 59/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 516 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V + D+L ++ + Q + Sbjct: 558 KLREENIRVHAIGV--KEANQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEIC 610 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDPAL--- 742 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 743 --ALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 237 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 293 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 294 LIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 339 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 340 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 394 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 395 LATHNQTFLKKLRNQITHT 413 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 59/428 (13%), Positives = 111/428 (25%), Gaps = 83/428 (19%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + +A + A + + + D D + + +S+S Sbjct: 43 ISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWTDKSVSTQEVT 102 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + ++ + + + + + S+ I N+ L + Sbjct: 103 LKTYDDNHVTVAPKDGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGD 162 Query: 172 -------KIVSFIPALLRIEMGERPIFLIELV------VDLSGSMHCAMNSDPEDVNSAP 218 + I R + D GS H + A Sbjct: 163 DEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAA 222 Query: 219 ICQDKKR-------TKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI 267 Q R T+M ALK A+ F+D +S + +GL+ Y + V Sbjct: 223 HTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRS 282 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + ++ V D T + M+ A +L + Sbjct: 283 GLTDSLSGLKSTVD---DLKASGATRADLGMQTANTVLGN-----------------ARA 322 Query: 328 PFQKFIIFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPN-------- 374 K +IF TDG+ + I K N + ++ I N Sbjct: 323 DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 382 Query: 375 ---------GQRLLKTCVS----------------SPEYHYNVVNADSLIHVFQNISQLM 409 ++ S + Y+ +AD+L VF I + Sbjct: 383 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 442 Query: 410 VHRKYSVI 417 S I Sbjct: 443 SSNPTSPI 450 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 31/249 (12%), Positives = 72/249 (28%), Gaps = 76/249 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D K ++ K+ + ++ + Sbjct: 91 DIVFAIDVSKSM---------------LAEDVKPNRIEKAKHIISQLIEVL-------HG 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 + I Y + + + + ++ ++ + T A++ A + Sbjct: 129 DRVAFIPYAAQAYPQLPLTSDYSSAKIFLEGINTNMLSSQGTAIGEAIQMAINYFEESSQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K +I L+DGE++ + + +AK+ I++ TI + Sbjct: 189 T------------------SKILIILSDGEDHQQGVDT----VIQEAKDKGIRLFTIGLG 226 Query: 371 ASP---------------------------NGQRLLKTCVSSPEYHYNVVNA----DSLI 399 + N L + ++N N D+L Sbjct: 227 TAQGATIPVSENGQIVAKRDNNGQVVITKLNQALLEEIAQEGGGKYFNGANTKEVLDALQ 286 Query: 400 HVFQNISQL 408 I + Sbjct: 287 KALDTIEKN 295 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 73/251 (29%), Gaps = 67/251 (26%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + I + +D+S SM + +D + ++ A KN F+ Sbjct: 81 DVSTQTSSTQGIDIVMAIDVSASM---------------LARDLQPNRLDATKNVAEEFI 125 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAM 298 + +GL+ Y + V + ++++ T + Sbjct: 126 Q-------DRPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENGTAIGSGL 178 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + S K II LTDG NN + + A Sbjct: 179 ATAVNRI------------------KDSDAESKVIILLTDGVNNAGFID--PSTASELAV 218 Query: 359 ENFIKIVTISINASP------------------------NGQRLLKTCVSSPEYHYNVVN 394 E IK+ TI + ++ + L + ++ ++ N Sbjct: 219 EFGIKVYTIGVGSNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKYFRATN 278 Query: 395 ADSLIHVFQNI 405 + L ++ I Sbjct: 279 NEKLEEIYAEI 289 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 81/227 (35%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P+ + L++D+SGSM T++ A+KN++ F+ Sbjct: 92 QKPMRDLVLILDVSGSMAKNDVPGG-------------ITRLQAVKNSVSKFVA------ 132 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + S + ++ +T+ ++ + T A+ A ++L Sbjct: 133 -ARQSDRIGLVIFANQAWPFAPVSEDKQALQTRITQLSPGMVGEQTAIGDALGVAVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + K I LTDG + + A +++ T Sbjct: 192 SS----------------ANQDASKLAILLTDGNDTASQLAPPLAAQLAAAH--HVQVHT 233 Query: 367 ISINASPNGQ-------RLLKTCVSSPEYHYN-VVNADSLIHVFQNI 405 I+ S + +L + + + + SL V+Q I Sbjct: 234 IAFGDSNSAGSDHVDLTQLQEIARITGGKSWTAANSGASLDSVWQEI 280 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 79/247 (31%), Gaps = 36/247 (14%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + P + + V+D+SGSM+ + V +A + + +++ Sbjct: 51 QVLLRVEAPPAADLN--SHVPLDVVAVLDVSGSMN-------DPVAAASPKSNLQGSRLD 101 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDS 286 LK ++ + + + ++ + K + + Sbjct: 102 VLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRL 155 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T PA+++A +IL + S FI+ LTDG++ Sbjct: 156 QARGGTALMPALEEAVKILDERQGSSRN--------------HVGFILLLTDGDDTTGFR 201 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 K + T + AS + + LL S + V + D+L ++ ++ Sbjct: 202 WTRDAIHGAVFK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD-DNLANIAGALA 257 Query: 407 QLMVHRK 413 + K Sbjct: 258 VCLGGLK 264 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 72/195 (36%), Gaps = 21/195 (10%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ +++ +N + + + + + +GL+ + T V S+ Sbjct: 89 DLDTSGSMRERGFNPNDLQQNRWNVVNNVVQDFIEKRVNDNIGLVVFGTSVLTASPLSFD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ + ++ + T ++ + IL S Sbjct: 149 KNSQKEIIKYIDIGIVGEQTAMFDSLATSINIL------------------KNSKAKSNI 190 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 II LTDGE+N K I AK+ IKI TI I S N Q L + + Sbjct: 191 IILLTDGEDNASKIP--PQIILKLAKKYKIKIYTIGIGES-NRQMLSTISQETGAKSFLA 247 Query: 393 VNADSLIHVFQNISQ 407 + D L+ V+ I++ Sbjct: 248 NSKDDLVEVYNTINK 262 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 118/411 (28%), Gaps = 74/411 (18%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKK-----NSMESANNAAILAGASKMVSNLSRLGDRFE 80 FA++++ L G K +++A+ L M + + Sbjct: 26 FAVTMVVALAGCGLS---SSPELAKPAEPVAELQTASPQGALVKRMAMPAPMRMQESAAM 82 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + + + + +S +S N+ + L Sbjct: 83 DYRSEPREQYANLPDNPVHRVAETPVSTFSVDVDTGSYANVRR-------FLNQGSLPPE 135 Query: 141 NNTIFYNMDVMTSYDYRLQFIEH----------LLNQRYNQKIVSFIPALLRIEMGERPI 190 M Y Y L Q + I A + E Sbjct: 136 GAVRLEEMVNYFPYHYALPTDGSPFGVTTEVAATPWNPRTQLLRIGIKASD-RPVAELAP 194 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +VD+SGSM +R + +K+ L L +D + V Sbjct: 195 ANLVFLVDVSGSM-------------------DRREGLPLVKSTLKLLVDQLREQDRVS- 234 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 L+ Y ++P+ G +KV+ T ++ AYQ+ Sbjct: 235 -----LVVYAGESRVVLKPTSGRDKVKIR-NAIDQLTAGGSTAGASGIELAYQMAREGFI 288 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISI 369 + I+ TDG+ N S+ + ++ + +++ + + T+ Sbjct: 289 DNGIN----------------RILLATDGDFNVGISDFDSLKQMAVEQRKSGVSLTTLGF 332 Query: 370 NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + L++ + + + N L + + + +V+ + Sbjct: 333 GVDNYNEHLMEQLADAGDGNYAYIDN---LREAHKVLVDQLSST-LAVVAR 379 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 59/428 (13%), Positives = 111/428 (25%), Gaps = 83/428 (19%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + +A + A + + + D D + + +S+S Sbjct: 29 ISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWTDKSVSTQEVT 88 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + ++ + + + + + S+ I N+ L + Sbjct: 89 LKTYDDNHVTVAPKDGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGD 148 Query: 172 -------KIVSFIPALLRIEMGERPIFLIELV------VDLSGSMHCAMNSDPEDVNSAP 218 + I R + D GS H + A Sbjct: 149 DEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAA 208 Query: 219 ICQDKKR-------TKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI 267 Q R T+M ALK A+ F+D +S + +GL+ Y + V Sbjct: 209 HTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRS 268 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + ++ V D T + M+ A +L + Sbjct: 269 GLTDSLSGLKSTVD---DLKASGATRADLGMQTANTVLGN-----------------ARA 308 Query: 328 PFQKFIIFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPN-------- 374 K +IF TDG+ + I K N + ++ I N Sbjct: 309 DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 368 Query: 375 ---------GQRLLKTCVS----------------SPEYHYNVVNADSLIHVFQNISQLM 409 ++ S + Y+ +AD+L VF I + Sbjct: 369 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 428 Query: 410 VHRKYSVI 417 S I Sbjct: 429 SSNPTSPI 436 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 78/233 (33%), Gaps = 38/233 (16%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 LVVD+S SM + + + A+ +F+D + ++ Sbjct: 37 NNMDAVLVVDVSNSMTQSDKNKVSN-------------------EAMKMFVDMTSIQAN- 76 Query: 249 KEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 +G++ YT ++E+ +E+ + + +DSL TD + +A +IL Sbjct: 77 ----KVGVVAYTDKIEREKALLEINSEEDKNDIKAFIDSLQKGAYTDIAVGVTEAVKILD 132 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + KS+ + +AK+ + T Sbjct: 133 AGRNPNNAPIIVLLADG--------NNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYT 184 Query: 367 ISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I +NA N L + + + AD L + I H K V+ Sbjct: 185 IGLNADGQLNRTTLQQIAAETNGKFFETSTADKLPQILSEIFAN--HLKLKVV 235 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 71/214 (33%), Gaps = 45/214 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D + T++ A KNA+ + + +G++ + + + Sbjct: 95 DGSESMAATDYEPTRLDAAKNAINNLILKMGPQ------HNVGVVLFESGATTVSYLTPD 148 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 EK ++ L T + + +S + Sbjct: 149 KEKSVNAISSIEQGL--GATAIGDGLALGVDMASSI------------------PDKKGV 188 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-----------------ASPNG 375 +I L+DG +N+ + + AK N ++I TI + A + Sbjct: 189 VILLSDGVHNSGLVT--PEEATEYAKINNVQIHTIGLGSIEPVFLRDDIYGEPQYAELDE 246 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L+ + +Y ++ +L +F N+S + Sbjct: 247 ETLVIIAQQTSGNYYKSLDEQTLNEIFVNLSSNL 280 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 29/351 (8%), Positives = 86/351 (24%), Gaps = 33/351 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R+ ++ + ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 1 MNRAARYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + + + + + + Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFV---FFQKTSVQMSTNANVTFSDSLTNPFLT 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + SN ++ L + A+ Sbjct: 118 KNAVTTPANIKYVQCTATLSNIAHWF-----IEVLNVL------PGTKLANAAEVSASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + + + AAL Sbjct: 167 ATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGSSSTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P T ++ R ++ T+ Sbjct: 218 -----DGSTNETTLASELSGNTCNITSPPDLATTGMKSASQRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 Y + + F + DG + Sbjct: 273 QPDFTGYAYVGPNYGPPGTAGIKGDAYTQFVADRASFKPYQGDGAPPSGSG 323 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 59/428 (13%), Positives = 111/428 (25%), Gaps = 83/428 (19%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + +A + A + + + D D + + +S+S Sbjct: 43 ISAAMIVTVALVAIPAQAAGNMIADPTTFTQWEQGIGEPTDPRSTGRVWTDKSVSTQEVT 102 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + ++ + + + + + S+ I N+ L + Sbjct: 103 LKTYDDNHVTVAPKDGSFLVGLSAMSSAQKLIGVSNVTKPLDIVLVLDTSGSMAWGMDGD 162 Query: 172 -------KIVSFIPALLRIEMGERPIFLIELV------VDLSGSMHCAMNSDPEDVNSAP 218 + I R + D GS H + A Sbjct: 163 DEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVTPKTSAADSDAA 222 Query: 219 ICQDKKR-------TKMAALKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI 267 Q R T+M ALK A+ F+D +S + +GL+ Y + V Sbjct: 223 HTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGLVTYASDVNTRS 282 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + ++ V D T + M+ A +L + Sbjct: 283 GLTDSLSGLKSTVD---DLKASGATRADLGMQTANTVLGN-----------------ARA 322 Query: 328 PFQKFIIFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPN-------- 374 K +IF TDG+ + I K N + ++ I N Sbjct: 323 DASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 382 Query: 375 ---------GQRLLKTCVS----------------SPEYHYNVVNADSLIHVFQNISQLM 409 ++ S + Y+ +AD+L VF I + Sbjct: 383 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 442 Query: 410 VHRKYSVI 417 S I Sbjct: 443 SSNPTSPI 450 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 70/239 (29%), Gaps = 51/239 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++ LV+D+S SM ++ A K A F D + Sbjct: 95 NRAVVMLVIDVSESMAATDVP---------------PDRLTAAKEAGKQFADELTPA--- 136 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ V+ + T + + A Q + Sbjct: 137 ---INLGLVEFAANASLLVSPTTNRAAVKAAIDSLK---PAPKTATGEGLFTALQAI--- 187 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 + P I+ +DG N A+ ++I Sbjct: 188 -------ATVGSVMGGGDGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQIS 240 Query: 366 TISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 TIS + Q L K C + ++ + +SL +V+ + + + Sbjct: 241 TISFGTPYGTVEYEGATIPVPVDDQTLQKICEITDGQAFHADSLESLKNVYSTLQRQIG 299 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 30/353 (8%), Positives = 86/353 (24%), Gaps = 33/353 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R ++G+ ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 1 MSSAARHPKLTRRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + ++ + + Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAGHANFA---FFQQNAVQMQTDSNVTFSDSLTNPFLT 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + ++ + L + + A+ Sbjct: 118 KTAVATPANVK--------YVKCTAQLSNIAHWFIEVLNT---IPGLQVANAAQVAASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + S + + + AAL Sbjct: 167 ATVGAGQTTCAIPVFVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P G+ ++ R ++ T+ Sbjct: 218 -----DGSTNEPTIASELSGNTCNITSPPDLGSTGLKSASLRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 Y + L F + DG Sbjct: 273 QPDFTGYAYVGPKYGPPGTAGIVGDAYTQFVLDRASFKAYQGDGTPPTGSGIS 325 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 66/220 (30%), Gaps = 52/220 (23%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ K L F+ + + +GL+ + + VR+ Sbjct: 72 AGDFRPQNRLHVAKEVLAEFI-------SNRVNDRIGLVVFAGAAYTQAPLTLDYGVVRE 124 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ T A+ + L + + + ++ +TD Sbjct: 125 VLKQIRTRVLEDGTAIGDALATSLNRLRDSEAK------------------SRVVVLITD 166 Query: 339 GENNNFK-SNVNTIKICDKAKENFIKIVTISINASPN----------------------G 375 G+NN K S ++ + + K I I TI + Sbjct: 167 GDNNAGKISPLDAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVWRDTEIPIN 223 Query: 376 QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LL+ S + +Y + + L Q + + K Sbjct: 224 PELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKL 263 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 109/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + +D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 222 VVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLID 281 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F+ I D L + +GL+ Y++ V + K Sbjct: 282 GSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 341 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 342 VRNMSYMEKSTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 388 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 389 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 441 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 442 QIGKKLQKKIC 452 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G++ Y + V++ K + T + Sbjct: 63 FLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGL 122 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A +G + S K +I +TDG + +V+ Sbjct: 123 AIQFAIT----------KAFGDAEGGRSRSPDISKVVIVVTDGRPQDSVQDVSAR----- 167 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + +++ I + + + L+ S P E+ V + + Sbjct: 168 ARASGVELFAIGVGSVD--KATLRQIASEPQDEHVDYVESYSVIEK 211 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 58/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + K V G + Y E K+ D + T + Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTA 888 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+N+ + T Sbjct: 889 EALAFSDHMFTEARGSRL------------HKGVPQVLIVITDGESNDAEKLNTT---AK 933 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + +Y V L +F ++S + + Sbjct: 934 ALRDKGILVLAVGI-AGANSWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKYYFQPDMGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 516 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V D+L ++ + Q + Sbjct: 558 KLREENIRVHAIGV--KEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEIC 610 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDP----- 740 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 741 AIALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 237 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 293 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 294 RIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 339 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 340 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 394 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 395 LATHNQTFLKKLRNQITHT 413 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 33/351 (9%), Positives = 88/351 (25%), Gaps = 33/351 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R ++G+ ++ +II LS+ + +G + + + ++ ++++ +A Sbjct: 1 MNSAARHPKVTRRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + +++ S T + Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFV----------FFQKTSVQMSTNANVTFSDS 110 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + N + ++ + L L + A+ Sbjct: 111 LTNPFLTKSAVTKPASIK-YVQCTATLSNIAHWFIEVLNV---LPGTKLANAAEVSASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + + + AAL Sbjct: 167 ATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGSSSTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P T ++ R ++ T+ Sbjct: 218 -----DGSTNETTLASELSGNTCNITSPPDLSTTGLKSAAQRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 Y + + F + DG + Sbjct: 273 QPDFTGYAYVGPNYGPPGTPGIKGDAYTQFVADRASFKPYQGDGAPPSGSG 323 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 68/180 (37%), Gaps = 25/180 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +KN +L L+ ++ + LI + + ++ + ++ + + Sbjct: 172 SKIENVKNTILQLLEMLNEND------RLSLITFNSYAKQLCGLKKVSNLNKETLQAITN 225 Query: 286 SL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-- 342 S+ T+ T ++ A+QIL S KK++ ++ I L+DG+++ Sbjct: 226 SIKAYGGTNITSGLEIAFQILQSRKKKNSVSS----------------IFLLSDGQDDGA 269 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + K + +E I + S + + + K Y V D + F Sbjct: 270 DTKIKNLLKITYQQLQEESFTIHSFSFGSDHDCPLMQKIAQIKDGSFYFVEKNDQVDEFF 329 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 76/227 (33%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + L++D+SGSM D T++ A++ ++ F+ Sbjct: 92 IKPMRNMVLILDVSGSMEKNDAQDG-------------ITRLQAVQRSVRAFVAQ----- 133 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + + S + + + + +I + T A+ A ++L Sbjct: 134 --RKTDRIGLVIFASSAWPFAPISEDKQALLARINQLAPGMIGQQTAIGDALGVAVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + I LTDG + + + A + +++ T Sbjct: 192 SSL----------------DRDASRLAILLTDGNDTASQLSPAL--AAQLAASHHVQVHT 233 Query: 367 ISIN-------ASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNI 405 I+ + L + + + +L V+Q I Sbjct: 234 IAFGDINSSGEDKVDTALLKQIAQLTGGEALQASTSGKALDSVWQQI 280 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 49/371 (13%), Positives = 111/371 (29%), Gaps = 39/371 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 101 AALLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRSRSPDISKVVIVVTDGRPQDSV 160 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + R+D + + DY + R Q+ Sbjct: 161 QDVSARARASGVELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 220 Query: 176 FIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + L + + + +NSD + N T + L + Sbjct: 221 VVSDLCATGDHDCEQVCISSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFLID 280 Query: 234 A-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + F++ I D L + +GL+ Y++ V + K Sbjct: 281 GSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAA 340 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + K T + A+K +F QK I TDG + Sbjct: 341 VRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRS 387 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 388 QDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTIN 440 Query: 400 HVFQNISQLMV 410 + + + + + Sbjct: 441 QIGKKLQKKIC 451 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/351 (8%), Positives = 85/351 (24%), Gaps = 33/351 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+ +R+ ++ + ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 1 MNRAARYPKVTRRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + K + S Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHLNFVFFQKKSVQMSTNANVTFSDSLTNPFLTKNA 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + I ++ + L L + A+ Sbjct: 121 VTTPANIKYVQC-----------TATLSNIAHWFIEVLNV---LPGTKLANAAEVSASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + + + AAL Sbjct: 167 ATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGSSSTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P T ++ R ++ T+ Sbjct: 218 -----DGSTNETTLASELSGNTCNITSPPDLATTGMKSASQRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 Y + + F + DG + Sbjct: 273 QPDFTGYAYVGPNYGPPGTAGIKGDAYTQFVADRASFKPYQGDGAPPSGSG 323 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 32/358 (8%), Positives = 87/358 (24%), Gaps = 33/358 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R ++G+ ++ +II L++ + +G + + + ++ ++++ +A Sbjct: 89 MSSAARRPKAVRRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 148 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + F K + ++ Sbjct: 149 ALSAARDLTSAISLSVAEA--------NGIAAGHMNFAFFQQKPVQMLTDSNVTFSDSLT 200 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S+ ++ AN + ++ L + A+ Sbjct: 201 NPFLSKTAVATPANVKYVKCVTSLSNIAHWFIEVLNVL------PGTKLANAAEVSASAI 254 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + + + AAL Sbjct: 255 ATVGGGQTTCAIPVFVCRGPSDPPYKVGDWISSPSGSSSTYGPGNFGWAAL--------- 305 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P + ++ R ++ T+ Sbjct: 306 -----DGSTNETTIASELSGDTCNITSPPDLASTGLKSASLRAWNTRFGIYTNGANGSSG 360 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 Y + D + K+ + DG + Sbjct: 361 QPDFTGYAYVGRDYGPPGTAGTKGDAYTQFVIDRTKYKPYQGDGAPPSGSGISTNGTA 418 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 62/207 (29%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + V + K+ +K A+L + + + + ++ + Sbjct: 51 TGGDSSNDRPGLDLVAVLDVSGSMAGDKIEKVKTAMLFVIKKLSPID------RLSVVTF 104 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + TE ++ + + ++ L T+ T ++ ++L+ Sbjct: 105 SADANRLCPLRQITENSQKDLEKLINGLNADGATNITAGLQTGLKVLSDRSLSGGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 I+ ++DGE N + + + T + + L Sbjct: 164 --------------IMLMSDGEQNAGGD-------AAQVPVGNVPVYTFGFGINHEPRVL 202 Query: 379 LKTC-VSSPEYHYNVVNADSLIHVFQN 404 S +V + ++L F Sbjct: 203 KAIAHNSIGGTFSDVQDTNNLSKAFSQ 229 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 73/241 (30%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + LV+D+S SM T++ A + A F ++ Sbjct: 85 PRNRATVMLVIDVSLSMEATDVL---------------PTRLQAAQEAATSFARNMTPG- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GLI + + P+ V + + + T + + A Q + Sbjct: 129 -----INLGLISFAGTATVLVNPTTDRNGVIKAIENLKLAQ---STATGEGIFAALQSVE 180 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK---ICDKAKENFIK 363 S + I+ ++DG+ + AK+ + Sbjct: 181 SFSSLVGGADGP----------PPARIVLMSDGKQTVPEDLYAARGGYTAAQAAKQAGVP 230 Query: 364 IVTISINASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I +IS + + + L + S Y +A+ L V+ ++ + + Sbjct: 231 ISSISFGTTHGSVTIDDKPQPVSVDDESLREIARLSGGDFYKAASAEELKKVYADLGEQI 290 Query: 410 V 410 Sbjct: 291 G 291 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 70/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYQEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 NQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNTT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + +Y V L +F ++S + + Sbjct: 948 -AGANSWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKYYFQPDMGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 516 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V D+L ++ + Q + Sbjct: 558 KLREENIRVHAIGVK--EANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEIC 610 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDP----- 740 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 741 AIALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 237 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 293 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 294 RIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 339 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 340 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 394 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 395 LATHNQTFLKKLRNQITHT 413 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 71/228 (31%), Gaps = 30/228 (13%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + ++++++L+G + V + K+A +K A+L Sbjct: 30 TSINNNTAPLEESKLKVMLELTGGDSTNDRPGLDLVAVLDVSGSMAGDKIAKVKTAMLFV 89 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPA 297 + + + + ++ ++ + TE ++ + ++ L T+ T Sbjct: 90 IKKLSPID------RLSVVKFSADASRLCPLRQITEDSQKDLENLINGLNADGATNITAG 143 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ ++L S II ++DGE N + Sbjct: 144 LQTGLKVLNDRSLSSGRVVG---------------IILMSDGEQNAGGD-------AAQV 181 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQN 404 + + T + + L +S +V N D+L F Sbjct: 182 PIGNVPVYTFGFGINHEPRVLKAIANNSMGGTFSDVQNTDNLSLAFSQ 229 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 58/171 (33%), Gaps = 17/171 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ V G + Y+ E + + + M T + A+K A Sbjct: 841 VKKADVGRDRVQFGALKYSDNPEILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA 900 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F + + ++ +I +TDGE+++ +T K ++ Sbjct: 901 ------------NALFTEEHGSRINQNVKQMLIVITDGESDDRVELNDT---AAKLRDKG 945 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I I + + + Q+ L+ + V N D L ++ + + M Sbjct: 946 ITIFAVGVGKAD--QKELEGMAGNKNNTIYVDNFDKLKDIYAPVQESMCTE 994 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/281 (11%), Positives = 83/281 (29%), Gaps = 26/281 (9%) Query: 135 NRLDSSNNTIFYNMDVMTS--YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + +I + Y + +L + + S L+ E + Sbjct: 364 EDVTVFALSIQGANNTQLEEIVSYPPEQTISMLKSYADLETYS--TKFLKKLQSEVWSQI 421 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + M++ D++ + + L+ ++ S + V Sbjct: 422 STYAEERNLDKTGCMDTKEADIHFLIDGSTSI--REEQFEQIKKFMLEVTEMFSIGPDKV 479 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ ++ ++ + + + + T + A+ Sbjct: 480 RVGVVQFSDKMRVEFSITDYSNDIDLRKAILNIQQLTGDTHTGEAL-------------D 526 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 F + G+K ++I LTDG++ ++ + + I + I I Sbjct: 527 FILPRIKNGIKERMSQVPCYLIVLTDGKSQYS-----VVEPAKRVRAEQITVHAIGIG-- 579 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ L+ E N D+L + + + K Sbjct: 580 EANKKELQEIAGKEERVSFGQNFDALKSIKNEVVHGICTEK 620 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 51/159 (32%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ E+ + + + D SLI K T A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDDPEEEFQLNTYFTQQEISDAIDRMSLIDKGTLMGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K +KF+I +TDG + + + Sbjct: 715 DQYFT--------------HSKGARFGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + + L+ + V N D L Sbjct: 756 VTIFSVGV-YGADRSQ-LEEISGDGSLVFYVENFDHLQA 792 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/250 (11%), Positives = 64/250 (25%), Gaps = 31/250 (12%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + N V + I V S + ++ + +C Sbjct: 964 MAGNKNNTIYVD----------NFDKLKDIYAPVQESMCTESPEVCNLQEADVIFLCDGS 1013 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 R + D ID + + +G+ + + + I+ K + Sbjct: 1014 DRVSNSDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSDYQNIIDLKSSLNKTQWKTQIQ 1073 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 S A+K+ + + ++ +T G+ Sbjct: 1074 TVSKSSGFPRIDYALKKVGNMFNIRAGGRRNAG------------VPQTLVVITSGDPRY 1121 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI--HV 401 N K+ I ++ + I Q L + E + D L V Sbjct: 1122 -----NVAGAVKALKDLGICVLVLGIGDVHKEQLL--PITGNSEKIITFQHFDRLKNVDV 1174 Query: 402 FQNISQLMVH 411 + I + + Sbjct: 1175 KKRIVREICQ 1184 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 77/226 (34%), Gaps = 48/226 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + +++AA+K F+ + Sbjct: 102 IMLALDLSGSMERVDFS----------IDGRNVSRLAAVKQVGADFIRR-------RAGD 144 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + + S+ T V + L+ + T + A + L + R Sbjct: 145 RIGLVIFADQADVAASLSFDTAAVVHALEEAQIGLVGRSTGIGDGLGLALKRLDAATARE 204 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 K +I L+DG NN + + A++ I++ TI++ Sbjct: 205 ------------------KVVILLSDGANNAGQ--TTPHDVAGLARDLGIRVHTIALGPR 244 Query: 371 --ASPNGQ-------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + +G L +S + V D L V I + Sbjct: 245 DLSDADGDPDVVDTEALRDVSATSGGRFFRVRTTDDLAAVADAIDE 290 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 66/220 (30%), Gaps = 52/220 (23%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ K L F+ + + +GL+ + + VR+ Sbjct: 54 AGDFRPQNRLHVAKEVLAEFI-------SNRVNDRIGLVVFAGAAYTQAPLTLDYGVVRE 106 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ T A+ + L + + + ++ +TD Sbjct: 107 VLKQIRTRVLEDGTAIGDALATSLNRLRDSEAK------------------SRVVVLITD 148 Query: 339 GENNNFK-SNVNTIKICDKAKENFIKIVTISINASPN----------------------G 375 G+NN K S ++ + + K I I TI + Sbjct: 149 GDNNAGKISPLDAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVWRDTEIPIN 205 Query: 376 QRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LL+ S + +Y + + L Q + + K Sbjct: 206 PELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKL 245 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + K V G + Y E K D + T + Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTA 885 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ T Sbjct: 886 EALGFSDHMFTEARGSRL------------HKGVPQVLIVITDGESHDADKLNAT---AK 930 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + ++ V L +F ++S + + Sbjct: 931 ALRDKGILVLAVGI-AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 984 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 50/185 (27%), Gaps = 19/185 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + I V +G ++ + K + I Sbjct: 1010 DDFRKMKEFLASVIQDFDISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQI 1069 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++Q D + ++ LTDG++ + Sbjct: 1070 FGYTHIGAALRQVGHYFRPDMGSRIHAG------------TPQVLLVLTDGQSQD----- 1112 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + I I ++ I + Q L + V N D L V + I + Sbjct: 1113 EVAQAAEELRHKGIDIYSVGIGDVDDQQ--LVQITGTANKKLTVHNFDELKKVKKRIVRN 1170 Query: 409 MVHRK 413 + + Sbjct: 1171 ICSPR 1175 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 62/178 (34%), Gaps = 28/178 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + V +G++ ++ ++ + + T + D + I + T Sbjct: 637 MKTFMKNLVSKSQIRADRVQIGVVQFSDVNKEEFQLNRYTSQEEISDAIDRMAHIGETTL 696 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ Q + K +KF+I +TDGE + + Sbjct: 697 MGSALTFVSQYFSP--------------AKGARPNVRKFLILITDGEAQDIVKDP----- 737 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I I ++ ++ L+ PE + V N + ++I +V Sbjct: 738 AVALREEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENF----DILKHIEDDLV 788 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 60/202 (29%), Gaps = 28/202 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + LK +SI L + + +GL+ Y+ + S G K Sbjct: 235 NSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVNKSEVLQD 291 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + A+ ++ F Q + + + +T Sbjct: 292 IQSLAPQAGKAYTGAAL-----------RKIRKEVFSAQHGSRKNQGVPQIAVLVT---- 336 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL- 398 ++ + T + + + + TI + + + L+ S P Y + + L Sbjct: 337 HSPSQDNVTKAAVN-LRRQGVIVFTIGVEGASD--TQLEKIASHPAEQYVSQLRSFSDLA 393 Query: 399 ---IHVFQNISQLMVHRKYSVI 417 + + + H SVI Sbjct: 394 AHNQTFLKKLRNQITHT-VSVI 414 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 59/189 (31%), Gaps = 20/189 (10%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T+ + + + + + V +G + Y R + E S T K Sbjct: 439 GSGSTQATDFQEMKTFLSEVAGMFNIAPQKVRVGAVQYADRWDLEFEISKYTNKHDVRKA 498 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A+ +L K++ ++ LT+G + Sbjct: 499 IENIRQMGGNRNTGAALNFTLGLLQRAKQQRGG-------------RVPCHLVVLTNGAS 545 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S ++ E I + I + Q L+ + Y V + D+L + Sbjct: 546 RDSVSGP-----ANRLSEELIHVYAIGVR--EANQTQLREIAGEEKRVYYVHDFDALKDI 598 Query: 402 FQNISQLMV 410 + Q + Sbjct: 599 RNQVVQEIC 607 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + K V G + Y E K D + T + Sbjct: 826 DFMINLVKKADVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTA 885 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ T Sbjct: 886 EALGFSDHMFTEARGSRL------------HKGVPQVLIVITDGESHDADKLNAT---AK 930 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + ++ V L +F ++S + + Sbjct: 931 ALRDKGILVLAVGI-AGANPVELLAMAGSS-DKYFFVETFGGLKGIFSDVSASVCN 984 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 50/185 (27%), Gaps = 19/185 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + I V +G ++ + K + I Sbjct: 1010 DDFRKMKEFLASVIQDFDISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQI 1069 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++Q D + ++ LTDG++ + Sbjct: 1070 FGYTHIGAALRQVGHYFRPDMGSRIHAG------------TPQVLLVLTDGQSQD----- 1112 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + I I ++ I + Q L + V N D L V + I + Sbjct: 1113 EVAQAAEELRHKGIDIYSVGIGDVDDQQ--LVQITGTANKKLTVHNFDELKKVKKRIVRN 1170 Query: 409 MVHRK 413 + + Sbjct: 1171 ICSPR 1175 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 62/178 (34%), Gaps = 28/178 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + V +G++ ++ ++ + + T + D + I + T Sbjct: 637 MKTFMKNLVSKSQIRADRVQIGVVQFSDVNKEEFQLNRYTSQEEISDAIDRMAHIGETTL 696 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ Q + K +KF+I +TDGE + + Sbjct: 697 MGSALTFVSQYFSP--------------AKGARPNVRKFLILITDGEAQDIVKDP----- 737 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I I ++ ++ L+ PE + V N + ++I +V Sbjct: 738 AVALREEGIIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENF----DILKHIEDDLV 788 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 60/202 (29%), Gaps = 28/202 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + LK +SI L + + +GL+ Y+ + S G K Sbjct: 235 NSSWEDFDYLKE---FLEESISALDIKEHCMRVGLVAYSNETKVISTLSRGVNKSEVLQD 291 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + A+ ++ F Q + + + +T Sbjct: 292 IQSLAPQAGKAYTGAAL-----------RKIRKEVFSAQHGSRKNQGVPQIAVLVT---- 336 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL- 398 ++ + T + + + + TI + + + L+ S P Y + + L Sbjct: 337 HSPSQDNVTKAAVN-LRRQGVIVFTIGVEGASD--TQLEKIASHPAEQYVSQLRSFSDLA 393 Query: 399 ---IHVFQNISQLMVHRKYSVI 417 + + + H SVI Sbjct: 394 AHNQTFLKKLRNQITHT-VSVI 414 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 59/189 (31%), Gaps = 20/189 (10%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T+ + + + + + V +G + Y R + E S T K Sbjct: 439 GSGSTQATDFQEMKTFLSEVAGMFNIAPQKVRVGAVQYADRWDLEFEISKYTNKHDVRKA 498 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A+ +L K++ ++ LT+G + Sbjct: 499 IENIRQMGGNRNTGAALNFTLGLLQRAKQQRGG-------------RVPCHLVVLTNGAS 545 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S ++ E I + I + Q L+ + Y V + D+L + Sbjct: 546 RDSVSGP-----ANRLSEELIHVYAIGVR--EANQTQLREIAGEEKRVYYVHDFDALKDI 598 Query: 402 FQNISQLMV 410 + Q + Sbjct: 599 RNQVVQEIC 607 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 63/191 (32%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++ V S K Sbjct: 45 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSCVRTEFPLSMYHSKDEIKKAVMNVEY 101 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 102 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 149 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 150 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 197 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 198 SQIAENLKLNV 208 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/353 (9%), Positives = 93/353 (26%), Gaps = 45/353 (12%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ + ++ RA + R I K E + ++ +++ Sbjct: 454 TDWDATDYSKLANASCQVRAQWNTTVRQIDLEWKNYPYLSVKTNVTPETVAVNDTVTVTI 513 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + +L ++ + ++ E Sbjct: 514 QLIGDGWA---------LQPDPIDVMLTADRSGSMLRDYPDRMV-------SLMDALEDF 557 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 ++ D G + + ++ D ++ Sbjct: 558 GIEMKEGWDRLGLASFGTYGNADIIDYGNRYWAGYDNS-------------YYDDWEYIS 604 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E Y ++ + V + T + + + L + Sbjct: 605 EHYAGNDKNYNDYATIDLNLTEDFSDYNTEVKALV---PDGGTPMRKGLYYSIKHLRDNG 661 Query: 310 K----------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + N++ + S + + + N + + AK+ Sbjct: 662 RDDAVKAVVVLSDGDYNYYGDPLARGSGGTKWDWSDMQEKYYTFSDLNSSEQDMRIFAKD 721 Query: 360 NFIKIVTISI--NASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLM 409 N IKI +I+ S G+ +L+ + +Y + + L ++++I+ + Sbjct: 722 NDIKIFSIAYADGISSEGKAVLQALAEGTGGKYYYAPSGEDLEEIYEDIAGEL 774 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + + V G + Y+ + + + + M T + Sbjct: 834 INLTIHLVKKADVGRNRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + T N ++ +I +TDG++++ +T Sbjct: 894 AKALKHANALFTEKHGSRIKQN------------VKQVLIVITDGKSHDHDQLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + ++ I I + + Q+ L+ + V N D L VF + + M Sbjct: 939 SELRDKGITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 464 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGK 523 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 524 ALDYILQIIKNGTKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 565 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I A+ + L+ E N D+L + + + + K Sbjct: 566 LRAEQITVHAVGIGAANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICTEK 620 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + S + D SLI + T + A+ Sbjct: 655 LTKIQVGADKTQIGVVQFSDKTKEEFQLSRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 SQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDGSLVFHVENFDHLKA 792 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTL 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I ++LL + E D L Sbjct: 1120 RYDVAD-----AVRTLKDLGICVLVLGIG-DVYKEQLLPI-TGNSEKIITFQGFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVKKRIIREICQ 1184 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 64/207 (30%), Gaps = 21/207 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K+A LK A+ +++++ + ++ +++ + T +Q Sbjct: 243 SRSMSGPKLALLKRAMRFVIENLEPSD------RLSVVAFSSSACRLFPLRKMTAFGQQQ 296 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + +DSL T+ +++A +++ + R+ + + S + Sbjct: 297 SQQAVDSLVADGGTNIAEGLRKAARVVEDRQARNPVCSIILLSDGVDSHNLPPRDGSAPE 356 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINA----SPNGQRLLKTCVSSPEYHYNVVN 394 + E+ + I + + + + S + Sbjct: 357 PDYAPLVPRSILPGS-----EHHVPIHAFGLGMDHDHDHDSRAMHAVAQMSSGTFSFIDM 411 Query: 395 -ADSLIHVFQN-ISQLMVHRKYSVILK 419 S+ I L+ SV+ + Sbjct: 412 VGSSIQDALAQCIGGLLS---VSVVAQ 435 >gi|125602048|gb|EAZ41373.1| hypothetical protein OsJ_25890 [Oryza sativa Japonica Group] Length = 757 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 15/161 (9%), Positives = 49/161 (30%), Gaps = 7/161 (4%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 + ++ ++ + I + +E + ++SL T+ + +A ++ + Sbjct: 354 DRLAVVSFSYSARRVIRLTRMSEGGKASAKSAVESLHADGCTNILEGLVEAAKVFDGRRY 413 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 R+ + + G +N+ +V ++ + + + T Sbjct: 414 RNAVASVILLSDGQDNYNVNGG-----WGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFG 468 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + + + N + F I L+ Sbjct: 469 TDHDASAMHTIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 509 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 78/260 (30%), Gaps = 31/260 (11%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 N ++ + +P+ + L+VD SGSM+ M++D K + Sbjct: 155 NAENDYTLTLNVKGMYDSETTKPMIDVLLIVDKSGSMNWKMDTDK----------VGKPS 204 Query: 227 KMAALKNALLLFLDSIDL-LSHVKEDVYMGLIGYT----------TRVEKNIEPSWGTEK 275 +M LK + D + + D M ++ Y+ E E + + Sbjct: 205 RMDVLKQVVTGTGGLTDSIFGNTQIDAQMAVVTYSGSNDFLDQRYDDAEIIQEWTKQKDT 264 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 V V T+ ++ L ++ + F Sbjct: 265 VNNAVNNI---QAKGGTNCEAGLRTGATALEGSRENAKKFVIFLSDGDATFYYGDDGY-- 319 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIK-IVTISINASPNGQ---RLLKTCVSSPEYHYN 391 T G + I K ++ TI + +S + + L +S + Y Sbjct: 320 -TKGPGSGSSPTAREKAIAQVQKITGLEGFYTIGMTSSSSSEFLTNLANNSKASEKRFYP 378 Query: 392 VVNADSLIHVFQNISQLMVH 411 N ++L FQ I Sbjct: 379 ANNTEALEKAFQEIVGETTE 398 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++RV S K Sbjct: 469 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAEMNVEY 525 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 526 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 573 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 574 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 621 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 622 SQIAENLKLNV 632 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 43 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 101 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 102 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 147 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 148 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 187 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 78/220 (35%), Gaps = 44/220 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + I V+D SGSM D + + +++ ++L+ + Sbjct: 111 QKVNDIVFVIDTSGSMA---------------NTDPQNERFSSV----------LNLMDN 145 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQIL 305 + + + + ++ I + +E +++ ++ T+ A+ AY + Sbjct: 146 MNTQNRVSIYKFDDTSKRIIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAIDSAYNEI 205 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + +I L+DGE+N N + K++ I I Sbjct: 206 NS----------------TKRPGRKAAVILLSDGEDNFGL-NKKFDETLKPFKDSNISIY 248 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI ++ N L K + +YNV NA L F I Sbjct: 249 TIGMSNENNFTTLKKIAKDTHGEYYNVKNASDLKGTFSKI 288 >gi|218200417|gb|EEC82844.1| hypothetical protein OsI_27660 [Oryza sativa Indica Group] Length = 423 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 15/161 (9%), Positives = 49/161 (30%), Gaps = 7/161 (4%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 + ++ ++ + I + +E + ++SL T+ + +A ++ + Sbjct: 20 DRLAVVSFSYNARRVIRLTRMSEDGKASAKSAVESLHADGCTNILKGLVEAAKVFDGRRY 79 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 R+ + + G +N+ +V ++ + + + T Sbjct: 80 RNAVASVILLSDGQDNYNVNGG-----WGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFG 134 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + + + N + F I L+ Sbjct: 135 TDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQCIGGLLS 175 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 40/224 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D S SM + + K +K + F+ +++ Sbjct: 84 DIVLVIDSSDSMRQMGFDPKDPY----------KNKFDVVKEVVADFIKK-------RKN 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + ++ + + + K T A+ QAY +++ K + Sbjct: 127 DRIGMVTFADVAFIASPLTFEKDFLTNITEMQKLGMAGKRTAINDALVQAYNLMSKSKAK 186 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 K II LTDG +N K + ++ +K+ TI I Sbjct: 187 ------------------SKIIILLTDGRDNMSKIP--LSDVKHMIEKRDVKLYTIGIGG 226 Query: 371 -ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ-LMVHR 412 + Q L + Y +A L ++ I++ + Sbjct: 227 PRDYDAQYLKTLAKAGKGQAYAARSAAMLSKIYDEINKLEVTKL 270 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 65/220 (29%), Gaps = 57/220 (25%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ A K + F+ + +GL+ + + P+ + + Q V Sbjct: 109 NRLEAAKETIAKFI-------SARTSDRIGLVVFAGESFTMVPPTLDYQMILQRVNEISS 161 Query: 286 SLIL---KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T AM A L + R + +IF+TDGENN Sbjct: 162 ASSAKIKDGTALGVAMANAAGRLKDSQAR------------------SRVMIFMTDGENN 203 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP--------------------------NGQ 376 + + + AK IK+ +I I N Sbjct: 204 SGTID--PETGLEIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNED 261 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 L + + +Y +L VF +I + K V Sbjct: 262 LLGRMASDTGGKYYRATTEGALQKVFSDI-DTLEKTKIDV 300 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 70/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYQEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 NQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNAT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + +Y V L +F ++S + + Sbjct: 948 -AGANTWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 50/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKHYFRPDTGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + Q L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 57/174 (32%), Gaps = 20/174 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + +D+ + V +G + Y + E S T K D + T++ Sbjct: 457 FLSEVVDMFNIAPHKVRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGA 516 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +L ++ K ++ LT+G + + + K Sbjct: 517 ALNFTLTLL-------------QRAKKQRGNKVPCHLVVLTNGMSQDS-----VLGPAHK 558 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I++ I + Q L+ + Y V D+L + + Q + Sbjct: 559 LREENIRVHAIGVK--EANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEIC 610 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 23/173 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V +G++ ++ + + + + D + I + T + A+ Sbjct: 648 VSKSQIGADRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFV 707 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q + + K +KF+I +TDGE + + +++ Sbjct: 708 SQYFSPE--------------KGARPNVRKFLILITDGEAQDIVKDP-----AVALRKDG 748 Query: 362 IKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I ++ ++ L+ PE + V N D L H+ ++ + + Sbjct: 749 VIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 57/198 (28%), Gaps = 30/198 (15%) Query: 228 MAALKNALLLFLDSIDLL--------SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + N LD + + + +GL+ Y+ S G K Sbjct: 233 LDMAINGSQENLDHLKAFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTEVL 292 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 S + + A+++ + + S ++ + + + +T Sbjct: 293 QRIQDLSPHVGQAYTGAALRKTRKEVFSAQRG-----------SRKNQGVPQIAVLVT-- 339 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSL 398 + S+ N K + + + T+ + N ++L K E + N L Sbjct: 340 ---HRASDDNVTKAAVNLRREGVTVFTMGVE-GANPEQLEKIASYPAEQFTSKLSNFSEL 395 Query: 399 ----IHVFQNISQLMVHR 412 + + + H Sbjct: 396 ATHNQTFLKKLRNQITHT 413 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 70/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYQEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 NQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNAT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + +Y V L +F ++S + + Sbjct: 948 -AGANTWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 50/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKHYFRPDTGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + Q L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 57/174 (32%), Gaps = 20/174 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + +D+ + V +G + Y + E S T K D + T++ Sbjct: 457 FLSEVVDMFNIAPHKVRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGA 516 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +L ++ K ++ LT+G + + + K Sbjct: 517 ALNFTLTLL-------------QRAKKQRGNKVPCHLVVLTNGMSQDS-----VLGPAHK 558 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I++ I + Q L+ + Y V D+L + + Q + Sbjct: 559 LREENIRVHAIGVK--EANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEIC 610 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 23/173 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V +G++ ++ + + + + D + I + T + A+ Sbjct: 648 VSKSQIGADRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFV 707 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q + + K +KF+I +TDGE + + +++ Sbjct: 708 SQYFSPE--------------KGARPNVRKFLILITDGEAQDIVKDP-----AVALRKDG 748 Query: 362 IKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I ++ ++ L+ PE + V N D L H+ ++ + + Sbjct: 749 VIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 57/198 (28%), Gaps = 30/198 (15%) Query: 228 MAALKNALLLFLDSIDLL--------SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + N LD + + + +GL+ Y+ S G K Sbjct: 233 LDMAINGSQENLDHLKAFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTEVL 292 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 S + + A+++ + + S ++ + + + +T Sbjct: 293 QRIQDLSPHVGQAYTGAALRKTRKEVFSAQRG-----------SRKNQGVPQIAVLVT-- 339 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSL 398 + S+ N K + + + T+ + N ++L K E + N L Sbjct: 340 ---HRASDDNVTKAAVNLRREGVTVFTMGVE-GANPEQLEKIASYPAEQFTSKLSNFSEL 395 Query: 399 ----IHVFQNISQLMVHR 412 + + + H Sbjct: 396 ATHNQTFLKKLRNQITHT 413 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 72/245 (29%), Gaps = 24/245 (9%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM--HCAMNSDPEDVNSAPICQD 222 L + + + + L ER + +V+D+SGSM N + + Sbjct: 163 LGESTGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEG 222 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 R+KMA K+AL+ + + +G++ + + Sbjct: 223 DSRSKMAVAKDALVALTEQLHPDD------RVGVVLFNNEPTVAKPLRDVETTDMDAIRG 276 Query: 283 DM--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + D T+ M +A +L RQ I +TD Sbjct: 277 HIREDIEAGGGTNIADGMAEAADMLGEYADSDPTEAETRQ-------------IVITDAM 323 Query: 341 NNNFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + + E+ I + + N + + + + +V +A+ Sbjct: 324 PNTGQTDDQALQDRLAGYAEDGIHTSFVGVGVDFNPELVDEITAVRGANYRSVHSAEDFE 383 Query: 400 HVFQN 404 Sbjct: 384 TYLGE 388 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 66/221 (29%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKQVASEFIA-------GRPNDNIGLTIFAAEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + T + A L K + Sbjct: 148 HAVLLNLFHGIKTDMAQRGMIQDGTAVGMGIANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------- 373 K II LTDG NN +++ + + AK I++ TI + + Sbjct: 192 --SKVIILLTDGTNNAG--DISPLTAAEIAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQY 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y + L V+++I + Sbjct: 248 LNVPVEIDTKTLAAIAGKTDGEFYRATDNKKLEDVYKDIDK 288 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 35/365 (9%), Positives = 96/365 (26%), Gaps = 33/365 (9%) Query: 51 NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 ++ ++ + + F+ + D R + + Sbjct: 645 AKIQIGPDSTQIGVV----QFSDINQEEFQLNKYFTQNETSDAIDRMSLINRG------T 694 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY----RLQFIEHLLN 166 + L + + + + F + Sbjct: 695 LTGSALTFVGQYFTPTKGARTKVKKFLILITDGEAQDPVRDPAKALRDKGVVIFSVGVYG 754 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q + L ++ + + + ++ + Sbjct: 755 ANRTQLEEISGDSSLVFQVENFDDLKTVESKLVFRVCALHDCKNIKVLDIVFVLDHSGSI 814 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ + L + + + V +G + Y+ E S V +++ R Sbjct: 815 GTQEQESMMNLTIHLVKKADVDSDRVRVGALKYSDYPEVLFYLSGNKSAVIEHLRR--RR 872 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A++ A + T + N ++ +I +TDG +++ Sbjct: 873 YTSGHTYTARALEHANIMFTEEYGSRIQQN------------VKQMLIIITDGVSHD--- 917 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 N K + I I + + Q L+T + ++V N +L ++ + Sbjct: 918 RDNLSDTASKLRNKGINIYAVGVG--QANQLELETMAGNKSNTFHVDNFSNLKDIYLPLQ 975 Query: 407 QLMVH 411 + M Sbjct: 976 EKMCT 980 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 68/216 (31%), Gaps = 46/216 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLL 245 E I +VD SGS+ + F+ ++ + Sbjct: 612 EDMKADIMFLVDSSGSIGHDNFGK------------------------MKTFMKNLLAKI 647 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +G++ ++ ++ + + + D SLI + T + A+ Q Sbjct: 648 QIGPDSTQIGVVQFSDINQEEFQLNKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYF 707 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T K +KF+I +TDGE + + ++ + I Sbjct: 708 TPT--------------KGARTKVKKFLILITDGEAQDPVRDP-----AKALRDKGVVIF 748 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ + N + L+ + V N D L V Sbjct: 749 SVGV-YGAN-RTQLEEISGDSSLVFQVENFDDLKTV 782 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 58/180 (32%), Gaps = 24/180 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTD 293 L + S + V G + Y+ ++ S +R+ + T Sbjct: 452 QFMLAVTGMFSIGSDKVQAGAVQYSDKIRVEFYINASSNDMDLRKAILNIEQLQ--GNTH 509 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ I+ D+K +I LTDG++ + +K Sbjct: 510 TGKALDFMLSIIKKDRKHRI-------------SEIPCHLIVLTDGKSQD-----EVLKP 551 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ ++ I I + I + Q L+ E N DSL ++ + + K Sbjct: 552 AERLRDEQITIHAVGIGEADKIQ--LQQIAGEEERVNFGQNFDSLRNIKNEVVHRICTEK 609 >gi|41409532|ref|NP_962368.1| hypothetical protein MAP3434 [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570937|sp|Q73UD4|Y3434_MYCPA RecName: Full=UPF0353 protein MAP_3434 gi|41398363|gb|AAS05984.1| hypothetical protein MAP_3434 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 330 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 67/242 (27%), Gaps = 54/242 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R +I LV+D+S SM ++ A + A F + Sbjct: 89 PRNRAVIMLVIDMSQSMRATDVE---------------PNRLKAAEQAASQFASQLTPG- 132 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+G+ + P+ + + + T + A+ A ++ Sbjct: 133 -----INLGLVGFAGTPYLLVPPTP---QHQATIDALKKLDFADSTATGEAIFTALHAIS 184 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFI 362 + T I+ L+DG N + + AK+ + Sbjct: 185 ATAVAGGDTPP------------PARIVLLSDGGENKPSNPSDPHDGVYTAARLAKDEGV 232 Query: 363 KIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I TI+ + ++ S Y N L + I Sbjct: 233 PISTITFGTKGGEIEMDGQKVAVPVSTDQMKMVAKLSGGQSYTATNLGELQKSYNAIENE 292 Query: 409 MV 410 + Sbjct: 293 IG 294 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 76/227 (33%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+P+ I L++D+SGSM + T++ A++ ++ F+ Sbjct: 92 EKPMRNIMLILDVSGSMEKNDVAGGL-------------TRLQAVQQSVKKFVA------ 132 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + S + + +++ + + T A+ ++L Sbjct: 133 -ARKSDRIGLVIFANSAWPFAPVSEDKQALETRISQLTPGMAGQQTAIGDALGVTVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S K I LTDG + + A + +++ T Sbjct: 192 S----------------TGDKEASKLAILLTDGNDTASQLTPRL--AAQLAVSHHVQLHT 233 Query: 367 ISINASPNGQR------LLKTCV-SSPEYHYNVVN-ADSLIHVFQNI 405 I+ + LL+ + + N SL V++ I Sbjct: 234 IAFGDVNSSGDDKVDLNLLQDLARMTGGRSWTAENSGASLDAVWKEI 280 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 36/258 (13%), Positives = 76/258 (29%), Gaps = 38/258 (14%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + RP I V+D SGSM ++P + Sbjct: 384 GGYGRTAVEEALPVYMDVRNRELRPDLAIVFVIDKSGSMDACHCANP---DRGGPITSSS 440 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ K+A+ + +G++ + + G V Q + Sbjct: 441 ERKIDIAKDAVAQATALLSPQD------TVGVVTFDGAAFPTFVATRG-ATVEQVMDAVS 493 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 PT+ + +A ++L R K +I LTDG + Sbjct: 494 GVEPRGPTNIRAGLLRAEEMLQQVDARI------------------KHMILLTDGWGSGG 535 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-- 402 + I + +E I + ++ + + L + +Y + + +F Sbjct: 536 D----QLDIAARLREQGITLTVVAAGSG-SATYLQQLAAEGGGRYYPAADMADVPQIFVQ 590 Query: 403 ---QNISQLMVHRKYSVI 417 I +V + + + Sbjct: 591 ETITAIGNYIVEQPFVPV 608 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 77/252 (30%), Gaps = 25/252 (9%) Query: 158 LQFIEHL----LNQRYNQKIVSFIPALL----RIEMGERPIFLIELVVDLSGSMHCAMNS 209 L + + + + ++ P + + L +D+SGSM Sbjct: 30 LPRSQSMLQIHPLETEDGVLIKIDPPKEPELEDLRERNHVPLDLVLSIDVSGSMGAD--- 86 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 P + + + + +++A L+++D +G++ ++T + E Sbjct: 87 APVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDDHD------RLGIVTFSTSSKVVREL 140 Query: 270 SWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 ++ T + + + L T+ ++ + ++ K G Sbjct: 141 TYMTPANKAKILKQLDALQPLSMTNLWHGIRDGLSLFNNNLKAVNDRRNPGSGRVPA--- 197 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 ++ LTDG N+ N + + I T S L Sbjct: 198 ----LLVLTDGMPNHQCPNQGYVAKLRQWSTLPASIHTFGFGYSLRSGLLKSIAEVGGGN 253 Query: 389 HYNVVNADSLIH 400 + + +A + Sbjct: 254 YSFIPDAGMITT 265 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 78/192 (40%), Gaps = 35/192 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDM 284 ++ A KNA F+D +D +G++GY++ E + + T +Q++ + Sbjct: 85 NRINAAKNAAKGFIDLMD-----MTKHQVGIVGYSSVAETSSLPLTTDTAAAKQFIDPIV 139 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T++ A+ QA +L+S Q I+ +TDGE N+ Sbjct: 140 ---ASGGTETGYAIDQAITLLSS-----------------HRPEAQPVIVIMTDGEANSS 179 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNG------QRLLKTCVSSPEYHYNVVNADSL 398 ++ ++ AK+ I TI++ + LLK ++ +H+ V+ + L Sbjct: 180 QA---ALERAQAAKDAGIVFYTIALLGPNDNPDTSAPNELLKQMATTNSHHHFVLGSTGL 236 Query: 399 IHVFQNISQLMV 410 ++ I + Sbjct: 237 AEIYAAIVAEIG 248 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 50/416 (12%), Positives = 117/416 (28%), Gaps = 76/416 (18%) Query: 27 ALSVMSFLLLIGFLI----YVLDWHYKKNSMESANNA--------AILAGASKMVSNLSR 74 + L + G + + ++++A A A +A + LS Sbjct: 14 GFAAGLLLAVAGCGVSSKPESAAGSSTQGALQAAPQAQYEVQHADATMAKRAVHPMRLSA 73 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT-HMA 133 S + D+ + + + + +T ++ + ++ + Sbjct: 74 PMPAPISSRDSLVAGYRDEPREQYQKLPDNPIHSVAEAPVSTFSADVDTGAYANVRRLLN 133 Query: 134 NNRLDSSNNTIFYNMDV--MTSYDYRLQFIEH--------LLNQRYNQKIVSFIPALLRI 183 L + Y + + + I A Sbjct: 134 QGSLPPEGAVRLEELVNYFPYDYALPTDGSPFGVTTELAPSPWNPHTRLLRIGIKASD-R 192 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + E + +VD+SGSM +R + +K+ L L +D + Sbjct: 193 AVAELAPANLVFLVDVSGSM-------------------DRREGLPLVKSTLKLLVDQLR 233 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V L+ Y +EP+ G +K + T T ++ AYQ Sbjct: 234 DQDRVS------LVVYAGESRVVLEPTSGRDKAKIR-TAIDQLTAGGSTAGASGIQLAYQ 286 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFI 362 + I+ TDG+ N S+ + + + +++ + Sbjct: 287 MAQQGFIDQGIN----------------RILLATDGDFNVGVSDFDSLKAMAAEKRKSGV 330 Query: 363 KIVTISINASPNGQRLLKTCV-SSPEYHYNVVN--------ADSLIHVFQNISQLM 409 + T+ + L++ + + + N D L +++ + Sbjct: 331 SLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLREARKVLVDQLSSTLAVVAKDV 386 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 60/185 (32%), Gaps = 38/185 (20%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ A+K F++ ++ +GL + R ++ V + + Sbjct: 112 STRLEAVKRVARSFVEE-------RQGDRIGLALFANRAYVAAPLTFDLAAVGRAIEEAS 164 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A + +T + I+ L+DG++N Sbjct: 165 IGITGRSTAIADGLGLALKRVTESGA------------------ASRVIVLLSDGQDNAH 206 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-----------NGQRLLKTCVSSPEYHYNVV 393 + + ++ A + ++I TI++ + L +S Y V Sbjct: 207 QID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAIAEASGGRSYRVR 264 Query: 394 NADSL 398 + L Sbjct: 265 GMEDL 269 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 69/210 (32%), Gaps = 43/210 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + L+VDLSGSM + KK +++ A+K L FL Sbjct: 90 PTRDLLLLVDLSGSMAQEDFQND---------AGKKVSRLDAVKEVLDGFLAK------- 133 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ +GL+ + + + ++ + + T A+ + Sbjct: 134 RKGDRVGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVNLFDE- 192 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K II LTDG N+ KS V ++ A + IKI T++ Sbjct: 193 -----------------DTERAKTIIALTDG--NDTKSKVPPVEAARVATQRDIKIYTVA 233 Query: 369 INASP-------NGQRLLKTCVSSPEYHYN 391 I + Q L + ++ Sbjct: 234 IGDPTTVGEDKLDEQSLKDVASETGGKYFF 263 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 76/253 (30%), Gaps = 54/253 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 IV + + R I + VD+S SM +D ++ A Sbjct: 68 LIVGAARPAGEVRV-PRDRATIIIAVDVSLSMEA---------------RDVAPNRLIAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A F+ + E +G++ + I P+ V + Sbjct: 112 KEAAQQFVRDL------PERFNVGVVAFARTAAVVISPTTDHAAVTNAIAGLT---TRPG 162 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ A + S + + + P I+ L+DG+N + + I Sbjct: 163 TSIGEAVFNALDSIRSFDREA------------ATDPPPAAIVLLSDGDNTSGRPVSEAI 210 Query: 352 KICDKAKENFIKIVTISINASP--------------NGQRLLKTCVSSPEYHYNVVNADS 397 + AK + + TI+ N L + + Y +A Sbjct: 211 EAAANAK---VPVSTIAYGTPDGYVMIDNRPVQVPVNKAALQELSEGTGGRAYTAESASE 267 Query: 398 LIHVFQNISQLMV 410 L V+Q I + Sbjct: 268 LREVYQQIGTSLG 280 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 68/221 (30%), Gaps = 55/221 (24%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + +M A K + F+ ++ +GL+ ++ + + + +++ Sbjct: 108 RSLNRMEAAKKVIRDFMKE-------RKYDRIGLVIFSGLAFTQCPLTTDKDSLAEFINN 160 Query: 283 DMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T A+ + L + + + II +TDG N Sbjct: 161 INIGDTGLDGTAIGSAIMTSVNRLKDSRAK------------------SRIIILVTDGNN 202 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINA----------------------SPNGQRLL 379 N + + + A+ IKI + + + + +L Sbjct: 203 NMGEID--PLTASKIARSYDIKIYAVGVGSLDGAIYEVDDPFLGKREIKYRKDAINESVL 260 Query: 380 KTCV-SSPEYHYNVVNADSLIHVFQNISQ----LMVHRKYS 415 K ++ ++ + S ++ + I + + +++ Sbjct: 261 KEVAYNTSGGYFRAQDVKSFENIMKQIDKLEKDDIEVMRFT 301 >gi|118401451|ref|XP_001033046.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89287392|gb|EAR85383.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 680 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 81/245 (33%), Gaps = 31/245 (12%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + N+ +S IP + R I VVD+SGSM NS+ +S C Sbjct: 20 KDNKLRISLIPPA---SIKRRTNSNICCVVDVSGSMGTEANSNSSVSSSENYCLSI---- 72 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDS 286 + +++AL + ++++ + L+ +++ + + + Sbjct: 73 LDIVQHALKMIVNTLTPDDD------LSLVVFSSMAIEVFDTLRMDDANKILAIDKIEKL 126 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ ++ IL+ K ++ + LTDG+ ++ Sbjct: 127 EASGSTNLQHGIQVGLNILSKSKSQNRNQA----------------MYVLTDGQPDDRNV 170 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNI 405 K + I T +S + + L + + + +A + F + Sbjct: 171 MQFLKKYKKDNPQLRCTISTFGFGSSCDSELLDEIAREYNGMYSFIPDATLIATAFANAL 230 Query: 406 SQLMV 410 + + Sbjct: 231 ANTLT 235 >gi|327286711|ref|XP_003228073.1| PREDICTED: cartilage matrix protein-like [Anolis carolinensis] Length = 565 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 44/370 (11%), Positives = 107/370 (28%), Gaps = 40/370 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + + + R I + + + Sbjct: 173 ASLLQAVRRIEPLSTGTMTGLAIQFAINRAFSEGEGARVRVPEINKVAIIVTDGRPQDAV 232 Query: 119 QNIV-NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 +++ + + + A +T+ + ++E K Sbjct: 233 KDVAARARALGIEIFAIGVGRVDMHTLRLIASPPLEEH--VDYVESYSVIEKLTKKFQEA 290 Query: 178 --PALLRIEMGERPIFLIELVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRTKMAALKNA 234 G+ + + S S C + D + C T + L + Sbjct: 291 FCAVSDLCATGDHDCAQVCVSAGASYSCACREGFTLNSDGKTCNACSGGAPTDLVFLIDG 350 Query: 235 -----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + F++ I D L + ++GL+ Y++ V + +K Sbjct: 351 SKSVRPENFELVKKFINQIVDSLDVSDRNAHVGLVQYSSSVRQEFPLGQYKDKKDIKAAV 410 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 S + K T + A++ + K I TDG + Sbjct: 411 RRMSYMEKGTMTGQALQYLVDSSFAISSG-------------ARPGVPKVAIVFTDGRSQ 457 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 ++ ++ KAKE K+ + + + LK S P E+++ + ++ Sbjct: 458 DYINDS-----AKKAKELGYKMFAVGVG--NAVEDELKEIASEPVAEHYFYTADFRTINQ 510 Query: 401 VFQNISQLMV 410 + + + + Sbjct: 511 IGKKLQNKIC 520 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 58/184 (31%), Gaps = 22/184 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + + +K L ++S+ +G++ Y + V+ K Sbjct: 119 SSRSVRPNEFEQVKVFLSQVIESL---DVGPNATRVGVVNYASAVKSEFSLKTHRTKASL 175 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + A++ A S +G ++ K I +TD Sbjct: 176 LQAVRRIEPLSTGTMTGLAIQFAINRAFS----------EGEGARVRVPEINKVAIIVTD 225 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + + + +A+ I+I I + + L+ S P E+ V + Sbjct: 226 GRPQDAVKD-----VAARARALGIEIFAIGVGR-VDMHT-LRLIASPPLEEHVDYVESYS 278 Query: 397 SLIH 400 + Sbjct: 279 VIEK 282 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/333 (9%), Positives = 82/333 (24%), Gaps = 24/333 (7%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + + + + + S L + Sbjct: 675 FQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGSRPSVRKFLILITDGE 734 Query: 145 FYNMDVMTSYDY----RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++ + + + + Q + + I + DL Sbjct: 735 AQDIVKEPAVALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQ-RIEDDLV 793 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL--LLFLDSIDLLSHVKEDVYMGLIG 258 + + + + + +D + V G + Sbjct: 794 FGICSPREECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIDIVRKADVGMNRVRFGALK 853 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 Y E K D + T + A+ + + T + Sbjct: 854 YADDPEVLFYLDNFGTKPEVISVLQTDQAMGGNTYTAEALGFSDHMFTEARGSRL----- 908 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + +I +TDG++++ T ++ I ++ + I A N L Sbjct: 909 -------NKGVPQVLIVITDGDSHDADKLNAT---AKALRDKGILVLAVGI-ADANPVEL 957 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L SS + ++ V L + +++ + + Sbjct: 958 LAMAGSS-DKYFFVETFGGLKGIISDVTASVCN 989 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 67/219 (30%), Gaps = 21/219 (9%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA--LKNALLLFLDSIDLLSHVKEDVY 253 + D++ S+ + D E + T + K + + + V Sbjct: 980 ISDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQTDFKKMKEFMVSVVQDFDVSNKRVR 1039 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G ++ + + + I T A+++ D Sbjct: 1040 IGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNTHIGAALRKVEHYFRPDMGSRI 1099 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + + ++ LTDG++ + + + + I I ++ I Sbjct: 1100 ------------NTGTPQVLLVLTDGQSQD-----EVAQAAEALRRRGIDIYSVGIG-DV 1141 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + Q+L++ + E V N D L V + I + + Sbjct: 1142 DDQQLMQI-TGTAEKKLTVHNFDELKKVNKRIVRNICTT 1179 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 58/174 (33%), Gaps = 20/174 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + + + V +G + Y + E + + K + + T++ Sbjct: 459 FLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQLGGNTNTGA 518 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +L K K ++ LT+G + + ++ ++ Sbjct: 519 ALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPANR 560 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I++ I I Q L+ + Y V + D+L + + Q + Sbjct: 561 LREEHIRVYAIGIK--EANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEIC 612 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 642 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 699 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 700 TLTGSALSFVSQYFSPTKGS--------------RPSVRKFLILITDGEAQD-----IVK 740 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 741 EPAVALRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 797 Query: 411 HRK 413 + Sbjct: 798 SPR 800 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/343 (10%), Positives = 85/343 (24%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + + + + + + Sbjct: 110 KALQEAHRTYFSATTNGRDKKQFPP-ILVVLASSESEDDVEKASKALQKDGVKIISVGVQ 168 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q Q I G +E Sbjct: 169 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTQIIKDVAKYKE----GAVDDIFVEA 222 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 S + + + + + LK +S+ L + + +G Sbjct: 223 CQGPSMADVVFL---------LDMSINGSDENLDYLK---GFLEESVSALDIKENCMRIG 270 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 271 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 319 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 320 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIKGASD- 373 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 374 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 415 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 82/257 (31%), Gaps = 58/257 (22%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V F + + R + + VD+S SM + D + ++ A Sbjct: 68 LVVGFARPTAEVRV-PRERATVMVAVDVSTSM---------------LAGDVEPDRLTAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 K A F+D + ++ +GL+ + + P E + + + R ++ + Sbjct: 112 KEAARRFVDGL------PDEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRLVEGATGVQ 165 Query: 291 PTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ A + L + + I+ L+DG N + Sbjct: 166 GTAIGEAINTSLGAVKALDGEAAKDPP---------------PARIVLLSDGANTSGMD- 209 Query: 348 VNTIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVV 393 ++ A + + TI+ +GQ L + + + Sbjct: 210 --PMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDEVARETGGQFHEAD 267 Query: 394 NADSLIHVFQNISQLMV 410 +A L V+ +I + Sbjct: 268 SAKELRAVYDDIGSSVG 284 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 108/365 (29%), Gaps = 64/365 (17%) Query: 60 AIL--AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN-- 115 A L A ++ L+ +R++ + +++ + + F Sbjct: 48 AALSTAAVAEENLPLAENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVRRFLTNG 107 Query: 116 -TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++ V I N L + N + + + S + K++ Sbjct: 108 EQPPKDAVRIEEIVNYFPYNYPLPTDNRPFAVHTETIDSP------------WQPEAKLI 155 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + P + +VD+SGSM K+ ++ Sbjct: 156 KIGIQAQDTAKKDLPPANLVFLVDVSGSMDEE-------------------NKLPLVQKT 196 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + + + LI Y + + + P+ G +K ++ TD Sbjct: 197 LRILTQQLRPQD------KVTLITYASGEDLVLPPTSGADK-ETILSAIDKLRAGGATDG 249 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKI 353 A++ AY+ + I+ TDG+ N S+ T + Sbjct: 250 ESALQMAYEQAQKAFVPNGIN----------------RILLATDGDFNVGVSDTETLKSM 293 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +++ + + T+ + +++ + + + N + + Q + Sbjct: 294 VAEKRKSGVSLSTLGFGMGNYNEDMMEQIADAGDGNYSYIDNE---KEAKKVLQQQLTST 350 Query: 413 KYSVI 417 +V Sbjct: 351 LATVA 355 >gi|125540180|gb|EAY86575.1| hypothetical protein OsI_07955 [Oryza sativa Indica Group] Length = 477 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 14/151 (9%), Positives = 45/151 (29%), Gaps = 7/151 (4%) Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + I + ++ + ++SL T+ + +A ++ + R+ + Sbjct: 158 QYRRVIRLTRMSDDGKASAKSAVESLAAGGGTNILKGLVEAAKVFDGRRYRNAVASVILL 217 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + G +N+ +V ++ + + + T + + Sbjct: 218 SDGQDTYNVNSG-----WGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDAAAMHA 272 Query: 381 TCVSSPEYHYNVVNADSLIHVFQN-ISQLMV 410 + + N + F I L+ Sbjct: 273 IAEETGGTFSFIENQAVVQDAFAQCIGGLLS 303 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 67/212 (31%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D ++ A K F+D + + +GL+ + + S Sbjct: 94 DVSNSMAATDIAPNRLEAAKQGAEAFVDQLPP------RINLGLVSFAGSATVLVPASTD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E VR + T + + Q + + KR Sbjct: 148 RESVRAGIRGLQLGPA---TAIGEGIFASLQAINTAGKRFSDAGQSPPPAA--------- 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DGE + N + D A++ + + TI+ N Q L Sbjct: 196 IVLLSDGETTRGRPNT---QATDAARQAHVPVDTIAYGTSDGTLDVGGQEVPVPVNEQAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + ++ D L V++ + + Sbjct: 253 NEIADQTEGSYHRAATGDELRSVYKGLGSSIG 284 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 72/231 (31%), Gaps = 42/231 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + +D K ++ A + A F++S+ +V +G++ + P+ Sbjct: 93 DVSGSMRAEDVKPNRLVAAQEAARAFVESL------PREVRVGVVSFAGTAAVVQAPTTS 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI-------- 324 + V + R T + + + D ++ + Sbjct: 147 RDDVFAAIERFQLQR---GTAIGSGIVLSLATIFPDAGIDIQQITGQRTMPRMLGDPEKK 203 Query: 325 -------PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---- 373 P +I LTDG+ I A + I++ T+ + + Sbjct: 204 AEFTPVPPGSYASAAMILLTDGQRTTGPD---PIDAAKMAADRGIRVYTVGVGTTQGEII 260 Query: 374 -----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + L + + ++ A+ L V+Q + MV + Sbjct: 261 GFEGWSMRVRLDEDTLRQIAQMTTGEYFYAGTAEDLKKVYQRLGSRMVVER 311 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 79/276 (28%), Gaps = 52/276 (18%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 S I T+ + + + P+ + Sbjct: 29 TGAVSAQAIAEPAVSKTASPALINIAGS--GVNEETTVTIEVTGAGSTSTSAVPM-DVVF 85 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D SGSM S K A F+D +D G Sbjct: 86 AIDSSGSMQSNDPSGLR-------------------KTAAKSFVDKMDSSRDTA-----G 121 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + ++ ++ + V+ + S T+ +++A IL ++ + Sbjct: 122 VVSWDDSIDFSLPLTNDFPLVKTNIDSVDSS---GSTNLNVGLEEAIDILDANPRTENSV 178 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 IIFLTDG+ +A + I +I + N Sbjct: 179 E---------------VIIFLTDGQGTY------LHSTAQEAADKGYVIYSIGLG-GVNP 216 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L ++ +Y+ +A SL +F +I + Sbjct: 217 TPLQDMATTTGGAYYSSPDATSLQAIFDDIFSEVTT 252 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 24/212 (11%), Positives = 66/212 (31%), Gaps = 33/212 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D ++ A K A F+D + V +G++ + + P+ Sbjct: 93 DTSLSMEATDVAPNRLDAAKEAAQSFVDDLTPG------VNLGIVSFAGIATVLVSPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +Q + + T + A+ + Q + K + P Sbjct: 147 RTVAKQAIDGLTL---DERTATGEAIISSLQTIELFSKTLP-------PDGTDTGPPPAR 196 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--------------NGQRL 378 I+ +TDG+ ++ + + + + I+ + + + Sbjct: 197 IVLMTDGKRTVGRTEQDAAQRAAD---AGVPVSVIAFGTDNGSITVNDEVIPVPLDTEAM 253 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + S + +A+ L ++ + + + Sbjct: 254 QQIAQISGGDFHQAASAEELKSIYAQLGEQIG 285 >gi|226290246|gb|EEH45730.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 757 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 65/216 (30%), Gaps = 28/216 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM + + + + K+A +++++ Sbjct: 75 DIVLCIDVSGSMQLSAPLPTTNESGKREETGLSV--LDLTKHAARTIIETLNEND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++ E + S + ++ + L T+ +K +L Sbjct: 128 -RLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLNVL----- 181 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVT 366 G + + LTDG+ N I ++ K+ I T Sbjct: 182 ----------GEVDLRPRNVQALYVLTDGQPNHMCPTQGYVPKLRPILERQKDRLPLIHT 231 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 232 FGFGYDIRSGLLQSIAEVGGGTYSFIPDAGMIGTVF 267 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 74/225 (32%), Gaps = 30/225 (13%) Query: 195 LVVDLSG-SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V+ +G S S+ + + + +K + +D + + Sbjct: 476 HVLQDNGKSCGTCRTSNIDLAPLIDGSKSVRPQNFELVKQFVNQVVDQL---DVSAKGTR 532 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL+ Y++RV S K + K T + A+K + Sbjct: 533 VGLVQYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVE---------- 582 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + +G + + + TDG + + + KAKE I + + + Sbjct: 583 NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-----QEWAKKAKEAGITMYAVGVG--K 635 Query: 374 NGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + L+ S P ++ + + F ISQ+ + K +V Sbjct: 636 AVEDELREIASDPVEKHFFYSAD-------FTAISQIAENLKLNV 673 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 56/166 (33%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G + Y+++V+ K Q V + + + T + Sbjct: 43 KFMIDIIHELDIGLAATRIGAVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 101 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 102 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 147 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 148 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 187 >gi|302796872|ref|XP_002980197.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] gi|300151813|gb|EFJ18457.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] Length = 542 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 67/201 (33%), Gaps = 24/201 (11%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 + K+ +K A+ + ++ ++ + ++ ++ + ++ T + Sbjct: 90 SGSMRGQKLELVKTAMEFVIRNL------RQQDRLAIVSFSDEPKVHLGLKRMTYDGREA 143 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ L T+ P +K + +L+ + R+ ++ + + K L Sbjct: 144 ALSAVEKLRTLGGTEIRPGLKAGFDLLSRRRNRNPVSSIMLLSDGMDNAITFKRCKVLP- 202 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + C + + T + + + +L ++ V ++ Sbjct: 203 -------VDSYLED-C----SERVPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTV 250 Query: 399 IHVFQNISQLMVHRKYSVILK 419 H F +Q + SV+ + Sbjct: 251 QHAF---AQCIGGL-LSVVAQ 267 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 58/207 (28%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + S + V + + K+ +K A+ + + + + ++ + Sbjct: 51 TGGDSTSDRSGLDLVAVLDVSGSMQGEKIDKMKTAMKFVVKKLSSID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + T+ + + + T+ T ++ ++L S Sbjct: 105 MDTATRICPLRQVTDASQPELLGLIDALNPGGNTNITDGLQTGLKVLADRNLSSGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ ++DG+ N + K + T A + L Sbjct: 164 --------------VMLMSDGQQNRGGN-------AADVKIGNAPVYTFGFGADYDPTVL 202 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S V + D L F Sbjct: 203 NAVARNSMGGTFSVVNDVDKLSMAFSQ 229 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 76/243 (31%), Gaps = 44/243 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 R + V+D+SGSM K+ +K A+ Sbjct: 70 KDDSLAAADARAPVDLVTVLDVSGSMR--------------------GQKLELVKTAMEF 109 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTP 296 + ++ ++ + ++ ++ + ++ T R ++ L T+ P Sbjct: 110 VIRNL------RQQDRLAIVSFSDEPKVHLGLKRMTHDGRAAALSAVEKLRSLGGTEIRP 163 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 +K + +L+ K R+ ++ + + K L + C Sbjct: 164 GLKAGFDLLSRRKNRNPVSSIMLLSDGMDNAITFKRCKVLP--------VDSYLED-C-- 212 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + T + + + +L ++ V ++ H F +Q + SV Sbjct: 213 --SERVPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAF---AQCIGGL-LSV 266 Query: 417 ILK 419 + + Sbjct: 267 VAQ 269 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 58/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + K V G + Y E K+ D + T + Sbjct: 829 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTA 888 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ + T Sbjct: 889 EALAFSDHMFTEARGSRL------------HKGVPQVLIVITDGESHDAEKLNTT---AK 933 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + +Y V L +F ++S + + Sbjct: 934 ALRDKGILVLAVGI-AGANSWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKYYFQPDMGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 516 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V D+L ++ + Q + Sbjct: 558 KLREENIRVHAIGV--KEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEIC 610 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 640 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 699 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 700 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDP----- 740 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 741 AIALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 797 Query: 413 K 413 + Sbjct: 798 R 798 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 237 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 293 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 294 RIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 339 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 340 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 394 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 395 LATHNQTFLKKLRNQITHT 413 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 66/221 (29%), Gaps = 53/221 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K ++ A K F+ + + +GL + + Sbjct: 95 DVSTSMLAEDLKPNRLEAAKQVASEFIA-------GRPNDNIGLTIFAAEAFTQCPMTTD 147 Query: 273 TEKVRQYVTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + T + A L K + Sbjct: 148 HAVLLNLFHGIKTDMAQRGMIQDGTAVGMGIANAVSRLKDSKAK---------------- 191 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------- 373 K II LTDG NN +++ + + AK I++ TI + + Sbjct: 192 --SKVIILLTDGTNNAG--DISPLTAAEIAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQY 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + L + Y + L V+++I + Sbjct: 248 LNVPVEIDTKTLAAIAGKTDGEFYRATDNKKLEDVYKDIDK 288 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 24/159 (15%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + G +GY + + + + V + R T++ ++ L + Sbjct: 567 DSERAGRVGYASGANLDQPLTTDHDAVNSSLERLS---ASGGTNTRAGLRVGLNHLEEEG 623 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I L+DG KS + + + + A E ++I T+ + Sbjct: 624 WENRSA----------------VMILLSDG-----KSGSDPLPVAEDAAEAGVEISTVGL 662 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + N L + + Y+V + L F+ +++ Sbjct: 663 GNNINENELREIAAITGGDFYHVEREEDLPDTFERVAEN 701 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 70/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYQEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 NQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNAT---AKALRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + +Y V L +F ++S + + Sbjct: 948 -AGANTWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 987 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 50/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + V +G+ ++ T + + I Sbjct: 1013 DDFQKMKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQI 1072 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1073 FGYTHIGDALRKVKHYFRPDTGSRINAG------------TPQVLLVLTDGRSQD----- 1115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + Q L + E V N D L V + I + Sbjct: 1116 EVAQAAEELRHKGVDIYSVGIGDVDDQQ--LIQITGTAEKKLTVHNFDELKKVKKRIVRN 1173 Query: 409 MVH 411 + Sbjct: 1174 ICT 1176 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 57/174 (32%), Gaps = 20/174 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + +D+ + V +G + Y + E S T K D + T++ Sbjct: 457 FLSEVVDMFNIAPHKVRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGA 516 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +L ++ K ++ LT+G + + + K Sbjct: 517 ALNFTLTLL-------------QRAKKQRGNKVPCHLVVLTNGMSQDS-----VLGPAHK 558 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +E I++ I + Q L+ + Y V D+L + + Q + Sbjct: 559 LREENIRVHAIGV--KEANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEIC 610 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 23/173 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V +G++ ++ + + + + D + I + T + A+ Sbjct: 648 VSKSQIGADRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFV 707 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q + + K +KF+I +TDGE + + +++ Sbjct: 708 SQYFSPE--------------KGARPNVRKFLILITDGEAQDIVKDP-----AVALRKDG 748 Query: 362 IKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I ++ ++ L+ PE + V N D L H+ ++ + + Sbjct: 749 VIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 57/198 (28%), Gaps = 30/198 (15%) Query: 228 MAALKNALLLFLDSIDLL--------SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + N LD + + + +GL+ Y+ S G K Sbjct: 233 LDMAINGSQENLDHLKAFLGESVSALDIKENCMRVGLVAYSNETRVISSLSMGVNKTEVL 292 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 S + + A+++ + + S ++ + + + +T Sbjct: 293 QRIQDLSPHVGQAYTGAALRKTRKEVFSAQRG-----------SRKNQGVPQIAVLVT-- 339 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSL 398 + S+ N K + + + T+ + N ++L K E + N L Sbjct: 340 ---HRASDDNVTKAAVNLRREGVTVFTMGVE-GANPEQLEKIASYPAEQFTSKLSNFSEL 395 Query: 399 ----IHVFQNISQLMVHR 412 + + + H Sbjct: 396 ATHNQTFLKKLRNQITHT 413 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 46/372 (12%), Positives = 108/372 (29%), Gaps = 42/372 (11%) Query: 59 AAILAGAS---KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 AA+L + + +IS A + ++ Sbjct: 100 AALLQAVRRIQPLSTGTMTGLAIQFAISRAFSAAEGGRGSAPNFKKVAIVVTDGRPQDGV 159 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ ++ I + + R+D + + DY + Q+ Sbjct: 160 QDVSARARAAGIEIFAIGVGRVDMGTLRQMASEPLDEHVDYVESYSVIEKLTHKFQEAFC 219 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAM-NSDPEDVNSAPICQDKKRTKMAAL--- 231 + G+ + + + C S D + C + + + Sbjct: 220 VV--SDLCATGDHDCEQVCVSTPGAYRCACRDGFSLNNDGKTCTACNGGLGSALDLVFLI 277 Query: 232 ---KNA-------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ + F++ I D L + +GL+ Y++ V + K Sbjct: 278 DGSKSVRPENFELVKKFINQIVDSLEVSDKQAQVGLVQYSSSVRQEFPLGQFKSKKDIKA 337 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + K T + A+K S K I TDG Sbjct: 338 AVKKMSYMEKGTMTGQALKYLVDSSFSAING-------------ARPGVPKVGIVFTDGR 384 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + ++ S+ KAK++ ++ + + + L+ S P E+++ + ++ Sbjct: 385 SQDYISD-----AAKKAKDSGFRMFAVGVG--NAVEDELREIASEPVAEHYFYTADFRTI 437 Query: 399 IHVFQNISQLMV 410 + + + + Sbjct: 438 SKIGKKLQMKIC 449 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 77.2 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 72/226 (31%), Gaps = 43/226 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM + + K+ K+A +D ++ Sbjct: 399 NKEKERNVAVVLVIDHSGSMGESNLGN--------------INKLEIAKSAAAKMIDHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + + K + I T P + +A Sbjct: 445 SSDS------VGVIAFDHNFYWASKFGKLKSKNE-VIENISGIQIGGGTAIIPPLTEAVN 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K + K I+ LTDG AK N IK Sbjct: 498 TLRKSKAKD------------------KVIVLLTDGYGEEGGYEYP----ASIAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TI + +S N L + Y V +A +LI VF ++++ Sbjct: 536 ITTIGVGSSINAPILSWMAAYTSGRFYYVKDASNLIDVFLKEAKII 581 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 69/240 (28%), Gaps = 59/240 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E + L +D+S SM S +++ A + F+ Sbjct: 94 AEARGIDVMLALDISESMLQKDGSGK--------------SRLDAAREVARKFV------ 133 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQI 304 + +GL+ + + + + + + +I + T A+ A Sbjct: 134 -LRRSSDRIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQVIQDEGTAIGSAILIATNR 192 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 S QK II +TDGENN +V A +N I+I Sbjct: 193 F------------------KGSTSLQKVIILITDGENNTG--DVGPATAATLAAQNGIRI 232 Query: 365 VTI--SINASP---------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + L ++ ++ + L + + I + Sbjct: 233 YVVNAGFKSGGSAGNLSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVLDNTIKTIGR 292 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 66/217 (30%), Gaps = 51/217 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D + ++ ALK + F++ + + +GL+ Y + Sbjct: 99 DISASMLARDLEPNRLEALKKVAINFIE-------GRPNDRIGLVIYAGESYTKTPLTTD 151 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + S + T + + L S K Sbjct: 152 KSIIFNAINDLEYSQNIEGGTAIGMGLATSVNKL------------------KDSKAESK 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II LTDGENN + A E IK TI + ++ Sbjct: 194 VIILLTDGENNAGFID--PKTATQLATEYDIKTYTIGVGSNGMALSPVGIKANGQFEYRN 251 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L S+ ++ + +++ I Sbjct: 252 IEVKIDEALLKTIAESNGGKYFRATDNQKFEAIYEEI 288 >gi|297471458|ref|XP_002685231.1| PREDICTED: collagen type VI alpha 4-like [Bos taurus] gi|296490811|gb|DAA32924.1| collagen type VI alpha 4-like [Bos taurus] Length = 780 Score = 76.8 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 63/198 (31%), Gaps = 26/198 (13%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 D + A+ +F++ I + + V G++ Y+ V S + Sbjct: 169 DIYFLIDGSGSTHQDDFLAMKVFMNEVIKMFHVGPDRVQFGVVQYSDEVSPQFTLSQHSS 228 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V D T A+ Q+ + + ++I Sbjct: 229 VAGLEVAVDSIQQKGGGTKMGEALGSMIQVFADSARSN----------------VPWYLI 272 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +TDG++ + ++ + + + + I + + N L + + + V + Sbjct: 273 VVTDGQSMDPVAD-----AAEALRGHGVTIYAVGV-RDANIAELQEIA---EDRMFFVHD 323 Query: 395 ADSLIHVFQNISQLMVHR 412 +SL + Q + Q + Sbjct: 324 FESLKTIQQEVVQDICSL 341 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 22/177 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + +++ +GL+ +++ ++ + + KV + I T Sbjct: 369 MKEFMKRMVNQSNIGADEIQIGLLQFSSDPQEEFRLNRYSSKVDVHRAISDVKQINGGTY 428 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ S + +++I +TDG + + + Sbjct: 429 TGKALNFTLPFFGSSRGG--------------RPSVHQYLIVVTDGVSRDNVALP----- 469 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ I I I + Q L + Y +SL ++ + I + Sbjct: 470 AKALRDRNIIIFAIGVGEVKFSQLL--EITNDQSKVYYEEKFESLQNLEKEILYQVC 524 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 76/220 (34%), Gaps = 44/220 (20%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + + + +D + ++ A K A FLD + + +G+ ++ + P+ Sbjct: 91 VTDHSRSMLAEDVEPDRITAAKRAASRFLDQLPPG------IRVGVTTFSDVPDGTQTPT 144 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + +R+ + + T + A++ A L + ++ Sbjct: 145 YDHDLIRRTIEAQI---ADGGTATGDALQVALDTL--------------ERLEQNGERTP 187 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN------------------AS 372 ++ L+DG + V + +A+ I I T+++ + Sbjct: 188 AAMVLLSDGATTTGRDPVMVARAAGEAR---IPIYTVALGTRDATVPNPGPTGPPLLPVA 244 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 P+ + L +S + + L +++ + + R Sbjct: 245 PDPETLQAIADASGGRAFQAQDDQELSSIYETLGSRLGTR 284 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 43/148 (29%), Gaps = 5/148 (3%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + ++IF L+ + + L+G I + ++SA +A ++AG + + + Sbjct: 22 RLRHESDGAVAVIFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNALKLVV 81 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S + + A + + + + E + + Sbjct: 82 SSTDSIVGLTTQTIQTEAKAPADVPVTVQVT-----VAPDKTSVEARAEQVIKLTFGAFV 136 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + S + M L Sbjct: 137 GMASMPISARAKASVVGRMRLCMLALDP 164 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 74/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + + R ++ LV+D+S SM +MAA + Sbjct: 80 TIAMAGPTNDVRI-PRNRAVVMLVIDVSQSMRATDVE---------------PNRMAAAQ 123 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F + + +GLI Y + P+ + + + + T Sbjct: 124 EAAKQFAGELTPG------INLGLIAYAGTATVLVSPTTNRYATKNALDKLQFADR---T 174 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ A Q + + I+ +DG+ + N Sbjct: 175 ATGEAIFTALQAIATVGAVIGGGEMP----------PPARIVLFSDGKETMPTNPDNPKG 224 Query: 353 I---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 AK+ + I TIS + + + K S YN Sbjct: 225 AYTAARTAKDQGVPISTISFGTVYGFVEINGQRQPVPVDDETMKKVAQLSGGNSYNAATL 284 Query: 396 DSLIHVFQNISQLMV 410 L V+ ++ Q + Sbjct: 285 AELKAVYASLQQQIG 299 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 41/146 (28%), Gaps = 16/146 (10%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL---- 72 S++ +II ALS++ + G + + K +++ +A LA + ++ + Sbjct: 10 SQRGVVAIITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYELTGSPISPE 69 Query: 73 ----SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG--------YSAVFYNTEIQN 120 + + N A I SL+G Y Sbjct: 70 NFTRAENAGKTVGTENRVDFQGGAIAAADIDVTFSTSLAGSWLPAGGATGNSKYVRCTIT 129 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFY 146 + M M S Sbjct: 130 RNGIAPWFMQVMGFGDQTVSAIATAT 155 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 25/214 (11%), Positives = 60/214 (28%), Gaps = 48/214 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D + + A F+ + + + +GL+ + + Sbjct: 93 DASGSMMAMDLQPNRFVAAVEVAQKFIGN-------RPNDNIGLVMFAGESFTQCPLTTD 145 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T + A L + K Sbjct: 146 HATLLNRLSEVEIGYLEDGTAIGLGIATACNRLKESHAK------------------SKI 187 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------------A 371 I+ LTDG NN A+ I+I T+++ Sbjct: 188 IVLLTDGTNNAGSI--APSMAASLAESLGIRIYTVAVGTRGEAPYPHATAFGTVIDNVKV 245 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + L + ++ ++ + +SL ++ I Sbjct: 246 EIDEASLKEIAQTTGGSYFRATDNESLNQIYDEI 279 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 58/176 (32%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + K V G + Y E K+ D + T + Sbjct: 809 DFMIGLVKKADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTA 868 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + T + + +I +TDGE+++ + T Sbjct: 869 EALAFSDHMFTEARGSRL------------HKGVPQVLIVITDGESHDAEKLNTT---AK 913 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I ++ + I A N LL SS + +Y V L +F ++S + + Sbjct: 914 ALRDKGILVLAVGI-AGANSWELLAMAGSS-DKYYFVETFGGLKGIFSDVSASVCN 967 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 993 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 1052 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 1053 FGYTHIGDALRKVKYYFQPDMGSRINAG------------TPQVLLVLTDGRSQD----- 1095 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 1096 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 1153 Query: 409 MVH 411 + Sbjct: 1154 ICT 1156 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 20/175 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E S + K + + T++ Sbjct: 436 TFLSEVVGMFNIAPHKVRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTG 495 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++L ++ K ++ LT+G S + + Sbjct: 496 AALNFTLKLL-------------QRAKKERGSKVPCHLVVLTNGM-----SRDSVLGPAH 537 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K +E I++ I + Q L+ + Y V D+L ++ + Q + Sbjct: 538 KLREENIRVHAIGV--KEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEIC 590 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + +F+ ++ + V +G++ ++ ++ + + + D + I + T Sbjct: 620 MKMFMKNLVSKSQIGADRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTL 679 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ Q + K +KF+I +TDGE + + Sbjct: 680 TGSALTFVSQYFSP--------------DKGARPNVRKFLILITDGEAQDIVRDP----- 720 Query: 354 CDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I ++ ++ L+ PE + V N D L H+ ++ + Sbjct: 721 AIALRKEGVIIYSVGVFGSNVTQ---LEEISGKPEMVFYVENFDILQHIEDDLVLGICSP 777 Query: 413 K 413 + Sbjct: 778 R 778 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + LK +SI L + + +GL+ Y+ S G K Sbjct: 217 INGSQEDLDHLKA---FLGESISALDIKENCMRVGLVTYSNETRVISSLSTGNNKTEVLQ 273 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + + A+++ + + S ++ + + + +T Sbjct: 274 RIQDLSPQVGQAYTGAALRKTRKEIFSAQRG-----------SRKNQGVPQIAVLVT--- 319 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVV--NADS 397 + + T + + + I T+ I A+P+ L+ S P + N Sbjct: 320 -HRASEDNVTKAAVN-LRREGVTIFTMGIEGANPDE---LEKIASHPAEQFTSKLGNFSE 374 Query: 398 L----IHVFQNISQLMVHR 412 L + + + H Sbjct: 375 LATHNQTFLKKLRNQITHT 393 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 72/258 (27%), Gaps = 52/258 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ + + R ++ LV+D+S SM ++ Sbjct: 78 MVLTIALAGPTHDVRI-PRNRAVVMLVIDVSRSMESTDV---------------APNRLG 121 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K A F ++ + +GLI Y + P+ + V + + Sbjct: 122 AAKEAGKEFARNLTPG------INLGLIAYAGTATVLVSPTTNRDAT---VNALDNLQLA 172 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + + A Q + + P I+ ++DG+ + N Sbjct: 173 DRTATGEGIFTALQAI----------ATVGAVIGGGDKPPPARIVLMSDGKETVPSNPDN 222 Query: 350 TIKI---CDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNV 392 AK+ + I TI+ + K S Y Sbjct: 223 PKGAYTAARTAKDQQVPISTIAFGTKDGYVEINGQRQNVPYAPDMMEKVAKLSGGETYTA 282 Query: 393 VNADSLIHVFQNISQLMV 410 L V+ N+ Q + Sbjct: 283 STLGQLKEVYANLQQQIG 300 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 86/290 (29%), Gaps = 65/290 (22%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + V A+ + LV+D+SGSM P+ + + Sbjct: 165 NDNGSYTVNVDVKGAVNSTTVTTTQPIDFTLVLDVSGSM------------DDPMSKTDR 212 Query: 225 RTKMAALKNALLLFLDSI--DLLSHVKEDVYMGLIGY--------TTRVEKNIEPSWGTE 274 ++ ALK A+ FLD E V++GL+ + + ++ ++ Sbjct: 213 TRRLDALKEAVKAFLDEAANTNTEAGSELVHVGLVKFAGDKTDKIGDDMYRSGGYTYNYS 272 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ----GVKIPSLPFQ 330 ++ + T +K L + F R G + Sbjct: 273 QIVSNL-----------TADMNGLKNKVSKLKAAGATRADNGFNRAVKVMGSASARTDAK 321 Query: 331 KFIIFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPNGQRLLKT---- 381 K +IF DG + ++ + K+ + +I I AS N L Sbjct: 322 KVVIFFADGSPTSSSGFEGKVANKAVEAAKELKDGGAAVYSIGIFASANPSSLSSNENQF 381 Query: 382 --CVSSP-----------------EYHYNVVNADSLIHVFQNISQLMVHR 412 VSS Y+ + NA L +F I + Sbjct: 382 MHAVSSNFPKATKYNQLGEGNIEAGYYKSATNASELNTIFDEIEKSETTT 431 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 57/204 (27%), Gaps = 50/204 (24%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++AA K L F+ ++ +GL+ + R + + ++ V R Sbjct: 112 NRLAAAKRVLSEFIG-------ARKQDRIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDT 164 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T A+ L + +I +TDG NN Sbjct: 165 NSVEDGTALGDAILAGVNRL------------------RQRPAEGRALILITDGRNNAG- 205 Query: 346 SNVNTIKICDKAKENFIKIVTISIN----------------------ASPNGQRLLKTCV 383 AK I++ I I A + L Sbjct: 206 --AEPQLAAQAAKALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDADLDAATLKGVAE 263 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 + ++ +A L VF I + Sbjct: 264 ITGGRYFEAGDATVLSRVFAEIDR 287 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 71/212 (33%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D T++AA K F+D + + +GL+ + + S Sbjct: 94 DVSNSMAATDIAPTRLAAAKQGASAFVDQLPP------RINLGLVSFAGSATVLVPASAD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E VR + T + + Q +T+ KR T Sbjct: 148 RESVRAGIRGLQLGPA---TAVGEGIFASLQAITTAGKRFSDTGQSAPPAA--------- 195 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DGE + N I+ +A+ I + TI+ N Q L Sbjct: 196 IVLLSDGETTRGRPNNQAIEAARQAR---IPVDTIAYGTADGTLDVGGQEVPVPVNEQAL 252 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + D L V++ + + Sbjct: 253 RDIAEQTGGSYHRATSGDELRSVYRGLGSSIG 284 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 78/225 (34%), Gaps = 46/225 (20%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E P F I LV+D S SMH +M +K++ L L Sbjct: 106 TAEYNVKEAPPFHICLVLDRSTSMH--------------------GARMDMVKSSALNLL 145 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + ++ ++ R E I P+ + + + R + T+ ++ Sbjct: 146 KQFRKQD------LISVVAFSDRAEVVIPPTRVPD-LAKDDHRISMLQVGGGTEIYQGLQ 198 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + L S F + +I LTDG + + + I++ ++A + Sbjct: 199 LGIEQLRS-----------------IDPRFMRQLILLTDG--HTYGDDEACIELAEEAAQ 239 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I+I T+ I N + L K S V + L F+ Sbjct: 240 DGIQINTMGIGHEWNDELLDKIATISGANSIFVTSPKDLNKFFEQ 284 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 15/198 (7%), Positives = 40/198 (20%), Gaps = 23/198 (11%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL 90 M L+ +G + + + M+S +AA++A ++ + + + + Sbjct: 1 MPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTDALKLKVTDWFHAQVE 60 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 I N M + + S + Sbjct: 61 NSYTLGEIDIDTTNH---------NITATASGTVPTTFMKIANIDTVPVSVASAVKGPAT 111 Query: 151 M-TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + + +L I + + Sbjct: 112 SYLNVYIVIDTSPSMLL-------------AATTAGQATMYSGIGCQFACHTGDSHKVGN 158 Query: 210 DPEDVNSAPICQDKKRTK 227 N + + Sbjct: 159 KTYANNYEYSTAKSIKLR 176 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 24/297 (8%), Positives = 76/297 (25%), Gaps = 28/297 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R ++ + ++ +I+ L++ + +G + + + ++ ++++ +A Sbjct: 1 MSSAARHPKLTRRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADAC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + + + + Sbjct: 61 ALSAARDLTSAISLSVAEADGIAAGHVNFA---FFQKSAVQMLTDSNVTFSDALTNPFLT 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S + + + L + + A+ Sbjct: 118 KTAVSTPANVK--------YVKCTATLSGIANWFVGVLNA---MPGVQVANATQVSASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V S + + + + AAL Sbjct: 167 ATVAAGQTTCAIPVFVCRASSAAPYKVGDWISSPSGSSSTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + P G+ ++ R ++ T+ Sbjct: 218 -----DGSTNETTIASELSGNTCNITSPPDLGSTGLKSASLRAWNTRFGIYTNGANG 269 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 64/201 (31%), Gaps = 35/201 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + QD ++ A K A F+ + + +G++ + + + Sbjct: 154 DTSGSMAAQDFDPNRITAAKVAAANFIAN-------RLSDRIGIVVFASDAMLQSPLTLD 206 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + ++ ++ T A+ + L S K Sbjct: 207 YESLLDFLADVRIGMVRTDGTAIGDAIAVSSVHLE------------------RSPARSK 248 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-------NASPNGQRLLKTCVS 384 II LTDGE+N+ + + A IK+ TI+ + L + Sbjct: 249 VIILLTDGESNSGVIS--PLDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIAKL 306 Query: 385 SPEYHYNVVNADSLIHVFQNI 405 + +Y N L ++ I Sbjct: 307 TGGKYYRAYNEAELTKIYAEI 327 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++RV S K Sbjct: 728 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMNVEY 784 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 785 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 832 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 833 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 880 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 881 SQIAENLKLNV 891 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 56 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 115 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 161 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 200 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++RV S K Sbjct: 428 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMNVEY 484 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 485 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 532 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 533 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 580 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 581 SQIAENLKLNV 591 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 43 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 101 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 102 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 147 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 148 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 187 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++RV S K Sbjct: 728 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMNVEY 784 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 785 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 832 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 833 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 880 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 881 SQIAENLKLNV 891 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 56 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 115 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 161 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 200 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 64/191 (33%), Gaps = 29/191 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ Y++RV S K Sbjct: 605 FELVKQFVNQVVDQL---DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMNVEY 661 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + + +G + + + TDG + + Sbjct: 662 MEKGTMTGLALKHMVE----------NSFSEAEGARPAEKNIPRVGLVFTDGRSQDDI-- 709 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAKE I + + + + L+ S P ++ + + F I Sbjct: 710 ---QEWAKKAKEAGITMYAVGVG--KAVEDELREIASDPVEKHFFYSAD-------FTAI 757 Query: 406 SQLMVHRKYSV 416 SQ+ + K +V Sbjct: 758 SQIAENLKLNV 768 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 56 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 115 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 161 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 200 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 65/227 (28%), Gaps = 45/227 (19%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + R I V+D+SGSM K+ ++ L ++ Sbjct: 79 AQTSEASRSGVDIVCVIDVSGSMQ--------------------GEKIQLVQTTLNFMVE 118 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMK 299 + + LI ++ K + K ++ + + + T+ ++ Sbjct: 119 RLSPAD------RICLISFSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGLE 172 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKA- 357 Q L + ++ II L+DG++NN D Sbjct: 173 YGLQALRQRR----------------TINQLSSIILLSDGQDNNGTTVLQRAKATMDSIV 216 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + T + L Y V + +++ F N Sbjct: 217 IRDDYSVHTFGYGHGHDSTLLNALAEPKNGAFYYVKDEETIATAFAN 263 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 68/212 (32%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D ++ A K+ F+DS+ + L+ + + P+ Sbjct: 96 DVSRSMEATDVTPNRLDAAKSGAKDFVDSLPSA------FNVALVTFAGTANVKMPPTTD 149 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +++ + + T + + +L + Sbjct: 150 RTQLKAAIDAIRLA---PSTAIGEGIYTSLDVLEKLAPQDPDHPDDPAPGA--------- 197 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--------------NGQRL 378 I+ L+DG N + + +AK+ + I TI+ S N L Sbjct: 198 IVLLSDGATNMGRDS---ADAATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAVNHAEL 254 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + S Y+ + +L V+Q IS+ + Sbjct: 255 SQVAKLSGGKKYSADSMKNLQAVYQTISRQIG 286 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 43/339 (12%), Positives = 100/339 (29%), Gaps = 55/339 (16%) Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 N K+ + K + + + + + L Sbjct: 15 QNFEKKQVQSQILDEDFLDYKPVVDILDDDEPIEYNFQKMGVGQNKQKYNLDKGLSLDVK 74 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLN-QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T+ + +S + + + + + + A + + RP + V+D SG Sbjct: 75 TLHKHFQFNSSASQSIPVMVSVKTLDKTEDSPKTDLEAAKQDRLENRPNLDLVCVIDRSG 134 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM K+ +K L LL + E+ + LI + + Sbjct: 135 SM--------------------SGNKIENVKKTLEY------LLELLGENDRLCLIAFDS 168 Query: 262 RVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 V + + + + T+ M+ A+++L K + ++ Sbjct: 169 CVSRRCHLMKTNSSNKPNLIKIINEIHCHGGTNINSGMELAFRVLKERKYYNPVSS---- 224 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-AKENFIKIVTISINASPNGQRLL 379 I L+DG+ + +++ + +K + I + + +G + Sbjct: 225 ------------IFLLSDGQ--DGGADLRVRQSLEKHLSQECFTIHSFGFGSDHDGPLMN 270 Query: 380 KTCVSSPEYHYNVVNADSLIHV--------FQNISQLMV 410 K C Y V + + F I+Q ++ Sbjct: 271 KICSLKDGNFYYVEKINQVDEFFVDALGGLFSVIAQEII 309 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 72/226 (31%), Gaps = 43/226 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM + K+ K+A +D ++ Sbjct: 399 NKEKERNVAVVLVIDHSGSM--------------GGSNLRNINKLEIAKSAAAKMIDHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + + K + + T P + +A Sbjct: 445 SSDS------VGVIAFDHNFYWASKFGKLKSKNE-VIENISTIQVGGGTAIIPPLTEAVN 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +L K + K I+ LTDG AK N IK Sbjct: 498 LLKKSKAKD------------------KVIVLLTDGYGEEGGYEYP----ASIAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TI + +S N L + Y V +A +LI VF ++++ Sbjct: 536 ITTIGVGSSINAPILSWMAAYTSGRFYYVKDASNLIDVFLKEAKII 581 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 33/259 (12%), Positives = 72/259 (27%), Gaps = 65/259 (25%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + +D+S SM ++ A K + Sbjct: 101 SNETNTQYTEGINMIFAIDVSESMKITD---------------IHPSRFDAAKQICTDII 145 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAM 298 + + + +G++ ++ + ++ + + L+ T A+ Sbjct: 146 NK-------RSNDRIGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSGTAIGTAL 198 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + + + + I+ ++DGEN + + I D Sbjct: 199 GTAINRLKNAETKE------------------RIIVLISDGENTSGLMD--PITAADLCL 238 Query: 359 ENFIKIVTISINASP------------------NGQRLLKTCVSSPEYHYNVVNADSLIH 400 E IKI I + + L ++ Y + SL Sbjct: 239 EYNIKIYCIGLGKDGTHQFKDDNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLDD 298 Query: 401 VFQNISQ----LMVHRKYS 415 V NI Q +V +++ Sbjct: 299 VIANIDQLEKGKIVQLQFT 317 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 80/246 (32%), Gaps = 25/246 (10%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAP--ICQDKKRTKMAALKNALLLFLDSIDLL 245 RP ++++ + M D K T+M K + F+ Sbjct: 67 RPQAGTSYSLEVNEGIAIQMLVDVSSSMDMSVKNFDGKSTTRMEVAKEMVERFIAGDGED 126 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQ 303 + +GLI + + ++G + + Q V + T A+ A Sbjct: 127 LQGRPHDLIGLITFARYADTRSPLTFGHDALLQIVRHLTIQERPNEDGTAYGDALALAAA 186 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K ++ K II LTDGENN+ I+ AK K Sbjct: 187 RL----KNPQELRHGKRPDAQAEAIESKVIILLTDGENNSGSH--LPIEAAGLAKAWDCK 240 Query: 364 IVTISINASPNGQRLL-----------KTCVSSPEYHYNVVNADSLIHVFQNISQ----L 408 I IS+ S + + L + + + +SL+ V++ I + Sbjct: 241 IYAISLGESLDAENPLDALSPAERVLEHISIETGGVFRQAHDFESLLSVYEEIDRLERAE 300 Query: 409 MVHRKY 414 + +R + Sbjct: 301 ISNRSF 306 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 47/382 (12%), Positives = 111/382 (29%), Gaps = 41/382 (10%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 H K + A + + +I+ A + + + Sbjct: 97 RAHSSKAELLQAVRR-----IQPLSTGTMTGLAIQFAITKALSDAEGGRPRSPDISKVVI 151 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 ++ ++ + I + + R+D + + DY + Sbjct: 152 VVTDGRPQDSVRDVSARARAGGIELFAIGVGRVDKATLQQIASEPQDEHVDYVESYSVIE 211 Query: 165 LNQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + Q+ + L + + + +NSD + N Sbjct: 212 KLSKKFQEAFCLVSDLCATGDHDCEQVCVSSPGSYTCACREGFTLNSDGKTCNVCNGGGG 271 Query: 223 KKRTKMAALKNA-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 T + L + + F++ I D L + +GL+ Y++ V + Sbjct: 272 SSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLG 331 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K S + K T + A+K +F Q Sbjct: 332 RFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQ 378 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EY 388 K I TDG + ++ ++ KAK+ K+ + + + L+ S P E+ Sbjct: 379 KVGIVFTDGRSQDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEH 431 Query: 389 HYNVVNADSLIHVFQNISQLMV 410 ++ + ++ + + + + + Sbjct: 432 YFYTADFKTINQIGKKLQKRIC 453 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V++ + K + T + Sbjct: 64 FLSQVIESLDVGPNATRVGLVNYASSVKQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGL 123 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + +V+ Sbjct: 124 AIQFAITKALSDA----------EGGRPRSPDISKVVIVVTDGRPQDSVRDVSAR----- 168 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ I++ I + + L+ S P E+ V + + Sbjct: 169 ARAGGIELFAIGVGRVD--KATLQQIASEPQDEHVDYVESYSVIEK 212 >gi|319780570|ref|YP_004140046.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166458|gb|ADV09996.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 359 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 22/198 (11%), Positives = 58/198 (29%), Gaps = 16/198 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R I S +F+ I L+++ + IGF + +++ ++A +AAIL+ + Sbjct: 1 MRGLAGSFIESRSGSFAPILILAMIPLITAIGFSVDYTSAVQTRSTEQAALDAAILSITT 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 M + ++ + +++ A S + + Sbjct: 61 -MDTASTKPQRQVAMQASYMANGGQGTAT-------LNSFDVSANGTATAQASASFAMPT 112 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + M + + + + + + + A Sbjct: 113 VFMQIARIPSVAVGVTSAVNKAPALVEATFKVTGVSGY--WNKKMTLYGTMFGAAAG--- 167 Query: 186 GERPIFLIELVVDLSGSM 203 +P+ I+ +G Sbjct: 168 --KPLMTIDYAYGKTGDP 183 >gi|295669664|ref|XP_002795380.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] gi|226285314|gb|EEH40880.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] Length = 773 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 65/216 (30%), Gaps = 28/216 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM + D + + K+A +++++ Sbjct: 75 DIVLCIDVSGSMQLSAPLPTTDESGKREETGLSV--LDLTKHAARTIIETLNEND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++ E + S + ++ + L T+ +K +L Sbjct: 128 -RLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVL----- 181 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVT 366 G + + LTDG+ N I ++ K+ I T Sbjct: 182 ----------GKVDLRPQNVQALYVLTDGQPNHMCPRQGYVPKLRPILERQKDRLPLIHT 231 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 232 FGFGYDIRSGLLQSIAEVGGGTYSFIPDAGMIGTVF 267 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 62/192 (32%), Gaps = 25/192 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTRDMDS 286 + +A + +D + + V +G+ Y+ +K + ++ + Sbjct: 1417 SNFTSAKTFMKEIVDSFTISENRVRIGVAQYSANPKKEFFLNEYYSSSDMKKQIDSISQL 1476 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + ++ Q +++I +TDG +N+ + Sbjct: 1477 KAT--TYTGKGLRFVKQFFDPANGG--------------RKNVPQYLIVMTDGMSNDSVN 1520 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +KI +I I + + ++ SP+ Y V +L + + I Sbjct: 1521 ED-----AAALRSSGVKIFSIGIGLRNSFELVM--IAGSPKNVYEVETFQALDSIKRQIV 1573 Query: 407 QLMVHRKYSVIL 418 + I+ Sbjct: 1574 AQVCEPSDQPII 1585 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 56/176 (31%), Gaps = 20/176 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ I + V G++ Y+ S + + T + Sbjct: 860 KFMIELISMFQVGANRVRFGVVQYSDVRRTEFFISEHNTQKMLKDAISQIEQLGGGTLTG 919 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ Q+ + K + ++ +TDGE+ + + Sbjct: 920 EALTSMKQLFVNA-------------AKDRPHKVPQSLVVITDGESQD-----RVTEAAA 961 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + I I I + + ++ S E + V N DSL + ++++ + Sbjct: 962 EIRNDGITIFAIGV--KNAVEEEIRDIAGSNEKMFFVNNFDSLKVIKNDLARELCT 1015 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 54/171 (31%), Gaps = 17/171 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ K++V G + Y T + + + K+ + T + A++ Sbjct: 1239 VNDSLVGKDNVQFGAVVYGTNPAEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYT 1298 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + +I +TDG I + K++ Sbjct: 1299 RIRFGTSYGG--------------RPGISHILILVTDGATTEADRPNLPI-VSKALKDDG 1343 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I + + + + L+ P+ + V N L ++ NI+Q++ Sbjct: 1344 IIVFAVGVG--KAVPQELQQIAGYPDRWFLVQNYKGLDNIHDNITQVVCDE 1392 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 42/351 (11%), Positives = 99/351 (28%), Gaps = 59/351 (16%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + +S++ + + V + E Q+ V + Sbjct: 902 LKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKDRPHKVPQSLVVITDGESQDRVTEAAA 961 Query: 128 SMTHMANNRLDSSNNTIFYN----MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + + + L + + P + Sbjct: 962 EIRNDGITIFAIGVKNAVEEEIRDIAGSNEKMFFVNNFDSLKVIKNDLARELCTPEACK- 1020 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-I 242 I +VD S S++ + F++S + Sbjct: 1021 ----NMKADIVFLVDSSASINSDDY------------------------ETMKEFMESMV 1052 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + V +GLI +++ ++ + K + + T A+K Sbjct: 1053 KQAEIGPDRVQIGLIQFSSETKEEFPLNRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTL 1112 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K T ++++I +TDGE + N +++ + Sbjct: 1113 PYFSASKGGRVNT--------------KQYLIVITDGEAQDAVGNP-----AKAIRDHGV 1153 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF--QNISQLMVH 411 I I + N +LL+ + +Y + DSL F +NI + + Sbjct: 1154 IIYAIGVQQ-ANNTQLLEIAGKQEQVYYE-DSFDSL--AFLNKNIMFEICN 1200 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 63/193 (32%), Gaps = 22/193 (11%) Query: 222 DKKRTKM--AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + T+M A + A D + L + +GL+ Y + + K Sbjct: 241 VESTTRMGDATFEKAKNFLYDLVSNLDVGINKIRIGLVTYNDETNPEFLLNSYSSKTEIL 300 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T + A++ T F + + +I +T+G Sbjct: 301 ESIQNMKYVEGYTYTGRALEY-----------VNTTYFTQAAGSRFEESVAQILIIVTEG 349 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADS 397 + S+ + + K I + + + R L+ S P+ + Y + + D Sbjct: 350 D-----SSDTLTEPAKELKSRGISVYVVG--TNIKYDRQLQEASSKPDEKFFYQLDDFDD 402 Query: 398 LIHVFQNISQLMV 410 +V + + + + Sbjct: 403 SENVTEQLLKNLC 415 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 60/181 (33%), Gaps = 24/181 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTD 293 ++ + + +GLI Y+ + + K V +Y+ T Sbjct: 54 NFLYTLVNGFDVGLDKIQIGLIQYSDNARTEFFLNSYSNKEDVLKYIQNLK--YKGGGTK 111 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + +++ +F + + + +TDG+ + + Sbjct: 112 TGLSLEFMLT-----------QHFSEAAGSRAAEGVPQIAVVITDGQAQDSI-----REP 155 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 K I + I I + + L S P+ + Y+V + ++L + QN+ Q++ Sbjct: 156 AIAVKNAGIILYAIGIKDAVLSE--LNEIASDPDDKHVYSVADFNALQSISQNMIQVLCT 213 Query: 412 R 412 Sbjct: 214 T 214 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 37/352 (10%), Positives = 115/352 (32%), Gaps = 31/352 (8%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS-RIS 128 S+ + + + +++ A ++ +G +I IV Sbjct: 294 SSKTEILESIQNMKYVEGYTYTGRALEYVNTTYFTQAAGSRFEESVAQILIIVTEGDSSD 353 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 L S + Y + YD +LQ ++++ ++ + E+ Sbjct: 354 TLTEPAKELKSRGIS-VYVVGTNIKYDRQLQEASSKPDEKFFYQLDD---FDDSENVTEQ 409 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + +DL+ ++ +D + + +K+ ++ ++ ++ Sbjct: 410 LLKNLCFSIDLNIQVYSKRYADVVFLVDSSTSMGTIF--FQKMKDFIIHIINQLN---VG 464 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +GL Y+ + + K + + T +K Sbjct: 465 INKHRIGLAQYSGLPQTEFLLNHYETK-----EEILKHIKETFTYRGGPLKTG-----HA 514 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + T F + + +F++ +T + KS + ++ K + +++ Sbjct: 515 LEFVRSTFFIEEAGSRINYGNPQFLVVIT-----SSKSEDAVRRHAEELKSVGVTTISVG 569 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH---RKYSVI 417 I S + L+ + P + + + ++ Q+++ +++ ++S + Sbjct: 570 IGNSDRKE--LEKIATDP-FVFQTTGLQHISNLQQDVANVIIAEDMLQFSPV 618 Score = 42.9 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 58/173 (33%), Gaps = 20/173 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L +V +GL+ Y+ ++ + EK + A+ Sbjct: 659 VSALDISLSNVRVGLVLYSDEPRLELKLNTFNEKYEILDFITKLPYRGGKAHTGAALDFL 718 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + + P Q+ + +T+G +S N K K + + Sbjct: 719 RKKMFTKQNG-----------GRPHQGVQQIAVVMTNG-----QSMDNFTKPAAKLRRSG 762 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 +++ + + + L S P ++ NV + L ++ I + + + Sbjct: 763 VEVFAVGFQNINDTE--LDIIASHPPRKHVTNVESFLQLSNLEFRIQKRLCNE 813 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 46/376 (12%), Positives = 108/376 (28%), Gaps = 49/376 (13%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA+L ++ + K + R I + + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDGRPQDSV 161 Query: 119 QNI--------VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 +++ V I + + L + ++ R Sbjct: 162 RDVSARARASGVELFAIGVGRVDKATLQQIASEPQEEHVDYVESYNVIE-----KLSRKF 216 Query: 171 QKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 Q+ + L + + + +N+D + N T + Sbjct: 217 QEAFCVVSDLCATGDHDCEQVCISSPGSYTCACREGFTLNTDGKTCNVCSGGGGSSATDL 276 Query: 229 AALKNA-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 L + + F++ I D L + +GL+ Y++ V + K Sbjct: 277 VFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKK 336 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 S + K T + A+K +F QK I Sbjct: 337 DIKAAVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVF 383 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + ++ ++ KAK+ K+ + + + L+ S P E+++ + Sbjct: 384 TDGRSQDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTAD 436 Query: 395 ADSLIHVFQNISQLMV 410 ++ + + + + + Sbjct: 437 FKTINQIGKKLQKKIC 452 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 54/165 (32%), Gaps = 17/165 (10%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G++ Y + V++ K + T + Sbjct: 63 FLSQVIESLDVGPNATRVGVVNYASTVKQEFPLRAHVSKAALLQAVRRIQPLSTGTMTGL 122 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + +V+ Sbjct: 123 AIQFAITKAFSDA----------EGGRSRSPDISKVVIVVTDGRPQDSVRDVSAR----- 167 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIH 400 A+ + +++ I + + L + E+ V + + + Sbjct: 168 ARASGVELFAIGVGR-VDKATLQQIASEPQEEHVDYVESYNVIEK 211 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 62/231 (26%), Gaps = 42/231 (18%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN----IEPSWGTEKV 276 +M A K A + F + + ++GL +TT E+ + T+ Sbjct: 604 PKVVMKRMDAAKLAAITFNNMLGEGD------FVGLATFTTYAERISVNQTPLKYMTKDK 657 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN-------------FFRQGVK 323 + TD A+ ++ N G Sbjct: 658 LRVNNEIEGLYAKLATDHADALYWGVKVF-PIWPDETQNNCTECINNTRPLMILLTDGET 716 Query: 324 IPSLPFQKFIIFLTDG---ENNNFKSNVNTIKICDKAKEN------FIKIVTISINAS-- 372 + + +N + + D K N I I TI Sbjct: 717 TTCDKNEDYFNNTCKNKCVRDNGHYGAQQALCVADYIKRNIKVNGFNIPICTIGFGTDIG 776 Query: 373 PNGQRLLKTCVS-----SPEYHYNVVNADSLIHVFQNISQ--LMVHRKYSV 416 +GQ L+ S ++ ++ LI ++ I + + S+ Sbjct: 777 SDGQAFLRDIASPRPDNGEACYFFATTSEELIEAYKTIFNIFQIAAKNISI 827 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 69/241 (28%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + L +D+S SM +++AA + F D++ Sbjct: 85 PRNRATVILAIDVSLSMRATDVP---------------PSRLAAAQAGAKTFADNLTPG- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL + + P + + T + A+ A Q + Sbjct: 129 -----INLGLEAFAGTASMLVSPITDHTATDNALDHLQLAER---TATGEAIFTALQAID 180 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 + V P I+ +DG+ + AKE + Sbjct: 181 T----------LAGVVGGGGTPPPARIVLESDGKQTVPTDLNDPRGAFTAARLAKEQGVP 230 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + + L + S + +AD L +QN+ Q + Sbjct: 231 ISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQI 290 Query: 410 V 410 Sbjct: 291 G 291 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 68/211 (32%), Gaps = 56/211 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + QD K +++ K A+ ++ + + +GLI + + + Sbjct: 97 DVSNSMMAQDIKPSRLEKAKMAISRMVEKL-------SNDKIGLIVFAGDAYVQLPITTD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ D + ++ T A+ A + T + K Sbjct: 150 YSSAKLFLSNISTDIVPVQGTAIGSAIDLAARSFTPE------------------TETSK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + + +A E I I TI + Sbjct: 192 AIIVITDGENHQDDA----VAAAKQAHEKGIVIHTIGMGLEQGAPIPEKGKPGQFMQDAQ 247 Query: 374 --------NGQRLLKTCVSSPEYHYNVVNAD 396 + Q L + + N + Sbjct: 248 GNVVISKLDEQTLQDIAKAGEGLYIRASNTE 278 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 79/242 (32%), Gaps = 48/242 (19%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 P + ER + LVVD SGSM K+ +K+ L Sbjct: 677 GPCFEQQAQKERKGVDLVLVVDKSGSMA--------------------GQKLDMVKSTLS 716 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDST 295 +D + + ++ + T+V+ N++ + + ++ + ++ T+ + Sbjct: 717 FMVDQLKEKD------RVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISPGSCTNLS 770 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKI 353 A+ + ++L S ++ +I TDG N +N + Sbjct: 771 GALFTSLKLLASRQQEKNEVT---------------SVILFTDGLANRGLISTNEILQNM 815 Query: 354 CDKAKE----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 D E + + I T + L + + AD + F N+ + Sbjct: 816 QDLMDELLSTSNVTIHTFGFGQDTDANMLTSIAQKGNGLYDYLETADDIPKAFGNVIGNL 875 Query: 410 VH 411 V Sbjct: 876 VS 877 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 39/355 (10%), Positives = 95/355 (26%), Gaps = 26/355 (7%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L+ FR ++ + S+IFA++ + L+ +G + K ++ + +AA+LA Sbjct: 7 LATFRNLCRRFGWASDGQISMIFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLA 66 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + ++ S + A ++ SL+G + Sbjct: 67 GVVRPND--QQISTAAAVFSGAYRGRFDTAATASFASNTDGSLTGT----------ATTS 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + M + L + + S L I Sbjct: 115 VKTSFLNVMGTSALGVTASATAKAGAQAQSSVCILLVSTVNAQSLLVNSGAQLNAPSCEI 174 Query: 184 EMGERPIFLIELVVDLSGSMHC--------AMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + L+ C +P D K+ A+ + Sbjct: 175 HVLSTQSPAAMFNATLNAKRICIKGSNITKNGGVNPPVETGCAAISDPFVGKLPAV--TV 232 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + G+ T + + + + T + Sbjct: 233 GSCTTNNKVFDPGAVTLNPGVYCGATNFNGSGTLTLNPGLY--IIKGGAMTFNSGWTVTG 290 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + + ++ + +F N + + ++F +G + + T Sbjct: 291 SGVT--FYLVDQNATLTFNGNVNATLAAPSTGTYANILMFEPNGLSTSQLPINGT 343 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 60/207 (28%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + S + V + + K+ +K A+ + + + + ++ + Sbjct: 51 TGGDSTSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + TE + Q + T+ + ++ ++L K S Sbjct: 105 LDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ ++DG+ N + K + + T A + L Sbjct: 164 --------------VMLMSDGQQNRGEP-------AANVKIGNVPVYTFGFGADYDPTVL 202 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S V + + L F Sbjct: 203 NAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 68/242 (28%), Gaps = 54/242 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R ++ LV+D+S SM ++ A + A F + Sbjct: 86 PRNRAVVVLVIDMSQSMRATDVP---------------PNRLKAAEEAASQFAAQLTPG- 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+G+ + P+ + + + T + A+ A ++ Sbjct: 130 -----INLGLVGFAGTPYLLVPPTP---QHQATIDALKKLDFADSTATGEAIFTALHAVS 181 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFI 362 + T I+ L+DG N + + A++ + Sbjct: 182 ATAITGGDTPP------------PARIVLLSDGRENKPSNPSDPHDGVYTAARLARDEGV 229 Query: 363 KIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I TIS + ++ S Y N L + I + Sbjct: 230 PISTISFGTKTGEIEMDGQRVAVPVSTDQMKTIAKLSGGQSYTAGNLAELNKSYNAIEKD 289 Query: 409 MV 410 + Sbjct: 290 IG 291 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 75/259 (28%), Gaps = 31/259 (11%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + I S E LV + H + + K+ T Sbjct: 753 GKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHLKALERKLIFHVCALHDCKRITL 812 Query: 228 MAAL----------KNA----LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + K + L + + ++ V G + Y+ + + + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHMINLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYS 872 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T + A+K A + T + N ++ + Sbjct: 873 NRSAIIENLRKRRDTGGNTYTAKALKHANALFTEEHGSRIKQN------------VKQML 920 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 I +TDGE+++ +T + + I I + + Q+ L+ + V Sbjct: 921 IVITDGESHDHDQLNDT---ALELRNKGITIFAVGVG--KANQKELEGMAGNKNNTIYVD 975 Query: 394 NADSLIHVFQNISQLMVHR 412 N D L VF + + M Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 57/177 (32%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 464 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYVTDYSNDIDLRKAIFNIKQLTGGTYTGK 523 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 524 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 565 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I + L+ E N D+L + + + + K Sbjct: 566 LRAEQITVHAVGIG--AANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREICAEK 620 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 792 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 71/252 (28%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + ++LL + E + D L Sbjct: 1120 RYDVAD-----AVKTLKDLGICVLVLGIG-NVYKEQLLPI-TGNSEKIITFQDFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVKKRIIREICQ 1184 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 75/259 (28%), Gaps = 31/259 (11%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + I S E LV + H + + K+ T Sbjct: 753 GKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHLKALERKLIFHVCALHDCKRITL 812 Query: 228 MAAL----------KNA----LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + K + L + + ++ V G + Y+ + + + Sbjct: 813 LDVVFVLDHSGSIKKQYQDHMINLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYS 872 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T + A+K A + T + N ++ + Sbjct: 873 NRSAIIENLRKRRDTGGNTYTAKALKHANALFTEEHGSRIKQN------------VKQML 920 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 I +TDGE+++ +T + + I I + + Q+ L+ + V Sbjct: 921 IVITDGESHDHDQLNDT---ALELRNKGITIFAVGVG--KANQKELEGMAGNKNNTIYVD 975 Query: 394 NADSLIHVFQNISQLMVHR 412 N D L VF + + M Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 57/177 (32%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 464 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYVTDYSNDIDLRKAIFNIKQLTGGTYTGK 523 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 524 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 565 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I + L+ E N D+L + + + + K Sbjct: 566 LRAEQITVHAVGIG--AANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREICAEK 620 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 792 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 71/252 (28%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + ++LL + E + D L Sbjct: 1120 RYDVAD-----AVKTLKDLGICVLVLGIG-NVYKEQLLPI-TGNSEKIITFQDFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVKKRIIREICQ 1184 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 72/218 (33%), Gaps = 50/218 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D SGSM ++ A F+ +ID L++ Sbjct: 768 DIVFVLDNSGSMSSND---------------PNYYRI----EATKKFIQNIDELNN---- 804 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + V + K+ Q + T+ +K A ++ Sbjct: 805 -RVGLVDFDSSVYVRSNLTSDKNKLLQALNAM--RWTGGSTNIGGGLKAALELF------ 855 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K I+ L+DG +N + + + + I + TI++ Sbjct: 856 -------------DQEQSKKIIVLLSDGYHNTGIHPNDVLP---ELIKQEIVVNTIALGK 899 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + L + ++ V N L +++ + + Sbjct: 900 DCDRELLHDIADKTKGDYFYVDNTGGLSQ--EDVDKQI 935 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 60/207 (28%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + S + V + + K+ +K A+ + + + + ++ + Sbjct: 51 TGGDSTSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + TE + Q + T+ + ++ ++L K S Sbjct: 105 LDTANRICPLQQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ ++DG+ N + K + + T A + L Sbjct: 164 --------------VMLMSDGQQNRGEP-------AANVKIGNVPVYTFGFGADYDPTVL 202 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S V + + L F Sbjct: 203 NAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 27/377 (7%), Positives = 93/377 (24%), Gaps = 45/377 (11%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K +I L+++ + + V + K +++ ++A L+ A + + Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQA 71 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 +D ++ + + N++ + Sbjct: 72 EAIVAGNEAINNNLTLDGFGSMTIDNTDIHYEFSDS----LPFDSSTNTATSPYVRVRIE 127 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 +D ++ F + + I Sbjct: 128 DVDVAD-------------YLVAIFN------------IDMSARSSAVAGPSSSITTTCN 162 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VV LS ++ + + K + + + ++ + G Sbjct: 163 VVPLSICEGSESSTTLSGYSEGSLHVLKASSTKDSAIGSGNFMPMALKDADG---NTIPG 219 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + + ++ PT + +F Sbjct: 220 ASSYGDA--LAGSFDTCLTVIEDEMITSEPGNMVGPTR-------GIDTRFGIYEGTFKN 270 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-INASPN 374 + + + ++ + + ++ N + + I + + +A+ + Sbjct: 271 DENKYPADKDTYYDSNNLVEVQTVQIDDGDGNTIEEYQTTLSHDA---IYSFADYSANQD 327 Query: 375 GQRLLKTCVSSPEYHYN 391 + L + ++ Sbjct: 328 FESCLSDSACQSQGYFR 344 >gi|325964113|ref|YP_004242019.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] gi|323470200|gb|ADX73885.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] Length = 352 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 16/177 (9%), Positives = 42/177 (23%), Gaps = 19/177 (10%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 E+ +I+ A+ +++ L + V + +K +++ +A LA A + + ++ Sbjct: 22 ERGAATILVAVLMVALLGCAALAVDVGAMYAEKAQIQNGADATSLAIAEECANGVNCAVA 81 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + A D A + + S + + Sbjct: 82 MAAPANRLADANANDGATGVFSVTQPSPSTIRVETNAREAGSGDNHFSLFFARVLGIDTS 141 Query: 138 DSSNNTIFYNMDVMTSYDYR-------------------LQFIEHLLNQRYNQKIVS 175 + + L I+ Sbjct: 142 QITAAAEASWGPPSSGSTLPWTVSECVFRKYLTPTQLTELDTTGSFTGDPTPTHILL 198 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/345 (9%), Positives = 76/345 (22%), Gaps = 87/345 (25%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 L + + + + L Sbjct: 85 EMEMERQKKEIVLVLDTSGSMNEKVGKVCTNNRGWY-----CKTHNSSDLYHRESLFYHN 139 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + K TKM LK A F+D + + +K + ++ Y++ Sbjct: 140 WINDYCEEHGKVGQHYASYSKSTKMEELKKAANNFIDKMKDVPDLK----ICIVNYSSEA 195 Query: 264 EKNIEPSWGTEKVRQYVTR---------------------------DMDSLILKPTDSTP 296 + +K V L T++ Sbjct: 196 -TINPCGYNGDKNSASVEEDRHHTIPNYKSLGTKFLNSNDNTLHSMINGLKALGGTNTGE 254 Query: 297 AMKQAYQILTSDKKRS-------FFTNFFRQGVKIPSLPFQKFIIFLT------------ 337 +++A +L K + V QK+ L Sbjct: 255 GLRKAEYMLEQGDKDAKKTIVFMSDGLPTYYSVYKNHQNVQKYYWVLKYSWDNGYHWEKE 314 Query: 338 ------DGENNNFKSNV-------------------NTIKICDKAKENFIKIVTISINAS 372 D + ++ KI + K N + ++ Sbjct: 315 YYWKEEDYWDYYTSTDDTFPNYAGTGNSDDQGCCKKYAKKIGEIIKSNNSNVFSLGYGLG 374 Query: 373 P---NGQRLLKTCVSSPEYH---YNVVNADSLIHVFQNISQLMVH 411 + +++K S + + ++ +F I+ ++ Sbjct: 375 NKNSDANKIMKEIHDSMGGAEKDFFATDTGAIDEIFNQIADKIIE 419 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 60/207 (28%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + S + V + + K+ +K A+ + + + + ++ + Sbjct: 51 TGGDSTSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + TE + Q + T+ + ++ ++L K S Sbjct: 105 LDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ ++DG+ N + K + + T A + L Sbjct: 164 --------------VMLMSDGQQNRGEP-------AANVKIGNVPVYTFGFGADYDPTVL 202 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S V + + L F Sbjct: 203 NAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 48/237 (20%), Positives = 76/237 (32%), Gaps = 49/237 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I+++VD+S SM M +++ T+M K + F+ + D Sbjct: 88 IQVLVDVSSSMDINMKYG-----------EERLTRMDVAKIVVEKFIGGDGDELVGRPDD 136 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI + + S E + V T Sbjct: 137 LIGLITFARYADTIAPLSLAHEALISIVQDVTINTRPNEDGTAYG--------------D 182 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + ++ K II LTDGENN ++ AKE IKI TISI Sbjct: 183 ATALAAAQLDLLQGDQDIKSKIIILLTDGENNCGNH--LPLQAASLAKEWGIKIYTISIQ 240 Query: 371 -------------------ASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQ 407 G ++LK S+ + DSL V++ I++ Sbjct: 241 NKPTPERKKTDQGTFFVPPTPSAGDQVLKKMAESTGGVFRLAHDYDSLKSVYKEINK 297 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 75/255 (29%), Gaps = 52/255 (20%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 I + ++ R + LV+D+S SM K T++ A + Sbjct: 72 IFALAGPTAEQKV-PRNRATVMLVIDVSLSMEATDV---------------KPTRLRAAQ 115 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +A F + + V +GLI + P+ E V + + T Sbjct: 116 DAARSFAEGLTPG------VNLGLISFAGTATVLAAPTTEREGVVHAIENLKLAQ---ST 166 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A Q + S I+ +TDG+ + Sbjct: 167 ATGEGIFAALQAIESFSAVIGGAEGP----------PPARIVLMTDGKQTVPQDEYAPRG 216 Query: 353 I---CDKAKENFIKIVTISINAS--------------PNGQRLLKTCVSSPEYHYNVVNA 395 AK+ I I TIS S + + + S Y A Sbjct: 217 AFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASMREIARLSGGDFYKAATA 276 Query: 396 DSLIHVFQNISQLMV 410 + L V+ ++ + + Sbjct: 277 EELKQVYDSLGEQIG 291 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 30/295 (10%), Positives = 77/295 (26%), Gaps = 52/295 (17%) Query: 148 MDVMTSYDYRLQFIEHLL------NQRYNQKIVSFIPA------------------LLRI 183 + L + Y I +++ Sbjct: 548 TRPPVDVTFVLDRSGSMGPHNPTSWSAYEPDYEIDIGEEWEPIPTDEPFRNTHDWKSIQV 607 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--MAALKNALLLFLDS 241 + I +E + + P + + Sbjct: 608 RDDDGTIRTLEHRDFVHPDDWTEIRVHPYHQFGYIPGSIGIYPHPGNDPTNQRVEATRNV 667 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID L + +G+ + + S E ++ V T+ ++ A Sbjct: 668 IDELDPSADR--VGVYDFASSGRALHPLSDDLESAKESVV----GTAYGGTNMAAGLEAA 721 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + ++ +I L+DG+N+N ++ ++ D++ + Sbjct: 722 LN----------------DYATRGTDDRERIVILLSDGKNSNTANDERMDELADRSDDLD 765 Query: 362 IKIVTISIN----ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + T+ ++ S +L + +Y + D L+ +F+ I + Sbjct: 766 YTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDEEIDL 820 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 62/211 (29%), Gaps = 43/211 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + I +D ++ A K F+ + +GLI + + Sbjct: 110 DISDSMIEKDLSPNRLEAAKRMARQFIK-------GRLQDRIGLIVFAGEAVSLCPLTTD 162 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + ++ + T A+ A + + K Sbjct: 163 YELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRM------------------RDTAGESK 204 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQ 376 I ++DG+N + N+ A +K+ TIS+ + Sbjct: 205 VAILISDGDNTSG--NLGPTTSAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDEG 262 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 L ++ + +L VF+ I Q Sbjct: 263 ELQNIAGIGNGKYFRATDNTALESVFKQIDQ 293 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 72/235 (30%), Gaps = 44/235 (18%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 K++ + E P + +VD+SGSM+ K+ Sbjct: 171 AKLIKIGIQAQDLAKKELPPANLVFLVDISGSMN-------------------SPEKLPL 211 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K L + + + + LI Y + E + P+ G K + Sbjct: 212 VKKTLRILTEQLRPQD------KVTLITYASGEELVLPPTSGRNKDE-ILRAINKLQAGG 264 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVN 349 T A+K AY+ ++ I+ TDG+ N S Sbjct: 265 STAGESALKMAYEQAQKAYVKNGIN----------------RILLATDGDFNVGVSSTDA 308 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 + + +++ I + T+ + +++ + + + N V Q Sbjct: 309 LKSMVAEKRKSGISLTTLGFGTGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQ 363 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 69/241 (28%), Gaps = 51/241 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + L +D+S SM +++AA + F D++ Sbjct: 85 PRNRATVILAIDVSLSMRATDVP---------------PSRLAAAQAGAKTFADNLTPG- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL + + P + + T + A+ A Q + Sbjct: 129 -----INLGLEAFAGTASMLVSPITDHTATDNALDHLQLAER---TATGEAIFTALQAID 180 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 + S P I+ +DG+ + AKE + Sbjct: 181 TLAGVLGG----------GSTPPPARIVLESDGKQTVPADLNDPRGAFTAARLAKEQGVP 230 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I TIS + + L + S + +AD L +QN+ Q + Sbjct: 231 ISTISFGTTHGAIDLNGSHIPVPVDDESLRRIAELSGGSFFTATSADELQASYQNLQQQI 290 Query: 410 V 410 Sbjct: 291 G 291 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 75/224 (33%), Gaps = 48/224 (21%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L +DLSGSM S + ++AA+K F+ + Sbjct: 97 IMLALDLSGSMERVDFS----------LDGRNVARLAAVKRVGADFIRR-------RAGD 139 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + + + S+ T V + L+ + T + A + L Sbjct: 140 RIGLVIFADQADVAASLSFDTASVAHALDEAQIGLVGRSTGIGDGLGLALKRLD------ 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 + +K ++ L+DG NN + + A+E I++ TI++ Sbjct: 194 ------------ATPAREKVVVLLSDGANNAGQ--TTPHDVAALARELGIRVHTIALGPR 239 Query: 371 ---------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + L +S + V D L V +I Sbjct: 240 DLSDAEGDPDVVDTEALRDVATTSGGRFFRVRTTDDLAAVADSI 283 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 25/256 (9%), Positives = 75/256 (29%), Gaps = 33/256 (12%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L + + ++ + + ++ GS + D + Sbjct: 796 LVWNGSNWGAEGSSSSLASSGYTRWVNLESNVDSDEISLITSDGSSDINVQKWNGDSWNV 855 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 P + ++ + + + + D+ + + + + + + S Sbjct: 856 PYEVEISSSRYYEVSDIVYPWKDTTYSSTELNWKEWRATVTSSLSNNSLVHLS------- 908 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T + +A L++ S ++ ++ Sbjct: 909 ---NSIDTITADGLTAIDEGLYEANNELSAITGNS-------------------TVVLMS 946 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNA 395 DG +N ++ + +AKE+ I T+ + + + L + + +Y N+ Sbjct: 947 DGLDNAGHHSLIEEAL--RAKEHNTVIYTVGLGNNEDEVDPILCEIANITGGKYYFAPNS 1004 Query: 396 DSLIHVFQNISQLMVH 411 L +F I+ + + Sbjct: 1005 TVLEDIFIGIASEITN 1020 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 69/219 (31%), Gaps = 43/219 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM ++ K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVMLVLDHSGSMADTEDAGIP--------------KLEIAKSASAKMVEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + + K + + T P + +A + Sbjct: 445 SSD------GVGVIAFDHNYYWAYKFGKLVRK-EDVIESISSIEVGGGTAIIPPLSEAVK 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K ++ K ++ LTDG + D+AK N IK Sbjct: 498 TLKKSKAKN------------------KLVVLLTDGMGEQSGYEIP----ADEAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I TI + N L + Y V N L+ VF Sbjct: 536 ITTIGVGKFVNASVLSWIAAYTSGRFYLVSNPSELVDVF 574 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 58/204 (28%), Gaps = 50/204 (24%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +++AA K L F+ ++D +GL+ + R + + ++ V R Sbjct: 112 SRLAAAKRVLAGFIG-------GRKDDRIGLVAFAGRPYPAAPLTSDHQWLQGVVDRLDT 164 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A+ L + +I +TDG NN Sbjct: 165 GAVEDGTALGDAILSGVNRL------------------RRRPAESRALILITDGRNNAG- 205 Query: 346 SNVNTIKICDKAKENFIKIVTISIN----------------------ASPNGQRLLKTCV 383 AK I++ I I A + L Sbjct: 206 --AEPQLAAQAAKALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDAELDAATLKGVAE 263 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 + ++ +A L VF I + Sbjct: 264 LTGGRYFEAGDATVLSRVFAEIDR 287 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 5/105 (4%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENF 361 + K + +K+I+F+TDG+NNN S + T K CD AK Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 64 IEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIG 108 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 E+ + +NSD + N T + L + + Sbjct: 239 EQLCVSSPGSYTCACHEGFTLNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELV 298 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ I D L +GL+ Y++ + + K S + K T + Sbjct: 299 KKFINQIVDTLDVSDRLAQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMT 358 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K +F QK I TDG + ++ ++ Sbjct: 359 GAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 400 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P ++++ + ++ + + + + + Sbjct: 401 RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQKQIC 456 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V+ K + T + Sbjct: 67 FLSQVIESLDVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGL 126 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + + ++ Sbjct: 127 ALQFAITKALSDA----------EGGRARSPDISKVVIVVTDGRPQDS-----VRDVSER 171 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I++ I + + L+ S P E+ V + + + Sbjct: 172 ARASGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYNVIEK 215 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + T + Sbjct: 834 INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + T + N ++ +I +TDGE+++ +T Sbjct: 894 AKALKHANALFTEEHGSRIKQN------------VKQMLIVITDGESHDHDQLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I I + + Q+ L+ + V N D L VF + + M Sbjct: 939 LELRNKGITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 464 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGK 523 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 524 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 565 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I A+ + L+ E N D+L + + + + K Sbjct: 566 LRAEQITVHAVGIGAANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICAEK 620 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 792 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + LL + E + D L Sbjct: 1120 RYDVAD-----AVKTLKDLGICVLVLGIG-DVYKEHLLPI-TGNSEKIITFQDFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVKKRIIREICQ 1184 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + T + Sbjct: 834 INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + T + N ++ +I +TDGE+++ +T Sbjct: 894 AKALKHANALFTEEHGSRIKQN------------VKQMLIVITDGESHDHDQLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I I + + Q+ L+ + V N D L VF + + M Sbjct: 939 LELRNKGITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 464 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGK 523 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 524 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 565 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I A+ + L+ E N D+L + + + + K Sbjct: 566 LRAEQITVHAVGIGAANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICAEK 620 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 792 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + LL + E + D L Sbjct: 1120 RYDVAD-----AVKTLKDLGICVLVLGIG-DVYKEHLLPI-TGNSEKIITFQDFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVKKRIIREICQ 1184 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + T + Sbjct: 779 INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYT 838 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + T + N ++ +I +TDGE+++ +T Sbjct: 839 AKALKHANALFTEEHGSRIKQN------------VKQMLIVITDGESHDHDQLNDT---A 883 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I I + + Q+ L+ + V N D L VF + + M Sbjct: 884 LELRNKGITIFAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 939 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 409 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGK 468 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 469 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 510 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I A+ + L+ E N D+L + + + + K Sbjct: 511 LRAEQITVHAVGIGAANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICAEK 565 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 600 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 659 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 660 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 700 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 701 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 737 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 909 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 963 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 964 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 1016 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1017 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 1064 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + LL + E + D L Sbjct: 1065 RYDVAD-----AVKTLKDLGICVLVLGIG-DVYKEHLLPI-TGNSEKIITFQDFDKLKNV 1117 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1118 DVKKRIIREICQ 1129 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 E+ + +NSD + N T + L + + Sbjct: 195 EQLCVSSPGSYTCACHEGFTLNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELV 254 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ I D L +GL+ Y++ + + K S + K T + Sbjct: 255 KKFINQIVDTLDVSDRLAQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMT 314 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K +F QK I TDG + ++ ++ Sbjct: 315 GAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 356 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P ++++ + ++ + + + + + Sbjct: 357 RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQKQIC 412 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V+ K + T + Sbjct: 23 FLSQVIESLDVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGL 82 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + + ++ Sbjct: 83 ALQFAITKALSDA----------EGGRARSPDISKVVIVVTDGRPQDS-----VRDVSER 127 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I++ I + + L+ S P E+ V + + + Sbjct: 128 ARASGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYNVIEK 171 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 E+ + +NSD + N T + L + + Sbjct: 239 EQLCVSSPGSYTCACHEGFTLNSDGKTCNVCRGGGSGSATDLVFLIDGSKSVRPENFELV 298 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ I D L +GL+ Y++ + + K S + K T + Sbjct: 299 KKFINQIVDTLDVSDRLAQVGLVQYSSSIRQEFPLGRFHSKKDIKARVRNMSYMEKGTMT 358 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K +F QK I TDG + ++ ++ Sbjct: 359 GAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 400 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P ++++ + ++ + + + + + Sbjct: 401 RKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQKQIC 456 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V+ K + T + Sbjct: 67 FLSQVIESLDVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGL 126 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + + ++ Sbjct: 127 ALQFAITKALSDA----------EGGRARSPDISKVVIVVTDGRPQDS-----VRDVSER 171 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I++ I + + L+ S P E+ V + + + Sbjct: 172 ARASGIELFAIGLGRVD--KATLRQIASEPQDEHVDYVESYNVIEK 215 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 71/230 (30%), Gaps = 44/230 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + +DLS SM M+ + ++ K + F+ + Sbjct: 122 DIVIALDLSDSMSNPMDGR----------RGLGLDRLTVAKQVIDEFIRR-------RPH 164 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 + L+G+ + +R + + ++ T + + L + Sbjct: 165 DRIALVGFGAHASTIAPLTLDHAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRLKESQA 224 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K I+ LTDG +N + + A E + I T+ + Sbjct: 225 ------------------ATKIIVLLTDGVHNADGMD--PDTVAQTAAERGVVIYTVLMG 264 Query: 371 ------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +S + +L + ++ Y Y + +L FQ++ + Sbjct: 265 QQTGDRSSVDAGQLERLAGATDGYAYLAEDTQTLETSFQDLLDKLEKSSI 314 >gi|260426558|ref|ZP_05780537.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] gi|260421050|gb|EEX14301.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] Length = 1088 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/402 (10%), Positives = 98/402 (24%), Gaps = 53/402 (13%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL-------SRLGDRFESISNHAKR 88 ++ + + +A + A + G+ ++S + + G Sbjct: 279 AAVISLFGTRTDMAEWQITAAVDGAAIGGSQGVISVVESWTLQFNADGSLASVSEASPSM 338 Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 + + T I + S N D + Sbjct: 339 NVTGLTTGAGDLTVALDFGTIGGTDGLTTGVGITSVSDTHDGTSQCNAADCAQRWNAGTG 398 Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM- 207 T+ + + + PAL + V D+ GS M Sbjct: 399 RFETTQQTLVNGGLSGWDTLVRNYPFTTAPALPAEAPPATCFRAVSFVDDVVGSDQIMMI 458 Query: 208 --------------------NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 N +D + D +++ ++ A F+D Sbjct: 459 VDRSGSMSWSSNSGQAEVCLNGLDDDNDGTVDEGDCADSRIEFVRAAGRAFVDL-----Q 513 Query: 248 VKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + +GL+ + + + Y T A Sbjct: 514 TSQGIDLGLLEFNEGNTLLRPIDTLNAGNAQDYKDAIDALSPGGDTAIGDAFD------- 566 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + ++ + + LTDG N + ++ + ++I Sbjct: 567 ---------ASTGEFTRVAEVGRVRTAYLLTDGFNTAGGD---PVAAAERLDDIGVRIHA 614 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + + L + Y N ++L +F ++ Sbjct: 615 IPAGNDVDREELTDIASGTGGQVYEARNVNALTGIFAELAGS 656 >gi|254776723|ref|ZP_05218239.1| hypothetical protein MaviaA2_18931 [Mycobacterium avium subsp. avium ATCC 25291] Length = 309 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 67/242 (27%), Gaps = 54/242 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R +I LV+D+S SM ++ A + A F + Sbjct: 68 PRNRAVIMLVIDMSQSMRATDVE---------------PNRLKAAEQAASQFASQLTPG- 111 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + +GL+G+ + P+ + + + T + A+ A ++ Sbjct: 112 -----INLGLVGFAGTPYLLVPPTP---QHQATIDALKKLDFADSTATGEAIFTALHAIS 163 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFI 362 + T I+ L+DG N + + AK+ + Sbjct: 164 ATAVAGGDTPP------------PARIVLLSDGGENKPSNPSDPHDGVYTAARLAKDEGV 211 Query: 363 KIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I TI+ + ++ S Y N L + I Sbjct: 212 PISTITFGTKGGEIEMDGQKVAVPVSTDQMKMVAKLSGGQSYTATNLGELQKSYNAIENE 271 Query: 409 MV 410 + Sbjct: 272 IG 273 >gi|291299992|ref|YP_003511270.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290569212|gb|ADD42177.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 316 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 71/212 (33%), Gaps = 38/212 (17%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D + +A+K A L F+D + ++ +GL+ + + P+ +V+ + Sbjct: 103 DVSPDRFSAMKKASLEFVDEL------PKNYNLGLVTFAKSASVAVSPTKDRNQVKSAIK 156 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A Q + S S P I+ L+DG Sbjct: 157 SMKLDRA---TAIGEGIFSALQAIQS------------VPPDGASEPAPARILLLSDGYR 201 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP--------------NGQRLLKTCVSSPE 387 + + AK + + TI+ + + L +T ++ Sbjct: 202 TSGR---LVEDGAKAAKAAKVPVSTIAFGTDTGTVEIEGETQEVPVDRETLSQTAETTGG 258 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 Y + D L V++++ + HR +V + Sbjct: 259 KFYEAASVDDLKGVYEDMGSSIGHRTVAVEIA 290 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 35/387 (9%), Positives = 101/387 (26%), Gaps = 39/387 (10%) Query: 35 LLIGFLIY------VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKR 88 + G I + + S + ++ +S +++ + E Sbjct: 918 VATGAAIDFVRQNSYTRGNGDRT---SVPDLLVVVTSSASTDDVASAQETAEKEGITIYT 974 Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM----THMANNRLDSSNNTI 144 + ++ F + + + + + ++ Sbjct: 975 VGVTNSVSFAELTSTAGSFSRVLRANDFSDLSAIRQPLHETICQAAFCGDPGTPANGFQQ 1034 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + T + F LL+ N + + + + Sbjct: 1035 GTYFEGNT-VTFGCNF-GFLLSGTDNTQCQADGSWSNPLPVCIAITTPAP--TQAPACND 1090 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + V +K + + + + ++ Y+ + Sbjct: 1091 FPLFNGTDLVFLLDGSGSVGSNNFDLVKTFTKNVVQNF---DISETATRVAVVQYSDQFS 1147 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + K Y D S + T + A+ Q + + Sbjct: 1148 TEFSLNAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGERDG--------- 1198 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + ++ +TDG + + S D A+ + I + + + + L + Sbjct: 1199 ----YPDLLVVVTDGLSTDDVSGP-----ADTARAQGVTIYAVGVGSDIDFNTLEQIAGL 1249 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVH 411 + V + SL+ + Q +SQ + Sbjct: 1250 T-SRVSQVSDFSSLVTLSQTLSQDICT 1275 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/369 (10%), Positives = 81/369 (21%), Gaps = 44/369 (11%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--TEIQNIVN 123 S S R + ++ A +++ + + + + + Sbjct: 129 SADSVTSSAATAREQGVTMFAVGVGTGVGLSELQDIAGYTDRVLQLNDFVQLAQSADTIQ 188 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMT------SYDYRLQFIEHLLNQRYNQKIVSFI 177 S+ + + + + + T + S Sbjct: 189 STLCGLAYCGDPGAPVNGYRRGSFFEGTTITFGCNDGYLLV---GSTNATCRGDGTWSHP 245 Query: 178 PALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT--------- 226 P + L S + C N E+ C + Sbjct: 246 LPTCITPCNPNPCLNNGVCLASGNSYTCACTSNYQGENCQYYTPCLVRNPEFDLIFLLDE 305 Query: 227 ----KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K + +G++ Y+ + T+K Sbjct: 306 SGSIGTDNFKLVKSFTERMANNFDISPNSTRVGVVQYSNFPGTEFSLNAFTDKAAVLDAI 365 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + A+ F N V +I +TDG N Sbjct: 366 SKIDYNGGSTFTGAAI-------------DFVRNNEFTSVNGDRDDVPNILIVITDGNPN 412 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + S A I + I ++ + L++ P + L V Sbjct: 413 DDVSGP-----AISANNAGITTYAVGIGSNVDQANLVQMTAGRPGRVLQAADFTDLTTVV 467 Query: 403 QNISQLMVH 411 + + + Sbjct: 468 GTLQENVCD 476 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 33/341 (9%), Positives = 87/341 (25%), Gaps = 33/341 (9%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + N A ID + + + +G ++ + NN Sbjct: 2117 TTPAPAERNTLTGAAIDFVRTSTFSTPAGNRAGQPDFLIVVTDGLSQDNVAVPAQTARNN 2176 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + I +D + + ++ + E+ ++ Sbjct: 2177 GISIFAVGIGSEVDA--------DTLLQIAGTPSRTLQIND--FAGLVNAEEQLASIVCN 2226 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQ---DKKRTKMAALKNALLLFL-DSIDLLSHVKED 251 + + + + D + + + F D + + Sbjct: 2227 TITTTPPATTPTPAPSCSSITPVDLVFLLDGSSSITSPNFQIVKDFTADVVRTFNVSSAA 2286 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ Y+ + + K + T + A+ Sbjct: 2287 TNVGLVQYSDTIRTEFFLNSFDTKSGVLNAIGNIGYLQGNTRTGAAI------------- 2333 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F ++I +TDG + + A+ + I I + I + Sbjct: 2334 DFVRISSFSVPAGNRGNQPDYLIVVTDGLSQDDVVVP-----AQTARNDGISIFAVGIGS 2388 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + LL SP + + L + + ++ ++ + Sbjct: 2389 EIDFATLL-NIAGSPNRILQINDFAGLANAQEQLTDIVCNL 2428 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 46/183 (25%), Gaps = 19/183 (10%) Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGY 259 + + D + A + + F + + D +G++ Y Sbjct: 1365 VCQRYSPCYNRNIELDLVFLLDGSGSVTTANFDIVKEFTRRLANNFDISLADTRVGVVQY 1424 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + T++ A+ F Sbjct: 1425 SDSPTLEFNLNSFNTNELVDLAIRNIQYQQGGTNTGQAI-------------DFVRVNSF 1471 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 +I +TDG +S+ + + A+ I + + I + LL Sbjct: 1472 SANNGDRSDVPNVMIVVTDG-----QSSDDVVGPAQTARNAGISMYAVGIGNGVDTNELL 1526 Query: 380 KTC 382 + Sbjct: 1527 QIA 1529 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 35/385 (9%), Positives = 94/385 (24%), Gaps = 34/385 (8%) Query: 37 IGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 G I + + + ++ + + + ++ N Sbjct: 1459 TGQAIDFVRVNSFSANNGDRSDVPNVMIVVTDGQSSDDVVGPAQTARNAGISMYAVGIGN 1518 Query: 97 FIKNHI------KESLSGYSAVFYNTEIQNIVNSSRI-SMTHMANNRLDSSNNTIFYNMD 149 + + + SA F + + + + + + Sbjct: 1519 GVDTNELLQIAGQVDRVVQSADFSTLFSVGENLQELVCDAGWCGDPGVPVNGFRLGSYFE 1578 Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 +LN + + + + Sbjct: 1579 GH--VVTFGCNAGFILNGASSTICQGDGSWSNAVPVCVAITTPSP---TRAPVCTDLSFG 1633 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + V +K+ + + +G++ ++ + Sbjct: 1634 GLDLVFLLDGSGSVTAVNFDLVKDFASGVVSEFQ---ISTTETRVGVVQFSDTLRTEFFM 1690 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 S + K + I T + A+ A S + F Sbjct: 1691 SSFSTKQQVLQAISDIDYIQGNTLTGAAITFATASSFSTPAGN-------------RANF 1737 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 F+I +TDG + + ++ A++ I I + + + LL+ PEY Sbjct: 1738 PDFMIVVTDGLSQDS-----VVQPAQSARDQGITIFAVGVGNEVDFATLLQI-TGVPEYI 1791 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 V + L+ ++++ + Y Sbjct: 1792 LQVTDFSDLLAAQLQVAEIACNLTY 1816 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 53/187 (28%), Gaps = 22/187 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + + ++ S +GL+ Y+ + + K Sbjct: 2525 FQIVKDFTADVVRTFNVSSAATN---VGLVQYSDTIRTEFFLNSFDTKSEVLNAIGNIGY 2581 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A+ F ++I +TDG + + Sbjct: 2582 LQGNTRTGAAI-------------DFVRISSFSVPAGNRGNQPDYLIVVTDGLSQD---- 2624 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A+ I I + I + LL SP + + L ++ Sbjct: 2625 -EVLGPAQTARFEGINIFAVGIGNEIDFTTLLHI-AGSPNRVLQINDFAGLASATGQLTD 2682 Query: 408 LMVHRKY 414 ++ + K Sbjct: 2683 IVCNCKL 2689 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 19/217 (8%), Positives = 59/217 (27%), Gaps = 25/217 (11%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + + C + V + +K+ D+ + + Y Sbjct: 3 TPAPECTQFGGVDLVFLLDGSASVGASNFELVKDFTQQTTAKFDISDGS---TRVAVAQY 59 Query: 260 TTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 ++ + + + + + + + T + A++ Q S + Sbjct: 60 SSTPQVEFNLNTNSDVDTLSNAIEQITY-MNGDSTFTGFAIEFVRQSAFSSFNGARDDK- 117 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 ++ +TDG++ + ++ A+E + + + + G Sbjct: 118 ------------PDIMVVVTDGQSADSVTSS-----AATAREQGVTMFAVGVGTGV-GLS 159 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 L+ + + + L I + Y Sbjct: 160 ELQDIAGYTDRVLQLNDFVQLAQSADTIQSTLCGLAY 196 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 51/203 (25%), Gaps = 21/203 (10%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + V +K + + +GL+ Y+ Sbjct: 546 CANLAYPGADLVFLLDGSGSIGTDNFQLVKAFTKEVIRNFA---ISPTATRVGLLQYSDT 602 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++ + + Y D T + A++ QI + Sbjct: 603 IDNEFFMNEFNTRDELYTAVDNVVYKTGGTFTGFAVEFTRQIAFRTSAGTRDN------- 655 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + +I +TDG S A + I I + + ++ + LL+ Sbjct: 656 ------YPDILIVVTDG-----NSEDVVTSAVASAIDQGILIYAVGVGSNVDFASLLELT 704 Query: 383 VSSPEYHYNVVNADSLIHVFQNI 405 V + L Q + Sbjct: 705 GGVNSRVLQVSDFTGLSTAAQTL 727 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 40/112 (35%), Gaps = 7/112 (6%) Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 LT T+ F L ++ +TDG + + + A++N I Sbjct: 1955 NTLTGAAIDFVRTSSFSIPAG-NRLTLPDVLVVVTDGLSQDDVAGP-----AQIARDNGI 2008 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 I + I + + LL + + SL+ + +++++ + Y Sbjct: 2009 AIYAVGIGSEVDFATLL-DIAGLQSRVLQINDFSSLLDAEEQLTEIVCNISY 2059 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 38/413 (9%), Positives = 93/413 (22%), Gaps = 62/413 (15%) Query: 35 LLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN---LSRLGDRFESISNHAKRALI 91 GF + + S AG + + G+ + +++ A+ Sbjct: 632 TFTGFAVEFTRQIAFRTS----------AGTRDNYPDILIVVTDGNSEDVVTSAVASAID 681 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY----- 146 + ++ S + A L + Y Sbjct: 682 QGILIYAVGVGSNVDFASLLELTGGVNSRVLQVSDFTGLSTAAQTLPTVLCAAAYCGDPG 741 Query: 147 -NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + + + S + L P+ + D + H Sbjct: 742 APTNGYRRGTFFVDSVVTFGCNDGYLLQGSSNTSCLGTGQWSNPVPVCLDRCDPNPCFHG 801 Query: 206 AMNSDPEDVNSAPICQDKKRTKMA---------------------------ALKNALLLF 238 A S + + + Sbjct: 802 AACSVQGSSFVCVCPPNYEGELCQFYTPCLNRSVEFDLVFLVDKSSSVGPANFELVKEFM 861 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 D + S D +G + + K+ + K Q + + + A Sbjct: 862 YDFTNTFSVGLSDTRIGAVQFADAQTKDFDMDTFATK-EQTLAGIQNIVYTDNQVGGVAT 920 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + + ++ +T + + ++ + A+ Sbjct: 921 GAAIDFVRQNSYTRGNG---------DRTSVPDLLVVVTSSASTDDVASAQ-----ETAE 966 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + I I T+ + S + L T S + L + Q + + + Sbjct: 967 KEGITIYTVGVTNSVSFAELTST-AGSFSRVLRANDFSDLSAIRQPLHETICQ 1018 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 57/382 (14%), Positives = 112/382 (29%), Gaps = 51/382 (13%) Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 A +N S S + N + Sbjct: 120 ADEPVIELATPSEAQPATASATPTQKPTGTENPTTVERSVQSDDDDADTVANQNEAKDDE 179 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ--KIVSFIPALLRIEMG 186 + A+ + + + +S + + + Q ++ I Sbjct: 180 TKNNADKTVRLGIASYRGMLKSASSGLSTPEHTKSIEYQGNGAYILKLNVIGKDASTSTT 239 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD----SI 242 + I LV+D+SGSM+ +K++ALK A+ FLD + Sbjct: 240 DTTPIDIALVLDVSGSMNDDFGGR------------GSPSKISALKTAVNSFLDETAKTN 287 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQA 301 D + + V + L+ Y ++ RQ T + ++ + T +K + Sbjct: 288 DTIEDDNDKVKVALVKYANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTS 347 Query: 302 YQILTSDKKRSFFT--NFFRQGVKIPSLPFQKFIIFLTDGENN-----NFKSNVNTIKIC 354 L + +Q + +K++IF TDGE N + IK Sbjct: 348 VNGLQAAGATYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDGDVANAAIKKS 407 Query: 355 DKAKENFIKIVTISINASPNGQR-----------LLKTCV--------------SSPEYH 389 + K + +I I N + +S +Y+ Sbjct: 408 QELKNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYWNLGDRASGDYY 467 Query: 390 YNVVNADSLIHVFQNISQLMVH 411 Y+ +A L +F +I + + Sbjct: 468 YSASSATQLAQIFNDIQKTITE 489 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 60/207 (28%), Gaps = 30/207 (14%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + S + V + + K+ +K A+ + + + + ++ + Sbjct: 16 TGGDSTSDRSGLDLVAVLDVSGSMQGEKIEKMKTAMKFVVKKLSSID------RLSIVTF 69 Query: 260 TTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + TE + Q + T+ + ++ ++L K S Sbjct: 70 LDTANRICPLRQVTEDSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG- 128 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ ++DG+ N + K + + T A + L Sbjct: 129 --------------VMLMSDGQQNRGEP-------AANVKIGNVPVYTFGFGADYDPTVL 167 Query: 379 LKTCVSS-PEYHYNVVNADSLIHVFQN 404 +S V + + L F Sbjct: 168 NAVARNSMGGTFSVVNDVNLLSMAFSQ 194 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 37/386 (9%), Positives = 95/386 (24%), Gaps = 53/386 (13%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESIS----NHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + A + ++ + + + + D + I + S Sbjct: 2227 ADVANILVVITDGQSVDDVTGPAQDLLREGVTVYALGIGDGIQYSTLEAIAQDQSRVLQA 2286 Query: 113 FYNTEIQNIVNSSRISMT--------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH- 163 T + N + + S+ + + + L Sbjct: 2287 NTFTNLSNTAQALQESLGDARYCGTPGTPQDGYTYGLFFEGSTVTYSCEVGFTLDGASSA 2346 Query: 164 --LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + N P + + + C + + S CQ Sbjct: 2347 QCINGTWDNSPPTCIPPDPCLANPCYNGGTCVRTGLTTDFTCTCPSDYTGDTCQSYTACQ 2406 Query: 222 DKKRTK----------------MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + +K +D+ + D +G++ Y++ Sbjct: 2407 GRSLDFDLALLLDGSGSVGADNFNLVKQFAKRLVDNF---EISQTDTKVGVVQYSSSSNV 2463 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + K + + T++ A+ Q + + Sbjct: 2464 EFYLNAFSTKQAVLDAINAVTYQQGGTNTGAAITYTMQEIFASANG-------------A 2510 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 + +I +TDGE+++ + A+ I + + N LL+ + Sbjct: 2511 RANYPDVLIVVTDGESSDDVAVPAL-----SARNAGTLIYAVGVGNGVNQATLLQI-AGN 2564 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH 411 + L V Q++ Q + Sbjct: 2565 AGQVLQAADFAGLTTVVQSLQQNLCD 2590 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 47/184 (25%), Gaps = 22/184 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK +G++ Y+ + + + + S Sbjct: 1605 FDLLKT---FTTRIATNFDVSTNLTRVGVVQYSDQTNSEFVLNTFSTEAEVLAAIAAISY 1661 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ Q + +I LTDG +++ S Sbjct: 1662 QNGGTSTGAALDYVRQNVFISASG-------------DRPDAANILIVLTDGVSSDDVSF 1708 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A+ I I ++ I + L+ P SL + + + Sbjct: 1709 PAM-----AARNAGITIYSVGIGDGVDYNT-LQQIAGDPNKVLQATGFSSLDDIGGQLEE 1762 Query: 408 LMVH 411 L+ Sbjct: 1763 LVCD 1766 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 56/210 (26%), Gaps = 22/210 (10%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + V +KN L D + Sbjct: 1822 CMAPTTPPPPGCDELSFGGWDLVFLLDGSGSVGSNNFLNVKNFTKLITDLFPVGD---NA 1878 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ ++ ++K + K D S + T + A+ Q+ + Sbjct: 1879 TKVGLVQFSDTIQKEFDLRDYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFNTING 1938 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I LTDGE S A++ I I I + Sbjct: 1939 NRGS-------------HPDMLIVLTDGE-----SFDPVTFASQSARDQGITIFAIGVGT 1980 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + L+ P+ V + L V Sbjct: 1981 GVDYAT-LEEIAGDPQKVQQVTDFADLTSV 2009 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 48/180 (26%), Gaps = 20/180 (11%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 N L F +I + +G++ Y+ + + S Sbjct: 2716 NLLKAFTQNIVGNFDIAVNNTRVGVVQYSDFNNIEFNLNAYATEAEVLAAIGAISYQRGG 2775 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ F G ++ LTDGE+++ + Sbjct: 2776 TFTGAAIDF-----------VRQDVFTTAGGNRADK--PDILLVLTDGESSDSVAGP--- 2819 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I + I + N L+ P V + L + + + + Sbjct: 2820 --AQNTLNAGITIYAVGIGSGVNADT-LQEIAGDPGRVLQVADFQGLAAITNQLQEALCS 2876 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 85/338 (25%), Gaps = 35/338 (10%) Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + IS A A F++ ++G + + + S+ + A + Sbjct: 629 ALDGISYLAGGTFTGFALDFVQQSAFSPVAGARDGYPDILVVVTDGVSQDDVVAPAESAR 688 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + Y LQ + +I+ + + I Sbjct: 689 KEGIAVFAVGIGSAVDYATLLQIAG------IDGRILQINNFVDLASASQTLPDYI---- 738 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + + + + V +K+ + +G++ Sbjct: 739 -CNYASCQGRDVPLDIVFLLDGSGSVGSANFDLVKDFTRTLARNF---DIAANMTQIGVV 794 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y+ V + + + S T + A+ Q + Sbjct: 795 QYSDTVNREFGL-GDFHNRQDVLNAISAVSYQQGGTLTGAAIDFVRQTSFTTGDG----- 848 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +I +TDG + + D A+ I + I + Sbjct: 849 --------DRPDVPNMLIVVTDGVSGDS-----VQGPADAARREGITTFGVGIGNGIDFG 895 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 LL+ + +L V Q + +++ +Y Sbjct: 896 TLLEI-AGDSARVLQADDFGALATVAQRLQEVVCDLEY 932 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 60/227 (26%), Gaps = 26/227 (11%) Query: 161 IEHLL-----NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + N + + + GE + + V Sbjct: 965 SGSVGATCTNGTWDNSVPMCIGEFVSLQDKGEAITTTPPPTTPRAPVCANFPYGGLDLVF 1024 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 T +K+ + + ++ + +G++ Y+ V S K Sbjct: 1025 LLDGSGSVGTTNFELVKDFTSEVVLNFNISADTTN---VGVVQYSDTVRNEFFLSSYDTK 1081 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + S + T + A+ Q S F ++ Sbjct: 1082 LPLIDAINQISYLTGGTLTGFAIDYVRQSSFSRPAG-------------ARNTFPDVLVV 1128 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 LTDG++ + + A+ I I + I + + LL+ Sbjct: 1129 LTDGQSQDDVVSS-----AAAARSQGITIFAVGIGSEVDFTTLLQIS 1170 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 48/184 (26%), Gaps = 22/184 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +G+ Y+ + + K + S Sbjct: 1319 FELVKSFTYSVSRNF---DVSPNATRIGVAQYSDTNSLEFNLNRYSTKDEVLNAVNGISY 1375 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ Q + + ++ TDGE S+ Sbjct: 1376 QGGGTYTGAALDFVRQTMMVESAG-------------DRTMSPNILVVATDGE-----SS 1417 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + + + I A + LL + SL + + + + Sbjct: 1418 DDQRTPAEVLRNAGTLVYAVGIGAGVSSTTLL-DIAGYNSRVLQATDFASLEVIGRELQE 1476 Query: 408 LMVH 411 + + Sbjct: 1477 FICN 1480 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 54/193 (27%), Gaps = 28/193 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKED---------VYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +K+ + S ++ +G + Y+ V S Sbjct: 567 FELVKDFAERVARHFTISSSRNDNMSYRSFTAATNVGAVQYSDTVRSEFFLSSFDTDFEV 626 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 D S + T + A+ F V + ++ +TD Sbjct: 627 VRALDGISYLAGGTFTGFAL-------------DFVQQSAFSPVAGARDGYPDILVVVTD 673 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + + + A++ I + + I ++ + LL+ + N L Sbjct: 674 GVSQDD-----VVAPAESARKEGIAVFAVGIGSAVDYATLLQIAGID-GRILQINNFVDL 727 Query: 399 IHVFQNISQLMVH 411 Q + + + Sbjct: 728 ASASQTLPDYICN 740 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 49/184 (26%), Gaps = 22/184 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK L + +G++ Y+T V E + + Sbjct: 370 FDLLKTFLQNIT---MKFDVSSDITRIGVVQYSTDVNTEFELKTYATEAEVIHAISNITR 426 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + +F ++ +TDG + + + Sbjct: 427 QRGSTFIGAGI-------------NFVRTNSFTVAAGDRPLAPNILVTITDGISADDVAG 473 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A++ I +I I LL + +NV + L + +++ Sbjct: 474 P-----AQAARDQGILTYSIGIGEEIQWPTLLSI-AGARHRVFNVTSFSELPGIEASLTA 527 Query: 408 LMVH 411 L+ Sbjct: 528 LLCE 531 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 39/156 (25%), Gaps = 19/156 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +G++ Y+T + + + + S T + + Sbjct: 166 FEVSSSQTRVGVVQYSTGINTEFDLNSFATEAEVINAIRGLSHQRGSTFTGAGIT----- 220 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 F G +I +TDG + + + A+ + I Sbjct: 221 --------FTRLESFTGASGDRPDAPNVLIVITDGISADSVD-----APAEAARADNITT 267 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +I I N LL NV L Sbjct: 268 YSIGIGDEINYLTLLSIAGMRE-RVLNVTTFGDLND 302 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 32/262 (12%), Positives = 84/262 (32%), Gaps = 51/262 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEM-GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + + +P + + +RP + V+D SGSM Sbjct: 383 AGGYRDTPVEAALPVYMDVRDREQRPDLALVFVIDRSGSMAE---------------PAG 427 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K AL+ + + +G++ + ++ + + G + + + Sbjct: 428 NVQKLDIAKEALVQAIRMLYGED------RVGIVTFDSQAYTTMPITQGVGE-EEVLQAI 480 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ + ++LT + + K +I LTDG Sbjct: 481 ASVTADGGTNIGAGLSAGQRMLTGVEAKI------------------KHMILLTDGWGEG 522 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF- 402 + + + + + I + ++ + + L + +Y ++ + Sbjct: 523 ----NDQLAVVEAMRAQGITLSVVAAGSD-TAEELKTLATAGGGRYYAAAIMQAVPQILV 577 Query: 403 -QNI---SQLMVHRKYSVILKG 420 + I +L+V R++ + G Sbjct: 578 DETITVAGKLIVERRFVPVATG 599 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/256 (15%), Positives = 78/256 (30%), Gaps = 54/256 (21%) Query: 178 PALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALK 232 P I +D+SGSM + VN I + ++ K Sbjct: 73 PRTGEENSYSYKDSVDIVFSLDISGSMSSYDQPEDLAVNRRVIAEAINNKELHPRLHYAK 132 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP- 291 ++ F+D ++ +GL+ + P+ E ++ + + Sbjct: 133 KSIADFIDK-------RKSDRLGLVVFGAEAYSVCPPTNDHEYLQNRLKEISTEYLGDYN 185 Query: 292 --TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ T A+ L S +K II +TDG + SN+ Sbjct: 186 RQTNITAAISGGLARL------------------RKSKAPKKIIILVTDGSHTA-NSNLT 226 Query: 350 TIKICDKAKENFIKIVTISIN--------------------ASPNGQRLLKTCVSSPEYH 389 A ++ I TI + + + + L + + + Sbjct: 227 PRMAAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEELLKEIAEKTGGLY 286 Query: 390 YNVVNADSLIHVFQNI 405 ++V A+ + V + I Sbjct: 287 FSVREAEQMKDVLKKI 302 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/269 (13%), Positives = 77/269 (28%), Gaps = 47/269 (17%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + ++ IP +L + I + + S + S + Sbjct: 39 SMAHTLKSKYYAKDIPFILLMLGLTFSIIGLARPATVDSSANIN---GEGIYISMVVDIS 95 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 L L ++ + + L+ + R ++ + + + Sbjct: 96 PSMMAEDMLPTRLEASKKTMADFIKKRNFDKISLVAFALRASVLSPSTFDYTSLEKEIGN 155 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + A +L S +K II LTDGENN Sbjct: 156 IKI-DEEGSTSIGLGIATAVDMLRS-----------------VKDDAEKVIILLTDGENN 197 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP------------------------NGQRL 378 + + + + A IKI TI I + N + L Sbjct: 198 SGEID--PKLASEIASNFNIKIYTIGIGDAAGSHAWVTYTDPNYGKRRIRADFTLNEKAL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ + ++N + +L +V+ I + Sbjct: 256 IEIASITGGKYFNAKTSSALDNVYNTIDR 284 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 70/173 (40%), Gaps = 19/173 (10%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSL-ILKPTDST 295 F +++L+ ++ ++ + + EK I S T++ R+ V+ + D T+ Sbjct: 136 FSAALNLIDNMDKNNRFSMYKFDDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTNMR 195 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+++AY+ + S + + +I L+DG + + + + Sbjct: 196 DALEKAYEEIKSSETKDKNA----------------MVIMLSDGGDT-YDLSKKFDETLK 238 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 KE I I TI ++ N L + S +YNV L +VF I + Sbjct: 239 PFKEKNISIYTIGMSNGNNFSMLKEIAKESGGNYYNVKEIKDLKNVFNKIYRD 291 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/382 (12%), Positives = 113/382 (29%), Gaps = 41/382 (10%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 H K S+ A + + +I+ A+ + + Sbjct: 95 RAHGSKASLLRAVRR-----IQPLSTGTMTGLAIQFAITKAFSNTEGGRARSPDISKVVI 149 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 ++ ++ ++ I + + R+D + + DY + Sbjct: 150 VVTDGRPQDSVRDVSARARANGIELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIE 209 Query: 165 LNQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + Q+ + L + + + +NSD + N Sbjct: 210 KLSKKFQEAFCVVSDLCATGDHDCEQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGG 269 Query: 223 KKRTKMAALKNA-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 T + L + + F++ I D L + +GL+ Y++ V + Sbjct: 270 SSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLG 329 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K S + K T + A+K +F Q Sbjct: 330 RFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQ 376 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EY 388 K I TDG + ++ +N KAK+ K+ + + + L+ S P E+ Sbjct: 377 KVGIVFTDGRSQDYINN-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEH 429 Query: 389 HYNVVNADSLIHVFQNISQLMV 410 ++ + ++ + + + + + Sbjct: 430 YFYTADFKTINQIGKRLQKKIC 451 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G++ Y + V++ K + T + Sbjct: 62 FLSQVIESLDVGPNATRVGVVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGL 121 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A S+ +G + S K +I +TDG + +V+ Sbjct: 122 AIQFAITKAFSN----------TEGGRARSPDISKVVIVVTDGRPQDSVRDVSAR----- 166 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ N I++ I + + L+ S P E+ V + + Sbjct: 167 ARANGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYSVIEK 210 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 74/243 (30%), Gaps = 58/243 (23%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + I +D+S SM + +D + +++ A K L F Sbjct: 84 ITRAVTEASEKGIDIVFALDISESM---------------LEEDFEGSRLDAAKKIALRF 128 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPA 297 + + GL+ + + + + V + D++ K T A Sbjct: 129 IRE-------RPQDRFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAISDKGTAIGSA 181 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + L + + + ++ LTDGE+N+ + + + A Sbjct: 182 ILVGTNRLRASVSKE------------------RVLLLLTDGEHNSGEV--GPVTASEIA 221 Query: 358 KENFIKIVTISI------NASP---------NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + I+I I + + + Q L + ++ + +SL F Sbjct: 222 QSEGIRIYVIGVRNEEEAGSPESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAF 281 Query: 403 QNI 405 I Sbjct: 282 GEI 284 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 72/232 (31%), Gaps = 54/232 (23%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + L VDLSGSM + + ++ A K + FL + Sbjct: 98 MLLAVDLSGSMSEVDM----------MLGMQPVDRLTAAKAVIADFLQR-------RSGD 140 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL+ + R + VR+ + + L + T A+ A + L Sbjct: 141 RVGLLVFGQRAYMLTPLTLDLSAVREQLRDTVAGLAGRETALGDAIGLAVKRL------- 193 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 Q+ ++ LTDG N + +K + A +++ TI+ Sbjct: 194 -----------RTQPEGQRVLVLLTDGVNTTGV--LQPLKAAELAAAEQVRVYTIAFGGD 240 Query: 373 P-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + + +A+ L ++ + + Sbjct: 241 GGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFRAHDANQLAGIYAELER 292 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/353 (8%), Positives = 88/353 (24%), Gaps = 33/353 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R ++G+ ++ +II L++ + +G + + + ++ ++++ ++ Sbjct: 1 MSSAARHPKLTRRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + ++ T + Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAGHANFA---FFQQNAVQMQTD-------SNVTFSDS 110 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + N + ++ + L + + A+ Sbjct: 111 LTNPFLTKTAVANPANVK-YVKCTAQLSNIAHWFIEVLNT---IPGVQVANASQVAASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + S + + + AAL Sbjct: 167 ATVGAGQTTCAIPVFVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P G+ ++ R ++ T+ Sbjct: 218 -----DGSTNEPTIASELSGNTCNITSPPDLGSTGLKAASLRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 Y + + L F + DG Sbjct: 273 QPDFTGYAYVGPNYGPPGTPGIVGDAYTQFVLDRASFKAYQGDGTPPTGSGIS 325 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 78/227 (34%), Gaps = 46/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P+ + L++D+SGSM T++ A++ ++ F+ Sbjct: 92 QKPMRDVVLILDVSGSMAKNDVQGGA-------------TRLQAVQESVRKFVA------ 132 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ +GL+ + ++ S + ++ + + ++ + T A+ ++L Sbjct: 133 -ARQSDRIGLVIFASQAWPFAPVSEDKQALQTRINQLSPGMVGQQTAIGDALGVGVKLLD 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + K I LTDG + + A +++ T Sbjct: 192 ----------------NTTNTEASKLAILLTDGNDTASQLAPALAAQLAAAH--HVQVHT 233 Query: 367 ISINA-SPNGQR-----LLKTCV--SSPEYHYNVVNADSLIHVFQNI 405 I+ + G L++ + + + +L V++ I Sbjct: 234 IAFGDINSTGDDKVDLPLMQNIAQITGGQSWTAANSGAALDSVWKEI 280 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/277 (13%), Positives = 81/277 (29%), Gaps = 61/277 (22%) Query: 151 MTSYDYRLQFIEH--------LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 SY + + + I A + + + +VD+SGS Sbjct: 148 PYSYALPTDGSPFGVTTEVAPSPWNPHTRLLRIGIKASD-RAVADLAPANLVFLVDVSGS 206 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M +R + +K+ L L +D + V L+ Y Sbjct: 207 M-------------------DRREGLPLVKSTLKLLVDQLRDQDRVS------LVVYAGE 241 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++P+ G +KV T ++ AYQ+ Sbjct: 242 SRVVLKPTSGRDKV-TIRNAIDQLDAGGSTAGASGIELAYQMARESFIDKGIN------- 293 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 I+ TDG+ N S+ + ++ +++ + + T+ + L++ Sbjct: 294 ---------RILLATDGDFNVGVSDFDSLKQMAVDQRKSGVSLTTLGFGVDNYNEHLMEQ 344 Query: 382 CV-SSPEYHYNVVN--------ADSLIHVFQNISQLM 409 + + + N D L +++ + Sbjct: 345 LADAGDGNYAYIDNLLEARKVLVDQLSSTLAVVARDV 381 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 53/414 (12%), Positives = 113/414 (27%), Gaps = 69/414 (16%) Query: 28 LSVMSFLLLIGFLI----YVLDW-----HYKKNSME-SANNAAILAGASKMVSNLSRLGD 77 L+ L++ G + K+S++ A A ++ Sbjct: 11 LAASVLLVVAGCGVSSSPETTVAPPPAQTELKSSVQPEAVMADSAMAKRSALTAPIASFA 70 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + ++ + + +++ K S + + RL Sbjct: 71 PMPAGESYPQGYRDEQREQYAKLADNPIHSVTETPVSTFSADVDTGAYANVRRLLNQGRL 130 Query: 138 DSSNNTIFYNMDV--MTSYDYRLQFIEH--------LLNQRYNQKIVSFIPALLRIEMGE 187 M Y + + + I A + E Sbjct: 131 PPEGAVRLEEMVNYFPYDYALPSDGSPFGVTTELAASPWNPHTRLLRIGIKASD-RAVAE 189 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + +VD+SGSM +R + +K+ L L +D + Sbjct: 190 LAPANLVFLVDVSGSM-------------------DRREGLPLVKSTLKLLVDQLREQDR 230 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V L+ Y +EP+ G EK + T T ++ AYQ+ Sbjct: 231 VS------LVVYAGESSVVLEPTSGREKAKIR-TAIERLTAGGSTAGASGIELAYQMAQQ 283 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVT 366 I+ TDG+ N S+ + ++ ++ I + T Sbjct: 284 AFIPKGIN----------------RILLATDGDFNVGTSDFDSLKQMAVDKRKTGISLTT 327 Query: 367 ISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + L++ + + + N L + + + +V+ K Sbjct: 328 LGFGVDNYNEHLMEQLADAGDGNYAYIDN---LREARKVLVDQLGST-LAVVAK 377 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 74.9 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 46/148 (31%), Gaps = 4/148 (2%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA---SKMVSNLSR 74 E S+I A+ +++ L + + + + ++ +++ +A+ +A A ++ + + Sbjct: 11 EGGAISVIVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCARDATGVDC 70 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + AL +K + K + + S + Sbjct: 71 STTSALASGLANRNALDGMSKVHTIDLDKTTRKVSVTTSAKEVGGADNSVSLFFADALGI 130 Query: 135 NRLDSSNNTIFYNMDVMT-SYDYRLQFI 161 + M + L F Sbjct: 131 PTKEVGARASAVWGSPMAGRTAFPLAFS 158 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 74.9 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 31/245 (12%), Positives = 73/245 (29%), Gaps = 60/245 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + L +D SGSM + T++ A K + + + S+D Sbjct: 83 QTRDGVSVVLAIDDSGSMAANDY---------------QPTRLEAAKESASVLIKSLDPK 127 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + G++ + + S ++V + T + A + Sbjct: 128 DYA------GVVIFESGATTAAYLSPDKDRVMEKTAAIE--QKNGQTALGDGLALAVDM- 178 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +K ++ L+DG N + A +N +++ Sbjct: 179 -----------------ADSIPNQKKVVVLLSDGVGNAGVIS--PEDATAFAAQNKVQVF 219 Query: 366 TISINASP-----------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 T+ + + + L + +Y V+ +L ++ +++ Sbjct: 220 TVGLGSKSPVLLGTDPTGTPQYATLDEAALQSIAEKTGGTYYTSVDEQTLHQIYAGLNKE 279 Query: 409 MVHRK 413 +V K Sbjct: 280 IVREK 284 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 74.9 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 77/238 (32%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 ++ S +NSD + ++ T + L + + Sbjct: 234 QQICNSSPGSYTCSCRDGFTLNSDGKTCSACNGAAGGSATDLVFLIDGSKSVRPENFELV 293 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ I D L + +GL+ Y++ V + K S + K T + Sbjct: 294 KRFINQIVDSLDVSDKLAQVGLVQYSSSVRQEFPLGRYKTKKDIKAAVKKMSYMEKGTMT 353 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K F QK I TDG + ++ ++ Sbjct: 354 GAALKY-----------LIDNTFTISSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 395 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P E+++ + ++ + + + + + Sbjct: 396 KKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKKLQKKIC 451 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 54/166 (32%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G+I Y + V+ K + T + Sbjct: 62 FLSQVIESLDVGPNTTRVGVINYASAVKHEFPLKAHRSKASLLQAVRKIEPLSTGTMTGL 121 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A S+ +G ++ K I +TDG + +V+ Sbjct: 122 AIQFAINKAFSEV----------EGSRLKFPEISKVAIVVTDGRPQDGVKDVSAR----- 166 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 AK++ I++ I + + L+ S P ++ V + + Sbjct: 167 AKQSGIELFAIGVGRVD--KHTLRQIASEPLDDHVDYVESYSVIEK 210 >gi|13473814|ref|NP_105382.1| hypothetical protein mll4535 [Mesorhizobium loti MAFF303099] gi|14024565|dbj|BAB51168.1| mll4535 [Mesorhizobium loti MAFF303099] Length = 373 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 14/143 (9%), Positives = 47/143 (32%), Gaps = 7/143 (4%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 S + +F L ++ + +GF + +++ + A +AA+L+ + M + + Sbjct: 22 FARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAIQTRSNQQQALDAALLSITT-MDTTST 80 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + A + ++G + + + I M Sbjct: 81 LAQRQTALQDSFIANGGLGTA------TLNSFVAGTTTTPATGQASASFSMPTIFMKIAR 134 Query: 134 NNRLDSSNNTIFYNMDVMTSYDY 156 + + + + + + + Sbjct: 135 IDSVPIAVASAVSKPPALVNATF 157 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 76/238 (31%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 E+ + +NSD + N T + L + + Sbjct: 79 EQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENLELV 138 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F+ I D L + +GL+ Y++ V + K S + K T + Sbjct: 139 KKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMT 198 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K +F QK I TDG + ++ ++ Sbjct: 199 GAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 240 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P E+++ + ++ + + + + + Sbjct: 241 KKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKKLQKKIC 296 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 76/216 (35%), Gaps = 53/216 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ +C+D K +++ K L D + ++ +GL+ + I + Sbjct: 98 DISNSMLCEDVKPNRLSFAKQVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 +Q++ + + + T A++ A + + + K Sbjct: 151 LSAAKQFLADISPNMVTAQGTAIGAAIELASKSFSD------------------NKEIGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--------------SPNGQ- 376 II LTDGEN+ + I+ +A E I++ I + G Sbjct: 193 TIIVLTDGENHEGNA----IEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNP 248 Query: 377 -------RLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 ++ + S ++ +A +L+ ++ Sbjct: 249 VVTKFDEKMCRDIASAGEGTFFSGQSASALVRAIES 284 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 76/216 (35%), Gaps = 53/216 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ +C+D K +++ K L D + ++ +GL+ + I + Sbjct: 98 DISNSMLCEDVKPNRLSFAKQVLGKLFDGL-------QNDKVGLVVFAGNAYTQIPITTD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 +Q++ + + + T A++ A + + + K Sbjct: 151 LSAAKQFLADISPNMVTAQGTAIGAAIELASKSFSD------------------NKEIGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--------------SPNGQ- 376 II LTDGEN+ + I+ +A E I++ I + G Sbjct: 193 TIIVLTDGENHEGNA----IEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNP 248 Query: 377 -------RLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 ++ + S ++ +A +L+ ++ Sbjct: 249 VVTKFDEKMCRDIASAGEGTFFSGQSASALVRAIES 284 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 68/220 (30%), Gaps = 48/220 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM + +++ A+K F+ + Sbjct: 82 DIVLAIDLSGSMQKEDF----------QLDGQPISRLDAVKRTASRFVA-------ARRG 124 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R ++ + V + V + + T + + A + L + + Sbjct: 125 DRIGLVIFGDRAYFAQPLTFDVDAVARAVDEAQIGISGRATAISDGLGLAMKRLAASEA- 183 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ ++DG + + + A + I+I TI++ Sbjct: 184 -----------------PTRVVVLMSDGVDTSGNV--QAVDAARLAAGHGIRIHTIALGP 224 Query: 372 SP-----------NGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + L + S + V L Sbjct: 225 EDLENQPRSRDAVDTKTLREVAELSGGTAFRVRGMADLEA 264 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 49/382 (12%), Positives = 113/382 (29%), Gaps = 41/382 (10%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 H K S+ A + + +I+ A+ + + Sbjct: 80 RAHGSKASLLRAVRR-----IQPLSTGTMTGLAIQFAITKAFSNTEGGRARSPDISKVVI 134 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 ++ ++ ++ I + + R+D + + DY + Sbjct: 135 VVTDGRPQDSVRDVSARARANGIELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIE 194 Query: 165 LNQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + Q+ + L + + + +NSD + N Sbjct: 195 KLSKKFQEAFCVVSDLCATGDHDCEQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGG 254 Query: 223 KKRTKMAALKNA-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 T + L + + F++ I D L + +GL+ Y++ V + Sbjct: 255 SSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLG 314 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K S + K T + A+K +F Q Sbjct: 315 RFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPGAQ 361 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EY 388 K I TDG + ++ +N KAK+ K+ + + + L+ S P E+ Sbjct: 362 KVGIVFTDGRSQDYINN-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEH 414 Query: 389 HYNVVNADSLIHVFQNISQLMV 410 ++ + ++ + + + + + Sbjct: 415 YFYTADFKTINQIGKRLQKKIC 436 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G++ Y + V++ K + T + Sbjct: 47 FLSQVIESLDVGPNATRVGVVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGL 106 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A S+ +G + S K +I +TDG + +V+ Sbjct: 107 AIQFAITKAFSN----------TEGGRARSPDISKVVIVVTDGRPQDSVRDVSAR----- 151 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ N I++ I + + L+ S P E+ V + + Sbjct: 152 ARANGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYSVIEK 195 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/345 (10%), Positives = 94/345 (27%), Gaps = 55/345 (15%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 E N F + + + + + +I M + + Sbjct: 79 AETYKEIKENSFVAVAQQPVTTFSADVDRAAYANVRRIIGYGQIPPKDAVRIEEMVNYFD 138 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + V N ++ +I++ + P I Sbjct: 139 YDYPAPEEGSASPLRVSPELAPAP--------WNPNHLLLRIGLQAKKIDLAKAPPSNIV 190 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 ++D+SGSM K+ L+++ + L + + Sbjct: 191 FLIDVSGSMDEE-------------------NKLPLLQSSFKMLLGQLRPDD------KV 225 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ Y + + + +K + + + T ++ AY+ ++ Sbjct: 226 AIVTYANGTKVALPSTSVKDK-EKIIKVLDNLYASGGTSGGKGIQLAYEQAQKSFIKNGN 284 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASP 373 II TDG+ N + + K +K +E+ I + + Sbjct: 285 N----------------RIILATDGDFNIGINNTTDLEKFIEKQRESGIYMSVLGFGMGN 328 Query: 374 NGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +T + + N + + + + ++V Sbjct: 329 YRDDMAETIADKGNGNYAYIDN---ITEAKKVLVNELSGTLFAVA 370 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 74/201 (36%), Gaps = 19/201 (9%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +S+ I + R ++ +K+ +D + ++ ++T V E ++ Sbjct: 122 DSSSSITDENYRKEIDFVKS-------ILDYYYLHPNYTLVSILEFSTDVRVLQELTYDA 174 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 VR+ + D + T+ A+++A++IL + + Sbjct: 175 CDVRKAIDSD---RMSGLTNIAKAIEEAHRILKNSRSDIPDQIVLI----TDGFQTVHSS 227 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYN 391 I D ++ N I+ AK + I+I TI + A+ + L+ SSP Y Sbjct: 228 INCNDHPHDC---NAYAIEKARAAKADDIQIYTIGVGAASYYEDDLRQIASSPSDQYFSL 284 Query: 392 VVNADSLIHVFQNISQLMVHR 412 V + S+ V + + Sbjct: 285 VDDYSSIQTVREKLQNSTCPL 305 >gi|290987786|ref|XP_002676603.1| predicted protein [Naegleria gruberi] gi|284090206|gb|EFC43859.1| predicted protein [Naegleria gruberi] Length = 755 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 24/227 (10%), Positives = 72/227 (31%), Gaps = 30/227 (13%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G+R + ++D+SGSM + + ++ +++ +K+++ ++ ++ Sbjct: 126 GQRQPCNLVCILDVSGSM------GSSAEDLSSSNENTGFSRLDLVKHSVRTLIELMNEK 179 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDSTPAMKQAYQI 304 + LI ++ + + ++ + + T+ ++ + Sbjct: 180 D------QISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMES 233 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FI 362 ++ +I TDGE N + KE+ Sbjct: 234 SLNN---------------PLCAKTNTCLILFTDGEPNINPPRGIVPTLEKYIKEHPLNS 278 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I + S + L ++ + + + + F N+ + Sbjct: 279 TIHSFGFGYSLDSALLKDIAMNGSGAYSYIPDCSMVGTTFVNMMSNI 325 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 70/230 (30%), Gaps = 55/230 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 G+ I LV+D SGSM + K+A F+D + Sbjct: 636 NSGKTVAMDIALVIDSSGSMTWNDPKNLR-------------------KDAAKEFVDKLS 676 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +I + + + N + + + D T T + + + Sbjct: 677 SIDEA------AIIDFDSSSKINRNLTSNRTLLYSAID---DIDSSGGTSLTAGVSKGLE 727 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L+ +K +I LTDG+ KS +A + Sbjct: 728 ALSKSND-------------------KKIMILLTDGKGPYDKSLT------TQAINAGVT 762 Query: 364 IVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I TI + + + + L + +Y+ + F N+S + + Sbjct: 763 IYTIGLGTNNDIDQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSGDLGN 812 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 28/319 (8%), Positives = 83/319 (26%), Gaps = 38/319 (11%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + + ++ + I+ + + +S + V + Sbjct: 592 DDQVYDPVTCTVTAKTTHFSRYILLNKTAFEKIWDKDFAGTSVDN--SGKTVAMDIALVI 649 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + +I+ D S ++ + S+ + SA Sbjct: 650 DSSGSMTWNDPKNLRKDAAKEFVDKLSSIDEAAIIDF--DSSSKINRNLTSNRTLLYSAI 707 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D + ++ LS + M L+ Q Sbjct: 708 DDIDSSGG-----TSLTAGVSKGLEALSKSNDKKIMILLTDGKGPY-------DKSLTTQ 755 Query: 279 YVTRDMDSLILK-PTDST--------PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + T++ A + + + K +F + + Sbjct: 756 AINAGVTIYTIGLGTNNDIDQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSGDLGNKDS 815 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + + +N + + V+ K + K + I + + N + Sbjct: 816 DEDGLPDIVEKNLYWFNGVSLSKAAKEHKPSDIDLY---LGFDHNNK----------NTF 862 Query: 390 YNVVNADSLIHVFQNISQL 408 Y ++ ++ ++ IS+ Sbjct: 863 YKYLSDGDIVVEYKQISKD 881 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 49/391 (12%), Positives = 100/391 (25%), Gaps = 68/391 (17%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A N + + + + + + + + + + Sbjct: 155 AVRQPTAENADVPANGAAPAQDGDRNDHAGENRTDAVANGDAGATADDGKTGDADDADND 214 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVS--FIPA 179 ++ + D M + + + + + ++ + Sbjct: 215 GNTAEDADDAEHIMRDRFTFNRKTVMRAARNVAVPQPDHTKSITYNNGGKYTLNLNVVGK 274 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 R + LV+D SGSM+ M+ S C ++ ALK A F+ Sbjct: 275 DTRESHETTEKIEVVLVLDTSGSMNYCMDG------SQRGCNKSNPKRLTALKEAATSFI 328 Query: 240 D----SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + D + V + + + + T ++ V+R T + Sbjct: 329 DATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLS---ANGATPAD 385 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-----VNT 350 M A L +K +IF DG + + Sbjct: 386 KGMAAAQTALL-----------------RARPGAKKVVIFFADGVPTTQNTFSTRVANDA 428 Query: 351 IKICDKAKENFIKIVTISI------------NASPNGQRLLKTCVSSP------------ 386 + K I +I I N + VSS Sbjct: 429 VTTALAMKSAGTLIYSIGIFEGANPEQQSFGNRENDQANQFMHAVSSNYPNATAYNKTNW 488 Query: 387 ------EYHYNVVNADSLIHVFQNISQLMVH 411 Y+ +AD L +F +I + + Sbjct: 489 GTGSNLGYYKATNSADDLTKIFDDIQKEITT 519 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 81/276 (29%), Gaps = 57/276 (20%) Query: 154 YDYRLQFIEH--------LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y Q + I A + E + +VD+SGSM Sbjct: 146 YALPTDGSPFGVTTEVAATPWNPRTQLLRIGIKASD-RAVAELAPANLVFLVDVSGSM-- 202 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 +R + +++ L L +D + V L+ Y Sbjct: 203 -----------------DRREGLPLVQSTLKLLVDQLREQDRVS------LVVYAGESRV 239 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + P+ G +K + T T ++ AYQ+ Sbjct: 240 VLPPTSGRDKAKIR-TAIDQLTAGGSTAGASGIQLAYQMAREGFIDKGIN---------- 288 Query: 326 SLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV- 383 I+ TDG+ N S+ + ++ + +++ + + T+ + L++ Sbjct: 289 ------RILLATDGDFNVGISDFDSLKQMAAEQRKSGVSLTTLGFGVDNYNEHLMEQLAD 342 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + + L + + + +V+ + Sbjct: 343 AGDGNYAYIDT---LREARKVLVDQLSST-LTVVAR 374 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 68/215 (31%), Gaps = 36/215 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D + ++M A + A F D + + +GLI Y + P+ Sbjct: 15 DVSQSMRATDVEPSRMVAAQEAAKQFADELTPG------INLGLIAYAGTATVLVSPTTN 68 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E + + + + T + A+ A Q + + P Sbjct: 69 REATKNALDKLQFADR---TATGEAIFTALQAI----------ATVGAVIGGGDTPPPAR 115 Query: 333 IIFLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTISIN--------------ASPNG 375 I+ +DG+ + N AK+ + I TIS + Sbjct: 116 IVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDD 175 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + K S YN L V+ ++ Q + Sbjct: 176 ETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIG 210 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/256 (12%), Positives = 82/256 (32%), Gaps = 51/256 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + V + E+P + LV+D SGSM Sbjct: 503 GWGGSTIEQVLPVRFSGER-QREQPTLALILVIDKSGSM-------------------SS 542 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K A ++D +G+I + + + ++R + Sbjct: 543 GDRLDLVKEAARATARTLDPSD------EIGVIAFDNSPQVLVRLQPAANRLR-ISSSIR 595 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T++ PA+++AY L S K +I L+DGE Sbjct: 596 RLSAGGGTNAMPALREAYLQLA------------------GSKALVKHVILLSDGE---- 633 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +++ I + ++ + L++ ++ + + +F Sbjct: 634 SPENGINALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYFYSEDGTDVPRIFSR 693 Query: 405 ISQLMVHRKYSVILKG 420 ++ + ++ +++ +G Sbjct: 694 EAREV--KRNALVERG 707 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 68/236 (28%), Gaps = 48/236 (20%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ + L +DLSGSM + T++ A+K F+ Sbjct: 80 VDALPATGRDLILAIDLSGSMEREDF----------DLDGRTVTRLDAVKAVAKEFVT-- 127 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +GLI + ++ V + V + + T + A Sbjct: 128 -----SRAGDRVGLILFAEFAYTAAPLTFDVAAVSRIVDEATIGISGRSTAIAGGLGLAL 182 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L S + +I L+DG + + A++ + Sbjct: 183 KRL------------------KRSDAQSRVVILLSDGSDTSGNV--LPRDSARLAEQLGV 222 Query: 363 KIVTISINASP-----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + TI++ + L S + V N D L V I + Sbjct: 223 TVHTIALGPEDMETAPQTRDAVDTATLRDIAELSGGRTFRVRNTDDLRAVTAEIDR 278 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 74.1 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 75/231 (32%), Gaps = 34/231 (14%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------LLLFLDSI 242 + +NSD + N + T + L + + F++ I Sbjct: 243 PGSYTCACHEGFTLNSDGKTCNVCRGGGNGSATDLVFLIDGSKSVRPENFELVKKFINQI 302 Query: 243 -DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 D L +GL+ Y++ + + K S + K T + A+K Sbjct: 303 VDTLDVSDRLAQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKY- 361 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +F QK I TDG + ++ ++ KAK+ Sbjct: 362 ----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AARKAKDLG 404 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 K+ + + + L+ S P ++++ + ++ + + + + + Sbjct: 405 FKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQKKIC 453 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I L +GL+ Y + V+ T K + T + Sbjct: 65 FLSQVIKSLDVGPNATRVGLVNYASTVKPEFPLRAHTSKASLLQAVHRIQPLSTGTMTGL 124 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G + S K +I +TDG + + ++ Sbjct: 125 ALQFAITKALSDA----------EGGRSRSSDISKVVIVVTDGRPQDS-----VRDVSER 169 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I++ I + + L+ S P E+ V + + + Sbjct: 170 ARASGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYNVIEK 213 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 63/186 (33%), Gaps = 33/186 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T++ K L F++ + + +GL+ + + E + + + + Sbjct: 116 TRLGVSKKLLKRFIEK-------RNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEE 168 Query: 286 SLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T A+ + L S K I+ +TDG +N Sbjct: 169 ETVTEQGTAIGDAIILSTYRL------------------RNSKARSKVIVLITDGVSNTG 210 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN-----ASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K + + D A++ KI ++ I N + L + ++ + + + + Sbjct: 211 KID--PVTATDLAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMK 268 Query: 400 HVFQNI 405 V +I Sbjct: 269 AVLSSI 274 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 67/231 (29%), Gaps = 60/231 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L VDLS SM D +++ ++ A++ + + + Sbjct: 78 DLLLAVDLSDSMRTPDMLDN----------GEQQARLTAVRQQIKALIAK-------RAG 120 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+I + + + L+ + T A+ A Q + Sbjct: 121 DRVGIIVFADHAYLLSPLTQEIPALLTLSDELDFDLVGRTTALGEAILLARQHGDPGRPT 180 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + ++ +TDG N ++ ++ A I+I T+ + A Sbjct: 181 A--------------------LLLVTDGRNTAGNAD--PLQEAKLAAAQGIRIYTLGVGA 218 Query: 372 SPN--------------------GQRLLKTCVSSP-EYHYNVVNADSLIHV 401 P+ + LLK + ++ L + Sbjct: 219 DPDTFIQPYDEAGSGQADPSSELDEPLLKELAQTGQGRYFRARTQSDLDTI 269 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 74/221 (33%), Gaps = 33/221 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +++ +D ++ A K A + F+D + +GL+ +++ P+ Sbjct: 93 DTSASMTAEDMPPNRLEAAKAAAISFIDQL------PARYNVGLVSFSSSARVVTSPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF--- 329 V + + T A+ + L + + + ++ + Sbjct: 147 HALVVRSIDGL--GPPDGGTAIGEAVYSSIDDLQQILEEAAASGPSQEPEESEGSEKSEE 204 Query: 330 ------QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL----- 378 ++ L+DG N+ +S V + + + TI+ + L Sbjct: 205 SEEERSPAHLVLLSDGGNSAGRSPVAAAEAARE---AGLPTSTIAYGTEGSSATLPGGQS 261 Query: 379 -------LKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVH 411 L+ ++ Y +AD L V+ +I L+ H Sbjct: 262 VEVREDTLRNLADTTGGRFYRASSADELREVYDDIGTLVGH 302 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 63/186 (33%), Gaps = 33/186 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T++ K L F++ + + +GL+ + + E + + + + Sbjct: 108 TRLGVSKKLLKRFIEK-------RNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEE 160 Query: 286 SLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T A+ + L S K I+ +TDG +N Sbjct: 161 ETVTEQGTAIGDAIILSTYRL------------------RNSKARSKVIVLITDGVSNTG 202 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN-----ASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K + + D A++ KI ++ I N + L + ++ + + + + Sbjct: 203 KID--PVTATDLAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMK 260 Query: 400 HVFQNI 405 V +I Sbjct: 261 AVLSSI 266 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 71/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V+D SGS+ + +D + + K Sbjct: 806 VLDVVFVIDSSGSIDYDEYNIMKD-----------------------FMIGLVKKADVGK 842 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + Y E K+ D + T + A+ + + T + Sbjct: 843 AQVRFGALKYADDPEVLFYLGDLDTKMEVISMLQNDQPMGGNTYTAEALAFSDHMFTEAR 902 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I +TDGE+++ + T K ++ I ++ + I Sbjct: 903 GSRL------------HKGVPQVLIVITDGESHDAEKLNGTTKA---LRDKGILVLAVGI 947 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A N LL SS + ++ V L +F ++S + + Sbjct: 948 -AGANPVELLAMAGSS-DKYFFVETFGGLQGIFSDVSASVCN 987 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 68/217 (31%), Gaps = 21/217 (9%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA--ALKNALLLFLDSIDLLSHVKEDVYM 254 D+S S+ + D E + + K + V + Sbjct: 979 SDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIDPSDFKKMKEFVASVVQDFDVSLNRVQI 1038 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G ++ + T++ + I T A+ Q + D Sbjct: 1039 GAAQFSHTYQPEFPLGTFTDEKEVSFHIENIQQIFGYTHIGAALHQVGRYFQPDMGSRI- 1097 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + + ++ LTDG++ + + ++ + N + I ++ I + + Sbjct: 1098 -----------NTGTPQVLLVLTDGQSQD-----EVAQAAEELRHNGVDIYSVGIG-NVD 1140 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 Q+L++ + + V N D L + + I + + Sbjct: 1141 HQQLIQI-TGTADKKLTVDNFDELKKIKKRIVRNICT 1176 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 59/191 (30%), Gaps = 24/191 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQY 279 T+ + + + V +G + Y + + E + + Sbjct: 442 GSGSTQATDFQEMKTFLSEVASMFHIGPHKVRVGAVQYASSWDLEFEIGKYSNKHDLGRA 501 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T++ A+ ++L K++ ++ LT+G Sbjct: 502 IENIRQL--GGDTNTGAALNFTLRLLQKAKQQRGN-------------KVPCHLVVLTNG 546 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + ++ + +E I++ I + Q L+ + Y V + D+L Sbjct: 547 MSKDSI-----LEPAKRLREENIRVYAIGVK--EANQTQLREIAGDEKRVYYVHDFDALK 599 Query: 400 HVFQNISQLMV 410 + + Q + Sbjct: 600 DIRNQVVQEIC 610 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 64/183 (34%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ S+ V +G++ ++ ++ + T + ++ +D + + Sbjct: 640 MKTFMKSLVSKSQIGAHRVQIGVVQFSHINKEEFQL--DTFMSQSDISNAIDRMAHIGET 697 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 698 TLTGGALTFVSQYFSP--------------AKGARPNVRKFLILITDGEAQD-----VVK 738 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 739 EPAVALRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 795 Query: 411 HRK 413 + Sbjct: 796 SPR 798 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 55/197 (27%), Gaps = 27/197 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + LK +S+ L + + +GL+ Y+ E S G + + Sbjct: 238 NGSLEDFDHLK---GFLQESVSALDIKESCMRVGLVAYSNETEVIGSLSEGVNR-SAVLH 293 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R T A ++ F +G + + ++ +T Sbjct: 294 RIQSLSPGAGEAYTGA----------ALRKIRKEVFGARGGSRKNQGVPQIVVLVT---- 339 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL- 398 + + T + + + + T+ + L+ S P Y + L Sbjct: 340 HRASEDNVTRAAVN-LRRQGVTVFTLGVE--GASATQLEKIASHPAEQYVSKLPTFSDLA 396 Query: 399 ---IHVFQNISQLMVHR 412 + + + H Sbjct: 397 AHNQTFVKKLRNQITHT 413 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 71/218 (32%), Gaps = 50/218 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D SGSM ++ A F+ +ID L++ Sbjct: 768 DIVFVLDNSGSMSSND---------------PNYYRI----EATKKFIQNIDELNN---- 804 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + + V + K+ Q + T+ +K A + Sbjct: 805 -RVGLVDFDSSVSVRSNLTSDKSKLLQALNAM--RWTGGSTNIGGGLKAALGLF------ 855 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K I+ L+DG +N + + + + I + TI++ Sbjct: 856 -------------DQEQSKKIIVLLSDGYHNTGIHPNDVLP---ELIKQEIVVNTIALGK 899 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + L + ++ V N L +++ + + Sbjct: 900 DCDRELLHDIADKTKGGYFYVDNTGGLSQ--EDVDKQI 935 >gi|220925466|ref|YP_002500768.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] gi|219950073|gb|ACL60465.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] Length = 359 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/368 (9%), Positives = 97/368 (26%), Gaps = 31/368 (8%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ ++IFA +++ L G + + ++ ++S ++A L A + Sbjct: 6 RHFHSCDRGAVALIFAAAMIPLFGLAGAALDYANARRVRDVLQSISDATALLVA-----D 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 S + + R Y I +N+ I + Sbjct: 61 ADTPTVAERSFKLAENQLISRLGDRSGSGGYTIKGEWLDGSSYKLTISTKINTILIHLLS 120 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDY---------RLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + S T+ + + + + R Sbjct: 121 GKSKQFEISAVTVANRIPPRYETKPPTLSLLSPEAADYNRIYMYCFSSDPKRQAETDGGR 180 Query: 183 IEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + C N P + TK A ++ + Sbjct: 181 RGITPIADNATPPSNYGDYAPPTCGDNEAPSYMLRNVRDARTNPTKWDAKYQSVYEYYTD 240 Query: 242 --IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ID + + G Y++ ++ I ++++I T S Sbjct: 241 VVIDTGTRRQTMNMKGYKVYSSNYKEPINMDKNP---------ILETIICGSTTSCKPQS 291 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-- 357 + + + + + + + + ++ ++ + C K Sbjct: 292 EG--GILPNNHVTHDPQTATTSCEEGKFMYYGWEDRPPNAGSDRDYDDIRLLVSCPKLVK 349 Query: 358 -KENFIKI 364 + I+I Sbjct: 350 VADKKIRI 357 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 48/221 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM S + +++AA++ F+ + Sbjct: 88 DIVLALDLSGSMEREDFSLN----------GQTVSRLAAVQGVAADFVR-------GRTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R + V + + + K T + A + L + Sbjct: 131 DRVGLVVFGDRAYVAAPQTHDVASVARLIDGLQIGVSGKATAIADGLGLAIRRLRERDAK 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + I+ L+DG++ + + A++ +++ TI++ Sbjct: 191 ------------------SRVILLLSDGQDTTGMVD--PVAAAQTARDLGMRVYTIALGP 230 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + L + ++ + V D L V Sbjct: 231 ADLSDDPGARDAVDADTLRRIAQAAGGETFRVRTTDDLQAV 271 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/302 (10%), Positives = 73/302 (24%), Gaps = 32/302 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E+ ++I AL + + G I + + + K M+SA +AA+ G K+ + Sbjct: 5 LGRHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKL---PN 61 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + S + A + Sbjct: 62 QGLAMTTANSFITSN---GFDPNDATITYTQDTVNNPAGSPEINCSMTNQVPTFFLGLFG 118 Query: 134 NNRLDSSNNTI----FYNMDVMTSYDYRLQ----------FIEH------LLNQRYNQKI 173 + + + +Y L + Sbjct: 119 YPNISITVSAKGILQTGGAGGPFNYAIFSDQNLPISGNQKITGSVHTNHQLTISGNTKIS 178 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + A G I ++ + + + + + I +++ A Sbjct: 179 GAAEGATGVRLSGSNTIGSVQADTTANIHVSGSNTIGSQSGGATQIAMPDFTSQIEAAAQ 238 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A S +LS + + + +I S T + + + I T Sbjct: 239 AQGTVYYSAKILSGTNDLDGNIWV------QGDITMSGMTTSGTGAILANGNITISGNTT 292 Query: 294 ST 295 Sbjct: 293 IG 294 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/373 (10%), Positives = 106/373 (28%), Gaps = 50/373 (13%) Query: 40 LIYVLDWHYKKNSM--ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRF 97 I ++ K + E + + L+ + + E +N A + Sbjct: 196 AIDASNYPTIKVKLAVEDGSEQSDLSSGQVAIKENTVAQKTAEVNANTADKTYEIVYDTT 255 Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + N + + + + S + + + + V + Sbjct: 256 VSNTNPPNGEQRVVDLVIGDNKLSESYKSPSQKKLHIDDVSYN---TDEYPKVNVYFSL- 311 Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVN 215 + ++ + ++ G++ + ++ + D D Sbjct: 312 ------YDENNQLVEDMNPVKTAFTVKEGDKETKNASFSKLTEKPQAISTNLVIDVSD-- 363 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + K+ +K+A FL + +GL+ ++ S T + Sbjct: 364 -----SMSEDNKLTKVKDAATQFLSHASFA----SNDVVGLMSFSDASNIRQ--SDFTTE 412 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + T A+ QA S+ K+++ Sbjct: 413 IESIKSSIAGMQTSGCTALYEALNQAVS-----------------NTAYNSVEGSKYVVV 455 Query: 336 LTDGENN--NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NV 392 TDG+N + + V+ + + A + + I I + + L + + ++ Sbjct: 456 FTDGKNTICDGTNWVSPSTVINNALQWGVPIYAIGVEEDAD---LQQIAEQTNGQYHVLG 512 Query: 393 VNADSLIHVFQNI 405 + L ++ +I Sbjct: 513 NDFTDLNAIYSDI 525 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 71/246 (28%), Gaps = 48/246 (19%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 E I I ++ + V+D SGSM Sbjct: 198 SESQRLPAMVSIITKDIEQYVKNNSSIEAGIDLLCVIDKSGSM----------------- 240 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + K+A+++ +L+ LD + + LI + ++ T+ + Y Sbjct: 241 ---EGKKIASVQQSLVQLLDFLSEKD------RLCLITFDGSAQRLTPLKTLTQDNKNYF 291 Query: 281 T-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ + A+ + K ++ T+ I L+DG Sbjct: 292 KKAIYSIRASGQTNIAKGTEIAFNQIQQRKMKNQVTS----------------IFLLSDG 335 Query: 340 ENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 ++ + + + + I + + + + K C Y + + L Sbjct: 336 QDQGAAEYIQRQKDVVEDI----VTIHSFGYGSDHDAALMSKICKVGQGSFYYIEDVKLL 391 Query: 399 IHVFQN 404 F + Sbjct: 392 DEFFAD 397 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 30/255 (11%), Positives = 79/255 (30%), Gaps = 54/255 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I++ + +++ ++ +D+S SM ++ + Sbjct: 68 LILALARPAMDVQVPRERATILV-AIDVSPSMAATDV---------------APDRLTSA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K + F++S+ +GL+ +++ P+ + V + S Sbjct: 112 KESAQSFIESLPP------RFNVGLVAFSSVATVVASPTQDHQAVADSIANLTISS---G 162 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + Q + S +++ I+ L+DGEN + + Sbjct: 163 TAIGEGVFASLQAIRSFDEKATDDPPPAA------------IVLLSDGENTSGRPVAAAA 210 Query: 352 KICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + TI+ A+ + + L + +++ Y + Sbjct: 211 DEARA---AGVPVSTIAFGTGVSIIEIEGHYVPANIDKETLKELAMTTGGRFYEAESTGE 267 Query: 398 LIHVFQNISQLMVHR 412 L V+ +I + Sbjct: 268 LKDVYADIGSSLGTE 282 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 77/238 (32%), Gaps = 34/238 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-----------L 235 E+ + +NSD + N T + L + + Sbjct: 190 EQVCLSSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELV 249 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ I D L + +GL+ Y++ V + K S + K T + Sbjct: 250 KKFINQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMT 309 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K +F QK I TDG + ++ ++ Sbjct: 310 GAALKY-----------LIDNSFTVSSGA--RPGAQKVGIVFTDGRSQDYIND-----AA 351 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P E+++ + ++ + + + + + Sbjct: 352 KKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKRLQKKIC 407 >gi|302405156|ref|XP_003000415.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] gi|261361072|gb|EEY23500.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] Length = 662 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 69/217 (31%), Gaps = 25/217 (11%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R I LV+D+SGSM ++ P V ++ + + K+A L+++D Sbjct: 87 PRAPCDIVLVIDVSGSM---DDAAPAPVIPGQKDENTGLSILDLTKHAARTILETLDERD 143 Query: 247 HVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +G++ +TT + + + + + L T+ + + ++ Sbjct: 144 ------RLGIVAFTTNAKVILSLVEMNPDNKVSAKDKIENLQPLNGTNMWHGITEGIKLF 197 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + S ++ LTDG N+ + I + I Sbjct: 198 SDCDSSSGRVP---------------AMMVLTDGLPNSGCPRLGYIPKLRDMGQLPATIH 242 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 T L + + +A + VF Sbjct: 243 TFGFGYHIRSGLLKSIAEIGGGNYAFIPDAGMIGTVF 279 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 71/242 (29%), Gaps = 73/242 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + VD+S SM +D +++ K + ++++ Sbjct: 92 DVVFAVDVSKSMDA---------------EDIAPSRIDKAKQLVTQIINNLGS------- 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + ++ D L + T A++ A +++ Sbjct: 130 DRVGIIAYAGSAYPQLPITTDYSSAKMFLNAMNTDMLSSQGTAIRDAIELAKTYYNDEEQ 189 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + ++ ++DGE++ I + A E I+I TI + Sbjct: 190 TN------------------RVLVIISDGEDHAG----EVASIAESATEQGIRIFTIGVG 227 Query: 371 ASP----------------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + L + + + N + ++ Sbjct: 228 SEAGDRIPIKRNGVVQSYKKDQNGETVITKLDPATLQEIAAEADGEYINGNSTQEVVDKM 287 Query: 403 QN 404 + Sbjct: 288 AS 289 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 59/176 (33%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + V G + Y+ E + T K D I + T + Sbjct: 821 EFMISLVKKSDVSHDRVQFGAVKYSAEPETFFYLNRYTTKSAIIRAIQNDKSIGETTYTA 880 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + F + + +I +TDG++++ ++ Sbjct: 881 KALRHS------------EGLFSEEHGSRKHRGVPQVLIVITDGDSHDA---AELDEVSK 925 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 K + N I I I I LL S + ++ V + L H++ IS+ + Sbjct: 926 KLRANGIIIYAIGIER-ARPDELLTM-AGSEDKYFYVNTFEGLKHLYPRISEKICS 979 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 60/193 (31%), Gaps = 22/193 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + K + FL + +V +G+ Y+ R K + + K+ Sbjct: 997 GSNSISDSDFTK--MKNFLQDVVRPFDTGHNVQVGIAQYSDRYRKEFSLNMFSHKLELET 1054 Query: 281 TRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T A+ + T +G + Q+ ++ +TDG Sbjct: 1055 QIGRIRQMEGLQTYIGAALDRVRNFFTP------------EGGSRVNENIQQILLVITDG 1102 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 +++ +K + ++ + I I + + L S + Y V N L Sbjct: 1103 RSHD-----KVVKAAEDLRKKGVDIYAIGVGRIDH--LQLSQIAGSSDRKYTVDNFSELK 1155 Query: 400 HVFQNISQLMVHR 412 + + + + Sbjct: 1156 VIKKRLVDDICEE 1168 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 62/178 (34%), Gaps = 22/178 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ + + + +++G I +++R ++ S K SL+ + T Sbjct: 632 MKDFMSELVNKSDIGLDRMHVGAIQFSSRNKEEFRLSQYATKSDIIRAIGRMSLMGQSTL 691 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + +K +K +I +TDGE + Sbjct: 692 TGGALQFVLDYF--------------RPIKGSRPYVKKILILITDGEAQDD-----VKTP 732 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I + ++ + + L PE + V + D L H+ I + Sbjct: 733 AEALRQEGIIVYSVGVF--NANRTQLVEISGKPEMVFYVEDFDILKHLENEILFGICS 788 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 61/224 (27%), Gaps = 43/224 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I ++ S SM+ +D + LK + F+ Sbjct: 422 DTEEADIYFLIGGSSSMNYFDFADLKLF----------------LKEVVKFFM------- 458 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V G++ Y E P + + + + A+K Sbjct: 459 VGPNKVRFGVVQYAEINELEFGPEEYGKTSDILKAIENIRQLRGNPHTGKALK------- 511 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 F R+ S ++ LTD S + K K + I Sbjct: 512 ------FIHPLLRKSQGQHSRNVPCHLVVLTD-----QISEDPVKEPAKKLKNEMVSIYA 560 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I I + + S + Y V + SL H+ + + + Sbjct: 561 IGIR--HANESQIYEIAESKDRAYFVNDFASLKHIRNEVVRDIC 602 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 53/177 (29%), Gaps = 26/177 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L + + +GL+ Y+ + + T+K + S + Sbjct: 247 VSSLDVKDKCMRIGLVTYSNKPQATSLLRMATDKTHVLQSIQSISPKGGKAN-------- 298 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L S + F + S ++ I +T + S + + Sbjct: 299 ---LGSVIHFTKEKVFSKSAGSRKSQGVEQIAIVIT-----HRSSEDDVSGAATALRRAG 350 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE-------YHYNVVNADSL--IHVFQNISQLM 409 + I I I+A N +L + PE ++ N + + I Q + Sbjct: 351 VTIFAIGIDA-ANTTQLAQIVAYPPEKHLTRLKTFSHLPNKTQIFHKKIMNQIQQKL 406 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 53/178 (29%), Gaps = 28/178 (15%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +GL+ + + P+ V+ V + T + A+ A Q + + + Sbjct: 126 VNLGLVSFAGTAAVLVSPTTDRNAVKNGVDNLQLAE---STATGEAIFTAMQSIDTFSRS 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-----KICDKAKENFIKIVT 366 + G P ++ L+DG + A+ I + T Sbjct: 183 ------LQGGPDQQGTPPPARVVLLSDGTQTVPGPDGENEPRGSFTAASDAQRRGIPVST 236 Query: 367 ISINAS--------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 IS S + + + S + L V+ ++S+ + Sbjct: 237 ISFGTSYGSIELDGGRTPVAVDDASMERIASLSGGRFFTAATESELRAVYSDLSEELG 294 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 56/165 (33%), Gaps = 19/165 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D ++ V G + Y+ + + + + D+L+ T + Sbjct: 839 DFVIDIVNKSDVGNNRVQFGAVKYSAYPQILFNLNGNKADIIDKIKG--DTLLNDTTYTA 896 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + T K + ++ +TDG +++ + Sbjct: 897 EALRHAENLFTESKGS------------RKRRGVPQLLMVITDGTSHDK---DKLDAVST 941 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + + I I I I + + L+ + ++ + V D L + Sbjct: 942 RIRNDGITIYAIGIKDAKREE--LEIIAENKDHVFFVDTFDGLKN 984 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 61/184 (33%), Gaps = 22/184 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + ++ + V +G++ ++ ++ + T K + D + Sbjct: 642 DNFKKMKTFIKEVVNRSYIGVDQVQVGVVHFSDTSKEVFSLNKNTSKKSDIIQAVEDMSL 701 Query: 289 LKPT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T ++ A++ + K K ++ +TDGE ++ Sbjct: 702 IGSTTNTGGALRFVSKYFKL--------------AKQARPSVNKILVLITDGE-----AS 742 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + I I ++ + + L+ PE + V N D L + +I Sbjct: 743 DEVTAPATELRNDGIIIYSVGVF--NANKTQLEEISGKPEKVFYVENFDILEDIKGDIIF 800 Query: 408 LMVH 411 + Sbjct: 801 GICS 804 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TD 293 + FL + +++ G+ ++ + K S+ T Sbjct: 1021 VKDFLTKLVDSISFHDNIQFGMAQFSDIYSEEFPLGHYQSKSELKNKIANVSMQAGLHTY 1080 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 +K+ FF+ + + + ++ TDGE+ + + Sbjct: 1081 IGKGLKE-------------VKAFFKSPRRRVARNVHQKLLIFTDGESKDS-----FTQP 1122 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + ++I + + + + L+ SPE Y N L H+ + I++ M Sbjct: 1123 AEDLRREGVEIHAVGVGKIEHAK--LQQITVSPERIYTTANYTGLPHITKGITEEMCKE 1179 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 57/183 (31%), Gaps = 26/183 (14%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLIL 289 K+ + I + + + V G++ Y+ E + + + Sbjct: 461 KDMKTFLKEVIKMFTVGPDHVRFGVVQYSNIHRTEFEIDKHSTISNLEKAINNIQYL--T 518 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A++ + S K ++I LTDG+ ++ Sbjct: 519 GDTYTGAALESMLGLFESA-------------RKQRKNKVPTYLIVLTDGDPHD-----K 560 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + D+ ++ I ++ I + + L + Y V SL + NI Q + Sbjct: 561 VKEPADRLRKAGINVIAIGVG-DIKWKGLQEI---GESNVYYVHQYASLKTIKDNIVQDI 616 Query: 410 VHR 412 Sbjct: 617 CSE 619 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 51/190 (26%), Gaps = 10/190 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + ++ + + NF ++ AL V L + + + + + M+ A +AA + Sbjct: 1 MIKVLTAMQRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIK 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG-----YSAVFYNTEI 118 S R E+ A + ES + + Sbjct: 61 AVRSYGEGESENAVRTEANRLFFANFQTPSATDGYNSASPESPAVEFTFSETGQETRASA 120 Query: 119 QNIVNSSRIS--MTHMANNRLDSSNNTI---FYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 + + + +R + + + + ++ N I Sbjct: 121 SYAAQYNPVFWGLQPFVISRRSVAARLTDREACILALHPTASRAIEVSGSAAVDTSNCTI 180 Query: 174 VSFIPALLRI 183 S I Sbjct: 181 TSNSDDAQSI 190 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 11/73 (15%), Positives = 33/73 (45%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + ++ +E+ N IIFA + + + + + + + K+ ++ A++ A L+ + Sbjct: 15 RQFLRRFANNERGNVMIIFAAAALPMAIGVAGALEISQYSQLKSQLQEASDRAALSAMAA 74 Query: 68 MVSNLSRLGDRFE 80 + + + Sbjct: 75 LREGPRAMRQQAR 87 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 90/287 (31%), Gaps = 32/287 (11%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 F T + + + +L +T V S I ++ Q Sbjct: 4 FQVTTMILQPRLLTWFFSTLLLTQL-MMASTAAAESTVKVSTTPIFPQIPLGQARKDFQV 62 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ + R + V+D+SGSM+ D + P + +++ LK Sbjct: 63 LLRVEAPTAAVRPEARVPIDVVAVLDVSGSMN--------DPAAVPPERRPTTSRLDLLK 114 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLIL 289 A + ++ + ++ + R K + + R+ + Sbjct: 115 TAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEAR 168 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T PA ++A ++L G FI+ LTDGE+ + + Sbjct: 169 GGTALVPAFEEAVKVLD--------------GRVGDGRNRLGFIVLLTDGEDTSGFTLSE 214 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + + + T+ + + + + LL S + V + + Sbjct: 215 RRREVIRGALGRYPVHTLGLGRAHDPEVLLYLAQESHGTYSFVDDDN 261 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 33/204 (16%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 K+ + A +D ++ ++ ++ Y RVE E ++ Sbjct: 90 SGSMSGDKIKYARQAAKFLIDQLNSTD------HLSIVNYDDRVEVTSPSQSVKNKEALK 143 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + D T+ + M + Y + S +K + ++ LT Sbjct: 144 AAIDKIHD---RGSTNLSGGMLEGYTQVKSTRK----------------EGYVNRVLLLT 184 Query: 338 DGENNNF-KSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DG N + ++ + KE+ I + T + A N L + +Y + + Sbjct: 185 DGLANQGITDPLELKRLAENKYKEDGIALSTFGVGADYNEDLLTMLAENGRANYYFIDSP 244 Query: 396 DSLIHVFQNISQLMVHRKYSVILK 419 D + +F + + SV+ + Sbjct: 245 DKIPQIF---AGELKGL-LSVVAQ 264 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 71/235 (30%), Gaps = 50/235 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + DLS SM ++ A K+ + F+D Sbjct: 86 TVNGIDIMIAFDLSYSMETPDMVLNRM----------PINRLVAAKHVITQFVD------ 129 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQIL 305 + D +G++G+ + + + V + + T A+ A L Sbjct: 130 -SRPDDRIGIVGFAGKTKSFCPLTLDHALVNSIIRDFHPRMIQADGTAIGSAIAAAATRL 188 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 K+ K II +TDG +N+ + + + A + IKI Sbjct: 189 DDRKETK-----------------SKIIILVTDGASNSGQISPLV--AAENAAKLGIKIY 229 Query: 366 TISINASP-------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 TI++ + L K + H+ N S F +I + Sbjct: 230 TIAVGTEEGTLANGMVVQSEFDEPTLRKIAQLTGGEHFRATNMASFNKAFTSIGK 284 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 34/323 (10%), Positives = 84/323 (26%), Gaps = 26/323 (8%) Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 K S+S ++ + L++ +N + L Sbjct: 303 AADHKLSVSKSKVTAKFEVKSFSDFILTWNIDATPADPLETGDNAASIEKQINHEKYATL 362 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + ++ A + + + ++D SGSM Sbjct: 363 --------RDDGTYDLTLTVAGKKGTETNKAKLDVIYILDKSGSMKEDFGG--------T 406 Query: 219 ICQDKKRTKMAALKNALLLFLD---SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + + AL +L + +++ T+ + E + Sbjct: 407 SKRIAASNAITALTKSLKQNANIDARFSMVTFSGNKTTGMWGQGDTKTWDDAEVAVSWTT 466 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + R T+ ++ A ++LTS + + F Sbjct: 467 DAGTIERGSKPTSNGGTNYQAGIRTAKELLTSKRAGAMTAVIFISDGDPTFYYNPDGYTR 526 Query: 336 LTDGENNNFKSNV---NTIKICDKAKENFIK-IVTISINASPNGQRLLKTCVSSP---EY 388 + N ++ ++ + T+ + + + L C +S Sbjct: 527 GDGNNDGNGGADNLKVCLDAAKNEIANLGVNYFYTVGVGKANDYVNLSDLCSASGVSGAK 586 Query: 389 HYNVVNADSLIHVFQNISQLMVH 411 +++ N D L F I ++ Sbjct: 587 NFDGTNTDELTKAFSTIESDILT 609 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/386 (11%), Positives = 108/386 (27%), Gaps = 48/386 (12%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 H K ++ A + + +I+ A + + Sbjct: 96 RAHGSKAALLQAVRR-----IRPLATGTMTGLAIQFAITKALSDAEGGRVTSPGISKVVI 150 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 ++ ++ +S + + + + + ++E Sbjct: 151 VVTDGRPQDSVRDVSERARASGVELFAIGVGGRVDKATLRQIASEPQDEHV---DYVESY 207 Query: 165 L----NQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + ++ + L + + + +NSD + N Sbjct: 208 SVIEKLSKKFREAFCVVSDLCATGDHDCEQVCVSSPGSYTCACRDGFTLNSDGKTCNVCS 267 Query: 219 ICQDKKRTKMAALKNA-----------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKN 266 T + L + + F++ I D L +GL+ Y++ V + Sbjct: 268 GGGGSSATDLVFLIDGSKSVRPENFELVKKFINQIVDTLDVSDRLAQVGLVQYSSSVRQE 327 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 K S + K T + A+K +F Sbjct: 328 FPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--R 374 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 QK I TDG + ++ ++ KAK+ K+ + + + L+ S P Sbjct: 375 PGAQKVGIVFTDGRSQDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEP 427 Query: 387 --EYHYNVVNADSLIHVFQNISQLMV 410 E+++ + ++ + + + + + Sbjct: 428 VAEHYFYTADFKTITQIGKKLQKRIC 453 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 53/165 (32%), Gaps = 16/165 (9%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +G++ Y + V + K + T + Sbjct: 63 FLSQVIESLDVGPNATRVGVVNYASAVRQEFPLRAHGSKAALLQAVRRIRPLATGTMTGL 122 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A SD +G ++ S K +I +TDG + + ++ Sbjct: 123 AIQFAITKALSDA----------EGGRVTSPGISKVVIVVTDGRPQDS-----VRDVSER 167 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIH 400 A+ + +++ I + + L + E+ V + + Sbjct: 168 ARASGVELFAIGVGGRVDKATLRQIASEPQDEHVDYVESYSVIEK 212 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 64/189 (33%), Gaps = 24/189 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDM 284 + + L+ I+ + V +G + Y+ EK + + T +R+ ++ Sbjct: 452 RSDHFEQIKKFMLEVIENFDIGPDKVRVGAVQYSDTREKEFDITDYTTDETLRKAISNIR 511 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + A+ QI+ +++ ++I LTDG Sbjct: 512 QL--GGGTYTGEALDFILQIIKKGREQRIN-------------KVPCYLIVLTDGM---- 552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S + ++ +K + I + I I + L+ E N DSL ++ Sbjct: 553 -SMDDVLEPAEKLRAENIAVHAIGIG--EANRTQLQQIAGKEERVSFGQNFDSLKNIKNE 609 Query: 405 ISQLMVHRK 413 + + K Sbjct: 610 VLHSICTEK 618 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + V G + Y+ + + + +Y+ T + A+++A Sbjct: 839 VKKSDVGPNRVQFGALRYSNEPDIIFYLNSNRSAIMEYLRSLSA--KGGDTYTAKALERA 896 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F + ++ +I +TDG++++ + K + Sbjct: 897 ------------NILFTEEHGSRIKQNVKQMLIIITDGKSHDHI---HLSDKASKLRAKG 941 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I + + Q L+T + Y +V N DSL V+Q + + M Sbjct: 942 IIIYAVGVG--EANQEELETMAGNKHYTIHVSNFDSLKDVYQPLQESMCT 989 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/181 (9%), Positives = 54/181 (29%), Gaps = 22/181 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ ++ + + + S I + Sbjct: 643 RTMKSFMKNLLAKIQIGLDKTRIGVVQFSDVTKEEFKLDTYFTQKEISDAIERMSPIEQN 702 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I +TDGE + Sbjct: 703 TLTGKALTS--------------IEPYFTESKGARSMVRKFLILITDGEAQDD-----VR 743 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ + I + + + L+ + V + L + + + Sbjct: 744 NPAKVLRDKGVVIFAVGVFR--ANRTQLEEISGDSSLVFQVESFSDLQEIENKLIFRVCA 801 Query: 412 R 412 Sbjct: 802 L 802 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 73/223 (32%), Gaps = 37/223 (16%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM + + + K +++ LK A+ + +D Sbjct: 45 PIDVVAVLDVSGSMGDPAMASSDFEKN------KPPSRLDVLKEAMKFIIRKLDDGD--- 95 Query: 250 EDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + R K + R + T PA+++A ++L Sbjct: 96 ---RLSIVAFNDRPVKEYSTGLLNISGNGRRIAEKKVDWLEARGGTALMPALEEAIRVLD 152 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + F I+ LTDG++ + + K + T Sbjct: 153 CRPGDSRNSVGF--------------ILLLTDGDDTSGFRWSRDVINGAVGK---YPVHT 195 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + A+ + + LL S + V + + I+ + Sbjct: 196 FGLGAAHSSEALLHIAQESRGTYSFVDDEN-----MDKIAGAL 233 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 83/259 (32%), Gaps = 31/259 (11%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 +T V S I ++ Q ++ + R + V+D+S Sbjct: 23 ASTAAAESTVKVSTTPIFPQIPLGQARKDFQVLLRVEAPTAAVRPEARVPIDVVAVLDVS 82 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM+ D + P + +++ LK A + ++ + ++ + Sbjct: 83 GSMN--------DPAAVPPERRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFN 128 Query: 261 TRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 R K + + R+ + T PA ++A ++L Sbjct: 129 DRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFEEAVKVLD----------- 177 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 G FI+ LTDGE+ + + + + + T+ + + + + Sbjct: 178 ---GRVGDGRNRLGFIVLLTDGEDTSGFTLSERRREVIRGALGRYPVHTLGLGRAHDPEV 234 Query: 378 LLKTCVSSPEYHYNVVNAD 396 LL S + V + + Sbjct: 235 LLYLAQESHGTYSFVDDDN 253 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 71/229 (31%), Gaps = 52/229 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + +D SGSM S K A F+D ++ Sbjct: 66 TTSVPIDVVFALDSSGSMGWNDPSGLR-------------------KTAAKSFVDKLNST 106 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + G++ + ++ + V+ + S TD + A +L Sbjct: 107 TD-----QAGVVSWDNNIDFTQTLTNNFSLVKSKIDAVDSS---GGTDLNVGLNAAISLL 158 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K+ + IIFL++G+ S A + Sbjct: 159 DTGKQANS----------------SWVIIFLSNGQGTYSHST------AVVAANKGYTVY 196 Query: 366 TIS--INASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI I+ + LK ++ +Y+ NA +L VF +I + +V Sbjct: 197 TIGLAISPGSTAESNLKDIANTTGGKYYSSPNATNLDAVFNDIYKEVVT 245 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 23/261 (8%), Positives = 62/261 (23%), Gaps = 30/261 (11%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 N K + +G +A + VN + I + + T Sbjct: 2 NGRKEFAAILFAAITILALAIVPTGATAQIAPPTVSKTVNPTDIVLLTEETTITLTIQGT 61 Query: 144 IFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 V + L + + + + + + Sbjct: 62 GSEWTTSVPIDVVFALDSSGSMGWNDPSGLRKTAAKSFVDKLNSTTDQAGVV-------- 113 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL-----FLDSIDLLSHVKEDVYMGLI 257 + +++ + + ++K+ A+ ++ +I LL K+ +I Sbjct: 114 ---SWDNNIDFTQTLTNNFSLVKSKIDAVDSSGGTDLNVGLNAAISLLDTGKQANSSWVI 170 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + + + ++ + T T + A ++ + Sbjct: 171 IFLSNGQG----TYSHSTAVVAANK-------GYTVYTIGL--AISPGSTAESNLKDIAN 217 Query: 318 FRQGVKIPSLPFQKFIIFLTD 338 G S D Sbjct: 218 TTGGKYYSSPNATNLDAVFND 238 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 66/199 (33%), Gaps = 33/199 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + +D K ++ A K L F++ + + L+ + + Sbjct: 109 DISDSMLAEDLKPNRLEAAKGMALRFIE-------GRTTDRIALVAFAGETATLSPLTTD 161 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 +++Y+ ++I T A+ L + + Sbjct: 162 YTALKEYLASINTNIIRTSGTALGMALSSCVNKLRDVAGK------------------SR 203 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG----QRLLKTCVSSP- 386 I ++DG+N + AK +++ TI+I + ++ L+ P Sbjct: 204 IAIIISDGDNTAGTIP--PETAVELAKSFGVRVYTIAIGKPGSEEGVDEKTLRMLAGGPN 261 Query: 387 EYHYNVVNADSLIHVFQNI 405 + + SL +F I Sbjct: 262 GRFFKAADNSSLSKIFDEI 280 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 71/236 (30%), Gaps = 47/236 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D + ++ A +NA F+ + V +G++ + + P+ Sbjct: 93 DVSGSMRATDVQPDRLTAAQNAAKAFIQDL------PRHVRVGVVAFAGTAQLAQLPTQS 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK--------------RSFFTNFF 318 E + + + T + + A L D R Sbjct: 147 HEDLLKAIDSFQLQR---GTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPIDEVGR 203 Query: 319 RQGVKIPSLPF------QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K P P II LTDG+ ++ A + +++ T+ + Sbjct: 204 ADPAKKPFTPVAPGSYRSAAIIMLTDGQRTTGVD---PLEAAQWAADRGVRVYTVGVGTV 260 Query: 373 P---------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + L + + +++ A L V++ +S + K Sbjct: 261 QGELIGFEGWSMRVRLDEDTLKAVALRTNAEYFHAATAQDLRKVYETLSSRLTVEK 316 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + + V G + Y+ + + + + M T + Sbjct: 834 INLTIHLVKKADVGRNGVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGSTYT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A F + ++ +I +TDG++++ +T Sbjct: 894 AKALKHA------------NALFTEEHGSRIKENVKQMLIVITDGKSHDHDQLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + ++ I I+ + + Q+ L+ + V N D L VF + + M Sbjct: 939 SELRDKGITILAVGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQESMCTE 994 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 82/282 (29%), Gaps = 28/282 (9%) Query: 135 NRLDSSNNTIFYNMDVMTS--YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + +I + Y + +L + + S L+ E + Sbjct: 364 EDVTMFALSIQGANNTQLEEIVSYPPEQTISMLKSYADLETYS--TKFLKKLQNEIWSQI 421 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKED 251 + +++ D++ + K +K +L + + Sbjct: 422 STYAEQRNLDKTGCVDTKEADIHFLIDGSSSIQEKEFEQIKRFMLEVTEMFS---IGPDK 478 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ Y+ E S + + + T + A+ QI + K Sbjct: 479 VRVGVVQYSDNAEVEFYISDYSNDIGLRKAIFNIKQLTGRTYTGKALDYILQIXKNGSKD 538 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++I LTDG S ++ + + I I + I A Sbjct: 539 RM-------------SKVPCYLIVLTDGM-----SMDRVVEPGKRLRAEQITIHAVGIGA 580 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L+ E + N D+L + + + + K Sbjct: 581 ANKIE--LQEIAGKEERVHFGQNFDALKSIKNEVVREICTEK 620 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ ++ + + + D SLI K T + A+ Sbjct: 655 LTKIQIGADKTQIGVVQFSDTTKEEFQLNRYFTRQEISDAIDRMSLINKGTLTGKALNFV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG ++ + + Sbjct: 715 GQYFT--------------HSKGARLGAKKFLILITDGVARDYVRDP-----ARILRGKN 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 756 VTIFSVGV-YNANRSQ-LEEISGDGSLVFHVENFDHLKA 792 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + E V L + + + V++ Sbjct: 964 MAGNKNNTIYVDNFDKLKDVFTLVQESMCTEAPEVCSLQEADVIFLCDGSDRVSN----- 1018 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 1019 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKNQ 1071 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 1072 IQNVSKSSGFPRIDFALKKVSNMFN------------LYAGGRRNAGVPQTLVVITSGDP 1119 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 + ++ K+ I ++ + I Q LL + E + D L Sbjct: 1120 HYNVAD-----AVKTLKDLGICVLVLGIGDVSKEQLLL--ITGNSEKIITFQDFDKLKNV 1172 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 1173 DVRKRIVREICQ 1184 >gi|119899154|ref|YP_934367.1| hypothetical protein azo2864 [Azoarcus sp. BH72] gi|119671567|emb|CAL95480.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 339 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 69/228 (30%), Gaps = 39/228 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++A + A F+ + DV +G++ + P+ Sbjct: 93 DVSLSMSAPDVLPDRLSAAQAAARDFVR------NQPPDVRIGIVSFAGTATVVQAPTDN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK- 331 E + + R + T + A L ++ + + Sbjct: 147 REDLLGAIDRLQLARH---TAIGSGIIVALSALFPEESFDPDPTMMSSAEPGRAPNAPRE 203 Query: 332 ----------FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------- 373 +I LTDG + + A I++ T+ + Sbjct: 204 EVAPGSNGSAAVILLTDGRRTSG---PEPVDAARMAAVRGIRVFTVGFGTAEGATIQNEG 260 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI-SQLMVHRK 413 + L + +++ A L ++ ++ ++ ++ R+ Sbjct: 261 WSVFMRFDEGTLRAIADLTQAKYFHAGTAAELQQIYHDLNARYVLERR 308 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 95/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANLGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|291295701|ref|YP_003507099.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470660|gb|ADD28079.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 298 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 62/187 (33%), Gaps = 30/187 (16%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +M A + A +D + + +G++ ++ + P+ + +RQ + Sbjct: 102 APNRMVATQMAAKALVDKL------PRHIKVGVVTFSGYGTLLLPPTTDRKAIRQAIDNL 155 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T + A + L P I+ + G + + Sbjct: 156 DL---GGGFSFTYGLLAALEAL---------------PQTPPEGSRPGVIVLFSHGHDVS 197 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +KI D+A E I++ I + + + + L K + +Y + +A L Sbjct: 198 GND---PLKIADQALERGIQVHAIGVGTHGHNFDEEMLKKVADRTGGRYYPIFSASDLSK 254 Query: 401 VFQNISQ 407 ++ + Sbjct: 255 AHADLGR 261 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 95/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 28/344 (8%), Positives = 82/344 (23%), Gaps = 92/344 (26%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-----------------------P 189 + +++++ K ++ ++ + + Sbjct: 83 EIVLLVDTSGSMIDRKVEGKYR-INYSVDQVFIEKEWKWYRRVSLENGQKYIIHELLYTS 141 Query: 190 IFLIELVVDLSGSM---HCAMNSDPEDVNSAP-ICQDKKRTKMAALKNALLLFLDSIDLL 245 I + + D + +K+ L+ A F++ + Sbjct: 142 ERDIIINGQIYTEYIEIDNKRYYLKYDYSDGSWYAIAYSESKIDELQKAAKNFVNKFET- 200 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST---------- 295 K + +GL+ Y + E + +++ + + T+ Sbjct: 201 ---KSNTKIGLVSYGNKGEVVHSLTNELDRINSSID--YGLSVYGATNIGDGIRKANGLL 255 Query: 296 --------------PAMKQA--------------YQILTSDKKRSFFTNFFRQGVKIP-- 325 + A Y G Sbjct: 256 NNGSNADKYIVLMTDGIPTAATCYSNVKSERTEHYSNFYRRNIYDTPVGAILDGYGRELE 315 Query: 326 -----------------SLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + I + G+N+ + + +A EN + + Sbjct: 316 KFNLITDYNYKFEYNPKDYIYSDENIIINRGDNDYGNIALNYAKESLKRASENGVNNFVV 375 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + +L + + Y+ ++ D L V+ I+ + + Sbjct: 376 GFSNGISRDKLTQIADAGNGYYREAMHGDELEDVYNRIADEIKN 419 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 38/257 (14%), Positives = 82/257 (31%), Gaps = 58/257 (22%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V F + + R + + VD+S SM + D ++ A Sbjct: 68 LVVGFARPTAEVRV-PRERATVMVAVDVSTSM---------------LAGDVDPDRLTAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 K + F+D + ++ +GL+ + + P E + + + R ++ + Sbjct: 112 KESARRFVDGL------PDEFNVGLVAFAGSAAVLVPPDTDREALDEGIDRLVEGATGVQ 165 Query: 291 PTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ A + L + + I+ L+DG N + Sbjct: 166 GTAIGEAINTSLGAVKALDGEAAKDPP---------------PARIVLLSDGANTSGMD- 209 Query: 348 VNTIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVV 393 ++ A E + + TI+ +GQ L + + V Sbjct: 210 --PMEAAADAVEMEVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDAVARETGGQFHEAV 267 Query: 394 NADSLIHVFQNISQLMV 410 + + L V+ +I + Sbjct: 268 SVEELRAVYDDIGSSVG 284 >gi|302832626|ref|XP_002947877.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] gi|300266679|gb|EFJ50865.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] Length = 733 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 42/365 (11%), Positives = 104/365 (28%), Gaps = 53/365 (14%) Query: 50 KNSMESANNAAILAG-ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 ++ ++ A A+ A AS + + + + S + AL + + + + + Sbjct: 101 QSKLQ-ALEKAVRAAPASAVGRESNGVEGNGSAESKVPQTALESPQPKDVDSGLTPGAAP 159 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 +A + + A S + + + Sbjct: 160 GAAPAAAPGAAPGAAPAAVGAATAAEEVDVLSISVVPEYDQYGLEAEAV----------- 208 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + ++ ER + V+D SGSM + Sbjct: 209 -----RAVVSIKAIADVPERARVALTCVLDRSGSM--------------------SGGPI 243 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SL 287 ++ +D + ++GLI Y V +++ T R ++ + Sbjct: 244 RLVRETCHFLIDQLTSDD------FLGLISYAHDVREDLPLLRMTPASRTLAHAVVEELV 297 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII----FLTDGENNN 343 T + + + ++ N G S P ++ TDG+ + Sbjct: 298 AGGSTALYDGLVAGLRQQMAAERDLGGGNGASGGASDSSSPSSLSLVHSCFLFTDGQATD 357 Query: 344 FKSNVNTI----KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 SN +I + + + T + + L + + +Y + + + Sbjct: 358 GPSNPASIIEGLQAAQAPSGQHVTVHTFGFGNGHSVELLQQVAEAQSGVYYYISCEEDIA 417 Query: 400 HVFQN 404 F + Sbjct: 418 CGFGD 422 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 61/186 (32%), Gaps = 22/186 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + V +GL Y + + ++ V + +L Sbjct: 249 QNFQKVKNFLYSVVSGLDVSSDRVRVGLAQYNDDIYPAFQL--NQHPLKSMVLEQIQNLP 306 Query: 289 L--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ + + +I +TDGE+N+ Sbjct: 307 YRTGGTNTGNALEF-----------IRTSYLTEGSGSRAKDRVPQIVILVTDGESND--- 352 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNI 405 + D+ KE+ I + + IN + Q L K E +N+ N + L +I Sbjct: 353 --EVQEAADQLKEDGIVVYVVGINV-QDVQELQKIASEPFEKFLFNIENFNILQDFSGSI 409 Query: 406 SQLMVH 411 Q + Sbjct: 410 LQTLCS 415 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 61/170 (35%), Gaps = 21/170 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ + + + +GL+ +++ ++ + + KV Y + T + A+ Sbjct: 1051 VNQSNIGTDSIQIGLLQFSSIPKEEFRLNRYSSKVDIYNAIFAVQQMRDGTRTGKALNFT 1110 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F + K Q+++I +TDG + N I ++ Sbjct: 1111 L--------------PFFESSKGGRPSVQQYLIVITDGVAQD-----NVILPAKALRDKN 1151 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I I + +LL+ +Y+V + + L ++ + I + Sbjct: 1152 IIIFAIGVG-EAKKSQLLEITNDEDRVYYDV-DFEVLQNLEKEILSKVCD 1199 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/307 (11%), Positives = 87/307 (28%), Gaps = 38/307 (12%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-----LLNQRYNQ 171 + + + + + + + + V E + + + Sbjct: 734 SSHGVQQIAVVITENFSQDSVSGPA-SRLRRAGVTIYAVGTQDVSESKDLEKMASYPPWK 792 Query: 172 KIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDV-----NSAPICQDKKR 225 V L +G + L +VD S+ E D Sbjct: 793 HSVPLESFLQLSIIGSKLTNQLCPEIVDSQVSVRGTSYPVQEGCVHIQKADIYFLIDGSG 852 Query: 226 TKMAALKNALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + +F++ I + + V G+I Y+ ++ S V D Sbjct: 853 SIYPEAFLDMKVFMNEVIKMFQIGPDRVQFGVIQYSDEIKSKFVLSQYPTVAELKVAIDN 912 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + A+ Q+ + +++I +TDG++++ Sbjct: 913 IQQGGGGTTTGEALNNMTQVF----------------ADTARINVARYLIVITDGKSSDP 956 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ + + + + I I + N L + + + V D L + + Sbjct: 957 VAD-----AAEGLRASGVIIYAIGV-REANIDELKEIA---KDKIFFVYEFDLLKDIQKE 1007 Query: 405 ISQLMVH 411 + + + Sbjct: 1008 VVRDICS 1014 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 50/186 (26%), Gaps = 20/186 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ + + + +GL Y+ S K Sbjct: 46 QHTRSVRNFLYILVNSFNVSSKTIRVGLAQYSDVPHSEFLLSTYHRKNDVLRHIRQFQFK 105 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A++ +F S + + ++ G + Sbjct: 106 PGGKKMGLALQF-----------ILDHHFQEAAGSRASQGVPQIAVVMSSGPAEDHVHGP 154 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNIS 406 + + I + + + + + L+ SSP+ ++ V + L + Q + Sbjct: 155 -----GEALRRAGILVYAVGVKDAVWAE--LREIASSPQENFTSFVPSLSGLSSLAQKLR 207 Query: 407 QLMVHR 412 Q + Sbjct: 208 QELCDT 213 >gi|211546|gb|AAA48695.1| cartilage matrix protein [Gallus gallus] Length = 416 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 43/380 (11%), Positives = 108/380 (28%), Gaps = 44/380 (11%) Query: 48 YKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 K + A + + +IS + N + ++ Sbjct: 20 QTKAELLQAVQR-----IEPLSTGTMTGLAIQFAISRAFSDTEGARLRSPNINKVAIVVT 74 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 ++ + I + + R+D + + DY + Sbjct: 75 DGRPQDGVQDVSARARQAGIEIFAIGVGRVDMHTLRQIASEPLDDHVDYVESYSVIEKLT 134 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRT 226 Q+ + G+ I + S C + D + C + Sbjct: 135 HKFQEAFCVV--SDLCATGDHDCEQICISTPGSYKCACKEGFTLNNDGKTCSACSGGSGS 192 Query: 227 KMAAL------KNA-------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + K+ + F++ I + L ++ +GL+ Y++ V + Sbjct: 193 ALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSEKQAQVGLVQYSSSVRQEFPLGQF 252 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K + + K T + A+K S K Sbjct: 253 KNKKDIKAAVKKMAYMEKGTMTGQALKYLVDSSFSIANG-------------ARPGVPKV 299 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHY 390 I TDG + ++ ++ KAK+ ++ + + + L+ S P E+++ Sbjct: 300 GIVFTDGRSQDYITD-----AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYF 352 Query: 391 NVVNADSLIHVFQNISQLMV 410 + ++ ++ + + + Sbjct: 353 YTADFRTISNIGKKLQMKIC 372 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 43/380 (11%), Positives = 108/380 (28%), Gaps = 44/380 (11%) Query: 48 YKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 K + A + + +IS + N + ++ Sbjct: 97 QTKAELLQAVQR-----IEPLSTGTMTGLAIQFAISRAFSDTEGARLRSPNINKVAIVVT 151 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 ++ + I + + R+D + + DY + Sbjct: 152 DGRPQDGVQDVSARARQAGIEIFAIGVGRVDMHTLRQIASEPLDDHVDYVESYSVIEKLT 211 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRT 226 Q+ + G+ I + S C + D + C + Sbjct: 212 HKFQEAFCVV--SDLCATGDHDCEQICISTPGSYKCACKEGFTLNNDGKTCSACSGGSGS 269 Query: 227 KMAAL------KNA-------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + K+ + F++ I + L ++ +GL+ Y++ V + Sbjct: 270 ALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSEKQAQVGLVQYSSSVRQEFPLGQF 329 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K + + K T + A+K S K Sbjct: 330 KNKKDIKAAVKKMAYMEKGTMTGQALKYLVDSSFSIANG-------------ARPGVPKV 376 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHY 390 I TDG + ++ ++ KAK+ ++ + + + L+ S P E+++ Sbjct: 377 GIVFTDGRSQDYITD-----AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYF 429 Query: 391 NVVNADSLIHVFQNISQLMV 410 + ++ ++ + + + Sbjct: 430 YTADFRTISNIGKKLQMKIC 449 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 34/282 (12%), Positives = 93/282 (32%), Gaps = 37/282 (13%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 ++ +T L + + + +++ + P ++ + + Sbjct: 34 TSVTSITDLTPDLDYSVSINSFDGSEESI---PIFGQLTIQSGNSSDRVRRPTDTIKCSV 90 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVE 264 + +D + ++ K + F+ S+ ++ + ++ Y+T Sbjct: 91 SAITDLVFLVDGSWSVGRENFKH------IRSFIASLAGAFDIGEDKTRVAVVQYSTDTR 144 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + T + + T + A+ Q + ++ S Sbjct: 145 TEFPLTRYTRRGDLLQAINSLPYKGGNTMTGDAIDYLLQNIFTEAGGS------------ 192 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCV 383 F K + +TDG++ + + + + ++I + I A + L+ Sbjct: 193 -RKSFPKVAMIITDGKSQDP-----VEEHARRLRNIGVEIFVLGIKGADEDE---LREIA 243 Query: 384 SSP--EYHYNVVNADSLIHVFQNISQLMVH---RKYSVILKG 420 S+P ++ YNV N D + V + I + + + S ++ G Sbjct: 244 STPHSKHMYNVPNFDKIQEVQKKIIREVCSGVDEQLSSLVSG 285 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 55/174 (31%), Gaps = 30/174 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMK 299 ++ V + L+ Y+ T V R + + T++ AMK Sbjct: 422 VNSFDIGPSKVQISLVQYSRDPHTEFAL--NTHHDINAVVRAVRTFPYRGGSTNTGKAMK 479 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + + + ++ +TDG KS+ + + Sbjct: 480 YVKDKIFVASRGARQN-------------VPRVMVLITDG-----KSSDSFKDAATNLRN 521 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ + P + + V + FQ IS+ + Sbjct: 522 IDVEIFAVGVKDAVRSE--LEAIANPPADNHVFEVEDF----DAFQRISKELTQ 569 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 63/170 (37%), Gaps = 16/170 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ S E G+I ++T V+ + K + T++ A+ + Sbjct: 881 VNQSSVGPELTRFGVITFSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYS 940 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q ++ +L + ++ +TDG + ++ Sbjct: 941 LQYF------------GKEHGGRAALKVPQILMVITDGAAQEPSKLPGPSEA---LRKQG 985 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + +I + + + ++L + P + V D+L +++NIS+++ + Sbjct: 986 VSVFSIGV-KNASREQLDIMAGNDPSRVFFVDTFDALETLYKNISKVLCN 1034 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 57/178 (32%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D I+ ++ V++GL +++ + + ++ T Sbjct: 1067 KFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYLNKFFDEQAISAHIKDMQQEEGGTLIG 1126 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + + + ++ +TDG++ + + Sbjct: 1127 LALNSIRKYFEAS------------HGSRKAEGISQNLVLITDGDSQDD-----VEEAAR 1169 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +++ I I + + L +PE + V N D L + Q + + K Sbjct: 1170 LLRGLGVEVFAIGIGNVHDLELL--QIAGTPENVFTVKNFDKLEGIHQKVVDTICQSK 1225 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L+ + V +G++ Y + T+ + T++ Sbjct: 480 KFILEFLQTFRVGPNHVRIGVVKYADSPTLEFDLHTYTDVKSLEKAITNIHQVGGGTETG 539 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + V +++++ +TDG + + + D Sbjct: 540 KALDFMRPQFD-------------RAVTTRGHKVKEYLVVITDGNSTDKVKDP-----AD 581 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 K + + + I + + LL+ P+ + V N D+L + +I + Sbjct: 582 KLRAQGVVVYAIGV-KDAVEKELLEIS-GEPQRTFYVNNFDALKPIKDDIITDICST 636 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 63/178 (35%), Gaps = 22/178 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ S+ + K+ V++G++ Y+T + K + D I T Sbjct: 673 MKDFMKSLVQKSNIGKDQVHVGVLQYSTEQKLVFPLIQYYTKDQLSKAIDDMQQIGGGTH 732 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + + ++ ++ +TDGE+ + Sbjct: 733 TGEAIAVVSKYFDAQNGG--------------RPDLKQRLVVVTDGESQDDVKLP----- 773 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + +I + + N +LL+ + Y + D+L + + ++ + Sbjct: 774 AEALRAKGVIVYSIGV-VAANTSQLLEIS-GDADRMYAERDFDALKDLEKQMALEICD 829 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 15/167 (8%), Positives = 47/167 (28%), Gaps = 19/167 (11%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A+ + +G + + + + EK+ + T+ Sbjct: 29 AVSAQPGLGPQRQPRQHPDRLGSVS-SDEPHQEFLLKYHIEKMNLLAAFESFPYRNGGTE 87 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + + K + + +TDG++ + Sbjct: 88 TGKAINFLRKQYFTKKAG-----------SRADQRVPQIAVVITDGDSTDDVVVP----- 131 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLI 399 + +++ + + I + + N L E + + + +L Sbjct: 132 ARELRKHGVIVFAIGVG-NANQGELKSIANRPSERFKFTIDSFQALK 177 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 64/212 (30%), Gaps = 46/212 (21%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 +++ A + F+ + + +GL+ + + + E + + Sbjct: 116 VGGTSRLDAAREVSRNFV-------LRRSNDRIGLVVFRGKGYTQCPLTLDHEVLAMLLD 168 Query: 282 RDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R +I T A+ A L S K +I +TDGE Sbjct: 169 RLSPGVIQDDGTAIGTAILIAVNRL------------------KASESLHKVLILVTDGE 210 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTI--SINA---------------SPNGQRLLKTCV 383 NN + A + ++I I + + L Sbjct: 211 NNAGEV--GPGTAASIAARSGVRIYVINAGFKVVEDRIDPPEESGRYIQKDEESLQGIAR 268 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ ++ V + + ++I + + ++++ Sbjct: 269 TTGGGYFRVEDPAAFDQTIRSIDR-LEKKRFT 299 >gi|148657117|ref|YP_001277322.1| peptidase M23B [Roseiflexus sp. RS-1] gi|148569227|gb|ABQ91372.1| peptidase M23B [Roseiflexus sp. RS-1] Length = 982 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 34/384 (8%), Positives = 92/384 (23%), Gaps = 47/384 (12%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV--- 112 A A+ ++ +N+ DR S+S K +A R + ++ Sbjct: 329 ATEASAVSYNGASNANVILTLDRSGSMSTDNKMPAAHNAARQFVDLMQVGDGVGVVGFDD 388 Query: 113 -FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 ++ + + + ++S + + + Sbjct: 389 RVTTAFPLTVITDPPPLSSLIFTDTMESG---TGKWIPDPPWGLTSVAYRGSAAWTDSP- 444 Query: 172 KIVSFIPALLRIEM--GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ + + + Sbjct: 445 ----AGNYANNANSVLAIADPIVLPASLTTPALSFWHRYDIENYFDYGRVEVSTDNGATW 500 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-------------TEKV 276 A + I R+ N + + Sbjct: 501 QSLAAYTGVNTTWSRAVIDLSPYRGQTIRLRFRLTTNAYLTRDGWYIDDVTVGPKWVDAR 560 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T ++++ Q+L++ + I+ L Sbjct: 561 ADAIAAIGTLTPRGSTSIGGGLQRSQQLLSASAPGR-----------------TRAIVLL 603 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG+ N + + + + I + TI + + Q +L + + D Sbjct: 604 SDGQENTA---PYVSDVLPQIRASQITVHTIGLGTDADQQLMLSIAAQTGGTYNYAPRPD 660 Query: 397 SLIHVFQNISQLMVHRKYSVILKG 420 L ++ IS + +R+ + + G Sbjct: 661 QLAGIYNTISGAVSNRQTLITIDG 684 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 56/400 (14%), Positives = 113/400 (28%), Gaps = 81/400 (20%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA+ A A + L + + A + A + + + A Sbjct: 9 AAVGAAAGRSGKRLVAVVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + ++ + L + + +Y L Sbjct: 69 LSTEGTN----GVPDDPTLSAPAREKTVTANGDGTYKVALNVTG--------------AK 110 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + I LV+D+SGSM ++ P K+ ALK A+ F Sbjct: 111 SAGTGAIVTNQPLDIVLVLDVSGSMADNLSGGP--------------KKIDALKTAVNGF 156 Query: 239 LDSIDL----LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 +D+ ++ + + L+ + + ++ + E Y + ++ T Sbjct: 157 IDATADENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYD 213 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + L++ S F R P +K +IF TDGE N+ T+ Sbjct: 214 VSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTV 273 Query: 352 KI-----CDKAKENFIKIVTISINASPNG------------------------------- 375 K+ I +I + + N Sbjct: 274 AATAVNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWG 333 Query: 376 ---QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L + Y+ +A L ++F++I Q + Sbjct: 334 KSWNANLGDRAETSSYYKAATDAGQLNNIFESIYQEITKT 373 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 76/266 (28%), Gaps = 60/266 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + I L +D+SGSM + +++ A + A FL Sbjct: 76 MAVITLPSNQQTIMLAIDVSGSMRATDV---------------QPSRLVAAQTAAKAFLT 120 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + V +GL+ + + P+ E + + R T + Sbjct: 121 EL------PRTVKVGLVAFAGSAQVAQIPTVNREDLVSAIDRFQLQR---GTAIGNGIVM 171 Query: 301 AYQILTSDKKRSFFTNFFRQGVKI------------------PSLPFQKFIIFLTDGENN 342 + L D + + + P II LTDG+ Sbjct: 172 SLATLFPDAGIDLQSMQSGRERQRGFAIDQEKKEAKEFTPVAPGSYNSAAIILLTDGQRT 231 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQRLLKTCVSSPE 387 + + A + +++ T+ I + + L ++ Sbjct: 232 TGVDS---LDAAKLAADRGVRVYTVGIGTVDGETIGFEGWSMRVRLDEETLKGIARATQA 288 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRK 413 ++ A L V++ +S + K Sbjct: 289 EYFYAGTATDLKKVYETLSSRLTVEK 314 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 75/258 (29%), Gaps = 44/258 (17%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + +++ S + +R + IE+++D SGSM ++ Sbjct: 132 GRVEDEEKKSNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGK--------------- 176 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT------TRVEKNIEPSWGTEKVRQY 279 KM K A+ F+ + +V VY + + Sbjct: 177 MKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENVYTLQKYNQTTL 236 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T A+K++ + S K + + ++DG Sbjct: 237 RQSLDGFQPVGWTPLAEAIKRSTETFQSAKAND-----------------KNIMYVVSDG 279 Query: 340 ENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVS-SPEYHYNVVNAD 396 N ++ K + IK + I ++ LK S + NA Sbjct: 280 VETCG---GNPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLANNAK 336 Query: 397 SLIHVFQNISQLMVHRKY 414 L F+ + + R+ Sbjct: 337 ELQDQFKETGKDITSRRL 354 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + D +++AA K+A F+ ++ + ++ + + P+ Sbjct: 102 DISQSMMATDVSPSRLAAEKDAATKFVAALPAQ------YNVSVVTLSGHPNTLVPPTTD 155 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V Q + + T ++ + L + Sbjct: 156 RAPVNQGIKTLELAD---GTAIASSIDVGLEAL------------KQAPAGDDGKQAPGL 200 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 ++ L+DG + DKAK+ + I TI+ +P+ L Sbjct: 201 MVLLSDGSETGGGD---PVASADKAKQQNVPIYTIAFGTQNGYVDLDGQRFNVAPDTDML 257 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +S + +A L V++ ++ + Sbjct: 258 KRIADASSGKALDAASASQLDDVYKTLTSDVG 289 >gi|221115612|ref|XP_002166292.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 484 Score = 73.0 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 61/175 (34%), Gaps = 19/175 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + ++ + K + G+I +++R NI+ + ++ + D + T Sbjct: 164 IEFLINLAKTFNISKNGAHAGVITFSSRAVLNIKLNQYYDQEQFEFAIDEIPYMGYVTRI 223 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIK 352 A+++A ++ + + LTDGE + + N Sbjct: 224 DLALRKALEMFDEING--------------ARKDIPQLLFLLTDGEQYAGDGVVDENPAN 269 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL--IHVFQNI 405 + ++ I+I+ + I + N L SS + N D L + + I Sbjct: 270 VAQLVRDRGIEIIAVGIGSGVNQSELNSIAGSSE-KVFLAENFDELINKNFLKRI 323 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 63/225 (28%), Gaps = 43/225 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ERP + V+D+SGSM + ++ + AL D Sbjct: 175 AERQSERPPAALTFVIDISGSM-------------------GEPGRLDLAQEALGTMTDR 215 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + L+ ++ E + + + + T+ M+ Sbjct: 216 LRDDDS------VALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNLGAGMETG 269 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKE 359 Y+ + ++ ++D N ++ +I + +E Sbjct: 270 YETAVEGR----------------REGATNRVVLVSDALANTGDTDADTILERIATERRE 313 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + + + + + + + V + VF Sbjct: 314 HGITLFGVGVGSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 63/225 (28%), Gaps = 43/225 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ERP + V+D+SGSM + ++ + AL D Sbjct: 175 AERQSERPPAALTFVIDISGSM-------------------GEPGRLDLAQEALGTMTDR 215 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + L+ ++ E + + + + T+ M+ Sbjct: 216 LRDDDS------VALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNLGAGMETG 269 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKE 359 Y+ + ++ ++D N ++ +I + +E Sbjct: 270 YETAVEGR----------------REGATNRVVLVSDALANTGDTDADTILERIATERRE 313 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + + + + + + + V + VF Sbjct: 314 HGITLFGVGVGSDYGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 69/213 (32%), Gaps = 43/213 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ D T++ A K + + + S+ G++ + + S Sbjct: 94 DVSASMSASDYSPTRVEAAKGSSEILIRSLSESDTA------GVVIFESGASSAAYLSSD 147 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +V + + S+ T + A ++T Sbjct: 148 KNRVVSRLEQV--SVKTGKTALGDGLALAVDMVT------------------AIPAGTYI 187 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQR 377 ++ L+DG +N+ + + AK + + + TI + + + + Sbjct: 188 VVLLSDGVSNSGMIT--PQEAAEYAKNSGVVVYTIGVGSESPVEVSSDGVQQYASLDEET 245 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 L + ++ V+ +L+ + I ++ Sbjct: 246 LRSIAEITGGEYFRSVDEKTLVQIQNTIQTSII 278 >gi|224824215|ref|ZP_03697323.1| conserved hypothetical protein [Lutiella nitroferrum 2002] gi|224603634|gb|EEG09809.1| conserved hypothetical protein [Lutiella nitroferrum 2002] Length = 421 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 89/363 (24%), Gaps = 64/363 (17%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + RF + +K +II L ++ + +G + + K + A++A LA Sbjct: 1 MKRFAQLPTR--RHQKGVAAIIVGLCIVVLIGFLGLVADLGRLFITKTELSDASDACSLA 58 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS---AVFYNTEIQN 120 A+++ + L + +R +D + + ++ +Y Sbjct: 59 AAAELKGDAESLSRAESAGITVGQRNKVDFQANNVTIVPNQDVTFSDHLNGTYYTKNAVA 118 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY--NQKIVSFIP 178 N + + L + S Sbjct: 119 PANIANMKYAKC------------------------TLPRTGIMPWFMQVMGAGAQSVTS 154 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + I V S + + + ++ A F Sbjct: 155 QAVASLSPSQTNCAIP-VGLCSQTPPASCPDGSAPDSYGFCVGKWYGSRF----TAGGGF 209 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-----RDMDSLILKPTD 293 S +L+ + PS G ++ +T + + T Sbjct: 210 TGSFNLIDYS-------------------PPSGGASELSGQLTGAGQCNLNVTNPVGQTG 250 Query: 294 STPAMKQAYQI---LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 ++ A+ L G ++ + DG + + + Sbjct: 251 MQQSIANAWNTRFGLYQGSYNVSNAPPDFTGYAYTAISWPSQYNAY-DGTDGTRPNFQAS 309 Query: 351 IKI 353 Sbjct: 310 RTA 312 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 80/236 (33%), Gaps = 30/236 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D S SM ++ + D+ ++ + +AL + + Sbjct: 1467 ATVTTKYPVDLVFVIDKSLSMDYDIDGNEIKWWD-----DETESRKDIVNDALEEIIPDL 1521 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + D+ + ++ + ++ S + +Q ++ + T+ + A+ A Sbjct: 1522 CSQQY---DIQIAGYQFSGSSTRVLDWSR---EEQQVLSGLKIARTSSSTEPSQALADAL 1575 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +L + +K++IF+TDGE + Sbjct: 1576 DMLKTG-----------SPAHRNQSNVKKYLIFMTDGEPTEPEDWSY--NAVRNHAVPGA 1622 Query: 363 KIVTISINASPNGQRL--LKTCVSSPEYH----YNVVNADSLIHVFQNISQLMVHR 412 I TI +++ + + +++ S + + +A + F I ++ Sbjct: 1623 SIYTIGVSSDASTNLMEGIRSTALSNGMYAPATFKGTSAQLIRDAFTQIKDEIIST 1678 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 84/266 (31%), Gaps = 49/266 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + G++ + V+D SGSM + S D Sbjct: 1027 NGNGTYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDA-------- 1078 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHV------KEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + +AL LF + + + K + + + + T Sbjct: 1079 ----RREVVNSALELFFNQLSDGDYNIQFGGYKFSDSGERVNFNDQGWETEYWETDTSNA 1134 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + T + ++ A L + + ++++IFL Sbjct: 1135 LSHL-KLTSRETDGSTYPSQTLRSAISALENVELGENG---------------KRYLIFL 1178 Query: 337 TDGEN--NNFKSNVNTIKIC-DKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYH-- 389 TDGE N++ + + C K ++ I + A+ + +++ VS+ Sbjct: 1179 TDGEPGQNSYSFSEKEAENCYSAIKNLDSGTTFYAIQV-ANSDSHGFMESMVSNANSVDG 1237 Query: 390 -----YNVVNADSLIHVFQNISQLMV 410 + +AD L F ++ + Sbjct: 1238 VTAQKFVGNSADELNAAFSQMAAEIS 1263 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 20/249 (8%), Positives = 46/249 (18%), Gaps = 62/249 (24%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + + + S + Sbjct: 591 QQGLDIVLVIDLSNSMDDGISEDSSDSRLK-------------------------VLKDT 625 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + + K F+D + S ++ Y+T + Sbjct: 626 LGYYRTSYGSHGRPSTEGK--------EGFIDELFEQSPNS---RFSIVTYSTDASTELG 674 Query: 269 PS---WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + D T+ + QA ++L S P Sbjct: 675 WTEYGRNGSGQQTIKKAIGDLQANGGTNYEAGLYQAVEVLKERGNSSNIPVVIFLSDGKP 734 Query: 326 SLPFQ-----------------KFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIK 363 + + + DG + + + TI + E I Sbjct: 735 TYYYSDVDEFNGLETGWLPMIGNYDNREGDGGSIDTDTGDGTIFAAGRFHEEMEALQGIT 794 Query: 364 IVTISINAS 372 + Sbjct: 795 -YCVGFGIP 802 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 64/199 (32%), Gaps = 20/199 (10%) Query: 226 TKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD- 283 TKM LK F+ + + MGL+ + + + + + +++ Sbjct: 123 TKMDLLKEVTKGFVLGNKQEGLTGRPQDMMGLVTFARGAQVLAPLTLDHQAIIDQLSKLQ 182 Query: 284 -MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 L T A+ + ++ + + + K +I +TDG Sbjct: 183 YTTDLEQDGTAIGYAIYKTANLIAATRH---YAEELEGAGKPAYTIKNSIMILVTDGLQA 239 Query: 343 NFKSNV-------NTIKICDKAKENFIKIVTISIN-------ASPNGQRLLKTCVSSPEY 388 + + AK+ +K+ I++ S + + K + Sbjct: 240 PNPLDQGKEFRNVELLDAAVYAKKLGVKVYIINVEPRIASEEFSAHRLLMKKITELTGGR 299 Query: 389 HYNVVNADSLIHVFQNISQ 407 Y V N+ +L ++ I Q Sbjct: 300 FYMVDNSLNLSSIYSEIDQ 318 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 61/251 (24%), Gaps = 64/251 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 M + V+D S SM T++ A K + F + Sbjct: 81 SEAMYSSSGQALMFVIDTSPSMAAQDMGTE--------------TRLEAAKRIIKSFAEK 126 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 E +GL + I P+ + + + T + A Sbjct: 127 -------YEGDSLGLTALGSSAAVLIPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASA 179 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 LT II TDG+NN + + D K Sbjct: 180 VLHLTQY------------------STLPSHIILFTDGDNNTGEI--HPRAAADIIKHKK 219 Query: 362 IKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSL 398 I I + S N L K +++ + + L Sbjct: 220 IGFYIIGLGKSGYAPVKYIDPIQKKEISGTLNTVFNETELQKIAGYGNGRYFSAKSPELL 279 Query: 399 IHVFQNISQLM 409 +F Q + Sbjct: 280 TDIFNRFIQKI 290 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 61/181 (33%), Gaps = 19/181 (10%) Query: 237 LFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F + + ++ V G + Y+ E + + + + T + Sbjct: 775 NFTKHLVEKADVGRDRVQFGALKYSDTPEILFYL-NDYQNKKSIIDKLKFQRKGGNTYTA 833 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +++++ T Q ++ +I +TDG+++++ + + Sbjct: 834 KALNRSHELFT------------EQHGSRIKRGVKQMLIVITDGKSHDY---LELETVGK 878 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I I + + + L S + ++ V N + L ++ I + Sbjct: 879 ALRAKNIIIY--GVGVAEASDKELLDITGSKDNYFMVDNFEKLKEIYLPIEKKACENALD 936 Query: 416 V 416 V Sbjct: 937 V 937 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 60/178 (33%), Gaps = 20/178 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ I + + E V G + Y++ ++ S + V + T++ Sbjct: 403 NFMIEVIQMFNVAPEKVRFGAVQYSSTHQQEFGISTYSNVVDLSKAILNIRQLGHTTNTG 462 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + + S K K ++ +TDG +N+ + Sbjct: 463 AALRFMLEEIKSSK-------------KKQHDSVPCHLLVVTDGMSND-----EVSEPAK 504 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +E + I I + LK S + V N +SL + I Q + + Sbjct: 505 RLREEDVIIHAIGV--KGANIIELKDIAGSEDRVKFVYNFESLKGIKNEIVQTICRER 560 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 53/186 (28%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 K+ +K L L+ + + LI + + + Sbjct: 141 NSGSMSGEKIQNVKKTLEYLLELLGDND------RLCLILFNSYATRLCHLMKTNNSNKP 194 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + TD M+ A+++L K + + L+ Sbjct: 195 AFKEIINKIYSTGGTDINSGMELAFRVLKDR--------------KYQNPVSS--VFLLS 238 Query: 338 DGENNNFKSNVNTIKICDK-AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ + +++ + ++ + I + + +G + K C Y V + Sbjct: 239 DGQ--DGSADLRVRQSLERHLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKIN 296 Query: 397 SLIHVF 402 + F Sbjct: 297 QVDEFF 302 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 28/353 (7%), Positives = 84/353 (23%), Gaps = 33/353 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R + + ++ +II L++ + +G + + + ++ ++++ ++ Sbjct: 1 MSSAARHPKLTRHSLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSC 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ A + S +S + I+ + ++ + + Sbjct: 61 ALSAARDLTSAISLQVAEADGIAAGHANYA---FFQQNAVQMQTDSNVTFSDSLTNPFLT 117 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + ++ + L + + A+ Sbjct: 118 KTAVATPANVK--------YVKCTAQLSNIAHWFIEVLNT---IPGVQVANASQVAASAI 166 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + G+ + V + S + + + AAL Sbjct: 167 ATVGAGQTTCAIPVFVCKPASSAPYKVGDWISSPSGSSTTYGPGNFGWAAL--------- 217 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK- 299 + + P G+ ++ R ++ T+ Sbjct: 218 -----DGSTNEPTIASELSGNTCNITSPPDLGSTGLKAASLRAWNTRFGIYTNGANGSSG 272 Query: 300 ----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 Y + L F + DG Sbjct: 273 QPDFTGYAYVGPKYGPPGTAGIVGDAYTQFVLDRASFKAYQGDGTPPTGSGIS 325 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 58 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 117 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 118 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 177 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 178 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 217 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 218 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 264 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ T + I+ Sbjct: 265 AEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN----------------RILL 307 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + S + ++ + + + Sbjct: 308 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 367 Query: 394 NADSLIHVFQN 404 V + Sbjct: 368 TLSEAQKVLNS 378 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 73/221 (33%), Gaps = 48/221 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +DLSGSM + + +++AA+++ F+ + Sbjct: 88 DIVLALDLSGSMEREDF----------VLDGRTVSRLAAVQDVAAQFVR-------GRTG 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + R P+ V Q + + K T + A + L Sbjct: 131 DRVGLVVFGDRAYVAAAPTHDVGAVAQVIGTLQIGVSGKATAIADGLGLAIRRL------ 184 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN- 370 + II L+DG++ + + A+E +++ TI++ Sbjct: 185 ------------RGREAESRVIILLSDGQDTTGAVD--PVAAAQAAQELGMRVYTIALGP 230 Query: 371 ----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + L + + + V + + L V Sbjct: 231 ADLASSPDARDAVDSETLRRIAEVAGGETFRVRSTEDLEAV 271 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ T + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + S + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 58 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 117 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 118 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 177 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 178 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 217 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 218 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 264 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ T + I+ Sbjct: 265 AEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN----------------RILL 307 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + S + ++ + + + Sbjct: 308 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 367 Query: 394 NADSLIHVFQN 404 V + Sbjct: 368 TLSEAQKVLNS 378 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ T + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + S + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|109071570|ref|XP_001110086.1| PREDICTED: collagen alpha-1(XXI) chain-like isoform 2 [Macaca mulatta] Length = 959 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + I Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVIKQKLCEESV 216 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 66/193 (34%), Gaps = 23/193 (11%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDS 286 A+ +F+D + +G++ YT R+++ +++++ + Sbjct: 60 SEAMKMFIDMLSTTGD-----KVGVVSYTDRIQREKALLEIQSEADKTALKEFIDQLDR- 113 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD + + +A ++L + + + + ++ Sbjct: 114 --GPYTDISVGLDEAVKVLKQGMEPAHAPMIVVLADGNND--------LDPNTGKTSQEA 163 Query: 347 NVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + +AK + I I TI +NA N + L + + + +AD L + Sbjct: 164 SDHLNQAVQEAKGSGIPIYTIGLNADGKLNKEALAELANQTGGKSFTTSSADDLPQILSE 223 Query: 405 ISQLMVHRKYSVI 417 I K + Sbjct: 224 IFASHQQLKIVPV 236 >gi|46127789|ref|XP_388448.1| hypothetical protein FG08272.1 [Gibberella zeae PH-1] Length = 774 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/302 (13%), Positives = 80/302 (26%), Gaps = 44/302 (14%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS----YDYRLQFIEHLLN-- 166 +T I + T L S + + + Sbjct: 2 VASTVTSYIHIPTNTPQTVPLFRSLVSGSKSTKEEPIKSEDAQEPIAIISDKDATIRLEP 61 Query: 167 --QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 R I P + + P I LV+D+SGSM ED SA + Sbjct: 62 VPSRNGLLIKIETPKEPSVNIPHVPC-DIVLVIDVSGSMGQPAPVPGEDQESAGLSV--- 117 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 + K+A ++S++ + ++ + ++ + + + + Sbjct: 118 ---LDLTKHAARTIIESMNEND------RLSIVTFASKAKVLQPLLPMNQDNKTRAIKNV 168 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ M +A + +D+ I+ LTDG N Sbjct: 169 KSMEPRDATNLWQGMLEAIKQFNTDESSPNVPA----------------IMILTDGMPN- 211 Query: 344 FKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + K + I T S L + + +A + Sbjct: 212 --HMNPAVGFVPKIRNMGPLPASIHTFGFGYSLKSDLLKSIAEIGNGNYAFIPDAGMIGT 269 Query: 401 VF 402 VF Sbjct: 270 VF 271 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 64/237 (27%), Gaps = 64/237 (27%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM T+ A K+A+ F++ +E Sbjct: 92 DIVFVLDQSPSMIVRDFGGD--------------TRFDAAKHAIRTFVE-------GREH 130 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GL+ + P+ + + + + M A L Sbjct: 131 DPLGLVIFGDEAALVTPPTLDYTSFLSRMDAVRVMKLGRGSALGLGMAVATVHLEKSSAE 190 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K ++ ++DGENN + A I+I + + Sbjct: 191 R------------------KVMVIVSDGENNAGEIT--PESAARVAASLGIRIYAVGVGG 230 Query: 372 SPNGQR----------------------LLKTC-VSSPEYHYNVVNADSLIHVFQNI 405 + LLK S+ + + +L VF+ I Sbjct: 231 EGSVATEFTDPETGKSYRGTYEGKIDMELLKAVTESTRGQAFLAGSPGALSQVFREI 287 >gi|253701795|ref|YP_003022984.1| von Willebrand factor A [Geobacter sp. M21] gi|251776645|gb|ACT19226.1| von Willebrand factor type A [Geobacter sp. M21] Length = 368 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 20/221 (9%), Positives = 46/221 (20%), Gaps = 10/221 (4%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA----AILAGASKMVSNLSR 74 K ++ A + + L + + K + +A +A A A A + Sbjct: 9 KGQILLVVAAVMFFGIFLAALAVDAGRAYGVKAKLHAAVDAASYEAAKALAHGEDEDDME 68 Query: 75 LGDRFESISNHAKRALIDDAKRFIKN-HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ S + + + Sbjct: 69 EKASEAARDYFRANFPAAYFGAQCSGPELELSERKSGKKMRALTVSATATLPNVFAGILG 128 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + +D + D L+ + L R + L+ Sbjct: 129 WDSIDLPAQSRAVRTDA--DVVLVLESSDVLRESFPQ-VKQRVANFSDRFSQHYDRMSLV 185 Query: 194 ELV--VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 D S+ + + K K Sbjct: 186 TFAAGADPVISICGVYKPAKDRPGAGTFNCGSGYQKSNFAK 226 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 53/186 (28%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 K+ +K L L+ + + LI + + + Sbjct: 129 NSGSMSGEKIQNVKKTLEYLLELLGDND------RLCLILFNSYSTRLCHLMKTNNSNKP 182 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + TD M+ A+++L K + I L+ Sbjct: 183 AFKEIINKIQATGGTDINSGMELAFRVLKER--------------KYQNPVSS--IFLLS 226 Query: 338 DGENNNFKSNVNTIKICDK-AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ + +++ + ++ + I + + +G + K C Y V + Sbjct: 227 DGQ--DGSADLRVRQSLERHLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKIN 284 Query: 397 SLIHVF 402 + F Sbjct: 285 QVDEFF 290 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 42/140 (30%), Gaps = 12/140 (8%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 +K I E +++FALS+ + L + ++ + +A ++A+LAGA ++ N Sbjct: 5 RKLIGGEGGTAAVLFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQELPGN 64 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + D F S + Sbjct: 65 PD--AALAVARTYAEANGVTDAEAVF----------SVSEDGRQIVGEATRRVGLYFARV 112 Query: 132 MANNRLDSSNNTIFYNMDVM 151 + + + + Sbjct: 113 LGFTQGEVKARAAARVGALT 132 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 95/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMIKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 64/211 (30%), Gaps = 26/211 (12%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M + + + K + + + + Y+ ++ + T Sbjct: 1 MSDLIFLVDTSGSLQYWSGGGWKNGFDDEKVFVNSL---LSHIRVSYKSTYVSVVLFGTS 57 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLI----LKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 +I + D T+ A + AY I+ Sbjct: 58 ATIDINYIFNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIF----------GK 107 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 G K P+ + + LTDG+ N N + I + K+ I+I TI + N L Sbjct: 108 YSGHKRPTHQVKTAVFLLTDGQWNW---NGDPWPIAKRLKDRGIEIFTIGVTNGVNVNTL 164 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 SP +++ + F+ ++ + Sbjct: 165 RSLA--SPNNYFHYNDFTQ----FRELATCI 189 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 80/255 (31%), Gaps = 45/255 (17%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ P L+ ++ RP + V+D SGSMH Sbjct: 20 SVTLDVLIKIEPPLVELDNNARPPLNLGFVLDKSGSMH--------------------GN 59 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K A+ ++ + + L + T+VE I + T+K ++ + Sbjct: 60 KLDYAKQAIAYAIEQLLPSD------RLSLTLFDTQVETKIPSTLATDK-QRLLETIKLI 112 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + + G + +I L+DG N ++ Sbjct: 113 RSGSSTALHDG---------------WVQGGIQVGQ-YLNNDHLNRVILLSDGLANVGET 156 Query: 347 NVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-N 404 N + I + I + + + L S +++ + + L +F+ Sbjct: 157 NPDVIASDVHGLMKTGISTSALGVGRDYDEDLLEAIARSGDGNFFHIASPEDLPQIFETE 216 Query: 405 ISQLMVHRKYSVILK 419 + L SV LK Sbjct: 217 LQGLATTIGRSVTLK 231 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 28/245 (11%), Positives = 67/245 (27%), Gaps = 54/245 (22%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + +R + V+D+SGSM Sbjct: 44 SMAPHDQELLLELRGSSS----STDRAGLDLVAVIDVSGSMD------------------ 81 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 ++ +K AL + + L + ++ + T + + T + + Sbjct: 82 --GDRIDKVKTALQFVIRKLSDLD------RLCIVTFCTNATRLCPLRFVTAAAQAELKA 133 Query: 283 DMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ ++ + ++ ++DG Sbjct: 134 LVDGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAV---------------SVMLMSDGYQ 178 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIH 400 N+ + + T S AS + L S V ++ +L Sbjct: 179 NHGGD-------ARDVHLKNVPVYTFSFGASHDSNLLEAIARKSLGGTFNYVADSANLTG 231 Query: 401 VFQNI 405 F + Sbjct: 232 PFSQL 236 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 80/261 (30%), Gaps = 50/261 (19%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + +++ S + +R + IE+++D SGSM ++ Sbjct: 153 GRVEDEEKKSNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGK--------------- 197 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKNIEP-SWGTEKV 276 KM K A+ F+ + +V VY +E + Sbjct: 198 MKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQTTF 257 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ + + T A+K++ + S K+ + I + Sbjct: 258 RQSLDGF---QPVGWTPLAEAIKKSTETFQSAKEND-----------------KNIIYIV 297 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 +DG N ++ K + IK + I ++ LK S + Sbjct: 298 SDGVETCG---GNPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLAN 354 Query: 394 NADSLIHVFQNISQLMVHRKY 414 +A L F+ + + R+ Sbjct: 355 SAKELQDQFKETGKDITSRRL 375 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 80/261 (30%), Gaps = 50/261 (19%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + +++ S + +R + IE+++D SGSM ++ Sbjct: 153 GRVEDEEKKSNGTKGEQKNKEDRGNYNIEILLDASGSMAGKIDGK--------------- 197 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKNIEP-SWGTEKV 276 KM K A+ F+ + +V VY +E + Sbjct: 198 MKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQTTF 257 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ + + T A+K++ + S K+ + I + Sbjct: 258 RQSLDGF---QPVGWTPLAEAIKKSTETFQSAKEND-----------------KNIIYIV 297 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 +DG N ++ K + IK + I ++ LK S + Sbjct: 298 SDGVETCG---GNPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLAN 354 Query: 394 NADSLIHVFQNISQLMVHRKY 414 +A L F+ + + R+ Sbjct: 355 SAKELQDQFKETGKDITSRRL 375 >gi|226314649|ref|YP_002774545.1| hypothetical protein BBR47_50640 [Brevibacillus brevis NBRC 100599] gi|226097599|dbj|BAH46041.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 513 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 38/339 (11%), Positives = 97/339 (28%), Gaps = 57/339 (16%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + K + K+ + + Y I + + + + Sbjct: 72 NDMYFKDYGTNQFVSTAKDRLSTFAADVDTASYTIMRHFIKDGNLPPAEAVRVEEF-INF 130 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 Y ++ + + N +IV + ER + V+D+SG Sbjct: 131 FPTSYPAPTNQTFAIQADSGPS--PFQKNLQIVRIGIKGKELSPKERKPANLVFVIDVSG 188 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 SM+ ++ +K +L + +D + +G++ Y + Sbjct: 189 SMNQE-------------------NRLELVKKSLHVLVDQLQPTDS------VGIVVYGS 223 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + P+ +K + ++ + T++ + Y++ K Sbjct: 224 EGRVLLPPTSTEDK-QAILSAIDELQPEGSTNAEQGLVLGYEMAARSFK----------- 271 Query: 322 VKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 P +I +DG N + + I D A++ I + + ++ Sbjct: 272 -----PPAINRVILCSDGVANVGETGAEGILRSIEDYARK-DIYLSSFGFGMGNYNDVMM 325 Query: 380 KTCVS-SPEYHYNVVNADS--------LIHVFQNISQLM 409 + + + + L Q I++ + Sbjct: 326 EQLANKGEGSYAYIDTFSEARRIFTESLTGTLQTIARDV 364 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 72/230 (31%), Gaps = 44/230 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + ER +I L +D S SM K KM A+ +A ++ + Sbjct: 35 NVEERKPLVIGLAIDKSWSM--------------------KGEKMEAVIDASCALVNWLT 74 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V ++ Y+ V+ + TEKV + + + T+ + A + Sbjct: 75 RHDAVS------IVAYSADVQLIQPVTHLTEKV-SVTDKIRNIQVATSTNLSGGWLSALK 127 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFI 362 L K K ++ LTDG + + + I I Sbjct: 128 SLNQSKI----------------PNAYKRVLLLTDGNPTSGIKDKEALVTIAADHLSMGI 171 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 TI + N + L++ + Y + N ++ +F + Sbjct: 172 STTTIGVGNDFNEEMLVEIAKAGGGNFYYIDNPENASDIFFEEFGDIGAL 221 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 76/244 (31%), Gaps = 51/244 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + LVVD+S SM +++ A + A F +++ Sbjct: 85 PRNRATVMLVVDVSLSMESTDV---------------SPSRLEAAQQAATTFANNLTPG- 128 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V +GL+ Y + P+ V + V R + T + A+ A Q +T Sbjct: 129 -----VNLGLVSYAGTASMLVAPTTDRGPVVRAVDRLSL---DERTATGEAIYTATQAIT 180 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 + F + + P I+ L+DG+ ++A E I Sbjct: 181 T----------FTESLGGPDQAPPARIVLLSDGKETVPADPTEERGAFTAAERAAEAGIP 230 Query: 364 IVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + TIS + L S + + L V++ + + + Sbjct: 231 VSTISFGTLYGTVDIQGRPQPVPVDDASLRTIAELSGGDFFTASTLEELDSVYRTLEEQI 290 Query: 410 VHRK 413 + Sbjct: 291 GYEW 294 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 51/396 (12%), Positives = 107/396 (27%), Gaps = 75/396 (18%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A + + + + + + S + Sbjct: 9 AAVGAAAGRKGKRLVAIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 124 SSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S + ++ L + + +Y L + Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANEDGTYTVALNVTG--------------AKSAGT 114 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD-- 240 E+ I LV+D+SGSM + S + TK+ +LK A+ F++ Sbjct: 115 GEIVTNQPLDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINAT 163 Query: 241 --SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + L+ + + ++ + E Y + ++ T + Sbjct: 164 AAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGL 220 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-- 353 L++ S F R P +K +IF TDGE N+ T+ Sbjct: 221 TSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATA 280 Query: 354 ---CDKAKENFIKIVTISINASPNG----------------------------------Q 376 K+ I +I + + N Sbjct: 281 VNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWN 340 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L + Y+ +A L ++F++I Q + Sbjct: 341 ANLGDRAETSSYYKAATDAGQLNNIFESIYQEITKT 376 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 72.2 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 51/396 (12%), Positives = 107/396 (27%), Gaps = 75/396 (18%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A + + + + + + S + Sbjct: 9 AAVGAAAGRKGKRLVAIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 124 SSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S + ++ L + + +Y L + Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANEDGTYTVALNVTG--------------AKSAGT 114 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD-- 240 E+ I LV+D+SGSM + S + TK+ +LK A+ F++ Sbjct: 115 GEIVTNQPLDIVLVLDVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINAT 163 Query: 241 --SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + L+ + + ++ + E Y + ++ T + Sbjct: 164 AAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGL 220 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-- 353 L++ S F R P +K +IF TDGE N+ T+ Sbjct: 221 TSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATA 280 Query: 354 ---CDKAKENFIKIVTISINASPNG----------------------------------Q 376 K+ I +I + + N Sbjct: 281 VNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWN 340 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L + Y+ +A L ++F++I Q + Sbjct: 341 ANLGDRAETSSYYKAATDAGQLNNIFESIYQEITKT 376 >gi|238060066|ref|ZP_04604775.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881877|gb|EEP70705.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 316 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/234 (11%), Positives = 67/234 (28%), Gaps = 53/234 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D+S SM ++ A + A F+ + + Sbjct: 87 ATVMLAIDVSLSMQADDV---------------APNRLEAAQEAAKQFVAEL------PQ 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + P+ + V + + + T + A+ Sbjct: 126 TYNLGLVSFAKSANVLVPPTKDRDAVTTAIDGLVLAEA---TATGEAVFT---------- 172 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ L+DG + +S + A+ + + TI+ Sbjct: 173 --CLEAIRSVPADGAAGIPPARIVLLSDGFRTSGRS---VEEAAAAAQAANVPVSTIAFG 227 Query: 371 ASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + ++ Y Y + L V+Q++ + Sbjct: 228 TDAGQVDIGGQLQRVPVDRMALAELAETTEGYFYEAASVSELKQVYQDMGSSIG 281 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 68/190 (35%), Gaps = 23/190 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K K + ID L + + L+ Y T + K E S +++++ + Sbjct: 590 GSDSVKADNFKKLKTWTGEFIDKLGVQEYGAQVALVKYATSIIKVSELSSDVDELKEKL- 648 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I T++ A+++A Q+L K I+ +TDG+ Sbjct: 649 -MKVPFIQGKTNTGGALERAQQMLAEG-----------------RPSVPKIILLITDGDA 690 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + +K K++ I I TI + + L + + ++ Y + DS+ + Sbjct: 691 TDK---ERLDAQIEKLKKSNILIYTIGVGDLIDRNELNRI-ATDEDFVYETRDFDSISKI 746 Query: 402 FQNISQLMVH 411 ++ + Sbjct: 747 KSSLLGRVCK 756 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/181 (8%), Positives = 47/181 (25%), Gaps = 17/181 (9%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A + + +I YT+ + + L T Sbjct: 403 ARDWLISVTKSFQIGPVATQVSVIQYTSEPIPEFDLNDFKTAAEVEDGIKRMRLQNGSTR 462 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A + F ++ K ++ ++DG++N ++ Sbjct: 463 TDKA-----------RALDFVSDVIFTKNFGARSQAPKVLVLISDGQDN---HPERVLEA 508 Query: 354 CDKAKENFIKIVTISINASPNGQ-RLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 K + + I I +++ + P ++ + + ++ + Sbjct: 509 AKKLHGQDVSVFVIGIGNESQMNIKMMNQIATEPITKHIKYANTVEGINKFKNALTGQIC 568 Query: 411 H 411 Sbjct: 569 E 569 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 24/165 (14%) Query: 235 LLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 F+ ++ DV +G++ Y+T V + G + + S+ D Sbjct: 791 AKNFVANVSSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGD 850 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A + ++++ + K +IF+TDG + Sbjct: 851 THTG--TALRYISTNNR--------------WREEVPKILIFVTDGTP---QDRAIVPAA 891 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 ++ ++I I + + LK S P + + AD Sbjct: 892 ARSLRDKGVRIFAIGVG--NAVESELKEIASEPYENHAIFIQGAD 934 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 25/245 (10%), Positives = 73/245 (29%), Gaps = 52/245 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 K + + + + I VVD SGS+ + + Sbjct: 175 KRIGILGDVCQEAECPDIFADIVFVVDSSGSI--------------------GPKRFDYM 214 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDM----DS 286 KN + S + ++ YT + ++ + + Q + + + Sbjct: 215 KNWVKSIAASFK---VGENFARFSVVQYTKTAKTVVDFQTLDLSSISQKIDSMIYFQGRN 271 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+++A+ +L + ++ ++ LTDG +++ Sbjct: 272 GRGGKTFTGNALERAHTLLKESEPGR-----------------KRIVLLLTDGSSDDEYG 314 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + +++ + I + + + L + + +++ Q + Sbjct: 315 P-----VAKAIRDDKVDIFAVGVGR--ARKNELVEITGDEQRVWQTRTFNNIGQFNQKLL 367 Query: 407 QLMVH 411 + Sbjct: 368 AEVCS 372 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 38/128 (29%), Gaps = 28/128 (21%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+ Y+ + + + ++ +I +TDG+ N Sbjct: 69 NGDTCIGEALDYFYRNMFTSQAGQ-------------RSDVEQRVIVMTDGKRNCPAEIA 115 Query: 349 NTIKICDKAKENFIKIVTISINAS---------PNGQRLLKTCVSSPEYH-YNVVNADSL 398 + + +I I I + Q L + + + + + N D L Sbjct: 116 KP---AELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQL 172 Query: 399 IHVFQNIS 406 + + I Sbjct: 173 --ILKRIG 178 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 90/291 (30%), Gaps = 32/291 (10%) Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE- 187 M + + + +QR + V ++ E Sbjct: 1 MASSSAGKTKMEATSKAAIDRQEVRVSTTPIRAAIARDQRKDDFEVLVTVEAPKVVAPEK 60 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V+D+SGSM+ + +++ LK A+ + + Sbjct: 61 RAPIDLVAVLDVSGSMNKEEF----------VRGKHMSSRLDLLKIAMKYIIKLVRDAD- 109 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILT 306 + ++ + V + + R+ + +D TD PA+K+A +IL Sbjct: 110 -----RLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKAVEILD 164 Query: 307 SDKK---RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 K + + + + E ++V+ ++ ++ Sbjct: 165 GRGKEEKKKRVGFILLLSDGVDQFQYSRINW-----EKVAKSTDVDHSEVGAMLRKYA-- 217 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRK 413 + T +AS + L + S + V N D++ F ++ + + Sbjct: 218 VHTFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAF---ARCLGGLR 265 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 64/248 (25%), Gaps = 64/248 (25%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I ++D+S SM S T++AA K + F+ Sbjct: 84 VYTDAGSSIMFLLDISPSMAAKDMSGE--------------TRIAAAKKIIRKFVAK--- 126 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 GL ++ + P+ + + + T + Sbjct: 127 ----YPGDSFGLTALSSSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLA----- 177 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +I+ LTDGENN + N I Sbjct: 178 -------------VSSAYMTRTKLNSSYIVLLTDGENNTGEIN--PKTAAKVLVNKNIGF 222 Query: 365 VTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I I +S + L K + + + + L ++ Sbjct: 223 YVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDELELKKIAQYGNGKYASASSPEILENI 282 Query: 402 FQNISQLM 409 F IS+ + Sbjct: 283 FNTISKQV 290 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/294 (9%), Positives = 85/294 (28%), Gaps = 20/294 (6%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 F K +SE+ NF+I A +++ ++ + + ++ + + ++++ +AA LA +K Sbjct: 2 FALKAFWSSERGNFAITTAFAMLPIMIGLAGAVDLIGTSHDASQLQNSLDAAGLAIGTKF 61 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 +++ + + A D + ++ + + + + + S Sbjct: 62 SPDMAAGDVQQLGLQFFAANMSAADQQEYLGSVSAFAATASGSPSAYFISLSSSISRPSF 121 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ A + S + L VS ++ Sbjct: 122 ISASAPWQAYRSASVKIKPGAQAC--VLALDPHASAAVNLQGSTNVSMDNCVIAANSDAS 179 Query: 189 PIFLIELVVDLSGSMHCAMNS-----DPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +S + P + + + L + + Sbjct: 180 DSVNRGGSALVSAGCVSTVGGTSGLLPPSASLACGTPHEHRYASFDPLADVV-------- 231 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + L + ++ + + +T + +++ T P Sbjct: 232 -----PPPYTLCLPVPNGKTYTLSPGTYCDKTLSGNITLNPGVYLMRGTTIKPG 280 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 39/162 (24%), Gaps = 15/162 (9%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 AN +I A S++ + ++G + ++ + +++ + L+ Sbjct: 1 MAKDVTANTIVISAASLVPLMAMVGGGVDASRYYMAETRLQA----------ACDAGALA 50 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ-----NIVNSSRIS 128 + + A R + L F T+ Sbjct: 51 ARRSMADDNFSRADRITGEKFFDENYPDGTFGLEDLERSFTATQSGQVNGEASGTLPTAI 110 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 M + S T + + + N Sbjct: 111 MAPFGYDEFSLSVTCEADVNISNTDVLFVVDVTGSMNCAPDN 152 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 24/209 (11%), Positives = 52/209 (24%), Gaps = 29/209 (13%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLI 288 L+NA DS + + + + + YV Sbjct: 437 VLRNATWKREDSSGNVLGHITQTGNENRPGYSCPAAAFKLTDISRTDLETYVDGLT---P 493 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL--------------------- 327 T M + ++ + + G I Sbjct: 494 RSNTYHDFGMIWGARFISPNGIFAASNATAPNGDAISRHIVFMTDGLLVPNQEIYSMYGI 553 Query: 328 -PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + + I G + C A++ I + I+ + L C +P Sbjct: 554 EWWDRRITNDGSGGQARDRHATRFQVACRAARQENISVWVIAFGTTLTQN--LIDCA-TP 610 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + +L F+ I+Q + + + Sbjct: 611 GRAFQANDTAALETRFEQIAQEIAALRLT 639 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/380 (11%), Positives = 109/380 (28%), Gaps = 44/380 (11%) Query: 48 YKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 K + A + + +IS A + N + ++ Sbjct: 97 QTKAGLLQAVRR-----IEPLSTGTMTGLAIQFAISRAFSDAEGARLRSSNINKVAIVVT 151 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 ++ + I + + R+D + + DY + Sbjct: 152 DGRPQDGVQDVSARARQAGIEIFAIGVGRVDMHTLRQIASEPLDDHVDYVESYSVIEKLT 211 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRT 226 Q+ + G+ I + S C + D + C + Sbjct: 212 HKFQEAFCVV--SDLCATGDHDCEQICISTPGSYKCACKEGFTLNSDGKTCSACSGGSGS 269 Query: 227 KMAAL------KNA-------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + K+ + F++ I + L ++ +GL+ Y++ V + Sbjct: 270 ALDLVFLIDGSKSVRPENFELVKKFINQIVESLEVSEKQAQVGLVQYSSSVRQEFPLGQF 329 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K + + K T + A+K S K Sbjct: 330 KNKKDIKAAVKKMAYMEKGTMTGQALKYLVDNSFSIANG-------------ARPGVPKV 376 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHY 390 I TDG + ++ ++ KAK+ ++ + + + L+ S P E+++ Sbjct: 377 GIVFTDGRSQDYITD-----AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYF 429 Query: 391 NVVNADSLIHVFQNISQLMV 410 + ++ ++ + + + Sbjct: 430 YTADFRTISNIGKKLQMKIC 449 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 71/222 (31%), Gaps = 40/222 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I V+D SGS+ K + ++ + Sbjct: 192 VADIVFVIDSSGSIDYTEY-----------------------KEMQNFMVSLVNKSAVGP 228 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++V G + Y+ + + T KV + D+ T + A++ + + T Sbjct: 229 DNVQFGALKYSDYNTELFYLNRYTNKVDIINHINKDTTQGGNTYTAGAVRFSKEFFTEK- 287 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + ++ +TDG++++ T + ++ I I I I Sbjct: 288 -----------HGSRKARGVPQIVMVITDGDSHDKDKLNET---ARQLEQEGIIIYAIGI 333 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L+T + + V N L + +S+ M + Sbjct: 334 --DQANTNELETLAGTEGKWFMVANFSGLQDILVQVSEAMCN 373 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 61/215 (28%), Gaps = 19/215 (8%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 V +S +M + E + + T + + +D + +V++ + Sbjct: 365 VQVSEAMCNKTDCKVEQADLLFLIDGSSNTSEETFREIKNFVVSVMDDFNVGPVNVHIAV 424 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y+ + I + TE+ A+ + S Sbjct: 425 SQYSESCIREINFDYSTERGTLKNEIINIRKTKGRRHIGAALDFTKSTVYS--------- 475 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + ++ ++ +T G ++ + ++ + I + I + Sbjct: 476 --PSSDNRLNQGVKQLLVVITAGNASDQVARP-----AKALRDRGVDIYAVGIG--NICK 526 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L SPE Y + L + + + + Sbjct: 527 TQLTQITGSPEKIYT-DDVSGLKAIKKRLVRDTCS 560 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 58/191 (30%), Gaps = 23/191 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ V + ++ +++ + + K + + Sbjct: 14 SSSGMGSDKFSKMKTFMKDLVNKTEVGLTGVQFAAVQFSNSIKEEFQFNKHATKNAIWDS 73 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D +L+ + A+ + F PS K ++ +TD + Sbjct: 74 IDNMNLMGNVAKTGNALA------------NVADYFTEAKGARPSSKVSKILLLITDNPS 121 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 + D + N + I +I S N + L + +P +Y + D L Sbjct: 122 QD-----EVKVPADSLRSNGLIIYSIG-GFSANKKELQEISGKITP--YY--QSFDKLQT 171 Query: 401 VFQNISQLMVH 411 + ++ + + Sbjct: 172 IENDLLFRICN 182 >gi|91201647|emb|CAJ74707.1| hypothetical protein kuste3944 [Candidatus Kuenenia stuttgartiensis] Length = 339 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 68/214 (31%), Gaps = 38/214 (17%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T + +K + F+ + + +GL + T P+ TE + + + Sbjct: 93 STTGTAIETIKEVVAAFIKK-------RSNDIIGLSIFGTDAALITMPTMETELLEKSLE 145 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV-----------------KI 324 R S + T + + L + S FT + K Sbjct: 146 RVHSSQMGYQTSIGEGIFTSITALFEKEMGSRFTFTELRNSINKQYLDDYAVSFVKEMKK 205 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI------NASPNG--- 375 L K II TDG N S +++ + IK +++ P+ Sbjct: 206 RDLLKNKLIILFTDGIYNIGISPDRPLRLLQRM---GIKAYVVAVKASDVYGIDPDIAAQ 262 Query: 376 --QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 Q L + S+ +Y+ N + + + I + Sbjct: 263 HIQELQEAIESTGGKYYHADNFEEVAKFYDEIDR 296 >gi|332970884|gb|EGK09861.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 440 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 24/220 (10%), Positives = 67/220 (30%), Gaps = 34/220 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--------EDVYMGLIGYTTRVE 264 D + + Q KM K+A+ F+ + ++V + + + +E Sbjct: 135 DASGSMAAQVSGGRKMDVAKDAVETFVSTFPEEANVSLIAYGHKGSNSQKDKVLSCSEIE 194 Query: 265 KNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + ++ + T A+K++ + L ++ Sbjct: 195 EIYPLSTYDQST---FSNALKTVDATGWTPLADAIKKSGKTLKAN--------------- 236 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK- 380 + + ++DG + K + +++ I + I + + Q+ LK Sbjct: 237 -ADKNSKNVVYIVSDGLETCG---GHPAKEAEALQKDGISATVNIIGFDVNNAEQQALKE 292 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + + + L F++ + + +K Sbjct: 293 VAEAGGGTFTSANSKSDLESYFESEYDELKREWFDYSIKA 332 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 97/371 (26%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A +K + R++ ++ + + + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 187 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 188 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 227 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 228 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 274 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ T + I+ Sbjct: 275 AEIN-AAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGIN----------------RILL 317 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T + S + ++ + + + Sbjct: 318 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYID 377 Query: 394 NADSLIHVFQN 404 V + Sbjct: 378 TLSEAQKVLNS 388 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 13/139 (9%), Positives = 40/139 (28%), Gaps = 7/139 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R +K + + + A S++ L + + +++A +A LAGA Sbjct: 1 MRRENRKELKRSRGAVLVWVAASMVVLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAW 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + +++ + + K+ + + ++ + Sbjct: 61 ELGN-----KSASQALREASAAQVAGSVASDNKSE--GAYAVDFPDADTCHVEGQETIAM 113 Query: 127 ISMTHMANNRLDSSNNTIF 145 + N Sbjct: 114 TFARILGVNESTVLAEAAA 132 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 65/207 (31%), Gaps = 30/207 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 SA + +K K A +L L ++ +G++ + T + Sbjct: 402 DRSASMSATFGISKFDMAKEAAILSLTTLQPGD------RVGVLAFDTETIWTVPFRTVG 455 Query: 274 E--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + + + + T+ A+ + Sbjct: 456 EGVSLVELQDQIATMSLGGGTNIERALSVGL------------------PALANEPYSTR 497 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + + ++ + A+ I + TI+I + + + L + +Y Sbjct: 498 HAVLLTDGRSYSNNYPRY-QQLVETARAAQITLSTIAIGSDSDTELLNQLASWGNGRYYF 556 Query: 392 VVNADSLIHV-FQNISQLMVHRKYSVI 417 V +A L + FQ + + +V Sbjct: 557 VADATDLPRITFQE--SEIATAELTVE 581 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 69/219 (31%), Gaps = 43/219 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM ++ K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVVLVLDHSGSMADTEDAGIP--------------KLEIAKSASAKMVEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + + K + + T P + +A + Sbjct: 445 SSD------GVGVIAFDHNYYWAYKFGKLVRK-EDVIESISSIEVGGGTAIIPPLSEAVK 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K ++ K ++ LTDG + D+AK N IK Sbjct: 498 TLKKSKAKN------------------KLVVLLTDGMGEQSGYEIP----ADEAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I TI + N L + Y V N L+ VF Sbjct: 536 ITTIGVGKFVNASVLSWIADYTSGRFYLVSNPSELVDVF 574 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 70/244 (28%), Gaps = 64/244 (26%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I + +D+SGSM + +++A + F+ Sbjct: 21 SNKGVDIVMAIDISGSMLAMDFAPK--------------NRLSAAVSVAKDFVKR----- 61 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + GL+ ++ + ++ + + + + T + +A L Sbjct: 62 --RPNDRFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLAKAVARL- 118 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S K II +TDG +N + + + AKE IK+ Sbjct: 119 -----------------KNSTAKSKVIILITDGVSNTGEID--PLTAAGMAKELGIKVYP 159 Query: 367 ISINA----------------------SPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQ 403 I + + + + L K ++ +A L + Sbjct: 160 IGVGSKGLVPFPYSDPIFGTRYINTYIDLDMETLNKIAETTGTGKAALATDAKGLADIMN 219 Query: 404 NISQ 407 I + Sbjct: 220 EIDR 223 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/281 (13%), Positives = 75/281 (26%), Gaps = 61/281 (21%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP---ALLRIEMGERPIFLIELV 196 + + + + V+ + G + V Sbjct: 464 NGTAVATVDSFSYFSVFWVDN-----WNDAITDTVNLAEHPEYVANETEGSIEPIDLAFV 518 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SGSM ++ K + F+ + L Sbjct: 519 IDESGSM--------------------GGARIQDAKASAKRFVGGLYEDDRA------AL 552 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + + + V + + ++ +++A LTS+ + Sbjct: 553 VSFAGGATLGQSLTTDHGAVNASIDQLNAGGGT---NTGAGLQKAVDELTSNGEGD---- 605 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + II L DG + I A E+ I I TI + + Q Sbjct: 606 -------------TQEIILLADGGTGLGPD---PVTIAQTADEHRITINTIGMGTGIDAQ 649 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 L ++ Y V ++ L VF + Q + S++ Sbjct: 650 ELTSIADATGGEFYQVSDSSELPEVFDRVEQN----RISLV 686 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 14/240 (5%), Positives = 46/240 (19%), Gaps = 3/240 (1%) Query: 58 NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 +A + ++ A + + F Sbjct: 421 ESAT--VEMPLDDDVDPSDGNLSVYKWDQHDEKPWHAVETDVHVENGTAVATVDSFSYFS 478 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + N + + + + + R S Sbjct: 479 VFWVDNWNDAITDTVNLAEHPEYVANETEGSIEPIDLAFVIDESGSMGGARIQDAKASAK 538 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + + L+ + + + + + A L+ A+ Sbjct: 539 RFVGGLY-EDDRAALVSFAGGATLGQSLTTDHGAVNASIDQLNAGGGTNTGAGLQKAVDE 597 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + + G G + + + + T A Sbjct: 598 LTSNGEGDTQEIILLADGGTGLGPDPVTIAQTADEHRITINTIGMGTGIDAQELTSIADA 657 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 18/296 (6%), Positives = 54/296 (18%), Gaps = 20/296 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 Y + K +I+ + + I ++ + + A AGA Sbjct: 8 LLHYLRLFCDDRKGGIAIMTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGAL 67 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S ++ + + + T + Sbjct: 68 AYSNANSEDQMTDAALRVAKLNGVDAANVVVSLTASPKDSRVQAVHVDVTTT------NT 121 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSY--DYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + T Y+ L Sbjct: 122 LFLAPILGVDSKLNIATEAYSSLGAAESGCIIALDKSAS-----GVTLSGGVQANASSCY 176 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-- 242 + + ++ + + + K L+ Sbjct: 177 VASNSNLVAPCGTKITAKSASYYEGSSQPCPWSTNIVRADGSPAPVTKQYTSDPLEGNAG 236 Query: 243 -----DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 ++E + + + S + S + Sbjct: 237 VDKLSGRFDTIREATWPSKVSTGNGTDIEFGGSSSPSVTAAAIDAVGCSYDPANYN 292 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 51/185 (27%), Gaps = 24/185 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + K+ +K L L+ + + LI + ++ K + Sbjct: 96 NSGSMEGEKIQNVKKTLEYLLELLGDND------RLCLILFNSKATKLCHLMRTNNSNKP 149 Query: 279 YVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 ++ + TD M+ A+++L K + ++ F K Sbjct: 150 AFKEIINKIEANGGTDINSGMELAFRVLKDRKYHNPVSSVFLLSDGQDGSADLKV----- 204 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + C I + + +G + K C Y V + Sbjct: 205 -----RQSLERHLPQEC-------FTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQ 252 Query: 398 LIHVF 402 + F Sbjct: 253 VDEFF 257 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/245 (11%), Positives = 67/245 (27%), Gaps = 54/245 (22%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + +R + V+D+SGSM Sbjct: 44 SMAPHDQELLLELRGSSS----STDRAGLDLVAVIDVSGSMD------------------ 81 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + K AL + + L + ++ ++T + + T + + Sbjct: 82 --GDGIDKAKTALQFVIRKLSDLD------RLCIVTFSTNATRLCPLRFVTAAAQAELKA 133 Query: 283 DMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ + ++ + + ++ ++DG Sbjct: 134 LVDGLKADGMTNMKAGLETSMSVVDGRRLAAGRAV---------------SVMLMSDGYQ 178 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIH 400 N+ + + T AS + L S V ++ +L Sbjct: 179 NDGGD-------ARDVHLKNVPVYTFGFGASHDSNLLEAIARKSLGGTFNYVADSANLTG 231 Query: 401 VFQNI 405 F + Sbjct: 232 PFSQL 236 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 64/248 (25%), Gaps = 64/248 (25%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I ++D+S SM + T++AA K + F+ Sbjct: 34 VYTDAGSSIMFLLDISPSMAAKDMNGE--------------TRIAAAKKIIRKFVAK--- 76 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 GL ++ + P+ + + + T + Sbjct: 77 ----YPGDSFGLTALSSSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLA----- 127 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + +I+ LTDGENN + N + I Sbjct: 128 -------------VSSAYMTRTKLNSSYIVLLTDGENNTGEIN--PKTAAEVLVNKNIGF 172 Query: 365 VTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I I +S + L K + + + + L + Sbjct: 173 YVIGIGSSGYTTLEYTDRKTGKTYSGSIFSKFDEVELKKIAQYGNGKYASASSPEILEDI 232 Query: 402 FQNISQLM 409 F IS+ + Sbjct: 233 FNTISKQV 240 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 2/70 (2%) Query: 347 NVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + IC A+ I I I ++ N LLK C S+ ++++V + F I Sbjct: 336 DRRLRNICAAAQRAGIVIYAIGMDVDSQNSLDLLKECASTEAHYFDVDGL-EIQTAFDMI 394 Query: 406 SQLMVHRKYS 415 + + + + Sbjct: 395 AASISMLRLT 404 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 52/205 (25%), Gaps = 27/205 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + ++ +S IP ++ GE + + D + Sbjct: 29 LNDSEEDRVSISIIPYSTKVNAGEDLL-------------EQYTVTQEHDYSHCVDFNAD 75 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTR 282 T +A L D G + S +++ + Sbjct: 76 DYTDLAISPRTELQRTGHFQFQQMSTSDPRSGQWVCRHDGGFSITPLSKSVSDLKRQIAA 135 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKFI 333 T K +L + +F + + K + Sbjct: 136 LT---PEGSTSIDMGAKWGLALLDPSARGPVSALIASGQVDASFRERPHPHDAENSMKVL 192 Query: 334 IFLTDGENNNFKSNVNTIKICDKAK 358 + +TDG+N+N + + AK Sbjct: 193 VLMTDGKNDN-EYRLQPEYASGDAK 216 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 42/347 (12%), Positives = 101/347 (29%), Gaps = 48/347 (13%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +A ++ + G ++ A + + +IK L + + E Sbjct: 275 TVAANNQATAYTQVAGAPVVLVAEGHPGAASNLIQALEAGNIKVELRDLALLPKELEGYK 334 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF---IEHLLNQRYNQKIVSFI 177 + I + + + ++ + + + Sbjct: 335 --QFASIVLADVPATSMTDADMERMRTAVRDLGIGLIMTGGKDSFGMGGWFQTPIEEALP 392 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + P ++LV+D SGSM KMA + A + Sbjct: 393 VHMDLKGKEQLPSLGLQLVIDKSGSMSSDARGAD---------------KMALAREAAIR 437 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + +G+I + + P K+ + + TD PA Sbjct: 438 ATTMMNAQDY------IGVIAFDDTPWDVVAP-QSVTKLDEIQQQISRIQADGGTDIFPA 490 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ Y+ + + +K +I LTDG+ + + + + Sbjct: 491 LQLGYERVKAMN------------------TQRKHVILLTDGQ---SALDDDYEGLLQQM 529 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + T+++ + L +Y +A+S+ +F Sbjct: 530 TAENITVSTVALGDDSDRGLLEMIAELGKGRYYFANDAESIPKIFSK 576 >gi|116201805|ref|XP_001226714.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] gi|88177305|gb|EAQ84773.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] Length = 777 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 61/211 (28%), Gaps = 25/211 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +D+SGSM + +D + +K+A + ++D Sbjct: 74 DLVLSIDISGSMA--DEAPAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRD----- 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ +T R + I P K + T+ ++ + + + Sbjct: 127 -RLGIVTFTNRSKVGIPPYENKAKT---LENIESMEPFSSTNMWHGIRDGLSLFSEAEGG 182 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S ++ LTDG N + + + I T Sbjct: 183 STGRVPA--------------LLVLTDGMPNYMCPPKGYVPMLRSMEPLPATIHTFGFGY 228 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A L VF Sbjct: 229 ELRSGLLKSIAEVGGGNYSFIPDAGMLGTVF 259 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 61/187 (32%), Gaps = 25/187 (13%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMD 285 K+ +K L+ ++ + + + ++ + ++ + + T++ +Q + Sbjct: 238 KIQLVKETLVKIINLMSSMD------RICIVCFNESGDRPLTFTRVTDENKQTLLNLIQQ 291 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ + + A + + + K ++ T+ + + + ++ Sbjct: 292 IYAGGGTNISEGINHALKAIQNRKFKNNVTSILLLSDGQDTKAYTRVKAYIDKY------ 345 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + K+ I TI + + L + + + L F NI Sbjct: 346 ----------QIKDA-FNIETIGFGEDHDPKLLRTLSDLRNGTFNFMQDVNYLDTAFINI 394 Query: 406 -SQLMVH 411 + ++ Sbjct: 395 FAGMIST 401 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 75/231 (32%), Gaps = 42/231 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + ++ +++ A K L F+ ++ Sbjct: 97 DVLMVVDLSGSMEEKDFATKS---------GEQLSRLTAAKKVLRDFVTQ-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI + + + + + + T+ A+ ++ Sbjct: 141 DRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS--- 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 PS + ++ LTDG + S V+ + A I+I I++ Sbjct: 198 -------------PSTSQDQIMLVLTDGNDTG--SFVSPVDAAKIAAAKGIRIYVIAMGD 242 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + ++ L +Q I + +KYS Sbjct: 243 PENVGEQPLDMDVVSRVSSLTQARSFVAIDQSQLNEAYQVI-DQLEPQKYS 292 >gi|260467412|ref|ZP_05813583.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259028808|gb|EEW30113.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 354 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 39/130 (30%), Gaps = 7/130 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R S + +F L ++ + +GF + +++ + A +AA+L+ + Sbjct: 1 MRRVASGFARSRSGSMMPLFFLMLVPIISAVGFSVDYTSAVQTRSNQQQALDAALLSITT 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 M + + + + A + ++G + Sbjct: 61 -MDTTSTLAQRQAALQDSFIANGGQGTA------TLNSFVAGTTTAAATARATASFAMPT 113 Query: 127 ISMTHMANNR 136 I M + Sbjct: 114 IFMKIARIDT 123 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 73/235 (31%), Gaps = 48/235 (20%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +E + L +DLSGSM + + +++ A+K F+ Sbjct: 81 AVEALPASGRDLVLAIDLSGSMEQEDF----------VLDGRTVSRLDAVKAVARDFVR- 129 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +GL+ + ++ E V + + + + + T + + A Sbjct: 130 ------ARAGDRVGLVIFAETAYFAAPLTFDVEAVGRLIDQATIGISGRSTAISDGLGLA 183 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L S + ++ L+DG NN A+ Sbjct: 184 LKRLA------------------RSDARSRVVVLLSDGVNNAGAV--QPRDAGSLAERLG 223 Query: 362 IKIVTISIN-----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I++ TI++ + + L +S + V D L V + I Sbjct: 224 IRVHTIALGPADLETDPKSRDAVDTATLRAIAETSGGETFRVRTTDDLRQVARAI 278 >gi|302868694|ref|YP_003837331.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504835|ref|YP_004083722.1| von willebrand factor type a [Micromonospora sp. L5] gi|302571553|gb|ADL47755.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315411454|gb|ADU09571.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 316 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 67/234 (28%), Gaps = 53/234 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D+S SM ++ A + A F+ + E Sbjct: 87 ATVMLAIDVSLSMQADDV---------------SPNRLEAAQEAAKQFVGEL------PE 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + P+ V + + + T + A+ Sbjct: 126 SYNLGLVSFAKSANVLVPPTKDRAAVTTAIDGLVLAEA---TATGEAVFT---------- 172 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ L+DG + +S + A+ + + TI+ Sbjct: 173 --CLEAIRSVPADGAAGIPPARIVLLSDGYRTSGRS---VEEAAAAAQAANVPVSTIAFG 227 Query: 371 ASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + ++ + Y +A L V+Q++ + Sbjct: 228 TDSGQVDIGGQLQRVPVDRTALSQLAETTQGFFYEAASASELKQVYQDMGSSIG 281 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 70/198 (35%), Gaps = 26/198 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K T++ A + ++ V ++ + TR + P+ + R Sbjct: 115 SGSMKGTRLTNAVQAATTAVSRLNDGDVVS------VVTFDTRTSVVVPPTTVGPETRGR 168 Query: 280 VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + L T + +++ +L ++ L+D Sbjct: 169 ILASVRGISLGGDTCISCGIEEGLSLLGQTSAGV------------------SRMLVLSD 210 Query: 339 GENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G+ N+ +V + +A++ + I TI ++ N + L + S HY V N + Sbjct: 211 GDANHGVRDVPGFRAMAQRARDRGVAITTIGVDVDYNEKILSAIALDSNGRHYFVENDAA 270 Query: 398 LIHVFQNISQLMVHRKYS 415 L +F+ ++ + S Sbjct: 271 LARIFEAEAEQLTTSVAS 288 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 68/236 (28%), Gaps = 38/236 (16%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I + V D+ A D++++ D + K A F+ ++ Sbjct: 69 TIAIARPVADVRVPRERATVMVAMDISNSMAATDVSPDRFTVAKEAATEFVRNL------ 122 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 E +GL+ + PS + V + T A+ + Q Sbjct: 123 PEQFNVGLVSFARTATVVAPPSTNH---QAAVDAIEQLTLTDSTAIGEAVLTSLQ----- 174 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + P I+ L+DG N + + + +A + + TI+ Sbjct: 175 -------AVRSLDAQAAEDPPPARIVLLSDGGNTSGRPIDEGARAATEA---GVPVSTIA 224 Query: 369 IN--------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + L ++ Y + + L V+ ++ + Sbjct: 225 YGTPEGTIDLEGRSIPVPADTESLRGLADATSGSFYAAESDEELRDVYSDLQSSIG 280 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 79/255 (30%), Gaps = 53/255 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I A + E P + +VD+SGSMH + Sbjct: 172 PWNPQTRLLRIAIKASD-RSVEELPPANLVFLVDVSGSMH-------------------R 211 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 R + ++ L L +D + V L+ Y + ++ + G++K + Sbjct: 212 REGLPMVQGTLKLLVDQLRPQDRVS------LVTYAGATQVVLDSTPGSDKAKIR-AAID 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T ++ AYQ + I+ TDG+ N Sbjct: 265 QLTAGGSTAGESGIQLAYQQASKHLIEGGIN----------------RILLATDGDFNVG 308 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVN-------- 394 S+ + ++ ++ + + T+ +RL++ + + + N Sbjct: 309 ISDFDSLKQLAADKRKTGVSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDNLREARKVL 368 Query: 395 ADSLIHVFQNISQLM 409 D L ++ + Sbjct: 369 VDQLASTLATVASDV 383 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 13/116 (11%), Positives = 38/116 (32%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + ++ F LS + L L G I +++ +++A +A +LA + Sbjct: 1 MLRAGSRAVTDRSGSVALTFGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGN 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + ++ + + + ++ + S E + Sbjct: 61 YLKLAVATSAAAKSIVVDTVHAQAAPRPESPYALRVEVADDKTSVATTVDETVKLA 116 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 51/151 (33%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R R + + I+FA + + LL IG I + + +++A + A++A Sbjct: 8 RVRRAVPGWRSDARGTIGIMFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMAVV 67 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + F++ ++ A + + + E + ++ Sbjct: 68 GNTSFDADFGRQAFKTSASFAYALDGAAPGSDPLTITRLTFTQNPDGTVTAEATATMRTT 127 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 +S+ + + L + +++ + Sbjct: 128 VVSIIGVRSLDLSFRSTAKGTTTLKISTITF 158 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 48/239 (20%), Positives = 90/239 (37%), Gaps = 53/239 (22%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + L++D SGSM+ KM + N+ F+D Sbjct: 58 GTPPVNVIRPNDVILIIDRSGSMNDE-------------------NKMQSAINSAKGFID 98 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +DL +G++ Y++ + S E V+ YV T + A+K Sbjct: 99 LMDL-----SKHKVGIVDYSSANNISSFPLSTDKEAVKNYVNGL---RANGGTATGDAIK 150 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKA 357 +A ++L + Q I+ LTDG+ N + + ++A Sbjct: 151 KARELLVN-----------------HRPDAQPVIVLLTDGDATEPNGNAYNYALTNSNEA 193 Query: 358 KENFIKIVTISI---NASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K+ I TI++ NA+P+ LLK ++ +H+ V+ + L ++ I Q + Sbjct: 194 KQEGIVFYTIALLNTNANPDTSGPNLLLKQMATTSHHHHFVLGSVGLGDIYAAIVQEIG 252 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 18/203 (8%), Positives = 54/203 (26%), Gaps = 7/203 (3%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + G A + V+ + I+ L+ + + Sbjct: 19 MGIYPLGAQAAENYVSVTKSVSPTTITTEEETLVTLNIKGTPPVNVIRPN-DVILIIDRS 77 Query: 162 EHLLNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + ++ Q ++ + +++ + + +++ + S N Sbjct: 78 GSMNDENKMQSAINSAKGFIDLMDLSKHKVGIVDYSSANNISSFPLSTDKEAVKNYVNGL 137 Query: 221 QDKKRTKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + T A+K A L ++ V + G N + E ++ Sbjct: 138 RANGGTATGDAIKKARELLVNHRPDAQPVIVLLTDGDATEPNGNAYNYALTNSNEAKQEG 197 Query: 280 VT----RDMDSLILKPTDSTPAM 298 + +++ T + Sbjct: 198 IVFYTIALLNTNANPDTSGPNLL 220 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 33/264 (12%), Positives = 73/264 (27%), Gaps = 69/264 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM A ++ A ++A F+ + Sbjct: 87 TIMLAMDVSGSMRAADVH---------------PDRLTAAQDAAKAFIAEL------PRH 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ + + P+ E + + + T + + + L D Sbjct: 126 VRVGIVAFAGSAQLAQLPTQNHEDLFKAIDSFQLQR---GTATGNGILLSLATLFPDTGI 182 Query: 312 SFFT----------------------NFFRQGVKIPSLPFQKFI-----IFLTDGENNNF 344 +G P+ I LTDG+ Sbjct: 183 DVSALGGRQAMPRPQSMDEIGRPPHRGSNGRGADRPAPVAPGSYSSAAIIMLTDGQRTTG 242 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP---------------NGQRLLKTCVSSPEYH 389 ++ A + +++ T+ + + L + + Sbjct: 243 VD---PMEAAQWAADRGVRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEY 299 Query: 390 YNVVNADSLIHVFQNISQLMVHRK 413 ++ A L V++ +S + K Sbjct: 300 FHAATAADLKKVYETLSSRLTVEK 323 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V +G + Y+ E + + T + Sbjct: 825 MNLTIHLVKKADIGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYT 884 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + F + + ++ +I +TDG +++ + Sbjct: 885 AKALQHS------------NILFTEEHGSRLTQNVRQLMIVITDGVSHD---RDKLDEAA 929 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + + + + Sbjct: 930 RELRDKGITIFAVGVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 984 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID+ V +G++ Y+ + E S T+ + + T + A+ Sbjct: 460 IDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKAL--- 516 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G + ++I LTDG++N+ ++ ++ + Sbjct: 517 ----------DFILPLIKKGKTERTDRAPCYLIVLTDGKSNDS-----VLEPANRLRAEQ 561 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I I I + L+ E N DSL + I + K Sbjct: 562 ITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEK 611 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 22/170 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ + + + + Y D S I + Sbjct: 636 ETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRN 695 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I LTDG+ + Sbjct: 696 TLTGGALTFVNEYFDLS--------------KGGRPQVRKFLILLTDGKAQDEVGGPAM- 740 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 741 ----ALRSKSVTIFSVGV-YGANRAQ-LEEISGDGSLVFHVENFDHLKAI 784 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V +G + Y+ E + + T + Sbjct: 613 MNLTIHLVKKADIGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYT 672 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + F + + ++ +I +TDG +++ + Sbjct: 673 AKALQHS------------NILFTEEHGSRLTQNVRQLMIVITDGVSHD---RDKLDEAA 717 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + + + + Sbjct: 718 RELRDKGITIFAVGVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 772 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID+ V +G++ Y+ + E S T+ + + T + A+ Sbjct: 248 IDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKAL--- 304 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G + ++I LTDG++N+ ++ ++ + Sbjct: 305 ----------DFILPLIKKGKTERTDRAPCYLIVLTDGKSNDS-----VLEPANRLRAEQ 349 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I I I + L+ E N DSL + I + K Sbjct: 350 ITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEK 399 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 22/170 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ + + + + Y D S I + Sbjct: 424 ETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRN 483 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I LTDG+ + Sbjct: 484 TLTGGALTFVNEYFDLS--------------KGGRPQVRKFLILLTDGKAQDEVGGPAM- 528 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 529 ----ALRSKSVTIFSVGV-YGANRAQ-LEEISGDGSLVFHVENFDHLKAI 572 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 80/255 (31%), Gaps = 53/255 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I A + E P + +VD+SGSMH + Sbjct: 172 PWNPQTRLLRIAIKASD-RSVEELPPANLVFLVDVSGSMH-------------------R 211 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 R + ++ L L +D + V Y G ++V + P K+R + + Sbjct: 212 REGLPMVQGTLKLLVDQLRPQDRVSLVTYAG----DSQVLLDSAPGSDKAKIRAAIDQLT 267 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T ++ AYQ + I+ TDG+ N Sbjct: 268 ---AGGSTAGESGIQLAYQQASKHLIDGGIN----------------RILLATDGDFNVG 308 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVN-------- 394 S+ + ++ +++ + + T+ +RL++ + + + N Sbjct: 309 ISDFDSLKQLAADKRKSGVSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDNLREARKVL 368 Query: 395 ADSLIHVFQNISQLM 409 D L ++ + Sbjct: 369 VDQLSSTLATVASDV 383 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/260 (16%), Positives = 81/260 (31%), Gaps = 58/260 (22%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V F + + R + + VD+S SM + D +++A Sbjct: 68 LVVGFARPTAEVRV-PRERATVMVAVDVSTSM---------------LATDVDPDRLSAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 K+A F+D + + +GL+ + + PS E + + R + + + Sbjct: 112 KSAARDFVDGL------PREFNVGLVAFAGSAAVLVPPSTDREALHDGIRRLAEGITGVQ 165 Query: 291 PTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ A + L + P II L+DG N + Sbjct: 166 GTAIGEAISTSLGAVKALDEQA---------------TTQPPPARIIVLSDGANTSGMDP 210 Query: 348 VNTIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVV 393 + + + TIS +G+ L + + Sbjct: 211 MEAAAEAVAFEV---PVHTISFGTPGGFVDRGGRPIQVPVDGETLQAVAEQTGGAFHQAD 267 Query: 394 NADSLIHVFQNISQLMVHRK 413 +D L V+ +I + RK Sbjct: 268 TSDELHAVYDDIGSSVGWRK 287 >gi|78484443|ref|YP_390368.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362729|gb|ABB40694.1| Hypothetical protein; predicted integral membrane protein with a von Willebrand factor type A domain [Thiomicrospira crunogena XCL-2] Length = 363 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 70/241 (29%), Gaps = 54/241 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + I VV+ S SM + ++++ ++ L F+ + Sbjct: 111 QTKTVRDIVFVVETSVSMVLEDY----------QIDGEPQSRIKVIQTVLDQFISGLA-- 158 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--DSTPAMKQAYQ 303 G I Y + + R + R L T + A+ A Q Sbjct: 159 -----GNRFGFILYADDAYTLMPLTSDATTARLMLKRLKPYLA-GRTDEATGEALGLALQ 212 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 Q K + ++ ++DG + + + + + A+ I Sbjct: 213 ----------------QAEKSTDSTENRIVVLISDGSTRDSRLPI--AEAINYAQGLNIP 254 Query: 364 IVTISINASPNGQR----------------LLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I TI + A+ L + + +Y + + L V Q I Q Sbjct: 255 IYTIGVGANSKDADKREFRGLLYEALESSSLKQIADQTQGRYYQIGSGQDLQKVLQAIDQ 314 Query: 408 L 408 Sbjct: 315 T 315 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 74/247 (29%), Gaps = 46/247 (18%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL-IELVVDLSGSMHCAMNSDPEDVNSA 217 ++L ++ +P + + L + LV+D SGSM Sbjct: 373 DNS-YMLGGYMGTQLEKMLPVDMDLSKKADIPSLGLVLVIDKSGSMTDGQYG-------- 423 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 TK+ K A I ++ +G+I + + + + Sbjct: 424 -------ITKLEMAKEAA------IRSTEALRPTDSVGVICFDDAASWVVGMRQA-DDLA 469 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T+ PA+ AY+ L + K II LT Sbjct: 470 EIQDSIGTIRPGGGTNMYPALDLAYKALEEADTKL------------------KHIIVLT 511 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++ + I + E+ I + ++++ + L + +Y + Sbjct: 512 DGQSATGDFD----GIAHRMAEDGITLSSVAVGMDADKNLLSRLAEIGNGRYYYTDEFSN 567 Query: 398 LIHVFQN 404 + + Sbjct: 568 IPKILTK 574 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V +G + Y+ E + + T + Sbjct: 866 MNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYT 925 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + F + + ++ +I +TDG +++ + Sbjct: 926 AKALQHS------------NVLFTEEHGSRLTQNVRQLMIVITDGVSHD---RDKLDEAA 970 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + + + + Sbjct: 971 RELRDKGITIFAVGVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID+ V +G++ Y+ + E S T+ + + T + A+ Sbjct: 501 IDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKAL--- 557 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G + ++I LTDG++N+ ++ ++ + Sbjct: 558 ----------DFILPLIKKGKTERTDRAPCYLIVLTDGKSNDS-----VLEPANRLRAEQ 602 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I I I + L+ E N DSL + I + K Sbjct: 603 ITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEK 652 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 22/170 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ + + + + Y D S I + Sbjct: 677 ETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRN 736 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I LTDG+ + Sbjct: 737 TLTGGALTFVNEYFDLS--------------KGGRPQVRKFLILLTDGKAQD-----EVG 777 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 778 GPATALRSKSVTIFSVGV-YGANRAQ-LEEISGDGSLVFHVENFDHLKAI 825 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V +G + Y+ E + + T + Sbjct: 866 MNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYT 925 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + F + + ++ +I +TDG +++ + Sbjct: 926 AKALQHS------------NVLFTEEHGSRLTQNVRQLMIVITDGVSHD---RDKLDEAA 970 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + + + + Sbjct: 971 RELRDKGITIFAVGVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID+ V +G++ Y+ + E S T+ + + T + A+ Sbjct: 501 IDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKAL--- 557 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G + ++I LTDG++N+ ++ ++ + Sbjct: 558 ----------DFILPLIKKGKTERTDRAPCYLIVLTDGKSNDS-----VLEPANRLRAEQ 602 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I I I + L+ E N DSL + I + K Sbjct: 603 ITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEK 652 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 22/170 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ + + + + Y D S I + Sbjct: 677 ETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRN 736 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I LTDG+ + Sbjct: 737 TLTGGALTFVNEYFDLS--------------KGGRPQVRKFLILLTDGKAQD-----EVG 777 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 778 GPATALRSKSVTIFSVGV-YGANRAQ-LEEISGDGSLVFHVENFDHLKAI 825 >gi|149773093|emb|CAO01896.1| collagen type VI alpha 6 [Mus musculus] Length = 1120 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 51/183 (27%), Gaps = 19/183 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + V +G+ ++ T + + I Sbjct: 25 DDFQKMKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQI 84 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+++ D + ++ LTDG + + Sbjct: 85 FGYTHIGDALRKVKYYFQPDMGSRINAG------------TPQVLLVLTDGRSQD----- 127 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ + + I ++ I + + L + E V N D L V + I + Sbjct: 128 EVAQAAEELRHKGVDIYSVGIGDVDDQE--LVQITGTAEKKLTVHNFDELKKVKKRIVRN 185 Query: 409 MVH 411 + Sbjct: 186 ICT 188 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 26/225 (11%), Positives = 64/225 (28%), Gaps = 43/225 (19%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ +R + LV+D SGSM K+A K A + Sbjct: 85 AQVPGAQRSPVNLALVIDRSGSM--------------------SGYKLAQAKQAARHLIG 124 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + +I Y + V+ + T+ + Sbjct: 125 LLNDQD------RLAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSA 178 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKE 359 L++ ++ +I ++DG+ ++ ++ + + Sbjct: 179 GRYQLSTAQRTYGVN----------------RLILMSDGQPTEGLTADEELTRMARELRA 222 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + I + N + + + +A L +FQ Sbjct: 223 TGLTLSAIGVGTDFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQK 267 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 71/235 (30%), Gaps = 35/235 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + R + V+D+SGSM+ +AP + + +++ LK A Sbjct: 63 LVRVEAPARPEARIPIDVVAVLDVSGSMNNP--------AAAPTERTRTTSRLDLLKTAA 114 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPT 292 + ++ + ++ ++ R + + + R + T Sbjct: 115 KFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEARGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 PA ++A ++L + F I+ LTDG + S + + Sbjct: 169 ALVPAFEEAVKVLDGRQGDGGDRLGF--------------IVLLTDGAEDASGSFTLSER 214 Query: 353 ICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSLIHVFQ 403 + + + + + + LL S + V N + Sbjct: 215 RREVIRGALRKYPVHAFGLGTAHGPEVLLYLAQESRGTYSFVDGDNVGEVAGALA 269 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 64/237 (27%), Gaps = 55/237 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++ A K A + + V +G++ + + P+ Sbjct: 93 DTSRSMAATDVAPTRIGASKQAARDLIVGLPAS------VRLGMVSFAATATVVLPPTDN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD------------------------ 308 + + + R L T + + QA +L D Sbjct: 147 RQDMLDAIDRFQLQL---GTATGSGLIQALAVLFPDDGIDLEAILFSGESLAPGPGGRSL 203 Query: 309 ----KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + + P +I L+DG + A + +++ Sbjct: 204 TEAAAADAVRKREQERPAAQPGSYRHGAVILLSDGRRTTGPD---PLDAARMAAQRGVRV 260 Query: 365 VTISINASP---------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + L + ++ +A L V++ +S Sbjct: 261 YTVGFGTPQGGAAAESGLSYYMQLDEPALRAVAAITNGEYFQAGSAADLSQVYRQLS 317 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 87/253 (34%), Gaps = 49/253 (19%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 R + V + LR E+ E P+ + +V+D+S SM Sbjct: 337 DAFGLGGWYRTPVEAVLPVTTDLRTEV-EVPLVALVIVMDVSQSMTA------------- 382 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +++ K + +D E +G I ++ R E P T + ++ Sbjct: 383 ----GNPSRLELAKEGAVGVVDLA------YERDMLGFITFSDRPEWVFRPRQATLQGKR 432 Query: 279 YVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T + ++ T PA ++A +L + + K +I LT Sbjct: 433 EMTAAILNVAPQGGTIFEPAYREALDVLMAQEAAV------------------KHVIVLT 474 Query: 338 DGENNNFKSN------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 DG+ + + ++ + + I TI+I + Q+L + +Y Sbjct: 475 DGKFADGTGPFSRGPAPDFGRLAALGRRSGITTSTIAIGDGADPQQLTTIARAGGGRYYE 534 Query: 392 VVNADSLIHVFQN 404 ++ +L +F Sbjct: 535 ALDVSTLPRIFTT 547 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 81/257 (31%), Gaps = 47/257 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++V + + E P + ++D+SGSM+ Sbjct: 150 SSAPWNPQHQLVHIGLQGKTLAIEELPPSNLVFLLDVSGSMN------------------ 191 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ LK L +D + + + Y + P+ G EK ++ + Sbjct: 192 -QPNRLPLLKEGFKLLVDQLSEQD------TVAIAVYAGAAGVVLPPTPGNEK-QKIIAA 243 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T +K AY++ T ++ S +I TDG+ N Sbjct: 244 IDGLQAQGSTAGGEGIKLAYELAT----------------RMLSEGKNNRVILATDGDFN 287 Query: 343 NF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIH 400 S+ +++ + ++ I + + + K + + N L+ Sbjct: 288 VGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDN---LME 344 Query: 401 VFQNISQLMVHRKYSVI 417 + +S + +++ Sbjct: 345 AKKVMSTELTGTLFTIA 361 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 79/224 (35%), Gaps = 45/224 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 LVVD+SGSM ++ K+ + A L ++ + Sbjct: 96 NVARPAAVLVVDVSGSMEDSIPGGV---------------KIEVARRAATLLVERMPGG- 139 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V +GL+ ++ R+ ++ P+ R+ + T T ++ A L Sbjct: 140 -----VDVGLLAFSDRIVLSLPPTGDR---RRVLDAIESLKPGGGTMYTYPLQAALSWLK 191 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K + T ++F++DG + + + + I + T Sbjct: 192 PYKLFNAST----------------LVVFVSDGLPADA---ATYRTLLSEFRSLGIPVYT 232 Query: 367 ISINASPN-GQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQL 408 + I + G+R LK S+ Y +A+ L+ F+ +++ Sbjct: 233 VYIGPGGDEGERELKLIAGSTGGEEYTAGSAEELLKAFKTLAEK 276 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 75/261 (28%), Gaps = 66/261 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + I V+D+S SM ++T++ A K+ + F+ + Sbjct: 81 TEYVYTETANSIMFVIDISPSMAAKD--------------INEKTRIQAAKDIITDFVQT 126 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 GL + I P+ E+ + + + T + A Sbjct: 127 -------YPADAFGLTALASTAALVIPPTIQHEQFFARLNSLQIGELGEGTALGMGLAVA 179 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + II LTDGE+N + + N + K Sbjct: 180 AAHFAK------------------NTVKTQSIILLTDGESNTGEIHPNL--AAELIKSKK 219 Query: 362 IKIVTISINASP-----------------------NGQRLLKTCVSSPEYHYNVVNADSL 398 I I I N + L + + + + SL Sbjct: 220 IGFYIIGIGKDGYANLEYVDPSTGEKREGTLQTIFNERELRELAHRGNGIYVSAKSFASL 279 Query: 399 IHVFQNISQLMVHR--KYSVI 417 +F+NISQ + ++S + Sbjct: 280 QEIFKNISQNISPTPARFSEV 300 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 69/244 (28%), Gaps = 54/244 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + +V+D SGSM K+A +K L + ++ Sbjct: 47 KRAPIDLVVVIDKSGSMA--------------------GEKLALVKKTLEFVVSQLNEKD 86 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + LI + T V + + + T + Q + D T+ + + + Sbjct: 87 ------RLCLITFDTSVYLDFKLTPMTPMNKYQTLKIIKDISPGSMTNLCGGLMKGLCEV 140 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK------- 358 ++ TDG N AK Sbjct: 141 IDRADEEKNEVAS--------------VLLFTDGFANKGGLTNIYCSSSQTAKYTIGIVG 186 Query: 359 --ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T ++ N Q L + + +Y + N D + F Q + S Sbjct: 187 PKTADASIYTFGFGSNHNAQMLKEISDAGSGMYYYIENVDMIAEAF---GQCLGGL-LST 242 Query: 417 ILKG 420 + +G Sbjct: 243 VAQG 246 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ +I+ + P I ++D SGSM + K Sbjct: 227 NPDNYLLRIGLQAKKIDFTKAPPSNIVFLIDTSGSM-------------------DEPNK 267 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M LK + L LD++ + ++ Y ++ + + EK + D + Sbjct: 268 MPLLKASFKLLLDNLRPED------RIAIVVYASQTGIALPSTPAKEK-EKISKVIDDLV 320 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY++ + II TDG+ N S Sbjct: 321 ASGSTAGGAGLQTAYEVAEKNFL----------------PKGNNRIILATDGDFNVGISS 364 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 ++ ++ + N I I + + +T + + + N Sbjct: 365 RDELQRLVEEKRNNGIYISVLGYGMGNYRDDMAETIANKGNGNYAYIDNFTE 416 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 37/339 (10%), Positives = 84/339 (24%), Gaps = 20/339 (5%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 + E S N + + NR + S + Sbjct: 29 SINTSFVSAEGNSSSSEKTITNSIQIDNMNEGEVRVFKTAKPIPNSINRWEISIDVFGRL 88 Query: 148 MDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVD 198 + L + R ++ I + I + L+ Sbjct: 89 KREPSDIVLVLDTSGSMDPQKNPQGIDRISKAKREAIHFVNEIFERDASARVALVSYGTK 148 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 +S + +N + + T + L+ + + G Sbjct: 149 VSSNSFHTKQESNLLINEIKSLKAEGGTFTQGALYEAKMLLNQSSAPNKTIVLLSDGQPT 208 Query: 259 YTTRVEKNIE---PSWGTEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 Y ++ + + V++ + T S+ S + F Sbjct: 209 YRYPLKAKVNQDLLRYDGNVIVQKRYNGQQRPFDIGITSSSN-QAIPGYRFKSRPNTNVF 267 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTISINA 371 + + G N F + ++ + I + I + Sbjct: 268 DYNAMVYGTGNEYYLDELGELRSQGNQNYFVYMSSADAAIIESNQIHQEQIHLYAIGFDT 327 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVFQNISQLM 409 G +LK S +Y+ ++ D+L +F+ IS + Sbjct: 328 DARGTDILKRI--SNNNYYDASSSRDNLDDIFKKISNNI 364 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 36/254 (14%), Positives = 85/254 (33%), Gaps = 52/254 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + N K++ + + + P + +VD+SGSM A K Sbjct: 151 KENAKLIRIGIQAKDLALQQLPPANLVFLVDVSGSMSAAD-------------------K 191 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K L + + + + +I Y + + +EP+ G +K + + Sbjct: 192 LPLVKQTLRILTEQLRAQD------KVTIITYASGEKLVLEPTSGEQK-EKILAVINGLR 244 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T A++ AY+ ++ I+ TDG+ N Sbjct: 245 AGGATAGEQAIQLAYKQAEKAFVKNGIN----------------RILLATDGDFNVGITD 288 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + + +++ I + T+ ++L++ + + + N + Sbjct: 289 FSTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNE-------- 340 Query: 406 SQLMVHRKYSVILK 419 ++ +V R+ S L Sbjct: 341 AKKVVQRQLSSTLA 354 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 68/200 (34%), Gaps = 39/200 (19%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSL 287 A+ +F+D + +G++ YT R+++ +++++ + Sbjct: 61 EAMKMFIDMLATTGD-----KVGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDR-- 113 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN------ 341 TD + + +A ++L I+ L DG N Sbjct: 114 -GPYTDMSVGLDEAVKVLKQGM----------------DPAHAPMIVVLADGNNDLDPNT 156 Query: 342 --NNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADS 397 + +++ + +AK + I I TI +NA N + L + + + +AD Sbjct: 157 GRTSKEASEQLAQAVKEAKGSGIPIYTIGLNADGKLNKETLAELANQTGGKSFTTSSADD 216 Query: 398 LIHVFQNISQLMVHRKYSVI 417 L + I K + Sbjct: 217 LPQILSEIFASHQQLKIVPV 236 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 67/212 (31%), Gaps = 35/212 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D + ++ A++ A F+D + + + +GL+ + + P+ Sbjct: 93 DVSLSMQATDIEPSRFEAMQVAAKEFVDVL------PDRINLGLVSFAGTATTVVTPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +V + + T A+ + + NF Sbjct: 147 RGQVSTAIDNLELAE---STAIGEAVFTSLTAIE---------NFQSSLDADGEEVPPAR 194 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------PNGQRL 378 I+ L+DG N + + + + I + TI+ + L Sbjct: 195 IVLLSDGYNTVGRPDTQAVSAAL---DAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATL 251 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + + +A L V+Q++ + Sbjct: 252 EEIADQTGGSYSEAASAAELEQVYQDLGSQIG 283 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 29/242 (11%), Positives = 68/242 (28%), Gaps = 64/242 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I +D+S SM +++ + FL+ + Sbjct: 89 MDIIFALDVSKSMDATDMM---------------PSRLLRAILQIGSFLEQVKT------ 127 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDK 309 +G+I + + E VR + + + + TD A++ A Sbjct: 128 -DRIGIIAFAGTATLQCPLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLAENAFPEGS 186 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K K ++ ++DGE+ + K I++ T+ + Sbjct: 187 KS-------------------KTLVLISDGEDLQHSALRE----ARILKTKGIRVYTMGV 223 Query: 370 NASP-----------------NGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVH 411 + + L + + +Y V + + + + I + Sbjct: 224 GSPEGTIIRHPETGEEVKSKLDEATLQEIARITEGEYYRVTPGGEEIQLILKRIYESEST 283 Query: 412 RK 413 R+ Sbjct: 284 RR 285 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 30/336 (8%), Positives = 82/336 (24%), Gaps = 24/336 (7%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ SI+ AL + + +G + + + ++ ++++ ++ LA A + ++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINL 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY-SAVFYNTEIQNIVNSSRISMTHMA 133 I+ ++ + S + N +S S+ ++ Sbjct: 76 SVPEAAGITAGHLNYA---LFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVK 132 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI---------VSFIPALLRIE 184 + F M + Sbjct: 133 CMTSQTGIVNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPP 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSD-PEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + + + +GS + ++ AL + Sbjct: 193 VAGATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVG 252 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + +PS+GT T + P+ AY Sbjct: 253 TPGNKAATTNAYNTRFGIYANPYKDPSYGTPDFT--------GYAYDAT-TWPSQSNAYA 303 Query: 304 ILTSDKKR-SFFTNFFRQGVKIPSLPFQKFIIFLTD 338 S ++ + + G+ + D Sbjct: 304 DFVSKRQTFTSYQGDLITGINTGGTYNPNYYAAGAD 339 >gi|242051338|ref|XP_002463413.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] gi|241926790|gb|EER99934.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] Length = 491 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 16/180 (8%), Positives = 48/180 (26%), Gaps = 13/180 (7%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RD 283 K+ LK A+ + + + ++ ++ + I + ++ + Sbjct: 88 GQKLELLKQAMCFVIHQLGPAD------RLSIVTFSRHATRQIRLARMSDVGKASAKFAV 141 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++ T+ ++ Q+L ++R+ + + DG Sbjct: 142 GALCAVRGTNIGQGLRVGAQVLAGRRERNAVAGMILLSDGQDTS--GCWTTVRPDGTKTY 199 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + I T + + ++ V N ++ F Sbjct: 200 ANLVPPS----TSFSSRPAPIHTFGFGTDHDAAAMHAIAEATGGTFSFVGNEAAIQDSFA 255 >gi|120609754|ref|YP_969432.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] gi|120588218|gb|ABM31658.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] Length = 355 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/264 (12%), Positives = 73/264 (27%), Gaps = 69/264 (26%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM A ++ A ++A F+ + Sbjct: 87 TIMLAMDVSGSMRAADVH---------------PDRLTAAQDAAKAFIADL------PRH 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ + + P+ E + + + T + + + L D Sbjct: 126 VRVGIVAFAGSAQLAQLPTQNHEDLFRAIDSFQLQR---GTATGNGILLSLATLFPDTGI 182 Query: 312 SFFT----------------------NFFRQGVKIPSLPFQKFI-----IFLTDGENNNF 344 +G P+ I LTDG+ Sbjct: 183 DVSALGGRQAMPRPQSMDEIGRPPHRGGNGKGADRPAPVAPGSYTSAAIIMLTDGQRTTG 242 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP---------------NGQRLLKTCVSSPEYH 389 ++ A + +++ T+ + + L + + Sbjct: 243 VD---PMEAAQWAADRGVRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEY 299 Query: 390 YNVVNADSLIHVFQNISQLMVHRK 413 ++ A L V++ +S + K Sbjct: 300 FHAATAADLKKVYETLSSRLTVEK 323 >gi|187470892|sp|A6NF34|ANTRL_HUMAN RecName: Full=Anthrax toxin receptor-like; Flags: Precursor Length = 565 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 62/187 (33%), Gaps = 29/187 (15%) Query: 228 MAALKNALLLFLDSIDLLSHV-----KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + ++D + ++ M I Y+T + + + +++ + + Sbjct: 81 LDKSGSVNNNWIDLYMWVEETVARFQSPNIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ 140 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T ++A Q + S + II +TDGE Sbjct: 141 LQKIVPDGHTFMQAGFRKAIQQIESFNSGN---------------KVPSMIIAMTDGE-L 184 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +T++ KA++ + T+ + + + SP + + V N F Sbjct: 185 VAHAFQDTLREAQKARKLGANVYTLGV--ADYNLDQITAIADSPGHVFAVENG------F 236 Query: 403 QNISQLM 409 + + + Sbjct: 237 KALRSTI 243 >gi|156742635|ref|YP_001432764.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233963|gb|ABU58746.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 777 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 44/136 (32%), Gaps = 20/136 (14%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T ++++ +LTS + + I+ L+ Sbjct: 374 RAQAAIDTLNSRGATSIGGGLQRSQHLLTS-----------------ANPAIPRAIVLLS 416 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG+ N + + ++ + TI + + + +L + + D Sbjct: 417 DGQENT---SPYVADVLPPIRDAQTTVHTIGVGQDADQRLMLSIAAQTGGTYNYAPTPDQ 473 Query: 398 LIHVFQNISQLMVHRK 413 L ++ IS + +R+ Sbjct: 474 LARIYNTISGNVSNRQ 489 >gi|145547190|ref|XP_001459277.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427101|emb|CAK91880.1| unnamed protein product [Paramecium tetraurelia] Length = 603 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 75/271 (27%), Gaps = 46/271 (16%) Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + S + + + V + +RP + Sbjct: 65 IVQFTISEGITAEELIQNLDIRLYPERDP--IKNNKFVPAVLSLKTKKVSNNLDRPPIDL 122 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 VVD+SGSM K+ +K++L + + Sbjct: 123 VCVVDVSGSM--------------------IGRKINLVKDSLRYLMKILGPED------R 156 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRS 312 + +I +TT T++ + + + + L T+ + M +A +L + K ++ Sbjct: 157 ICIIVFTTVAHIVTSFIRNTQENKPLLKKAILELKGLASTNISDGMNKALWMLKNRKYKN 216 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISINA 371 I L+DG+++ + K E I T Sbjct: 217 PV----------------SCIFLLSDGQDDYKGAEQRVFDQLQLLKIEEKFVIHTFGYGQ 260 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + Y + N + F Sbjct: 261 DHDAYVMNQIAKYREGNFYYIDNINKASDYF 291 >gi|145500364|ref|XP_001436165.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403304|emb|CAK68768.1| unnamed protein product [Paramecium tetraurelia] Length = 604 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 75/261 (28%), Gaps = 30/261 (11%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 YN++ + ++ L + + + + + IE +M Sbjct: 119 AYNINENIEFYLTSEYKT--LRFNDRKALSCLVGIKAKEVHQPQVPQQIEGDAQNQFNMD 176 Query: 205 CAMNSDPEDVNSAPICQDKKRT--KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 +S I + + K+ +K L + L+ + + LI + Sbjct: 177 QQQHSKVGVDLLCVIDRSGSMSGEKIEMVKQTLNILLNFLGPKD------RLCLIQFDDT 230 Query: 263 VEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 ++ T++ + Y T + A + + K + T Sbjct: 231 CQRLTNLRRVTDENKTYYSDIISKIYANGGTVIGLGTQMALKQIKYRKSVNNVTA----- 285 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 I L+DG++ S+ ++ + + I + + + + + K Sbjct: 286 -----------IFVLSDGQDEAAISS-LQKQLAYY--KQTLTIHSFGFGSDHDAKLMTKI 331 Query: 382 CVSSPEYHYNVVNADSLIHVF 402 Y V N L F Sbjct: 332 SNLGKGSFYFVNNISLLDEFF 352 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 44/340 (12%), Positives = 95/340 (27%), Gaps = 61/340 (17%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + +S KI + + E Sbjct: 26 IIPAAGTNLIVEAENNPGSTTSSDGLVTVNKTSERVGDNRYKINLDLATGKAQDSVEAIG 85 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I LV D+S SM + + N + ++ KNA + FL++ + + K Sbjct: 86 NDIVLVFDISNSMAEDEHGNSTSSND--------KKRLTKAKNAAIEFLNNSKISGNKKN 137 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ + + E +Q + ++ + T+ + +A +L + K Sbjct: 138 RY--SIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAK 195 Query: 310 KRSFFTNFFRQGVKIPSLPFQK---FIIFLTDG--------------------------- 339 + G S + D Sbjct: 196 SENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDS 255 Query: 340 ---------------ENNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLK 380 N+++ N + + AK++ I TI + LK Sbjct: 256 VIKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLK 315 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + E + ++ L +++NI+ +V R ++I G Sbjct: 316 NVATQGEGYAY-SSSSDLSGIYENIANEIVTRYETIIKNG 354 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/345 (8%), Positives = 82/345 (23%), Gaps = 20/345 (5%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD--- 138 + +N + + + + + ++ Sbjct: 538 SGYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDEKTNYLPSGSETINNYE 597 Query: 139 --SSNNTIFYNMDVMTSYDYRLQFIEHL----LNQRYNQKIVSFIPALLRIEMGERPIFL 192 S N + L + + + + + P Sbjct: 598 YNVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTHLKNNVIKFITKLYEHNPDSR 657 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + ++ + N++ ++ + T + + S + Sbjct: 658 VSVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYE 717 Query: 253 YMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA-YQILTSDK 309 + T I + G + + T D A+ QA + + + Sbjct: 718 VRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNL 777 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 G K ++ +N+ + N K I TI + Sbjct: 778 TGINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTN-----RSGKGLGKTIFTIGL 832 Query: 370 NASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N++ + L S ++ N+ ++ + F +I + Sbjct: 833 NSNNSSLFDSFLTRLAS-EGHYTKANNSSAMENAFNSIFTSITTM 876 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/341 (12%), Positives = 88/341 (25%), Gaps = 57/341 (16%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + L + D + K + + Sbjct: 460 SIRNIEKEDTLSANVVGALSAPIEITSDASFTSAKITFTYDEALLGDTPAENLAVMWYDR 519 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + N S + Y+ + Sbjct: 520 ENRRYVILDKDTVVDPNAHTVSYTTTHFSTYLVVD-----REVWYDCWRENIDYRSGDAG 574 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + I L VD+SGSM ++ K AL F+D++ Sbjct: 575 TSQLEPYDIGLCVDVSGSM--------------------YGDRLEKAKTALNTFIDAMLP 614 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ ++ E +R + D T++ + + I Sbjct: 615 QDNA------CMVSFSDNAYLVAGYGASKEVMRSRTNQLRDLY---GTNTDVGLSKTISI 665 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + K II + DG+ N + D AK I + Sbjct: 666 LADQGRSD----------------ASKMIIMICDGDVNYIQ------GTVDAAKAAGIAV 703 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI++ S + L K + +Y + ++ + I Sbjct: 704 YTINV-VSGDNDLLQKIADETGGEYYYAATTEEVVSQVEAI 743 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 81/257 (31%), Gaps = 47/257 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++V + + E P + ++D+SGSM+ Sbjct: 150 SSAPWSPQHQLVHIGLQGKTLAIEELPPSNLVFLLDVSGSMN------------------ 191 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ LK L +D + + + Y + P+ G EK ++ + Sbjct: 192 -QPNRLPLLKEGFKLLVDQLTEQD------TVAIAVYAGAAGVVLPPTPGNEK-QKIIAA 243 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T +K AY++ T ++ S +I TDG+ N Sbjct: 244 IDGLQAQGSTAGGEGIKLAYELAT----------------RMLSEGKNNRVILATDGDFN 287 Query: 343 NF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIH 400 S+ +++ + ++ I + + + K + + N L+ Sbjct: 288 VGVSSDAELVRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDN---LME 344 Query: 401 VFQNISQLMVHRKYSVI 417 + +S + +++ Sbjct: 345 AKKVMSTELTGTLFTIA 361 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 44/340 (12%), Positives = 95/340 (27%), Gaps = 61/340 (17%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + +S KI + + E Sbjct: 26 IIPAAGTNLIVEAENNPGSTTSSDGLVTVNKTSERVGDNRYKINLDLATGKAQDSVEAIG 85 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I LV D+S SM + + N + ++ KNA + FL++ + + K Sbjct: 86 NDIVLVFDISNSMAEDEHGNSTSSND--------KKRLTKAKNAAIEFLNNSKISGNKKN 137 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ + + E +Q + ++ + T+ + +A +L + K Sbjct: 138 RY--SIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNIQAGLYKARTVLKNAK 195 Query: 310 KRSFFTNFFRQGVKIPSLPFQK---FIIFLTDG--------------------------- 339 + G S + D Sbjct: 196 SENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGYSGRYTFGENAINYDS 255 Query: 340 ---------------ENNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLK 380 N+++ N + + AK++ I TI + LK Sbjct: 256 VIKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIFTIGYTTGSSVNSFLK 315 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + E + ++ L +++NI+ +V R ++I G Sbjct: 316 NVATQGEGYAY-SSSSDLSGIYENIANEIVTRYETIIKNG 354 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 30/345 (8%), Positives = 82/345 (23%), Gaps = 20/345 (5%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD--- 138 + +N + + + + + ++ Sbjct: 538 SGYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDEKTNYLPAGSETINNYE 597 Query: 139 --SSNNTIFYNMDVMTSYDYRLQFIEHL----LNQRYNQKIVSFIPALLRIEMGERPIFL 192 S N + L + + + + + P Sbjct: 598 YNVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTHLKNNVIKFITKLYEHNPDSR 657 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + ++ + N++ ++ + T + + S + Sbjct: 658 VSVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYE 717 Query: 253 YMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA-YQILTSDK 309 + T I + G + + T D A+ QA + + + Sbjct: 718 VRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNL 777 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 G K ++ +N+ + N K I TI + Sbjct: 778 TGINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTN-----RSGKGLGKTIFTIGL 832 Query: 370 NASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N++ + L S ++ N+ ++ + F +I + Sbjct: 833 NSNNSSLFDSFLTRLAS-EGHYTKANNSSAMENAFNSIFTSITTM 876 >gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 416 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/269 (12%), Positives = 78/269 (28%), Gaps = 49/269 (18%) Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 D L L + ++ + A ++E ERP+ + V+D SGSM Sbjct: 4 IQIDLALDKTYLLPGNKQVAYLMVKLTAPKQVEK-ERPVQNLSFVIDRSGSMA------- 55 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 K+ K A+ + + + + ++ + Sbjct: 56 -------------GEKLDYTKKAVAFAVGHLSPQDYCSVVAFDDMVTMVASSHQV----A 98 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ V T+ + M + + K + Sbjct: 99 NKDALKMAVESI---YPGGSTNLSGGMLLGVREVKLAHKENQIN---------------- 139 Query: 332 FIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 ++ LTDG N ++ ++ + + + T + L + Y Sbjct: 140 RVLLLTDGMANVGVTDHSALVEKSREMAAGGVNLSTFGLGEDFEEDLLQAMVEAGGGNFY 199 Query: 391 NVVNADSLIHVFQNISQLMVHRKYSVILK 419 + D + +F+ Q + S++ + Sbjct: 200 YIEKPDQIPGIFE---QELTGL-LSIVAQ 224 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 77/250 (30%), Gaps = 52/250 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + K++ + +RP + ++D SGSM K+ Sbjct: 182 DAKLIRIAIQAADLAPEKRPPANLVFLIDTSGSMDDPD-------------------KLP 222 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K + F +++ + LI Y+ + + P+ G +K + Sbjct: 223 LVKKTVCHFAEALRADD------RISLITYSGSTAEILPPTAGDQK-ETIIAALKPLRAH 275 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T A++ AY + ++ I+ TDG+ N S+ Sbjct: 276 GATAGGEALRMAYDAAAKNYRKDGIN----------------RILLATDGDFNVGISDPA 319 Query: 350 T-IKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADS--------LI 399 T +++ I + T+ + +++ + + + + L Sbjct: 320 TLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYIDSEAEAKKVLVRQLT 379 Query: 400 HVFQNISQLM 409 +++ + Sbjct: 380 STLATVARDI 389 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 73/256 (28%), Gaps = 61/256 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM + ++ A +NA F+ + Sbjct: 17 TIMLAMDVSGSMRATDV---------------QPDRITAAQNAAKAFIAEL------PRH 55 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ + + P+ E + + + T + + A L D Sbjct: 56 VRVGIVAFAGSAQLAQLPTQSHEDLAKAIDSFQLQR---GTATGNGIMLALATLFPDAGI 112 Query: 312 SFFTNFFR--------------QGVKIPSLPFQKFI-----IFLTDGENNNFKSNVNTIK 352 R K + I LTDG+ ++ Sbjct: 113 DIAALGGRQAMHPRPLDEVTRQDPAKPFTPVAPGSYTSAAIIMLTDGQRTTGVD---PLE 169 Query: 353 ICDKAKENFIKIVTISINASP---------------NGQRLLKTCVSSPEYHYNVVNADS 397 A + +++ T+ + + + L + +++ A Sbjct: 170 AAQWAADRGVRVYTVGVGTVQGETIGFEGWSMRVRLDEETLKAVAGRTHAEYFHAATAAD 229 Query: 398 LIHVFQNISQLMVHRK 413 L V++ +S + K Sbjct: 230 LKKVYETLSSRLAVEK 245 >gi|116751034|ref|YP_847721.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116700098|gb|ABK19286.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 479 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 70/257 (27%), Gaps = 44/257 (17%) Query: 164 LLNQRYNQKIVSFI-PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + ++ + R + +V+D SGSM A Sbjct: 64 MGGDGTVTVALTLECDRAPGGNVEARRELDMVVVMDRSGSMADA---------------- 107 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K+ + A+L L + L+ Y+ V+++ T R + R Sbjct: 108 ---GKLTHARQAVLNLLSRLSETD------RFALVSYSDHVQRHGGLLPITPANRATLER 158 Query: 283 DMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T+ +++ L ++ +I ++DG Sbjct: 159 IVRGIQPGGATNLGGGLQEGISQLAELQQNGRL----------------SRLILISDGLA 202 Query: 342 NNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N + A E + T+ + N + + + +A + Sbjct: 203 NRGVTDPSALGTMASVAAERGYAVSTVGVGLDFNEHLMTSIADKGAGNYTFMESASAFAQ 262 Query: 401 VFQNISQLMVHRKYSVI 417 VF + S + Sbjct: 263 VFDKEFRDAGTVVASSV 279 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 52/164 (31%), Gaps = 6/164 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + FR +K +EK + AL ++ + + + + K +++A +A LA Sbjct: 1 MDAFRR-IRKL--NEKGFAIVYIALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAGALA 57 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G +K+ ++ + N + + V Sbjct: 58 GVAKLSDTVTARQSAKLFAERNKAAGESVKVALNETNSADGDIVVGYWDKVTRTMSATVP 117 Query: 124 SSRI--SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 + ++ ++ +A ++ N V + L + + Sbjct: 118 TGKVANAVKVVARRTTETGTGISAENKQVDLFFGQVLNW-GQMS 160 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 36/254 (14%), Positives = 85/254 (33%), Gaps = 52/254 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + N K++ + + + P + +VD+SGSM A K Sbjct: 151 KENAKLIRIGIQAKDLALQQLPPANLVFLVDVSGSMSAAD-------------------K 191 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K L + + + + +I Y + + +EP+ G +K + + Sbjct: 192 LPLVKQTLRILTEQLRAQD------KVTIITYASGEKLVLEPTSGEQK-EKILAVINGLR 244 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T A++ AY+ ++ I+ TDG+ N Sbjct: 245 ARGSTAGEQAIQLAYKQAEKAFVKNGIN----------------RILLATDGDFNVGITD 288 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + + +++ I + T+ ++L++ + + + N + Sbjct: 289 FNTLKGMVAEKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNE-------- 340 Query: 406 SQLMVHRKYSVILK 419 ++ +V R+ S L Sbjct: 341 AKKVVQRQLSSTLA 354 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 74/231 (32%), Gaps = 42/231 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM ++ +++ A K L F+ ++ Sbjct: 133 DVLMVVDLSGSMEEKDF---------STAAGEQLSRLTAAKRVLRNFVTQ-------RQG 176 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI + + + + + + T+ A+ ++ Sbjct: 177 DRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS--- 233 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 PS + ++ LTDG + S V+ + A I+I I++ Sbjct: 234 -------------PSTSQDQIMLVLTDGNDTG--SFVSPVDAAKIAAAKGIRIYVIAMGD 278 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + ++ L +Q I + +KYS Sbjct: 279 PENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVI-DQLEPQKYS 328 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 68/200 (34%), Gaps = 39/200 (19%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQYVTRDMDSL 287 A+ +F+D + +G++ YT R+++ +++++ + Sbjct: 61 EAMKMFIDMLATTGD-----KVGIVSYTDRIQREKALLEIQSEADKTALKEFIDQLDR-- 113 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN------ 341 TD + + +A ++L I+ L DG N Sbjct: 114 -GPYTDMSVGLDEAVKVLKQGM----------------DPAHAPMIVVLADGNNDLDPNT 156 Query: 342 --NNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADS 397 + +++ + +AK + I I TI +NA N + L + + + +AD Sbjct: 157 GRTSKEASEQLNQAVKEAKGSGIPIYTIGLNADGKLNKETLAELAKQTGGKSFTTSSADD 216 Query: 398 LIHVFQNISQLMVHRKYSVI 417 L + I K + Sbjct: 217 LPQILSEIFASHQQLKIVPV 236 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 70.7 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 78/233 (33%), Gaps = 45/233 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E+P + L+VD+SGSM+ K+ K AL L +D +D Sbjct: 184 TEQPPLNLTLLVDVSGSMNHED-------------------KLPLAKKALKLLIDKMDED 224 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ ++ Y +EP+ G+EK + + T ++ AY + Sbjct: 225 D------HISVVVYAGAAGTVLEPTKGSEKSK-IFAALDNLSAGGSTAGGEGLRLAYSLA 277 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKI 364 + + ++ LTDG+ N S+ + +E+ + + Sbjct: 278 EQNYDAASVN----------------RVMLLTDGDFNVGVTSDERLEDFVARKRESGVYL 321 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +++ + D+L + ++ + +++ Sbjct: 322 SVLGFGRGNYNDAMMQKIAQAGNG--MASYIDTLNEARKVLNDDLSGSMFTIA 372 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/274 (11%), Positives = 75/274 (27%), Gaps = 75/274 (27%) Query: 172 KIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 +V+ + + M I +D+S SM +D +++ Sbjct: 69 LVVALVNPKMGTTMETVKREGVDIVFAIDVSKSMDA---------------EDIAPSRLE 113 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLI 288 K + L + +G+I Y + + + ++ D + Sbjct: 114 KSKQIVRQILGGLGS-------DRVGIIAYAGSAFPQLPITTDYAAAKMFLQALNTDMIS 166 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A+ A D + + + + ++DGE++ Sbjct: 167 SQGTAIGDAIDLATTYYDDDNQTN------------------RVLFIISDGEDHEGNV-- 206 Query: 349 NTIKICDKAKENFIKIVTISINASP----------------------------NGQRLLK 380 I D+A E I+I TI + + L + Sbjct: 207 --ESIADEAAEKGIRIYTIGVGTEKGGPIPIKRNGVVQNYKKDQNGETVITKLDPNTLQQ 264 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + + Q++ + + ++ Sbjct: 265 IASEANGEYIEGNVTAKVAETVQDLLKNIEKTEF 298 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/385 (10%), Positives = 109/385 (28%), Gaps = 61/385 (15%) Query: 34 LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAK-----R 88 +++ G + +++ ++ + ++M + ++ + Sbjct: 140 IVVTGMRAELSQAEERQHKAKAIADRQRRMAEAQMAAKPMAAAPTAAVHADAYAPADILQ 199 Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT-HMANNRLDSSNNTIFYN 147 A + + + ++T ++ ++ + +L + Sbjct: 200 ATTREYRDRFNQVDDNPVIATRDNPFSTFSIDVDTAAYSFTRRLLNQGQLPPKDAVRIEE 259 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL----------RIEMGERPIFLIELVV 197 M Y Y L + S ++ G P + ++ Sbjct: 260 MVNYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNKLLHIGIQGYQLPAGHIPQSNLVFLL 319 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D+SGSM + +K+ +K ++ L L ++ + ++ Sbjct: 320 DVSGSM-------------------DEPSKLPLVKQSMELLLSTLKPED------TVAIV 354 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 Y +EP+ EK + + + T + AYQ+ ++ Sbjct: 355 VYAGAAGTVLEPTKVREKSK-ILAALHNLQAGGSTAGGEGLALAYQLAEANF-------- 405 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + II TDG+ N + + ++ + I + + Sbjct: 406 --------NPKGVNRIILATDGDFNVGQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQD 457 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHV 401 L++T + D+L Sbjct: 458 ALMQTLAQNGNG--TAAYIDTLSEA 480 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 78/248 (31%), Gaps = 52/248 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V + +P + +VD+SGSM+ K+ Sbjct: 180 VKVGIKAEDVLAAKQNQPAANLVFLVDVSGSMNSDD-------------------KLQLA 220 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K +L + + + LI Y E + + G + ++ + + Sbjct: 221 KASLKMLTKQLRAQD------TITLITYAGNTEVVLPATSG-NQTQKILNAIDNLSANGS 273 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-T 350 T+ A+K AYQ + K+ I+ LTDG+ N SNV Sbjct: 274 TNGEAAIKLAYQQAEENFKKQGIN----------------RILMLTDGDFNVGVSNVKDM 317 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVN--------ADSLIHV 401 + I ++ I + T+ +++ + + + + D + Sbjct: 318 LDIIRNNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEAKKVLIDEMSST 377 Query: 402 FQNISQLM 409 F +++ + Sbjct: 378 FNTVAKDV 385 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 67/239 (28%), Gaps = 44/239 (18%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + +V+D S SM D T+M A K A + Sbjct: 53 STSSNDATGGDSKVAVVLDASDSMAEKDTGDG-------------GTRMDAAKKAANDTI 99 Query: 240 DSIDLLSHVKEDVYMGLIGYT------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 D++ + Y + S G K + T Sbjct: 100 DTLADSAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDLEDKINGLEPKGYTP 159 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+K+A + L S ++ II ++DG + V + Sbjct: 160 IGNAIKKAAEELGSSG--------------------KRNIILVSDGIDTCAPPPVC--DV 197 Query: 354 CDKAKENFIK--IVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + I I T+ Q+ L+ S + + + ++L + +Q + Sbjct: 198 AEDIAGDGIDLAIHTVGFKVDDKAQKELECISEVSGGTYTSADDTEALTEALTDAAQRV 256 >gi|302917449|ref|XP_003052439.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] gi|256733379|gb|EEU46726.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] Length = 764 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 68/212 (32%), Gaps = 29/212 (13%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D+SGSM V + + + K+A +++++ Sbjct: 86 DIVLVIDVSGSMA-----GAAPVPGEETNESTGLSILDLTKHAARTIIETMNESD----- 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 +G++ + ++ + T + ++ ++ S+ + T+ + + ++ + K Sbjct: 136 -RLGIVTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWHGLLEGIKLFKNVKS 194 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ LTDG N+ + + I T Sbjct: 195 SNVPA-----------------IMVLTDGMPNHMNPAAGFVPKLRAMGQLPASIHTFGFG 237 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 238 YHLRSGLLKSIAEIGGGNYAFIPDAGMIGTVF 269 >gi|126178503|ref|YP_001046468.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] gi|125861297|gb|ABN56486.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] Length = 1002 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/378 (8%), Positives = 91/378 (24%), Gaps = 51/378 (13%) Query: 61 ILAGASKMVSNLSRLG--------DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 +LA AS + ++ + A R ++ + Sbjct: 402 VLAAASGVTDIDGYATVEFHPGAFTTDPDVAGWSDTATGKATVRATWGNVSRDIELTWKN 461 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + ++ V+ +++ + + + D +L ++ Sbjct: 462 YPYLSVETEVSPETVAVNDTVDVTIRLKGDGWALQPD-PIDVVLCTDRSGSMLKDYPDR- 519 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + +D S + ++ + + + + + Sbjct: 520 --------------------MVKAMDASRIFNVQIDHGRDRLGLVSFGGQGQTNIIQYSQ 559 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 N + YT ++ S + + + Sbjct: 560 NVGGWLGRDSGTWDDLSYRTANYGNNRYYTEYATLDLGLSDDQSAINTTIAGMI---PDS 616 Query: 291 PTDSTPAMKQAYQILTSDKKRS----------FFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + +A L + + N++ + + KF T + Sbjct: 617 GTPMRYGIYKAITELKDNGRSDAVKAVVVLGDGEYNYYGDPLARGTGRTTKFDWSNTQND 676 Query: 341 NNNFKSNVNTIK-ICDKAKENFIKIVTISINAS-----PNGQRLLKTCVSSPEYHYNVVN 394 F + + A + +++ IS + + + HY+ + Sbjct: 677 YTYFSDLGAAEQRMSTYANNHNVRLYMISFSDDIVNGSHTWNTMETLASETGGTHYHAPS 736 Query: 395 ADSLIHVFQNISQLMVHR 412 L V+ I+ + Sbjct: 737 DSDLAQVYTEIAGELKTE 754 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 58/197 (29%), Gaps = 9/197 (4%) Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVE 264 + + + Q TK+ ++SID + ++ + + Sbjct: 42 DQAKNFLDKTYKVNQISTITKIVEEHIDYKETINSIDRNGETIDIPMDDILDPFFCLKET 101 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 ++ + + ++ T ++ + ++LT + + G Sbjct: 102 NAKSLNFDPNS-KGDINEILNMKAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDN 160 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTC 382 N + T +C K K+N IK+V I I N K C Sbjct: 161 TQKMSSPH---DQKAGIINITQKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDC 217 Query: 383 VSSPEYHYNVVNADSLI 399 V + Y NA L Sbjct: 218 VGT-GNFYLAKNAHELE 233 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 69/236 (29%), Gaps = 58/236 (24%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I + + V+D SGSM K+ +++ +L Sbjct: 171 DIEQYAKNNSSIEAGIDLICVIDKSGSM--------------------SGQKIESVQQSL 210 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDS 294 + L+ + + LI + +++ TE ++Y T+ Sbjct: 211 VQLLNFLSEKD------RLCLIVFDGGAKRHTPLKTLTEGNKKYFKGAIAAISAGGSTNI 264 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+Q + K ++ T+ I L+DG+++ Sbjct: 265 AAGTDIAFQQIQQRKMKNQVTS----------------IFLLSDGQDSG---------AA 299 Query: 355 DKAKENF------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ ++ + I + + + K C Y + N L F + Sbjct: 300 ERIQKQKDRISDVVTIHSFGYGNDHDADLMSKICKVGQGSFYYIENVKLLDEFFAD 355 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 74/231 (32%), Gaps = 42/231 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + ++ +++ A K L F+ ++ Sbjct: 97 DVLMVVDLSGSMEEKDFATES---------GEQLSRLTAAKKVLRNFVTQ-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI + + + + + T+ A+ ++ Sbjct: 141 DRFGLILFGDAAFIQTPFTADQNVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS--- 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 PS + ++ LTDG + S V+ + A I+I I++ Sbjct: 198 -------------PSTSQDQIMLVLTDGNDTG--SFVSPVDAAKIAAAKGIRIYVIAMGD 242 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + ++ L +Q I + +KYS Sbjct: 243 PENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVI-DQLEPQKYS 292 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 KM + A LD + + ++ Y V + ++V+ + R Sbjct: 54 GDKMVHAREAAKQALDRLGAGD------MVSVVAYDDAVSLISPATDLTDRDRVKAAIDR 107 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + + + L + ++ L+DG N Sbjct: 108 I---QAGGSTALFSGISKGAEEL----------------RRNKRPNQVNRVVLLSDGMAN 148 Query: 343 NF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 S + ++ + I + T+ + N + + + S H + N+ +L + Sbjct: 149 VGPSSPQDLGRLGASLAKEGITVTTLGLGLGYNEDLMTELALRSDGNHAFIENSQNLAGI 208 Query: 402 FQNISQLMVHRKYSVILK 419 FQ + SV+ + Sbjct: 209 FQTEFGDI----LSVVAQ 222 >gi|148657120|ref|YP_001277325.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569230|gb|ABQ91375.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 774 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 43/143 (30%), Gaps = 20/143 (13%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T ++ + ++L + + II L+ Sbjct: 373 RAQAAIDTLNSRGATSIGGGLQSSQRML-----------------DTANPDLPRVIILLS 415 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG+ N + + + + TI + + Q +L + + + Sbjct: 416 DGQENT---RPFVADVLPQIRAAQTTVHTIGLGRDADQQLMLSIAAQTGGTYNYAPTPEQ 472 Query: 398 LIHVFQNISQLMVHRKYSVILKG 420 L ++ IS + +R+ V G Sbjct: 473 LSGIYNTISGAVSNRQTLVTASG 495 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 45/372 (12%), Positives = 107/372 (28%), Gaps = 42/372 (11%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV---FYN 115 AA+L ++ + K + R I + + + Sbjct: 102 AALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDGRPQDNV 161 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ +S + + + R+D + + DY + R Q+ Sbjct: 162 QDVSARARASGVELFAIGVGRVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQEAFC 221 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCA-MNSDPEDVNSAPICQDKKRTKMAALKNA 234 + G+ + + S + C + D + R + + Sbjct: 222 VV--SDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNGQWGGHRLALERVWRG 279 Query: 235 LLLFLDS--------------IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + +D L + +GL+ Y++ V + K Sbjct: 280 GDGASATPRNHSXXXXXXNQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKA 339 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + K T + A+K +F QK I TDG Sbjct: 340 AVRNMSYMEKGTMTGAALKY-----------LIDNSFTVSSGA--RPRAQKVGIVFTDGR 386 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + ++ ++ KAK+ K+ + + + L+ S P E+++ + ++ Sbjct: 387 SQDYIND-----AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTI 439 Query: 399 IHVFQNISQLMV 410 + + + + + Sbjct: 440 NQIGKKLQKKIC 451 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 59/185 (31%), Gaps = 29/185 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M ++K AL + + +GL+ ++ ++ I P+ + + R Sbjct: 119 MDSVKYALRTMVSLF------NNTIDIGLVEFSHSIKSAIPPTPNRSYIDMVIDRM---E 169 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ A L L F +F+TDG + + Sbjct: 170 AGGGTMYSFALSTALSWLRP----------------YRELNVSAFTVFITDGLPGDPQDY 213 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 + +K I I T+ I P G K S + + V + D L I+ Sbjct: 214 RPLLDEYNKL---GIPIYTVFIGEDPRGIDETKLIASKTGGEQFTVESIDRLSDTLNTIA 270 Query: 407 QLMVH 411 + Sbjct: 271 SKINT 275 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 49/275 (17%), Positives = 87/275 (31%), Gaps = 51/275 (18%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 V A + LV+D+SGSM M TK+ A Sbjct: 581 TVNVDVTGAASSSTITTTQPVDFTLVLDVSGSMRENM---------------GSVTKLQA 625 Query: 231 LKNALLLFLDSIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 L++A+ FLD ++ E V +GL+ + K I +K Y + Sbjct: 626 LQSAVNNFLDEAAKINKGAQSGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTY---NYSQ 682 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENN-- 342 ++ K T T +K LT+ F +K +IF TDG+ Sbjct: 683 IVKKLTADTDGLKNEVNKLTAGGATRADYGFQHAFTVMSEARTEAKKVVIFFTDGKPTSE 742 Query: 343 ---NFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--------LLKTCV-------- 383 + K + ++ + K++ + +I + N + Sbjct: 743 KTFDGKVANDAVEYAKQLKDSGAIVYSIGVFDGANPASTATSENKFMHAVSSNYPNAANY 802 Query: 384 ------SSPEYHYNVVNADSLIHVFQNISQLMVHR 412 S+ Y+ +A L +F+ I + Sbjct: 803 EDLSEGSNAGYYKTATDASGLNSIFEEIRKSETTT 837 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 29/245 (11%), Positives = 68/245 (27%), Gaps = 33/245 (13%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + +D+SGSM +++ A + AL LFL Sbjct: 79 DLVTQLPAKGIALVVALDVSGSMGAEDVVWT--------PGAPSVSRLEAARRALKLFLA 130 Query: 241 SIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDST 295 + +GL+ + E + + + T+ Sbjct: 131 GGAAPDGTAFDPRPGDAVGLVAFAAVPETVCPATLNHSVLFKVADALQPKGGADAGTNIG 190 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ +A L + ++S G + D + + + Sbjct: 191 DSLAEAVIRLDAADQKSRVLILLSDG------EHNILKEDVRDAQRPGIDRTLKPREAAQ 244 Query: 356 KAKENFIKIVTI---S---INASPNG--------QRLLKTCVSSPEYHYNVVNADSLIHV 401 A +++ TI + A P+ + L + + + L+ Sbjct: 245 LAANLGVRVYTIDAGGDPPLGAPPDAVAQRFAGRKALKDVAEMTGGKSFQATSGAELLSA 304 Query: 402 FQNIS 406 ++ IS Sbjct: 305 YREIS 309 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 74/231 (32%), Gaps = 42/231 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM ++ +++ A K L F+ ++ Sbjct: 97 DVLMVVDLSGSMEEKDF---------ATEAGEQLSRLTAAKKVLRDFVTQ-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI + + + + + + T+ A+ ++ Sbjct: 141 DRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQS--- 197 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 PS + ++ LTDG + S V+ + A I+I I++ Sbjct: 198 -------------PSTSQDQIMLVLTDGNDTG--SFVSPVDAAKIAAAKGIRIYVIAMGD 242 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + ++ L +Q I + +KYS Sbjct: 243 PENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVI-DQLEPQKYS 292 >gi|290960274|ref|YP_003491456.1| lipoprotein [Streptomyces scabiei 87.22] gi|260649800|emb|CBG72916.1| putative lipoprotein [Streptomyces scabiei 87.22] Length = 537 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 23/251 (9%), Positives = 66/251 (26%), Gaps = 46/251 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + + ++ A +RP + V+D+SGSM Sbjct: 152 FSVTVDGARTGPGNWSLVRVGLATRAAGDR---QRPPAALTFVIDVSGSMAE-------- 200 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ ++AL + + + ++ ++ E + + Sbjct: 201 -----------PGRLDLAQDALRTMTNRLRDDDS------VAVVTFSDEAETVLPMTRLD 243 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T+ ++ Y+ + Sbjct: 244 GNRGEIREAVAGLEPTDSTNLAAGVETGYETAVEGL----------------RKGATNRV 287 Query: 334 IFLTDGENNNFK--SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + L+D N ++ +I + +E+ I + + + + + + + Sbjct: 288 VLLSDALANTGSTDADTILERIAGERREHGITLFGVGVGSDYGDALMEQLADKGDGHTTY 347 Query: 392 VVNADSLIHVF 402 V + VF Sbjct: 348 VSTEEEAEKVF 358 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 69/190 (36%), Gaps = 45/190 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +D+S SM + +D ++ K + ++++ Sbjct: 91 DVVFAIDVSKSM---------------LAEDIAPNRIEKSKQLVTQIINNLGS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + ++++ D L + T A++ A +++ Sbjct: 129 DRIGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ +S+ I ++A + I+I TI + Sbjct: 189 TN------------------RVLFIISDGEDHVGESSN----IAEQANDEGIRIFTIGVG 226 Query: 371 ASPNGQRLLK 380 S G LK Sbjct: 227 KSEGGPIPLK 236 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 95/352 (26%), Gaps = 53/352 (15%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A A K + ++ +L + +++ K + I Sbjct: 52 AAAVKEYAGAPAYAALEKTARIAPAPSLPQNTEKYGKIESNPVQAVARNPVSTFSIDVDT 111 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSY--DYRLQFIEHLLNQRY-------NQKI 173 S + + +NRL ++ + Y + + K+ Sbjct: 112 GSYANARRFLNDNRLPPAHAVRIEELINYFDYGYAPPSDGKPFAVYTETIDSPWQADAKL 171 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + I P + +VD+SGSM K+ +K Sbjct: 172 IKIAIKAKEIRSSALPPANLVFLVDVSGSMQAQD-------------------KLPLVKK 212 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 L + + + LI Y + + + P+ G +K + T Sbjct: 213 TLRILTKRLRAED------KVTLITYASNEKLVLPPTSGKDKD-TILQAINQLEAGGSTA 265 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIK 352 A++ AY ++ I+ TDG+ N Sbjct: 266 GEQALQMAYAQAQKAYIKNGIN----------------RILLATDGDFNVGITDFNTLKD 309 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ 403 + ++ I + T+ ++L++ + + + N V Q Sbjct: 310 TVAEKRKAGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNETEAKKVLQ 361 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 65/231 (28%), Gaps = 42/231 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D + ++ A + A F+ + V +G++ + P+ Sbjct: 100 DVSGSMRATDVEPNRLVASQVAAKNFVKDL------PRHVRVGVVSFAGTAAVVQAPTHS 153 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI-------- 324 + V + R T + + L + + + Sbjct: 154 RDDVFAAIDRFQLQR---GTAIGSGIVLSLATLFPEAGIDLSDITGERRMPKGIGESDKQ 210 Query: 325 -------PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---- 373 P +I LTDG+ + A + +K+ T+ + Sbjct: 211 DDFKPVAPGSYGSAAVILLTDGQRTTGPD---PMDAAKMAADRGVKVYTVGFGTTSGEII 267 Query: 374 -----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + L + ++ +A L V+ +S +V + Sbjct: 268 GFEGWSMRVRLDEATLKNIANLTQAEYFYAGSATDLQKVYDTLSSRLVFER 318 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 35/240 (14%), Positives = 72/240 (30%), Gaps = 44/240 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V R+ + ER + +VD SGSM Sbjct: 275 FTSGHHLVRVAVQGKRVPVKERTPVHLVYLVDTSGSMQ-------------------SPD 315 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K +L + D++ + L Y V + + P+ K + + D Sbjct: 316 KIELAKKSLKMLTDTLKPGD------TVALCTYAGSVREVLAPTGIESKGK-ILAALADL 368 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-K 345 T + + AY + + +I L+DG+ N Sbjct: 369 TAGGSTAMSSGIDLAYSL----------------AERTLVKGHVNRVIVLSDGDANVGPT 412 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 S+ +K +A++ I + T+ +++ + + + + VF Sbjct: 413 SHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSE 472 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 49/363 (13%), Positives = 113/363 (31%), Gaps = 69/363 (19%) Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE-------SLSGYSAVFYNTEIQNI 121 ++ E+ A + + K + + SL+ + + Sbjct: 269 AEVITEGDSIIENNEAFAFSNVKGEPKVLVVEDEQGQGKNLIDSLTASGLSVDVIPTELL 328 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-IEH-LLNQRYN--------Q 171 + + + ++ ++ + F + + Sbjct: 329 PTVLSGYVQYQSIVFVNVPATSVTQQQMDAI-HSAVKDFGTGFIMAGGDSSFGLGGYFKT 387 Query: 172 KIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 I +P + ++ + P + +V+D SGSM K+ Sbjct: 388 PIEKLLPVDMDLKGKKELPSLGMVIVLDRSGSMA--------------------GYKIQL 427 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K A I ++E +G I + R + I+ +K + + + Sbjct: 428 AKEAA------IRSAELLREKDTLGFIAFDDRPWQIIDTEPIKDK-EKVIEKINGLTSGG 480 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ P+++ AY+ LT +K II LTDG++ + T Sbjct: 481 GTNIFPSLELAYEQLTPL------------------ELQRKHIILLTDGQSATSPDYLTT 522 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I + KEN I + T++I + L + Y+V ++ ++ + + ++ Sbjct: 523 I---QEGKENNITLSTVAIGEGSDSVLLEELSDEGGGRFYDVNDSSTIPSILSR--ETVL 577 Query: 411 HRK 413 + Sbjct: 578 TTR 580 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 72/235 (30%), Gaps = 35/235 (14%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + R + V+D+SGSM+ D +AP + + +++ LK A Sbjct: 63 LVRVEAPARPEARIPIDVVAVLDVSGSMN--------DPAAAPTERTRTTSRLDLLKTAA 114 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPT 292 + ++ + ++ ++ R + + + R + T Sbjct: 115 KFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEARGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 PA ++A ++L + F I+ LTDG + S + + Sbjct: 169 ALVPAFEEAVKVLDGRQGDGGDRLGF--------------IVLLTDGAEDASGSFTLSER 214 Query: 353 ICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSLIHVFQ 403 + + + + + + LL S + V N + Sbjct: 215 RREVIRGALGRYPVHAFGLGTAHGPEVLLYLAQESRGTYSFVDGDNVGEVAGALA 269 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/410 (9%), Positives = 100/410 (24%), Gaps = 106/410 (25%) Query: 23 SIIFALSVMSFLLLIGFL------IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 +I+ L L G + V ++ +A + A + + Sbjct: 8 AIMPLLCGFILLAACGGAPPPASQVDV----------QATIDAGVRATLAAQPTEAPSPT 57 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 S + A + + T ++ + + Sbjct: 58 TPPPSPTAVPPTATTAPTSPPVAPTVAPFTPATP-----TATGADAPTTVPESSSPPTDT 112 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 V T+ Sbjct: 113 TTIFRPAEGEAAQVTTNIQLVFDA------------------------------------ 136 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 SGSM + + TK+ A + A+ +D++ + + Sbjct: 137 ---SGSMAQRIGGE---------------TKIQAARRAMERIIDTLPDNPDLNVGFRVFG 178 Query: 257 IGYTTR-------VEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + + + + T + A+++A + + Sbjct: 179 HEGDSSEAQKARSCQSTALLVPMQGVNKALLRQQAQAWQPTGWTPISLALQRAGEDFQAG 238 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + II +TDGE K +++ ++I + Sbjct: 239 EN------------------VRNVIIMVTDGEETCGGDPCAVAKALAESQAE-VRIDVVG 279 Query: 369 INASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN-ISQLM--VHRKY 414 +P+ + L+ +S + + N D+L+ + I+ + ++ Sbjct: 280 FGTTPDVAKTLRCIAENSGGVYTDAQNGDALVQTLEELIAATLKRSTLRF 329 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 80/243 (32%), Gaps = 49/243 (20%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +R + LV+D SGSM K+A K A + Sbjct: 33 SEAPASADRVPLNLSLVIDRSGSMA--------------------GDKIAYAKKAAQFIV 72 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D++ V ++ Y VE + K ++ R T+ + M Sbjct: 73 DNLSPEDRVS------IVQYDDIVEVLSPSAPVLNK-QELRQRIALMEARNMTNLSGGML 125 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--ICDKA 357 Y+ + K+ F ++ L+DG N+ ++ ++ + +K Sbjct: 126 AGYEQVERTKQARFVN----------------RVLLLSDGLANHGITDPTVLQQMVQEKF 169 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + I + T + A N + +Y + + D + +F ++ + +V+ Sbjct: 170 RNAGIAVSTFGVGADFNELLMTSLSEYGGANYYFIESPDKIPGIF---AEELRGL-LAVV 225 Query: 418 LKG 420 +G Sbjct: 226 AQG 228 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 50/381 (13%), Positives = 105/381 (27%), Gaps = 75/381 (19%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RL 137 + + + + + S + S + ++ L Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + +Y L + E+ I LV+ Sbjct: 62 SAPAREKTVTANEDGTYTVALNVTG--------------AKSAGTGEIVTNQPLDIVLVL 107 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVY 253 D+SGSM + S + TK+ +LK A+ F++ ++ + Sbjct: 108 DVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L+ + + ++ + E Y + ++ T + L++ S Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGLTSTVNGLSASGATSA 213 Query: 314 FTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-----CDKAKENFIKIV 365 F R P +K +IF TDGE N+ T+ K+ I Sbjct: 214 DYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIY 273 Query: 366 TISINASPNG----------------------------------QRLLKTCVSSPEYHYN 391 +I + + N L + Y+ Sbjct: 274 SIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKA 333 Query: 392 VVNADSLIHVFQNISQLMVHR 412 +A L ++F++I Q + Sbjct: 334 ATDAGQLNNIFESIYQEITKT 354 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 50/381 (13%), Positives = 105/381 (27%), Gaps = 75/381 (19%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RL 137 + + + + + S + S + ++ L Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + +Y L + E+ I LV+ Sbjct: 62 SAPAREKTVTANEDGTYTVALNVTG--------------AKSAGTGEIVTNQPLDIVLVL 107 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVY 253 D+SGSM + S + TK+ +LK A+ F++ ++ + Sbjct: 108 DVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L+ + + ++ + E Y + ++ T + L++ S Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGLTSTVNGLSASGATSA 213 Query: 314 FTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-----CDKAKENFIKIV 365 F R P +K +IF TDGE N+ T+ K+ I Sbjct: 214 DYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIY 273 Query: 366 TISINASPNG----------------------------------QRLLKTCVSSPEYHYN 391 +I + + N L + Y+ Sbjct: 274 SIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKA 333 Query: 392 VVNADSLIHVFQNISQLMVHR 412 +A L ++F++I Q + Sbjct: 334 ATDAGQLNNIFESIYQEITKT 354 >gi|260459671|ref|ZP_05807925.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259034473|gb|EEW35730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 718 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 35/361 (9%), Positives = 100/361 (27%), Gaps = 50/361 (13%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A SK+++ + I+ + + H + Sbjct: 219 AAESKLMTPQQPATAPADQIAPQEENRDRVQDFKTNPVHAALEDPVSTFSIDVDTASYSF 278 Query: 123 NSSRISMTHMAN-NRLDSSN--NTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIP 178 + + + + N Y+ S + + + Sbjct: 279 VRRSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTPFNSTVSVMPTPWNTHTKLMHVA 338 Query: 179 ALL-RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 ++ E+P + ++D+SGSM K+ LK+A L Sbjct: 339 IKGFDVKPTEQPKANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRL 379 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + ++ Y + P+ EK + + + T Sbjct: 380 LVSKLKADD------TISIVTYAGDAGTVLMPTKIAEKDK-ILNAIDNLQPGGSTAGEAG 432 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDK 356 +K+AY++ + ++ TDG+ N + + + ++ ++ Sbjct: 433 IKEAYKLAQQSFIKDGVN----------------RVMLATDGDFNVGQTDDDDLKRLIEQ 476 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 ++ + + +++T + D+L + + + +++ Sbjct: 477 ERKTGVFLSVFGFGRGNLNDEMMQTIAQNGNG--TAAYIDTLAEAEKVLVEDASSTLFTI 534 Query: 417 I 417 Sbjct: 535 A 535 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 50/381 (13%), Positives = 105/381 (27%), Gaps = 75/381 (19%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RL 137 + + + + + S + S + ++ L Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + +Y L + E+ I LV+ Sbjct: 62 SAPAREKTVTANEDGTYTVALNVTG--------------AKSAGTGEIVTNQPLDIVLVL 107 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVY 253 D+SGSM + S + TK+ +LK A+ F++ ++ + Sbjct: 108 DVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L+ + + ++ + E Y + ++ T + L++ S Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGLTSTVNGLSASGATSA 213 Query: 314 FTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-----CDKAKENFIKIV 365 F R P +K +IF TDGE N+ T+ K+ I Sbjct: 214 DYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIY 273 Query: 366 TISINASPNG----------------------------------QRLLKTCVSSPEYHYN 391 +I + + N L + Y+ Sbjct: 274 SIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKA 333 Query: 392 VVNADSLIHVFQNISQLMVHR 412 +A L ++F++I Q + Sbjct: 334 ATDAGQLNNIFESIYQEITKT 354 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 23/179 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ I D L +++ +GL+ Y++ V + T K + + K T Sbjct: 36 VKKFINQIVDSLDVSEQNAQVGLVQYSSSVRQEFPLGRFTSKRDIKAAVKKMTYMEKGTM 95 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ F QK I TDG + ++ S+ Sbjct: 96 TGTALNY-----------LIDNTFAISSGA--RPGAQKVGIVFTDGRSQDYISD-----A 137 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 KAK+ K+ + + + L+ S P E+++ + ++ + + + + + Sbjct: 138 AKKAKDLGFKMFAVGVG--NAVEDELREIASDPVAEHYFYTADFKTINQIGKKLQKKIC 194 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 67/239 (28%), Gaps = 44/239 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ PA + ++ +RP + V+D SGSM Sbjct: 20 TTTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSM--------------------SGL 59 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + K A + + V ++ + RV+ + S + Sbjct: 60 PLQMAKQAAIAAVRQARPDDRVS------VVAFDDRVDVIVP-SQLATSREAVIQAIGTI 112 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + + + + +I L+DG+ N + Sbjct: 113 DDRGSTNLHGGWLEGATQV----------------AQHLTPGALNRVILLSDGQANVGVT 156 Query: 347 NVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I E I TI + + + + LL + +V + L F+ Sbjct: 157 DRREIARQVRGLTERGISTTTIGLGSHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEE 215 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 74/217 (34%), Gaps = 37/217 (17%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 D S N+ + V + + +K + +D +D+ ++GL Sbjct: 758 TDDGRSCRACSNAATDVVFLIDGSKSVRPENFELVKKWINQIIDKLDVSD---NKAHVGL 814 Query: 257 IGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + Y++ V++ + +++ V + + + T + A++ LT + Sbjct: 815 VQYSSAVKQEFPLGRYNNKKDLKEAVKKMA--YMERGTMTGQALRY----LTDNSFG--- 865 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + K I TDG + ++ + KAK+ K+ + + Sbjct: 866 ------PGQGARPGVTKVGIVFTDGRSQDYIGD-----AAKKAKDQGFKMYAVGVG--NA 912 Query: 375 GQRLLKTCVSSP--EYHYNVVN--------ADSLIHV 401 + LK S P E+++ + D L + Sbjct: 913 VEDELKEIASEPTAEHYFYTADFKTDGQRREDELKEI 949 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 56/161 (34%), Gaps = 19/161 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ L +G++ Y +RV+ + K + T + A++ A Sbjct: 571 IEGLDVGPNATRVGVVNYASRVKNEVSLKTHRTKAGLIKAVTKIEPLSTGTMTGLAIQFA 630 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + S+ +G ++ S K I +TDG + N + +A++ Sbjct: 631 MNVAFSEA----------EGARLRSPDISKVAIVVTDGRPQD-----NVKDVAQRARDAG 675 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 I+I I + L+ S P ++ V + + Sbjct: 676 IEIFAIGVGRV--EMSTLRQMASDPLDDHVDYVESYSVIEK 714 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 50/381 (13%), Positives = 105/381 (27%), Gaps = 75/381 (19%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN-RL 137 + + + + + S + S + ++ L Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + +Y L + E+ I LV+ Sbjct: 62 SAPAREKTVTANEDGTYTVALNVTG--------------AKSAGTGEIVTNQPLDIVLVL 107 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD----SIDLLSHVKEDVY 253 D+SGSM + S + TK+ +LK A+ F++ ++ + Sbjct: 108 DVSGSMAEKIASGW-----------NQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L+ + + ++ + E Y + ++ T + L++ S Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSY---NYTQIVSNLTYDVSGLTSTVNGLSASGATSA 213 Query: 314 FTNFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI-----CDKAKENFIKIV 365 F R P +K +IF TDGE N+ T+ K+ I Sbjct: 214 DYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIY 273 Query: 366 TISINASPNG----------------------------------QRLLKTCVSSPEYHYN 391 +I + + N L + Y+ Sbjct: 274 SIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGDRAETSSYYKA 333 Query: 392 VVNADSLIHVFQNISQLMVHR 412 +A L ++F++I Q + Sbjct: 334 ATDAGQLNNIFESIYQEITKT 354 >gi|330469087|ref|YP_004406830.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812058|gb|AEB46230.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 316 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 64/234 (27%), Gaps = 53/234 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L +D+S SM T++ + A F+ + E Sbjct: 87 ATIMLAIDVSLSMQADDVP---------------PTRLEGAQEAAKQFVREL------PE 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + P+ V + + + T + A+ Sbjct: 126 TYNVGLVSFAKSANVLVPPTKDRPAVTNAIDGLVLAEA---TATGEAVFT---------- 172 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ L+DG +S + A+ + + TI+ Sbjct: 173 --CLEAIRSVPADGAAGIPPARIVLLSDGFRTAGRS---VEEAAAAAQAANVPVSTIAFG 227 Query: 371 ASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L ++ Y Y + L V+Q++ + Sbjct: 228 TDAGHVAIGGQLQRVPVDRMALAALAETTEGYFYEAASVSELKQVYQDMGSSIG 281 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 60/178 (33%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R + + + T++ Sbjct: 59 QWVANLVDTFEIGPDKTRVGVVRYSDRPTTEFDLGRYQTREQIKEAAKNIKYYGGNTNTG 118 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + ++ G + +K I LTDG + + + Sbjct: 119 DALRY----------INTYSFSEEAGGRPTDSAIKKVAILLTDGRSQDHVLDP-----AT 163 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A + I+I + + + + L S P+ + ++V + +++ + + + + Sbjct: 164 AAHKAGIRIFAVGVGEALKEE--LDEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCE 219 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 69.9 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 62/178 (34%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 67 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSREAVKAAARHLAYHGGNTNTG 126 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + S + G + F++ I LTDG + + + Sbjct: 127 DALRFITRHSFSRQ----------AGGRPGDRAFKQVAILLTDGRSQD-----LVLDAAA 171 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A I+I + + A+ + L+ S P+ + ++V + D++ + + + + Sbjct: 172 TAHRAGIRIFAVGVGAALREE--LEEIASEPKSAHVFHVSDFDAIDKIRGKLRRRLCE 227 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 53/191 (27%), Gaps = 10/191 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++I AL+ + + L+G I + ++ A +A ++AG + + +S Sbjct: 30 FHQDRGGTVTVIVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKLVVS 89 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + D AK + + + S + ++ + Sbjct: 90 NTASVIGLTTQTIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQTVHL-----SFGPFVG 144 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + S + M L +K + + Sbjct: 145 MSESKVSAKAKASVVGKMRLCMLALDPAA--AGAFNLEKSAQVTAYDCALYSNSSNSGGM 202 Query: 194 ELVVDLSGSMH 204 V + SM Sbjct: 203 ---VGRNNSMA 210 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V +G + Y+ E + + T + Sbjct: 866 MNLTIHLVKKADVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYT 925 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + F + + ++ +I +TDG +++ + Sbjct: 926 AKALQHS------------NVLFTEEHGSRLTQNVRQLMIVITDGVSHD---RDKLDEAA 970 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + + + + Sbjct: 971 RELRDKGITIFAVGVG--NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ID+ V +G++ Y+ + E S T+ + + T + A+ Sbjct: 501 IDMFPIGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKAL--- 557 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G + ++I LTDG++N+ ++ ++ + Sbjct: 558 ----------DFILPLIKKGKTERTDRAPCYLIVLTDGKSNDS-----VLEPANRLRAEQ 602 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I I I + L+ E N DSL + I + K Sbjct: 603 ITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNEIVHRICSEK 652 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 22/170 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ ++ + + +G++ ++ + + + + Y D S I + Sbjct: 677 ETMKTFMKNLVGKIQIGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRN 736 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ + K +KF+I LTDG+ + Sbjct: 737 TLTGGALTFVNEYFDLS--------------KGGRPQVRKFLILLTDGKAQD-----EVG 777 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 778 GPATALRSKSVTIFSVGV-YGANRAQ-LEEISGDGSLVFHVENFDHLKAI 825 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 79/236 (33%), Gaps = 30/236 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V+D S SM ++ D +D+ ++ + +AL + + Sbjct: 1516 ATVTTKYPVDLVFVIDKSLSMDYDIDGDEIKW-----WEDETESRKDIVNDALDEIIPDL 1570 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + D+ + ++ + ++ S + +Q + S T+ + A+ A Sbjct: 1571 CSQQY---DIQIAGYQFSGSSTRVLDWSR---EEQQVLNNLKISNTSYNTEPSQALADAL 1624 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +L + + +K++IF+TDGE + + Sbjct: 1625 DMLKTGSQ-----------AHQNQSNVKKYLIFMTDGEPTESEELSYYAISKNPV--PGA 1671 Query: 363 KIVTISINASPNGQRLLKTCVSSPEY------HYNVVNADSLIHVFQNISQLMVHR 412 I TI +++ + + ++ + +A + F I ++ Sbjct: 1672 SIYTIGVSSDASTDLMEGIRSTAEGNGMTAPATFKGTSAQLIKDAFTQIKDEIIST 1727 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 85/266 (31%), Gaps = 49/266 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIF--LIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + G++ + V+D SGSM + S D Sbjct: 1076 NGNGTYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDA-------- 1127 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHV------KEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + +AL LF + + + K + + ++ T Sbjct: 1128 ----RREVVNSALELFFNQLSDGDYNIQFGGYKFSDSGERVNFNDWGWQDKYWETDTSNA 1183 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + T + ++ A L + + ++++IFL Sbjct: 1184 LSHL-KLTSWETDGSTYPSQTLRSAISALENVELGENG---------------KRYLIFL 1227 Query: 337 TDGEN--NNFKSNVNTIKIC-DKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYH-- 389 TDGE N++ + + C K ++ I + A+ + +++ VS+ + Sbjct: 1228 TDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQV-ANSDSHGFMESMVSNANFVDG 1286 Query: 390 -----YNVVNADSLIHVFQNISQLMV 410 + +AD L F ++ + Sbjct: 1287 VTAQKFVGNSADELNAAFSQMAAEIS 1312 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 42/366 (11%), Positives = 107/366 (29%), Gaps = 55/366 (15%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 AA ++ ++ + R + S + + + + + Y Sbjct: 54 MEAAAMSKMARYSPIMPMPMPRQQDNSEKYQHLAANPIHQTATDAVATFSIDTDTGSYAN 113 Query: 117 EIQNIVNSSRISMTHMANNRL-DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + + N + L + N + + + N+ I Sbjct: 114 VRRFLNNGQLPPTDAVRIEELINYFNYDFSQAKRLANAPFLVSTETVAAPWRTANRIIKV 173 Query: 176 FIPALL--RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 I A + P + +VD+SGSM K+ +K+ Sbjct: 174 AIKADDPTITKQSTLPPANLVFLVDVSGSMSDND-------------------KLPLVKS 214 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 +L + + + ++ Y R + + + G++ + + T+ Sbjct: 215 SLKMLTKQLRPQD------TISIVTYAGRTQVTLPATRGSDTDK-ILAAIDSLDASGSTN 267 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIK 352 A+K AYQ K+ I+ +TDG+ N S+ + Sbjct: 268 GEAAIKLAYQQAKIHYKKDGIN----------------RILMMTDGDFNVGVSDVDEMLD 311 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVN--------ADSLIHVFQ 403 I + +++ + + T +++ + + + + D + F Sbjct: 312 IIRRERDSGVSLSTFGFGEGNLNDHMMEQVADNGNGNYSYIDSLSEAKKALVDEMSATFN 371 Query: 404 NISQLM 409 +++ + Sbjct: 372 TVAKDV 377 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 75/245 (30%), Gaps = 60/245 (24%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I + D+S SM +++ +M A K L+ F+ + + Sbjct: 96 IVVAFDVSLSMRIRDF----------YIGNRQVDRMTAAKRVLVDFIK-------GRPND 138 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + P+ + + + R ++ T + A + L + + Sbjct: 139 RIGIVAFGGAPYNPCPPTLDHDWLLNNMDRIQTGIMEDGTAIGSGIAAAARRLDQLEVK- 197 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K I+ +TDG NN+ K + A I+I ISI Sbjct: 198 -----------------SKVILLMTDGANNSGKLS--PQDAARLAATLGIRIHAISIGTP 238 Query: 373 P-------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ----LM 409 + + L + + + +L +F+ + + + Sbjct: 239 GMHPIYMPNGPPINSGRQEFDPETLQEVANIGSGSFFRAEDLSTLERIFKTVDEMERTEI 298 Query: 410 VHRKY 414 RK Sbjct: 299 ERRKI 303 >gi|156405834|ref|XP_001640936.1| predicted protein [Nematostella vectensis] gi|156228073|gb|EDO48873.1| predicted protein [Nematostella vectensis] Length = 250 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 75/220 (34%), Gaps = 21/220 (9%) Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 S P DV + + I+ + +E ++G+I Y+T+ E + Sbjct: 25 CSKPSDVALLIDASGSIGRR--RWPKVVEFTQAIINSFNVSEEGSHVGIILYSTKTELLV 82 Query: 268 EPSWGT------EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + + + V T A+K A + L F +G Sbjct: 83 KFNTFQGSELTADNINAKVAAVNYRDWGGLTYIDRALKLANEQL-----------FSPEG 131 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-AKENFIKIVTISINASPNGQ-RLL 379 S K + TDG+ K ++I + K+ +++ + I + + Sbjct: 132 GMRASKDILKVAVVFTDGKQTKDKGPFTELQIASQPLKDKDVQVYGLGIGDETTIDVQEM 191 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + PE + L ++ I+Q + KY +++ Sbjct: 192 QEMANKPENVLTAKTFEELKNLAAQITQGVCEIKYQYVIQ 231 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 32/267 (11%), Positives = 79/267 (29%), Gaps = 57/267 (21%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V+ + + I L +D+SGSM ++AA + Sbjct: 70 LVAVARPSALVTL-PVTDQTILLAMDISGSMRATD---------------IAPNRLAAAQ 113 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A F++ + D +G++ + P+ + V + R T Sbjct: 114 AAARSFVEVL------PSDTRVGVVAFAATAALIQAPTRNHDDVLAAIDRVQLQR---GT 164 Query: 293 DSTPAMKQAYQILTSDKK---------RSFFTNFFRQGVKIPSLPFQKF-----IIFLTD 338 M + L + S + G + I+ LTD Sbjct: 165 AIGSGMVLSLATLLPEAGIDLRLLAGDGSPPADKPPPGEPTHAPVPPGSHAYGAIVLLTD 224 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQRLLKTCV 383 GE ++ ++ +++ T+ + + + L Sbjct: 225 GERTTGPPLDFATRLAA---DHGVRVYTVGVGTAEGGVVGYEGWSMRVRLDEAALKSIAD 281 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMV 410 + +++ +A++L +++ + + Sbjct: 282 ETRGEYFHAQSAEALRTIYRKLGTRLT 308 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 69/238 (28%), Gaps = 49/238 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D ++ A +NA FL + +V +G++ + P+ Sbjct: 97 DVSGSMRATDVLPNRLVASQNAAKAFLADL------PRNVRVGVVAFAGTAAVVQPPTVS 150 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI-------- 324 E + + + T + + L + + + K Sbjct: 151 REDLTAAIDKFQLQR---GTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSLDQAG 207 Query: 325 --------------PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 P II LTDG+ + + A + I++ T+ + Sbjct: 208 KDDGNGKKAFTPVAPGSYTSAAIILLTDGQRTTGIDS---LDAAKVAADRGIRVYTVGVG 264 Query: 371 ---------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L ++ ++ A L V+Q +S + K Sbjct: 265 TVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTLSSRLTVEK 322 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 72/252 (28%), Gaps = 48/252 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ P + P + L++D SGSM Sbjct: 260 NEDGTLLLLASPTIETTPNKTPPAKTVVLILDRSGSM--------------------SGK 299 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDM 284 K+ + A+ +++++ LI Y VE E + + Sbjct: 300 KIEQARAAMKFVVENLNQDD------LFNLILYDDTVEMFKPELLRCNAENRAEALRFIE 353 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 TD ++ +++ + + ++IFLTDG + Sbjct: 354 GVRPGGSTDIDQGLRAGLKLIADESR-------------------PNYVIFLTDGLPTSG 394 Query: 345 KSNVNTIKICDKAKE-NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++N I +A K+ + N + L + + + V D+L Sbjct: 395 ETNELKIAEAARAANPLKAKLFVFGVGYDVNARLLDRLSGENGGVSFYVKPDDNLEVAVS 454 Query: 404 NISQLMVHRKYS 415 + + + Sbjct: 455 RFYERISTPALT 466 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 16/158 (10%), Positives = 50/158 (31%), Gaps = 25/158 (15%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 M ++ Y + + ++ + D T+ + + A L ++K Sbjct: 585 MMIVSYDNEAYLEQSLTSRSGTLKNSIAAISD---GGGTNISAGLNLALDNLEAEKGS-- 639 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + +I ++DG++ + + D+A + I + T+ Sbjct: 640 -----------------RAVILMSDGQDGGSEED--MQAATDRAAKLGISVYTVGFG-EC 679 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + L ++ + + +V+ Sbjct: 680 DDAYMQAIAEVTGGKFVKASASTELSDIYLYLQKYIVN 717 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 66/233 (28%), Gaps = 44/233 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D ++ A + A F+ + V +G++ + + P+ Sbjct: 93 DVSGSMRAADVLPNRLVAAQEAAKSFIKDLPRT------VKVGIVAFAGSAQVAQLPTTN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK--------- 323 + + + T + A+ + L D + Sbjct: 147 HDDLVTAIDSFQLQRA---TATGNAIVVSLATLFPDAGIDVEQFSAPSRQRGTPIDQTEK 203 Query: 324 --------IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-- 373 P II LTDG+ + A + ++I T+ + Sbjct: 204 KLKDFTPVAPGSFTSAAIIMLTDGQRTTGVD---PLDAAKAAADRGVRIYTVGVGTVDGE 260 Query: 374 -------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + ++ A+ L V++ +S + K Sbjct: 261 TIGFEGWSMRVRLDEETLKAVANKTNAEYFYAGTANDLKKVYETLSSKLTVEK 313 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 71/238 (29%), Gaps = 31/238 (13%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-AALKNALLLFLD 240 + F + + + C D V +++ +KN F+ Sbjct: 8 SVATTATLFFCLCFSSFKAETPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERFV- 66 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + I +++R E + + ++ + + + T +K Sbjct: 67 --------SPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKL 118 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + + K I+ LTDG+ + TI D A++ Sbjct: 119 ATEQM-----------------KKEPKKSSSIIVALTDGKLETYIH-QLTIDEADSARKY 160 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHRKYSVI 417 ++ + + + L S E + V +L + +I + +V Sbjct: 161 GARVYCVGVKD--FDEEQLADVADSKEQVFPVKGGFQALKGIVNSILKQSCTEILTVE 216 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 71/238 (29%), Gaps = 31/238 (13%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-AALKNALLLFLD 240 + F + + + C D V +++ +KN F+ Sbjct: 8 SVATTATLFFCLCFSSFKAETPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERFV- 66 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + I +++R E + + ++ + + + T +K Sbjct: 67 --------SPNLRVSFIVFSSRAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKL 118 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + + K I+ LTDG+ + TI D A++ Sbjct: 119 ATEQM-----------------KKEPKKSSSIIVALTDGKLETYIH-QLTIDEADSARKY 160 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHRKYSVI 417 ++ + + + L S E + V +L + +I + +V Sbjct: 161 GARVYCVGVKD--FDEEQLADVADSKEQVFPVKGGFQALKGIVNSILKQSCTEILTVE 216 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 51/152 (33%), Gaps = 6/152 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ + + +++ L +I V +++A +A LAGA Sbjct: 2 LSRVVRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + +I A A F SL + +V YN + Sbjct: 62 ELDGRDDAITRARTAIEKIANSA------AFSGGGTGMSLGSHISVVYNAGNDAGSTVTV 115 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + ++ N + +++ + SY + + Sbjct: 116 LFLKNIPANDDTAIPSSMQTTVASEASYAWVI 147 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 72/245 (29%), Gaps = 49/245 (20%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 R + +S E ERP + V+D+SGSM Sbjct: 33 GARESSAYISCRLTAPDFEPVERPAIDLVAVIDVSGSMA--------------------G 72 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K+ +++ L + ++ L+ + + V+ + T ++ D+ Sbjct: 73 QKLKMVQSTLEFLMRNLKDTD------RFALVTFDSDVKTVFDLRPMTTAHKEACLADVQ 126 Query: 286 SLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 L T+ + + + +++ I+ +TDG N Sbjct: 127 KLRAGSCTNLSGGLFRGVELMQQRGAT---------------KGAVSSILLMTDGIANEG 171 Query: 345 KSNVNTIKICDKAK-----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + +C + I T N L + + +Y + + D + Sbjct: 172 VRDK--DDMCRALRGLMGPAPDYTIYTFGYGKDHNENMLRQLSETGNGMYYFIESNDIIP 229 Query: 400 HVFQN 404 F + Sbjct: 230 ESFGD 234 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 10/96 (10%), Positives = 31/96 (32%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K +I L+++ + + V + K +++ ++A L+ A + S Sbjct: 12 RAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDSGSDHA 71 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + ++D + + Sbjct: 72 AAILAGNAAINNNLILDGFGSMTIDDTDIHYEFSES 107 >gi|297567411|ref|YP_003686383.1| hypothetical protein Mesil_3037 [Meiothermus silvanus DSM 9946] gi|296851860|gb|ADH64875.1| conserved hypothetical protein [Meiothermus silvanus DSM 9946] Length = 351 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 27/247 (10%), Positives = 72/247 (29%), Gaps = 48/247 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D ++ A + F+ +GLI ++ + P+ Sbjct: 93 DASKSMLAGDLNPNRLEAARAIAKEFVRLAPAT------TQIGLITFSDSASVVVAPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN---------------F 317 +++ + ++ T A+ ++L K+ Sbjct: 147 RAVLQEALDNVK---PVQNTSLPSAIVTGVRLLPGRKEVQPPKELQPQNPQNPQPQNPLV 203 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-----IKICDKAKENFIKIVTISIN-- 370 IP ++ ++DG N + A++N +KI ++ Sbjct: 204 QPDTPPIPREFPPGSLLVISDGATNVNSNPRLPNQTALEAAAKFAQDNGVKIYAFAVGKE 263 Query: 371 -------------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM----VHRK 413 + L + + + ++L V++ + ++ + Sbjct: 264 GGAVVRIEGQDYFVPFEPRSLQQLAERTGGKYVYPPTEEALRAVYRELGTVIRWEATKLE 323 Query: 414 YSVILKG 420 S +L G Sbjct: 324 VSSLLSG 330 >gi|218670888|ref|ZP_03520559.1| hypothetical protein RetlG_04158 [Rhizobium etli GR56] Length = 133 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 43/137 (31%), Gaps = 8/137 (5%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + I +II L + LL +G I + + + +++A ++A+LA A+K Sbjct: 2 SFLSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKY 61 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S + + + + + + + + + Sbjct: 62 KSGMPEATIAKTINAFLSAN---GEFETAVAGKPEVASDESELCLDVADA-----VPTTF 113 Query: 129 MTHMANNRLDSSNNTIF 145 M + S + Sbjct: 114 MKLANIQSVPISIRSCA 130 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 74/231 (32%), Gaps = 49/231 (21%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + +V+DLSGSM + D+ T++ A+K F Sbjct: 83 VSALKVSGRDLAIVLDLSGSMVRDDFN----------LDDRAVTRLEAVKAVGADFARR- 131 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + L+ + + ++ TE V + + + + T + + A Sbjct: 132 ------RAGDRLALVVFGSEAYFASPFTFDTESVARRIEEATIGISGRATSISDGLGLAL 185 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L++ + +I L+DG NN N + + A + Sbjct: 186 KRLSTSTAT------------------SRVVILLSDGINNAGA--TNPRGVAELAARYGV 225 Query: 363 KIVTISINASP------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ TI++ + L S + V + L+ V Sbjct: 226 RVHTIALGPKDLTTAEVGERGVVDAATLRAISQISGGESFRVRTTEDLVAV 276 >gi|149559056|ref|XP_001512734.1| PREDICTED: similar to collagen, type XXI, alpha 1, partial [Ornithorhynchus anatinus] Length = 225 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 59/179 (32%), Gaps = 26/179 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDS 294 ++ + + +G++ Y+ I + + + + T + Sbjct: 70 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGS-HDSLENLIQAMESIQYLGGNTRT 128 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A L + +R K + LTDG++ + Sbjct: 129 GKAIQFALDHLFAKSQR----------------FLTKIAVVLTDGKSQD-----EVKDAA 167 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 + A+++ I + I + S L+ + P Y + V + ++ + + I Q + Sbjct: 168 EAARDSKITMFAIGVG-SETEDAELRAIANKPSYTYVFYVEDYIAISKIREVIKQKLCE 225 >gi|149638912|ref|XP_001511941.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ornithorhynchus anatinus] Length = 3176 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 206 NNFKYILDFIAALVSAFDVGEEKTRVGVVQYSSDTRTEFNLNQYYQRNELLSAIKKIPYK 265 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + +D + F K I +TDG++ + Sbjct: 266 GGNTMTGDAIDYLIKNTFTDAAG-------------ARIGFPKVAIVITDGKSQD----- 307 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 308 EVEIPARELRNRGVEVFSLGIKAAD--AKELKQIASTPSLQHVFNVANFDAIVDIQNEII 365 Query: 407 QLMVH 411 + Sbjct: 366 SQVCS 370 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 54/173 (31%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ T KV + T++ AM Sbjct: 519 AKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDIIEAINTFPYRGGSTNTGKAMTY 577 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K S K +I +TDG KS+ + K + + Sbjct: 578 VREKIFVASKGS-------------RSNVPKVMILITDG-----KSSDAFKEPAIKLRNS 619 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+T S P + Y V + FQ IS + Sbjct: 620 DVEIFAVGVKDAVRSE--LETIASPPAETHVYTVEDF----DAFQRISFELTQ 666 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 17/186 (9%), Positives = 55/186 (29%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + Y+ + + + +K + Sbjct: 2392 DNFNKVVKFIFNTVGAFDLINPAGIQVSFVQYSDEAKSEFKLNTYADKAQALGALQNIRY 2451 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2452 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKAVPKVLVVVTDGRSQD---- 2494 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2495 -EVRKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2547 Query: 406 SQLMVH 411 ++ Sbjct: 2548 EDNLIT 2553 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 76/235 (32%), Gaps = 48/235 (20%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +++ + + L +D+S SM R+++ A+K+ F+ Sbjct: 88 VVKQTVAHSRGIDVILALDISESMQLKDAGG--------------RSRLDAVKSVAREFV 133 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAM 298 + +G++ + + + + D + + T A+ Sbjct: 134 TR-------HSNDRIGVVVFKGKGYTLSPLTLDHRVTGMLIDNVSPDVIRDEGTAVGTAV 186 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + + QK II +DG +N + + + A Sbjct: 187 LIAVNRLRASQSD------------------QKVIILFSDGVSNAGEID--PVTAASFAA 226 Query: 359 ENFIKIVTISINASP------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I+I T ++ + L + +++ ++ + SL F++I + Sbjct: 227 AQGIRIYTAGAGSASSASSALDEGELRRVALTAGGRYFRAGTSASLAEAFESIDR 281 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 76/249 (30%), Gaps = 52/249 (20%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 K++ + P + +VD+SGSM K+ Sbjct: 176 AKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMTDPD-------------------KLPL 216 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K L + + + + LI Y + + + P+ G +K + Sbjct: 217 VKKTLRILTEQLRPQD------KVTLITYASGEQLVLPPTSGKDKD-TILRALNALHAGG 269 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ AY+ ++ II TDG+ N S+ T Sbjct: 270 ATSGERALRMAYEQAEKAYVKNGIN----------------RIILATDGDFNVGVSDTET 313 Query: 351 -IKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNA--------DSLIH 400 + + +++ I + T+ + +++ + + + + L Sbjct: 314 LKSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIADAGDGNYSYIDSEKEARKVLRHQLTS 373 Query: 401 VFQNISQLM 409 ++Q + Sbjct: 374 TLATVAQDV 382 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 73/213 (34%), Gaps = 35/213 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +T+M A K A+ F +S+ ++V VY + E + S G Sbjct: 179 DASGSMAAKSNGKTRMDAAKEAIQAFAESLPEQANVALRVYGHK---GSGKESDKTLSCG 235 Query: 273 TEKV---------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + ++ + T ++++A + L+ K Sbjct: 236 SSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLS----------------K 279 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKT 381 +P I ++DG + + ++ + ++ I I I GQ+ LK Sbjct: 280 LPGDKNTNMIFLVSDGIETC---DGDPVEAAKQLAQSEITPIINVIGFGVDGPGQQQLKE 336 Query: 382 CV-SSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + L F + + ++ Sbjct: 337 VAKAAGGRYVLIQDQKELQDEFNR-GKEIANKW 368 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 58/178 (32%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + ++ +G++ Y+ +I + + T++ Sbjct: 62 KWLVNITMSFNIGQKFTQVGVVQYSDDPFLHIPLGKHFSSSDLIKAMESIEYMGGNTNTG 121 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K A L + + K + LTDG++ + + + Sbjct: 122 RAIKFANDKLFALSE-------------RGPNGIAKIAVVLTDGKSQD-----EVLAAAE 163 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 A++ I + I + + +L + ++V + ++ + + I Q + Sbjct: 164 AARKKGIILFAIGVGSETEEAQLRAIANKPSSTYVFSVKDYKAIAKIREVIRQKLCEE 221 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 69.5 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 68/245 (27%), Gaps = 39/245 (15%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +RP + ++D SGSM + + K+ K A+ Sbjct: 398 DVRNRQQRPDIALVFIIDKSGSMDACHCNGG----DMAAREGGGTRKIDIAKEAVAQAAA 453 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +G++ + IE + V T+ M Sbjct: 454 VLGKDD------KLGVVTFDDSAHWTIELDKVPSQD-DVVAALAPVPPSGQTNVVSGMNA 506 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY+ L + K I LTDG + I + ++ Sbjct: 507 AYEQLRQSDAKI------------------KHAILLTDGWGHATDIGS----IAENMNKD 544 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-----QNISQLMVHRKYS 415 I + ++ + L + +Y + + +F Q + +V +++ Sbjct: 545 GITLSVVAAGNGSD-NALQRYAELGGGRYYPARVMEEVPQIFLQETIQAVGTYIVEEQFT 603 Query: 416 VILKG 420 G Sbjct: 604 PAYAG 608 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 73/221 (33%), Gaps = 39/221 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D K +++ A + A + F+D + V +G++ + + + PS Sbjct: 104 DISKSMAATDVKPSRVDAARAAAIKFVDGMAPT------VQLGVVTFAGSAQPLVRPSTD 157 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E ++ + + + + K T + + A Q + + + Sbjct: 158 HETAKKVIDQMIRADKLEKQTATGEGIYTALQQIETIAGALGGK----------NHTPPA 207 Query: 332 FIIFLTDGENNNFKSNVNTI---KICDKAKENFIKIVTISIN--------------ASPN 374 I+ ++DG+ AKE I + T++ + Sbjct: 208 RIVLVSDGKETVPDDLNAPRGAYAAARTAKEKHIPVCTVAFGTKSGKITIDNQVDEVPVD 267 Query: 375 GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQLMV 410 L K S + + L ++Q++++ + Sbjct: 268 LDSLKKISDLSNSPGNSCRFFPAESQAELAQIYQSLNEDIG 308 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 54/406 (13%), Positives = 114/406 (28%), Gaps = 75/406 (18%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--TEIQNIVNSSRISMT 130 S + S A + I + S + + T+ NS + + Sbjct: 203 SEATSSEINSSRVVVGASNLEYLSGISAFSSINYSNIAPEYTTDATKGTFSTNSWQPAGQ 262 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE------ 184 N + V + + Q + Sbjct: 263 TNVINHQGNKERNFATWDGVTSWNGDPSDTTHSYIQYGEKQTDFAIRKYAKETSEPGLYD 322 Query: 185 ---------MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + I LVVD+SGSM + ++ D + A +N + Sbjct: 323 VYLNVKGNEQQDIKPVDIVLVVDMSGSMESSQSNGWND-------------RAGAARNGV 369 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRDMDSLI-- 288 FL +I + V +GL+G+++ + G ++ D+L Sbjct: 370 KNFLQTIKDAGIG-DYVNVGLVGFSSPGYVTGPNGYLTVPIGKASDTSHINAINDALKPK 428 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI------------PSLPFQKFIIF 335 T + ++Q Q+L + GV + KF Sbjct: 429 FTGGTYTQIGIEQGQQMLAGSSNENKMMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNT 488 Query: 336 LTDGENNNFKSNVN-----------------TIKICDKAKENFIKIVTISINASPNGQRL 378 L + N + ++ T+ AK+ + I T+ I S +G L Sbjct: 489 LDEPRNTSKLNSSYEVGSWGNRTNINSTWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFL 548 Query: 379 LKT-------CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +++P + + + + ++ +V +++ Sbjct: 549 TEQQVRDRASLIATPGKYKDAETTNDVSDYLNEQAKNVVKSFNTIV 594 >gi|302527162|ref|ZP_07279504.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436057|gb|EFL07873.1| von Willebrand factor [Streptomyces sp. AA4] Length = 326 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 78/256 (30%), Gaps = 52/256 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 VS ++ R + LV+D+S SM T++ A Sbjct: 71 LTVSLAGPTAEQKV-PRNRATVMLVIDVSLSMEATDV---------------APTRLKAA 114 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++A F ++ V +GLI + + P+ V + + + Sbjct: 115 QDAAKQFAQNMTPG------VNLGLISFAGTATVLVNPTTDRAGVTKAIDNLKLAQ---S 165 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A Q + S F V P I+ ++DG+ + Sbjct: 166 TATGEGIYAAMQSIQS----------FSAVVGGADGPPPARIVLMSDGKQTVPEDLYAPR 215 Query: 352 KICDKAKENF---IKIVTISINASP--------------NGQRLLKTCVSSPEYHYNVVN 394 A+ + I +IS + + L + S Y + Sbjct: 216 GAYTAAQAAKQAQMPISSISFGTEHGSVDIEGKQQDVRVDDESLREIARLSGGEFYKAAS 275 Query: 395 ADSLIHVFQNISQLMV 410 AD L V+ ++ + + Sbjct: 276 ADELKRVYADLGEQIG 291 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 58/183 (31%), Gaps = 23/183 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ I + V G++ Y+ + + + Sbjct: 636 HDDFAEMKTFMIELISTFRVGADHVRFGVVQYSDSPTVEFDIRQHSSVAQLKSAITKIWQ 695 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ ++ + + +F+I +TDG++ + + Sbjct: 696 TGGGTRTGEALTFMKRLFSE----------------VARDKVLRFLIVITDGQSQDQVA- 738 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++ ++ I I I + ++ + L S + V + DSL + Q + Q Sbjct: 739 ----QAAEELRQENITIYAIGVKSAVTKELL--EISGSQNRMFFVNDFDSLKPIQQEVIQ 792 Query: 408 LMV 410 + Sbjct: 793 DIC 795 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 64/181 (35%), Gaps = 24/181 (13%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLIL 289 A+ F+ I + + K+ V +G+ Y+T +K + ++ Q++ + Sbjct: 1203 TAMKTFMKQIVNSFTIGKDRVRIGVAQYSTNPQKEFYLNTFYSGAEINQHIDKITQLRT- 1261 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + ++ + + ++ +TDG +N+ Sbjct: 1262 -QTYTGKGLRFVKSFFEPANGSRKNLH------------VLQSLVVITDGMSNDS----- 1303 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 ++ + + I+I +I I + L+ + + + V + L + + + + + Sbjct: 1304 VVEAANDLRNEKIQIFSIGIGVINLFE--LQLIAGNVKRVFVVGDFGQLGSIERKVVREL 1361 Query: 410 V 410 Sbjct: 1362 C 1362 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 20/168 (11%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ V +GL + + K + K T + A+ Sbjct: 48 EIGRDQVRVGLAQFNDNIYKAFLLNQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEF 107 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 F + +I +TDGE+N+ + K K + I Sbjct: 108 -----------FTESAGSRAKDNVPQIVILVTDGESND-----EVAEAASKLKGQGVSIY 151 Query: 366 TISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + IN + LKT S P ++ +++ + + L + NI + Sbjct: 152 VVGINVQDVQE--LKTIASKPLEKFLFSIEDFNILEGLSGNILPTLCS 197 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 67/242 (27%), Gaps = 39/242 (16%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L+ ++ + + I I + G + Sbjct: 765 LEISGS------QNRMFFVNDFDSLKPIQQEVIQDICFLEVCKGMKADILF--------L 810 Query: 218 PICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 ++ T+ +K + ++ E V +GL+ +++ ++ + KV Sbjct: 811 VDGSERINTRDFDKMKE---FMMQMVNKSDLGPEKVQIGLLQFSSNPQEEFRLNTYYSKV 867 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 I A+ + + +++I + Sbjct: 868 DILRAITGMVQIRAGARVGSALSFSLPYFERSRGGRLN--------------VPQYLIII 913 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 G+ + ++ IKI I + N +LL+ + + +Y N D Sbjct: 914 ISGKTGDAVKMP-----AKALRDKGIKIFAIGV-HKANNSQLLEITGAQDKVYYE-ENFD 966 Query: 397 SL 398 SL Sbjct: 967 SL 968 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 57/187 (30%), Gaps = 19/187 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A+ L I L V +GL+ Y+ + + + Sbjct: 428 LENFQLAVELLRKIIHTLIIGPNKVRVGLVLYSDEPRLEFGLNTFLSQSEILSHLNKLPF 487 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I T + A+ + + +K S + Q+ + +T+G + + Sbjct: 488 IGGKTKTGAALDFLRNTVFTQQKGSRY-----------RQGVQQLAVVITEGYSQDEVDR 536 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + + + + +G R L S P ++ + + L + + I Sbjct: 537 P-----ASLLRRAGVTVFAVG-TLKASGSRDLNKIASHPPRKHAIYLESFLQLSVITEKI 590 Query: 406 SQLMVHR 412 + + Sbjct: 591 KKRVCTE 597 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 55/171 (32%), Gaps = 15/171 (8%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ K++V G I Y+ E K + + + A+K A Sbjct: 1020 INDSLVGKDNVRFGAISYSDNSEVLFSLDTYITKAQIRDAVFHLKPKVGKAHTATALKFA 1079 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + SL + ++ +T+ + + + +E Sbjct: 1080 KERFS------------EMHGGRQSLAVTQILVLITN-KPTESEEKKYLQESAQTLQEAG 1126 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I + I I + L+ + + V + + L ++ + ++ ++ + Sbjct: 1127 IDVFAIGIKNVKRPE--LQAITKHRDRSFMVQSYNELYNLHEKVTHIICNE 1175 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 55/170 (32%), Gaps = 21/170 (12%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 N + F+ ID L ++ +GL Y + + V + L Sbjct: 229 NQMKDFICRVIDTLEVGRDKDQIGLAQYGNQGHVEFLLNAYQNPVEMISHIQQNFLPRGG 288 Query: 292 TD-STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + ++ + ++ S F ++ + +T G +S Sbjct: 289 ARKTGNGLQYIQETFFQEEAGSRFLQGI-----------PQYAVVITSG-----QSEDLV 332 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ K KE +KI+ + I + + LK + P + + D + Sbjct: 333 LEKAQKLKERGVKIMVVGIQDFDSRE--LKAMATPP-LVFEIEGQDGIRQ 379 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 48/166 (28%), Gaps = 10/166 (6%) Query: 15 IASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + E + +II AL + S L G I + KN +++ +AA LAGA ++ N Sbjct: 11 LRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNGP 70 Query: 74 RL---GDRFESISNHAKRALIDDAKRFIKN---HIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + +N + IK S + N +S Sbjct: 71 ANVKSEAKEAAENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSVKIMVNNHSINSFF 130 Query: 128 SMTHMANNRLDSSNNTIFYNMDVM---TSYDYRLQFIEHLLNQRYN 170 + + + + + + + Sbjct: 131 ASALNFQQTVSAEATAVIGPLSGKRHLIPIAVTEDEADTMSEAGEG 176 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 74/231 (32%), Gaps = 42/231 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +VVDLSGSM + ++ +++ A K L F+ ++ Sbjct: 97 DVLMVVDLSGSMEEKDFATES---------GEQLSRLTAAKKVLRDFVTQ-------RQG 140 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI + + E + + + T+ A+ ++ Sbjct: 141 DRFGLILFGDAAFIQTPFTADQEVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSPAT 200 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + ++ LTDG + S V+ + A I+I I++ Sbjct: 201 SQD----------------QIMLVLTDGNDTG--SFVSPVDAAKIAAAKGIRIYVIAMGD 242 Query: 372 SP-------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + ++ L +Q I + +KYS Sbjct: 243 PENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVI-DQLEPQKYS 292 >gi|163760702|ref|ZP_02167782.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] gi|162282024|gb|EDQ32315.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] Length = 668 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/264 (13%), Positives = 83/264 (31%), Gaps = 45/264 (17%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y +R N +++ ++ RP + L+VD+SGSM Sbjct: 260 YPAPESVETPFRATVTVTPTPWNANTRLLHIGVKGYDVKPAARPQANLVLLVDVSGSMQE 319 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ LK+A L + ++ + ++ Y Sbjct: 320 TD-------------------KLPLLKSAFRLLIQKLEPED------TVSIVTYAGDAGT 354 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 +EP+ ++K + + D T +++AY++ + Sbjct: 355 VLEPTPASDKAK-ILDALDDLRPGGSTAGAAGIEEAYRLAEKARVNGGVN---------- 403 Query: 326 SLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 ++ TDG+ N + + ++ +E+ + + +L++T Sbjct: 404 ------RVLLATDGDFNVGASDDDALKSLIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQ 457 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 + D+L + ++Q Sbjct: 458 NGNGV--AAYIDTLAEAEKTLAQE 479 >gi|260808371|ref|XP_002598981.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] gi|229284256|gb|EEN54993.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] Length = 193 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 54/173 (31%), Gaps = 15/173 (8%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D D L+ +GLI YT + + T K + S T + A+ Sbjct: 30 VDLSDGLNISPTATRVGLIEYTDSPTVEFKLADHTNKASLATAINNVSYQSGGTQTGRAL 89 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + + F I +TDG + + N + + Sbjct: 90 DAARTQMDWRQPPVPNVCFSLLQAA----------IVVTDGMSGD-----NVQQPAKALR 134 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +N I + I + N L + + + + N D L + IS + Sbjct: 135 DNDISAYGVGIGPAINANELNEIAGGDAGHVFYIPNYDKLEKEMEKISNSVCS 187 >gi|260892924|ref|YP_003239021.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] gi|260865065|gb|ACX52171.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] Length = 298 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 36/131 (27%), Gaps = 12/131 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 SE+ +++ A ++ L L ++ + + +A +AA LAG + + S Sbjct: 3 WKSERGTVAVLVAAALTFLLGLAALVVDGGGLLLARERLVNAVDAAALAGVQFLPGDPSG 62 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 D R + + ++ + + Sbjct: 63 AVQTAL------------DYARLNGADPAQVTAEVEPDGRTLAVRADRSVPFFLARVLGL 110 Query: 135 NRLDSSNNTIF 145 + + Sbjct: 111 EKGEVKAQAKA 121 >gi|126310280|ref|XP_001371684.1| PREDICTED: similar to collagen XXI [Monodelphis domestica] Length = 957 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + + +G++ Y+ I + + T + Sbjct: 58 KWLVNISNNFDIGPKFIQVGVVQYSDYPVLEIPLGSHHSGENLMEAMESIQYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDHLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A++N I + I + + L + + V + ++ + + I Q + Sbjct: 157 AARDNRITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVIKQKLCEESV 216 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/325 (11%), Positives = 83/325 (25%), Gaps = 54/325 (16%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 D N D F + + S S + + M + + Sbjct: 167 TEDYSPVNENRFHTVASDPLSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFD 226 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + + + ++V + P + Sbjct: 227 YKYSNPTGNTPVAVRTDMAICPW----------NTAHQLVRIALKGKDVAKDNLPPSNLV 276 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 ++D+SGSM K+ +K A L ++ + + + Sbjct: 277 FLIDVSGSM-------------------SDAKKLPLVKQAFKLLVNQLRPVD------RV 311 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ Y + + G K + T ++ AY+ T +S Sbjct: 312 AIVVYAGAAGLVLPSTSGDHKT-AILDALDKLEAGGSTAGGEGVQLAYKTATEYLLKSGN 370 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASP 373 +I TDG+ N S+ +I +K +E I + + Sbjct: 371 N----------------RVIIATDGDFNVGPSSDGELQRIIEKKREKGIFLSVLGFGMGN 414 Query: 374 NGQRLLKTCVSSP-EYHYNVVNADS 397 L+ + + N + Sbjct: 415 YKDNKLELLADKGNGNYAYIDNFEE 439 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 69/235 (29%), Gaps = 46/235 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 P + E+ I I++ +D SGSM + A+ Sbjct: 73 AGPKIKGKPEDEKIISNIQICLDSSGSMRADFGGK---------------NRYEVAMQAV 117 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL---KPT 292 F + +E GL + T + + T + + T Sbjct: 118 KEFTEY-------REGDAFGLTVFGTEYINWVPVTKDTSAIALATPFLAPDRMSKWFGGT 170 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A++ + Q L + + II ++DG + + T+ Sbjct: 171 NIAKALRGSQQQLLQQEDGD------------------RMIILVSDGVSGSPND---TVD 209 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + N I I I + + + + V N +L F+ I + Sbjct: 210 MAQELRNNKIVAYCIYIGSGNGSPEMNALAAITGGQVFGVNNPKALDETFRFIDK 264 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 73/236 (30%), Gaps = 43/236 (18%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + I LV+D SGSM P + + A K A Sbjct: 344 IEGFSWVNLSRVQDPLNIMLVIDTSGSM-------------GPSKEGLTDGGLDAAKIAA 390 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L F+D + + +GLI + T V + + VRQ ++ T Sbjct: 391 LDFIDHL------PSNANVGLIHFGTLVTVDHSLTNDIGAVRQSISELK---PEGQTAIY 441 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +Y L Q FI+ ++DG + K + + Sbjct: 442 DALAISYTQL-------------------RRAKGQTFIVLISDGADTASKGDNYDSIVAK 482 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMV 410 K I I + + +LL+ + Y + + L + ++Q + Sbjct: 483 ATKA-NIPTYIIGLTSPEFDGQLLEDLQRDTKAMIYQTPSKEQLGGFYTEVAQEVS 537 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 40/105 (38%), Gaps = 2/105 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV--SNLSRL 75 + F I FAL ++ L +G + + K +++A ++ LA A ++ S+ Sbjct: 10 QHGAFLITFALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAARELNGQSDAITR 69 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + ++ A + A + + + + Y T + Sbjct: 70 AQNAGMAAGNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSD 114 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 74/251 (29%), Gaps = 36/251 (14%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + P VVD S SM+ Sbjct: 27 LFLSAPLTAGANGTAEAGVDAVFVVDTSNSMNKTDPGKT-------------------AA 67 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 + +F+D +G + Y R V+ S + R+ + R + L Sbjct: 68 EVMSMFIDM-----SEATRTRIGFVAYNDRIVQAQSPASMAEARNREQLKRTIQGLRYSG 122 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 +D +++ +++ K + + G + SN + Sbjct: 123 YSDLGLGLRRGAEMIEKAKDPARKPFLILLSDGGTD-------LRQNAGGRSVAASNKDV 175 Query: 351 IKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNADSLIHVFQNI-SQ 407 + KAK I TI +N + ++L K ++ + + D L +F I ++ Sbjct: 176 ETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAEATGGTSFVTQSTDDLPEIFNQIFAK 235 Query: 408 LMVHRKYSVIL 418 + + SV Sbjct: 236 HIQSQLVSVAA 246 >gi|113476847|ref|YP_722908.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167895|gb|ABG52435.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 477 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 20/197 (10%), Positives = 60/197 (30%), Gaps = 34/197 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ ++ A F++ + ++ +++ + +++Q Sbjct: 54 TSSSMWGGKLPEVQAAATGFVER-----QNLTVNNLAIVEFSSNSQVLTNFDADKTELKQ 108 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ + +K +L I+ TD Sbjct: 109 AIANLT---PSGGTNLSQGLKTVASLL--------------------RNSNTPNILLFTD 145 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G+ N+ +++ I + +E I +VT+ L + +P+ + + + Sbjct: 146 GQPNDPRAS---KSIAREIREAGINLVTVG--TGDANSNYLTSLTENPDLVFFANSG-EI 199 Query: 399 IHVFQNISQLMVHRKYS 415 F+ + + + Sbjct: 200 DQAFRAAEKAISQLSDT 216 >gi|332244061|ref|XP_003271190.1| PREDICTED: collagen alpha-1(XII) chain [Nomascus leucogenys] Length = 3100 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPNKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAPALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTVNL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|297678514|ref|XP_002817115.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 1 [Pongo abelii] Length = 3115 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2334 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2393 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2394 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2436 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2437 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2489 Query: 406 SQLMVH 411 ++ Sbjct: 2490 EDNLIT 2495 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1213 RTVRSFISRIVEVFDIGPKRVQVALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1272 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1273 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1314 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1315 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1372 Query: 410 VH 411 + Sbjct: 1373 CN 1374 >gi|297291177|ref|XP_001109727.2| PREDICTED: collagen alpha-1(XII) chain-like [Macaca mulatta] Length = 3095 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2314 DNFNKVVKFVFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2373 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2374 RGGNTRTGKAL-----TFIKEKILTWESGM--------RKNVPKVLVVVTDGRSQD---- 2416 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K+ +++ + + + + L S P + + V + +S F+ I Sbjct: 2417 -EVKKVALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2469 Query: 406 SQLMVH 411 ++ Sbjct: 2470 EDNLIT 2475 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 56/148 (37%), Gaps = 15/148 (10%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMK------QAYQILTSDKKRSFFTNFFRQGVKIP 325 VR +++R ++ + P A+ + L + + + Sbjct: 1214 NFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEGQLNAHRDKKSXXXXXXXTQAGM 1273 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 +K + +TDG++ + K K+ +++ I I + + LK + Sbjct: 1274 RPRARKIGVLITDGKSQDD-----VEAPSKKLKDEGVELFAIGIKNADEVE--LKMIATD 1326 Query: 386 PE--YHYNVVNADSLIHVFQNISQLMVH 411 P+ + YNV + +SL + +++ + + Sbjct: 1327 PDDTHAYNVADFESLSRIVDDLTINLCN 1354 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 65/195 (33%), Gaps = 29/195 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ K A ++ + +G I + R + IE + + Sbjct: 376 SGSMSGSKLELAKEAAARSVEMLRDED------TLGFIAFDDRPWEIIE-TGPLNNKEEA 428 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ ++ +AY+ L +K II LTDG Sbjct: 429 VDTILSVTPGGGTEIYGSLAKAYENLADM------------------KLQRKHIILLTDG 470 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ + + ++ K+N I + T++I + L YNV++ ++ Sbjct: 471 QSQPGNYD----DLIEQGKDNGITLSTVAIGQDADANLLEALSEMGSGRFYNVIDEQTIP 526 Query: 400 HVFQNISQLMVHRKY 414 + + ++ Sbjct: 527 SILSRETAMISRTYI 541 >gi|119569135|gb|EAW48750.1| collagen, type XII, alpha 1, isoform CRA_c [Homo sapiens] Length = 3063 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|114608138|ref|XP_518589.2| PREDICTED: collagen, type XII, alpha 1 isoform 3 [Pan troglodytes] Length = 3119 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|114608140|ref|XP_001142912.1| PREDICTED: collagen alpha-1(XII) chain isoform 2 [Pan troglodytes] Length = 3063 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|93141047|ref|NP_004361.3| collagen alpha-1(XII) chain long isoform precursor [Homo sapiens] gi|146345397|sp|Q99715|COCA1_HUMAN RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor gi|55662663|emb|CAH71310.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203512|emb|CAI19898.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203521|emb|CAI19908.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 3063 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|1846005|gb|AAC51244.1| collagen type XII alpha-1 [Homo sapiens] Length = 3063 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2337 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2396 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2397 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2439 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2440 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2492 Query: 406 SQLMVH 411 ++ Sbjct: 2493 EDNLIT 2498 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHDYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAASQIANPATARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 96/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A + + R++ ++ + + + Sbjct: 74 QQYSDKQTLQGRLQEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 133 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 134 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 193 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 194 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 233 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 234 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 280 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 281 AEIN-AAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILL 323 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T+ + S + ++ + + + Sbjct: 324 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYID 383 Query: 394 NADSLIHVFQN 404 V + Sbjct: 384 TLSEAQKVLNS 394 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 42/162 (25%), Gaps = 26/162 (16%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ + + + + + TD + A S+ Sbjct: 532 DVDRAAVVDFDSSAYVAQDLTSDFGAANSTLDNL---GSGGGTDIGSGLSTANSQFASNS 588 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + +I LTDG N I A + T+ Sbjct: 589 ----------------NDSRAQVMILLTDGRGNGG------ISEAQTAANQNTTVYTVGF 626 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + N +L + V + L +VF I++ Sbjct: 627 -DNANRDKLRDIANITDGEFNYVTDRSELPNVFSRIAENTTE 667 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 64/198 (32%), Gaps = 29/198 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ +K+ L D + + L+ + + + +++ + E R Sbjct: 58 SGSMAGSKIEMVKSTLAFMFDQLKPTD------RIALVEFDSNISTSLQFTNMNESGRSK 111 Query: 280 VTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ T+ + A+ + +++ + T+ ++ TD Sbjct: 112 AKQVVSNIRAGSCTNLSGALFEGLRLIGQRTNANEVTS----------------LLLFTD 155 Query: 339 GENNNF--KSNVNTIKICDKAKEN---FIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 G N +N K+ E + + T + L + +Y + Sbjct: 156 GLANEGITNTNEIVKKMTTMIHEEIRTNLTVFTFGFGTDTDANMLTSISQAGNGLYYFLQ 215 Query: 394 NADSLIHVFQN-ISQLMV 410 D + F N I L+ Sbjct: 216 TTDDIPKAFGNVIGGLIS 233 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 14/141 (9%), Positives = 44/141 (31%), Gaps = 6/141 (4%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ +++ +++ FL + + + K +++A ++A LA A+ + ++ + Sbjct: 10 KKQQGLVAVMITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAATILDNSKDKD 69 Query: 76 GDRFESISNHAKRALI------DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 E + A D + I ++ ++ R+ Sbjct: 70 AVDAEVGTALNAMAASTGNQEIDFSTASISIDYSNDPKDFTGTATFDSTDDVYVRVRVDA 129 Query: 130 THMANNRLDSSNNTIFYNMDV 150 M + + Sbjct: 130 LEMDEFFIQLFGLEKVVSASA 150 >gi|328951281|ref|YP_004368616.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451605|gb|AEB12506.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 328 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 67/204 (32%), Gaps = 27/204 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + I D+ +++ A + + FLD + +GL+ ++ + P+ Sbjct: 93 DTSKSMIAVDQSPSRLEAARAIVRTFLDRV------PRGARVGLVSFSAYASVLVLPTAR 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 +VR+ + T A+ A + L ++ Sbjct: 147 HVEVRKALEALEPQEA---TSLGAAILAAVRALPGRERAGEELLGRDPVPPELQELPPAT 203 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQR 377 ++ ++DG + + ++ A+ + ++I T+ + + + Sbjct: 204 VLLISDGVSTSGLD---PLEAARVARAHQVRIYTVGVGSPRGSIQEVDGQLSFIPFDPSG 260 Query: 378 LLKTCVSSPEYHYNVVNADSLIHV 401 L + + + D+L V Sbjct: 261 LEQIAALTGGRYVYPPTPDALEAV 284 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 42/346 (12%), Positives = 100/346 (28%), Gaps = 63/346 (18%) Query: 73 SRLGDRFESISNHAKRALID--DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + E+ S ++ + + + Sbjct: 273 TEEDAYLENNSLYSITEVAGNPEVLVVNGGEPSPIPALLDTGNLRVTELTSTQLPSSLSA 332 Query: 131 HMANNRL---DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN---------QKIVSFIP 178 + + + + S I + R + ++ I +P Sbjct: 333 MLRYDSIIFDNVSGTAIGEQQMAIIEQAVRQFGVGFMMVGGDQSFGLGGYFKTPIERLLP 392 Query: 179 ALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ++ P + +V+D SGSM KM K A Sbjct: 393 VEMEVKGKHELPSLGLMIVMDRSGSMM--------------------GLKMELAKEAAAR 432 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + +G+I + + + + + + ++ + + T+ + Sbjct: 433 SVELLRSDD------TLGVIAFDDQPWEILP-TGKVDDPKKAADKILSITPGGGTEIYRS 485 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++QAY L +K II LTDG+++ I + Sbjct: 486 LEQAYTELEDL------------------ELQRKHIILLTDGQSSTSNDYDALI---ENG 524 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 K++ I + T+SI + L + + Y+V +A ++ + Sbjct: 525 KDHNITLSTVSIGQDADRNLLEQLAGTGSGRFYDVTDATTIPAILS 570 >gi|320589835|gb|EFX02291.1| von willebrand factor type a domain containing protein [Grosmannia clavigera kw1407] Length = 735 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 69/212 (32%), Gaps = 26/212 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D+SGSM P + T + +K+A L++++ Sbjct: 96 SIVLVIDVSGSMQED---APVPATKGEPMESNGLTVLDLVKHAARTILETLNEHDC---- 148 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++ + + T+ + + + +D L T+ + +I +S + Sbjct: 149 --LGIVTFSEDANVLLMLTPMTQVNKAKALQVILDLEPLTVTNLWKGLTAGIEIFSSKAQ 206 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S I+ LTDG N I + I T Sbjct: 207 FSSV----------------PSIMLLTDGLPNFMHPPQGYIPKLRTFGKLPAPIHTFGFG 250 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + + + +A L VF Sbjct: 251 YNLRSGLLKSISELTGGNYAFISDAGMLGTVF 282 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/251 (13%), Positives = 68/251 (27%), Gaps = 46/251 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + + K+V + + ++D+SGSM+ A Sbjct: 214 VAIVAETTDS--PWNPGLKLVHIGLQAKTVSAENLSASNLVFLIDVSGSMNEA------- 264 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 K+ LK A L D + + + ++ Y + P+ G+ Sbjct: 265 ------------NKLPLLKQAFKLLADQLRVED------KISIVAYAGSAGMVLAPTSGS 306 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 EK + T ++ AY + + Sbjct: 307 EK-KTIKDALDKLEAGGSTAGGEGIELAYDLAKKHFL----------------PKGNNRV 349 Query: 334 IFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQR-LLKTCVSSPEYHYN 391 I TDG+ N SN K+ ++ ++ I + + + + Sbjct: 350 ILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADKGNGNYAY 409 Query: 392 VVNADSLIHVF 402 + N VF Sbjct: 410 IDNIQEARKVF 420 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 65/236 (27%), Gaps = 54/236 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D +++A + A + + V +G++ + + P+ Sbjct: 93 DTSRSMEAADVPPNRISAAQQAARDLVVGLPAS------VRLGIVSFAGTAAVVLPPTDN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD------------------------ 308 + + + + T + + QA +L + Sbjct: 147 R---QDMLDAIERFQLQRGTATGSGLFQALAVLFPEDGIDLEVILFGSRSDRAGRGTSLD 203 Query: 309 ---KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + Q P +I L+DG + A + +++ Sbjct: 204 EAAAADAARRREQGQQAAQPGSYRHGAVILLSDGRRTTGPD---PLDAARMAAQRGVRVY 260 Query: 366 TISINASP---------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + + L + +Y+ +A L V++ +S Sbjct: 261 TVGFGSQQVTSAPESSLSYFMQLDEPALRAVASITGGEYYHAGSAADLSQVYRQLS 316 >gi|296198464|ref|XP_002746714.1| PREDICTED: collagen alpha-1(XXI) chain isoform 2 [Callithrix jacchus] Length = 954 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLINITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|296198462|ref|XP_002746713.1| PREDICTED: collagen alpha-1(XXI) chain isoform 1 [Callithrix jacchus] Length = 957 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLINITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 77/228 (33%), Gaps = 49/228 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + V+D+SGSM + ++ K AL +F++ + Sbjct: 376 QNKKRVDVMAVLDVSGSM-------------------EDEGRLEQAKAALRIFVEQLQDD 416 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 GL ++ + S + + + R T + +AYQ L Sbjct: 417 D------GFGLTIFSDQATVLTPISPIGSRRTEVLNRIAGLTPRGGTRLLDTVVEAYQEL 470 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----F 361 T+ + ++ LTDG +N + + + D +++ Sbjct: 471 TATPPGQRI----------------RAVVVLTDGLDNRSQRS--AEDVLDLLRQDREGYS 512 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD--SLIHVFQNISQ 407 IK+ TI+ + L + ++ Y + ++ ++Q+I+ Sbjct: 513 IKVFTIAFGGDADVHLLKEIASATGAKSYVGKPGERGAIERIYQDITT 560 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 71/251 (28%), Gaps = 49/251 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 QR VS I + L++D SGSM Sbjct: 22 QRQLSIAVSAIHDTGLNNGTRNAPLNLCLILDHSGSM--------------------TGR 61 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDM 284 + +K A +D ++ + ++ + + + E+++ + R Sbjct: 62 PLTTVKEAAQSLIDRLNPGD------RIAVVAFDHHAKVLVPNQLVEDPEQIKALIQRL- 114 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T MK + L K+ + F LTDGE N Sbjct: 115 --EPKGGTAIDDGMKLGIEELAVGKQGTISQAF-----------------LLTDGE-NEH 154 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N + + A I + T+ + N L S + ++ + VF + Sbjct: 155 GDNQRCQQFAELAAGYNITLNTLGFGSHWNEDVLEGIADSGGGSLSFIEKPENAVDVFNS 214 Query: 405 ISQLMVHRKYS 415 + + + Sbjct: 215 LFTRIETVSLT 225 >gi|296198593|ref|XP_002806760.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XII) chain-like [Callithrix jacchus] Length = 3113 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + ++ +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEDKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIVITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLTHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 80/255 (31%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 MLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKXFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T++ AM + + K K +I +TD Sbjct: 504 IEAINNFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNTHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQSFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 71/228 (31%), Gaps = 47/228 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ERP + ++D+SGSM + K+ LK AL+L +D Sbjct: 361 EERPASNLVFLLDVSGSM-------------------SQPNKLPLLKEALMLLTRRLDSR 401 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + ++ Y + + T++ ++ AYQ+ Sbjct: 402 D------RVAIVVYAGASGLVLPSTTA-NNTATIEHALTQLQAGGSTNAGAGIELAYQVA 454 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN--VNTIKICDKAKENFIK 363 +I TDG+ N ++N + D+AK+ + Sbjct: 455 REHFIEDGNN----------------RVILCTDGDFNVGQTNRGDLAQIVADQAKD-GVS 497 Query: 364 IVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVF-QNISQLM 409 + + +L+ + + V + VF Q+++ + Sbjct: 498 LTVLGFGMGNYKDNMLEELSNKGKGTYAYVDSEAEARKVFLQDLASNI 545 >gi|46143335|ref|ZP_00135441.2| COG4961: Flp pilus assembly protein TadG [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 520 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/507 (7%), Positives = 123/507 (24%), Gaps = 108/507 (21%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLG------DRFESISNHAKRALIDDAKRFIKNHIK------------------- 103 S E+ S + + + ++ Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDNQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 104 ----ESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---M 151 S+ + V + N ++D ++ + + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI------------------EMGERPIFLI 193 + N+KI++ I + Sbjct: 187 IDLMVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAVA 246 Query: 194 ELVVDLSGSMHCAMNS------------DPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + S + + + + + A + L+ + Sbjct: 247 PFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLSANNNISAKDFALSLSYLVDIDKT 306 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAM 298 I+ + + + ++ + + + ++ + + T ++ + Sbjct: 307 INSIGGTFSSNSIIFNKNKFCLGRSNKNTHHWYNRDESSNFFSFIKRLHAEGSTLASSGL 366 Query: 299 KQAYQ---------------------ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 A +L+ + + + + Sbjct: 367 ITASNIMLKEESRSKSLGEQTKRVILVLSDGNDELRLNDEGTPFTQYSRITENLLLGQEE 426 Query: 338 DGENNNFKSNVNT-------IKICDKAK--------ENFIKIVTISINASPNGQRLLKTC 382 + I +CD+ + + KIV + + ++ + C Sbjct: 427 QTTDTYPYFMSKPPKKLTSNINVCDRIRNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHC 486 Query: 383 VSSPEYHYNVVNADSLIHVFQN-ISQL 408 V +Y+ + SL++ F+ I + Sbjct: 487 VG-NGNYYSANDKASLLNSFKQAIGET 512 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 68/185 (36%), Gaps = 18/185 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVR 277 + + +K+ + F D + D+ +G+I + + V + I + ++ Sbjct: 42 SRSVRPKNFQTVKDYVKNFTDIFEAFGP--NDMQVGVIQFGSGVREEILLNQFYVRHELM 99 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T + A+++ T G ++ + ++ +T Sbjct: 100 EAIDNIRYMET--GTMTGLALRKLVT----------ETLTVEHGARVDNPIVHTVVVIIT 147 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD 396 DG++ ++ T +AK +I I I N + LL+ E + + V N + Sbjct: 148 DGKSQDYSRGGVTK-WTKEAKARGFEIFAIGIGRKANRKELLEMASEPKELHTFRVQNFN 206 Query: 397 SLIHV 401 ++ V Sbjct: 207 AIKRV 211 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 43/142 (30%), Gaps = 2/142 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ + + +++ L +I V +++A +A LAGA Sbjct: 2 LSRVVRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 KMV--SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 ++ + ++ A + +HI + + + + N Sbjct: 62 ELDGRDDAISRAQTAIEKIANSAAFSGGGAGMSLGSHISVTYDAGNDAGSTVTVFFLKNI 121 Query: 125 SRISMTHMANNRLDSSNNTIFY 146 T + ++ + + Y Sbjct: 122 PANDDTSIPDSMKTTVASEASY 143 >gi|291295619|ref|YP_003507017.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470578|gb|ADD27997.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 412 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 46/172 (26%), Gaps = 20/172 (11%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + ++ Y VE + + + T + + Sbjct: 85 DRVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTIRTGGSTALHAGWLEGATQV------ 138 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 ++ L+DG N ++N I + + T+ + Sbjct: 139 ----------AAYQEAGRLNRVVLLSDGLANRGETNPGVIAEQVRELARRGVSTSTLGVG 188 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQLMVHR--KYSVILK 419 N + + +Y + + L +F Q ++ L + + L+ Sbjct: 189 LDYNEDLMTTMADAGEGNYYFIESPADLPRIFAQELAGLAGTLGTRVRLWLR 240 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 47/146 (32%), Gaps = 5/146 (3%) Query: 11 FKKGIAS--EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS-- 66 ++ A EK S+I A+ +++ L + + V + ++ ++S +A+ +A A Sbjct: 1 MRRLGADNTEKGAVSVIVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKC 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN-SS 125 + + + A + +D + ++ S E + N S Sbjct: 61 ARDTANADCSTTSTLAGSLANQNSLDGMSNVYSIQLDKTARTVSVTTSAKETGSPDNSVS 120 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM 151 + + + Sbjct: 121 LFFAKAIGIPSKEVGAKASATWGNPS 146 >gi|311260227|ref|XP_003128387.1| PREDICTED: collagen alpha-1(XXI) chain-like [Sus scrofa] Length = 957 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + ++V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFHVEDYIAISKIREVMKQKLCEESV 216 >gi|293339632|gb|ADE44108.1| collagen type XXI alpha 1 [Sus scrofa] Length = 895 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + ++V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFHVEDYIAISKIREVMKQKLCEESV 216 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/247 (12%), Positives = 87/247 (35%), Gaps = 31/247 (12%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + I +G V ++ + + + P+++ D ++ Sbjct: 259 YVQASITESGTYVVGADKSVNEPAVTRVAFLIDNSGSMYPKELCPTSSENDVDFKRLDFT 318 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 ++ + F D +G+ +T K + + ++R+ + R + Sbjct: 319 QSLIDKFDDDF----------RIGISKFTGTYTKMCDFTDDRTELRKVLNRIRTEDEIFD 368 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+K+ ++ + I+ L+DGE++ + Sbjct: 369 GTYNQTALKKCINEFSAAGDG----------------KYVNIIVMLSDGESDEVDAETI- 411 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + A E + ++T+ + + L + S+ +Y+ +A SL V++ I + Sbjct: 412 ESLSNLANEKSVIVLTVGLGREIDRAWLQEVAYSTGGKYYSASDATSLDDVYKQI---VT 468 Query: 411 HRKYSVI 417 Y ++ Sbjct: 469 TLNYDIV 475 >gi|313225343|emb|CBY06817.1| unnamed protein product [Oikopleura dioica] Length = 321 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 59/185 (31%), Gaps = 26/185 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDS 294 + S + +G+ Y+ + + + V R T + Sbjct: 157 FLSALLKHFSIGHDASMIGIAQYSDNPRLEFGLNEHYDFPSLNAAVNRMK--YKGGNTAT 214 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A + + QK ++ +TDGE+ Sbjct: 215 GKALTFALDHVF---------------GRSSRPNAQKVVLIITDGESLQDTVTEP----A 255 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 + +EN ++I +I + N + LK + P + + V +++ + + + + Sbjct: 256 RRLRENGVEIFSIGVGDEINLEE-LKDMATDPDSNHVFQVGGYNAITGITTQVLKDICRI 314 Query: 413 KYSVI 417 K V+ Sbjct: 315 KVRVV 319 >gi|166031603|ref|ZP_02234432.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] gi|166028580|gb|EDR47337.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] Length = 685 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 44/336 (13%), Positives = 106/336 (31%), Gaps = 34/336 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ S + DR + + + DD K ++KE + + S + Sbjct: 207 ADDSYVEDRVKMYEGLDLKDVGDDVVSGYKKNLKEIQDLTGKGKKDYKAIKEAFSKMDQI 266 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + +I + + ++ + ++ I + A I + Sbjct: 267 VYQYIEPKNQAVVSI--QQIDASEFPTVKLYM-SIKDKTTGNVIENLDDAFFYINKQDAN 323 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 ++ VV + ++ + V D + K + F+ S+ + Sbjct: 324 AKYVKQVVKSANQLNEKEALKVDMVADVSGSMDGSP--LNEAKQVMSDFVGSVQFDA--- 378 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + L ++T V E S + + + + T A+ A + + + Sbjct: 379 -GDLVELTSFSTGVCLEQEFSDDAATLTNDINNLVTGDM---TSLYDALYTAVERVAAQN 434 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +I TDG +N SN + + A + + I I Sbjct: 435 GA-------------------RCVIAFTDGNDNY--SNCTKEDVVNVANRYHVPVFIIGI 473 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 S + + + +YNV + S+ +++ I Sbjct: 474 G-SIDYADVNDIATQTGGMYYNVSDVTSMDKIYEEI 508 >gi|126208028|ref|YP_001053253.1| tight adherence protein G [Actinobacillus pleuropneumoniae L20] gi|126096820|gb|ABN73648.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 520 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/507 (7%), Positives = 123/507 (24%), Gaps = 108/507 (21%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLG------DRFESISNHAKRALIDDAKRFIKNHIK------------------- 103 S E+ S + + + ++ Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDTQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 104 ----ESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---M 151 S+ + V + N ++D ++ + + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI------------------EMGERPIFLI 193 + N+KI++ I + Sbjct: 187 IDLMVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAVA 246 Query: 194 ELVVDLSGSMHCAMNS------------DPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + S + + + + + A + L+ + Sbjct: 247 PFALGAQHSNNQCIIPFILKKYSKDRISEKNIKSYLSANNNISAKDFALSLSYLVDIDKT 306 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPTDSTPAM 298 I+ + + + ++ + + + ++ + + T ++ + Sbjct: 307 INSIGGTFSSNSIIFNKNKFCLGRSNKNTHHWYNRDESSNFFSFIKRLHAEGSTLASSGL 366 Query: 299 KQAYQ---------------------ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 A +L+ + + + + Sbjct: 367 ITASNIMLKEESRSKSLGEQTKRVILVLSDGNDELRLNDEGTPFTQYSRITENLLLGQEE 426 Query: 338 DGENNNFKSNVNT-------IKICDKAK--------ENFIKIVTISINASPNGQRLLKTC 382 + I +CD+ + + KIV + + ++ + C Sbjct: 427 QTTDTYPYFMSKPPKKLTSNINVCDRIRNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHC 486 Query: 383 VSSPEYHYNVVNADSLIHVFQN-ISQL 408 V +Y+ + SL++ F+ I + Sbjct: 487 VG-NGNYYSANDKASLLNSFKQAIGET 512 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 65/233 (27%), Gaps = 44/233 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D ++ A + A F+ + V +G++ + + P+ Sbjct: 93 DVSGSMRAADVLPNRLVAAQEAAKSFIKDL------PRHVKVGIVAFAGSAQVAQLPTTN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK--------- 323 + + + T + A+ + L D + Sbjct: 147 HDDLITAIDSFQLQRA---TATGNAIVVSLATLFPDAGIDVSQFSAPSRQRGTPIDQAEK 203 Query: 324 --------IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-- 373 P +I LTDG+ + A + ++I T+ + Sbjct: 204 QAKEFTPVAPGSYTSAAVIMLTDGQRTTGVD---PLDAAKAAADRGVRIYTVGVGTVDGE 260 Query: 374 -------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + L + ++ A L V++ +S + K Sbjct: 261 TIGFEGWSMRVRLDEETLKAVANKTQAEYFYAGTAADLKKVYETLSSRLTVEK 313 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 80/253 (31%), Gaps = 52/253 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 V L ++P + +VD+SGSM Sbjct: 191 PTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMDTED------------------- 231 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K++L + + + LI Y + + + G + ++ + + Sbjct: 232 KLQLAKSSLKMLTKQLRAQDS------ITLITYAGNTKVVLPSTPG-NQTQKILNAIDNL 284 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+K AYQ T K+ I+ LTDG+ N S Sbjct: 285 TASGSTNGEAAIKLAYQQATEHFKKDGIN----------------RILMLTDGDFNVGVS 328 Query: 347 NVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVN--------AD 396 +V ++I ++ I + T+ +++ + + + + D Sbjct: 329 SVKDMLQIIRSNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEAKKVLID 388 Query: 397 SLIHVFQNISQLM 409 + F +++ + Sbjct: 389 EMSATFNTVAKDV 401 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 96/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A + + R++ ++ + + + Sbjct: 67 QQYSDKQTLQGRLKEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 126 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 127 VDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 186 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 187 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 226 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 227 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 273 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 274 AEIN-AAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILL 316 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T+ + S + ++ + + + Sbjct: 317 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYID 376 Query: 394 NADSLIHVFQN 404 V + Sbjct: 377 TLSEAQKVLNS 387 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 72/207 (34%), Gaps = 34/207 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K++ K+ K L FL D +G++ + + + + + Sbjct: 92 VSNSMKEKNKLKISKEILRDFLLKRDEED------RIGILVFDNLPFRLMPLTSDRGALL 145 Query: 278 QYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ +++ T + +A + D++ K II L Sbjct: 146 RVISIIRPAMVDVGGTAMYDGLVEALNMFMKDRR-------------------NKIIILL 186 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-- 394 TDG + N K + ++ KI TI +++ N L + ++ + V Sbjct: 187 TDGGDINSKYT--LEDVVRFNQDIGAKIYTIGVSSGMNFYVLERLSEATGGKAFFVTKDY 244 Query: 395 ADSLIHVFQNISQL----MVHRKYSVI 417 +L VF I++L + ++ V Sbjct: 245 QKALRSVFDEINRLEPSYIQEYRFDVE 271 >gi|296815358|ref|XP_002848016.1| U-box domain containing protein [Arthroderma otae CBS 113480] gi|238841041|gb|EEQ30703.1| U-box domain containing protein [Arthroderma otae CBS 113480] Length = 748 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 75/259 (28%), Gaps = 33/259 (12%) Query: 152 TSYDYRLQFIEHL-LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L I + N +VS P L + I LV+D+S SM+ A Sbjct: 30 VEENMTLDPILSVHSIPNKNSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSA-API 88 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P + + K+A + +++ + ++ + T + E Sbjct: 89 PTGESGGEDTGLSI---LDLTKHAAKTIIQTLNEND------RLAVVTFCTEIRVAFELE 139 Query: 271 WGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL-- 327 + +E+ + V +D L T+ +K+ ++L ++ + P+ Sbjct: 140 FMSEENKSKVLAAIDCLHGISSTNLWHGIKEGLKVLATNSTQGNVQALLVLTDGAPNHMC 199 Query: 328 ----PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 K L D + + I T L Sbjct: 200 PAQGYVPKLRQTLLDHRDLTGSLPL---------------IHTFGFGYYLRSPLLQSIAE 244 Query: 384 SSPEYHYNVVNADSLIHVF 402 + +A + VF Sbjct: 245 IGGGTFAFIPDAGMIGTVF 263 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 63/178 (35%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + +G++ Y+ R + + T++ Sbjct: 59 QWVSNLVETFEIGPDKTRVGVVRYSDRPTTEFDLGKYKTREEIKEAARKIRYYGGNTNTG 118 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + ++ G ++ +K I LTDG + ++ + + Sbjct: 119 DALRY----------INTYSFSKEAGGRLSDRTVKKVAILLTDGRSQDYVLDP-----AN 163 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A++ I+I + + + + L S P+ + ++V + +++ + + + + Sbjct: 164 AARQAGIRIFAVGVGEALKEE--LDEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCE 219 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 69/231 (29%), Gaps = 35/231 (15%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +P + + + +V+D SGSM + T+M + Sbjct: 8 LSAGALPVAAVPAVTDDAGGSLVMVLDSSGSMGEDDGTGS--------------TRMESA 53 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLIL 289 + A+ +D++ VY + + P V+ V Sbjct: 54 RRAVGAVVDALPDGYPTGLRVYGADRPQGCADTRLVRPVRPLDRAAVKSAVAGV---RPT 110 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T ++++A + L + I+ ++DGE+ Sbjct: 111 GDTPIGLSLRKAAEDL---------------PAPRDGAARTRTIVLVSDGEDTCGTPPPC 155 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 + + ++I T+ + L+ + +Y+ +AD+L Sbjct: 156 EVAARLAGQGAGLRIDTVGFQVKGAAREQLECVAEAGNGRYYDAPDADALA 206 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 39/377 (10%), Positives = 110/377 (29%), Gaps = 59/377 (15%) Query: 56 ANNAAILA---------GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 A + A+ A SK+++ + + + + + H Sbjct: 189 ALDGAVSAPSTSRARMPAESKLMAPQQPSTLPADQMQPQPENRDRIEDFKTNPVHAALED 248 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMAN-NRLDSSN--NTIFYNMDVMTSYDYRLQFIEH 163 + S + + + + N Y+ S Sbjct: 249 PVSTFSIDVDTASYSFVRSSLKQGTLPQVDTVRVEEMINYFPYDWKGPESASTPFNSTVS 308 Query: 164 -LLNQRYNQKIVSFIPALL-RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + ++ E+P + ++D+SGSM Sbjct: 309 VMPTPWNTHTRLMHVAIKGFDVKPTEQPKANLVFLIDVSGSMDEPD-------------- 354 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ LK+A L + + + ++ Y +EP+ ++K + ++ Sbjct: 355 -----KLPLLKSAFRLLVSKLKADD------TISIVTYAGDAGTVLEPTKASQKDK-ILS 402 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T +K+AY++ + ++ TDG+ Sbjct: 403 AIDNLTPGGSTAGEAGIKEAYRLAQKSFVKDGVN----------------RVMLATDGDF 446 Query: 342 NNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N +S+ + ++ +K ++ + + ++++T + D+L Sbjct: 447 NVGQSDDDDLKRLIEKERKTGVFLSVFGFGRGNLNDQMMQTIAQNGNG--TAAYIDTLAE 504 Query: 401 VFQNISQLMVHRKYSVI 417 + + + +++ Sbjct: 505 AEKVLVEDASSTLFTIA 521 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 59/173 (34%), Gaps = 23/173 (13%) Query: 228 MAALKNALLLFLDSIDLLSHV-----KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + ++D + D+ M I Y+T + + + +++ + + Sbjct: 79 LDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ 138 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T ++A Q + + + II +TDGE Sbjct: 139 LRKIVPDGHTFMQAGFRKAIQQIETFNSGN---------------KVPSMIIAMTDGE-L 182 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + +T++ KA++ + T+ + + + SPE+ + V N Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTVGV--ADYKLDQITAIADSPEHVFAVENG 233 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 59/173 (34%), Gaps = 23/173 (13%) Query: 228 MAALKNALLLFLDSIDLLSHV-----KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + ++D + D+ M I Y+T + + + +++ + + Sbjct: 79 LDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ 138 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T ++A Q + + + II +TDGE Sbjct: 139 LRKIVPDGHTFMQAGFRKAIQQIETFNSGN---------------KVPSMIIAMTDGE-L 182 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + +T++ KA++ + T+ + + + SPE+ + V N Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTVGV--ADYKLDQITAIADSPEHVFAVENG 233 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 73/246 (29%), Gaps = 50/246 (20%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + K+ + +RP + +V+DLSGSM Sbjct: 32 MNSTTAPLEESKLKVMLELTGGDSSNDRPGLDLVVVLDLSGSM----------------- 74 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + K+ LK A+L + + + + ++ ++ + TE ++ + Sbjct: 75 ---EGEKIEKLKAAILFMIKKLSSID------RLSIVTFSRDARRLCPLRQITENSQKDL 125 Query: 281 TRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 ++ L + T ++ ++L + I+ ++ Sbjct: 126 ENLINGLHAYGAANITAGLQTGLKVLNDRRFTGGRVA---------------TIMLVSSS 170 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSL 398 E NN D+ + + T A L +S +V N D+L Sbjct: 171 EQNNGDD-------ADQILVGNVPVHTFGFGAYHEPGVLKAIAHNSIGGTFSDVQNMDNL 223 Query: 399 IHVFQN 404 F Sbjct: 224 NKAFSQ 229 >gi|319777804|ref|YP_004134234.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171523|gb|ADV15060.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 343 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 15/206 (7%), Positives = 46/206 (22%), Gaps = 18/206 (8%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 + +F + + IGF + +++ + A +AA+L+ M + + + Sbjct: 1 MAPLFVFMLAPIITAIGFSVDYTRAVQTRSNEQQALDAAVLSITG-MDTTSTLAQRQTML 59 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + S + + + + M N + + Sbjct: 60 QDTFIANHGLGTPT-------LNSFVVSANGTATAQAMASYSMPTVFMQIARINTVPVAV 112 Query: 142 NTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 + + + + + L+ + + Sbjct: 113 GSAASKTPALVQTTFKVSKVSG--WWNKTMTLY-------GTTFGATVAKPLMSIEYTYN 163 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRT 226 G +T Sbjct: 164 GFGDPKGYGTTNVYTITNNGGADIKT 189 >gi|12052774|emb|CAB66559.1| hypothetical protein [Homo sapiens] Length = 957 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLFDKS----------------SRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 43/289 (14%), Positives = 92/289 (31%), Gaps = 35/289 (12%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE-RPIFLIELVVDL 199 + +QR + V ++ E R + V+D+ Sbjct: 53 ATSKAAIDRQEVRVSTTPIRAAIARDQRKDDFEVLVTVEAPKVVAPEKRAPIDLVAVLDV 112 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM+ + +++ LK A+ + + + ++ + Sbjct: 113 SGSMNKEEF----------VRGKHMSSRLDLLKIAMKYIIKLVRDAD------RLAIVSF 156 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 V + + R+ + +D TD PA+K+A + + ++ Sbjct: 157 NHAVVSEYGLTRNSADSRKKLENLVDKLKASGNTDFRPALKKAVEDMNIQNIKNSSAYNN 216 Query: 319 RQGVKIPSLPFQK----FIIFLTDGENNNFKSNVNTIKICDK---------AKENFIKIV 365 Q + +K FI+ L+DG + S +N K+ A + Sbjct: 217 FQILDGRGKEEKKKRVGFILLLSDGVDQFQYSRINWEKVAKSTDVDHSEVGAMLRKYAVH 276 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRK 413 T +AS + L + S + V N D++ F ++ + + Sbjct: 277 TFGFSASHDPVPLRQISALSYGLYSFVCKNLDNITEAF---ARCLGGLR 322 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 58/190 (30%), Gaps = 28/190 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 M K+A FL+ +D + +I + + V I + + Sbjct: 182 NSGSMSGNPMTQAKSAAKQFLNYVD----FSNGDQVEIIEFNSDVYIRIPYGSDIKSLNT 237 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T A+ V+ S K I+ TD Sbjct: 238 AID---TMESNSQTALYDALYTGL-------------------VRAYSQSGPKCILAFTD 275 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE N S + ++ + ++ I I I + + + + L + + ++ A L Sbjct: 276 GEENA--SIRSVSEVTELSRATSIPIFIIGVGSLIDEESLKEIAEQTGGEYFYSPTAVEL 333 Query: 399 IHVFQNISQL 408 +++ + Sbjct: 334 EQIYKTVYDQ 343 >gi|322697455|gb|EFY89235.1| U-box domain-containing protein [Metarhizium acridum CQMa 102] Length = 757 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/241 (11%), Positives = 65/241 (26%), Gaps = 18/241 (7%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V +P E I LV+D+S SM +D ++ T Sbjct: 30 PSRRSLVVKVVPPRAPAAQIEHVPCDIVLVLDVSTSM-------EDDAPVPGEAEETGLT 82 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 + K+A L +++++ +G++ + T + + + Sbjct: 83 VLDLTKHAALTIIETLNEKD------RLGIVSFATNSTIVQTLTHMDISNKEEARRKIKA 136 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ Q + +P+ ++ E+++ Sbjct: 137 LDPNGSTNLWHGIRDGIQAFEQSSENGNIRAMMVLTDGMPNHMCVSPLLLWAHREHSSNS 196 Query: 346 SNVNTIKI----CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I T L + + +A + V Sbjct: 197 VDRCPQQGYIPKLKTLSRLPATIHTFGFGYGLRSGLLKSLAEYGHGNYAFIPDAGMIGTV 256 Query: 402 F 402 F Sbjct: 257 F 257 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 53/159 (33%), Gaps = 30/159 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D K ++ K + L + E +GLI + R + Sbjct: 96 DTSRSMLADDVKPNRLEVAKREIEDLLKIL-------EGDRVGLIAFAGRAFTYCPLTSD 148 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R ++ + + + T A+ + + K Sbjct: 149 YSAFRLFLNDLNVNIIPVGGTAIAEAIYKGIDAFGE------------------NENNHK 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +I +TDGEN+ + +K KAKE I I T+ + Sbjct: 191 AMIIITDGENH----ETDPLKAASKAKEKGIVIYTVGVG 225 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/322 (8%), Positives = 71/322 (22%), Gaps = 16/322 (4%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 +II LS+ + +G + + + K+ ++++ +A LA A + L Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEA 78 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 ++ + A + E + + +T ++ + Sbjct: 79 AGLTTGTRNA------ALFQGKAVEMFENLNVSYSDTPDNTFYTKDKVPYSLDKIK---- 128 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + L + + Q ++ P+++ Sbjct: 129 YVKCTAERTGIAQWFIQVLNTLPGMNIQPSTVNAMAVATTTSAQTACAIPVYICTPQTAN 188 Query: 200 SGSMHCAMNSD--PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 +D + D + V + Sbjct: 189 PVRTAYNRGDWIKSKDEKDPYGPGSFGWADLTPPGGGASELADLLAGSGQCDLSVVGSKV 248 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 G + I + T + + + +F Sbjct: 249 GQPGSISSLIPAWNTRFGIYT----GSYKGPQDGTPDFTGYAYSVTKWNPPNGGNAYADF 304 Query: 318 FRQGVKIPSLPFQKFIIFLTDG 339 ++ T G Sbjct: 305 IQKRGTNEPYQTDTNSGLSTQG 326 >gi|311244457|ref|XP_001927071.2| PREDICTED: collagen alpha-1(XII) chain [Sus scrofa] Length = 1894 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/243 (11%), Positives = 78/243 (32%), Gaps = 28/243 (11%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +++ + ++ I + + ++ L+ A V+ + + Sbjct: 82 SELIPEVEYVVTITSYDEVEESVPVIGQLTKCSVSAWTDLVFLVDGSWSVGRNNFKYI-- 139 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L + +E +G++ Y++ + ++ Sbjct: 140 ----LDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQFYQRDELLAAIKKIPYKGG 195 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ + ++ + F K I +TDG++ + Sbjct: 196 NTMTGDAIDFLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD-----EV 237 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 238 EIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQ 295 Query: 409 MVH 411 + Sbjct: 296 VCS 298 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 79/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + + ++ PI ++E + + C+ D + + Sbjct: 368 MLSVRDLSADTEYQISV-SAMKGLTASEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 426 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 427 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 483 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 484 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 530 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 531 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 582 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 583 ---DAFQRISFELTQ 594 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1113 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIQY 1172 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1173 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1215 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1216 -EVKKAALIIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1268 Query: 406 SQLMVH 411 ++ Sbjct: 1269 EDNLIT 1274 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 73/228 (32%), Gaps = 46/228 (20%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM T++ K A+ + +D + Sbjct: 751 MVFVLDRSGSMSQT---------------VGDVTRLDVAKQAVSAAANLLDPQTGS---- 791 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + T+ P ++ A+Q L + + Sbjct: 792 -LGVVMFGSEAEVALPLGPLPDAA-GIAAALGHLQPGGGTNIYPGLQLAFQALRASDADA 849 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + I+ +TDG ++ + I + +++I ++ Sbjct: 850 ------------------RHIVVMTDGMSDEADFPGLLAA----IRAEGITVSSVAIGST 887 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + +N + +L + ++Q + + +VI +G Sbjct: 888 SETSIAEDIALLGGGRFHNTRDFGALPSI---LAQEALMLRGAVIEEG 932 >gi|149188854|ref|ZP_01867144.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] gi|148837274|gb|EDL54221.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] Length = 505 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 15/175 (8%), Positives = 50/175 (28%), Gaps = 7/175 (4%) Query: 34 LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + + + + + + A+ AA LA + L+ + Sbjct: 1 MGFTFWAVEGTRYVQETSRLRDASEAAALAVTIEDQPTLANNLATKYIENYVRDIKSTAL 60 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-NRLDSSNNTIF-----YN 147 + ++ + Y + +S S + ++ D + ++ Y Sbjct: 61 TAQRFHQAENQNAGVLEYIQYTVNAKTTHDSWFASSFIPSFGDQQDLAGRSLARKYPVYL 120 Query: 148 MDVMTSYDYRLQFIEHLLNQR-YNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 D + F + ++ N+ ++ ++ + ++G Sbjct: 121 GDNNIDIVFVSDFSGSMNSRWGTNRNRKIDDLKTAINQISDKILCTSIRRDQVNG 175 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 71/204 (34%), Gaps = 14/204 (6%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPS 270 DV S + + +F D ++ + + G+ + Sbjct: 297 NDVLSRSYYYPDDYNYVDFNQTVSTMFSDKSGLSSNYYRINGVQLFAGFGDQSSTQFHNI 356 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K+ + T + + + QIL S + + K Sbjct: 357 ALTNKL-SDLNAISPMWANGSTAAFQGILRGAQILKQGDPNSSDQDKQQAYTKK-----I 410 Query: 331 KFIIFLTDGE---NNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSS 385 K ++ L+DG+ NN + +C+KA++ + I I IN + Q + CV Sbjct: 411 KMLLILSDGQESPNNGILRGLVNAGMCNKARQQIPGLYIGVIGINFQASQQSGFQDCVVD 470 Query: 386 PEYHYNVVNADSLIHVFQNISQLM 409 P ++++ +L + + I +L+ Sbjct: 471 PNE--DIIDVSNLDELIKKIEELI 492 >gi|332210152|ref|XP_003254169.1| PREDICTED: collagen alpha-1(XXI) chain [Nomascus leucogenys] Length = 957 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|297678422|ref|XP_002817079.1| PREDICTED: collagen alpha-1(XXI) chain-like [Pongo abelii] Length = 612 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/350 (7%), Positives = 89/350 (25%), Gaps = 21/350 (6%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + N I+F +++ + + + + K +++ ++A L A+++ + Sbjct: 9 LRSKQNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATELDEGAT 68 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 R ++ +D S + +N + Sbjct: 69 HEQARAAVVALIQLNIAHNDHHELASA---IDFSIVNNGLDQMT--AQLNVEFSQLPDPF 123 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + S + ++ + ++ + P+ + + ++ Sbjct: 124 IQDNNESAKYVKVSLSQLELDNFLADVFSF---NKQVSATALSGPSSDISNCYQDLVPMV 180 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK------MAALKNALLLFLDSIDLLSH 247 + ++ A ++ + + L+ L + D+ + Sbjct: 181 VCADTANAGVNGANEYGLKEYSLNLMKIGSNSNSPIGPGNFQLLR--LGSNTGAADIRAA 238 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + D +G + +T + P+ V + + L Sbjct: 239 MAGDTDLGTVCFTPGISNQSVPTEPGNTVGPVAQGLNTRMGEW-----HGPVNSTDHLRD 293 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 T D +++ + + CD Sbjct: 294 TNICEGATIELDANGDALEAGAAAKSYTFNDYDSDGAQYLSDANTSCDAI 343 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 72/274 (26%), Gaps = 46/274 (16%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 L V + +L ++ + ++ R + LV Sbjct: 21 LPLFVMAKAITPAVTVKSELSAPV---MLANSEDKNYLKISLTGFNLDSKRRSPINLALV 77 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D S SM ++ + A +L ++ +D + + Sbjct: 78 IDRSTSM--------------------TGDRIEKAREAAILAVNMLDASD------TLSV 111 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + Y E I + K + T + + Sbjct: 112 VAYDNNAEVIIPATKVNNKPALIAKIQQHIHPMGMTALFAGVSKGI-------------- 157 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNG 375 Q K + II L+DG+ N S + A + I I TI + N Sbjct: 158 --GQVDKNLNPEQVNRIILLSDGQANTGPTSISELSDLARMAAKKGIAITTIGLGEDYNE 215 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + S H V N+ L F Q + Sbjct: 216 DLMTAIAGYSDGNHSFVANSADLESAFTKEFQDV 249 >gi|118575253|ref|YP_874996.1| hypothetical protein CENSYa_0043 [Cenarchaeum symbiosum A] gi|118193774|gb|ABK76692.1| conserved hypothetical protein [Cenarchaeum symbiosum A] Length = 311 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 68/214 (31%), Gaps = 46/214 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++ A K A + + +G+I + + + Sbjct: 92 DGSGSMAADDYAPTRLDAAKAAAAQLVGRLAPGD------RVGVILFGSSAITISYLTSD 145 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + T + ++ + ++S Sbjct: 146 RAEAAGRIGEIVQGD--GATALGDGLALGVEMAAAGPEKS-------------------T 184 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-----------------NG 375 II L+DG +N+ + + + A + I++ TI + + + Sbjct: 185 IILLSDGVHNSGR--TVPGEALELAIQGNIRVHTIGMGSDEPVRVGDDIFGEPRYAELDE 242 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + + +Y V+ +L +F+ +S+ + Sbjct: 243 DTLREIADRTGGMYYTSVDNPTLDGIFEALSEDI 276 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 39/356 (10%), Positives = 92/356 (25%), Gaps = 49/356 (13%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 G + + AKR + + ++ + SA ++ +S+ + Sbjct: 183 AGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVG- 241 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + T + ++ I Sbjct: 242 -EVPRITVTNTVVTAPRYRKYIKANNDGTY------DLSLNVTGTQSGSSQTTVSPADIV 294 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDV 252 +V D SGSM M + +++ K A+ + + ++ Sbjct: 295 VVFDTSGSMSNPMGHN---------------SRLEVAKTAVNSMAQHLLTSENQGKDSNI 339 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 M L+ ++T V + + V T+ A+K A LTS +K Sbjct: 340 RMALVPFSTTVGNVSNFTDNAMDIVSAVNGL---RADGGTNWEAALKAANAKLTSGRKGV 396 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-------------------VNTIKI 353 F G + + + Sbjct: 397 KKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAV 456 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + ++ +++ P ++ + +Y+ + D L F +I + Sbjct: 457 AEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDELNKAFADIIGQI 510 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 75/270 (27%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 51 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 105 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ ++ + ++ Y Sbjct: 106 TSM--------------------SGERIEKAREAAILAVNMLNTTD------TLSVVAYD 139 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 140 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 183 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 184 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 243 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 244 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 273 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/371 (9%), Positives = 96/371 (25%), Gaps = 61/371 (16%) Query: 45 DWHYKKNSMESANNAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + K +++ A A + + R++ ++ + + + Sbjct: 74 QQYSDKQTLQGRLKEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 133 Query: 102 IKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + F N + + + S + + Sbjct: 134 VDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELA 193 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + I + E P + ++D SGSM Sbjct: 194 PA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------------ 233 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 234 -------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHK 280 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T+ ++ AYQ + I+ Sbjct: 281 AEIN-AAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILL 323 Query: 336 LTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVV 393 TDG+ N + +I+ + K +E+ + + T+ + S + ++ + + + Sbjct: 324 ATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYID 383 Query: 394 NADSLIHVFQN 404 V + Sbjct: 384 TLSEAQKVLNS 394 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 48/141 (34%), Gaps = 14/141 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 +K +++ N +II L + + L +++ + + +K + +A ++ LA A ++ Sbjct: 1 MRKL--NDRGNVAIISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALELPD 58 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 N R D + +L A + +I+ + N + Sbjct: 59 NEVRARTAAV------------DYLQKNNVDPSLALITVGADHKSIQIEEVKNVKHLFAQ 106 Query: 131 HMANNRLDSSNNTIFYNMDVM 151 + N + T Sbjct: 107 IIGINSSNIKAKTKAVVAPAK 127 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 47/382 (12%), Positives = 105/382 (27%), Gaps = 64/382 (16%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALID------DAKRFIKNHIKESLSG 108 + + A A A + + A D + + N + Sbjct: 82 QSVDQAADAAAPMSTMGGAVGLAARSRMETIPAPAPADMLPPPAENRERFGNADANPVKS 141 Query: 109 YSAVFYNTEIQNIVNSSRISMT-HMANNRLDSSNNTIFYNMDVMTSYDYRLQFI------ 161 +A +T ++ +S + + + + + M YD+ Sbjct: 142 VAAEPVSTFSVDVDTASYSFVRRSLMAGEMPNPDAVRVEEMVNYFPYDWPRPTTAAEPFK 201 Query: 162 -----EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + I + E P + ++D+SGSM Sbjct: 202 ATVTVTPTPWNAGTRLMHVAIKGYEVV-QKEAPRANLVFLIDVSGSMDEPD--------- 251 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 K+ LKNA L +D + + ++ Y +EP+ +K Sbjct: 252 ----------KLPLLKNAFRLLVDRLRPDD------TVSIVTYAGNAGTVLEPTAVKDKT 295 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ T + AYQ+ R I+ Sbjct: 296 K-ILSAIDTLQPGGSTAGAAGIDAAYQLAEKAFVRDGVN----------------RILLA 338 Query: 337 TDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 TDG+ N S+ ++ + + + I + + L++T + Sbjct: 339 TDGDFNVGPSSDEELKRMVETKRRSGIFLSVLGFGRGNYNDALMQTIAQNGNGV--AAYI 396 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 D+L + + + + + Sbjct: 397 DTLAEAQKTLVEEAGSSLFPIA 418 >gi|260782980|ref|XP_002586557.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] gi|229271674|gb|EEN42568.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] Length = 534 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 57/179 (31%), Gaps = 25/179 (13%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + + D L +G++ Y+T V T K + T Sbjct: 165 SVMKFAADMSLRLDISATTTRVGMVQYSTNVTPEFMLKEHTTKKSVEKAIGDVKRLGGGT 224 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ A+K F R + P ++ I +TDG++ + Sbjct: 225 NTGKALK-----------------FVRTEMDWRDPPTKRVAIVVTDGKSQDDVGTP---- 263 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ + + + + + LK P Y + + D L + Q+IS +V Sbjct: 264 -ATALRQAGVVLYAVGVGLPTDE---LKEITGDPTKVYALNSYDELQDIIQDISNSVVE 318 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 70/235 (29%), Gaps = 46/235 (19%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 I+ P RP + V+D SGSM K+ Sbjct: 24 DLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGH-------------------NKITY 64 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A+ +D + ++ + + +V+ I + K + R + + Sbjct: 65 ARQAVCYAIDQLSPGD------HLSVTIFDDQVQTLIPST--LVKDKAQFKRLVQGINPG 116 Query: 291 P-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD Q ++ + II L+DG N ++N + Sbjct: 117 GCTDLHGGWLQGGIQVSQNLSAEL-----------------NRIILLSDGLANRGETNPD 159 Query: 350 TIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + T+ + N L S +Y V +A+ L +F+ Sbjct: 160 IIATDVHGLAQRGASTTTLGLGDDYNEDLLEAMARSGDGNYYYVADAEQLPTIFE 214 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 83/244 (34%), Gaps = 49/244 (20%) Query: 172 KIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 I++ + + +M I VD+S SM + +D K ++ Sbjct: 69 VIIAIVNPKMGTKMETVKRQGIDIVFAVDISKSM---------------LAEDIKPNRLE 113 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-I 288 K + ++ + + +G++GY + + + Y+ ++ Sbjct: 114 KTKQLVSQIINQLG-------NDRVGIVGYAGSAYPILPMTTDYSIAKMYLQSMNTNMVS 166 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A+K A K II ++DGE++ ++ Sbjct: 167 SQGTAFNDAIKLAVDYF-------------------DVKDTSKLIILVSDGEDHGEGAS- 206 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQNISQ 407 + D AKE ++I+TI + LK + + N ++ + ++ ++ + Sbjct: 207 ---EAIDLAKEKGVRILTIGVGTEKGALIPLKDNKGTISSYKKDQNGENVITKLYPDVLK 263 Query: 408 LMVH 411 + Sbjct: 264 NIAT 267 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/373 (9%), Positives = 94/373 (25%), Gaps = 62/373 (16%) Query: 44 LDWHYKKNSMESANNAAI----LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + K +++ AA A + R++ ++ + + + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYKQFDDNPVKQVAQNPLATFS 134 Query: 100 NHIKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + F N + + S T + Sbjct: 135 LDVDTGSYANVRRFLNHGLLPPPDAVRVEEIVNYFPYDWDIKDKQSIPATKPIPFAMRYE 194 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + I + E P + ++D SGSM Sbjct: 195 LAPA-------PWNEQLTLLKIDIL-AKDHKSEELPASNLVFLIDTSGSM---------- 236 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ ++++L L + + + + ++ Y + G+ Sbjct: 237 ---------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGS 281 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K T+ ++ AYQ + I Sbjct: 282 HKAEIN-AAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGIN----------------RI 324 Query: 334 IFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYN 391 + TDG+ N + +I+ + K +E+ + + T + S + ++ + + Sbjct: 325 LLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSY 384 Query: 392 VVNADSLIHVFQN 404 + V + Sbjct: 385 IDTLSEAQKVLNS 397 >gi|218778177|ref|YP_002429495.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759561|gb|ACL02027.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 558 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 64/218 (29%), Gaps = 44/218 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + ++D+SGSM+ K+ +K ++ + + + Sbjct: 191 DVKPSNLVFLLDVSGSMN-------------------SENKLPLVKRSMEMLVKELGAGD 231 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V ++ Y + + K R+ +T T ++ AY++ Sbjct: 232 RVS------IVTYAGSAGLVLPSTSARNK-RKIITALDRLEAGGSTAGGEGIELAYRV-- 282 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIV 365 + +I TDG+ N S+ +++ ++ + I + Sbjct: 283 -AWENLIPEG-------------NNRVILCTDGDFNVGVSSTPELVRMIEEKRRAGIYLT 328 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVF 402 ++ + Y + + VF Sbjct: 329 ICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVF 366 >gi|149922129|ref|ZP_01910569.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] gi|149817066|gb|EDM76548.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] Length = 546 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 40/257 (15%), Positives = 79/257 (30%), Gaps = 41/257 (15%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 Q + + + A G RP + +V+D SGSM Sbjct: 105 AGQPFEMPAIIRLSADDEAGQGPRPGLDLAIVLDRSGSM--------------------G 144 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 K+ K A L ++ +D + LI Y V + + + Sbjct: 145 GDKLRFAKQAGLDLVNRLDEQD------RVTLISYDDTVTPLSNLQRVDDDGIEVLRRQL 198 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +D + T PA+ Q L + F Q + +I L+DG N Sbjct: 199 LDIQVGGTTALGPALFMGLQRLAA------PEPFGPQTRTEARHDRLRHVILLSDGIANV 252 Query: 344 FKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I + E+ + + T+ + N + + ++ + +A+S+ Sbjct: 253 GE--TRPEVIGGRVAEHFGGGVSVSTLGMGLDYNEDLMTRIADEGGGRYHFIEDAESIP- 309 Query: 401 VFQNISQLMVHRKYSVI 417 + + +V Sbjct: 310 --AMLGDELAGLTATVA 324 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 63/206 (30%), Gaps = 24/206 (11%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 ++ + + LK A F + S E + L+ + + + Sbjct: 60 TDNKERLYYTTNDANDRFYYLKQAATNFTTQLAQSSPNSE---IALVTFNKTATEQFDFK 116 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + T + +AY+IL +D+ S + Sbjct: 117 NVGKDSAYITETINAMETSGGTHQNEGLDRAYKILNNDQNTSN---------------LK 161 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK-----TCVSS 385 ++++ LTDG N + T I +K K K++T+ + L + Sbjct: 162 RYVVLLTDGCPNGVTYDQITTSI-NKIKSTNTKLITVGVGLDETNTGLKAAKDYLQANAD 220 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH 411 YN +A L +F I + Sbjct: 221 DNMAYNANDASHLNTIFTQILGQTTN 246 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/384 (9%), Positives = 84/384 (21%), Gaps = 42/384 (10%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDD-----AKRFIKNHIKESLSGYSAVFYN 115 +L S + + + AL DD + + + Sbjct: 185 VLTDGSWTSQDGTSPLSPASQNPSITAGALYDDQDIPTYVVAVAGDPQAEAAADEVAAAG 244 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIF--YNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 I ++ + N + ++ + + L +LN Sbjct: 245 GTTAAIDGNTPEELGTALNEIVTDIIESVVAPECVGGLPRAMIILDASSSMLNTDGGTMF 304 Query: 174 VSFIPA----LLRIEMGERPIFLIELVVD-----------LSGSMHCAMNSDPEDVNSAP 218 + +GE +F I++ + G + Sbjct: 305 GAMGETGWDQAGGALVGEMGLFDIDVGTGTAEDLTHLGLTVFGYDTPNLAGAESLEEDPG 364 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLL-SHVKEDVYMGL--------IGYTTRVEKNIEP 269 + L D+ G I + + P Sbjct: 365 YLPAYDAGEPRVLVGYGPCSKDNFAWALDPNSSCTEPGCTDPWGGPPIAWDYDDGQANPP 424 Query: 270 SWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + T + ++ + + + Sbjct: 425 EFDAPTFSHMPQCAGNGFCSGSGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTAD---E 481 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCV-- 383 F I +TDG+ N + +N ++ + I I + Q L+ Sbjct: 482 TLYFNILITDGQYNGYSTNAQVQGELEEMYNSGITTYVIGFGDGVDTAAAQAQLQNMAQW 541 Query: 384 --SSPEYHYNVVNADSLIHVFQNI 405 S +Y+ N L I Sbjct: 542 GSGSQNNYYDANNQTELEQALTTI 565 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|301780200|ref|XP_002925521.1| PREDICTED: collagen alpha-1(XII) chain-like [Ailuropoda melanoleuca] Length = 3172 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIGALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLIKNTFTESAG-------------ARAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 52/173 (30%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 I V + L+ Y+ T KV + T++ AM Sbjct: 467 IKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDIIEAINTFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPSKGSRGNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 2339 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEIFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGI--KNADEDELKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 75/270 (27%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 39 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 93 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ ++ + ++ Y Sbjct: 94 TSM--------------------SGERIEKAREAAILAVNMLNTTD------TLSVVAYD 127 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 128 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 171 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 172 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 231 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 232 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 261 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 55/185 (29%), Gaps = 29/185 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ K+A I + + +G+ + + + + T Sbjct: 422 SGSMSGEKLEMAKSAA------IATAEVLTRNDSIGVYAFDSEAHVVVPMTRLTSS-SAV 474 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ PA +A L K + K +I LTDG Sbjct: 475 AGQIAGLTSGGGTNLHPAFTEARNALQRTKAKI------------------KHMIILTDG 516 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + + + + + I T++I + L Y ++A +++ Sbjct: 517 QTSGQGY----EALASQCRAEGVTISTVAIGDGAHVGLLQAIASLGGGKSYTTLDAANIV 572 Query: 400 HVFQN 404 +F Sbjct: 573 RIFTQ 577 >gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella aurantiaca DW4/3-1] Length = 749 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 37/327 (11%), Positives = 87/327 (26%), Gaps = 59/327 (18%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 R + K + + V +++I + + D+ D Sbjct: 176 RLVGGSAKNDFTFSAKVSSKVPLKSIYSPTHPMDVSRRGEAEAVVGLEQVNGADLSKDLD 235 Query: 156 YRLQFIEHLLNQRYNQKI--------VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + + ++ I + E + V+D SGSM Sbjct: 236 LYFSVSDKAVGLSLLTYKQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQ--- 292 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 ++M K+AL + ++ ++ ++T VE Sbjct: 293 -----------------GSRMQIAKDALKYCVTRLNPQDTFN------VVRFSTDVEALF 329 Query: 268 EP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 S E +++ V + T A+ + Q Sbjct: 330 PALKSAQPENIQKAVAFVEQLEAIGGTAIDEALVRGLQ------------------DNDG 371 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVS 384 ++F+TDG+ +++ I ++ ++ T + N + L + Sbjct: 372 KSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSD 431 Query: 385 SPEYHYNVVNADS----LIHVFQNISQ 407 V + + + +S Sbjct: 432 GAGTSDFVRDGKEFETKISSFYDKVSN 458 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/354 (10%), Positives = 93/354 (26%), Gaps = 54/354 (15%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--- 115 A+ + A R+E + + + + F N Sbjct: 103 RAVASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQ 162 Query: 116 ---TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 E + + I + + + Y L Sbjct: 163 LPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTL 219 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ + P + ++D SGSM ++ ++ Sbjct: 220 LKVDVQ-ARDMQTRDLPPANLVFLIDTSGSMQ-------------------PAERLPLIR 259 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +AL L ++ + ++ Y G T V T ++ + T Sbjct: 260 SALKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGST 312 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ AY+ + I+ TDG+ N ++ I+ Sbjct: 313 GGEAGLRLAYEQAEKGFIKGGVN----------------RILLTTDGDFNLGITDPKDIE 356 Query: 353 I-CDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 K +E I + T+ + + ++ + + + + V ++ Sbjct: 357 ALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 410 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/337 (9%), Positives = 87/337 (25%), Gaps = 58/337 (17%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R++ ++ + + + + F N + + Sbjct: 113 TARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQGQLPPPDAVRVEEMVNYFP 172 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + N S + + + I ++ P Sbjct: 173 SDWVINDKQSIPASKPIPFAMRYELAPA-------PWNEQRTLLKVDIL-AQDLKSEALP 224 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ ++++L L + + + Sbjct: 225 ASNLVFLIDTSGSM-------------------YSDERLPLIQSSLKLLVKELREQDN-- 263 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ Y + + G K T+ ++ AYQ Sbjct: 264 ----ISIVTYAGDSRIALPSTSGNHKDEIN-AAIDSLNARGSTNGGAGLEMAYQQAAKGF 318 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTIS 368 + I+ TDG+ N + +I+ + K +E+ + + T+ Sbjct: 319 IKGGVN----------------RILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLG 362 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + V + Sbjct: 363 VGRDNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLNS 399 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/272 (13%), Positives = 78/272 (28%), Gaps = 72/272 (26%) Query: 173 IVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +V+F + + E I L VDLS SM+ +++ Sbjct: 58 LVAFAGPSIGTSVKEIKEEGKDIFLAVDLSQSMNATDIG---------------PSRLQR 102 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-IL 289 +K L S +GLI +++ ++ ++ Y+ L Sbjct: 103 IKFELKELTKSFPS-------DRIGLIIFSSEAFMQCPLTFDQSVLQLYIDGLNTGLVPN 155 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD ++ A +D+ + K +I ++DGEN Sbjct: 156 FGTDLNAPLRIALDRFQNDESQEVK---------------SKSVILISDGEN----FGDE 196 Query: 350 TIKICDKAKENFIKIVTISINASP------------------------NGQRLLKTCVSS 385 I + K +K+ + I + + L + + Sbjct: 197 LENIGSELKNLGVKVFALGIGTESGSTIPRGNGIVMDPQTGEPAQTVLDKRPLQQIAAET 256 Query: 386 PEYHYNVVNA----DSLIHVFQNISQLMVHRK 413 ++ + + LI + + + + Sbjct: 257 DGQYFEISDEVQEVADLIKRLERLEGGITGSR 288 >gi|219848048|ref|YP_002462481.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542307|gb|ACL24045.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 446 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 27/277 (9%), Positives = 69/277 (24%), Gaps = 47/277 (16%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + + ++ + Sbjct: 30 STQGDRTLPLHLVVVADASRSMRIPIVDEHRF----------------RELVRNGGAHEV 73 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + + AL ++ + M LI + Sbjct: 74 LVDGVPVWQLANPLSSEARSQFSSPIDYTVRALHSVVERLTPDD------RMALIACASD 127 Query: 263 VEKNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + G + + R + + T+ ++ A Sbjct: 128 ALVLAPSTPGHRRTDLIGAIARLPVLRLGESTNLAQGLQLALAQFV-------------- 173 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + P + ++ LTDG + + +A + I I TI + + L + Sbjct: 174 ---VTDEPAVRRVVLLTDGFTTDT---TMCTALAREAADRSITISTIGLGNTFEETLLTQ 227 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 S V A + + IS + H + + I Sbjct: 228 IADLSGGRASFVQEAGHIPTI---ISAELEHARQTTI 261 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/247 (12%), Positives = 81/247 (32%), Gaps = 73/247 (29%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D ++ K + ++S+ Sbjct: 91 DIVFAVDVSKSM---------------LAEDIAPNRLDKSKQLVTQIINSLAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ D L + T + A++ A +++ Sbjct: 129 DRVGIIAYAGKAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAISEAIELAKTYYDDEEQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + +I ++DGE++ ++ + I ++A E I+I+T+ + Sbjct: 189 TN------------------RVLIIISDGEDHGGEA----VDIAEEANEEGIRILTVGVG 226 Query: 371 A-------------------SP---------NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + + + N N + ++ Sbjct: 227 DVKGGPIPIKRNGVVLNYKKDNKGETVITRLDETTLKEIAEETNGIYINGSNTNEVVDAI 286 Query: 403 QNISQLM 409 +++ M Sbjct: 287 KDVLDKM 293 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/246 (12%), Positives = 70/246 (28%), Gaps = 74/246 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D ++ K + ++ + Sbjct: 91 DIVFALDVSKSM---------------LAEDIAPNRLEKSKRIITEIVNKLTA------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+GY + + + ++ D + + T + A+ A Sbjct: 129 DRVGLVGYAGSAFPQVPITTDYASTKTFLQSMNTDMVSSQGTAISQAIDLAKSYYND--- 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K +I L++GE++ + N + + A IKI TI + Sbjct: 186 ---------------DDQTNKVLIILSEGEDH----DSNVESMAETAAAEGIKIYTIGVG 226 Query: 371 AS----------------------------PNGQRLLKTCVSSPEYHYNVVNAD-SLIHV 401 + + L K + + N ++ + Sbjct: 227 TERGDPIPIKKEGRIQSYLKDDNGEIVITKRDTETLQKIAEIGNGAYIDGTNTSNAVDDI 286 Query: 402 FQNISQ 407 + + + Sbjct: 287 LKELQK 292 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 76/238 (31%), Gaps = 37/238 (15%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + E + ++D SGSM+ N D ++ +D Sbjct: 274 ADASVNEAATTRVAFLIDNSGSMYPKELCPTSSEN------DVDFKRLDF----TQSLID 323 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMK 299 D D +G+ +T + + + + R + T + A+K Sbjct: 324 KFDS------DFRIGISKFTGTYTRMCGFTDDRTALSDVIKRIRTEDEIFDGTHNQTALK 377 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + I+ L+DGE++ + + A E Sbjct: 378 ----------------RCIEEFTATGDGKYVNIIVMLSDGESDETNAESI-KNLARLANE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + ++T+ + + L + S+ +Y+ A++L V++ I + Y ++ Sbjct: 421 KSVIVLTVGLGREIDRAWLQEMAYSTGGKYYSASEANALDDVYKQI---VTTLNYDIV 475 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 11/107 (10%), Positives = 33/107 (30%), Gaps = 3/107 (2%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + ++F + + S + + + K +++A +A+ L A ++ Sbjct: 19 RPYRKQGGAILVMFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKELQDGA 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 S R + + + + + + LS Sbjct: 79 SLYEAREAATTLLLQNL---QYQENGELNSSIDLSAPDYNSTQVAAN 122 >gi|328953619|ref|YP_004370953.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] gi|328453943|gb|AEB09772.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] Length = 333 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 41/138 (29%), Gaps = 10/138 (7%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K + +I FAL++ + + K ++ +A LAGA+ +V Sbjct: 11 KLVRDASGAVAISFALTLAILCGFVALAFDIGHLVMVKAELQRTADAGALAGATGLVPYT 70 Query: 73 S----------RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + +++ N+ + + + + + V Sbjct: 71 GPVTSQTPDWLQGQSKAQTLINNTANLADNQVFTTTDSAVTYGYWLLNPPEGYVQSLPTV 130 Query: 123 NSSRISMTHMANNRLDSS 140 + + R+ S Sbjct: 131 RPTTSAYLPQPAIRVTLS 148 >gi|189066649|dbj|BAG36196.1| unnamed protein product [Homo sapiens] Length = 957 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|119624855|gb|EAX04450.1| collagen, type XXI, alpha 1, isoform CRA_c [Homo sapiens] Length = 552 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|119624854|gb|EAX04449.1| collagen, type XXI, alpha 1, isoform CRA_b [Homo sapiens] Length = 567 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|119624853|gb|EAX04448.1| collagen, type XXI, alpha 1, isoform CRA_a [Homo sapiens] Length = 429 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|114607957|ref|XP_001157649.1| PREDICTED: collagen alpha-1(XXI) chain isoform 3 [Pan troglodytes] Length = 957 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|114607959|ref|XP_001157475.1| PREDICTED: collagen, type XXI, alpha 1 isoform 1 [Pan troglodytes] gi|114607961|ref|XP_518554.2| PREDICTED: collagen, type XXI, alpha 1 isoform 4 [Pan troglodytes] gi|114607963|ref|XP_001157591.1| PREDICTED: collagen, type XXI, alpha 1 isoform 2 [Pan troglodytes] Length = 954 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|18780273|ref|NP_110447.2| collagen alpha-1(XXI) chain precursor [Homo sapiens] gi|74752071|sp|Q96P44|COLA1_HUMAN RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|15593270|gb|AAL02227.1|AF414088_1 collagen XXI [Homo sapiens] gi|19310967|gb|AAL86699.1|AF438327_1 alpha 1 type XXI collagen precursor [Homo sapiens] gi|55665071|emb|CAH73913.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202573|emb|CAI22496.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202937|emb|CAI22395.1| collagen type XXI alpha 1 [Homo sapiens] gi|116496597|gb|AAI26109.1| Collagen, type XXI, alpha 1 [Homo sapiens] gi|215434893|gb|ACJ66843.1| alpha 1 type XXI collagen precursor [Homo sapiens] Length = 957 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|17974510|gb|AAL50033.1|AF330693_1 alpha 1 chain-like collagen COLA1L precursor [Homo sapiens] gi|55665070|emb|CAH73912.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202574|emb|CAI22497.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202938|emb|CAI22396.1| collagen type XXI alpha 1 [Homo sapiens] Length = 954 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAAYQIANPGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFVKGGVN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|281348096|gb|EFB23680.1| hypothetical protein PANDA_015036 [Ailuropoda melanoleuca] Length = 3047 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 138 NNFKYILDFIGALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 197 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ F K I +TDG++ + Sbjct: 198 GGNTMTGDAIDYLIKNTFTESAG-------------ARAGFPKVAIIITDGKSQD----- 239 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 240 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 297 Query: 407 QLMVH 411 + Sbjct: 298 SQVCS 302 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 52/173 (30%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 I V + L+ Y+ T KV + T++ AM Sbjct: 451 IKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDIIEAINTFPYRGGSTNTGKAMTY 509 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 510 VREKIF-------------VPSKGSRGNVPKVMILITDG-----KSSDAFRDPAIKLRNS 551 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 552 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 598 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1200 RTVRSFISRIVEIFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1259 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1260 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1301 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + LK + P+ + YNV + +SL + +++ + Sbjct: 1302 APSKKLKDEGVELFAIGI--KNADEDELKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1359 Query: 410 VH 411 + Sbjct: 1360 CN 1361 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 2323 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 2382 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2383 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2425 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2426 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2478 Query: 406 SQLMVH 411 ++ Sbjct: 2479 EDNLIT 2484 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 74/261 (28%), Gaps = 75/261 (28%) Query: 171 QKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 +++ + + +M I +D+S SM +D ++ Sbjct: 68 SLVIALVNPKMGTKMETVKREGVDIVFAIDVSKSMDA---------------EDIAPNRL 112 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSL 287 K + L S+ +G+I Y + + + ++ D + Sbjct: 113 EKSKQLVSQILSSLGS-------DRVGIIAYAGGAYPQLPITTDFSAAKMFLQALNTDMI 165 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ A D++ + + + ++DGE++ Sbjct: 166 SSQGTAISDAIELATTYYDDDQQTN------------------RVLFIISDGEDHEGNV- 206 Query: 348 VNTIKICDKAKENFIKIVTISINASP----------------------------NGQRLL 379 I ++A E I+I TI + N L Sbjct: 207 ---EDIAEQAAEKGIRIFTIGVGTEKGGPIPIKRNGVVQNYKKDRQGETVITKLNPATLQ 263 Query: 380 KTCVSSPEYHYNVVNADSLIH 400 + ++ + ++ Sbjct: 264 EIANATDGSYIEGNVTSEVVD 284 >gi|118349478|ref|XP_001008020.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289787|gb|EAR87775.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 642 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 66/180 (36%), Gaps = 22/180 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +KN LL L+ ++ + L+ + + EK +Q + R +++ Sbjct: 214 KILNVKNTLLYLLEMLNSND------RLSLVLSNNNPTTLFDLKYLDEKNKQDLKRIINN 267 Query: 287 LI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T+ T +M +A+ IL + ++ ++ F + S ++ +++ Sbjct: 268 ISITQNTNITKSMIKAFNILQFRQSQNKVSSIFLLSDGVDSSAEKQIQNYISSQ------ 321 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I + + + + K C Y + N + + F ++ Sbjct: 322 ---------QSLQNKNFAIHSFGYGFDQDAEMINKICSLKNGNFYYIQNMNQVDQYFADV 372 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 62/177 (35%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + ++ ++ V G + Y+ E + + + T + Sbjct: 860 INLTIHLVEKADVGRDRVQFGALMYSNNPEILFYLNTYSSRSAITEHLKRPRDTRGDTYT 919 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A F + ++ +I +TDGE+++ +T Sbjct: 920 AKALQHA------------NILFMEEHGSRLKQNVRQLMIVITDGESHDRDKLNDT---A 964 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + +++ + Sbjct: 965 RELRDKGITIFAVGVGR--ANQDELETMAGKKENTIHVDNFDKLRDIYLPLQEILCN 1019 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 59/172 (34%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D+ +V +G++ Y+ R E S T+ + + T + A+ Sbjct: 495 VDMFPIGPNNVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKAL--- 551 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G ++I LTDG++N+ ++ ++ + Sbjct: 552 ----------DFILPIIKKGKSERIHEVPCYLIVLTDGKSNDS-----VLEPANRLRAEH 596 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I + I + L+ E N DSL ++ I + K Sbjct: 597 ITIHAVGIG--EANKTQLQQIAGKDERVSFGQNFDSLKYIKNEIVHRICSDK 646 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 58/168 (34%), Gaps = 22/168 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + + +G++ ++ ++ + + + + Y D S I + T Sbjct: 673 MKTFMKNLVGKIEIGADRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTL 732 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + K +KF+I LTDGE + Sbjct: 733 TGSALTFVNEYFDIS--------------KGGRPQVRKFLILLTDGEAQDEVGGPAM--- 775 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 776 --ALRSKSVTIFSVGV-YGAN-RTQLEEISGEGSLVFHVENFDHLKTI 819 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 62/177 (35%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + ++ ++ V G + Y+ E + + + T + Sbjct: 860 INLTIHLVEKADVGRDRVQFGALMYSNNPEILFYLNTYSSRSAITEHLKRPRDTRGDTYT 919 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A F + ++ +I +TDGE+++ +T Sbjct: 920 AKALQHA------------NILFMEEHGSRLKQNVRQLMIVITDGESHDRDKLNDT---A 964 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ I I + + Q L+T E +V N D L ++ + +++ + Sbjct: 965 RELRDKGITIFAVGVGR--ANQDELETMAGKKENTIHVDNFDKLRDIYLPLQEILCN 1019 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 59/172 (34%), Gaps = 20/172 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D+ +V +G++ Y+ R E S T+ + + T + A+ Sbjct: 495 VDMFPIGPNNVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKAL--- 551 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F ++G ++I LTDG++N+ ++ ++ + Sbjct: 552 ----------DFILPIIKKGKSERIHEVPCYLIVLTDGKSNDS-----VLEPANRLRAEH 596 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I I + I + L+ E N DSL ++ I + K Sbjct: 597 ITIHAVGIG--EANKTQLQQIAGKDERVSFGQNFDSLKYIKNEIVHRICSDK 646 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 58/168 (34%), Gaps = 22/168 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + + +G++ ++ ++ + + + + Y D S I + T Sbjct: 673 MKTFMKNLVGKIEIGADRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTL 732 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + K +KF+I LTDGE + Sbjct: 733 TGSALTFVNEYFDIS--------------KGGRPQVRKFLILLTDGEAQDEVGGPAM--- 775 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I ++ + N + L+ ++V N D L + Sbjct: 776 --ALRSKSVTIFSVGV-YGAN-RTQLEEISGEGSLVFHVENFDHLKTI 819 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 39/357 (10%), Positives = 96/357 (26%), Gaps = 56/357 (15%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY-SAVFYNTEIQNIV 122 GA +V + S++ + + IK + + + SA+ + Sbjct: 1824 GARVVVVSYDSDISYLIRGSDYHSKKHLLQLLSQIKYQVPQKIRDIGSAMRFVARNVFKR 1883 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S + +A + + + + L + + Sbjct: 1884 TSLGTNTRRVAVFFSNGQAASRASILTATMEFS-ALDISLAVFAHNERVFLDEAFGQCET 1942 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP-----------------------I 219 R + + + N Sbjct: 1943 AVDPVRSWQHFPPPIQYMLHEQLVSWTPVTNQNKCFPNTCAEEPFFPENSYMDVAFLLDN 2002 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSH---VKEDVYMGLIGYT--------TRVEKNIE 268 ++ A+K + +D + S+ + + L+ Y+ V+ Sbjct: 2003 SRNIASDDFQAVKGLVSSVIDGFHITSNPSTSESGDRVALLSYSPSENSRRKGSVKTEFG 2062 Query: 269 PS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + +E + + + A++ A + L + P+ Sbjct: 2063 FTTFDSESIMKNYIHTSLQQLNGDAAIGLALEWAMEGLF---------------LGTPNP 2107 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 K II ++ GEN+ K V T+ + +AK I +S+ + + ++ S Sbjct: 2108 RKHKVIIVISAGENHEEKEFVKTVAL--RAKCQGYIIFVVSLGS--TRRDEMEELAS 2160 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 71/223 (31%), Gaps = 35/223 (15%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + LV+D SGSM+ + DP +KM ALK A +F+D +DL + + Sbjct: 419 NTDVVLVMDRSGSMNLSSAPDP------------SVSKMDALKYAANVFMDFLDLDAGHR 466 Query: 250 EDV--YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + ++ + + T T+ + + LT+ Sbjct: 467 AGLVQFHEVVVPFSPAFNLQPV--NAASLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTT 524 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ ++ LTDG +N T + + ++ Sbjct: 525 AV----------------DPSDRQIMLLLTDGLHNRPVGTSVTDITAPLLASE-VTLYSV 567 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLI--HVFQNISQL 408 S N L +S+ H + L F +I+ Sbjct: 568 GFGTSTNEAELTPLALSTGGVHLENKDVSDLQLRKHFLSIAAS 610 >gi|327310820|ref|YP_004337717.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] gi|326947299|gb|AEA12405.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] Length = 509 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 64/196 (32%), Gaps = 17/196 (8%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+A A ++S+ + + + +V+ + P+ EK R+ + + Sbjct: 55 FDKLAHAIEAAKGLVESMAPEDI------VAVYVFDEKVKALVPPTP-VEKARKMLGKIE 107 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-N 343 T A+ Q L R + K I+ +TDGE Sbjct: 108 KIKPGTYTLLYQALLQVIDDL-----RGIKRGLPLMPRRAVPENIPKRIVVITDGEPWPY 162 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + A I I I I N + L SS Y++ + + Q Sbjct: 163 YTEERWYEHLGKAAARYGITISAIGIGDDYNEKILYALANSSGGAWYHI---SQIRDISQ 219 Query: 404 NISQLMVHRKYSVILK 419 ++ + K +V+ + Sbjct: 220 VLANELRRAK-TVVAR 234 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 76/232 (32%), Gaps = 28/232 (12%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD-SIDLLSHVKED 251 I LV+D SGSM + ++ + K+ + F+ + Sbjct: 103 IYLVLDQSGSMAQKIPLSSDEKGRISVP------KIDFIVEITKDFVKGDPTKGLKGLHN 156 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--DMDSLILKPTDSTPAMKQAYQILTSDK 309 +GLIG+ + + + + + + + T A+ + ++ S K Sbjct: 157 DLVGLIGFARTAQVLSPLTLDHQAIIDQLNKFSIVKHQDEDGTSIGYAIFKTANLIASTK 216 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-------NFKSNVNTIKICDKAKENFI 362 + P ++ +TDG + + ++ AKE + Sbjct: 217 HFA-----EELKEASPYTIKNSIMLIVTDGFQDPNPLDREDQYRSIELEDAAKYAKEQGV 271 Query: 363 KIVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ I++ + + K + Y + + + L +++ +I + Sbjct: 272 RVYIINVEPRIASEEFGSERRVMQKVTEITGGKFYLLDHIEELKNIYADIDK 323 >gi|119569133|gb|EAW48748.1| collagen, type XII, alpha 1, isoform CRA_a [Homo sapiens] Length = 821 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNVGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 56/185 (30%), Gaps = 18/185 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + ++ + V +GL+ Y+ + K Sbjct: 45 LDNFQQIRQFLSSLVENFEVAPDKVRIGLVQYSDTPRTEFSLNTYQNKEEILDYIRNLRY 104 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + ++ +F + + I +TDG++ + Sbjct: 105 KTGGTHTGQGLEF-----------ILKQHFIEEAGSRAQQNVPQIAIVITDGDSQDEVDL 153 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNIS 406 + ++ IKI I I + + L + + + Y+V + +L + Q++ Sbjct: 154 Q-----AQELRQRGIKIFAIGI-KDADVRLLRQIANEPYDQYVYSVSDFAALQGISQSVV 207 Query: 407 QLMVH 411 + + Sbjct: 208 RELCT 212 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 74/224 (33%), Gaps = 23/224 (10%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 V +S + +D + + +++ ++I L K+ + + + Sbjct: 599 VAVSKDCEFDVVADIAFIVDQSSSIKSRN--FQLVRD---FLENTIGRLDVGKDKIQIAV 653 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 I Y+ ++ + + K + T + A++ A + + + + Sbjct: 654 ILYSDFPRADVYLNTFSNKNDILRYINTLPYGRGKTYTGAALRFAKEHVFTKARG----- 708 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 Q+ + +TDG++ + ++ + + + I + I Sbjct: 709 ------SRRDKYVQQVAVVITDGKSTDDAASAAAE-----LRRSGVSIFALGI-KDTKED 756 Query: 377 RLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKYSVILK 419 L + P+ NV N D L + +++ + + I+ Sbjct: 757 DLREIASYPPKKFVLNVENFDQLNSLAGILTKTLCNEISDAIIP 800 >gi|194216197|ref|XP_001914777.1| PREDICTED: collagen, type XII, alpha 1 [Equus caballus] Length = 3120 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDPRTEFNLNQYYQRDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + T G ++ K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLIK----------NTFMESAGARVGFP---KVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRSIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/255 (13%), Positives = 79/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + + ++ P+ ++E + + C+ D + + Sbjct: 388 MLSVRDLSADTEYQISV-SAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHKDKQSLLEAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ + I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAVGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 72/233 (30%), Gaps = 49/233 (21%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 +R + L++D SGSM +A +K A +D + Sbjct: 33 QQQRAPLNLCLILDHSGSMAW--------------------QPLAMVKQAAASLVDRLLP 72 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +I + + + + W E ++ + T MK Sbjct: 73 SD------RLSVIAFDHKAKVLVPNQTVWDKEAIKAQIA---TLEPGGGTAIDEGMKLGL 123 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + K+ + I LTDGE N N +++ A E I Sbjct: 124 KEIAAGKQGTISQ-----------------IFLLTDGE-NEHGDNQRCLELAKLAAEYNI 165 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + N L + ++ + A+ I FQ++ + Y+ Sbjct: 166 TLNALGFGVHWNQDVLEQIADAAGGRLVFIEYAEQAIACFQSLFSHISSVDYT 218 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 57/180 (31%), Gaps = 20/180 (11%) Query: 233 NALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + F+ D +D ++ V +G++ Y+ ++ + + + Sbjct: 1228 STMKTFMKDVVDSFDISRDKVRLGVVQYSQEPQREFYLNEFYSDTIIKEQINRIEQLRSS 1287 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + ++ + F + + ++ +TDG + + + Sbjct: 1288 TFTGKGLRF------------VQSLFESANGGRKNQGVSQNLVVITDGYSADSVDD---- 1331 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + N I + + + N LL+ + V N ++L + I + Sbjct: 1332 -AAMALRSNGIHVFAVGVGI-VNSFELLRI-AGDARRVFTVENFNALKTIKSTIINEICE 1388 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 67/227 (29%), Gaps = 26/227 (11%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + S C + + + +K + + L Sbjct: 624 EIVTKTFVVPLQSRSLKKGCMDTEEADIYFLIDGSGSIYPSDFKDMKTFMNEVIRIFQLG 683 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +V G++ Y + + I ++ +R + + I T + A++ + Sbjct: 684 A---NNVRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNALRSMKSLF 740 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +I +TDG KS + ++ I I Sbjct: 741 Q----------------MAYRENVPQILIVITDG-----KSEDKVNQAARDLRQQGIVIY 779 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I I Q+ L+ + + V + DSL H+ I Q + Sbjct: 780 AIGIKD--AVQQELEEIAETKNRMFFVNDFDSLKHIKHEIVQEVCST 824 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 64/181 (35%), Gaps = 14/181 (7%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 KN L L ++ K++V G++ Y++ E + K + + Sbjct: 1033 FKNMLQLVEAVVNNSVVGKDNVQFGVLVYSSNPEVQFSLNSYASKSQIREAVFSLKPLSG 1092 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + A+ A Q G + SL + ++ +TD E N Sbjct: 1093 QPFTARALSFARQTF-----------GVNYGGRASSLAVARILVLITD-EPTVPSDRDNL 1140 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 KE+ I + I++ S + L+ + + + D+L ++ +N++Q + Sbjct: 1141 PMAIRALKEDKIVL--IAVGVSKASREELEEITEDQKRLFFAQSYDALENMHENLTQTVC 1198 Query: 411 H 411 Sbjct: 1199 E 1199 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 60/224 (26%), Gaps = 43/224 (19%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + + +VD S S+ +KN L Sbjct: 33 DAQQTACSKATVADVVFIVDTSTSIAQEN--------------------FQKVKNFLSSL 72 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + S+ + + +GL Y+ + + K T + A+ Sbjct: 73 VSSL---DIGLDMIRVGLAQYSDEAYQVFLLNQYLLKSDVLDQIGNLPYRGGETYTGRAL 129 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 F + + +T GE+N+ + K + Sbjct: 130 DF-----------VSTRYFTESAGSRAKGYVPQLAVLITSGESND-----EVEQPAKKLR 173 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 I I + I + L+ S P Y Y++ + D L Sbjct: 174 YRGISIYVVGIGIQNTTE--LQQIASKPFRRYLYSIGSFDDLPD 215 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/321 (10%), Positives = 88/321 (27%), Gaps = 57/321 (17%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 A R I ++ + + + I + + + + Sbjct: 743 AYRENVPQILIVITDGKSEDKVNQAARDLRQQGIVIYAIGIKDA--VQQELEEIAETKNR 800 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + F + + + + I +VD S S+ A Sbjct: 801 MFFVNDF------DSLKHIKHEIVQEVCSTNVCKNVRADIVFLVDSSNSIRAA------- 847 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + +K+ + F+ + ++V +GLI +++ + + + + Sbjct: 848 -------------EFQKIKDFMQSFVIKV---DVGLDNVRIGLIQFSSEIREEFQLDRYS 891 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 I T + A+ A K ++++ Sbjct: 892 TIADVQRAIQEMQQIKLGTLTGKALTFAASYFD--------------RPKGGRPELKQYL 937 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 I +TDGE + + ++ I I I + N +L++ + + + Sbjct: 938 IVITDGEAQDSVKSP-----ARAIRDKGITIYAIDM-LQANNSQLVEITGAQDKVFFESE 991 Query: 394 -NADSLIHVFQNISQLMVHRK 413 N + I + + + Sbjct: 992 MNFSE-----KQILFEICNLQ 1007 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 57/180 (31%), Gaps = 17/180 (9%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L ++ L +V +GL+ Y+ +K+ + + T Sbjct: 464 FLLKIVNALDIAPSNVRVGLVLYSNEPRLEFTLDTFKDKLE-ILNYLKNLPYRGGQAYT- 521 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A + L F ++ Q+ + +TDG++ + I+ K Sbjct: 522 --GIAIEFL-------RNKVFTQEAGSRKKQGVQQIAVVITDGQSLDDY-----IEPASK 567 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + + + G +L K P + + L ++ I + + + + Sbjct: 568 LRRESVTVYAVGVKNITEGSKLDKIATYPPRNHVTTLKYFLQLSNIRWKIKKQLCNEIVT 627 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 62/201 (30%), Gaps = 26/201 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K+ LKNAL + +D + VY G + V + T + Sbjct: 260 VSGSMSSQDKLPLLKNALKMLSQQLDEGDRISIVVYAGA----SGVVLDGVKGNDTLAIS 315 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q + + T+ ++ AYQ+ +I T Sbjct: 316 QALDKLK---AGGSTNGGAGIELAYQLAQKHFIAGGVN----------------RVILAT 356 Query: 338 DGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ N S+ + ++ ++ I + T+ L++ HY ++ Sbjct: 357 DGDFNVGVSDQQALEDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDT- 415 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L + + + ++ Sbjct: 416 -LNEARKVLVDEISATLLTIA 435 >gi|281345873|gb|EFB21457.1| hypothetical protein PANDA_003362 [Ailuropoda melanoleuca] Length = 399 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 56/164 (34%), Gaps = 21/164 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ +++ +GL+ +++ ++ + KV + + T + A+ Sbjct: 249 VNQADISTDEIQIGLLQFSSTPQEEFRLDQYSSKVDIHRAITNVQQMNDGTRTGKALNFT 308 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S + Q+++I +TDG + ++ Sbjct: 309 RPFFDSSRGG--------------RPNVQQYLIVITDGVAQDDVVMP-----AKALRDRN 349 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I I I + + N Q L P+ Y N +SL ++ + I Sbjct: 350 IVIFAIGVGEAKNAQLL--QITDDPQKVYYEENFESLQNLEKKI 391 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 62/204 (30%), Gaps = 23/204 (11%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYV 280 + + + + ++ L+ + V +GL+ Y+ + +Y+ Sbjct: 11 FSWDRQSNFQQVVNFLKSTVSSLNVHPDGVRIGLVFYSEEPRLEFSLDAFQNPASILEYL 70 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R T + A+ F + Q+ + +T+G Sbjct: 71 DRLTYRRRSGRTKTGAALDF-----------LRNEVFIEERGSRSKHGVQQMAVVITEGF 119 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + + S K + + I + + + + L+ S P ++ ++ + L Sbjct: 120 SQDQLS-----KSASLLRRAGVTIYAVGTHL-ASESKDLENIASYPPWKHVISLESFLQL 173 Query: 399 IHVFQNISQLMV--HRKYSVILKG 420 V I + SV + G Sbjct: 174 SVVGNKIKNQLCPETLDRSVSISG 197 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 72/223 (32%), Gaps = 37/223 (16%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM + + + K +++ LK+A+ + ++ Sbjct: 45 PIDVVEVLDVSGSMGDPAMASSDFKKN------KPPSRLDVLKDAMKFIIRKLEDGD--- 95 Query: 250 EDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + R K R + T PA+++A ++L Sbjct: 96 ---RLSIVAFNDRPVKEYSTGLLDISGNGRRIAEKKVDWLEGRGGTALMPALEEAIRVLD 152 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S F I+ LTDG++ + + K + T Sbjct: 153 CRPGDSRNRVGF--------------ILLLTDGDDTSGFRWSRDVINGAVGK---YPVHT 195 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + A+ + + LL S + V + + I+ + Sbjct: 196 FGLGAAHSSEALLYIAQESRGTYSFVDDEN-----MDKIAGAL 233 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 53/182 (29%), Gaps = 25/182 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K A + ++ + + ++ + + + + + + Sbjct: 59 LETVKQAAVQLVEKLKEGD------RLSVVAFDHQAQVIVP-NQMINDSASIKGKINKLR 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K + L +K S F LTDGEN + ++ Sbjct: 112 ASGGTAIDKGLKLGIEELNKGRKESISQAF-----------------ILTDGENEHGDND 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + A + I + ++ N L K + + + I F + Sbjct: 155 LCLK-LAKLATDYNITLNSLGFGDDWNQDVLEKIADAGGGNLSYIQQPEQAIEEFSKLFN 213 Query: 408 LM 409 + Sbjct: 214 RI 215 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 61/208 (29%), Gaps = 32/208 (15%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G + V + K+A LK A+L + + + + ++ + Sbjct: 51 TGGDSSNDRPGLDLVAVLDVSGSMAGEKIAKLKMAMLFMIKKLSPID------RLSIVTF 104 Query: 260 TTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 +T + TE ++ T+ T ++ A ++L Sbjct: 105 STDSTRLCPLRQITENSQKEFENLINRLKADGWTNITAGLETALKVLNDRSFNGGRVVG- 163 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 I+ ++DGE+N + + T + R+ Sbjct: 164 --------------IMLMSDGEHNTDGDPAEVPL-------GNVPVHTFGFGRN-YEPRV 201 Query: 379 LKTCV--SSPEYHYNVVNADSLIHVFQN 404 LK S +V N ++L F Sbjct: 202 LKAVAHKSIGGTLSDVQNTNNLGKAFSQ 229 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 57/178 (32%), Gaps = 20/178 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + S + V +G + Y+ ++ + + V + T + Sbjct: 463 EFMLAVTGMFSIGPDKVQVGAVQYSDKMRVEFYINDNSNNVNLKNAILNIEQLQGNTYTG 522 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ I+ D K+ + ++I LTDG + + ++ + Sbjct: 523 EALNFTLSIIKED-------------RKLRTSQVPCYLIVLTDGRSTDD-----VLEPAE 564 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + I + I + Q L+ E N DSL + + + K Sbjct: 565 RLRAEQVTIHAVGIGEAIKVQ--LQQIAGGEERVSFGQNFDSLRSIKNEVVHRICTEK 620 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 21/160 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + ++G+I ++ + + + + + D SLI K T + A+ Sbjct: 655 LAKIQIGPDRTHIGVIQFSDKTREEFQLNKYFTQNEISDAIDRMSLIDKNTLTGNALISV 714 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K +KF+I +TDGE + + ++ Sbjct: 715 DQYFTPAKGARIG--------------IKKFLILITDGEAQDAVRDP-----AKALRDKG 755 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I ++ + N + L+ + V + D L + Sbjct: 756 VVIFSVGV-YGAN-RTQLEEISGDGNLVFQVESFDDLKAI 793 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 52/169 (30%), Gaps = 17/169 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V +G + Y+ E + + T + A+ Sbjct: 841 VKKADVGSDRVRIGALKYSDYPEILFHLGKYSNRSSVIEHLRRRRSTGGDTYTARALDHT 900 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + T + N ++ +I +TDG +++ T K + Sbjct: 901 NMMFTEEYGSRIQQN------------VKQMLIVITDGVSHDRNLLNET---ALKLRNKG 945 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I I + + Q L+ + ++V N + L ++ + + M Sbjct: 946 IDIYAVGVG--QADQLELEAMAGNKSKTFHVDNFNKLKDIYLPLQESMC 992 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 73/210 (34%), Gaps = 47/210 (22%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 ++ +V + + ++ + +D+S SM + + Sbjct: 61 MICIAIAFLVVGLVNPKMGTKLETVKREGVDVVFAIDVSKSM---------------LAE 105 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ K + ++S+ +G+I Y + + + +++ Sbjct: 106 DIAPNRIEKSKQLVTQIINSLGS-------DRIGIIAYAGSAYPQLPITTDYSSAKLFLS 158 Query: 282 RDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + D L + T A++ A +++ + + + ++DGE Sbjct: 159 QMNTDMLSSQGTAIGEAIELAKTYYNDEEQTN------------------RVLFIISDGE 200 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN 370 ++ +++ + ++A + I+I TI + Sbjct: 201 DHVGEASS----LAEQANKEGIRIFTIGVG 226 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 55/186 (29%), Gaps = 38/186 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +++ A++ F+ + +GL+ + R P++ V + Sbjct: 110 SRLDAVRRVASRFVA-------ARRGDRIGLVIFGDRAYVAQPPTFDVGSVAHAIEAAQI 162 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T + + A + L K ++ L+DG + + K Sbjct: 163 GISGRSTAISDGLGLATRRLLQSDAT------------------SKVVVLLSDGVDTSGK 204 Query: 346 SNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVS----------SPEYHYNVVN 394 A + I++ TI++ Q + V + V N Sbjct: 205 V--QAGDAARLAASHGIRVHTIALGPEDLENQPESRDAVDAAALHAMAEAGGGTSFRVRN 262 Query: 395 ADSLIH 400 + L Sbjct: 263 MEDLET 268 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 38/354 (10%), Positives = 92/354 (25%), Gaps = 54/354 (15%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--- 115 A + A R+E + + + + F N Sbjct: 101 RASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQ 160 Query: 116 ---TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 E + + I + + + Y L Sbjct: 161 LPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTL 217 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ + P + ++D SGSM ++ ++ Sbjct: 218 LKVDVQ-ARDMQTRDLPPANLVFLIDTSGSMQ-------------------PAERLPLIR 257 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +AL L ++ + ++ Y G T V T ++ + T Sbjct: 258 SALKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGST 310 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ AY+ + I+ TDG+ N ++ I+ Sbjct: 311 GGEAGLRLAYEQAEKGFIKGGVN----------------RILLTTDGDFNLGITDPKDIE 354 Query: 353 I-CDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 K +E I + T+ + + ++ + + + + V ++ Sbjct: 355 ALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 408 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/296 (12%), Positives = 83/296 (28%), Gaps = 75/296 (25%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + L + C +N Sbjct: 34 SQPVNLLLVLDVSGSMAWTT----------------------------------DACRLN 59 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-----TRV 263 + S + +++++ +K AL LFLD + ++V +G++ Y+ + Sbjct: 60 RWGQPYPSCYP-GNGEKSRLDIMKEALELFLDDL------PDNVKVGILTYSAGNNIDLL 112 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL---------------TSD 308 + + S K +T T + A+ +A ++ Sbjct: 113 HEVKQLSDNNHKA-TLLTTIDGLEANGGTLTAGALYEAGSYFRGQYDNLPSPITPGCSNA 171 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS----------NVNTIKICDKAK 358 F T+ + ++ II +T ++T+ + Sbjct: 172 SNIVFLTDGQPNSMSYNGYSYRNSIINMTGSSCARSDDGKECSEKLAGFLSTVDQIEDLT 231 Query: 359 ENFIKIVTISINA-SPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQ-NISQLMVH 411 + +K TI+ N + L+ + Y + D L+ F+ +I + Sbjct: 232 PSKVKTHTIAFALEDNNARTFLENVADAGNGQSYTADSTDGLVDAFKSSIQTDIEQ 287 >gi|317505805|ref|ZP_07963650.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316255887|gb|EFV15112.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 343 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 71/212 (33%), Gaps = 39/212 (18%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D K +++ A + A + F+D + V +G++ + + + PS E ++ + Sbjct: 113 DVKPSRVEAARAAAIKFVDGMAPT------VQLGVVTFAGNAQPLVRPSTDHETAKKVID 166 Query: 282 RDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + K T + + A Q + + + I+ ++DG+ Sbjct: 167 QMIRPDKLEKQTATGEGIYTALQQIETIAGALGGK----------NHAPPARIVLVSDGK 216 Query: 341 NNNFKSNVNTI---KICDKAKENFIKIVTISIN--------------ASPNGQRLLKTC- 382 AKE + + T++ + + L K Sbjct: 217 ETVPDDLNAPRGAYAAARTAKEKHVPVCTVAFGTKAGKITLDNQVDEVPVDLESLKKISD 276 Query: 383 -VSSPEY---HYNVVNADSLIHVFQNISQLMV 410 +SP + + L ++Q++++ + Sbjct: 277 LSNSPGNSCRFFPAESQGELAQIYQSLNEDIG 308 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 38/354 (10%), Positives = 92/354 (25%), Gaps = 54/354 (15%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--- 115 A + A R+E + + + + F N Sbjct: 101 RASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQ 160 Query: 116 ---TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 E + + I + + + Y L Sbjct: 161 LPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTL 217 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ + P + ++D SGSM ++ ++ Sbjct: 218 LKVDVQ-ARDMQTRDLPPANLVFLIDTSGSMQ-------------------PAERLPLIR 257 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +AL L ++ + ++ Y G T V T ++ + T Sbjct: 258 SALKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGST 310 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ AY+ + I+ TDG+ N ++ I+ Sbjct: 311 GGEAGLRLAYEQAEKGFIKGGVN----------------RILLTTDGDFNLGITDPKDIE 354 Query: 353 I-CDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 K +E I + T+ + + ++ + + + + V ++ Sbjct: 355 ALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 408 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/281 (9%), Positives = 68/281 (24%), Gaps = 12/281 (4%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ +++ LL++G I + K+ + +A + A+ R Sbjct: 13 RGLVVLMSVIALPFILLVVGLSIDAGRAYIVKSKLFAA--VDAASIAAARAVANGEDAGR 70 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + A D N + + S + +I + + + + + Sbjct: 71 AAAQKYFAANIPADFYSA-TPNLGDVNFAYDSFGNISIDISATAQVPTVFLPLIGLDTFN 129 Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR------IEMGERPIFL 192 + + L I + Sbjct: 130 PGVSAQSIRR--PVDLVLVIDNTTSLRLGSIGDVTQDVIDRSKSFIENFHEGFDRISLVK 187 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 ++ + + S + T A NA + + L V + Sbjct: 188 FAFGSEVPVGFNATRGHSRSTIKSEIDSFNFGSTSNAQYTNASEGMYRAFNELRTVTDPA 247 Query: 253 YMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + ++ +T ++ E + S T Sbjct: 248 NLKVIVFFTDGAPNTFASTFDFEGGGTHTGAIRSSDGTSGT 288 >gi|73973308|ref|XP_539002.2| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 1 [Canis familiaris] Length = 3065 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + Sbjct: 154 NNFKYILDFIGALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + I + ++ P+ ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISI-SAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRGNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 2339 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGI--KNADEDELKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|118347390|ref|XP_001007172.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288939|gb|EAR86927.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 669 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/255 (10%), Positives = 70/255 (27%), Gaps = 29/255 (11%) Query: 164 LLNQRYNQKIVSFIPALLRIEM-GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + +S +R+ + + + +M S ++ Sbjct: 1 MEKNTKLDLQLSQTKNYVRVSVIPPDDLERHPCNIVCLVDGSLSMGSKLVIHQKNGGKKE 60 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVT 281 T + +K+ + S++ + L+G++T + E + ++ + T Sbjct: 61 SDMTTLDLVKHTVKTIASSLNPQD------RLALVGFSTHSKIYFELTEMDDQGKNVAFT 114 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T+ ++ + +++ I TDG Sbjct: 115 EIDKMWAGGQTNIWGGLQDSLEVIKKGF----------------RPNQNVCIFLFTDGRP 158 Query: 342 NNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + + + KE I T + +L+ + ++ Sbjct: 159 TMIPAIGHVE-MLRRWKEQHPAIQFSIFTFGFGNDLDTDLMLELSQEQNGIFSFISDSSM 217 Query: 398 LIHVFQNISQLMVHR 412 L VF N ++ Sbjct: 218 LGTVFSNALANILST 232 >gi|225024147|ref|ZP_03713339.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] gi|224943172|gb|EEG24381.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] Length = 573 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 80/255 (31%), Gaps = 52/255 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 R K++ + P + +VD+SGSM R Sbjct: 187 FRSGAKLIRIGIQAKEVSQAALPPANLVFLVDVSGSM-------------------YSRD 227 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +K L + L+ Y + + P+ G + ++ + Sbjct: 228 KLPMVKYTLCTLAHQTRAQD------RITLVTYADGNKVVLPPTPG-NQRQKILAALDSL 280 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-K 345 T A++QAYQ R+ I+ TDG+ N Sbjct: 281 KAGGSTAGENAIQQAYQAAQRAYIRNGIN----------------RILLATDGDFNVGIT 324 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQN 404 + + +++ I + T+ + +RL++ + + + + + Sbjct: 325 DFNTLRSMVAEKRKSGISLTTLGFGSGNYNERLMEQLADAGDGNYSYIDSPEE------- 377 Query: 405 ISQLMVHRKYSVILK 419 +Q ++HR+ S L Sbjct: 378 -AQKVLHRQLSSTLA 391 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 66/228 (28%), Gaps = 42/228 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + LVVD SGSM + + +++ L+ + F+ Sbjct: 92 EPVAGRDLVLVVDTSGSMLVRDY----------RAEGRPVSRIEVLQGVVTRFVR----- 136 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E LI + + E V + R ++ T + A + L Sbjct: 137 --AREGDRFALIPMAEEAATLVPLTGDRELVASQLARLRAGMLGDDTAIGDGIALALRQL 194 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +I +DGE+N + + A+ + + Sbjct: 195 QASGAERRPA-----------------LILFSDGESNAGL--LRPSEALALARAAGVALY 235 Query: 366 TISINASPN------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T+ I L ++ H++V + L V I + Sbjct: 236 TVEITGGQALAPVEGEPSLADMAETTGGRHFHVTRSADLEAVIATIDR 283 >gi|291396486|ref|XP_002714579.1| PREDICTED: collagen, type XII, alpha 1 [Oryctolagus cuniculus] Length = 3117 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKRIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + T G ++ K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVK----------NTFLESAGARVGFP---KVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ PI ++E + + C+ D + + Sbjct: 388 TLSIRDLSADTEYQISV-SAMKGLTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLAKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ NF Q +K + +TDG++ + Sbjct: 1276 TLTGMAL-----------NFIRQQNFKTQAG--MRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + DSL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFDSLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 2339 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 76/270 (28%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 51 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 105 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ +++ + ++ Y Sbjct: 106 TSM--------------------SGERIEKAREAAILAVNMLNITD------TLSVVAYD 139 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 140 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 183 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 184 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 243 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 244 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 273 >gi|73973418|ref|XP_532180.2| PREDICTED: similar to alpha 1 type XXI collagen precursor [Canis familiaris] Length = 961 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGKNLVAAMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 34/361 (9%), Positives = 93/361 (25%), Gaps = 58/361 (16%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 +++A A + R++ ++ + + + + Sbjct: 83 RLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPIKQVAQNPLVTFSLDVDTGSYANVR 142 Query: 112 VFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 F N + + + S + + Sbjct: 143 RFLNQGLLPPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPA-------P 195 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I + E P + ++D SGSM Sbjct: 196 WNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM-------------------ISD 235 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ++++L L + + + + ++ Y + G+ K Sbjct: 236 ERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGSHKAEIN-AAIDS 288 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ AYQ + I+ TDG+ N Sbjct: 289 LDAEGSTNGGAGLEMAYQQAAKGFIKGGIN----------------RILLATDGDFNVGI 332 Query: 346 SNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQ 403 + +I+ + K +E+ + + T+ + S + ++ + + + V Sbjct: 333 DDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDTLSEAQKVLN 392 Query: 404 N 404 + Sbjct: 393 S 393 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 64/195 (32%), Gaps = 29/195 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ K A ++ + +G I + R + IE + + Sbjct: 416 SGSMSGSKLELAKEAAARSVEMLRDED------TLGFIAFDDRPWEIIE-TGPLNNKEEA 468 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ ++ +AY+ L +K II LTDG Sbjct: 469 VDTILSVTPGGGTEIYGSLAKAYENLADI------------------KLQRKHIILLTDG 510 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ + ++ K+N I + T++I + L YNV++ ++ Sbjct: 511 QSQPGNY----EDLIEQGKDNGITLSTVAIGQDADANLLEALSEMGSGRFYNVIDEQTIP 566 Query: 400 HVFQNISQLMVHRKY 414 + + ++ Sbjct: 567 SILSRETAMISRTYI 581 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 74/238 (31%), Gaps = 53/238 (22%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +R + +VD SGSMH + K+ K A+ + + ++ Sbjct: 206 VSRSQRKPAHLVFLVDTSGSMH-------------------SQDKLPLAKEAMKVAVRNL 246 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + ++ Y + + P+ T +V++ T T M+ AY Sbjct: 247 NEND------TVAIVTYAGSTQDVLPPTPAT-EVQRIHTAIDLLQSGGGTAMGSGMELAY 299 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--EN 360 + VK S +I LTDG+ N + + K Sbjct: 300 R----------------HAVKKASGNAISRVIVLTDGDANIGPNLSAESMLSGIEKYVAE 343 Query: 361 FIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN--------ISQLM 409 + + TI L++ + V + VF+ I++ + Sbjct: 344 GVTLSTIGFGMGNYRDDLMERLADKGNGNCFYVDSYQEAKKVFEAQLTGTLEVIAKDV 401 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 67/225 (29%), Gaps = 59/225 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + QD ++ K A+ +D++ +G+I + + + + Sbjct: 97 DVSNSMLSQDLSPNRLENAKRAISQLIDNLHD-------DRIGIIVFAGQAYVQLPITTD 149 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ ++ + T A+ Q K Sbjct: 150 YSAAKLFLNTINTNMVPTQGTAIGAAIDLGMQSFDFKNG------------------MSK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + + + A++ + + I + + Sbjct: 192 AMIVITDGENHEDDA----VSAANHARDKDVTVNVIGVGSEEGAPIPIIKDGKQAGFHTD 247 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVNA-DSLIHVFQNISQ 407 + + + + NA L V + + + Sbjct: 248 SAGKVVVSKLDENMGREIAAAGNGVYVRATNANSGLNIVMEQMGK 292 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 54/151 (35%), Gaps = 22/151 (14%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++G+ T+ E ++ + V + +T+ M TD A++ ++ Sbjct: 672 DRVGIVGFNTKAEIVVDITSD---VEEIITKVMSLKPGGATDIGDAIRVGTELF------ 722 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI--CDKAKENFIKIVTISI 369 + +I LTDG + + T + A + I TI I Sbjct: 723 ----------RRCGRPDRDWHMILLTDGVPTKGEPDPETKALSEATAASRMGVTISTIGI 772 Query: 370 NASPNGQRLLKTCVS-SPEYHYNVVNADSLI 399 G RL++ S +++ + + L Sbjct: 773 KLPEEGIRLIEHIAGISGGRSHHITDPEELT 803 >gi|292627386|ref|XP_001332035.3| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 1056 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 55/185 (29%), Gaps = 20/185 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A ++ +G++ Y+ I + + S + Sbjct: 118 FETAKRWLINVTSGFDVSSHYSQVGVVQYSDTPRLEIPLGQHKTTQQLIEAIEKISYLGG 177 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+K A + +R + + + +TDG++ + ++ + Sbjct: 178 NTQTGRAIKFAVDHVFPSSRR--------------NDVKNRIAVVVTDGKSQDDVTDASL 223 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLM 409 A+ I + + + + L+ + + + ++ + + Q + Sbjct: 224 -----DARTQGITVFAVGVGSEITNSELVTIANTPAGDYVLFAEDYTNIERIRDAMEQKL 278 Query: 410 VHRKY 414 Sbjct: 279 CEESV 283 >gi|219520386|gb|AAI43866.1| COL21A1 protein [Homo sapiens] Length = 957 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 55/181 (30%), Gaps = 24/181 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQDDVKD-----AAQ 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRK 413 A+++ I + I + S LK + P Y + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVG-SETEDAELKAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEES 215 Query: 414 Y 414 Sbjct: 216 V 216 >gi|118359381|ref|XP_001012930.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294697|gb|EAR92685.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 713 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 75/246 (30%), Gaps = 39/246 (15%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 NQ + + + ++ I VVD+SGSM + NS + + Sbjct: 46 ENQVRIQILSPKGKSKVSN----SICCVVDVSGSMGSRAVTKQSGGNSELG-----YSVL 96 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSL 287 +K++L + ++D ++ ++ + T ++ V Sbjct: 97 DIVKHSLNTIVQNLDEGD------EFSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQ 150 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ ++Q + + +D ++ +I LTDG+ N Sbjct: 151 PDASTNIWAGIEQGLEQMQNDSNKNKNQQ----------------LIVLTDGQPNVNPPR 194 Query: 348 VNTIKICDKAKENFIK-----IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + I I T + L + + ++ + +F Sbjct: 195 GILTTL-NNFYNKNIISPKPSINTFGFGYYLDSHLLFNIAQDCQGIYSFIPDSSFVGTIF 253 Query: 403 -QNISQ 407 +I+ Sbjct: 254 TNSIAS 259 >gi|73973312|ref|XP_867438.1| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 3 [Canis familiaris] Length = 2989 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + Sbjct: 154 NNFKYILDFIGALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + I + ++ P+ ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISI-SAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRGNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 2339 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 1276 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGI--KNADEDELKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|149188837|ref|ZP_01867127.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] gi|148837257|gb|EDL54204.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] Length = 504 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 75/238 (31%), Gaps = 14/238 (5%) Query: 179 ALLRIEMGERPIFLIEL--VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + D ++ ++ + + + Sbjct: 260 TSGEVYDCSQDRDDCPNGRNGERRQAQRLVSVFDIDEDDNDRSYHVEVYDYVDFDLSVAE 319 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +F++ + L +N S R + D T S Sbjct: 320 MFINKFPDARTEYRLDSLELYRGYGSSNENQFYSIDLTSDRTEIDVIDDMWADGSTASFQ 379 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE---NNNFKSNVNTIKI 353 M + +Q + + K + + + K ++ L+DG NN + + Sbjct: 380 GMLRGFQHMLAGKPNTSDEDELAEYNDK-----IKMVLVLSDGVESPNNGILKGLVDAGM 434 Query: 354 CDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 CDKA+E + I I I+ + + Q + CV +P+ ++ + + I +L+ Sbjct: 435 CDKAREEIPGLYIAVIGIDFAASEQSGFQDCVLNPDE--DITDVTDTEEFIEKIEELI 490 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/252 (10%), Positives = 61/252 (24%), Gaps = 10/252 (3%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 A +++ L + F + + + N + A AA A + S + + Sbjct: 1 MAFTLIPVLGMTFFAVEGTRYIQETNRLRDAAQAAASAVTIEDQSANANEMAKDYIRDYV 60 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS-RISMTHMANNRLDSSNNTI 144 + Y E NS ++ + D + Sbjct: 61 RDINSETVVATRFYQAPDPENDVDEFIQYTVEATTNHNSWFASNLIPVFGETQDLKGVAV 120 Query: 145 F-----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 D + F + Q +I + + I + Sbjct: 121 AKKYPFNLGDKNIDIVFVSDFSGSMSWQWGGNSSDPCTATNCKIADLKVAVKEIADKLLC 180 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S + + A Q + +K+ + A++ F + L Sbjct: 181 S---DIQTDPATNEDYCADDDQPELTSKLDN-RVAVVPFNIRTRESNGSNVFTVTQLRYR 236 Query: 260 TTRVEKNIEPSW 271 E + ++ Sbjct: 237 DDVDEDDSPRTY 248 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 76/270 (28%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 88 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 142 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ +++ + ++ Y Sbjct: 143 TSM--------------------SGERIEKAREAAILAVNMLNITD------TLSVVAYD 176 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 177 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 220 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 221 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 280 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 281 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 310 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSPR-----------AGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|238915987|ref|YP_002929504.1| hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] gi|238871347|gb|ACR71057.1| Hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] Length = 745 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 49/404 (12%), Positives = 102/404 (25%), Gaps = 62/404 (15%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA + + N+ + + E K +G + ++ Sbjct: 36 ALAAENVLSDNIILVTEN-EVSDTAKDGYSKVTFKIGDHTKDVYIKNGNTVKLKSSNKDY 94 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 V + I + Y + + + F Sbjct: 95 GVIAGGIENGEAITENTVIYAKSYSYQGTLSLEKSI-----QSETPDENGYYTLQFSAKS 149 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + LV+D S SM C+++ D + + ++T+ + NA+ FL+ Sbjct: 150 QNLPSIKSENQNVILVLDRSFSMACSVDEDVDSD---AMAPTYEKTRWSVTINAVEKFLN 206 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + +I Y I +K+ + + + Sbjct: 207 EFL---PEGTSNKVSVISYCGSARTEITNESSKDKIMSKLNSIYNRNMYNEDYKNSSKRY 263 Query: 289 ---------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ +KQ +I S + + + + Sbjct: 264 NVTQIGRGLGSATNIQQGLKQVSEIAGDTTGASVILFTDGGANRYDNGEIGGYYYIKNNN 323 Query: 340 ENNNFKSNV-------NTIKICDKAKENFIKIVTISINASPNGQRLLKTCV--------- 383 E N + K ++ K I TI + + + L Sbjct: 324 ETNRYNKPRDEVNGSYYAGKAGEELKAAGADIYTIVLMSKESDITDLVKVSLGNKSLTYE 383 Query: 384 ----------SSPEY---HYNVVNADSLIHVFQNISQLMVHRKY 414 S Y Y NA+ L + F+ I M + Sbjct: 384 KSWEKTYFTFSDGGYAKEFYTAANAEQLNNRFKQIMTEMTSLPF 427 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/368 (11%), Positives = 107/368 (29%), Gaps = 45/368 (12%) Query: 60 AILAGASK---MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 A+L K + + +++N + K N + ++ Sbjct: 100 ALLQAVKKVQPLSTGTMTGLAIQYAMNNAFTESEGARIKSPGINKVAIVVTDGRPQDTVK 159 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 +I S + + + R+D + + + DY + + Q+ Sbjct: 160 DISARARESGLEIYAIGVGRVDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQEAFCV 219 Query: 177 IPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 L + + + +N D + N+ + K+ Sbjct: 220 TADLCSTGDHDCEQICVSTPGSYTCACRDGFTLNEDGKTCNACGASAVDLVFLIDGSKSV 279 Query: 235 -------LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDM 284 + F++ I D + + ++GL+ Y++ V + + K ++ V + Sbjct: 280 RPENFELVKQFINQIVDSMDVGERRAHVGLVQYSSSVRQEFPLGRYSSKKDIKSAVKKMS 339 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + A++ +F K I TDG + ++ Sbjct: 340 YMEK--GTMTGQALQY-----------LVDNSFAISSG--GRPAVPKVGIVFTDGRSQDY 384 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNADSLIHV 401 ++ +AKE K+ + + + L+ S P+ Y Sbjct: 385 IND-----AAARAKELGYKMFAVGVG--NAVEEELRMIASEPQAEHSFYTAD-----FKA 432 Query: 402 FQNISQLM 409 + I + + Sbjct: 433 MKEIGKKL 440 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 62/183 (33%), Gaps = 23/183 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + + + + A + I+ L +GL+ Y + V+ K + Q Sbjct: 45 SSRSVRPSEFEQAKVFLSQVIESLDVGANATRVGLVNYASTVKNEFSLKTHKAKPALLQA 104 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T + A++ A +G +I S K I +TDG Sbjct: 105 VKKVQPLST--GTMTGLAIQYAMN----------NAFTESEGARIKSPGINKVAIVVTDG 152 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS 397 + + I +A+E+ ++I I + + L+ S P E+ V + Sbjct: 153 RPQDTVKD-----ISARARESGLEIYAIGVGRVD--KNTLRQIASEPLDEHVDYVESYSL 205 Query: 398 LIH 400 + Sbjct: 206 IEK 208 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 74/254 (29%), Gaps = 48/254 (18%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + Q ++S +I + LV+D SGSM Sbjct: 19 SSSQRQLMLSIAATSEQIN--TNLPINLCLVLDHSGSMQ--------------------G 56 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K A L ++S+ + + +I + R + + + + ++ Sbjct: 57 KPLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQSREDDLTLIRSKIQQ 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K Q S + I LTDGEN + Sbjct: 111 LQAGGGTAIDEGIKLGIQE-----------------SSTGSKGYVSHIFLLTDGENEHGN 153 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQN 404 N +K+ + A E I + T N L K + + + LI F Sbjct: 154 -NQRCLKLAEVAAEYGITLNTFGFGDHWNQDILEKIADIAGGSLSYIERPEQALIE-FTR 211 Query: 405 ISQLMVHRKYSVIL 418 + + + + Sbjct: 212 LFNRLQSVRLTNAF 225 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 25/230 (10%), Positives = 75/230 (32%), Gaps = 52/230 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D + +++ K + FL+ + + +GL+ + + + Sbjct: 96 DVSNSMLADDIQPSRLQKSKYTISNFLERLG-------NDRVGLVVFAGQSFVQCPITSD 148 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ ++ D++ + T+ + A++++ + L + + K Sbjct: 149 KSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALE--------RIEEGAEAEEKNRVRNK 200 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +DGE++ + + I+I T+ + ++ Sbjct: 201 VILIFSDGEDHEAGIDEVLEEAA----SKNIRIYTVGVGSAEPTPIPVLNKDGKRVDFKR 256 Query: 374 -----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L K + +Y + F+ I+ + Sbjct: 257 DSQGSVVTTHLQEALLRKIAEQTKGNYYRIAPQG---SDFELIADDINKL 303 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 62/178 (34%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + +G++ Y+ R + T++ Sbjct: 59 QWVSNLVETFEIGPDKTRVGVVRYSDRPTTEFDLGKYKTCEEIKEAARKIRYYGGNTNTG 118 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + ++ G ++ +K I LTDG + ++ + + Sbjct: 119 DALRY----------INTYSFSKEAGGRLSDRTVKKVAILLTDGRSQDYVLDP-----AN 163 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A++ I+I + + + + L S P+ + ++V + +++ + + + + Sbjct: 164 AARQAGIRIFAVGVGEALKEE--LDEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCE 219 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 46/376 (12%), Positives = 114/376 (30%), Gaps = 19/376 (5%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + D S S++ + D + K + + A T + + Sbjct: 67 NNGRKDN-DYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNEEKMHLIPTCK 125 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-IEHLLNQRYNQKIVSFIPALLRIEMGE 187 N + +S++ + + + + + + + S A + Sbjct: 126 TKTDTNKKGHTSSSEVTCTVSGTIEHKSWFPLKVGSVEVIPHEVNVASKSKAFKKNTFNI 185 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + +V DLSGSM+ +++ E V +K+ L++ L + L Sbjct: 186 --PIDLMVVADLSGSMNYDLSNKNEIV-------GSPMSKLGILQDVLSELAEKTLLSEE 236 Query: 248 VKEDVYMGLIGYT-----TRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPA--MK 299 + + + + + I SW K Q + K + A + Sbjct: 237 ANHNNRIYVTPFALGAEISSSNCAIPYSWDMNKNNQELENAKNTLSNSKNSQYYRAEFIN 296 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L + + +K F + + + + + IK K Sbjct: 297 NLVYKLNTRDTLTNIGGQKDYKLKYSKGAFCLKNMRTQNKGWYSHNNKSDFIKYVKALKA 356 Query: 360 NFIKIVTISINASPNG 375 + + + + + N Sbjct: 357 DGATLASSGLLVAANN 372 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 68/219 (31%), Gaps = 43/219 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM ++ K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVVLVLDHSGSMADTEDAGIP--------------KLEIAKSASAKMIEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + + K + + T P + +A + Sbjct: 445 SSD------GVGVIAFDHNYYWAYKFGK-ISKKEDVIESISSIEVGGGTAIIPPLSEAVK 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S I+ LTDG + ++AK N IK Sbjct: 498 TLKKSKAKSKL------------------IVLLTDGMGEQGGYEIP----ANEAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I TI + N L + Y V N L+ VF Sbjct: 536 ITTIGVGKYVNATVLSWIASFTSGRFYLVSNPSELVDVF 574 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/356 (10%), Positives = 92/356 (25%), Gaps = 49/356 (13%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 G + + AKR + + ++ + SA ++ +S+ + Sbjct: 501 AGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVG- 559 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + T + ++ I Sbjct: 560 -EVPRITVTNTVVTAPRYRKYIKANNDGTY------DLSLNVTGTQSGSSQTTVSPADIV 612 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDV 252 +V D SGSM M + +++ K A+ + + ++ Sbjct: 613 VVFDTSGSMSNPMGHN---------------SRLEVAKTAVNSMAQHLLTSENQGKDSNI 657 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 M L+ ++T V + + V T+ A+K A LTS +K Sbjct: 658 RMALVPFSTTVGNVSNFTDNAMDIVSAVNGL---RADGGTNWEAALKAANAKLTSGRKGV 714 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-------------------VNTIKI 353 F G + + + Sbjct: 715 KKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAV 774 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + ++ +++ P ++ + +Y+ + D L F +I + Sbjct: 775 AEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDELNKAFADIIGQI 828 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 63/184 (34%), Gaps = 28/184 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ K A ++ + +G I + R +E + E + Sbjct: 412 SGSMAGSKLELAKEAAARSVELLREKD------TLGFIAFDDRPWVIVE-TGPLEDKKDA 464 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ ++++AY+ L + +K II LTDG Sbjct: 465 VDKIGSVTPGGGTEIFTSLEKAYEELENL------------------KLQRKHIILLTDG 506 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ + + KEN I + T+++ + + L + Y+V ++ + Sbjct: 507 QSARSTDY---ESMIETGKENNITLSTVALGSDADRNLLEELAGLGAGRFYDVTDSSVIP 563 Query: 400 HVFQ 403 + Sbjct: 564 SILS 567 >gi|293361231|ref|XP_243912.5| PREDICTED: collagen alpha-1(XII) chain [Rattus norvegicus] Length = 3119 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 51/173 (29%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + + T++ AM Sbjct: 467 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINNFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2338 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2397 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2398 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2440 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2441 -EVKKAAFIIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2493 Query: 406 SQLMVH 411 ++ Sbjct: 2494 EDNLIT 2499 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 74/230 (32%), Gaps = 44/230 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 R + +V+D+SGSM +K+ K A L + Sbjct: 420 TQVRKPARVLVVMDVSGSMA-------------SESGYGSESKLDLAKKAATSALGQLTD 466 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGT-------EKVRQYVTRDMDSLILKPTDSTPA 297 MGL +TT + + + + + L T A Sbjct: 467 TD------QMGLWAFTTDLPTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAA 520 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--NNFKSNVNTIKICD 355 ++A + + + K ++ LTDG N + + ++ Sbjct: 521 TREAAKAMNAQK----------------DPNSINAVVVLTDGRNEYTDNDLDGLLRELNA 564 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 A+E+ +++ TI+ + L + +S Y+ N S+ VF ++ Sbjct: 565 SAEEDGVRVFTIAYGPDADLATLQEISEASRAAAYDARNPTSIDKVFSDV 614 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 60/190 (31%), Gaps = 27/190 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-- 271 SA + +K K A +L L ++ +G++ + T I Sbjct: 402 DRSASMSASFGVSKFDLAKEAAILALTALQAGD------RIGVLAFDTDTIWVIPFQAVG 455 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V + TR I T+ A+ L ++ + Sbjct: 456 EGAAVAELQTRIATMAIGGGTNIERALAVGLPALAAEPHSV------------------R 497 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + + ++ + A+ I + TI+I + L + +Y Sbjct: 498 HAVLLTDGRSYSNNYPRY-QQLVETARAAQITLSTIAIGTDADTDLLEQLARWGNGRYYF 556 Query: 392 VVNADSLIHV 401 V +A L + Sbjct: 557 VPDAADLPRI 566 >gi|301753369|ref|XP_002912539.1| PREDICTED: anthrax toxin receptor 2-like [Ailuropoda melanoleuca] Length = 611 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ + + + T Sbjct: 185 IYNFVQQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYI 243 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K II LTDG + K Sbjct: 244 HEGLKLANEQIQ----------------KAGGFKASSIIIALTDG-KLDGLVPSYAEKEA 286 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQ----LM 409 ++ ++ + + Q L+ S + + V +L + +I + Sbjct: 287 KISRSFGARVYCVGVLD--FEQAQLERIADSKDQVFPVKGGFQALKGIINSILDRSCTEI 344 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 345 LELRPSSVCVG 355 >gi|281346820|gb|EFB22404.1| hypothetical protein PANDA_000280 [Ailuropoda melanoleuca] Length = 482 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ + + + T Sbjct: 55 IYNFVQQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYI 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K II LTDG + K Sbjct: 114 HEGLKLANEQIQ----------------KAGGFKASSIIIALTDG-KLDGLVPSYAEKEA 156 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQ----LM 409 ++ ++ + + Q L+ S + + V +L + +I + Sbjct: 157 KISRSFGARVYCVGVLD--FEQAQLERIADSKDQVFPVKGGFQALKGIINSILDRSCTEI 214 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 215 LELRPSSVCVG 225 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 66/194 (34%), Gaps = 34/194 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + KM K A++ ++ + + + + ++ + R + WG Sbjct: 304 DTSGSMKEFSGAYMKMDIAKEAIVKYIRYL-----SRTNDRLSMVLFNFRADIL----WG 354 Query: 273 TEKVRQYVTRDMDS----LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 V++Y+ + T+ A+++A IL+ Sbjct: 355 PHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSN------------------Y 396 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 K II +TDG N S IK K + + + T+++ + + L++ Sbjct: 397 PNKHIICITDGRTVNASS---CIKEAVKLRRMGVTLSTVAVGDNSDFDLLMRLSKIGNGL 453 Query: 389 HYNVVNADSLIHVF 402 + + +L Sbjct: 454 FIKINDISNLDKAL 467 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 72/259 (27%), Gaps = 52/259 (20%) Query: 158 LQFI-EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L +QR VS IP + + + L++D SGSM Sbjct: 14 LDITQGS--SQRQLSMSVSAIPNSVDRHV----PLNLCLILDHSGSM------------- 54 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + + +K A +D ++ + ++ + R + I + + Sbjct: 55 -------SGSPLETVKKAAGELIDRLNPGD------RISVVVFDHRAKVLIP-NQDIDDP 100 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T +K + L K F L Sbjct: 101 ESIKKQINRLRTSGGTSIDEGLKLGIEELGKGKVERISQAF-----------------LL 143 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDGEN + +N + A + + + ++ N L K + + Sbjct: 144 TDGENEHGDNNRCLK-LAKLATDYNLTLNSLGFGNDWNQDILEKIADEGGGTLAYIEYPE 202 Query: 397 SLIHVFQNISQLMVHRKYS 415 +I F + M + Sbjct: 203 QVIDEFSRLFNRMQSVGLT 221 >gi|240140254|ref|YP_002964732.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] gi|240010229|gb|ACS41455.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] Length = 441 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 45/137 (32%), Gaps = 5/137 (3%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 +IFAL+ + + L+G I + + +++SA +A +LAG + + +S + Sbjct: 1 MIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTT 60 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + A + + ++ + E + + + S Sbjct: 61 QTIQAEAKAGADAPVSIQVT-----VASDKTSVEARAEQVIKLTFGAFVGMASIPISARA 115 Query: 144 IFYNMDVMTSYDYRLQF 160 + M L Sbjct: 116 RASVVGRMRLCMLALDP 132 >gi|194215132|ref|XP_001499245.2| PREDICTED: similar to Collagen alpha-1(XXII) chain [Equus caballus] Length = 1632 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 65/218 (29%), Gaps = 21/218 (9%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 SG + + + + +D + +G+ Sbjct: 29 SGDSGCQAQRAGCKTVHYDLVFLLDTSSSVGKEDFEKVQQWVANLVDTFEVGADHTRVGV 88 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AYQILTSDKKRSFFT 315 + Y+ + E + + T++ A++ + Sbjct: 89 VRYSDQPTTAFELGHFRSREAVKAAARRLAYHGGNTNTGDALRYITRHSFSP-------- 140 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + G + F++ I LTDG + + + A I+I + + + Sbjct: 141 ---QAGGRPGDRAFKQVAILLTDGRSQD-----LVLDAAATAHRAGIRIFAVGVGEALKE 192 Query: 376 QRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 + L+ S P+ + ++V + +++ + + + + Sbjct: 193 E--LEEIASEPKSAHIFHVSDFNAIDKIRGKLRRRLCE 228 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 90/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + I + + + Y L + + ++ + P Sbjct: 175 APQPVADEEDNTKPIAARIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTRDLP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIRSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 405 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADPVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDMGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 61/217 (28%), Gaps = 32/217 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT------TRVEKN 266 D + + + + KM K A+ F+ + + + VY + Sbjct: 149 DASGSMAGRIRDGVKMDLAKEAIENFVSDMPENAKISLRVYGHKGSNRKQDQKESCASTE 208 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + G+ ++ T AM++A Q L Sbjct: 209 VVYPHGSYVKGKFGKALNSFEPTGWTPLAAAMEEARQDLKP----------------YAG 252 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLK-TCV 383 + I ++DG +K + I+ + I + GQ+ LK Sbjct: 253 EDAENIIYVVSDGIETCGGD---PVKAAKSLYNSDIQAVVNIIGFDVDDAGQQALKKVAE 309 Query: 384 SSPEYHYNVVNADSLIHVF----QNISQLMVHRKYSV 416 + + + L F I + + Y+V Sbjct: 310 AGGGEYKTANTREELNQSFGIDWDEIEKEVSKVWYNV 346 >gi|293349337|ref|XP_001060689.2| PREDICTED: collagen, type XII, alpha 1 [Rattus norvegicus] Length = 3064 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 51/173 (29%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + + T++ AM Sbjct: 467 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINNFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2338 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2397 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2398 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2440 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2441 -EVKKAAFIIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2493 Query: 406 SQLMVH 411 ++ Sbjct: 2494 EDNLIT 2499 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 64/187 (34%), Gaps = 19/187 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSREEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ L+ R G + +++ I LTDG + + + Sbjct: 119 DALRY-ITALSFS---------PRAGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA--- 165 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRK 413 I+I + + + + L+ S P+ + ++V + +++ + + + + + Sbjct: 166 --HRAGIRIFAVGVGKALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCESE 221 Query: 414 YSVILKG 420 S +G Sbjct: 222 CSRAPRG 228 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 25/205 (12%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 ++ I + + K + F+ + L + ++GLI Y + V+ Sbjct: 7 NSCNNKHLDLVFIIDSSRSVSHYDFEK--VKEFILTILQFLDISPDATHVGLIQYGSTVK 64 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + +Q + R + ++ T + + Y + +G + Sbjct: 65 QEFSLKTFR--RKQDIERAVKRMMHLGTGTMTGLALQYAV--------NIAFSETEGARP 114 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + I+ +TDG + + +I KA+ + I I I + + LK+ S Sbjct: 115 LRQNVPRIIMIVTDGRPQDPVA-----EIAAKARNSGILIFAIGVGR-VDMNT-LKSIGS 167 Query: 385 SP--EYHYNVVNADSLIH---VFQN 404 P E+ + V N + FQ Sbjct: 168 EPHEEHVFLVANFSQIETLTSAFQT 192 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 38/354 (10%), Positives = 92/354 (25%), Gaps = 54/354 (15%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN--- 115 A + A R+E + + + + F N Sbjct: 109 RASASKAFAAQGGNVMGTARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQ 168 Query: 116 ---TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 E + + I + + + Y L Sbjct: 169 LPLPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTL 225 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ + P + ++D SGSM ++ ++ Sbjct: 226 LKVDVQ-ARDMQTRDLPPANLVFLIDTSGSMQ-------------------PAERLPLIQ 265 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +AL L ++ + ++ Y G T V T ++ + T Sbjct: 266 SALKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGST 318 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ AY+ + I+ TDG+ N ++ I+ Sbjct: 319 GGEAGLRLAYEQAEKGFIKGGVN----------------RILLTTDGDFNLGITDPKDIE 362 Query: 353 I-CDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 K +E I + T+ + + ++ + + + + V ++ Sbjct: 363 ALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 416 >gi|301767086|ref|XP_002918971.1| PREDICTED: collagen alpha-1(XXI) chain-like [Ailuropoda melanoleuca] Length = 957 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLVAAMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSKITLFAIGVGSETEDAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 35/348 (10%), Positives = 89/348 (25%), Gaps = 20/348 (5%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + ++ I + ++ I ++ + Sbjct: 195 ASQNPANTTADLFDNQDISTYVVAVAGDPAAEMAADETAAAGGTDAAIDGNTPEELEQAL 254 Query: 134 NNRLDSSNNTIF--YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGE 187 N + + +++ + + L +LN GE Sbjct: 255 ANVVQNIISSVVAPECIGGLPRVMILLDASSSMLNIGGGTMAGGMGETGWDQAREALAGE 314 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL-- 245 + +F I++ + + + + AP Q+ K+ LD Sbjct: 315 QGLFDIDVGIGAAEDVTQLGLAVFGHNQPAPGEQNILVQYAPCAKDNFAWALDPNSSCVE 374 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQI 304 + I +T + P + ++ T + + Q+ Sbjct: 375 PGCTDPWGGPPIAWTYSDGQQDPPGFDIPTTSHMPQCAGNTFCSGSGTYTHLGL----QL 430 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + ++ + + F I +TDG+ N + +N ++ + I Sbjct: 431 IKDNQVQYQADGLMDGAEFPTNDETIYFNILITDGQYNGYSTNAQVQGELEEMYNDGITT 490 Query: 365 VTISINASPNGQRLLKTC-------VSSPEYHYNVVNADSLIHVFQNI 405 I + + E +Y+ N L +I Sbjct: 491 YVIGFGDGVDTPAAMAQLQNMAQWGSGDSENYYDANNQAELEAALTSI 538 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 77/235 (32%), Gaps = 45/235 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 N+ ++ ++ + P + ++D+SGSM + Sbjct: 156 PWNPENKLML-VGLQTKKLSTEQLPPSNLVFLIDVSGSM-------------------DE 195 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ LK+A L +D +D V ++ Y ++ + G EK + + M Sbjct: 196 PNKLPLLKSAFKLLVDELDEDDRVS------IVVYAGAAGLVLDSTPGNEKDK-ILDALM 248 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T +K AY + + +S +I TDG+ N Sbjct: 249 NLEAGGSTAGAEGIKLAYDVAKKNFIKSGNN----------------RVILATDGDFNVG 292 Query: 345 KSNV-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 S+ +++ +K ++ I + + +++ + + N Sbjct: 293 ISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDNIAE 347 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 55/168 (32%), Gaps = 21/168 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I + + + +G+I + + V+ + + + +R T+ + Sbjct: 574 ISMAESMSGNARLGVILFNSSVQGLTDFTVQPDVIRSTAESMT---ANGGTNIFDTVVHG 630 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKICDKAKEN 360 + ++ ++DG ENN + I AK+ Sbjct: 631 LESF-----------------PKNGPEVLNTLVVMSDGQENNAHSAEEIQTAIGQAAKDK 673 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + + + + + L S+ + V ++ SL +QN++ Sbjct: 674 SILVHCLGLGSEVDANYLQTIAQSAGGTYQYVTDSSSLAVFYQNLASQ 721 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVGIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 49/168 (29%), Gaps = 28/168 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + T++ AM Sbjct: 471 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINTFPYRGGSTNTGKAMTY 529 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDGE S+ K + + Sbjct: 530 VREKIF-------------VPNKGSRSNVPKVMILITDGE-----SSDAFRDPAIKLRNS 571 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++I + + + + L+ S P + + V + FQ IS Sbjct: 572 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRIS 613 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1220 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPNKGGN 1279 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1280 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1321 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1322 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1379 Query: 410 VH 411 + Sbjct: 1380 CN 1381 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 54/186 (29%), Gaps = 28/186 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 2343 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2402 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ F ++ V +K + L G + +S Sbjct: 2403 RGGNTRTGKALT-----------------FIKEKVLTWESGMRKNVRVL--GVVTDGRSQ 2443 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2444 DEVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2497 Query: 406 SQLMVH 411 ++ Sbjct: 2498 EDNLIT 2503 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 69/227 (30%), Gaps = 47/227 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP + V+D SGSM K+ K A+ ++++ Sbjct: 50 ERPPLNLAFVLDRSGSMA--------------------GDKLKFAKKAVAYAVENLRPHD 89 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ Y +VE + + K + + R T+ Sbjct: 90 ------RVAVVIYDHQVEVVVPSTLAENK-EEILRRLRPVRPRGSTNLHAG--------- 133 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 + Q +I L+DG N ++N N I + + Sbjct: 134 -------WLEGSTQVAAHLDAKRLNRVIVLSDGLANTGETNPNVIAEQVRGLSQRGVSTS 186 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 T+ + N ++ + +Y + + D L +F +Q + Sbjct: 187 TLGVGLDYNEDLMMAMAEAGQGNYYFIESPDDLPGIF---AQELSGL 230 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 75/238 (31%), Gaps = 34/238 (14%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + E P + + +D SGSM + + P Sbjct: 378 LGGYFNTPLEQTLPVDSQIRNPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIP- 436 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K + +L +++ G++ + T + P T+K + Sbjct: 437 ---KVDIAKESAILSSETLGPNDI------FGVVAFDTAPRWVVRPEPVTDK-SSIAEKV 486 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ + +A L K ++ K +I LTDG +N Sbjct: 487 AGIQGSGGTNIYGGLAEAIDSLIKVKAKN------------------KHVILLTDGWSNV 528 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + ++ KA+ + I I T+S A + Q L YN ++ + + Sbjct: 529 GNYD----ELISKARRHGITISTVS-AAGGSAQLLRSIAEKGGGTFYNTRDSADIPQI 581 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 75/229 (32%), Gaps = 49/229 (21%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + + V+D+SGSM ++ K AL +F++ + Sbjct: 374 VENKKRVDVMAVLDVSGSMADEA-------------------RLEQAKTALRIFIEQLQD 414 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 GL ++ S K + + R T +AYQ Sbjct: 415 DD------GFGLTIFSDSATVLTPVSPIGPKRAEILNRIAGLTPRGGTRLLDTTVEAYQE 468 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---- 360 +++ + ++ LTDG +N + N + + +++ Sbjct: 469 MSATPPGQRI----------------RAVVVLTDGLDNKSQRN--AQDVLNLLRQDREGY 510 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD--SLIHVFQNISQ 407 IK+ T++ + L + ++ Y + S+ V+Q+I+ Sbjct: 511 SIKVFTVAFGGDADVNLLKEIAEATGAKSYVGKPGERGSIERVYQDIAT 559 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 57/203 (28%), Gaps = 48/203 (23%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K+ L ID L + + LI + + T + L+ + Sbjct: 106 KDRLSAVRQQIDRLIAARPGDRIALIVFADHAYLLSPLTQETNALLGLTRELDFELVGRT 165 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ A Q ++ + ++ +TDG N ++ + Sbjct: 166 TALGEAILLARQHKAPERSTA--------------------LLLVTDGRNTAGSAD--PL 203 Query: 352 KICDKAKENFIKIVTISINASP----------------------NGQRLLKTCVSSPEYH 389 + +A I++ T+ + A P + L + + Sbjct: 204 REAKQAAAAGIRLYTLGVGADPDTFAEAMTPAQTPAQSDPSAELDEALLQQLAEVGHGRY 263 Query: 390 YNVVNADSLIHVFQNISQLMVHR 412 + L + I+Q + Sbjct: 264 FRARTQGDL----EAINQALDTL 282 >gi|316975135|gb|EFV58594.1| putative calcium binding EGF domain protein [Trichinella spiralis] Length = 2537 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 64/183 (34%), Gaps = 21/183 (11%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLIL 289 K + + + L ++ +G+I YT +V+ + ++++Q + + Sbjct: 500 KYVMRFVHEFVTLFDINEDRTRVGIIQYTGQVKPEFYLNQHKNIDQLQQAIRNI--RYVG 557 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A++ + T + G + K + +TDG + Sbjct: 558 GLTKTGAALQFMTK----------NTFTAQMGARTRDPNVYKIGVVITDGRAQDNVKIP- 606 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-NISQL 408 D+A+ + I + + + + + L+ S + ++ V L + I + Sbjct: 607 ----ADEARRHNISLYAVGV-TNHVLESELEQIAGSKDRYFIVGTFAELNTRLRAKIQKE 661 Query: 409 MVH 411 M Sbjct: 662 MCK 664 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 32/344 (9%), Positives = 80/344 (23%), Gaps = 24/344 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 A ++ +I+ L + + IG + + K ++++ +A LA + ++ S Sbjct: 15 RARQRGAVAIVVGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSI 74 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 F + + F + I S S + + ++ Sbjct: 75 APASFARAEAAGQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSS 134 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 ++M + L ++ + I Sbjct: 135 K----YVRCTITRSNIMPWFMQVLMPS---------LDTLTVSSLATATLAPAQNNCGIP 181 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + GS + + V+ + F S + + Sbjct: 182 MAICSKGSAPSYGMTPGQWVSGFFGAGGGVTGSFNWI-----DFTPPAGGTSELAALLTG 236 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK--QAYQILTSDKKRS 312 + EP + T+ T A Y ++ Sbjct: 237 NGVCTLNVPTPVGEPGALGAAAAKAWNTRFGLYQTGSTNVTTAPPDFTGYSYTPTNWPSK 296 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTD----GENNNFKSNVNTIK 352 + + + G NN++ + Sbjct: 297 ANALADFLSRRSAHASYGATVSVGNTITGLGINNSYNPTTTVAQ 340 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 64/195 (32%), Gaps = 19/195 (9%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D ++ A K A+ L ++ +GL+ Y T + Sbjct: 64 DASGSMNQDDAPGLRIDAAKAAVTDLLGTL------PAPTQVGLMVYGTS-------TGS 110 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T+ R +D+ +L T + + +T+ +P+ + Sbjct: 111 TDAERAAGCQDIKTLAPVGTLNAATLTSQVAGITASGYTPIGNALRAAAQALPNEG-PRS 169 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYH 389 I+ ++DGE+ + + + E + + T+ + L ++ + Sbjct: 170 IVLVSDGEDTC--APPAPCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQATGGTY 227 Query: 390 YNVVNADSLIHVFQN 404 + NA L Q Sbjct: 228 SDAGNATGLTDALQA 242 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 44/384 (11%), Positives = 97/384 (25%), Gaps = 89/384 (23%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + + + + D + L SA E + Sbjct: 158 SVQSDDDDADTVANQNEAKDDETKDNADKTVHLGIASYRGMLKSASAGLSTPEHTKSIEY 217 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + + L + + + Sbjct: 218 QGNGAYTLKLD-VTGKDASTSTTDTTPIDIALVLDVSGSMNDDFGGR------------- 263 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD---- 240 +K++ALK A+ FLD Sbjct: 264 --------------------------------------GSPSKISALKTAVNSFLDETAK 285 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMK 299 + D + V + L+ Y ++ RQ T + ++ + T +K Sbjct: 286 TNDTIEDDNNKVKVALVKYANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLLK 345 Query: 300 QAYQILTSDKKRSFFT--NFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKIC-- 354 + L + +Q + +K++IF TDGE N + + Sbjct: 346 TSVNGLQAAGATYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDDDVANAAIK 405 Query: 355 --DKAKENFIKIVTISINASPNGQR-----------LLKTCV--------------SSPE 387 + K + +I I N + +S + Sbjct: 406 KSQELKNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYRSLGDRASGD 465 Query: 388 YHYNVVNADSLIHVFQNISQLMVH 411 Y+Y+ +A L +F +I + + Sbjct: 466 YYYSASSATQLAQIFNDIQKTITE 489 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 61/188 (32%), Gaps = 22/188 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + L + V +GL+ Y+ + + ++ V + +L Sbjct: 248 LQNFQKVKNFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQ--SSLKSVVLEQIRNL 305 Query: 288 IL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T++ A++ + +I +TDGE+N+ Sbjct: 306 PYNMGSTNTGSALEF-----------IRANYLTEMSGSRAKDGVPQIVILVTDGESND-- 352 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQN 404 D+ K + + + + IN + Q L K E + N + L + + Sbjct: 353 ---EVQDAADQLKRDGVFVYVVGINI-QDVQELQKIASEPFEEFLFTTENFNILQELSGS 408 Query: 405 ISQLMVHR 412 +SQ + Sbjct: 409 LSQAVCST 416 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/343 (9%), Positives = 98/343 (28%), Gaps = 41/343 (11%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 ++ + + + K + + S++++ R + + I Sbjct: 913 YSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQYLI 972 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG--S 202 + + L + N + + E+ E + D S Sbjct: 973 VITDGQSSD--PVADAAQGLRDTGINIYAIG-VRDANTTELEEIANNRVFFTDDFHFLKS 1029 Query: 203 MHCAMNSDPEDVNSAPICQDK-----------KRTKMAALKNALLLFLDSIDLLSHVKED 251 +H + D + + +K+ + ++ + + Sbjct: 1030 IHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEKIKDFVKRMVNQ---SNIGADK 1086 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKK 310 + +GL+ +++ + + + + I T++ A+ + Sbjct: 1087 IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQG 1146 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +++I +TDG++++ + ++ I I I + Sbjct: 1147 G--------------RPGVHQYLIVITDGDSHDDIVSP-----AKALRDRNIIIFAIGVG 1187 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 Q L + + Y+ N +SL ++ + I + + Sbjct: 1188 KIQRAQLL--AITNDQDKVYHEENFESLQNLEKEILYEVCTSQ 1228 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 59/226 (26%), Gaps = 31/226 (13%) Query: 189 PIFLIELVVDLS-GSMH--CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 P + + + SM C + +K + + Sbjct: 843 PGGKMPPLTGCTPQSMQKVCTHVEKADIYFLIDGSGSISPKDFTEMKEFMKDVIKMFH-- 900 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + V G++ Y+ ++ + T + A+ Q Sbjct: 901 -IGPDGVRFGVVQYSDKIISQFLLTQYTSMEK----------------LGTAIGNIQQGG 943 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +++I +TDG++++ ++ ++ I I Sbjct: 944 GGTTTGEALSKMALVFRNTARTNVAQYLIVITDGQSSDPVAD-----AAQGLRDTGINIY 998 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + N L + + + L + Q + + + Sbjct: 999 AIGV-RDANTTELEEIA---NNRVFFTDDFHFLKSIHQEVVRDICS 1040 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 17/190 (8%), Positives = 46/190 (24%), Gaps = 18/190 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ + L +++ + L Y S K Sbjct: 46 QHAQSVRNFLYILANSLQVGGDNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFK 105 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 +++ + + S + + ++ G + Sbjct: 106 PGGNRMGQSLQFILEHHFPEGAG-----------SRASQGVPQVALVMSTGV-----AED 149 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQ 407 + + + K I + I + L + S + + V N L + Q + Sbjct: 150 HFREPAEALKREGILLYAIGV-KDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRP 208 Query: 408 LMVHRKYSVI 417 + V+ Sbjct: 209 ELCTTLAKVV 218 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 18/161 (11%), Positives = 47/161 (29%), Gaps = 18/161 (11%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLI 288 + + L ++ L+ + V + L+ Y+ + + + +V + + Sbjct: 650 FQQVIKLLKTTVHSLNIHPDIVRVSLVFYSEKPQLKFSLNTFQNAAQVLTSLDQLTFRAR 709 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + F + Q+ + + + S Sbjct: 710 RGRTKAGAALDFLRK-----------EVFLPEKGSRSIWGVQQIAVIIME-----SPSLD 753 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 N + + I + I ++ + L K P H Sbjct: 754 NVSTPASHLRRTGVTIYAVGIQSASESKDLEKIATYPPGKH 794 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 61/188 (32%), Gaps = 22/188 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + L + V +GL+ Y+ + + ++ V + +L Sbjct: 248 LQNFQKVKNFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQ--SSLKSVVLEQIRNL 305 Query: 288 IL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T++ A++ + +I +TDGE+N+ Sbjct: 306 PYNMGSTNTGSALEF-----------IRANYLTEMSGSRAKDGVPQIVILVTDGESND-- 352 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQN 404 D+ K + + + + IN + Q L K E + N + L + + Sbjct: 353 ---EVQDAADQLKRDGVFVYVVGINI-QDVQELQKIASEPFEEFLFTTENFNILQELSGS 408 Query: 405 ISQLMVHR 412 +SQ + Sbjct: 409 LSQAVCST 416 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/343 (9%), Positives = 98/343 (28%), Gaps = 41/343 (11%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 ++ + + + K + + S++++ R + + I Sbjct: 710 YSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQYLI 769 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG--S 202 + + L + N + + E+ E + D S Sbjct: 770 VITDGQSSD--PVADAAQGLRDTGINIYAIG-VRDANTTELEEIANNRVFFTDDFHFLKS 826 Query: 203 MHCAMNSDPEDVNSAPICQDK-----------KRTKMAALKNALLLFLDSIDLLSHVKED 251 +H + D + + +K+ + ++ + + Sbjct: 827 IHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEKIKDFVKRMVNQ---SNIGADK 883 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKK 310 + +GL+ +++ + + + + I T++ A+ + Sbjct: 884 IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQG 943 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +++I +TDG++++ + ++ I I I + Sbjct: 944 G--------------RPGVHQYLIVITDGDSHDDIVSP-----AKALRDRNIIIFAIGVG 984 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 Q L + + Y+ N +SL ++ + I + + Sbjct: 985 KIQRAQLL--AITNDQDKVYHEENFESLQNLEKEILYEVCTSQ 1025 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 59/226 (26%), Gaps = 31/226 (13%) Query: 189 PIFLIELVVDLS-GSMH--CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 P + + + SM C + +K + + Sbjct: 640 PGGKMPPLTGCTPQSMQKVCTHVEKADIYFLIDGSGSISPKDFTEMKEFMKDVIKMFH-- 697 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + V G++ Y+ ++ + T + A+ Q Sbjct: 698 -IGPDGVRFGVVQYSDKIISQFLLTQYTSMEK----------------LGTAIGNIQQGG 740 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + +++I +TDG++++ ++ ++ I I Sbjct: 741 GGTTTGEALSKMALVFRNTARTNVAQYLIVITDGQSSDPVAD-----AAQGLRDTGINIY 795 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + N L + + + L + Q + + + Sbjct: 796 AIGV-RDANTTELEEIA---NNRVFFTDDFHFLKSIHQEVVRDICS 837 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 17/190 (8%), Positives = 46/190 (24%), Gaps = 18/190 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ + L +++ + L Y S K Sbjct: 46 QHAQSVRNFLYILANSLQVGGDNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFK 105 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 +++ + + S + + ++ G + Sbjct: 106 PGGNRMGQSLQFILEHHFPEGAG-----------SRASQGVPQVALVMSTGV-----AED 149 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQ 407 + + + K I + I + L + S + + V N L + Q + Sbjct: 150 HFREPAEALKREGILLYAIGV-KDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRP 208 Query: 408 LMVHRKYSVI 417 + V+ Sbjct: 209 ELCTTLAKVV 218 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 18/161 (11%), Positives = 47/161 (29%), Gaps = 18/161 (11%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLI 288 + + L ++ L+ + V + L+ Y+ + + + +V + + Sbjct: 447 FQQVIKLLKTTVHSLNIHPDIVRVSLVFYSEKPQLKFSLNTFQNAAQVLTSLDQLTFRAR 506 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + F + Q+ + + + S Sbjct: 507 RGRTKAGAALDFLRK-----------EVFLPEKGSRSIWGVQQIAVIIME-----SPSLD 550 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 N + + I + I ++ + L K P H Sbjct: 551 NVSTPASHLRRTGVTIYAVGIQSASESKDLEKIATYPPGKH 591 >gi|89095762|ref|ZP_01168656.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89089508|gb|EAR68615.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 432 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 58/207 (28%), Gaps = 21/207 (10%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + Q KM K A+ F ED M L Y + G Sbjct: 141 DSSGSMNGQVSGGVKMNLAKEAVENFAAGF------PEDTIMTLRTYGHKGT-------G 187 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT---NFFRQGVKIPSLPF 329 +K + + + T A K A + + K Sbjct: 188 DDKDKAMSCASTEVMYDANTYDQAAFKAALEKFKPSGWTPLAASIKAGYEDLKKKAGEDT 247 Query: 330 QKFIIFLTDGENNN-FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPE 387 + + ++DG + D + +K+ I + GQ LK + Sbjct: 248 ENILYIVSDGIETCEGDPVKEAKALADS--DLNMKVHIIGFDVDDAGQDQLKKTAEAGNG 305 Query: 388 YHYNVVNADSLIHVFQNI-SQLMVHRK 413 +Y V + L + + + + + Sbjct: 306 KYYTVNSKLELTNTLNELMGEAISSIR 332 >gi|325089698|gb|EGC43008.1| U-box domain-containing protein [Ajellomyces capsulatus H88] Length = 759 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 61/216 (28%), Gaps = 28/216 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + + K+A +++++ Sbjct: 75 DIVLCIDVSYSMQSSAPLPTTDESGEREETGLSV--LDLTKHAARTIIETLNEND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E + S + + L T+ +K + ++++ Sbjct: 128 -RLGIVAFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERR 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN----FKSNVNTIKICDKAKENFIKIVT 366 + + LTDG N+ I I T Sbjct: 187 T---------------PQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHT 231 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + +A + VF Sbjct: 232 FGFGYNIRSGLLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|332831234|ref|XP_528236.3| PREDICTED: collagen alpha-1(XXII) chain, partial [Pan troglodytes] Length = 695 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSPR-----------AGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 55/197 (27%), Gaps = 9/197 (4%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + +S+ L +I V +++A +A LAGA Sbjct: 3 LRKAINRFWDDNRGYVIALTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGAR 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + +I A A F SL + V Y+ + Sbjct: 63 ELDGRDDAITRADAAIEALANSA------AFGGGGNGMSLGSHITVAYDAGNAAGSTVAV 116 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY---NQKIVSFIPALLRI 183 + + + D ++ SY + + + + + A + Sbjct: 117 TYLKEIPADDDDPITASMETTDPNEASYAWVIANDQAMTTIFPVPVGFNRDTINVAAEAV 176 Query: 184 EMGERPIFLIELVVDLS 200 + + + + Sbjct: 177 AVYRSSACDVTPIYICN 193 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 66.8 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 76/232 (32%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 K++ ++ + P + ++D+SGSM + K Sbjct: 160 NPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSM-------------------DEPNK 200 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L +D +D V ++ Y ++ + G EK + + M+ Sbjct: 201 LPLLKSAFKLLVDELDEDDRVS------IVVYAGAAGLVLDSTPGNEKDK-ILDALMNLE 253 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K AY + + +S +I TDG+ N S+ Sbjct: 254 AGGSTAGAEGIKLAYDVAKKNFIKSGNN----------------RVILATDGDFNVGISS 297 Query: 348 V-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 +++ +K ++ I + + +++ + + N Sbjct: 298 EAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDNIAE 349 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 34 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 93 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 94 DALRYITARSFSPRT-----------GGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-- 140 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 141 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 194 >gi|28210485|ref|NP_781429.1| membrane-associated protein [Clostridium tetani E88] gi|28202922|gb|AAO35366.1| membrane-associated protein [Clostridium tetani E88] Length = 842 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 81/253 (32%), Gaps = 46/253 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRI-EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 L N K +P + ++ I L++D SGSM Sbjct: 380 FALGSYENTKFEELLPVSCNVKNKRKQGDAGIVLLIDCSGSMDDESGG------------ 427 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ K + + +++ + +G++G++ ++ + K + + Sbjct: 428 ---VKKIELAKQGAIETIKALESEDY------IGILGFSDTIDWVVPFQKAENK-EKLIK 477 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T P + + + L+S K + K +I LTDG+ Sbjct: 478 EVGKLKPKGGTLIIPGLIEGVKTLSSAKTKV------------------KHMILLTDGQA 519 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + K+N + + T+ + + + L + Y + S+ + Sbjct: 520 EKNGFDKYL----ENMKKNNMTLSTVGLGEDSDREVLTHLSDFTGGRKYFSNDFKSVPII 575 Query: 402 FQNISQLMVHRKY 414 F + +KY Sbjct: 576 FAK-ETRISQKKY 587 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 76/270 (28%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 16 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 70 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ +++ + ++ Y Sbjct: 71 TSM--------------------SGERIEKAREAAILAVNMLNITD------TLSVVAYD 104 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 105 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 148 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 149 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 208 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 209 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 238 >gi|146325834|sp|Q60847|COCA1_MOUSE RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor Length = 3120 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 50/173 (28%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + T++ AM Sbjct: 467 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINTFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|326504464|dbj|BAJ91064.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 720 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 78/261 (29%), Gaps = 54/261 (20%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + Q+++ + R + V+D+S SM Sbjct: 35 SDRAPLHENRQQVLLEVIDASSAGGDRRLGLDLVAVLDVSKSMRKND------------- 81 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 ++ +K A+ ++ + + + ++ ++ E+ T + + Sbjct: 82 ------RLGKMKTAMHFVINKLGHMD------RLSIVKFSEEAERLCPLLSVTPPNKARL 129 Query: 281 TRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +D ++ PT+ ++ +L + I L+DG Sbjct: 130 NHLVDGLQVIDPTNIRDGLEAGLSVLAGRRITGGRVA---------------SIFLLSDG 174 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSL 398 + N + + + + + T + + L + S V + +++ Sbjct: 175 DENRGHATTVDV--------SDVPVYTFGFGTDYDPKVLDEIARRSKGGTFNFVDDEENM 226 Query: 399 IHVFQNISQLMVHRKYSVILK 419 F I + S++++ Sbjct: 227 TEPFSQILGGL----LSIVVQ 243 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 76/270 (28%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 16 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 70 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + A +L ++ +++ + ++ Y Sbjct: 71 TSM--------------------SGERIEKAREAAILAVNMLNITD------TLSVVAYD 104 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 105 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 148 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 149 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 208 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 209 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 238 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 50/436 (11%), Positives = 112/436 (25%), Gaps = 92/436 (21%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 +++ A++ A GA + S L+ L S+ + R + Sbjct: 7 RSAQSGASDHAGRKGARPLRSVLASLCAVAMSLGMASASVAAFADDRQPAATADPQAATA 66 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR- 168 SA + S T+ + + ++ V L + Sbjct: 67 SAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSG 126 Query: 169 -------YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + + + + + +S + Sbjct: 127 KLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGR-CTTWQDQDSAGQKYTVTYNWI 185 Query: 222 DKKR-----------------TKMAALKNALLLFLDSIDLLSH----------------- 247 T++ ALK+A+ FLD ++ + Sbjct: 186 GGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYA 245 Query: 248 VKEDVYMGLIGYT------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 K +G Y + +W E +++ T + ++ A Sbjct: 246 GKNSDKIGNDTYNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHA 305 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-----VNTIKICDK 356 + L S QK +F +DG + N IK + Sbjct: 306 VKQLNSG-----------------RPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQ 348 Query: 357 AKENFIKIVTISINASPN------GQRLLKTCVSS--------------PEYHYNVVNA- 395 K + ++++I + + + S+ +Y V+A Sbjct: 349 LKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSAR 408 Query: 396 DSLIHVFQNISQLMVH 411 L +F+ I ++ Sbjct: 409 TDLQTIFKEIISIVTS 424 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 66.8 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 72/228 (31%), Gaps = 38/228 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-------LLLFLD-SIDLL 245 + + +D ++ + K+ + +++ ID L Sbjct: 257 PGSFKCACREGFTLMNDSRSCSACSNAATDVVFLIDGSKSVRPENFELVKKWINLIIDKL 316 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + ++GL+ Y++ V++ K +++ V R + + T + A+ Sbjct: 317 DVSETNTHVGLVQYSSTVKQEFPLGRHNSKRSLKEAVKRMN--YMERGTMTGHALSFLVD 374 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K I TDG + ++ + KAK K Sbjct: 375 -------------NSFGPNQGARPGVPKVGIVFTDGRSQDYIGD-----AAKKAKALGFK 416 Query: 364 IVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + + + + L+ S P ++++ + I++ + Sbjct: 417 MYAVGVG--NAVEDELREIASEPIADHYFYTADF----KTMNQIAKKL 458 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 64/201 (31%), Gaps = 30/201 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ +K L +D + + +G++ Y +RV+ + K + Sbjct: 63 SSRSVRPSEFEQVKVFLAKVIDGLS---VGPDATRVGVVNYASRVKNEVSLKS--HKTKA 117 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T + + + + F S K I +TD Sbjct: 118 ALVKAVSKIEPLSTGTMTGLAIQFAM---------NVAFSEAEGGRKSPDISKVAIIVTD 168 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + N I +A+E I+I I + L+ S P ++ V + Sbjct: 169 GRPQD-----NIRDIAARAREAGIEIFAIGVGRVD--MTTLRQMASEPLEDHVDYVESYS 221 Query: 397 SL-------IHVFQNISQLMV 410 + F + + Sbjct: 222 LIEKLTKKFQEAFCAVVSDLC 242 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 55/181 (30%), Gaps = 24/181 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDS 294 ++ + + +G++ Y+ I E + + T + Sbjct: 65 SWLVNITRNFDIGPKFIQVGVVQYSDYPVLEIPL-GTHESTENLIKEMESIHYLGGNTKT 123 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ AY L S K + LTDG++ + + Sbjct: 124 GRAIQFAYDHLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDVA 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRK 413 +A++N I + I + + L + + V + ++ + + I Q + Sbjct: 163 AEARKNKITLFAIGVGSEIEEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEES 222 Query: 414 Y 414 Sbjct: 223 V 223 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 62/181 (34%), Gaps = 45/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +D+S SM + +D ++ + + ++S+ Sbjct: 90 DVVFAIDVSKSM---------------LAEDVAPNRLEKSQQLVTQIINSLAS------- 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + ++ D L + T A++ A D++ Sbjct: 128 DRVGIIAYAGSAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAINEAIQLAKTYYNDDEQ 187 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ S + I ++A E I+I TI + Sbjct: 188 TN------------------RVLFIISDGEDHEGDS----VNIAEEASEEGIRIFTIGVG 225 Query: 371 A 371 Sbjct: 226 T 226 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 63/168 (37%), Gaps = 23/168 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 +D I+ L +G++ Y+++V+ + + + + + T Sbjct: 53 KFMIDIINSLEVGLSTTRVGVVQYSSQVQTVFSLKTFSNKSDMEKAINEIIPL--AQGTM 110 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + + +G + S + I +TDG + ++ Sbjct: 111 TGLAIQYAMNVAFT----------EEEGARPLSKNIPRVAIIVTDGRPQD-----RVTEV 155 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 +A+E I+I + + + L+ S P ++ ++V + D + Sbjct: 156 AVQAREAGIEIYAVGVQRADVSS--LRAMASHPLDDHVFHVESFDLIQ 201 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 26/185 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + ++ +D + + ++GL+ Y++RV S T Sbjct: 478 RPQNFELVKEFVINIVDSSAISAQGTHIGLVQYSSRVRTEFPLSQYTNGQDIKTAVKNIQ 537 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + K T + A+K + S+ + K + TDG + + S Sbjct: 538 YMEKGTMTGLALKHMVEQSFSEAEG-------------ARKNVPKIGLVFTDGRSQDDIS 584 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 + KAKE I + + + + L S P ++ + + Sbjct: 585 -----EWAKKAKEAGITMYAVGVG--KAVEDELNEIASDPVNKHSFYTADFS----TMNL 633 Query: 405 ISQLM 409 I++ + Sbjct: 634 IAEDL 638 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 46/155 (29%), Gaps = 12/155 (7%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + +++ L +I V +++A +A LAGA Sbjct: 2 LSRVVIRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 KM---VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++ ++R E I+N A + N +G A Sbjct: 62 ELDGRDDAITRAQTAIEKIANSAAFSGGGTGMSLGSNSSVTYEAGNDAGNTV-------- 113 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + + + + ++ SY + + Sbjct: 114 -TVLFLKSIPADDDTPIPASMETTEPSEASYAWVI 147 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 68/205 (33%), Gaps = 25/205 (12%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 T K + V R + + T + + Y + +G + Sbjct: 107 NEFSLK--TFKRKSEVERAVKRMRHPSTGTMTGLAIQYAL--------NIAFSEAEGARP 156 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + I+ +TDG + + ++ KA++ I I I + LK+ S Sbjct: 157 LRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSIGS 209 Query: 385 SP--EYHYNVVNADSLIH---VFQN 404 P ++ + V N + VFQ Sbjct: 210 EPHEDHVFLVANFSQIETLTSVFQK 234 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 66.4 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 22/191 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 KN L+ +G++ Y+ + I T + Sbjct: 73 YKDFDTAKN---WLLNITSSFDIGPSYTQVGVVQYSDFPQLEIPLGHNTSYQQLLSALKS 129 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T++ A+K A + + KR + K I +TDG++ + Sbjct: 130 IKYLGGNTNTGRAIKFATEEVFPTSKRLNVSK-------------NKIAIVITDGKSQD- 175 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQ 403 N + I A+ I + + + + L+ ++ + ++ + + Sbjct: 176 ----NVVNISSSARAQGIILFAVGVGSEITKSELVAIANMPSTHYVLYAEDYTTIDRIKE 231 Query: 404 NISQLMVHRKY 414 + Q + Sbjct: 232 TMRQKICEESV 242 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 78/236 (33%), Gaps = 57/236 (24%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + +V+D+SGSM ++ LK+A+ + + + Sbjct: 102 RAALDLVVVLDVSGSMR-------------------DFGRLDKLKSAMRFIIKKLAPMD- 141 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILT 306 + ++ + + +E +T +D L T+ +K Q+L Sbjct: 142 -----RLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLD 196 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-NFIKIV 365 + T +I ++DGE N+ + + + Sbjct: 197 GRRYTGARTAG---------------VILMSDGEQNSGD--------ATRVRNPQNYPVY 233 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLI--HVFQNISQLMVHRKYSVILK 419 T+S ++ + LL+ + V+++ + VF + + +V+++ Sbjct: 234 TLSFGSNADMN-LLQKLAGGGGTYNPVLDSGGMSMLDVFSQLMAGL----LTVVVR 284 Score = 39.8 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 11/161 (6%), Positives = 38/161 (23%), Gaps = 16/161 (9%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS----- 200 L + + K+ S + +++ + ++ + Sbjct: 99 TAGRAALDLVVVLDVSGSMRDFGRLDKLKSAMRFIIKKLAPMDRLSVVTFNGGATRECPL 158 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKR------TKMAAL--KNALLLFLDSIDLLSHVKEDV 252 +M D+ + + + L + + L+S +++ Sbjct: 159 RAMSEDAVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTGARTAGVILMSDGEQNS 218 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + S+G+ + + T Sbjct: 219 GDATRVRNPQNYPVYTLSFGSNADMNLLQKLAG---GGGTY 256 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/373 (10%), Positives = 99/373 (26%), Gaps = 58/373 (15%) Query: 44 LDWHYKKNSMESANNAAI----LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + K +++ AA A + R++ ++ + + + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 100 NHIKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + F N + + + N SNN Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDK--SNNKEPVPASKPIP 192 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + R + + I + E P + ++D SGSM Sbjct: 193 FAMRYELAP-APWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM---------- 240 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ ++++L L + + + + ++ Y + G+ Sbjct: 241 ---------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPSISGS 285 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K T+ ++ AYQ + I Sbjct: 286 HKAEIN-AAIDSLDADGSTNGGAGLELAYQQAAKGFIKGGIN----------------RI 328 Query: 334 IFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYN 391 + TDG+ N + +I+ + K +E+ + + T + S + ++ + + Sbjct: 329 LLATDGDFNVGIDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRIADVGNGNYSY 388 Query: 392 VVNADSLIHVFQN 404 + V + Sbjct: 389 IDTLAEAQKVLNS 401 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 19/198 (9%), Positives = 59/198 (29%), Gaps = 47/198 (23%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 ++R +++A++ + +++ + + L+ + + T+ + Sbjct: 103 GEQRDRLSAVRQQISRLIET-------RPGDRIALVVFADHAYLLSPLTQETKALLTLTR 155 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 L+ + T A+ A Q + + ++ +TDG N Sbjct: 156 ELDFDLVGRTTALGEAILLARQHADPKRPTA--------------------LLLVTDGRN 195 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP------------------NGQRLLKTCV 383 ++ + +A + + + T+ + A P + L + Sbjct: 196 TAGSAD--PLSEARRAAASGMTLYTLGVGADPDTFVEALQPAQSDPSAELDEALLQQLAK 253 Query: 384 SSPEYHYNVVNADSLIHV 401 ++ L + Sbjct: 254 VGQGRYFRARTQGDLDAI 271 >gi|111074529|ref|NP_031756.2| collagen alpha-1(XII) chain [Mus musculus] Length = 3061 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 50/173 (28%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + T++ AM Sbjct: 467 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINTFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 70/207 (33%), Gaps = 31/207 (14%) Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEK 265 + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 203 SCGNKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVKN 260 Query: 266 NIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 +V + V R T + A++ A I S+ +G + Sbjct: 261 EFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGAR 308 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTC 382 + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 309 PLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKAI 360 Query: 383 VSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 361 GSEPHEDHVFLVANFSQIESLTSVFQN 387 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 63/185 (34%), Gaps = 19/185 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + ID L+ + +GL+ Y+T+V + + Sbjct: 822 DNFEIVKQFLTAIIDSLAVSPKAARVGLLQYSTQVRAEFTLRSFSTAKDMKKAVAHMKYM 881 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K + + A+K ++ + +G + S + I TDG + S Sbjct: 882 GKGSMTGQALKHMFERSFTQ----------LEGARPLSARVPRVAIVFTDGRAQDDVS-- 929 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNIS 406 + +KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 930 ---EWANKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMNEISEKLK 984 Query: 407 QLMVH 411 + Sbjct: 985 MGICE 989 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 64/191 (33%), Gaps = 18/191 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ L +A + ++ + + +I + VE + + + + Sbjct: 25 SGSMSGEKIENLYDATNEMIKVFSDAVSKEKIIDIAIITFGENVELHTPYTSVVDFKSRG 84 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + L T A++ A ++ + PS ++ ++ ++DG Sbjct: 85 LNPF---LASGMTPLGTALRMAKDMIEDKE-------------TTPSNIYRPAVVLVSDG 128 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + + + + + ++I + ++LK+ E + N ++ Sbjct: 129 VPTD-EWRGPLDNFKNNGRSSKCQRFAVAIGNDAD-NQMLKSFAECNENFFIAENVSDIV 186 Query: 400 HVFQNISQLMV 410 F+ IS + Sbjct: 187 DKFKQISMSVS 197 >gi|198435715|ref|XP_002125840.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 3908 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 68/250 (27%), Gaps = 39/250 (15%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + R + V D + + + Sbjct: 360 FDVSPSVGGSSAMG--------AGRTSDIDSDTQTGRFVQDCKKA---------QKTDLV 402 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K + +D S +V MG Y+ + K Sbjct: 403 VLTDGSWSVGPQNFKKIQAFLVSLVDAFSIGFNNVLMGYAQYSDDARTEFNLNEHVTKDD 462 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + A+ L + + K I +T Sbjct: 463 LIRAINQVQYKGGNTATGGALDYIRTNLFTSEGG-------------TRRGVLKTAIVIT 509 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNA 395 DGE + + KE +++ +I + A+ + L+ SSP ++ ++V N Sbjct: 510 DGE----SILDDVTEPARMLKEIGVEVFSIGVAAALRSE--LEDIASSPASDHVFSVDNF 563 Query: 396 DSLIHVFQNI 405 D + ++ +NI Sbjct: 564 DDIKNI-KNI 572 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 22/221 (9%), Positives = 61/221 (27%), Gaps = 33/221 (14%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAAL-----------KNALLLFLDSIDLLSHVKEDV 252 + C+ + + + A ++ ++ Sbjct: 3403 CLNDGTWSRSNPVCRACKTAVTDVIFIVDGSWSVGEINFRKAKDFLKALVEPFEVGWDNS 3462 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 ++ Y+ + D T++ A+ + Sbjct: 3463 RFAVVQYSDDPRTEFLMNEHFTVTDVLNAIDAIPYKGGNTNTGKALAFSLYT-------- 3514 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K + LTDG + + N + ++ +K++T+ + + Sbjct: 3515 -----ALSPANGARPYVNKVALVLTDGRSQDEVGNP-----ARELRQAGVKVLTVGVGDA 3564 Query: 373 PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + LK+ S P Y+V + DS+ + +++ + Sbjct: 3565 DKNE--LKSIASPPYDSSVYHVSDYDSISEIKAHLAAKLCE 3603 >gi|124003889|ref|ZP_01688737.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123990944|gb|EAY30411.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 704 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 19/202 (9%), Positives = 60/202 (29%), Gaps = 27/202 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ LK + + L + + ++ Y + + Sbjct: 351 ASGSMDSEDKLPLLKRSFKVLLKQLTD-----SRTKIAIVAYAGASGLVLPATS-VSHRE 404 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T + T ++ AY+I +I T Sbjct: 405 KILTALENIESGGSTAGGEGIELAYKIAQQAFIAGGNN----------------RVILAT 448 Query: 338 DGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNA 395 DG+ N S+ +++ +++ + + + +++ + +Y + Sbjct: 449 DGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDSMMEKLTNAGNGNYYYI--- 505 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 D + + +++ + Y++ Sbjct: 506 DGINEAKKVLAKNLTGTLYAIA 527 >gi|322711218|gb|EFZ02792.1| U-box domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 734 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 63/237 (26%), Gaps = 30/237 (12%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V +P + I LV+D+S SM E + T Sbjct: 27 PSKRSLVVKVVPPRAPAAQIDHVPCDIVLVLDVSTSMEDNAPVPGETERTGL-------T 79 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 + K+A L +++++ +G++ + T + + + Sbjct: 80 VLDLTKHAALTIIETLNDRD------RLGIVSFATNSTIVQTLTHMDISNKDEARRKIKA 133 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ ++ QI + + ++ LTDG N+ Sbjct: 134 LDPNGSTNLWHGIRDGIQIFEQSAENGNI----------------RAMMVLTDGMPNHMC 177 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I I T L + + +A + VF Sbjct: 178 PQQGYIPKLKTLPRLPAAIHTFGFGYGLRSGLLKSLAEYGHGNYAFIPDAGMIGTVF 234 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 37/144 (25%), Gaps = 9/144 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++K ++ ++ ++ L + F I + K +++A + A LAGA Sbjct: 19 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 78 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ LS T ++ + Sbjct: 79 DQAEAAVIAT---------LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPT 129 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRL 158 D +M + Sbjct: 130 GEYDIYVRVAVTDMGISQYLSAVF 153 >gi|302551540|ref|ZP_07303882.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] gi|302469158|gb|EFL32251.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] Length = 518 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/223 (10%), Positives = 63/223 (28%), Gaps = 43/223 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 GER + V+D+SGSM + ++ + +L + D Sbjct: 159 AERTGERLPAALTFVIDVSGSM-------------------SEPGRLDLAQESLSVMTDR 199 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ ++ E + + + D T+ ++ Sbjct: 200 LRDDDS------VAIVTFSDEAETVLPMTRLDGNRDEIQDVISDLATQDSTNLGAGVETG 253 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKE 359 Y+ ++ ++D N ++ +I + +E Sbjct: 254 YETAVEGL----------------REGATNRVVLVSDALANTGDTDADTILERIAGERRE 297 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + I + + + + + + + V D VF Sbjct: 298 HGITLFGVGVGSDYGDALMERLADRGDGHTVYVSGPDEAHEVF 340 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 86/265 (32%), Gaps = 65/265 (24%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + R + E + LV+D+S SMH + ++++ A+K Sbjct: 90 RIVQRQTVAETRGIDLLLVLDVSRSMHQQDFNG--------------QSRLEAVKGVGKQ 135 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTP 296 F+ + +GL+ ++ + + + + +I + T Sbjct: 136 FV-------LSRSADRIGLVVFSGKGYTPCPLTLDHLTLGTVLDNISSEVIQEEGTAIGT 188 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A L S QK II LTDG+NN + + Sbjct: 189 AILIAVNRL------------------RASESRQKAIILLTDGQNNAGDID--PLTAAGF 228 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVS----------------------SPEYHYNVVN 394 A ++ IKI TI+ + + + +++ S + + V + Sbjct: 229 ALQDGIKIYTIA-ATAQDARPFVRSAESLPAGGALSGLPAEDVLVSISRLTQGRSFRVGD 287 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 L F +I +L S + + Sbjct: 288 KAGLAETFDDIDRLEKSSLRSPVSQ 312 >gi|323699770|ref|ZP_08111682.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323459702|gb|EGB15567.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 2034 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 15/177 (8%), Positives = 43/177 (24%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + D + ++ + T+ + + T Sbjct: 433 NALDATRKFIAALPADARVRVVDFDTKPRALPGETRD-----AALKGLAGIKANGATCLN 487 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K+ + ++ D Sbjct: 488 DAVLLGLHMLAGAKRPALLVFTDGFDANFNDTGPGSKATR---------------REVLD 532 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 K + + TI + L + S +Y + +L F ++ + + Sbjct: 533 AVKTGGVPVFTIGFGKGHDVSTLDRIASLSGGRYYPASDPGALDKAFAVVNANLANT 589 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 44/404 (10%), Positives = 100/404 (24%), Gaps = 70/404 (17%) Query: 39 FLIYVLDWHYKKNSMESANNAA--ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 L+ K M +A +A +A D R Sbjct: 419 LLVDSSGSM--KGQMRNALDATRKFIAALPADARVRVVDFDTKPRALPGETRDAALKGLA 476 Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY-- 154 IK + L+ + + + + N D+ + +V+ + Sbjct: 477 GIKANGATCLNDAVLLGLHMLAGAKRPALLVFTDGFDANFNDTGPGSKATRREVLDAVKT 536 Query: 155 ---------------DYRLQFIEHLLNQR-----------YNQKIVSFIPALLRIEMGER 188 L I L R +V+ A ER Sbjct: 537 GGVPVFTIGFGKGHDVSTLDRIASLSGGRYYPASDPGALDKAFAVVNANLANTFTARYER 596 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P V + M ++ + C +M +K L FL ++ Sbjct: 597 PTRGRPSNVPVVTCMV-DISGSMDKTPDFSGC----NYRMDKVKAILHDFLAAL------ 645 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++V + ++ + + T + + D T+ A+ + Sbjct: 646 PDEVLAQGMTFSDQNVIEQVTTANTGE---MLAAMNDLYADGGTEIAGAVAAVLET---- 698 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKICDKAKENFIKIVT- 366 + +++++F+TD + + K ++ + Sbjct: 699 --------------QRAIPSTRRYLLFITDAALDVEPEDKLFFETTLAKLRDEG--VYCL 742 Query: 367 -ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I + + S + + L F + + Sbjct: 743 WVGIG-ELDPAPFKRAAEISGGSYVLTEDPAELGRAFDGLVADI 785 >gi|330417948|ref|NP_001193426.1| collagen alpha-1(XII) chain [Bos taurus] Length = 3065 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGEAIDYLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ P+ ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHKDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + DSL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFDSLSRIVDDLTNNL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 55/186 (29%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ + + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFHTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 >gi|296484270|gb|DAA26385.1| collagen, type XII, alpha 1 [Bos taurus] Length = 3115 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + +E +G++ Y++ + ++ Sbjct: 154 NNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 214 GGNTMTGEAIDYLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/255 (13%), Positives = 78/255 (30%), Gaps = 32/255 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + ++ P+ ++E + + C+ D + + Sbjct: 388 TLSVRDLSADTEYQISV-SAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKADIVFLVD 446 Query: 221 QDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + FL+ + V + L+ Y+ T KV Sbjct: 447 GSYSIGIANFVK--VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-KVEDI 503 Query: 280 VTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T++ AM + + K K +I +TD Sbjct: 504 IEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMILITD 550 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 G KS+ K + + ++I + + + + L+ S P + + V + Sbjct: 551 G-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF- 602 Query: 397 SLIHVFQNISQLMVH 411 FQ IS + Sbjct: 603 ---DAFQRISFELTQ 614 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHKDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + DSL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFDSLSRIVDDLTNNL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 55/186 (29%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ + + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFHTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 18/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ + E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDQPTTAFELGLFGSREEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPGDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 29/341 (8%), Positives = 77/341 (22%), Gaps = 33/341 (9%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 I + + + ++ T + + F Sbjct: 23 ILVSINSSVFRVKADIASKPQFTVTIDSYTPKNPKLGEEITINGTIHPQP-FKISIPPKE 81 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP-----------IFLIELVVDLSGS 202 L + + + + + +I + D+S S Sbjct: 82 IVLVLDSSGSMADNYKLTNLKKAATDFITKMSTVKNLKIAIVDFDTQATIINKLTDVSSS 141 Query: 203 MH---CAMNSDPEDVNSAPICQDKKRTKMAALKNA------LLLFLDSIDLLSHVKEDVY 253 + + + + R L N+ + + + Sbjct: 142 TNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKNIIFMSDGEPTYYNWQ 201 Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKR 311 G+ ++ R D + + T Y+ S Sbjct: 202 TANSGWIWGDYNGTYYTYYDWSFNRVNYGFDSNHMPSGYRTGPYTGNYTGYKYQGSSYSN 261 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + S T + + KS + + K + +I Sbjct: 262 YYTDI-------TQSNVASNVSQSGTGYSDTDGKSLNYAKIMGEIIKGKGYNVFSIGYGL 314 Query: 372 SPNGQRLLKTC---VSSPEYHYNVVNADSLIHVFQNISQLM 409 +G ++ +S+ ++ ++ ++ VF NI + Sbjct: 315 DSDGNTKMQQIHSSMSTNNSNFYETDSGAIDAVFSNIGDDI 355 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 72/216 (33%), Gaps = 44/216 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P + +V+D SGSM + +R K+ K A+ + + Sbjct: 405 QQPDLALVMVIDRSGSMAEPVAGG-------------RRNKLDLAKEAVYQASLGLTPID 451 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GL+ + ++ V I T+ P ++QA L Sbjct: 452 ------QVGLVVFDDTANWVLQLQPLPSMVEIE-RALGSFGIGGGTNIRPGIEQAALALA 504 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + K ++ LTDG + S++ + + + I I T Sbjct: 505 STDAKI------------------KHVLLLTDGIAESNYSDLI-----AQMRASGITIST 541 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +++ N L+ + Y V + D + +F Sbjct: 542 VAVGLDANPN-LVDVANAGGGRSYRVTSIDEVPRIF 576 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 82/294 (27%), Gaps = 51/294 (17%) Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + A L S + + L L I + A Sbjct: 5 SFILLSAILSLGVSGLVQAAPL---VNCQVELDRGVLLAGPAQKTVIKIALDAPRAPRTA 61 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +RP + LV+D SGSM K+A + A + + + Sbjct: 62 QRPPVNLALVLDRSGSM--------------------SGNKIAKAREAAIEAVRRLSDGD 101 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L+ Y VE + + + R T A+ Q Sbjct: 102 ------LFSLVVYDDSVETLVP-AQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAA--- 151 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIV 365 + K P+ ++ L+DG N S + ++ + I + Sbjct: 152 -------------EVRKHSDAPYVNRVVLLSDGLANVGPSRPADLARLGAALLKEGISVT 198 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 T+ + N + + S HY V ++ L +F + SV+ + Sbjct: 199 TVGVGTDFNEDLMTQLAERSDGNHYFVESSRDLPRIFAAELGDV----LSVVAR 248 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 90/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 118 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 177 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + +++ + P Sbjct: 178 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQVRDLP 233 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 234 PANLVFLIDTSGSMQ-------------------PAERLPLIQSALKLLVNDLRAQDNIT 274 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 275 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 327 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 328 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 371 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 372 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 408 >gi|297299828|ref|XP_002805471.1| PREDICTED: matrilin-2-like [Macaca mulatta] Length = 897 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 64/199 (32%), Gaps = 24/199 (12%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + + I + K +D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK-VKEFLMDILQFLDIGPDVTRVGLLQYGSTVKN 107 Query: 266 NIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 +V + V R T + A++ A I S+ +G + Sbjct: 108 EFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGAR 155 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 156 PLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAIG 208 Query: 384 SSP--EYHYNVVNADSLIH 400 S P ++ + V N + Sbjct: 209 SEPHEDHVFLVANFSQIET 227 >gi|154280917|ref|XP_001541271.1| predicted protein [Ajellomyces capsulatus NAm1] gi|150411450|gb|EDN06838.1| predicted protein [Ajellomyces capsulatus NAm1] Length = 759 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 60/216 (27%), Gaps = 28/216 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + + K+A +++++ Sbjct: 75 DIVLCIDVSYSMQSSAPLPTTDESGEREETGLSV--LDLTKHAARTIIETLNEND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E E S + + L T+ +K + +++ Sbjct: 128 -RLGIVAFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERH 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN----FKSNVNTIKICDKAKENFIKIVT 366 + + LTDG N+ I I T Sbjct: 187 T---------------PQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHT 231 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + +A + VF Sbjct: 232 FGFGYNIRSGLLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 64/218 (29%), Gaps = 49/218 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 E+P + LV+D SGSM K+ K+A ++ + Sbjct: 459 EKPSLAMMLVIDKSGSM--------------------GGQKIELAKDAAQAAVELLGPKD 498 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +G+I + E T T+ PAM AY+ L Sbjct: 499 A------IGVIAFDGDSYTVSELRS-TSDRGAISDAISTIEASGGTNMYPAMADAYEALL 551 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + K +I +TDG S + + + I + T Sbjct: 552 GATAKL------------------KHVILMTDGV----SSPGDFQGVAGDMSASRITLST 589 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +++ + L + +Y + S+ VF Sbjct: 590 VALGQGSSEDLLEELAQIGGGRYYFCDDPQSVPQVFAK 627 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 37/144 (25%), Gaps = 9/144 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++K ++ ++ ++ L + F I + K +++A + A LAGA Sbjct: 6 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ LS T ++ + Sbjct: 66 DQAEAAVIAT---------LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPT 116 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRL 158 D +M + Sbjct: 117 GEYDIYVRVAVTDMGISQYLSAVF 140 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 38/356 (10%), Positives = 92/356 (25%), Gaps = 49/356 (13%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 G + + AKR + + ++ + SA ++ +S+ + Sbjct: 524 AGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVG- 582 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + T + ++ I Sbjct: 583 -EVPRTTVTNTVVTAPRYRKYIKANNDGTY------DLSLNVTGTQSGSSQTTVSPADIV 635 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDV 252 +V D SGSM M + +++ K A+ + + ++ Sbjct: 636 VVFDTSGSMSNPMGHN---------------SRLEVAKTAVNSMAQHLLTSENQGKDSNI 680 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 M L+ ++T + + V T+ A+K A LTS +K Sbjct: 681 RMALVPFSTTAGNVSNFTDNAMDIVSAVNGL---GADGGTNWEAALKAANAKLTSGRKGV 737 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-------------------VNTIKI 353 F G + + + Sbjct: 738 KKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAV 797 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + ++ +++ P ++ + +Y+ + D L F +I + Sbjct: 798 AEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDELNKAFADIIGQI 851 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 61/176 (34%), Gaps = 19/176 (10%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D I L +G+I Y+++V+ + + + + T + Sbjct: 56 FMMDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRADMERAINSIIPLAQGTMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + + ++G + + I +TDG + ++ + Sbjct: 116 AIQYAMNVAFT----------TQEGARPLHKRIPRIAIVVTDGRPQD-----RVTEVATQ 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 A+ I+I + I + L+ S P E+ + V + + + + + Sbjct: 161 ARNAGIEIYAVGIQRADMNS--LRAMASPPLEEHVFLVESFELIQQFAKQFQDKLC 214 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 52/159 (32%), Gaps = 20/159 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + +GL+ Y++RV + Sbjct: 361 FELVKQFVNRIVDLL---EVSPHGTRVGLVQYSSRVRTEFPLNKYHSADEIKKAVMDVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + S +G + S + + TDG + + S Sbjct: 418 MEKGTMTGLALKHMVEHSFS----------ELEGARPLSYNIPRIGLVFTDGRSQDDIS- 466 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + +AKE+ I + + + + L+ S P Sbjct: 467 ----EWARRAKESGIVMFAVGVG--KAVEEELRAIASEP 499 >gi|262371960|ref|ZP_06065239.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] gi|262311985|gb|EEY93070.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] Length = 537 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 33/254 (12%), Positives = 83/254 (32%), Gaps = 52/254 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + N K++ + + + P + +VD+SGSM K Sbjct: 152 KENAKLIKIGIQAKDLSVKQLPAANLVFLVDVSGSMDDPD-------------------K 192 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K L + + + + +I Y + + +EP+ G +K + + + Sbjct: 193 LPLVKQTLRILTEQLRPQD------KVTIITYASGEKLVLEPTSGDQKDK-ILRVIDELR 245 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T A++ AY+ ++ I+ TDG+ N Sbjct: 246 AGGATSGEQAIQLAYKQAEKAFIKNGIN----------------RILLATDGDFNVGITD 289 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + + +++ + + + + L++ + + + N + Sbjct: 290 FSTLKGMVAEKRKSGVSLTALGYGTGNYNEELMEQIADAGDGNYSYIDNKNE-------- 341 Query: 406 SQLMVHRKYSVILK 419 ++ +V R+ S L Sbjct: 342 AKKVVQRQLSSTLA 355 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 50/436 (11%), Positives = 112/436 (25%), Gaps = 92/436 (21%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 +++ A++ A GA + S L+ L S+ + R + Sbjct: 33 RSAQSGASDHAGRKGARPLRSVLASLCAVAMSLGMASASVAAFADDRQPAATADPQAATA 92 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR- 168 SA + S T+ + + ++ V L + Sbjct: 93 SAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSGSMNELSG 152 Query: 169 -------YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + + + + + +S + Sbjct: 153 KLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGR-CTTWQDQDSAGQKYTVTYNWI 211 Query: 222 DKKR-----------------TKMAALKNALLLFLDSIDLLSH----------------- 247 T++ ALK+A+ FLD ++ + Sbjct: 212 GGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYA 271 Query: 248 VKEDVYMGLIGYT------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 K +G Y + +W E +++ T + ++ A Sbjct: 272 GKNSDKIGNDTYNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHA 331 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-----VNTIKICDK 356 + L S QK +F +DG + N IK + Sbjct: 332 VKQLNSG-----------------RPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQ 374 Query: 357 AKENFIKIVTISINASPN------GQRLLKTCVSS--------------PEYHYNVVNA- 395 K + ++++I + + + S+ +Y V+A Sbjct: 375 LKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSAR 434 Query: 396 DSLIHVFQNISQLMVH 411 L +F+ I ++ Sbjct: 435 TDLQTIFKEIISIVTS 450 >gi|124008506|ref|ZP_01693199.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123986014|gb|EAY25864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 425 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 76/236 (32%), Gaps = 49/236 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ER I LVVD SGSM K+ +K A+ +D++ Sbjct: 40 QERIPLNISLVVDRSGSM--------------------SGDKLNYVKKAVDFVIDNLKSD 79 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + ++ Y ++ + T K + + T+ + M + Y + Sbjct: 80 D------VLSIVQYDDEIDVVASSAKVTNK-KALHEKVKGIQARNMTNLSGGMMEGYAQV 132 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKIC-DKAKENFIK 363 S S + ++ L+DG N + +I K +E I Sbjct: 133 KS----------------TQSNGYVNRVLLLSDGLANAGITAPEQLQQIAQKKFREAGIA 176 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + T + + N + +Y + D + +F +Q + SV+ + Sbjct: 177 LSTFGVGSDFNEVLMTNLSEYGGANYYFIDMPDKIPQIF---AQELEGL-LSVVAQ 228 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 33/87 (37%), Gaps = 5/87 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 SR + ++ +II LS+ + +G + + + K+ ++++ +A LA Sbjct: 8 RSRLTRRAR-----QRGAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALA 62 Query: 64 GASKMVSNLSRLGDRFESISNHAKRAL 90 A + L ++ A Sbjct: 63 AARDVTGATPLLVSEAAGLATGTSNAA 89 >gi|261194779|ref|XP_002623794.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588332|gb|EEQ70975.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 756 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 64/216 (29%), Gaps = 29/216 (13%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + P + K+A +++++ Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSV--LDLTKHAARTIIETLNDND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E + S + + + L T+ +K + + L Sbjct: 128 -RLGVVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEE--- 183 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVT 366 P + + LTDG N I + K+ I T Sbjct: 184 ------------VTPIPQNVQALYILTDGMPNHMCPRQGYVPKLRSILQQ-KDRLPMIHT 230 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 231 FGFGYYIRSGLLQAISEVGGGTYSFIPDAGMIGTVF 266 >gi|148694466|gb|EDL26413.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] gi|148694467|gb|EDL26414.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] Length = 2886 Score = 66.0 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +E +G++ Y++ + + Sbjct: 154 NNFKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYK 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ S F K I +TDG++ + Sbjct: 214 GGNTMTGDAIDYLVKNTFTESAGS-------------RAGFPKVAIIITDGKSQD----- 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S+P + +NV N D+++ + I Sbjct: 256 EVEIPARELRNIGVEVFSLGIKAAD--AKELKQIASTPSLNHVFNVANFDAIVDIQNEII 313 Query: 407 QLMVH 411 + Sbjct: 314 SQVCS 318 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 50/173 (28%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + T++ AM Sbjct: 467 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINTFPYRGGSTNTGKAMTY 525 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 526 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 567 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 568 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 614 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 2339 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2442 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2494 Query: 406 SQLMVH 411 ++ Sbjct: 2495 EDNLIT 2500 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 1216 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1276 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1318 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSKIVDDLTINL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 59/167 (35%), Gaps = 19/167 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D I L +G+I Y+++V+ + + + + T + Sbjct: 56 FMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + + ++G + P + I +TDG + ++ Sbjct: 116 AIQYAMNVAFT----------VQEGARPPHKKIPRIAIIVTDGRPQD-----RVSEVAAH 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 A+ I+I + I + L+ S P E+ + V + + + Sbjct: 161 ARNAGIEIYAVGIQRADMNS--LRAMASPPLEEHVFLVESFELIQQF 205 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 59/167 (35%), Gaps = 19/167 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D I L +G+I Y+++V+ + + + + T + Sbjct: 56 FMIDIIGNLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGL 115 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A + + ++G + P + I +TDG + ++ Sbjct: 116 AIQYAMNVAFT----------VQEGARPPHKKIPRIAIIVTDGRPQD-----RVSEVAAH 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 A+ I+I + I + L+ S P E+ + V + + + Sbjct: 161 ARNAGIEIYAVGIQRADMNS--LRAMASPPLEEHVFLVESFELIQQF 205 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 20/159 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + ++GL+ Y++RV + Sbjct: 361 FELVKQFVNRIVDLL---EVSPDGTHVGLVQYSSRVRTEFPLNKYHSAEEIKAAVMKMEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K + S +G + S + + TDG + + S Sbjct: 418 MEKGTMTGLALKHMVEHSFS----------ELEGARPLSHNVPRIGLVFTDGRSQDDIS- 466 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + +AKE+ I + + + + L+ S P Sbjct: 467 ----EWAQRAKESGIVMFAVGVG--KAVEEELRAIASEP 499 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 61/180 (33%), Gaps = 22/180 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ ++ + +++ +GL+ +++ ++ + + KV + T Sbjct: 1049 MKDFMERMVNQSNIGADEIQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTH 1108 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ S + +++I +TDG + + + Sbjct: 1109 TGKALNFTLPFFDSSRGG--------------RPRVHQYLIVITDGVSQDNVAPP----- 1149 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ I I I + Q L + + + N +SL + + I + + Sbjct: 1150 AKALRDRNIIIFAIGVGNVQRAQLL--EITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 65/222 (29%), Gaps = 28/222 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I S C + K +K+ + + Sbjct: 827 MSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFH---IGP 883 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V G++ Y+ ++ + D + T + A+ + + Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQ--- 940 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +++I +TDG++ + + + ++ + I I + Sbjct: 941 -------------NTARIDVARYLIVITDGQSTDPVA-----EAAQGLRDIGVNIYAIGV 982 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L+ S + + + DSL + Q + + + Sbjct: 983 RDANTTE--LEEIAS--KKMFFIYEFDSLKSIHQEVIRDICS 1020 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 52/172 (30%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L + V +GL+ Y+ + + K + T + A++ Sbjct: 262 VSGLDVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEF- 320 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + ++ +TDGE S+ + D+ K + Sbjct: 321 ----------IRANSLTEMSGSRAKDGVPQIVVLVTDGE-----SSDEVQDVADQLKRDG 365 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 + + + IN + Q L K E + N L + + Q + Sbjct: 366 VFVFVVGINI-QDVQELQKIANEPFEEFLFTTENFSILQALSGTLLQALCST 416 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 45/172 (26%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L ++++ +GL Y+ S K A++ Sbjct: 59 ANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGNRMGQALQF- 117 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +F S + + ++ G + + + + Sbjct: 118 ----------ILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHI-----REPAEALRRAG 162 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 I + I + + L + S + + V N L + Q + + Sbjct: 163 ILVYAIGV-KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCST 213 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 61/180 (33%), Gaps = 22/180 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ ++ + +++ +GL+ +++ ++ + + KV + T Sbjct: 191 MKDFMERMVNQSNIGADEIQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTH 250 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ S + +++I +TDG + + + Sbjct: 251 TGKALNFTLPFFDSSRGG--------------RPRVHQYLIVITDGVSQDNVAPP----- 291 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ I I I + Q L + + + N +SL + + I + + Sbjct: 292 AKALRDRNIIIFAIGVGNVQRAQLL--EITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 349 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 59/178 (33%), Gaps = 26/178 (14%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ I + + V G++ Y+ ++ + D + T Sbjct: 10 MKDFMKEVIKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTT 69 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + + + +++I +TDG++ + + + Sbjct: 70 TGKALSKMVPVFQ----------------NTARIDVARYLIVITDGQSTDPVA-----EA 108 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ + I I + + + L+ S + + + DSL + Q + + + Sbjct: 109 AQGLRDIGVNIYAIGVRDANTTE--LEEIAS--KKMFFIYEFDSLKSIHQEVIRDICS 162 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 54/180 (30%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 SWLVNITRNFDIGPKFIQVGVVQYSDYPVLEIPLGTHESTENLIREMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A+ L S K + LTDG++ + + Sbjct: 118 RAIQFAFDHLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDVAA 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 +A++N I + I + + L + + V + ++ + + I Q + Sbjct: 157 EARKNKITLFAIGVGSEIEEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEESV 216 >gi|327351891|gb|EGE80748.1| U-box domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 756 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 64/216 (29%), Gaps = 29/216 (13%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + P + K+A +++++ Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSI--LDLTKHAARTIIETLNDND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E + S + + + L T+ +K + + L Sbjct: 128 -RLGVVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEE--- 183 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVT 366 P + + LTDG N I + K+ I T Sbjct: 184 ------------VTPIPQNVQALYILTDGMPNHMCPRQGYVPKLRSILQQ-KDRLPMIHT 230 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 231 FGFGYYIRSGLLQAISEVGGGTYSFIPDAGMIGTVF 266 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 GSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 671 FEIVKQFVTGIIDSLA---ISPKAARVGLLQYSTQVRTEFTLRNFGSA--KDMKKAVAHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSARVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEINEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADVVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 GSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 630 FEIVKQFVTGIIDSLA---ISPKAARVGLLQYSTQVRTEFTLRNFGSA--KDMKKAVAHM 684 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 685 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSARVPRVAIVFTDGRAQDDVS- 735 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 736 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEINEKL 789 Query: 406 SQLMVH 411 + + Sbjct: 790 KKGICE 795 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/237 (11%), Positives = 65/237 (27%), Gaps = 55/237 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D T++ A + A + + V +G++ + + P+ Sbjct: 93 DVSRSMAASDVAPTRIGAAQQAARDLIVGLPPG------VRLGIVSFAATAIVVLPPTDN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD------------------------ 308 + + + T + + Q+ +L D Sbjct: 147 R---LRMLDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLEGILFGGESLAPGTGGRSL 203 Query: 309 ----KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + Q P +I L+DG + A + +++ Sbjct: 204 TEAAAADAVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTGPD---PLDAARMAAQRGLRV 260 Query: 365 VTISINASPNGQR---------------LLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ A + L + + ++ +A L V++ +S Sbjct: 261 YTVGFGAVQDAGTEGSSLSYEMQVDEPTLRQIATLTDGEYFQAGSAADLTRVYRQLS 317 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 68/246 (27%), Gaps = 32/246 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + IVS P L I LV+D+SGSM+ S +D Sbjct: 43 YSIPNRDSMIVSIQPPLKPENDVPHVPCDIVLVIDISGSMN----SAAPIPTGERGGEDT 98 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTR 282 + + K+A +++++ + ++ + T V E ++ + + Sbjct: 99 GLSILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLGA 152 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL------PFQKFIIFL 336 T+ MK+ IL ++ + + P+ K L Sbjct: 153 IDKLYGKSSTNLWHGMKKGLNILATNPAQGKIQSLLVLTDGAPNHMCPAQGYVPKLRQTL 212 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 D N + I T L + +A Sbjct: 213 LDHHNLTGTLPL---------------IHTFGFGYYLRSPLLQSIAEIGGGTFAFIPDAG 257 Query: 397 SLIHVF 402 + VF Sbjct: 258 MIGTVF 263 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 64/195 (32%), Gaps = 29/195 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +K+ K A ++ + +G I + R + IE + + Sbjct: 416 SGSMQGSKLELAKEAAARSVEMLRDED------TLGFIAFDDRPWEIIE-TGPLSSKEEA 468 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ ++ +AY+ L +K II LTDG Sbjct: 469 VDTILSVTPGGGTEIYSSLAKAYENLADL------------------KLQRKHIILLTDG 510 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ + + KE+ I + T++I + L Y+V++ ++ Sbjct: 511 QSQAGNY----EDLITEGKEDGITLSTVAIGQDADANLLEALSDMGSGRFYDVIDEQTIP 566 Query: 400 HVFQNISQLMVHRKY 414 + + ++ Sbjct: 567 SILSRETAMISRTYI 581 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 58/180 (32%), Gaps = 19/180 (10%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + + G++ +++ E +I+ + ++ + D + Sbjct: 108 RQEVEFLKSLARTFKISNNGAHAGVVTFSSIAELSIKLNQYYDQEQFERAVDDIPYMGYV 167 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVN 349 T A++++ ++ + + LTDGE K + N Sbjct: 168 TRIDLALRKSLEMFDEING--------------ARKSIPQILFLLTDGEQYAGKGVVDEN 213 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL--IHVFQNISQ 407 + I ++ I I I I ++ Q L S E + N + L + I + Sbjct: 214 PVSIAKLLRDKGIVIFAIGIGSAVR-QSQLNDIAGSSEKAFLAKNFNELVNSDFLKKIKE 272 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 56/182 (30%), Gaps = 34/182 (18%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + ++ + K + G++ ++ +I+ + + + S + T Sbjct: 738 VEFLINLASTFNISKYGAHAGVVTFSYDAFLSIKLNDYFNQAQFNNAVKDISYLNGGTRI 797 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++++ ++ + + + LTDGE + N + I Sbjct: 798 DLALEKSLEMFDE--------------LNGARKNTPQILFLLTDGE---QSGDKNPVDIA 840 Query: 355 DKAKENFIKI-------VTISINASPNGQRLLKTCVSSPEYHYNVVNADSL--IHVFQNI 405 + ++ I I + L V S + + N + L + I Sbjct: 841 KRLRDRGIIIFAIGIGSYV--------NKTELNNIVGSNDKAFLAENFNELVNNDFLKKI 892 Query: 406 SQ 407 + Sbjct: 893 KE 894 >gi|219841908|gb|AAI44536.1| COL22A1 protein [Homo sapiens] Length = 1319 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/356 (10%), Positives = 86/356 (24%), Gaps = 33/356 (9%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + D I + + + N + H Sbjct: 446 FMTKDNDGDKAYEEVTLITTTETNIGTGGDHAPTSNADVIVNGNSGHDNFVGDPGGATES 505 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV-VDLSGSM 203 + + + R ++ + +++ + L + SG++ Sbjct: 506 TQTTYTDLNVALVVDTSGSMDGTRMSETKEALKDLCDQLKEHADEGADVNLSLIGFSGAL 565 Query: 204 HCAM----------NSDPEDVNSAPICQDKKRTKMAALKNALLL---------FLDSIDL 244 + + N + K T+ + Sbjct: 566 NINLPVEDITGIEENYRKFTLRDGDTIIGKVGTEFSVTSYVPYDDWSGGQISSTTTYRIT 625 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + G +G+ E + + + D + T + + A Sbjct: 626 EDGTLQRWESGFMGWGGSWEDVRTFSTESLTGYEAVLAQIDDMEAIGGTVYSDGYEAAKD 685 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + + +IF+TDGE NN S N A + I Sbjct: 686 WF----------GGKTSPDSLQNNGGENIVIFVTDGEPNNEWSAKNAYNQLVAAVD-NIT 734 Query: 364 IVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + T+ I + LL ++ + + +A L VF I + + Sbjct: 735 VETVGIAITDKDATDLLNGLTTNNNGAHFIEDASKLGDVFGEIVSDITTSTVTPAA 790 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 1/74 (1%) Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N +++ IC A+ + I T++ A GQ L+ C SSP +H++V D + Sbjct: 2 NGTEADARLSDICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDVNGTD-ISSA 60 Query: 402 FQNISQLMVHRKYS 415 F I+ + K + Sbjct: 61 FSAIASDIRALKLT 74 >gi|119612600|gb|EAW92194.1| collagen, type XXII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1626 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|40805823|ref|NP_690848.1| collagen, type XXII, alpha 1 [Homo sapiens] gi|296434458|sp|Q8NFW1|COMA1_HUMAN RecName: Full=Collagen alpha-1(XXII) chain; Flags: Precursor gi|225000822|gb|AAI72420.1| Collagen, type XXII, alpha 1 [synthetic construct] Length = 1626 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|22652113|gb|AAN03620.1|AF406780_1 alpha 1 type XXII collagen [Homo sapiens] Length = 1626 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 69/246 (28%), Gaps = 61/246 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D ++AA + A+ F++ D +GL+ ++ E + P+ Sbjct: 93 DVSGSMCSTDVVPNRLAAAQEAVRRFVEEQDSG------TRIGLVVFSGFAELAVAPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT----SDKKRSFFTNFFRQGVKIPSLP 328 + + + + T A+ ++ ++ + P Sbjct: 147 RDAITRALDGLTTGR---GTTVGSAILKSVDAISEIDPDVAPSDPAPGTVPPPPRAPGTY 203 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS---------------- 372 + ++ LTDG N A E +++ I Sbjct: 204 APEIVVLLTDGANTTGV---TPEDAAKTAAERGVRVYPIGFGTDEPSSMVCSAAQLGRSS 260 Query: 373 -----------------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + L + ++ ++ +AD L V Sbjct: 261 DGAFGGFGGYAGPSGGSGGGGGGGRGYLVADEGSLQQVADTTGGRYFAAADADRLQTVLA 320 Query: 404 NISQLM 409 ++ + + Sbjct: 321 DLPRTV 326 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 30/228 (13%), Positives = 71/228 (31%), Gaps = 38/228 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-------LLLFLDSI-DLL 245 + +N D + N+ + K+ + F++ I + + Sbjct: 239 PGSYTCACRDGFTLNEDGKTCNACGASAVDLVFLIDGSKSVRPENFELVKQFINQIVESM 298 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++GL+ Y++ V + T K ++ V + T + A++ Sbjct: 299 DVGDHRAHVGLVQYSSSVRQEFPLGRYTSKKDIKSAVKKMSYMEK--GTMTGQALQY--- 353 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +F K I TDG + ++ ++ +AKE K Sbjct: 354 --------LIDNSFAISSG--GRPAVPKVGIVFTDGRSQDYIND-----AALRAKELGYK 398 Query: 364 IVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + + + + L+ S P E+ + + + I + + Sbjct: 399 MFAVGVG--NAVEEELRMIASEPVVEHSFYTADF----KAMKEIGKKL 440 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + ++ +K L ++S+ +GL+ Y + V+ K + Sbjct: 45 SSRSVRPSEFEQVKVFLSQVVESL---DVGANATRVGLVNYASTVKNEFSLKTHKAKPAL 101 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 Q V + T + A++ A I + +G ++ S K I + Sbjct: 102 LQAVKKVQPLST--GTMTGLAIQYAINIAFT----------EPEGARLKSPGINKVAIIV 149 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + I +A+E+ ++I I + + L+ S P E+ V + Sbjct: 150 TDGRPQDAVKD-----ISARARESGLEIYAIGVGRVD--KNTLRQIASEPLDEHVDYVES 202 Query: 395 ADSLIH 400 + Sbjct: 203 YSLIEK 208 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 91/253 (35%), Gaps = 38/253 (15%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 I+++ P + LVVD+S SM+ + + Sbjct: 19 IMTSILAWQPQMANASSPSASKVDAVLVVDVSNSMNTSDPGKIGN--------------- 63 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSL 287 A+ +F+D + ++ +G++ YT V++ + +E +Q + +D L Sbjct: 64 ----EAMKMFIDMLST-----QNDKVGIVAYTDVVQREKALLNITSEADKQELKTFIDGL 114 Query: 288 ILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD++ +K+A +IL K + + + +S Sbjct: 115 NRGAYTDTSVGVKEALRILQDG-KTAGHAPMIVMLADGNNDFNKTTGR-------TESQS 166 Query: 347 NVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + +AK + + I TI +NA N +L + + +AD L ++ Sbjct: 167 DQDMAQAVAEAKNSGVPIYTIGLNADGKLNKNKLADIAQQTGGKSFITSSADDLPNILSE 226 Query: 405 ISQLMVHRKYSVI 417 I + K V+ Sbjct: 227 IFA--SNLKLKVV 237 >gi|296474583|gb|DAA16698.1| collagen, type XXI, alpha 1 [Bos taurus] Length = 507 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L + R K + LTDG++ + + Sbjct: 118 KAIQFALDYLFAKSAR----------------FLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSRITLFAIGVGSETEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|151556217|gb|AAI49226.1| COL21A1 protein [Bos taurus] Length = 518 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L + R K + LTDG++ + + Sbjct: 118 KAIQFALDYLFAKSAR----------------FLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+++ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARDSRITLFAIGVGSETEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 76/232 (32%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 K++ ++ + P + ++D+SGSM + K Sbjct: 146 NPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSM-------------------DEPNK 186 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L +D +D V ++ Y ++ + G EK + + M+ Sbjct: 187 LPLLKSAFKLLVDELDEDDRVS------IVVYAGAAGLVLDSTPGNEKDK-ILDALMNLE 239 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K AY + + +S +I TDG+ N S+ Sbjct: 240 AGGSTAGAEGIKLAYDVAKKNFIKSGNN----------------RVILATDGDFNVGISS 283 Query: 348 V-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 +++ +K ++ I + + +++ + + N Sbjct: 284 EAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDNIAE 335 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 89/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + ++ + P Sbjct: 175 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTRDLP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIRSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 405 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 42/527 (7%), Positives = 122/527 (23%), Gaps = 130/527 (24%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISN-------------------------HAKRALIDDAKRFIK---- 99 + + SN DD I Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---M 151 + S+ + + + ++D ++ + + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLLNQRYN---------------------------------QKIVSFIP 178 + + + Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLNGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E + ++ V+ + +H S + ++ ++ + L Sbjct: 247 FAMGAEHPKENKCVLPFVLQNN--LHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTE 304 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + ++ ++K + + E+ ++ R T ++ + Sbjct: 305 ATLNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGL 364 Query: 299 KQA-YQILTSDKKRS--------------------------------------------- 312 A Q+L+ + Sbjct: 365 LTASNQMLSEKSRSEELNQETKRVILVLSDGNDDMSNLNLADLERNSIPFTNFSRITKNL 424 Query: 313 --FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFI 362 + + F + N +++ +C K+ Sbjct: 425 ILGKKEDLSSTATSNRAYYNRHSTFNYNTYLTNKTKDISRKGMCSIIQEKLNTLNKDKNT 484 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 K+V + KTCV + ++ N +SL++ F+ I + Sbjct: 485 KLVFVEFGYRSESADAWKTCVGN-GNYFYADNRESLLNSFKQAIGET 530 >gi|119512059|ref|ZP_01631153.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] gi|119463285|gb|EAW44228.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] Length = 435 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 49/193 (25%), Gaps = 9/193 (4%) Query: 228 MAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +A + A+ LD + + ++ + + I + Sbjct: 59 IATVIQAVEQLLDRLQPSDSETPTSGDRISVVAFAGEAQVIIPNQTLQDTASIKAQIHKK 118 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++ L K + F S D Sbjct: 119 LKASGGTAIAEGLQLGITELMKGTKGAVSQAFLLTDGHGESSLKIWKFEIGKD------- 171 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N ++ KA + + I T+ N L K + + + + F+++ Sbjct: 172 DNKRCLEFAHKATKINLTINTLGFGNDWNQDLLEKIADAGGGTLAYIERPEQALEQFRHL 231 Query: 406 SQLMVHRKYSVIL 418 Q + + + Sbjct: 232 LQRIQSVRLTNAY 244 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/263 (10%), Positives = 64/263 (24%), Gaps = 51/263 (19%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + Q + + G +P + ++D Sbjct: 185 YARPTAQEPPFKPTVTVVPS-----PWSQDRQLMHIGVQGYATPRAG-QPPLNLVFLIDT 238 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM ++ K AL + +D + V ++ Y Sbjct: 239 SGSMSGPD-------------------RLPLAKKALNVLIDQLRPQDRVS------MVAY 273 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + P+ G K++ T ++ AY + + Sbjct: 274 AGSAGAVLSPTDGKSKLKMRC-ALTALRSGGSTAGGQGLELAYALARQNL---------- 322 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRL 378 +I +TDG+ N ++ +++ + + + Sbjct: 323 ------DPKAVNRVILMTDGDFNVGIADPTRLKDFVADQRKSGVYLSVYGFGRGNYNDTM 376 Query: 379 LKTCVSSPEYHYNVVNADSLIHV 401 ++ + D L Sbjct: 377 MQALAQNGNG--TAAYVDGLQEA 397 >gi|310115525|ref|XP_003120135.1| PREDICTED: collagen alpha-1(XXII) chain-like [Homo sapiens] Length = 788 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + +G++ Y+ R E + + T++ Sbjct: 59 QWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + +++ I LTDG + + + Sbjct: 119 DALRYITARSFSP-----------HAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAA-- 165 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + + L+ S P+ + ++V + +++ + + + + Sbjct: 166 ---HRAGIRIFAVGVGEALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 219 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 75/226 (33%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + ++ +GLI + + + Sbjct: 98 DISNSMLAEDVSPSRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITGD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A++ + T + K Sbjct: 151 YVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTP------------------NSKVGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE+N + + +A+ IK+ + I + NG+ Sbjct: 193 AIILITDGEDNEGGA----EAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKDSNGE 248 Query: 377 RL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + + +V N++ V + + + + Sbjct: 249 PVKTHLNEDMCRRIAAAGQGVYVHVDNSNVADAV---LGKELGKLQ 291 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/377 (9%), Positives = 97/377 (25%), Gaps = 66/377 (17%) Query: 44 LDWHYKKNSMESANNAAI----LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + K +++ AA A + R++ ++ + + + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 100 NHIKESLSGYSAVFYN------TEIQNIVNSSRISMTHMANNRLDSS----NNTIFYNMD 149 + F N + + + N ++ + Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNKEPVPASKPIPFA 194 Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + + I + E P + ++D SGSM Sbjct: 195 MRCELAPA-------PWNEQRTLLKVDIL-AKDRKSEELPASNLVFLIDTSGSM------ 240 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ ++++L L + + + + ++ Y + Sbjct: 241 -------------ISDERLPLIQSSLKLLVKELREQDN------IAIVTYAGDSRIALPS 281 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 G+ K T+ ++ AYQ + Sbjct: 282 ISGSHKAEIN-AAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGIN-------------- 326 Query: 330 QKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPE 387 I+ TDG+ N + +I+ + K +E+ + + T + S + ++ + Sbjct: 327 --RILLATDGDFNVGIDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRIADVGNG 384 Query: 388 YHYNVVNADSLIHVFQN 404 + + V + Sbjct: 385 NYSYIDTLAEAQKVLNS 401 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/380 (9%), Positives = 106/380 (27%), Gaps = 30/380 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S ++ + +D I + + + F T + + Sbjct: 67 NSG-------RKANDYKLGGSNPNDDSFNISSEVGKRDHAIVTTFVKTFLPQTNDDKMNL 119 Query: 129 MTHMANNR-----LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + +S++ + + + + + + + Sbjct: 120 IPVCKTVNNTSGKGHTSSSEVTCTVSGTVEHKSWFPLK---VGNLEVIPKQVDVASKSKA 176 Query: 184 EMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + +V DLSGSM+ ++++ + K +K++ LK L+ D Sbjct: 177 FKKNTFNIPIDLMVVADLSGSMNFDLDNNETK-------KTGKPSKISILKEVLVELADK 229 Query: 242 IDLLSHVKEDVYMGLIGYT-----TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT-DST 295 L + + + + + + SW E + + Sbjct: 230 TLLSEDANQHNRIYVTPFALGAEINKNSCALPYSWDIESSSKIENIKKILNKENSQYNRA 289 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + +++ + + + V P F + ++ Sbjct: 290 DLINNLVYRISTKETLNNINGKQKYNVTFPKNTFCLKDMKTSNQGWYTRSDKSKFTSYVQ 349 Query: 356 KAKENFIKIVTISINASPNG 375 K + + + + + N Sbjct: 350 SIKASGATLASSGVLVAANN 369 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 630 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 686 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 687 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 735 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 736 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 789 Query: 406 SQLMVH 411 + + Sbjct: 790 KKGICE 795 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 669 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 726 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 774 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 775 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTTKHLFYAEDFSTMDEISEKL 828 Query: 406 SQLMVH 411 + + Sbjct: 829 KKGICE 834 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 63/195 (32%), Gaps = 25/195 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV--EKNIEPSWGTEKV 276 K+ + +D I V +G++ Y+ + S+ E V Sbjct: 1847 ASASFYEEGFQQEKDFIKSVIDKI---ELSSSGVRVGVLTYSDEAKIRIRFDYSFDKEDV 1903 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + T ++ A ++ + +K +I L Sbjct: 1904 KKAIDNIP--YDSMGTRIDLGLEAAKELFLEKSGGRGSS--------------KKVLILL 1947 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG+ + + + E + I I I N L+ +S P++ + +D Sbjct: 1948 TDGQQTYIPDAKDPVDYAKELAEYGVDIFAIGIGDEINKVD-LEDLISKPQHIFL---SD 2003 Query: 397 SLIHVFQNISQLMVH 411 + + ++S+ + Sbjct: 2004 DINSLITDLSKDIST 2018 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 72/221 (32%), Gaps = 20/221 (9%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D C DP D+ + + T+ + +E G+I Sbjct: 1239 DAPPPPGCFTFGDPTDIA-FILDSSRSVTR-DHFNRQKEFVKTILGEFPLGEELTQAGII 1296 Query: 258 GYTTRVEKNIEPSWGTEKVR--QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + I + + + S + A+K+A++ L + + Sbjct: 1297 KYGRTADIEINFGEFLTQTDLFNAIDKIKHSQADES-RLDLALKKAHEELFTSQGARSDK 1355 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + + V + +++ G N + + K K +E + I TI + A PN Sbjct: 1356 DIEKAIVILGD-------GYISGGGNRSRDLIESAKKEAAKLRELGVLIFTIGVGAEPNS 1408 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 LL+ S Y+ V + L I + + K S+ Sbjct: 1409 -LLLQNFASKKTYYITVKDYGQL------IGK-IGALKTSI 1441 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 49/172 (28%), Gaps = 17/172 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D S +++ + Y + + S + T A++ A Sbjct: 31 VDSFSISQQNARFAALVYGSNASVEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVA 90 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L S + + + DG S+ + K+ Sbjct: 91 KSDLFSLQGKV-------------RSRRPMILYVFFDGTVTRSMSD--LESVVQPLKDYG 135 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +KI+ I + N +L K S ++ + L + +I + K Sbjct: 136 VKIIAIGVGPEVNRYQLKKI--SEDNAIFSGKSFKELAPLLYSIVEQSCSEK 185 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 53/167 (31%), Gaps = 20/167 (11%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQA 301 + +G+I + + E I+ S + ++ + + T A+++A Sbjct: 1601 NFDVDNQLTRIGIIRFDSDAEIIIQLS-DHKTLKDLLNDIDSIRYNEGIQTRIDKALERA 1659 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + K ++ L DG+N+ + + + + + Sbjct: 1660 MEAFSEKNGGRADA--------------TKALVLLADGQNSFIEGSQDLNEELKPLIDAK 1705 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-HVFQNISQ 407 + I I + + L + + D L V + IS+ Sbjct: 1706 VFRYVIGIGRELDLKELEDIA--TNNIAIYADSFDELKFKVDEQISR 1750 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 68/205 (33%), Gaps = 29/205 (14%) Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 53 CENKRADVVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNE 110 Query: 267 IEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 +V + V R T + A++ A I S+ +G + Sbjct: 111 FSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGARP 158 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + I+ +TDG + + ++ KA++ I I I + LK S Sbjct: 159 LRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKAIGS 211 Query: 385 SP--EYHYNVVNADSLIH---VFQN 404 P ++ + V N + VFQ Sbjct: 212 EPHEDHVFLVANFSQIETLTSVFQK 236 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 63/186 (33%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T V + + + + + + Sbjct: 673 FEIVKQFVTGIIDSLA---ISPKAARVGLLQYSTLVRTEFTLRNFSSA--KDMKKAVAHM 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 728 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSARVPRVAIVFTDGRAQDDVS- 778 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KA+ + I + + + + + L+ S + ++ + + ++ + + Sbjct: 779 ----EWASKAQASGITMYAVGVGKAIEEE--LQEIASEPTEKHLFYAEDFSTMGEISDKL 832 Query: 406 SQLMVH 411 + + Sbjct: 833 QKGICE 838 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 66/241 (27%), Gaps = 64/241 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I + +D+S SM + + ++ + F+ Sbjct: 88 DPPTIVIALDVSPSMGAMDIPGKQ--------------RFQVARDVIREFVR-------S 126 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ E + + + T + + L+ Sbjct: 127 YPHMAVGLVLFGKEAMLEVPPTIDVEYFLERLEAVRLFSLGDGTALGMGVGISLLHLSRV 186 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + ++ LTDG+N + + AK+ I + T+ Sbjct: 187 NASF------------------RAVVILTDGKNTTGEI--LPETAAEMAKDLDIPVFTVG 226 Query: 369 INAS-----------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + + L + S ++ SL +FQ I Sbjct: 227 VGSDLPVSLDVIDPSTGTRYAGVLEEGYDEETLRRMAEMSGGQFFSGYTPTSLHRIFQYI 286 Query: 406 S 406 Sbjct: 287 G 287 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/526 (7%), Positives = 117/526 (22%), Gaps = 127/526 (24%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDR---------------------FESISNHAKRALIDDAKRFI-------KN 100 S + + ++ K L ++ + Sbjct: 67 NSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPLCKTI 126 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---MT 152 + S+ + V + N +++ ++ + + Sbjct: 127 NNNSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTFNIPI 186 Query: 153 SYDYRLQFIEHLLNQ---------------------------------RYNQKIVSFIPA 179 + + + Sbjct: 187 DLMVVADLSGSMKDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQESNEYNRIGITAF 246 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E + ++ V+ + + + K+ + L + Sbjct: 247 AMGAEHPKENKCVLPFVLQDNLHGMSKIKIKQYLASRYNRTTLKRTEFVDNFVALLDIEK 306 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + + ++ + + + ++ R T ++ + Sbjct: 307 TLNSIGQTNYDITFPKSSICLEGLKNASQFWYTKTENDKFRDRVNSLKANGGTLASSGLL 366 Query: 300 QA-YQILTSDKKRSFFT------------------------------------------- 315 A Q+L+ + Sbjct: 367 TASNQMLSEKSRSEELNQETKRVILVLSDGNDDMSNLNLADLERNSIPFTNFSRITKNLI 426 Query: 316 ----NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFIK 363 K + + + + N+ +C K+ K Sbjct: 427 FGRKEDLSSTPKTNKAYNTRSSTYQYNTYLTDKTKNIIGKGMCSIIQEKLNALNKDENTK 486 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 +V + KTCV ++ N +SL++ F+ I + Sbjct: 487 LVFVEFGYRSESADAWKTCV-GNGNYFYADNRESLLNSFKQAIGET 531 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 L+ ++ S + V +G++ Y+ E + + + + T + Sbjct: 117 FMLEVTEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGK 176 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ QI+ + K ++I LTDG + + ++ + Sbjct: 177 ALDYILQIIKNGMKDRM-------------SKVPCYLIVLTDGMSTD-----RVVEPAKR 218 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I A+ + L+ E N D+L + + + + K Sbjct: 219 LRAEQITVHAVGIGAANKIE--LQEIAGKEERVSFGQNFDALKSIKNEVVREICAEK 273 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 45/136 (33%), Gaps = 15/136 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + T + Sbjct: 487 INLTIHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYT 546 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + T + N ++ +I +TDGE+++ +T Sbjct: 547 AKALKHANALFTEEHGSRIKQN------------VKQMLIVITDGESHDHDQLNDT---A 591 Query: 355 DKAKENFIKIVTISIN 370 + + I I + + Sbjct: 592 LELRNKGITIFAVGVG 607 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 21/159 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +G++ ++ + ++ + + + D SLI + T + A+ Sbjct: 308 LTKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFV 367 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q T K L +KF+I +TDG + + + Sbjct: 368 GQYFT--------------HSKGARLGAKKFLILITDGVAQDDVRDP-----ARILRGKD 408 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ + + N + L+ ++V N D L Sbjct: 409 VTIFSVGV-YNANRSQ-LEEISGDSSLVFHVENFDHLKA 445 >gi|74002027|ref|XP_544943.2| PREDICTED: similar to anthrax toxin receptor 2 [Canis familiaris] Length = 646 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 60/191 (31%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ + + + + T Sbjct: 220 IYNFVQQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKNVSPVGETYI 278 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 279 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 321 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ ++ + + Q L+ S E + V +L + +I + Sbjct: 322 KISRSFGARVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 379 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 380 LELRPSSVCVG 390 >gi|75076662|sp|Q4R7B7|ANTRL_MACFA RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|67969305|dbj|BAE01005.1| unnamed protein product [Macaca fascicularis] Length = 557 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 59/173 (34%), Gaps = 23/173 (13%) Query: 228 MAALKNALLLFLDSIDLLSHV-----KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + ++D + D+ M I Y+T + + + +++ + + Sbjct: 79 LDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTDGQTVLPLTSDKNRIKNGLDQ 138 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T ++A Q + + + II +TDGE Sbjct: 139 LRKIVPDGHTFMQAGFRKAIQQIETFNSGN---------------KVPSMIIAMTDGE-L 182 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + +T++ KA++ + T+ + + + SPE+ + V N Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTV--DVADYKLDQITAIADSPEHVFAVENG 233 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/255 (12%), Positives = 75/255 (29%), Gaps = 60/255 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+SGSM ++M A + A FL + Sbjct: 87 TIMLAIDVSGSMRATDV---------------APSRMEAAQAAARAFLADL------PRH 125 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V +G++ + + P+ E + + R T + + + + Sbjct: 126 VKVGIVAFAGSAQIAQLPTTNREDLNAAIDRFQLQR---GTAIGNGIVLSLAAIFPEVGI 182 Query: 312 SFFTNFFRQGVKIPSL------------------PFQKFIIFLTDGENNNFKSNVNTIKI 353 + + + II LTDG+ ++ Sbjct: 183 DLGQFSYGRPGQPRGPSLDMPPAGPPPTPVAPGSYGSAAIILLTDGQRTTGID---PLEA 239 Query: 354 CDKAKENFIKIVTISINASP---------------NGQRLLKTCVSSPEYHYNVVNADSL 398 A E +++ T+ + + + L + +++ +A++L Sbjct: 240 AKLASERGVRVYTVGVGTVEGVTVGFEGWSMHARLDEESLKHIAQQTRAEYFHAASAEAL 299 Query: 399 IHVFQNISQLMVHRK 413 V++ +S + K Sbjct: 300 TQVYETLSSRLTIEK 314 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 71/219 (32%), Gaps = 43/219 (19%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + LV+D SGSM +D +K+ K+A ++ ++ Sbjct: 399 NKEKEKNIDVVLVLDHSGSMA--------------DTEDAGISKLEIAKSASAKMIEHLE 444 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G+I + E S K + + + T P + +A + Sbjct: 445 SSD------GVGVIAFDHNYYWAYEFSKLVRK-KDVIESISSIEVGGGTAIIPPLSEAVK 497 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K +S I+ LTDG + ++AK N IK Sbjct: 498 TLKKSKAKSKL------------------IVLLTDGMGEQGGYEIP----ANEAKRNNIK 535 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I TI + N L + Y V N L+ VF Sbjct: 536 ITTIGVGKFVNLPVLSWIASFTSGRFYLVSNPYELVDVF 574 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/252 (14%), Positives = 80/252 (31%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K+V I + P + ++D+SGSM ++ K Sbjct: 205 NSDTKLVKIGLQGKTIPLENVPASNLVFLLDVSGSM-------------------GQQNK 245 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L +++ + ++ Y +EP+ G +K + Sbjct: 246 LPLLKSAFKLLTNNLREQD------KISIVVYAGSSGVVLEPTSGDQKTKIE-EALDKLS 298 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY+I + ++ +I TDG+ N S Sbjct: 299 AGGSTAGGEGIELAYKIAKDNFIKNGNN----------------RVILATDGDFNVGLSS 342 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS----LIHV 401 + + + +E+ I + + L++ + H + + L Sbjct: 343 DKAMEDLIKEKRESGIFLTALGFGMGNYKDSKLESLAQTGNGNHAYIDSMQEAQRVLETE 402 Query: 402 FQ----NISQLM 409 F I++ + Sbjct: 403 FGGTLYTIAKDV 414 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 60/180 (33%), Gaps = 22/180 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ ++ + +++ +GL+ +++ + + + KV + T Sbjct: 1049 MKDFMERMVNQSNIGADEIQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTH 1108 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ S + +++I +TDG + + + Sbjct: 1109 TGKALNFTLPFFDSSRGG--------------RPRVHQYLIVITDGVSQDNVAPP----- 1149 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ I I I + Q L + + + N +SL + + I + + Sbjct: 1150 AKALRDRNIIIFAIGVGNVQRAQLL--EITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 65/222 (29%), Gaps = 28/222 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I S C + K +K+ + + Sbjct: 827 MSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFH---IGP 883 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V G++ Y+ ++ + D T + A+ + + + Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTA 943 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +++I +TDG++ + + + ++ + I I + Sbjct: 944 RVD----------------VARYLIVITDGQSTDPVA-----EAAQGLRDIGVNIYAIGV 982 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L+ S + + + DSL + Q + + + Sbjct: 983 RDANTTE--LEEIAS--KKMFFIYEFDSLKSIHQEVIRDICS 1020 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 50/168 (29%), Gaps = 18/168 (10%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + V +GL+ Y+ + + K + T + A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEF----- 320 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + ++ +TDGE S+ + D+ K + + + Sbjct: 321 ------IRANSLTEMSGSRAKDGVPQIVVLVTDGE-----SSDEVQDVADQLKRDGVFVF 369 Query: 366 TISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 + IN + Q L K E + N L + + Q + Sbjct: 370 VVGINI-QDVQELQKIASEPFEEFLFTTENFSILQALSGTLLQALCST 416 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 44/172 (25%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L ++++ +GL Y S K A++ Sbjct: 59 ANSLQVGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGNRMGQALQF- 117 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +F S + + ++ G + + + + Sbjct: 118 ----------ILEHHFREGAGSRASQGVPQVAVVVSSGLAEDHI-----REPAEALRRAG 162 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 I + I + + L + S + + V N L + Q + + Sbjct: 163 ILVYAIGV-KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCST 213 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 60/180 (33%), Gaps = 22/180 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F++ ++ + +++ +GL+ +++ + + + KV + T Sbjct: 1049 MKDFMERMVNQSNIGADEIQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTH 1108 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ S + +++I +TDG + + + Sbjct: 1109 TGKALNFTLPFFDSSRGG--------------RPRVHQYLIVITDGVSQDNVAPP----- 1149 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ I I I + Q L + + + N +SL + + I + + Sbjct: 1150 AKALRDRNIIIFAIGVGNVQRAQLL--EITNDQDKVFQEENFESLQSLEKEILSEVCSSQ 1207 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 65/222 (29%), Gaps = 28/222 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + I S C + K +K+ + + Sbjct: 827 MSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPNDFIEMKDFMKEVIKMFH---IGP 883 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + V G++ Y+ ++ + D T + A+ + + + Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTA 943 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +++I +TDG++ + + + ++ + I I + Sbjct: 944 RVD----------------VARYLIVITDGQSTDPVA-----EAAQGLRDIGVNIYAIGV 982 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L+ S + + + DSL + Q + + + Sbjct: 983 RDANTTE--LEEIAS--KKMFFIYEFDSLKSIHQEVIRDICS 1020 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 50/168 (29%), Gaps = 18/168 (10%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + V +GL+ Y+ + + K + T + A++ Sbjct: 266 DVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEF----- 320 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + ++ +TDGE S+ + D+ K + + + Sbjct: 321 ------IRANSLTEMSGSRAKDGVPQIVVLVTDGE-----SSDEVQDVADQLKRDGVFVF 369 Query: 366 TISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 + IN + Q L K E + N L + + Q + Sbjct: 370 VVGINI-QDVQELQKIASEPFEEFLFTTENFSILQALSGTLLQALCST 416 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 44/172 (25%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L ++++ +GL Y S K A++ Sbjct: 59 ANSLQVGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGNRMGQALQF- 117 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +F S + + ++ G + + + + Sbjct: 118 ----------ILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHI-----REPAEALRRAG 162 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 I + I + + L + S + + V N L + Q + + Sbjct: 163 ILVYAIGV-KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCST 213 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 60/179 (33%), Gaps = 27/179 (15%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +K +L++ L + ++ LI + + T + + Y T+ + Sbjct: 176 KIEMVKASLIVLLQFLGDNDRLQ------LITFDNDAHRLTPLKTVTNQNKSYFTQIIKQ 229 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + A K A+ L S K + T+ + L+DG + + Sbjct: 230 IKANGGNRISEATKMAFYQLKSRKYINNVTS----------------VFLLSDGVDYTYP 273 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N I+ ++ + T + Q + + C Y V + L F + Sbjct: 274 EVKNQIQTVNEV----FTLHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 328 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 89/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + ++ + P Sbjct: 175 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTRDLP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIRSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 405 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 630 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 686 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 687 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 735 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 736 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 789 Query: 406 SQLMVH 411 + + Sbjct: 790 KKGICE 795 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|303252178|ref|ZP_07338346.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302648961|gb|EFL79149.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 532 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 44/518 (8%), Positives = 122/518 (23%), Gaps = 118/518 (22%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISNH-------------AKRALIDDAKRFIKNHI------------- 102 S + SN + + Sbjct: 67 NSGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMATKFVKAFLPQTNEEKMHLTPVCKT 126 Query: 103 ---KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL-----DSSNNTIFYNMDV---M 151 S+ + + + + + ++ + + + Sbjct: 127 INNTNGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPHEVNVASKSKAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLL----------------NQRYNQKIVSFIPALLRIEMGERPIFLI-E 194 + + I LL + + + Sbjct: 187 IDLMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTP 246 Query: 195 LVVD---LSGSMHCAMNSDPEDVNSAPICQDKKRTK------------MAAL--KNALLL 237 + + S + D P+ +K + K Sbjct: 247 FALGAEIDASSCALPYHWDKGKDPDTPLKIKNILSKKEGNNQQSRAEFIDNFVYKMNTQA 306 Query: 238 FLDSIDLLSHVKEDVYMGLIG----------------YTTRVEKNIEPSWGTEKVRQYVT 281 LD+I + K G+ + + Sbjct: 307 TLDNIGEKQNYKVTFPKGVFCLKNMKNKNHGWFSRKDKSDFTGYVRSLRADGATLASSGV 366 Query: 282 RDMDSLILKPTDSTPAMKQAYQ----ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF-- 335 + ++K T +K+ + +L+ + + + + +I+ Sbjct: 367 LVAANNMIKGGSRTKELKEQTKRVILVLSDGNDEIVKGDPNSKVPFLNYTRITENLIYGR 426 Query: 336 ----------LTDGENN------NFKSNVNTIKICDKAKEN--------FIKIVTISINA 371 ++ G + + T +C+ ++ KIV + Sbjct: 427 QEEFLSGKKKVSFGNTTIETYLTDTQPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGY 486 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 + + K CV + +Y+ + SL++ F+ I + Sbjct: 487 ASKSKEAWKHCVEGDKNYYSANDKASLLNSFKQAIGET 524 >gi|119612173|gb|EAW91767.1| matrilin 2, isoform CRA_c [Homo sapiens] Length = 451 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 630 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 686 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 687 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 735 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 736 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 789 Query: 406 SQLMVH 411 + + Sbjct: 790 KKGICE 795 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPFSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|77553311|gb|ABA96107.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] Length = 529 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 15/155 (9%), Positives = 37/155 (23%), Gaps = 11/155 (7%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 G ++ V + T+ + ++L + + + Sbjct: 150 DGKATAKRAVDALV---ANGNTNIRDGLDVDAKVLDGRRHTDAVASVILLSDGQDNQTMG 206 Query: 331 KFIIF-LTD--GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 F +TD ++ V E + + + + Sbjct: 207 YRGRFHMTDFKAAATSYDVLVPPSFTRAGGGERCAPVHAFGFGTDHDAAAMHSISEITGG 266 Query: 388 YHYNVVNADSLIHVFQN-ISQLMV----HRKYSVI 417 + N + F I L+ + + SV Sbjct: 267 TFSFIENLAVIQDTFARCIGGLLSVAAQNARISVE 301 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 109 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 166 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 167 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 214 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I I + LK+ Sbjct: 215 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDFN--TLKSI 267 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 268 GSEPHEDHVFLVANFSQIETLTSVFQK 294 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 731 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 787 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 788 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 836 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 837 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 890 Query: 406 SQLMVH 411 + + Sbjct: 891 KKGICE 896 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 75/253 (29%), Gaps = 48/253 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ + + L++D SGSM + + K Sbjct: 48 TTQEETEIALNITGTPPANVVMPNDVVLIIDKSGSMAPTYGPNNGED------------K 95 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M K A F+D +D+ H + + + + + + Y+ Sbjct: 96 MTNAKEAAKGFVDLMDMTKHRV------AVVDFSSSASSFPFTVDKDAAKSYINTINS-- 147 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----N 343 T + A+ A +L Q I+ +TDG N Sbjct: 148 -GGGTATGNAIDAAVALLAD-----------------HRTEAQPVIVLMTDGAATESPKN 189 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNG------QRLLKTCVSSPEYHYNVVNADS 397 ++ AK+ + TI++ L+K ++ +H+ V+ + Sbjct: 190 TDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPITSAPNVLMKNMATTATHHHFVLGSKG 249 Query: 398 LIHVFQNISQLMV 410 L ++ I + + Sbjct: 250 LNQIYAAIVKEIG 262 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 65/219 (29%), Gaps = 57/219 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + QD K ++ + A++ F+ ++ +GLI + + Sbjct: 102 DTSRSMLSQDLKPNRLERARYAVMDFVATLG-------GDRVGLIPFAGSSYLMCPLTLD 154 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + + + T+ + A + + K Sbjct: 155 YQAFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSN-------------------HK 195 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I LTDGEN +K D AK+N + I TI + + Sbjct: 196 ILIILTDGENLQGDV----LKAADLAKKNGLTIYTIGVGTAAGELIPGGPGGAFIRDSSG 251 Query: 374 -------NGQRLLKTCVSSPE-YHYNVVNADSLIHVFQN 404 + + L + + N L ++ + Sbjct: 252 KYVKSKLDEETLQEIAEKTGGISVLLGNNNQGLKKIYTD 290 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 72/228 (31%), Gaps = 38/228 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA-------LLLFLD-SIDLL 245 + + +D ++ + K+ + +++ ID L Sbjct: 239 PGSFKCACREGFTLMNDSRSCSACSNAATDVVFLIDGSKSVRPENFELVKKWINLIIDKL 298 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + ++GL+ Y++ V++ K +++ V R + + T + A+ Sbjct: 299 DVSETNTHVGLVQYSSTVKQEFPLGRHNSKRSLKEAVKRMD--YMERGTMTGHALSFLVD 356 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K I TDG + ++ + KAK K Sbjct: 357 -------------NSFGPNQGARPGVPKVGIVFTDGRSQDYIGD-----AAKKAKALGFK 398 Query: 364 IVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + + + + L+ S P ++++ + I++ + Sbjct: 399 MYAVGVG--NAVEDELREIASEPIADHYFYTADF----KTMNQIAKKL 440 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 64/201 (31%), Gaps = 30/201 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ +K L +D + + +G++ Y +RV+ + K + Sbjct: 45 SSRSVRPSEFEQVKVFLAKVIDGLS---VGPDATRVGVVNYASRVKNEVSLKS--HKTKA 99 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T + + + + F S K I +TD Sbjct: 100 ALVKAVSKIEPLSTGTMTGLAIQFAM---------NVAFSEAEGGRKSPDISKVAIIVTD 150 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + N I +A+E I+I I + L+ S P ++ V + Sbjct: 151 GRPQD-----NIRDIAARAREAGIEIFAIGVGRVD--MTTLRQMASEPLEDHVDYVESYS 203 Query: 397 SL-------IHVFQNISQLMV 410 + F + + Sbjct: 204 LIEKLTKKFQEAFCAVVSDLC 224 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 76/246 (30%), Gaps = 43/246 (17%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + R +L + + I + + S D++++ + D Sbjct: 49 MASPRRKLVKFVLAELILTLLILIVARPQIGNKIANNTSREGIEVIMALDISNSMLATDV 108 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +++ K + ++ +GLI + + + + ++ Sbjct: 109 VPSRLDKSKLMVEGLMNKFTK-------NKLGLIVFAGDAFVQLPITSDYVSAKMFLDNI 161 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 SLI + TD A+ A T + K I+ +TDGE+N Sbjct: 162 NPSLIGTQGTDIGKAINLAMHSFTP------------------NTQTGKAIVVITDGEDN 203 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + +A+E IK+ + I S+ + N + L Sbjct: 204 EGGA----EAMAKQAQEKGIKVFILGIG-------------STQGTTIPMPNGEDLRDAN 246 Query: 403 QNISQL 408 NI + Sbjct: 247 GNIVKT 252 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 72/244 (29%), Gaps = 51/244 (20%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 Q + + + + E+P I LVVD SGSM Sbjct: 427 DSFGSGGYQGTRIEKIMPVRF-DSEKQREQPHVAIALVVDRSGSM--------------- 470 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ A K + + + + ++ + + + + ++R Sbjct: 471 -----SGLKIEAAKESARATAEVLSPSD------LITVVAFDNQPTTIVRLQRASNRMRI 519 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 T+ PA+++AY+IL + K +I L+D Sbjct: 520 ATD-IARLQAGGGTNIYPALREAYEILQGANAKV------------------KHVIVLSD 560 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G+ +C + + I + + I + L + Y + +L Sbjct: 561 GQA----PYDGIADLCQEMRSARITVSAVGIG-DADRNLLNLITDNGDGRLYMTDDLAAL 615 Query: 399 IHVF 402 +F Sbjct: 616 PRIF 619 >gi|32475885|ref|NP_868879.1| hypothetical protein RB9427 [Rhodopirellula baltica SH 1] gi|32446428|emb|CAD76256.1| hypothetical protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 484 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 78/271 (28%), Gaps = 50/271 (18%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + ++ V ++ ++ + + ERP + LV Sbjct: 33 MTATPLHQASAEQVKLDVRLVHPVMK--AGEKQTNHLRIALTGFELKSAEERPPVNVCLV 90 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SGSM K+A K A +D + V + Sbjct: 91 LDHSGSM--------------------SGQKLARAKEAAEAAIDRLSDDDIVS------V 124 Query: 257 IGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + Y + V + + ++Q + T + + Sbjct: 125 VLYDSNVTVLVPATKATDRSSIKQKIRGI---QAGSSTALFAGVSKGAA----------- 170 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASP 373 + K + +I L+DG N KS + + I + T+ + + Sbjct: 171 -----EVRKFLADEQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSGY 225 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N ++ H + +ADSL+ VF Sbjct: 226 NEDLMVALASVGGGNHAFIEDADSLVSVFNQ 256 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 72/213 (33%), Gaps = 28/213 (13%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 S ++ +S I + + K+ + + S +E ++G+I Y+T Sbjct: 77 SCPIPIDLAMVLDSSGSIGKKDWVKLLEFTKSVVDAY-------SVSEEATHVGVITYST 129 Query: 262 RVEKNIEPSWGT----EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 +I + V D+ T A++ A +L ++ + Sbjct: 130 EATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKALELANSVLFTEARGM----- 184 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 ++ +FLTDG + + K K+ I + T+ + + Sbjct: 185 --------RPNKKQVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGIDVYTVGVGDDVDLF 236 Query: 377 RLLKTCVSSPEYH-YNVVNADSLIHVFQNISQL 408 LL SS + + Y+ N D L Q I Q Sbjct: 237 ELLSI--SSGDKYTYSAKNFDELQAKVQEILQE 267 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 60/208 (28%), Gaps = 27/208 (12%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-- 274 +++ +K + ++GL+ Y + Sbjct: 281 GIPNSQSLGSEITNVKQVMSRLAGLF---DVANSKAHIGLVAYDQGAHMALPLDKAYSVG 337 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V+Q + S D+ A+K A S T + Sbjct: 338 AVQQAINNLATSKKHGF-DTAAAIKVAADHAFSMFGGVRQTQPK--------------VF 382 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV- 393 L + + + +K K+N I+++ + I+ S + + V+ P + + Sbjct: 383 VLFAPRGSTSTA-AEIKEAAEKLKKNGIRLMVVGIDNSADTATY-SSAVTQPAKRFLMNT 440 Query: 394 -NADSLIHVFQNISQLMVHRKYSVILKG 420 + D L I+ + S + G Sbjct: 441 KDYDDLNAAVWEIADTVCK---SAVTPG 465 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 56/183 (30%), Gaps = 24/183 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPT 292 + F + +E+ ++ + Y + + + + + R T Sbjct: 808 VKDFTSKLIGAFDISRENTHVAVGTYRSSGRSVVRFNDAASSSKDSVLARLERVTFQGGT 867 Query: 293 D--STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + A+ A I + K K ++ + G + Sbjct: 868 YGLTKAALSTANDIFDAK--------------KGMRPGQNKTLVIIGTGHVD---EPQQA 910 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 ++ + N I I ++I N + L+ VS + + V + D+L+ + + Sbjct: 911 RELAQDLRSNMIDIFAVTIGDDANYRA-LRDVVSREAAKNVFAVTSGDALVAQLRALVNT 969 Query: 409 MVH 411 + Sbjct: 970 VCD 972 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 40/377 (10%), Positives = 96/377 (25%), Gaps = 35/377 (9%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A AI A+ + + + + S Sbjct: 338 AVQQAINNLATSKKHGFDTAAAIKVAADHAFSMFGGVRQTQPKVFVLFAPRGSTSTAAEI 397 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL---QFIEHLLNQRYNQK 172 E + + I + + + + + + + + + Sbjct: 398 KEAAEKLKKNGIRLMVVGIDNSADTATYSSAVTQPAKRFLMNTKDYDDLNAAVWEIADTV 457 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGS---MHCAMNSDPEDVNSAPICQ-------- 221 S + + VD + C S +A Sbjct: 458 CKSAVTPGKCRTPDAGDTGGAQCEVDKDCGVDQLCCDDGSGKTSCKTAIKNCFVPFEMAI 517 Query: 222 ----DKKRTKMAALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + ++ ++ + F+ + + + + +++ GL+ + + + G K Sbjct: 518 AMDASETVSRQDFVR--MKSFVRDLMWPNANSENNIHFGLMTF---AGTTKKVTEGFRKF 572 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R D ++ T + + K + F +G + ++ Sbjct: 573 RSEQELDELLDKIEKTQDPQ------RRVDKTFKFASKEFFSMEGGTRHGHE-RYLLVLA 625 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +D + K D I + T S L+ S +Y Y + D Sbjct: 626 SDATSPGSGDLNEAAKDLDLLNVKRIAVAT----NSDVQSTFLRAVASDDKYVYQAKSTD 681 Query: 397 SLIHVFQNISQLMVHRK 413 L V I+Q++ K Sbjct: 682 ELGQVSTQINQILCKEK 698 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 64/189 (33%), Gaps = 31/189 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A FLD +GLI Y+ +V + + KV + R Sbjct: 528 LDEAREAARSFLDQ-----CDFTTTRVGLISYSDQVVLQTDLTDNVRKVEAGLARL---E 579 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A++ + L K+++ LTDG ++ + Sbjct: 580 ADGTTNLAGALELGRRKL-----------------ATVPTGHVKYLVVLTDGYPDD--PD 620 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ AK + I+IV I + L + + + L+ F +I++ Sbjct: 621 NALLEAAH-AKGSGIEIVAIGTG-EADQAYLDRIASTQAGSIFA--RKGELVRAFGHIAR 676 Query: 408 LMVHRKYSV 416 ++ S+ Sbjct: 677 VIAEGGRSL 685 >gi|149921110|ref|ZP_01909568.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] gi|149817997|gb|EDM77456.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] Length = 532 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 39/356 (10%), Positives = 100/356 (28%), Gaps = 52/356 (14%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LA S +++ + + A + D + + + Sbjct: 8 ALAALSMTLTSTAVWQITAPADEVVADASDADASDTDPVAAEEPGPTADGGDTKVEADAP 67 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 ++ S + + + T ++ +F A+ Sbjct: 68 PLDQSTFLAGKTLMVEGRVGHGRMLADARGETFLYVDVRNELSAAGSNTLDARGNFRDAV 127 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 R + +V+D SGSM ++R + A Sbjct: 128 ARTSAEP---LNLAIVIDHSGSM----------------KGQRERNALDAA--------- 159 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTP 296 ++S +++ + ++ Y T+ + + ++V + + S T + Sbjct: 160 -AGMISRLRDGDTVSVVSYNTKAHTIVPVTTLDARNRDRVISDLRVGVASRPSGNTCVSC 218 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICD 355 ++ Q L + ++ L+DGE N + ++ Sbjct: 219 GVEAGLQTLQGRRPGI------------------DRMLLLSDGEANRGVRDEPGIRRLAR 260 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +A+ + I +I ++ N + + HY +L +F ++ Sbjct: 261 EARNRGVSISSIGVDVDYNEVLMSAIAREANGRHYFSETGSNLDAIFDQELDSLIQ 316 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 71/227 (31%), Gaps = 31/227 (13%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LV+D SGSM ++ T++ K A + LD Sbjct: 407 RKPANVLLVIDTSGSMQESV-------------PGTGSTRLELAKEAAITSLDEFSDSD- 452 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GL ++T + R+ V + T + + L Sbjct: 453 -----RVGLWMFSTD------LEDNGQDWRELVPLGPLGASVNGTPRREELAERISNLPP 501 Query: 308 DKKRSFFTNFFRQG---VKIPSLPFQKFIIFLTDGEN---NNFKSNVNTIKICDKAKENF 361 + + ++FLTDG+N N I + + Sbjct: 502 GGGTGLYDTALAAHTLVAEHSRPDAINAVVFLTDGKNEDLNGISLEKLLDSITPEPGQQG 561 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++I TIS + + + + ++ Y+ + S+ VF+ + Sbjct: 562 VRIFTISYGEDADLKTMTQIAEATNAAAYDASDPQSIDEVFEAVISN 608 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 46/193 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 E+ + + ++D S SM + D + ++++ K A+L F+ Sbjct: 86 TTEVSKALGEDVMFLLDCSKSM---------------LAADVQPSRLSRSKYAILDFVQQ 130 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQ 300 +GL+ + + ++ + R + ++ + TD A+ + Sbjct: 131 HGRG-------RVGLVAFAGQAFLQCPLTFDYDAFRDALLAIDEQTIPVGGTDIGRALDE 183 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 AY+ + + + K ++ +TDGE+ IK E Sbjct: 184 AYRAMEKNDR-------------------HKILVLITDGEDLEKAG----IKTAQALAEK 220 Query: 361 FIKIVTISINASP 373 I + TI + + Sbjct: 221 GIVVYTIGVGTAA 233 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 89/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + ++ + P Sbjct: 175 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTRDLP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIRSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGAN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 405 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 78/231 (33%), Gaps = 56/231 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + ++ K A+ + + + +GL+ + + + + Sbjct: 97 DVSNSMLAEDFEPNRLERTKYAIGKLFEGL-------QQDRVGLVVFAGEPKVQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R + R SL+ + T A++QA + + Sbjct: 150 YRMARAFARRIDPSLVSVQGTAIGKALEQALLAFS----------------GDTEQSHGR 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + + I + ++A + +KI TI I Sbjct: 194 VIILITDGENH----DDDAIAVAERAAQMGVKIFTIGIGTPEGAPIQIGGEFIKDEAGEM 249 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ----LMVHRKY 414 N + L + + + L + + I++ + ++ Sbjct: 250 VVSKLNEEMLARIADITGGAYVRSSKQSIGLDEIVKAINEMEQTELSTVRF 300 >gi|145491135|ref|XP_001431567.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398672|emb|CAK64169.1| unnamed protein product [Paramecium tetraurelia] Length = 590 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 55/179 (30%), Gaps = 27/179 (15%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-D 285 K+ +K +L++ L + ++ LI + + T + + Y T+ + Sbjct: 184 KIEMVKASLIVLLQFLGDNDRLQ------LITFDNDAHRLTPLKTVTNQNKSYFTQIIKQ 237 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + A K + L G K + + L+DG + + Sbjct: 238 IQADGGNRISEATKMTFYQLK--------------GRKYINNVTS--VFLLSDGVDVTYP 281 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 IK ++ + T + Q + + C Y V + L F + Sbjct: 282 EVKKQIKTVNEV----FTLHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 336 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 86/242 (35%), Gaps = 48/242 (19%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + FIP+ P + V+D SGSM + ++ RT Sbjct: 17 LQLGFIPSQKVYASTSSPNVDVVFVLDSSGSMKQSD-------------PEEIRT----- 58 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILK 290 A+ +FLD + GL+ Y+ + K +E ++ + ++ L Sbjct: 59 -EAIKMFLDM-----GQVQGNKAGLVAYSDTIIKEHNLDAINSESDKERIEEMASNIPLG 112 Query: 291 P-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD + + ++L S + II L+DG+N++ +S Sbjct: 113 QKTDIGRGLLEGAKVLDSG----------------HDSNNRPLIILLSDGKNDSQRSASE 156 Query: 350 TIKICDKA----KENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++K + A K + TI +N + +L + + +Y A L + + Sbjct: 157 SLKDLNSAISTCKSKGYPVYTIGLNYDGTVDKAQLTQIASETKGKNYITNKASDLTDILK 216 Query: 404 NI 405 +I Sbjct: 217 DI 218 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/338 (8%), Positives = 89/338 (26%), Gaps = 23/338 (6%) Query: 15 IASEKANFSIIFALSVMSFLL-LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + + ++ L + S ++ L+G I + +++A + A + + + Sbjct: 8 RSRKSGSVMVLITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLLGTPAV 67 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 E ++++ + A + + + ++ + +I + + + Sbjct: 68 PETLAAEFLASNFRTDGS--AGTWGAHDLHSTIVYTPGITKIIDIDATAQVPLLFLRILG 125 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR--------YNQKIVSFIPALLRIEM 185 + + + N K V+ + Sbjct: 126 KTSATVRARGSGTR--TDSRVMLVIDRSGTMDVSDGTGLPTRIENAKTVAQTLFIPAFTE 183 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G I L+ D S + + D + + N ++ ++ + Sbjct: 184 GADEIGLVAF--DGSAYVAYPPSQPGWDPTTTSSSRGGPDMYFKDPNN-PNNMINQVNAI 240 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 G T ++ +D + T + QA + Sbjct: 241 DA-------GSYTGTAEALWMAYIELQKAHLKDLAQDGVDLRMNSILLLTDGLPQALAVS 293 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + ++ + F+T G + Sbjct: 294 LNTTGSNVLKATSPCTNRVGPPNKPAMVGFVTGGPSTT 331 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D ++ K + LD + +G++ + + + Sbjct: 97 DISNSMLAEDLSPNRLQFAKLTIHRLLDYLAES-------KVGVVVFAGNAYMQLPITTD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ V D L + T A+ + + K Sbjct: 150 LAMAKKMVDDANPDMLSNQGTAIASAIDLSLGSFSD------------------RHDVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 II TDGEN+ + ++ KAK +K+ TI++ + Sbjct: 192 AIILFTDGENHEGDA----LEAAKKAKSQGVKVYTIAVGS 227 >gi|149701450|ref|XP_001492635.1| PREDICTED: anthrax toxin receptor 2 [Equus caballus] Length = 488 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ Q + + T Sbjct: 62 IYDFVKQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRGKISQGLEDLKHVRPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + K L II LTDG + K Sbjct: 121 HEGLKLANDQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ ++ + + Q L+ S E + V +L + +I + Sbjct: 164 KLSRSLGARVYCVGVLD--FEQAQLERIADSKEQVFPVTGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELRPSSVCVG 232 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 76/240 (31%), Gaps = 48/240 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + ++ E P + LV+D SGSM Sbjct: 380 MGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSMD------------------- 420 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 TK+ K + + ++ + +G++ + + + P K + ++ Sbjct: 421 -GTKIELAKESAMRTVELLRSKD------TVGVVAFDDQPWWVVPPQKLGNK-EEVLSSI 472 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ PA+ A + + ++ II +TDG++ Sbjct: 473 QSIPSAGGTNIYPAVSSALEEMLKI------------------KSQRRHIILMTDGQSAM 514 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + D EN I + ++++ + L ++ +Y V + +L VF Sbjct: 515 NSGYQDLT---DTMVENKITMSSVAVGTDADTHLLQSLAEAAKGRYYFVEDETTLPAVFS 571 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/270 (13%), Positives = 75/270 (27%), Gaps = 48/270 (17%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 ++V + + +L ++ + ++ R + LV+D S Sbjct: 51 AKAATQVVNVKSELAAPV-----MLANSEDKNYLKISLTGFNLDSTRRSPINLALVIDRS 105 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 SM ++ + +L ++ +++ + ++ Y Sbjct: 106 TSM--------------------SGERIEKAREEAILAVNMLNITD------TLSVVAYD 139 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 E I + T+K + T + Q Sbjct: 140 NHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGI----------------GQ 183 Query: 321 GVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + II ++DG+ N S + A + I I TI + N + Sbjct: 184 VDKHLNREQVNRIILISDGQANTGPTSISELSDLARMAAKKGIAITTIGLGQDYNEDLMT 243 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 S H V N+ L F Q + Sbjct: 244 AIAGYSDGNHTFVANSADLEKAFTKEFQDV 273 >gi|319954909|ref|YP_004166176.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319423569|gb|ADV50678.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 703 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 72/231 (31%), Gaps = 52/231 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + ++D+SGSM+ + K+ LK+A L + + V Sbjct: 343 PASNLTFLIDVSGSMN-------------------NQNKLPLLKSAFKLLVHQLREKDRV 383 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ Y +EP+ G +K + + + T ++ AY + + Sbjct: 384 S------IVVYAGAAGVVLEPTGGNDK-EKIIKALDNLSAGGSTAGGEGIELAYALAEKN 436 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTI 367 K +I TDG+ N S+ + + ++ +++ I + + Sbjct: 437 FK----------------PNKNNRVIMATDGDFNVGASSDKDMETLIEEKRKSGIFLSVL 480 Query: 368 SINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN--------ISQLM 409 L+ H + +F I++ + Sbjct: 481 GFGMGNYKDSKLEKLADKGNGNHAYIDTMQEAQKIFGEEFGGTLYTIAKDV 531 >gi|153006798|ref|YP_001381123.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] gi|152030371|gb|ABS28139.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] Length = 381 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 23/164 (14%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +G E+ ++I A+ ++ + + V + +++A++A L GA + + Sbjct: 4 RGKRRERGAVAVIVAIVLLVLGGFMALSLNVGHLLSVRGELQNASDAGALGGAWDLDCTV 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKES---------------------LSGYSA 111 + + ++A R D + + E+ L + Sbjct: 64 DGIAKARVAALDYATRHSTDYQPLLPSDTMVETGYWSVDKERFWPISDPGSNPELVSLTN 123 Query: 112 VFYNTEIQNIVNSSRISMTHM--ANNRLDSSNNTIFYNMDVMTS 153 + N++ + NN + I T Sbjct: 124 AVRVRSTRVDANAAPVFFPVFLGGNNTANVGAGAIAVRGGPCTE 167 >gi|315647020|ref|ZP_07900133.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] gi|315277222|gb|EFU40551.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] Length = 471 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/389 (11%), Positives = 103/389 (26%), Gaps = 55/389 (14%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSM--ESANNAAILAGASKMVSNLSRLGDRFESI 82 +F L L G KN + + A+ + + Sbjct: 6 LFILIAFIVF-LAGCS-----HEQDKNVLTNQDASES-------IQKDEPENNNSNNHNS 52 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + + + ++ E+ + + + S + +N Sbjct: 53 PDSDLNEPHSEIQNPPDSNHNETQTSLTDEEILLKPPGRFAGSNYDEQKVQAALDQLPSN 112 Query: 143 TIFYNMDVMTSYDYRLQF----IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + + V+ +I + I +++D Sbjct: 113 LTTDQYKDELLLLLAEDYRPYVTTFM--NFDTEISVNNERPEGKITLPVNRNLHISILLD 170 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 SGSM +N +KM + K A+ F D + + + L Sbjct: 171 ASGSMKAQINGK---------------SKMDSAKEAIQTFADKL------PNNAEVSLRV 209 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 Y + + + R + + + +K A Q + Sbjct: 210 YGHKG------TGDQKDKRVSCDSTEEIFHGQGDQTNQ-IKTALQKVEPAGWTPIANALQ 262 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQ 376 I + ++DG ++ + ++ +K + + + GQ Sbjct: 263 SVKKDINPETTDSVVYVVSDGIETCG---GKPAQVAKELHQSKVKTIVNIVGFDVDNEGQ 319 Query: 377 RLLKTC-VSSPEYHYNVVNADSLIHVFQN 404 +LL+ S +V N ++L +V Sbjct: 320 KLLRQIAASGGGEFMSVNNDEALKNVLNK 348 >gi|194289206|ref|YP_002005113.1| lipoprotein, von willebrand factor type a domain [Cupriavidus taiwanensis LMG 19424] gi|193223041|emb|CAQ69046.1| putative lipoprotein, Von Willebrand factor type A domain [Cupriavidus taiwanensis LMG 19424] Length = 570 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/251 (10%), Positives = 73/251 (29%), Gaps = 52/251 (20%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + ++ + G P + +VD+SGSM+ + + Sbjct: 172 PSNVLLRIGIKGKDMASGALPAANLVFLVDVSGSMNTPDKLPLLKSSLKLL--------- 222 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ + L+ Y + + P+ G++K V + Sbjct: 223 ----------------VNQLRAQDRITLVTYASGTRVALPPTPGSDKG-AIVAAIDQLVA 265 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN- 347 T + AYQ ++ TDG+ N ++ Sbjct: 266 GGSTAGASGIALAYQAAQQSYIAGGIN----------------RVLLATDGDFNVGVTDF 309 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVN--------ADSL 398 + + +++ + + T+ ++L++ + + + N + Sbjct: 310 RQLKSMVEDKRKSGVSLSTLGFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEI 369 Query: 399 IHVFQNISQLM 409 I++ + Sbjct: 370 SSTLATIARDV 380 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 47/373 (12%), Positives = 104/373 (27%), Gaps = 66/373 (17%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A+ ++ +S + A R N K + S+V TE Sbjct: 90 AATSDAVVVTGNRPALQSSVQMKRSAAPAYEAREAPNTEKYNGESVSSVMRVTETPVSTF 149 Query: 124 SSRISMT-------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ--------- 167 S + + + + + YDY L Sbjct: 150 SVDVDTGAYANVRRMLNDGTTPTEAAVRTEELLNYFRYDYPLPQDRSKPFSITTDVAQTP 209 Query: 168 -RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +++ + ERP + +VD+SGSM+ Sbjct: 210 WNAQTRLMRVGLRAYDVPRSERPAANLVFLVDVSGSMNDPD------------------- 250 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +K AL + D++ + ++ Y + P+ + V+Q + Sbjct: 251 KLPLVKTALSMLSDNLRPDD------KVSIVVYAGAAGMVLAPTHEGKYVKQALECLS-- 302 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-K 345 T M AY ++ + +I TDG+ N Sbjct: 303 -AGGSTAGGQGMALAYATAEANFIKGGIN----------------RVILATDGDFNVGIS 345 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 S + + +E+ + + + + L++ + + +A + + Sbjct: 346 SIGEVEALVKQNRESGVTLTALGFGTGNYNEALMEKMADVGNGNYAYIDSA---MEARKV 402 Query: 405 ISQLMVHRKYSVI 417 + + ++V Sbjct: 403 LDDELSSTLFTVA 415 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 70/226 (30%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GL+ + + + + Sbjct: 97 DISNSMLAEDVAPSRLTKSKMLVENLVDHF-------NNDQIGLVVFAGQSYVQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A+ T + K Sbjct: 150 YVSAKMFLQDIQPSLIQTQGTDIAGAINTCMHAFTP------------------NDKVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + I+ A + I + + I +G Sbjct: 192 AIIVITDGEDHEGGA----IEAAKAAHDRGINVFILGIGDSKGAPIPMGNGDYLKDRSGN 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N ++ + + Sbjct: 248 TVMTAVNEQMCKQIAEAGSGKYIHVDNT---KDAENQLNDDLSRLQ 290 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 17/186 (9%), Positives = 51/186 (27%), Gaps = 23/186 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ K A + + + ++ Y + V+ + Sbjct: 104 SGSMSGYKLEQAKQAARHLVTLLKDDD------RLAIVHYGSDVKSLPGLQATPANRERM 157 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ + + + + + S +I ++DG Sbjct: 158 IQYIEGIWDEGGTNISAGLLAGQAQVETAR----------------SDYRVNRLILISDG 201 Query: 340 ENNNFKSNV-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + ++ + ++ + I + +I + N + + + +A L Sbjct: 202 QPTEGSTDEGSLKQVVKDIRTRGITVSSIGVGTDFNEDLMQAFAEYGAGSYGFLEDAGKL 261 Query: 399 IHVFQN 404 +FQ Sbjct: 262 ATLFQK 267 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/360 (10%), Positives = 98/360 (27%), Gaps = 61/360 (16%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV-FYNTEIQNIV 122 + N R E + + + + A F + ++ + I + Sbjct: 253 AVKGLAKNPGLHRYRAEIFMDGDEASANNAAFDFTRVEGPPNVLIVEGTPGTSGNITAAL 312 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S I + L ++ + + + + I + + Sbjct: 313 QSGMIGTEVIPPELLPLEAAKYAVYDSIIFN-----NVSGSDVGGKQMELIEQAVRSFG- 366 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI-------------------CQDK 223 I + + S M + E + Sbjct: 367 -------IGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSM 419 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K + + ++ + +G++ + + + P +K + ++ Sbjct: 420 DGNKIELAKESAMRTVELMRAKD------TVGVVAFDDQPWWVVPPQKLGDK-EEVLSSI 472 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ PA+ A + + ++ II +TDG++ Sbjct: 473 QSIPSAGGTNIYPAVSSALEEMLKI------------------DAQRRHIILMTDGQSAM 514 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + D EN I + ++++ + L ++ +Y V + +L VF Sbjct: 515 NSGYQDLT---DTMVENKITMSSVAVGMDADTNLLQSLADAAKGRYYFVEDETTLPAVFS 571 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 47/138 (34%), Gaps = 13/138 (9%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I S+K ++ AL + F+ + +I + + ++ + + +AA LAG + + Sbjct: 3 LINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDSQ 62 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + ++ L ++ I + + + Sbjct: 63 QAETVA--LDYASRNGLDNNVTVEITDD-----------DHQIGLSGSKQVGMNFAVIFG 109 Query: 134 NNRLDSSNNTIFYNMDVM 151 ++++ + ++ V Sbjct: 110 IDQVEVAASSKARVGHVT 127 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 90/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 118 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 177 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + +++ + P Sbjct: 178 ALQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQVRDLP 233 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 234 PANLVFLIDTSGSMQ-------------------PAERLPLIQSALKLLVNDLRAQDNIT 274 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 275 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 327 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 328 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 371 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 372 VGDDNFNEAMMVRIADVGNGNYSYIYSLSEAQKVLKD 408 >gi|326478189|gb|EGE02199.1| U-box domain-containing protein [Trichophyton equinum CBS 127.97] Length = 741 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 69/243 (28%), Gaps = 32/243 (13%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + +VS P L I LV+D+SGSM+ S +D + Sbjct: 46 PNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMN----SAAPIPTGERGGEDTGLS 101 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMD 285 + K+A +++++ + ++ + T V E ++ + ++ Sbjct: 102 ILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLSAIDK 155 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL------PFQKFIIFLTDG 339 T+ +K+ +L ++ R + P+ K L D Sbjct: 156 LYGKSSTNLWHGIKKGLNVLATNPVRGNIQSLLVLTDGAPNHMCPVQGYVPKLRQTLLDH 215 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + I T L + +A + Sbjct: 216 RNLTGTLPL---------------IHTFGFGYYLRSPLLQSIAEIGGGIFAFIPDAGMIG 260 Query: 400 HVF 402 VF Sbjct: 261 TVF 263 >gi|326474578|gb|EGD98587.1| hypothetical protein TESG_05957 [Trichophyton tonsurans CBS 112818] Length = 741 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 69/243 (28%), Gaps = 32/243 (13%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + +VS P L I LV+D+SGSM+ S +D + Sbjct: 46 PNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMN----SAAPIPTGERGGEDTGLS 101 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMD 285 + K+A +++++ + ++ + T V E ++ + ++ Sbjct: 102 ILDLTKHAAKTIIETLNEKD------RLAVVTFCTEVNVAFELDSMNKENKSTVLSAIDK 155 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL------PFQKFIIFLTDG 339 T+ +K+ +L ++ R + P+ K L D Sbjct: 156 LYGKSSTNLWHGIKKGLNVLATNPVRGNIQSLLVLTDGAPNHMCPVQGYVPKLRQTLLDH 215 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + I T L + +A + Sbjct: 216 RNLTGTLPL---------------IHTFGFGYYLRSPLLQSIAEIGGGIFAFIPDAGMIG 260 Query: 400 HVF 402 VF Sbjct: 261 TVF 263 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 42/527 (7%), Positives = 122/527 (23%), Gaps = 130/527 (24%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISN-------------------------HAKRALIDDAKRFIK---- 99 + + SN DD I Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---M 151 + S+ + + + ++D ++ + + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLLNQRYN---------------------------------QKIVSFIP 178 + + + Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E + ++ V+ + +H S + ++ ++ + L Sbjct: 247 FAMGAEHPKENKCVLPFVLQNN--LHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTE 304 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + ++ ++K + + E+ ++ R T ++ + Sbjct: 305 ATLNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGL 364 Query: 299 KQA-YQILTSDKKRS--------------------------------------------- 312 A Q+L+ + Sbjct: 365 LTASNQMLSEKSRSEELNQETKRVILVLSDGNDDMSNLNLADLERNSIPFTNFSRITKNL 424 Query: 313 --FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFI 362 + + F + N +++ +C K+ Sbjct: 425 ILGKKEDLSSTTTSNKAYYNRHSTFNYNTYLTNKTKDISRKGMCSIIQEKLNTLNKDKNT 484 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 K+V + KTCV + ++ N +SL++ F+ I + Sbjct: 485 KLVFVEFGYRSESADAWKTCVGN-GNYFYADNRESLLNSFKQAIGET 530 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D E +G++ Y+ R E + + T + Sbjct: 59 QWVANLVDTFEVGPERTRVGVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ T G + F++ I LTDG + + + Sbjct: 119 DALRFITRHSFTPR-----------AGGRPGDRAFKQVAILLTDGRSQD-----LVLPAA 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 A+ I+I + + + + L+ S P + ++V + D++ + + + + Sbjct: 163 TAARRAGIRIFAVGVGEALREE--LEEIASEPTAAHVFHVSDFDAIDKIRGKLRRRLCE 219 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D E +G++ Y+ R E + + T + Sbjct: 59 QWVANLVDTFEVGPERTRVGVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ T G + F++ I LTDG + + + Sbjct: 119 DALRFITRHSFTPR-----------AGGRPGDRAFKQVAILLTDGRSQD-----LVLPAA 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 A+ I+I + + + + L+ S P + ++V + D++ + + + + Sbjct: 163 TAARRAGIRIFAVGVGEALREE--LEEIASEPTAAHVFHVSDFDAIDKIRGKLRRRLCE 219 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 65/218 (29%), Gaps = 55/218 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + QD K T++ K ++ + I GL+ + R + Sbjct: 96 DCSRSMLAQDIKPTRLERAKREIIDLMGMIQS-------DRAGLVAFAGRAILQCPLTLD 148 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ D L + TD A++ A + + K Sbjct: 149 HSAFNLFLNALEPDYLPVGGTDLGGAIETALNGFEKEVESE------------------K 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN S I++ KA + +KI I + + Sbjct: 191 AIILITDGENTTGDS----IEMAKKAADQGVKIFCIGVGSPEGAPVPDSAGGFKKDRSGK 246 Query: 374 ------NGQRLLKTCVSSPEYHYN-VVNADSLIHVFQN 404 + L K + + V L ++ Sbjct: 247 IIISRVDEPALKKIAAMTQGVYVRSVAGDMDLDRIYDQ 284 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/344 (10%), Positives = 94/344 (27%), Gaps = 50/344 (14%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A + + I+ + + H + Sbjct: 140 AAESKLMAPQPTMAPADQIAPQEENRNRVQDFKTNPVHAALEDPVSTFSIDVDTASYSFV 199 Query: 124 SSRISMTHMANNR---LDSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPA 179 + + ++ N Y+ S + Q + + Sbjct: 200 RRSLKEGSVPQADTVRVEEMINYFPYDWKGPDSVSTPFNSTVSVMPTPWNAQTKLMHVAI 259 Query: 180 LL-RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 I+ E+P + ++D+SGSM K+ LK+A L Sbjct: 260 KGFDIKPTEQPKANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRLL 300 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + ++ Y + P+ EK + + + T + Sbjct: 301 VSKLRADD------TISIVTYAGEAGTVLMPTRAAEKDK-ILNAIDNLTPGGSTAGEAGI 353 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKA 357 K+AY++ + ++ TDG+ N +S+ + ++ ++ Sbjct: 354 KEAYKLAQQSFVKDGVN----------------RVMLATDGDFNVGQSDDDDLKRLIEQE 397 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +++ + + ++++T + D+L Sbjct: 398 RKSGVFLSVFGFGHDNLNDQMMQTIAQNGNG--TAAYIDTLAEA 439 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 75 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 132 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 133 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 180 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 181 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 232 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 233 IGSEPHKDHVFLVANFSQIESLTSVFQN 260 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|116003875|ref|NP_001070294.1| anthrax toxin receptor 2 [Bos taurus] gi|115305014|gb|AAI23758.1| Anthrax toxin receptor 2 [Bos taurus] gi|296486409|gb|DAA28522.1| anthrax toxin receptor 2 [Bos taurus] Length = 488 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRGKISEGLDNLKHVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIE----------------KARDLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ ++ + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGARVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALRGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELRPSSVCVG 232 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + + + + + + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSA--KEMKKAVTHM 725 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + PS + I TDG + S Sbjct: 726 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|162448748|ref|YP_001611115.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] gi|161159330|emb|CAN90635.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 79/246 (32%), Gaps = 44/246 (17%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + ++++ I + + + + ERP + + VD SGSM Sbjct: 100 IMGNMISGSPCTLIQIGMNSPVDLGALERPPLHLVIAVDTSGSM---------------- 143 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +A ++ L+ +D++ + L+ Y+ E +E + G+++ Sbjct: 144 ----EGDPIAYVRAGLVEMIDALQPTD------RISLVRYSDAAEVVLEQAEGSDR-EAL 192 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ + AY + ++ Q +IFL+DG Sbjct: 193 TEAFEGLTARGSTNLYEGLFTAYALAEQHLDPAW----------------QNRVIFLSDG 236 Query: 340 ENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 S + + E I + I + A + + Y + + ++ Sbjct: 237 VATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGAEFDVDAMRGISEVGAGNFYFLEDPKAV 296 Query: 399 IHVFQN 404 VF Sbjct: 297 EEVFAE 302 >gi|315052466|ref|XP_003175607.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] gi|311340922|gb|EFR00125.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] Length = 740 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 70/243 (28%), Gaps = 32/243 (13%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + IVS P L I LV+D+SGSM+ + P Sbjct: 46 PNKDSMIVSIQPPLKPENDVPHVPCDIVLVIDISGSMN-SSAPIPTGERGGEDTGLSI-- 102 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMD 285 + K+A +++++ + ++ + T V+ E + ++ + + Sbjct: 103 -LDLTKHAARTIIETLNEND------RLAVVTFCTEVKVAFELDFMNKENKSMVLRAINK 155 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL------PFQKFIIFLTDG 339 T+ +K+ ++LT+ R + P+ K L D Sbjct: 156 LYGTSSTNLWHGIKEGLKVLTATPVRENVQSLLVLTDGAPNHMCPAQGYVPKLRQTLLDH 215 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N+ + I T L + +A + Sbjct: 216 RNSTGSLPL---------------IHTFGFGYYLRSPLLQSIAEIGGGIFAFIPDAGMIG 260 Query: 400 HVF 402 VF Sbjct: 261 TVF 263 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 82/242 (33%), Gaps = 49/242 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L+++ + ++ E I L +D+SGSM N P Sbjct: 291 LSRKSMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSNGMP------------- 337 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ K+A+ ++ + + + ++ + R + WG +VR+Y + Sbjct: 338 --KIEIAKDAVAQYIQFLSKTND-----RLAMVLFNFRADVL----WGLHQVRRYWQQMN 386 Query: 285 DS----LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T+ A++++ ++LT + K +I +TDG Sbjct: 387 YMLKYVYAGGGTNLANALERSREVLTRSRSN------------------SKHVICVTDGR 428 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N IK + + + I TI+I + + + L++ + + L Sbjct: 429 TVNSS---MCIKEAVRLRRSGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIHDLGK 485 Query: 401 VF 402 Sbjct: 486 AL 487 >gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 426 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 31/262 (11%), Positives = 72/262 (27%), Gaps = 38/262 (14%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + Q +Q+ ++ + E + + L++D SGSMH Sbjct: 11 DNNVDMA-QTNSQRQLAMTIFAIAGEFDQNLPLNLCLILDRSGSMH-------------- 55 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + A+ +D + + + ++ ++ VE I E + Sbjct: 56 ------GQPIKTVIQAVEGLIDRLKVGD------RISVVAFSGSVEVIIPNQVIEDPESI 103 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T ++ L + + F S Sbjct: 104 KSQIK--SKLSASGGTAIAEGLELGITELMKGTRGAVSQAFLLTDGHGESS-------LR 154 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + N +K+ KA + + I T S N L K ++ + + Sbjct: 155 IWKWDIGRDDNKRCLKLAQKAAKLNLTINTFGFGNSWNQDLLEKIADVGGGTLAHIEHPE 214 Query: 397 SLIHVFQNISQLMVHRKYSVIL 418 + F + + + Sbjct: 215 QAVEQFSRLFGRIQSIGLTNAY 236 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDLNT---LKA 206 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 207 IGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 67/186 (36%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + Sbjct: 671 FETVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRGFSSAKEMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + PS + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQV----------EGARPPSTQVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KEGICE 836 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 89/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSRAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + ++ + P Sbjct: 175 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTRDLP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIRSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---TYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVLKD 405 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 12/108 (11%), Positives = 40/108 (37%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ +II L ++ + IG + + + K+ +++ ++ LA A + Sbjct: 18 QRGAVAIIVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSCALAAARDLTGATPLTVS 77 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ A+ ++ + +I + S + + + + Sbjct: 78 EAAGLTAAARNLVLFQGNLEQQPNITSAESVTYSDSLANPFLDKNSVT 125 >gi|225559690|gb|EEH07972.1| U-box domain-containing protein [Ajellomyces capsulatus G186AR] Length = 759 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 60/216 (27%), Gaps = 28/216 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + + K+A +++++ Sbjct: 75 DIVLCIDVSYSMQSSAPLPTTDESGEREETGLSV--LDLTKHAARTIIETLNEND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E + S + + L T+ +K + +++ Sbjct: 128 -RLGIVAFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERH 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN----FKSNVNTIKICDKAKENFIKIVT 366 + + LTDG N+ I I T Sbjct: 187 TL---------------QSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHT 231 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + +A + VF Sbjct: 232 FGFGYNIRSGLLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|327540682|gb|EGF27254.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 887 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 27/252 (10%), Positives = 77/252 (30%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N ++V I+ +RP + ++D SGSM K+ K Sbjct: 482 NENNRLVRVGIQAKDIDRKKRPRCNLVFLIDTSGSM-------------------KRPNK 522 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + + LD ++ + ++ Y ++ + +K ++ + Sbjct: 523 LPLVIEGMKVLLDQLNKKD------RVAIVVYAGSSGLVLDSTPVKQK-KKIIRALSALS 575 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ ++ AYQ + +I +DG+ N Sbjct: 576 AGGSTNGGAGLQLAYQTARENFIEDGVN----------------RVILCSDGDFNVGMTG 619 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNI 405 + + ++ ++ + + ++ + S + V + + + Sbjct: 620 TDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDT---IAEAKKVL 676 Query: 406 SQLMVHRKYSVI 417 + + ++V Sbjct: 677 ADQVAGTLFTVA 688 >gi|212636466|ref|YP_002312991.1| type IV pilin biogenesis protein [Shewanella piezotolerans WP3] gi|212557950|gb|ACJ30404.1| Type IV pilin biogenesis protein, putative [Shewanella piezotolerans WP3] Length = 1175 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/302 (8%), Positives = 64/302 (21%), Gaps = 51/302 (16%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + + I S + + + ++ Sbjct: 167 LPVDSLGSKAAPIPYTHISSSASSSDIENA-----IDAAFLTKFGTGKSVTLYTERYVNW 221 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 +T+M K + + + + T + + Sbjct: 222 YHGYKSSSWQTRMNIAKRVMEDTVVTTPSVDFGLAIFNYNTSKRTDGGRIVSGINRLSTG 281 Query: 276 VR-QYVTRDMDSLILKPTDSTPAMKQAYQILT---------------SDKKRSFFTNFFR 319 + V + D T + +AY+ + + K + Sbjct: 282 DKISLVDKINDLNAETNTPLCETLYEAYRYFSGSAVHFAKEAGSMTPTRDKSIESNGNYV 341 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNV------------------NTIKICDKAKENF 361 K + +++++TDG + + Sbjct: 342 SPFKSSQCSNRSYVVYITDGSPTQDSAANTKVSKMSGYKSSERVDGSYLPALASIMHRQD 401 Query: 362 I----------KIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + TI + L KT S ++ NA L Q + + Sbjct: 402 VNPSLAGEQFVTTYTIGFSDGAADAEPILKKTAELSGGTYFPAKNATQLQSALQQVFSQI 461 Query: 410 VH 411 + Sbjct: 462 LE 463 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 20/188 (10%), Positives = 57/188 (30%), Gaps = 37/188 (19%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +++A K+ ++I + + +G+ + + + + V + Sbjct: 240 GSRIANTKSGAKQLAETILDANPDNQ---VGVTRFNNGASTPQQLTDDLDDVEAAIDGLS 296 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T++ + L + ++ DG+ N Sbjct: 297 ---ASGGTNAQAGVDAGQAELE------------------NCPHDNRVMVVFGDGDINTD 335 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 S AK +I I + + L+ S P E+ + ++ ++ +F Sbjct: 336 GS---------AAKVAGTEIFAIGVGGASFSD--LEDLASDPADEHVFFAIDDGAIEQIF 384 Query: 403 QNISQLMV 410 +++ + Sbjct: 385 GQVAETIT 392 >gi|262193497|ref|YP_003264706.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262076844|gb|ACY12813.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 583 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/266 (12%), Positives = 79/266 (29%), Gaps = 26/266 (9%) Query: 161 IEHLLNQRYNQKIV-SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 +L+ + + + + +G P E + + + C + P DV Sbjct: 46 SLDMLSTYQGRALGCDDGTLDISVGIGSSPDGPFEELPGDAYEVRCTASEAP-DVALVID 104 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 ++ + L+ A + D++ V + +T + + + E +R Sbjct: 105 NSGSEQGYLEWLQEAAHVMTDAVMGRDGRSSLVRV-----STDSDIRLGLTEDEEAIRGA 159 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T ++ + L + + ++ TDG Sbjct: 160 IDELYIL--NGWTALYDGIRLGNETLGAAAATHSDYDSMDDFCDTDRKLA---VVAFTDG 214 Query: 340 ENNN------------FKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSS 385 NN T++ + + I T+ + + L + + Sbjct: 215 NENNSANERLRSDEYPGDGIDTTLEDLHDLRVADVRTPIYTVGLGDEVDHGGLEELAGYT 274 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH 411 H+ + +A L F+ IS+ + Sbjct: 275 GGRHHRIDSAADLPATFEVISEYLAS 300 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 42/239 (17%), Positives = 83/239 (34%), Gaps = 46/239 (19%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + L++D SGSM + N+ + KM K A F+ Sbjct: 59 TGIPPANVVVPNDVVLIIDKSGSMAPSYNNGED--------------KMLNAKEAAKGFV 104 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D +DL H V ++ + N+ + + + Y+ T + A+ Sbjct: 105 DLMDLTKHRVAIVDFS----SSNMIGNLPFTTNPTEAKNYIDTIN---ANGSTATGDAID 157 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN--TIKICDKA 357 A +L + Q I+ +TDG+ ++ + A Sbjct: 158 SAIALLAN-----------------HRPEAQPVIVIMTDGDATQPSTDPYGYAKQKALLA 200 Query: 358 KENFIKIVTISINASPNG------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K+N I TI++ S + LLK ++ ++H+ V+ + L ++ I + + Sbjct: 201 KDNGIIFYTIALLKSTDDPVTSGPNILLKEMATTSDHHHFVLGSTGLSQIYAAIVKEIG 259 >gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 730 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 70/265 (26%), Gaps = 51/265 (19%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI--PALLRIEMGER-----PIFLIELVV 197 + + L L+ L + R I ++ Sbjct: 221 ASDYVPRSDVSLALTVTGGTLSSSIMNYWDEADRRGYFLLTLVPPREPERIIPKDIVFIL 280 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D+SGSM K+ K ALL L + + + Sbjct: 281 DISGSM--------------------SGQKIEKAKLALLQVLQMLHEGDRFSIITFNNEV 320 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 E+ + S + V + + T+ A+ + ++L + + Sbjct: 321 --NNLTERLLPFS-DRTEWYPAVKQI---MAGGMTNIHDALLEGIEVLGTQSTDDRY--- 371 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT--IKICDKAKENFIKIVTISINASPNG 375 K ++FLTDG +++ T AK + + + N Sbjct: 372 -------------KVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVGYDVNA 418 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIH 400 + L + +V + + Sbjct: 419 ELLDELAEKGGGKVKYIVENEEIDE 443 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/371 (12%), Positives = 108/371 (29%), Gaps = 60/371 (16%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A A + S E + + + + +T Sbjct: 73 ASEAAVAPDTSGQPAEAAGREYRYVMPVIVPQPEDRERYDGEEVSPVKIAAVEPLSTFSV 132 Query: 120 NIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYD----------YRLQFIEHLLNQR 168 ++ + + ++ ++ YD + + F Sbjct: 133 DVDTGAYANARRFLSQGQMPPKAAVRTEEFINYFRYDYDRPQDRSQPFTVNFDAARTPWN 192 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + +++ A IE ERP + ++D+SGSM + K+ Sbjct: 193 EDTRLIRIGLAGYDIERSERPPANLVFLMDVSGSM-------------------GRPDKL 233 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +K AL + + ++ Y +EP+ T K+R + + Sbjct: 234 PLVKTALAGLAGELQPQD------KVSIVVYAGAAGLVLEPTNDTRKIRAALNQLQAGGS 287 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSN 347 ++ AYQI + +I TDG+ N S Sbjct: 288 T---AGGAGIQLAYQIAEDNFIEGGVN----------------RVILATDGDFNVGVSSR 328 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 I++ +K +++ I + T+ + +++ + + + +A + + + Sbjct: 329 DALIEMIEKKRDSGITLTTLGFGTGNYNEAMMEQIANHGNGNYAYIDSA---LEAKKVLG 385 Query: 407 QLMVHRKYSVI 417 M +++ Sbjct: 386 DEMSSTLFTIA 396 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 17/133 (12%), Positives = 42/133 (31%), Gaps = 3/133 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV---SNL 72 + I AL ++ L +G + K +++A ++ L+ A ++ L Sbjct: 8 HRQSGAVIITVALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDAL 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 +R ++ ++ K +A + + S S + ++ T Sbjct: 68 TRATSAGKTAADLNKINFQGEATGLAQTEVVFSDSLIGTYSHTFTPIANARYAKCLHTKS 127 Query: 133 ANNRLDSSNNTIF 145 + F Sbjct: 128 GIAPWILHALSAF 140 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 31/344 (9%), Positives = 78/344 (22%), Gaps = 34/344 (9%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM---DV 150 + + Y T N + +A + + + +T ++ Sbjct: 29 VLAAGDDTLTPGSKYYDVNGAETSESNAKVVLTKNAQRVAADEWEVTLSTKVNDIKIEQQ 88 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L + + + +E + H Sbjct: 89 PLEVTFVLDVSGSMAWCAEEHRHSADCGYNCGLEEHRHGDDCYTKCTRQNHPDHWENWGL 148 Query: 211 PEDVNSAPICQDKKRTK----MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-K 265 C + ++ + + H + G + Sbjct: 149 VWVHKLGTDCSLSWGSYYYLTCTKAEHTHSSSCYACGKVEHNHDRASGGEPCSMVQSGKA 208 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTD-----STPAMKQAY--------------QILT 306 V+ V + L K T S+ K LT Sbjct: 209 QSRLKIAVSAVKNMVGTLREQLGGKLTAKFVVFSSEGYKNGVDKRASAKVITEAQLDQLT 268 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + S ++ ++ + DG++++ N + K I T Sbjct: 269 AVGGTDLSAGVALGVDQFKSSSARQVLVVVADGDSDDGYPNRT----ANNFKNKDGIIYT 324 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + S + + ++ + L ++IS + Sbjct: 325 VGFTFSSDS---FNNLATDADHALLANSDTELGEAMEDISTDIT 365 >gi|32474636|ref|NP_867630.1| hypothetical protein RB7099 [Rhodopirellula baltica SH 1] gi|32445175|emb|CAD75177.1| conserved hypothetical protein-containing vWFA domain [Rhodopirellula baltica SH 1] Length = 885 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 28/252 (11%), Positives = 76/252 (30%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N ++V I+ ERP + ++D SGSM K+ K Sbjct: 480 NENNRLVRVGIQAKDIDRKERPRCNLVFLIDTSGSM-------------------KRPNK 520 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + + LD + + ++ Y ++ + +K ++ + Sbjct: 521 LPLVIEGMKVLLDQLK------NRDRVAIVVYAGSSGLVLDSTPVKQK-KKIIRALSALS 573 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ ++ AYQ + +I +DG+ N Sbjct: 574 AGGSTNGGAGLQLAYQTARENFIEDGVN----------------RVILCSDGDFNVGMTG 617 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNI 405 + + ++ ++ + + ++ + S + V + + + Sbjct: 618 TDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDT---IAEAKKVL 674 Query: 406 SQLMVHRKYSVI 417 + + ++V Sbjct: 675 ADQVAGTLFTVA 686 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 48/175 (27%), Gaps = 29/175 (16%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 KM K + L LD ++ + GLI + + + T K + + Sbjct: 411 MKMELAKESAQLVLDLLEDKDY------FGLIAFDHSYQWIVPLQPLTNK-EEAASLISR 463 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T P +K A + L K II +TDG+ Sbjct: 464 ISPGGGTALYPPLKSAGESLLKV------------------PIKSKHIIAITDGQTEGGD 505 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I + TI I N L Y+ N +L Sbjct: 506 FYNLVRNLAKY----KITVSTIGIGEDANIPLLKDIANWGNGRFYHTWNIRNLPQ 556 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 70/250 (28%), Gaps = 33/250 (13%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L + + + R G L D D Sbjct: 14 VLLLDLCDSYDLNNPS--YILAQSYAQRRNNGLPHRTLNPAATDSQCRSRPLDLVFIIDS 71 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + ++ K+ D +D L + + ++ Y + V+ Sbjct: 72 SRSVRPGEFEKVKI--------FLADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLT 123 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K + T + A+K+A + G + S K I Sbjct: 124 KDTIKQAITRIEPLAAGTMTGMAIKKAMD----------EAFTEKSGARPKSKNISKVAI 173 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHY 390 +TDG + ++ A+ + I+I + + + + LK S+P ++ + Sbjct: 174 IVTDGRPQDQ-----VEEVSAAARASGIEIYAVGV----DRADMRSLKLMASNPLEDHVF 224 Query: 391 NVVNADSLIH 400 V + Sbjct: 225 YVETYGVIEK 234 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 73/252 (28%), Gaps = 32/252 (12%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP--------IC 220 + +S +I M ++ +I VV + A + V P Sbjct: 46 KDNLSLSIKLLNNKIVMDKKNNQIIPGVVSVEAKDFDADQVKKDKVRYQPLDLIFVIDTS 105 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + K+ +K ++L L I + L+G+ ++ + +E + T+ ++ + Sbjct: 106 GSMQGKKIELVKKSILQVLHIIQGDD------RISLVGFNSQAKVLLELTQLTKNSKKKI 159 Query: 281 TRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + +D T M++A+ I+ + F + F + F+ Sbjct: 160 QKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLASIFLLSDGQDNCGFSQTQHFMNQS 219 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + C I + L K + + + Sbjct: 220 KIEYP--------FC---------IDCFGFGDDHDSLTLSKINQLQQGTFNFIRDISQID 262 Query: 400 HVFQNISQLMVH 411 F I + Sbjct: 263 DAFTIILAGIKT 274 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 68/207 (32%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA++ I I + LK Sbjct: 155 RPLRENVPRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAFGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A++ ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALRHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S P ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTTKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 64.9 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 50/176 (28%), Gaps = 26/176 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K AL +D + + ++ Y +V + + V +D + Sbjct: 63 TKQALRFLVDQMAEED------RLAIVTYDDQVHVPFPSQPVVQ--KDAVRLLVDGITAG 114 Query: 291 PT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNV 348 T + + + Q + ++ +TDG N Sbjct: 115 GTTNLSGGLATGMQQIRP----------------HAGPGRVSRVLLMTDGLANVGVTDPD 158 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +E + + T+ + + L+ + + + N D + +FQ Sbjct: 159 VLAGWARAWREKGLAVSTMGVGPHFSEDLLVALAEAGGGNFHYIANPDQIPRIFQE 214 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 82/242 (33%), Gaps = 49/242 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L+++ + ++ E I L +D+SGSM + P Sbjct: 294 LSRKTMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSSGMP------------- 340 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ K+A+ ++ + + + ++ + R + WG +VR+Y + Sbjct: 341 --KIEIAKDAVSQYIQFLSKTND-----RLAMVLFNFRADVL----WGLHQVRRYWQQMN 389 Query: 285 DS----LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T+ A++++ ++LT S K +I +TDG Sbjct: 390 YMLKYVYAGGGTNLANALERSREVLT------------------RSKSNSKHVICVTDGR 431 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 N +K + + N I TI+I + + + L++ + + L Sbjct: 432 TVNSS---MCVKEAVRLRRNGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIHDLGK 488 Query: 401 VF 402 Sbjct: 489 AL 490 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 71/247 (28%), Gaps = 40/247 (16%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V+FIP + + V+D SGSM + + Sbjct: 17 LQVAFIPFSGVLAAQSGTSMDVIFVLDSSGSMAESDPEKIRE------------------ 58 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILK 290 A+ +FLD + + GL+ Y+ + + + + + + + L Sbjct: 59 -EAIKMFLDMGQIQGNKA-----GLVAYSDSIVREHNLDSINSSEDKDRIKSMASDISLG 112 Query: 291 P-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD+ + +A +++ S K + +S + Sbjct: 113 QRTDTGRGLLEAVKLMESGHKSGNNPVIILLSDGKNDPERSQ------------DESLND 160 Query: 350 TIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K + TI +N + + +L S+ Y A L + +I Sbjct: 161 LKNALQICKSKGYPVYTIGLNYNGTVDKTQLGDISSSTGGKDYITNTASDLPAILTDIYA 220 Query: 408 LMVHRKY 414 K Sbjct: 221 DNSKLKV 227 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 13/104 (12%), Positives = 39/104 (37%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++K ++ ++ ++ L + F I + K +++A ++A LAGA + N + Sbjct: 11 RTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDENGNVS 70 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 + + + + D + +++ + Sbjct: 71 AAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATFVSAA 114 >gi|149916952|ref|ZP_01905453.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] gi|149822230|gb|EDM81621.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] Length = 416 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 62/195 (31%), Gaps = 15/195 (7%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDS 286 + K A + +D + L+ + +R E + + + ++ Sbjct: 16 IEGAKAAAMELVDGLAEGDS------FALVVFHSRAEVLMPSTVINEDSRAAARSKIETM 69 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD ++QA L + P + ++ L DG N+ + Sbjct: 70 QAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERVVLLGDGVPNDAST 129 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ--- 403 +T+ + +I + + L + V + +++ +F+ Sbjct: 130 IPSTVG---QLAARGTQITALGYGIEYDETLLASLAEQTHGSFRFVDDPEAVASLFRDEV 186 Query: 404 -NISQLMVH-RKYSV 416 +I + + + + SV Sbjct: 187 LDIERTVANDLRLSV 201 >gi|326676330|ref|XP_003200547.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 1164 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 48/169 (28%), Gaps = 26/169 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + + ++ S + K S T + AMK Sbjct: 480 DQIGPDGTQVAIAQFSDDARTEFSLSSHSSKEELLTAIQRVSYKGGNTKTGRAMKHVKDS 539 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + V K ++ LTDG + + +++ + + + Sbjct: 540 VF-------------APVGGARRGVPKVLVVLTDGRSQDD-----VLQVSQELQAEGYIV 581 Query: 365 VTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I A + LL S P + + V + F+ I + ++ Sbjct: 582 FAIGF-ADADYGELLSI-ASRPGDRHVFFVDDL----DAFRTIVENILQ 624 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 62/181 (34%), Gaps = 44/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D ++ K + ++ + Sbjct: 91 DIVFAVDVSKSM---------------LAEDIAPNRLEKAKRLVSEIINELAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ D L + T A++ A +++ Sbjct: 129 DRVGIIAYAAQAFPQLPITTDYGAAKMFLQSMNTDMLSSQGTAIHEAIELAATYFDDEEQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ + + ++A + I+I TI + Sbjct: 189 TN------------------RILFLVSDGEDH---AEDQVMDAIEQATDQGIRIFTIGVG 227 Query: 371 A 371 + Sbjct: 228 S 228 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/337 (10%), Positives = 90/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 21 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 80 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + +++ + P Sbjct: 81 ALQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQVRDLP 136 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 137 PANLVFLIDTSGSMQ-------------------PAERLPLIQSALKLLVNDLRAQDNIT 177 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 178 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 230 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+ TDG+ N ++ I+ K +E I + T+ Sbjct: 231 IKGGVN----------------RILLTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 274 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 275 VGDDNFNEAMMVRIADVGNGNYSYIYSLSEAQKVLKD 311 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 59/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ S + + + T + Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPLSA-FSRREDMERAIRALVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPPEARVPRVAVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVDSFDLIQEF 207 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 322 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 378 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 379 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 428 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 429 -----WAARAKEEGIVMYAVGVG--KAVEEELRKIASEPAELHVSYSPDFGTMTHLLENL 481 Query: 406 SQLMV 410 + Sbjct: 482 KGSIC 486 >gi|197117451|ref|YP_002137878.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197086811|gb|ACH38082.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 357 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 18/199 (9%), Positives = 42/199 (21%), Gaps = 10/199 (5%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + K + +A +AA A + + ++ + + Sbjct: 20 VDAGRAYGVKAKLHAAVDAASYEAAKALAQGEDEDDMKEKASEAAFDYFRANFPSDYFGA 79 Query: 101 HIKE-----SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 S + I + +D + D Sbjct: 80 QCSGPELELSERKSEQKMRALSVSATARLPNIFAGILGWKSIDLPARSRAVRKDA--DVV 137 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPED 213 L+ + L + R + + L+ D S+ N + Sbjct: 138 LVLESSDALRDSFPQ-VKQRVATFSDRFSQHDDRMSLVTFAAGADPVISICGVYNPAKDR 196 Query: 214 VNSAPICQDKKRTKMAALK 232 + K K Sbjct: 197 PGAGTFNCGSGYQKSNFAK 215 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 50/516 (9%), Positives = 123/516 (23%), Gaps = 117/516 (22%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLG----------------------DRFESISNHAKRALIDDAKRF------IKN 100 S D + D K Sbjct: 67 NSGRKTTDYKLGGSNPSDESFNISSEVGKRDHAIVTAFVKTFLPQTDEKNMHLTPLCKTI 126 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---MT 152 + S+ + V + N +++ ++ + + Sbjct: 127 NNTSGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASQSRAIKKNTFNIPI 186 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR-------IEMGERPIFLIELVVDLSGSMHC 205 + N +I +E+ ++ + + + + Sbjct: 187 DLMVVADLSGSMNFDLDNNEIKKTGKPSKISILKEVLVELADKTLLSEDANQNNRIYVTP 246 Query: 206 AMNSDPEDVNSA---PICQDKKRTKMAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTT 261 + N+ + ++ +KN L + + + Sbjct: 247 FALGAEINNNNCALPYSWSVESSSRTQNIKNILNKQNSQYNRADLINNLVYKISTKETLD 306 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKP--------TDSTPAMKQAYQILTSDKKRSF 313 + + K + S T+ ++K L S Sbjct: 307 NINGKQNYNVTFSKNAFCLKDMKTSNKGWYSRSDKLEFTNYVQSIKANGATLASSGVLVA 366 Query: 314 FTNFFRQGVKIPS--LPFQKFIIFLTDGENNNFKSNVN---------------------- 349 N R G + ++ I+ L+DG + K + N Sbjct: 367 ANNMIRDGSRTEQLKEQTKRVILVLSDGNDEIVKGDPNNKVPFLNYTRITENLIYGRQEE 426 Query: 350 ----------------------------TIKICDKA--------KENFIKIVTISINASP 373 T +C+ K+ KIV + Sbjct: 427 FSSEKKRVSFGHSTTIETYLTDTQPKKVTDGMCNVIRDKLDTLNKDKNTKIVFVEFGYKS 486 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 ++ + CV +Y+ + SL++ F+ I + Sbjct: 487 TSKQAWEHCV-GNGNYYSANDKASLLNSFKQAIGET 521 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D ++ K A+ +D++ + +G+I + + + Sbjct: 97 DVSNSMLAGDLAPNRLENAKRAISQLIDNL-------HNDRIGIIIFAGEAYVQLPITTD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ D + + T A+ + K Sbjct: 150 YSAAKLFLNNITTDIVPTQGTAIGAAIDMGMKSFNFVNGT------------------SK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +I +TDGEN+ + + +A + I I + + Sbjct: 192 AMILMTDGENHEDDA----VSAAKRASAKDVAIHVIGVGS 227 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 60/180 (33%), Gaps = 44/180 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D ++ K + ++ + Sbjct: 91 DIVFALDVSKSM---------------LAEDIAPNRLEKGKRLISEIINHLGS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ D L + T A++ A + + Sbjct: 129 DRIGIIAYAAQAYPQLPITTDYSAAKMFLQSMNTDMLSSQGTAINEAIELASTYYDDETQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ + T +KA E I+I TI + Sbjct: 189 TN------------------RVLFIISDGEDH---AEGTTEDAVEKATEEGIRIFTIGVG 227 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ +K ++ L L + + ++ Y +EP+ + + Sbjct: 335 VSGSMGSPDKLPLVKQSMELLLSGLQPTD------TVSIVVYAGAAGTVLEPTP-VAEQQ 387 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T ++ AYQ+ ++ +R II T Sbjct: 388 KILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVN----------------RIILAT 431 Query: 338 DGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ N ++ ++ + N I++ + + L++ + D Sbjct: 432 DGDFNVGIADPEQLKGYVERKRANGIELSILGFGSGNYNDALMQQLAQNGNGV--AAYID 489 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 +L + + + ++V Sbjct: 490 TLSEAQKVLVEQASGTLFTVA 510 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 74/254 (29%), Gaps = 48/254 (18%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + Q ++S +I + LV+D SGSM Sbjct: 19 SSSQRQLMLSIAATSEQIN--TNLPINLCLVLDHSGSMQ--------------------G 56 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K A L ++S+ + + +I + R + + + + ++ Sbjct: 57 KPLETVKKAALSLIESLGVND------RLSVIAFDHRAKVILPSQSRQDDLTLIRSKIQQ 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K Q +S S + I LTDGE N Sbjct: 111 LRAGGGTAIDEGIKLGIQESSSG-----------------SKGYVSHIFLLTDGE-NEHG 152 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQN 404 N +K+ A E I + T N L K + + + LI F Sbjct: 153 DNQRCLKLAAVAAEYGITLNTFGFGDHWNQDILEKIADIAGGSLSYIERPEQALIE-FTR 211 Query: 405 ISQLMVHRKYSVIL 418 + + + + Sbjct: 212 LFNRLQSVRLTNAF 225 >gi|149175890|ref|ZP_01854508.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] gi|148845337|gb|EDL59682.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] Length = 368 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/354 (10%), Positives = 102/354 (28%), Gaps = 38/354 (10%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 + ++ A+ ++ L ++ F + K+ ++ A +AA+LA ++ + + Sbjct: 17 HRRGAIAVFTAIMLVPLLGMVAFAVDYGYLLKKRADLQRAADAAVLAAVRDLIPDANGTQ 76 Query: 77 D----RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI----- 127 D R + + + + I+ +V+ N I + + Sbjct: 77 DLSKVRATLRQYADLNIGEIEGFQVLDSDIQIGRYNPESVYDNFTILDWGTFDTVRVTLR 136 Query: 128 ------------SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + N D + + + + + S Sbjct: 137 FDTQANSPVSLFFARLLGINESDLNATSTAILQKG----SLLAPGVGVMPFTIPKAQWDS 192 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM---AALK 232 P + G+ +E + + + NS +D+ + Sbjct: 193 TEPGEVWTIYGD---GRLEDNIGAEIPGNWGTCDIGKSYNSTSDIRDQIEVGLRQEDLDA 249 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A DS + + + G G ++ ++ + + + + DS++ + Sbjct: 250 LAAEGRNDSTEYIDSARPLYLNGDPGISSGMKSAV----MAAEGKTKLIPVYDSVMGNGS 305 Query: 293 DS-TPAMKQAY-QILTSDKKRSFFTNF-FRQGVKIPSLPFQKFIIFLTDGENNN 343 + A +++ S T ++ + +TDG + Sbjct: 306 GVEFHVIGWAVCEVIDSHWGGDKNTYVKIKKSYTYDGFLRPNTDLRVTDGVVDG 359 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 66/208 (31%), Gaps = 41/208 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LV+D S SM A Q +++ A+K ++ LD E Sbjct: 90 DIMLVLDTSQSMEIADIEGQS-------GQAMALSRLDAVKQGVMSLLD-------ASEG 135 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + + + + + VR V++ T + A +L Sbjct: 136 NRIGLIAFGEQSFVMSDLTAYGDTVRYMVSQLETGFAGDSTRLGDGVGYAVSLLADVDSE 195 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ +TDG + S++ ++ AK +K+ ++ A Sbjct: 196 RA------------------IVVLITDGNDTG--SDLPPVEAARLAKALDVKLYVAAVGA 235 Query: 372 SP-------NGQRLLKTCVSSPEYHYNV 392 + L + + + + Sbjct: 236 DVSTDREPIDEALLRRLAERTGGAFFRI 263 >gi|327261831|ref|XP_003215731.1| PREDICTED: collagen alpha-1(XII) chain-like [Anolis carolinensis] Length = 3118 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + ++ +G++ Y++ + + Sbjct: 155 NNFKYILDFIVALVSAFDIGEDKTRVGIVQYSSDTRTEFNLNQYYRQRDLIEAIKNIPYK 214 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ + + S F K I +TDG++ + Sbjct: 215 GGNTMTGEAIEYLMRNTFVESAGS-------------RKDFPKVAIIITDGKSQD----- 256 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S P + +NV N D ++ + I Sbjct: 257 EVEIPARELRAAGVEVFSLGIKAAD--AKELKLIASQPSLTHVFNVANFDGIVDIQNEIV 314 Query: 407 QLMVH 411 + Sbjct: 315 SQVCS 319 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 46/364 (12%), Positives = 110/364 (30%), Gaps = 39/364 (10%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A A K++++ L F + + ++ + + + E L + E Sbjct: 279 AADAKELKLIASQPSLTHVFNVANFDGIVDIQNEIVSQVCSGVDEQLGELVSGEEAIEPP 338 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS---- 175 + +++IS + S + Y +D++ Q + Q + Sbjct: 339 ANLVATQISSRSIRITWSPSPSEITGYKIDMIPMLAGVKQQSLSVGPQTTAFNVKDLSAD 398 Query: 176 ----FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++ PI ++E + + C+ D + + + Sbjct: 399 TEYQINVYAMKGLTPSEPITIMEKTQPVKVQVECSKGVDVKADIVLLVDGSYSIGIANFV 458 Query: 232 KNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LIL 289 K + FL+ + E V + L+ Y+ + ++ + Sbjct: 459 K--VRAFLEVLVKSFEISPEKVQISLVQYSRDPHTEFTLNR-YNRIDDIIQAINTFPYRG 515 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ AM + + + + +I +TDG KS+ Sbjct: 516 GSTNTGKAMTYVREKIFVTGRG-------------ARPNVPRVMILITDG-----KSSDA 557 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQ 407 K + + ++I + + + L+ + P + Y V + FQ IS Sbjct: 558 FKDPAIKLRNSDVEIFAVGVKD--AVRTELEAIATPPAETHVYTVEDF----DAFQRISF 611 Query: 408 LMVH 411 + Sbjct: 612 ELTQ 615 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 65/183 (35%), Gaps = 22/183 (12%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K +++ + V +GL Y+ + + K +Q + + +L K Sbjct: 1215 FKTVRSFIARIVEVFDIGPDKVQIGLAQYSGDPRTEWQL--NSHKTKQSLMDAVANLPYK 1272 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 ++ M +F + +K + +TDG++ + Sbjct: 1273 GGNTLTGMAL-----------NFILRNNFKPEAGMRPGARKIGVLITDGKSQDDIV---- 1317 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQL 408 + K+ +++ + I + LK S P+ + YNV + L+++ +++ Sbjct: 1318 -APSQRLKDLGVELYAVGI--KNADENELKQIASDPDETHAYNVGDFTLLVNIVDDLTVN 1374 Query: 409 MVH 411 + + Sbjct: 1375 LCN 1377 >gi|148657485|ref|YP_001277690.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569595|gb|ABQ91740.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 459 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 63/185 (34%), Gaps = 26/185 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQ 278 TK+A+ K AL L + L+ ++ +V+ +++ S+ K + Sbjct: 100 SGSMSGTKLASAKEALRQALHFLQDGD------VFSLVTFSDQVQTHLKAESYAQRKRDK 153 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + Q + ++ ++ L+D Sbjct: 154 MENLLDEIRASGMTALDGGLAQGIDLGQKKRQ------------------ATTLVLLLSD 195 Query: 339 GENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G+ N ++++ I + KA+++ + + T+ + N +++ Y++ Sbjct: 196 GQANVGETDLEKIGLRAQKARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYHIQEGSQ 255 Query: 398 LIHVF 402 + Sbjct: 256 IPAAL 260 >gi|226498336|ref|NP_001143188.1| hypothetical protein LOC100275688 [Zea mays] gi|195615532|gb|ACG29596.1| hypothetical protein [Zea mays] Length = 599 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/236 (11%), Positives = 71/236 (30%), Gaps = 40/236 (16%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + M + +++++ SM Q + +++ LKNA Sbjct: 31 AVVRVEAPSSMKNHAPIDLVTLININQSMSWP-----------AASQTEIPSRLDLLKNA 79 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKP 291 + + + + ++ + +V K + + + Sbjct: 80 MKFIIRQLGDDD------RLAIVAFNDKVIKENTTGILEISGSGRMAIEKKVDGLVAMGD 133 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T P+++ A ++L FI+ ++DG + K +I Sbjct: 134 TAFKPSLEHAVKLLDDR----------------DDKKRAGFIVLISDGLDGQSKWGDESI 177 Query: 352 KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSLIHVFQ 403 D + + T + + + + L S + ++ N D +I F Sbjct: 178 TPTDPIRGLLRKYPVHTFGLGKAHDPKALHYIADISYGIYSSIVTDNLDKIIEAFA 233 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + VD+SGSM K Sbjct: 141 NKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDAPD-------------------K 181 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 182 LPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPGDQQGR-IVEAIDSLQ 234 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 235 SGGSTAGASGIELAYKAAQQGYLRGGIN----------------RILLATDGDFNVGVTD 278 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 279 FDALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTHE 338 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 339 LGATLATIARDV 350 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 77/233 (33%), Gaps = 59/233 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD T++ K + ++ + +G++ + + + Sbjct: 97 DISNSMLAQDVAPTRLDRSKRLVEDLVNRFT-------NDKIGIVVFAGDAFVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + + TD A++ + + K Sbjct: 150 YISAKMFLNNISPELIGSQGTDIGKAIELSEHSFSEKANFG------------------K 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGEN+ + ++ +A++ I++ + I + +GQ Sbjct: 192 AIIIITDGENHEKGA----EEMAREAQKQGIRVFILGIGSPQGAPIPMGNGSYLQDTSGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVF-QNISQ----LMVHRKYS 415 ++ K + + +V N + + I + + YS Sbjct: 248 TVMTRLNEDMCRKIAEAGKGMYIHVDNTTAAETILDNEIGKMQRGEIESVSYS 300 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/366 (10%), Positives = 94/366 (25%), Gaps = 33/366 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A S++ + + +A + + + +S + + Sbjct: 790 AARDQSSSVFQTSTNLAPRKRVIFQLTYQEALQRKRGIYQYGVSFRMLQPVSMFSITVSI 849 Query: 124 SSRISMTHMA-------NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 S + ++ + + + + + Q Sbjct: 850 SESVPLSTVNALGLETEQTSVPGPVPLQGITTVRNSPVSAVITYTPTSNQQHLISPFGLN 909 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALK 232 ++ ++ + V+D S H ++ P D + K+ ++ Sbjct: 910 GKFVIEYDVFRDRTTEM--VIDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFGIKIDQVR 967 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILK 290 A+ L + + + + + GT + Sbjct: 968 QAMNTILHGLAETDFFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLNTTVVTRG 1027 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 TD A++ A Q+ S + F++ LTDG + + Sbjct: 1028 GTDILQAVEAAIQLFDSAATG-------------GTNTASDFMVLLTDGRPTDGTVSSTA 1074 Query: 351 I-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNI 405 I I TI + L K + ++ + + + ++ I Sbjct: 1075 IISAIRNLNRGRFGINTIGFGTLVDMNLLRKIAAQNSGTSIQIFIDLNSYAQISNFYEEI 1134 Query: 406 SQLMVH 411 SQ ++ Sbjct: 1135 SQPILS 1140 >gi|218196035|gb|EEC78462.1| hypothetical protein OsI_18329 [Oryza sativa Indica Group] Length = 614 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 60/179 (33%), Gaps = 28/179 (15%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ +K A+ +F+ + + ++ + T V + E ++ +E+ R +D Sbjct: 46 RLEHVKEAMEIFIGKLGPDD------RLSVVSFATSVRRLTELTYMSEQGRAVAKEIVDG 99 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T+ A+ + IL K G + S ++FL+DG N+ Sbjct: 100 LVADGSTNMGAALLEGAMILRDRK-----------GARDESNGRVGCMMFLSDGTNDEIY 148 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQ 403 + + T + + N + + + V N + F Sbjct: 149 KEDISGEF---------PAHTFGLGSDHNPNVMRHIADETSATYSFVNRNIADIKGAFD 198 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 13/142 (9%), Positives = 43/142 (30%), Gaps = 2/142 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ + + +++ L +I V +++A +A LAGA Sbjct: 2 LSRIVRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 KMV--SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 ++ + +++ + +HI + + + + + Sbjct: 62 ELDGRDDAITRAQTAIEKISNSAAFSAGGTGMSLGSHISVTYDAGNDAGSTVTVLFLKDI 121 Query: 125 SRISMTHMANNRLDSSNNTIFY 146 T + ++ + + Y Sbjct: 122 PANDDTPIPSSMETTVASEASY 143 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 671 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFNSAKDMKKAVAHMKY 727 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 728 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRVAIVFTDGRAQDDVS- 776 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 777 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMDEISEKL 830 Query: 406 SQLMVH 411 + + Sbjct: 831 KKGICE 836 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 68/207 (32%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIMDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA+ I I I + LK Sbjct: 155 RPLKENVPRVIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQVDFN--TLKAI 207 Query: 383 VSSP--EYHYNVVNADSLIH---VFQN 404 S P ++ + V N + VFQ Sbjct: 208 GSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 61/178 (34%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + +G++ Y+ R + + T++ Sbjct: 61 QWVSNLVETFEIGPDKTRVGVVRYSDRPSTEFDLGKYKTREEIKEAARKIQYYGGNTNTG 120 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + ++ G ++ +K I LTDG + +F + Sbjct: 121 DALRY----------ITTYSFSKEAGGRLSDRTVKKVAILLTDGRSQDFVLDP-----AT 165 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 A + I+I + + + + L S P+ + ++V + +++ + + + + Sbjct: 166 AAHQAGIRIFAVGVGEALKEE--LDEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCE 221 >gi|219886181|gb|ACL53465.1| unknown [Zea mays] Length = 561 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 58/194 (29%), Gaps = 27/194 (13%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGT 273 AP + + +++ LK A + ++ + ++ ++ R + + Sbjct: 7 APTERTRTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTA 60 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + R + T PA ++A ++L + F I Sbjct: 61 DGRRNAIRSLDQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLGF--------------I 106 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + S + + + + + + + + LL S + Sbjct: 107 VLLTDGAEDASGSFTLSERRREVIRGALRKYPVHAFGLGTAHGPEVLLYLAQESRGTYSF 166 Query: 392 V--VNADSLIHVFQ 403 V N + Sbjct: 167 VDGDNVGEVAGALA 180 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/251 (13%), Positives = 70/251 (27%), Gaps = 31/251 (12%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 +K S I + P F E + S + D + + + +K Sbjct: 1 MKKTFSLILMFGLLLTAASPAFAAENANENSFFKPKNNTAILLDASGSMAKRIDGVSKYN 60 Query: 230 ALKNALLLFLDSIDLLSHVK--------EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ ++ F D I S V+ + G + + + + ++ Sbjct: 61 MAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQSCESIRGVYGFQRFDK--QSFLN 118 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T A++ A K + LTDGE Sbjct: 119 SLNGIGPTGWTPIAKALEDAKASFNG-----------------VHKLGSKSVYLLTDGEE 161 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIH 400 IK + ++ IK+ I + + +L + +Y + + Sbjct: 162 TCGGD---PIKTAKELRKQHIKVNVIGFDFNEGFNGQLHAIAGAGGGKYYEAHSQKDMNR 218 Query: 401 VFQNISQLMVH 411 +F+ + + Sbjct: 219 IFKLEASSLGE 229 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/356 (10%), Positives = 98/356 (27%), Gaps = 49/356 (13%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + AKR + + ++ + SA ++ +S+ + Sbjct: 504 ADQYAVLLGSDAKRITESSKYKVKEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVG- 562 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + T + ++ I Sbjct: 563 -EVPRTTVTNTVVTAPRYRKYIKANNDGTY------DLSLNVTGTQSGSSQTTVSPADIV 615 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDV 252 +V D SGSM M + +++ K A+ + + ++ Sbjct: 616 VVFDTSGSMSNPMGHN---------------SRLEVAKTAVNSMAQHLLTSENQGKDSNI 660 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK-- 310 M L+ ++T + + V T+ A+K A LTS +K Sbjct: 661 RMALVPFSTTAGNVSNFTDNAMDIVSAVNGL---GADGGTNWEAALKAANAKLTSGRKGV 717 Query: 311 -------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN----------NNFKSNVNTIKI 353 + V+ + + I DG ++ + + Sbjct: 718 KKYIVFMSDGDPTYRTSSVRTGTDWLGRPIYDADDGWGLPAGVHGSGLSDRYGANLSSAV 777 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + ++ +++ P ++ + +Y+ + D L F +I + Sbjct: 778 AEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDELNKAFADIIGQI 831 >gi|260797332|ref|XP_002593657.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] gi|229278884|gb|EEN49668.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] Length = 949 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 51/193 (26%), Gaps = 24/193 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K L + ++ + +G++ Y+ +K Sbjct: 32 VLDGSGSVGDADFAKT-LEFAENVVNAFDIGTDLTRVGVVQYSDTPTMEFNLGVHADKGS 90 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + A++ A R P K +I +T Sbjct: 91 TIAAVNNIQYQNGGTATGAALEFA-----------------RANANWRGAPVPKVMIVVT 133 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG KS + + + I + + + L + + + + ++ Sbjct: 134 DG-----KSGDDVTAAAQALAGEGVAVYAIGVG-NYDLPELQQIANGNNNNVIELQDYNA 187 Query: 398 LIHVFQNISQLMV 410 L I+ + Sbjct: 188 LTAAIDQIAGQVC 200 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 54/158 (34%), Gaps = 24/158 (15%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G+I Y++ V++ + K D + T + A+ Sbjct: 721 VGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGAAITYMKD---------- 770 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K I +TDG++++ ++ A++ I + I + A+ Sbjct: 771 --------NSQWRPGVAKIAIVVTDGKSSDDVGPPSS-----AAQQTGITMHAIGVGANV 817 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + Q L S+ +Y V + D+L ++ + Sbjct: 818 D-QTELSQIASTSQYVTTVADYDALDAQMAQLTASVCD 854 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 56/161 (34%), Gaps = 24/161 (14%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +G+I Y++ V++ + K D + T + A+ Sbjct: 364 NKQIGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGTAITYMKD------- 416 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K I +TDG++++ + ++ + I + I + Sbjct: 417 -----------NSQWRPNVAKIAIVVTDGKSSDDVAAPSSAAQ-----QAGITMHAIGVG 460 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A+ + Q L S+ +Y NV + D+L ++ + + Sbjct: 461 ANVD-QTELSQIASTSQYVTNVADYDALDAQMAQLTASVCN 500 >gi|46445753|ref|YP_007118.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] gi|46399394|emb|CAF22843.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] Length = 362 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 81/231 (35%), Gaps = 32/231 (13%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L+VD S SM+ ++S+ + + TK+ LK F+ + Sbjct: 98 IYLIVDQSSSMNEKISSNSFFE------RGRSFTKIDLLKVMAGQFI-------LHRPSD 144 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + + E + + + ++S+ T A+ + ++ + K Sbjct: 145 LIGLVAFARVPKILSPLTLDHELLINQLNDLKAINSMEEDGTAMGYAIYKTAHLIVATKH 204 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDG-------ENNNFKSNVNTIKICDKAKENFIK 363 ++ K ++ LTDG + N + + AK I Sbjct: 205 F---AQELQKKGKPAYEIKNAIMVVLTDGFQDPNRLDYGNRLRTIELDEAIAYAKNAGIH 261 Query: 364 IVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I+++ +P+ +++ S+ Y L VF I + Sbjct: 262 LYIINVDPKFSSPQFAPHRRQIETLAESTGGQLYLANQEKDLKRVFDTIDR 312 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + V D + + T + Sbjct: 54 QFLVGLVRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMVRAIRDLVPLAQGTMT 112 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 113 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 157 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 158 AQARARGIEIYAVGVQRADVGS--LRAMASPPLEEHVFLVESFDLIQEF 204 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 401 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 457 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 458 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 507 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 508 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAEQHMSYAPDFGTMTHLLENL 560 Query: 406 SQLMV 410 + Sbjct: 561 KGSIC 565 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + V D + + T + Sbjct: 54 QFLVGLVRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMVRAIRDLVPLAQGTMT 112 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 113 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 157 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 158 AQARARGIEIYAVGVQRADVGS--LRAMASPPLEEHVFLVESFDLIQEF 204 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 58/185 (31%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 360 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 416 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 417 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 466 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + S P + + ++ H+ +N+ Sbjct: 467 -----WAARAKEEGIVMYAVGVGKAVEAD--WREIASEPAEQHMSYAPDFGTMTHLLENL 519 Query: 406 SQLMV 410 + Sbjct: 520 KGSIC 524 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + VD+SGSM K Sbjct: 126 NKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDAPD-------------------K 166 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 167 LPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPGDQQGR-IVEAIDSLQ 219 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 220 SGGSTAGASGIELAYKAAQQGYLRGGIN----------------RILLATDGDFNVGVTD 263 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 264 FDALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTHE 323 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 324 LGATLATIARDV 335 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 62/188 (32%), Gaps = 25/188 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K A +D + + +I + + I T++ + TR Sbjct: 59 LETVKRAAQKLVDRLLPSD------RLAVIVFDHVAKVLIPNQPVTDRDKIK-TRISHLA 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T ++ L + K + I LTDGEN + ++ Sbjct: 112 AMGGTAIDEGLQLGLTELIAAKAGAISQ-----------------IFLLTDGENEHGNNS 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ ++A + I + T+ N L + ++ + ++ F+ + Sbjct: 155 RCL-QLAEEAAKENITLNTLGFGYHWNQDVLEQIADAAGGSLMFIEYPQDVLIGFERLFN 213 Query: 408 LMVHRKYS 415 ++ ++ Sbjct: 214 QIISVGFT 221 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 92/253 (36%), Gaps = 38/253 (15%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 I+++ P LVVD+S SM+ + + Sbjct: 19 IMSSILAWQPQTANAASPSASKVDAVLVVDVSNSMNTSDPGKIGN--------------- 63 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSL 287 A+ +F+D + + +G++ YT V++ + +E +Q + +D L Sbjct: 64 ----EAMKMFIDMLSTQND-----KVGIVAYTDVVQREKALLNISSEADKQELKTFIDGL 114 Query: 288 ILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD++ +K+A +IL ++ + + F K T G + + Sbjct: 115 NRGAYTDTSVGVKEAIRILQDG--KTAGHAPMIVMLADGNNDFNK-----TTGRTESQSA 167 Query: 347 NVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + AK++ + I TI +NA N +L + + +AD L ++ Sbjct: 168 QDMAQAVAE-AKKSGVPIYTIGLNADGKLNKNKLADIAQQTGGKSFITSSADDLPNILSE 226 Query: 405 ISQLMVHRKYSVI 417 I + K V+ Sbjct: 227 IFA--SNLKLKVV 237 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 55/168 (32%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDS 294 ID L + + ++ Y + V+ + +++ V+R T + Sbjct: 76 FLSKMIDTLDVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIQPLSA--GTMT 133 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + + G + + K +I +TDG + + Sbjct: 134 GLAIQAAMDEVFT----------EEMGTRPANFNIPKVVIIVTDGRPQDQ-----VENVA 178 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ I+I + + + L+ S P E+ + V + Sbjct: 179 ANARTAGIEIYAVGVGRADMQS--LRIMASEPLDEHVFYVETYGVIEK 224 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/360 (10%), Positives = 98/360 (27%), Gaps = 61/360 (16%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV-FYNTEIQNIV 122 + N R E + + + + A F + ++ + I + Sbjct: 253 AVKGLAKNPGLHRYRAEIFMDGDEASANNAAFDFTRVEGPPNVLIVEGTPGTSGNITAAL 312 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S I + L ++ + + + + I + + Sbjct: 313 ESGMIGTEVIPPELLSLEAAKYAVYDSIIFN-----NVSGSDVGGKQMELIEQAVRSFG- 366 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI-------------------CQDK 223 I + + S M + E + Sbjct: 367 -------IGFMMAGGEDSFGMGGYFKTPIEKALPVSMELEGKREIPSLGLILVIDRSGSM 419 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ K + + ++ + +G++ + + + P +K + ++ Sbjct: 420 DGNKIELAKESAMRTVELMRAKD------TVGVVAFDDQPWWVVPPQKLGDK-EEVLSSI 472 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ PA+ A + + ++ II +TDG++ Sbjct: 473 QSIPSAGGTNIYPAVSSALEEMLKI------------------DAQRRHIILMTDGQSAM 514 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + D EN I + ++++ + L ++ +Y V + +L VF Sbjct: 515 NSGYQDLT---DTMVENKITMSSVAVGMDADTNLLQSLADAAKGRYYFVEDETTLPAVFS 571 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/273 (12%), Positives = 81/273 (29%), Gaps = 45/273 (16%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y + + + K++ + + + P + ++D SGSM Sbjct: 188 YKVPDSREAPFSTNVSVVETPWNSDTKLLHIGLKGYTVPLDDLPPQNLVFLIDTSGSMSD 247 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ L+ A L L ++ + ++ Y Sbjct: 248 E-------------------NKLPLLQQAFRLLLSTLREDD------TIAIVTYAGNAGV 282 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 +EP+ ++K + T +K+AY++ + Sbjct: 283 LLEPTALSDKSK-IAEAIAALTSGGSTAGHAGLKEAYRL----------------AETMQ 325 Query: 326 SLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + II TDG+ N S+ + + + +++ I + + L++ Sbjct: 326 DDDTKSRIILATDGDFNVGLSSADDMKRFVKEKRDSGITLSVLGFGRGNYNDELMQALAQ 385 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + D+L + + +V ++ Sbjct: 386 NGNGV--AAYIDTLSEARKVLVDQIVSSISTIA 416 >gi|282900568|ref|ZP_06308510.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194368|gb|EFA69323.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 426 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 20/190 (10%), Positives = 49/190 (25%), Gaps = 17/190 (8%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMD 285 M+ + NA+ +D + + ++ + E I + ++ + Sbjct: 59 MSMVINAVEQLIDQLQSGD------RIAIVAFAGSGEVIIPNQIIKDPKTIKSQLH--NK 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + L K + F + + Sbjct: 111 LKAGGGTIIGEGLSLGITELLKGSKGACSHAFLLTDGYGDNGFKIWRLQI-------GPN 163 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N +++ KA + + I + N L K + + + I F I Sbjct: 164 DNQRCLELAQKAAKLNLTINSFGFGDEWNQDLLEKIADAGGGTLAYIETPQNAIEQFNRI 223 Query: 406 SQLMVHRKYS 415 + + + Sbjct: 224 FKRIQSVGLT 233 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 67/237 (28%), Gaps = 49/237 (20%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S + +V D SGSM +T++ A K+A Sbjct: 71 SAEEETNAPSSSSSTMAPTMVVFDSSGSMITNDAGG--------------QTRIDAAKDA 116 Query: 235 LLLFLDSIDLLSHVKEDVYMGLI--------GYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 F+ + + Y G + P G + + Sbjct: 117 ARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNS--EKMIAHMDGL 174 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T ++++A L Q+ II ++DG Sbjct: 175 QPRGFTPIGESLRKAAAEL--------------------PKEGQRSIILVSDGVATCTPP 214 Query: 347 NVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 V + + KE I I T+ N P Q+ L+ ++ + N +ADSL Sbjct: 215 PVC--DVAKELKEQGIDLVINTVGFNVEPEAQQELQCIADATGGTYANASDADSLAK 269 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 69/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFILDILQFLDIGPDITRVGLLQYGSTVK 105 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 V + V R T + A++ A I S+ +G Sbjct: 106 NEFSLKTFKRKSDVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 153 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 154 RPLRENVPRVIMIVTDGRPQDSVA-----EVASKARNTGILIFAIGVGQVDLNT---LKA 205 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 206 IGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 65/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + Sbjct: 629 FEIVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLKGFSSAKDMKKAVAHMKY 685 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K Y+ + +G + S + I TDG + S Sbjct: 686 MGKGSMTGLALKHMYERSFTQV----------EGARPLSTRVPRAAIVFTDGRAQDDVS- 734 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + I + + L+ S P ++ + + ++ + + + Sbjct: 735 ----EWARKAKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 788 Query: 406 SQLMVH 411 + Sbjct: 789 KLGICE 794 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 69/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFILDILQFLDIGPDITRVGLLQYGSTVK 105 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 V + V R T + A++ A I S+ +G Sbjct: 106 NEFSLKTFKRKSDVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 153 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 154 RPLRENVPRVIMIVTDGRPQDSVA-----EVASKARNTGILIFAIGVGQVDLNT---LKA 205 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 206 IGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 65/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + Sbjct: 657 FEIVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLKGFSSAKDMKKAVAHMKY 713 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K Y+ + +G + S + I TDG + S Sbjct: 714 MGKGSMTGLALKHMYERSFTQV----------EGARPLSTRVPRAAIVFTDGRAQDDVS- 762 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + I + + L+ S P ++ + + ++ + + + Sbjct: 763 ----EWARKAKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 816 Query: 406 SQLMVH 411 + Sbjct: 817 KLGICE 822 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 69/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 48 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFILDILQFLDIGPDITRVGLLQYGSTVK 105 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 V + V R T + A++ A I S+ +G Sbjct: 106 NEFSLKTFKRKSDVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 153 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA+ I I I + N LK Sbjct: 154 RPLRENVPRVIMIVTDGRPQDSVA-----EVASKARNTGILIFAIGVGQVDLNT---LKA 205 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 206 IGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 65/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +DS+ + +GL+ Y+T+V + Sbjct: 629 FEIVKHFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLKGFSSAKDMKKAVAHMKY 685 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K Y+ + +G + S + I TDG + S Sbjct: 686 MGKGSMTGLALKHMYERSFTQV----------EGARPLSTRVPRAAIVFTDGRAQDDVS- 734 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + KAK N I + + I + + L+ S P ++ + + ++ + + + Sbjct: 735 ----EWARKAKANGITMYAVGIGKAIEEE--LQEIASEPIDKHLFYAEDFSTMGEISEKL 788 Query: 406 SQLMVH 411 + Sbjct: 789 KLGICE 794 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 75/226 (33%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + ++ +GLI + + + Sbjct: 98 DISNSMLAEDVSPSRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITGD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A++ + T + K Sbjct: 151 YVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTP------------------NSKVGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE+N + + +A+ IK+ + I + NG+ Sbjct: 193 AIILITDGEDNEGGA----EAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKDSNGE 248 Query: 377 RL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + + +V N++ V + + + + Sbjct: 249 PVKTHLNEDMCRRIATAGQGVYVHVDNSNVADTV---LGKELGKLQ 291 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 52/187 (27%), Gaps = 27/187 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--W 271 SA + +K K A L +S+ +GL+ + T + Sbjct: 404 DRSASMLASFGVSKFDMAKEAAQLATESLQPED------RIGLLAFDTETLWVVPFQLIS 457 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G V Q + T A++ L + Sbjct: 458 GGLSVAQIQEQIASLPSGGGTRIERALEVGLPALAE------------------QPTKVR 499 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + N ++ + A+ I + TI+I + L + +Y Sbjct: 500 HAVLLTDGR-SFMNDNALYQRLVETARSQQITLSTIAIGLDSDTALLKQLAAWGGGRYYY 558 Query: 392 VVNADSL 398 + Sbjct: 559 ADQPADI 565 >gi|254414094|ref|ZP_05027862.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] gi|196179230|gb|EDX74226.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] Length = 601 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 69/243 (28%), Gaps = 45/243 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 +V +P + +RP + LV+D SGSM Sbjct: 16 STEALTTLDVLVKIVPPQPEANL-KRPELNLGLVIDRSGSMA------------------ 56 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K+A + A + + V + Y VE I + TEK + Sbjct: 57 --GKKIAYARQAACYAVQQLLGSDRVS------VTIYDDIVETLIPSTLATEKNY-ITRQ 107 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + + ++ + +I L+DG N Sbjct: 108 IERIHPRNMTALHDGWVEGGKQVSQ----------------YLNPEGLNRVILLSDGLAN 151 Query: 343 NFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++N + I + + T+ + N L S +Y + + L + Sbjct: 152 KGQTNADAIASDVYGLAQQGVSTTTMGVGDDYNEDLLEVMANSGDGNYYYIDTPEQLPEI 211 Query: 402 FQN 404 FQ Sbjct: 212 FQT 214 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/402 (10%), Positives = 90/402 (22%), Gaps = 99/402 (24%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A A M+ + + + D + K + + I Sbjct: 21 AAAFAMLPEANARSSTGMDGPTMTGKVVDPDTSNYWK---FWAGGYNGKEVTTQNVGRIW 77 Query: 123 NSSRISMTHMANNRL-----DSSNNTIFYNMDVMTSYDYRLQFIEHLL---NQRYNQKIV 174 + ++ S+ + L + N K + Sbjct: 78 TDKTVKAVESGDSDFLTTLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRI 137 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + + I K+ ++A +K A Sbjct: 138 DALKTAANTFIDAIAAQ------------------------NQSITDASKQHRVAIVKFA 173 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTD 293 D + + G Y + + K + + T Sbjct: 174 GKKKTDKVG-----NDTYRDGRYTY-NYSQTMKNLTSCKGKDADSLKDTVGNINPAGSTQ 227 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT--- 350 + ++ A I + + +K ++F TDG + + Sbjct: 228 ADYGLELAENITINSGRAD----------------AKKIVVFFTDGSPTSSSGFQASVAD 271 Query: 351 --IKICDKAKENFIKIVTISINASPNGQR-------------LLKTC------------- 382 I K N I TI I + N + Sbjct: 272 SAIASAKSLKANGADIYTIGIFSGANPSADPTAEGTSKVNKFMHAVSSNYPGATSSISFW 331 Query: 383 ----------VSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + +Y+ + +A L +F+ IS ++ Y Sbjct: 332 GEWVIDYGTRAENSDYYKSATSASELEKIFEEISGSIIQTGY 373 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 43/355 (12%), Positives = 101/355 (28%), Gaps = 62/355 (17%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + ++ + + N K+A ++ F + K S + N + Sbjct: 222 TKGYTNTINYDEEYAKLSENQFKKATLNPLSTFSIDVDKAGYSNVRRMINNGDNIPYDAV 281 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + + + +++ + ++V Sbjct: 282 KIEEMVNYFDYDYPQPTDEHPFSISTDVAKTPW----------NTQTQLVRIGLQGKEYL 331 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E P + ++D+SGSM K+ L +A L + + Sbjct: 332 NEELPASNLTFLIDVSGSMEDH-------------------NKLPLLISAFKLLVHQLIE 372 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ Y + P+ G +K + + T +K AY++ Sbjct: 373 KD------KVSIVVYAGAAGVVLPPTNGDQK-EKIINALQKLEAGGSTAGGQGIKLAYKL 425 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIK 363 + K++ +I TDG+ N S+ K+ +K + + + Sbjct: 426 AEKNFKKNGNN----------------RVILATDGDFNVGASSDTAMEKLIEKKRASGVF 469 Query: 364 IVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN--------ISQLM 409 + + L+T H + VF + I++ + Sbjct: 470 LSVLGFGMGNYKDSKLETLADKGNGNHAYIDTMQEAQKVFGDEFGGTLYTIAKDV 524 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K L +D++ + + ++ Y + V+ + + Sbjct: 40 SSRSVRPEEFEKVKIFLSEMIDTL---DVGERTTRVAVMNYASTVKVEFPLRTYFDKASM 96 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ V+R T + A++ A + + +++ + F + +I + Sbjct: 97 KEAVSRIEPLSA--GTMTGLAIQTAMEEVFTEEMGTRPAAF----------NIPRVVIVV 144 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + + A+ I+I T+ + + L+ S P E+ + V Sbjct: 145 TDGRPQDQVQD-----VAASARTAGIEIYTVGVGRADMQA--LRIMASEPLDEHVFYVET 197 Query: 395 ADSLIH 400 + Sbjct: 198 YGVIEK 203 >gi|149732296|ref|XP_001503293.1| PREDICTED: collagen, type XXI, alpha 1 [Equus caballus] Length = 957 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + + T + Sbjct: 58 KWLVNITRNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLMAAMESIHYLGGNTRTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L S K + LTDG++ + + Sbjct: 118 KAIQFALDYLF----------------AKSSRFLTKIAVVLTDGKSQD-----EVKDAAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A+E+ I + I + + L + + V + ++ + + + Q + Sbjct: 157 AARESKIILFAIGVGSETEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV 216 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 86/265 (32%), Gaps = 51/265 (19%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF------LIELVVDLSGSM 203 + Q + V +P I + I LVVD+SGSM Sbjct: 325 ANVNVPLSDPISLAYGVQPQGVQSVLPVPPAEVIVAAKNAWSLNRKRADILLVVDVSGSM 384 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + K+ A K L FL I +GL+ ++T Sbjct: 385 --------------------EGDKLEAAKAGLGTFLSRILPED------RVGLVTFSTES 418 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + P+ ++ + T A+ + L S + Sbjct: 419 RLVVPPAPLSDTRIRLDDAIAVMRAQGRTALYDALIDGKEALDS--------------LP 464 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + I+ L+DG +N+ ++ + +++ +E+ I I ++ A + L + Sbjct: 465 STGDDRIRAIVLLSDGLDNSSRATLEQVRLA--FEESGISIFPVAYGADADTDALQQIA- 521 Query: 384 SSPEYHYNVV-NADSLIHVFQNISQ 407 + V +A + +F+N+S+ Sbjct: 522 -TFSRTILVQGDAGDIGQIFENLSR 545 >gi|149923979|ref|ZP_01912364.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149815157|gb|EDM74708.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 785 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 19/196 (9%), Positives = 63/196 (32%), Gaps = 25/196 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 R K+ +K+ ++ + ++ ++ Y + P+ G +K Sbjct: 363 VSGSMSSRGKLPLIKHGFTQLVEQLGAED------HVSIVVYAGAAGVVLPPTSGDQK-E 415 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T+ + + +AY++ ++ +I T Sbjct: 416 TILGALDRLEAGGGTNGSAGIVEAYELAQANFVDGGVN----------------RVILGT 459 Query: 338 DGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ N S+ +++ ++ +E+ + + + + + + + + + + Sbjct: 460 DGDFNVGLSDHDALVELIEQKRESGVFLSVLGVGGHYDDELMEQLADHGNGNYAFLDGKR 519 Query: 397 SLIHVF-QNISQLMVH 411 V + I + Sbjct: 520 EAEKVLVEEIGGTLTT 535 >gi|301610722|ref|XP_002934910.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 957 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 58/191 (30%), Gaps = 29/191 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + T + Sbjct: 58 KWVVNITSNFNIGPKFTQVGVVQYSDYPILEIPLGSYESSDDLSRRMQSIQYLGGNTQTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L + R P K + LTDG++ + + + Sbjct: 118 NAIRFAIDNLFARSLR----------------PLTKIAVVLTDGKSQD-----EVKHVAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQL------ 408 +A++N I + I + + L + + V + ++ + + I + Sbjct: 157 EARKNKITLFAIGVGSEIEESELRAIANKPSSTYVFYVEDYIAISKI-REIMKQKLCEES 215 Query: 409 MVHRKYSVILK 419 + + V + Sbjct: 216 VCPTRIPVAAR 226 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/226 (10%), Positives = 75/226 (33%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++A K + +D+ + +GL+ + + + Sbjct: 97 DISNSMMAEDVVPSRLAKSKLMVENLVDNFT-------NDKIGLVVFAGEAFVQLPITSD 149 Query: 273 TEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ D + + T+ A++ + T + Sbjct: 150 YVSAKMFLQNADPSLITTQGTNIAQAIRLSMSSFTQ------------------QDKVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------ASPNGQ 376 II +TDGE++ ++ ++ +A++ I + + + GQ Sbjct: 192 AIILITDGEDHEGEA----LEAAKEARKKGINVYILGVGETKGAPIPTPDGGYMTDDRGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 248 TVMTALNEKMCQEVAKAGEGTYIHVDNTS---DAQKQLNDELAKLQ 290 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 72/234 (30%), Gaps = 44/234 (18%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++V I+ P ++D+SGSM K+ Sbjct: 222 HRLVKIGLKAREIDTDNLPATNFVFLIDVSGSMDWD-------------------GKLDL 262 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K+++ L ++++ + + ++ Y + + + G+EK + + Sbjct: 263 VKSSMKLLVNNLRPID------RVAIVVYAGAAGQVLPSTPGSEKSK-ILESLNGLTAGG 315 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVN 349 T + AY+I + II TDG+ N SN Sbjct: 316 STAGGEGIVLAYKIAKENLIEGGNN----------------RIILCTDGDFNVGVSSNDG 359 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVF 402 K+ + +++ + + + ++ + H + N V Sbjct: 360 LEKLIENERKSGVFLSILGYGMGNYKDDKMQTLAQAGNGNHAYIDNMQEANKVL 413 >gi|224029935|gb|ACN34043.1| unknown [Zea mays] Length = 598 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/236 (11%), Positives = 71/236 (30%), Gaps = 40/236 (16%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + M + +++++ SM Q + +++ LKNA Sbjct: 31 AVVRVEAPSSMKNHAPIDLVTLININQSMSWP-----------AASQTEIPSRLDLLKNA 79 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRDMDSLILKP 291 + + + + ++ + +V K + + + Sbjct: 80 MKFIIRQLGDDD------RLAIVAFNDKVIKENTTGILEISGSGRMAIEKKVDGLVAMGD 133 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T P+++ A ++L FI+ ++DG + K +I Sbjct: 134 TAFKPSLEHAVKLLDDR----------------ADKKRAGFIVLISDGLDGQSKWGDESI 177 Query: 352 KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSLIHVFQ 403 D + + T + + + + L S + ++ N D +I F Sbjct: 178 TPTDPIRGLLRKYPVHTFGLGKAHDPKALHYIADISYGIYSSIVTDNLDKIIEAFA 233 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 49/428 (11%), Positives = 120/428 (28%), Gaps = 45/428 (10%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M F F + +F I+ A+ + SFL ++ + D +N ++S+ + A Sbjct: 1 MFYYRNFLSVFSRLNHCTNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYA 60 Query: 61 I------LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + L+ + R + I + L + Sbjct: 61 LHNNPNELSVGT----IKQREMLIKKRIGYFLDSNYKGTLLT----EEQIKLIVNQSTVS 112 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 TE I++ N +L S + N + R + L ++ +V Sbjct: 113 ITERSFYPQQFHINIELHKNIQLKSLILHMAMNPKKDFNISQR----KSSLYKKNVALMV 168 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 I + + + + + K Sbjct: 169 VPFTWTGE---------WIPPSLFTTQFTVSQDLLPSDLKTEHFKKTEYFNKRNQFFKMF 219 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ + + ++ YV + +KP++ Sbjct: 220 LSKIKENNLCIAPYHYSAIVYWSEGIFSYKLPFSTTFLYSFRDIYVKQYSTIWDMKPSNY 279 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NT 350 + ++ ++ + +KF++ + G + + N Sbjct: 280 ILDLFAGAELHSNR-------LTPADPCFRRGVIQKKFMLIIAAGNQISDRKNSAEYFKM 332 Query: 351 IKICDKA-KENF------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 C K I + ++ I+ P+ +R L C P+ +Y + + + V Sbjct: 333 KHGCTLMGKNMGKNPQEEITVYSLGISPDPDTKRDLIQCTRHPDRYYEIQSYKDIAPVID 392 Query: 404 NISQLMVH 411 + + + Sbjct: 393 RLERNISS 400 >gi|254447588|ref|ZP_05061054.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] gi|198262931|gb|EDY87210.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] Length = 1197 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 78/271 (28%), Gaps = 60/271 (22%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S + + +K T++ A++N L + ++V GL+ + Sbjct: 231 SEDYSARVVTGNYLNWQENHNDEKTVTRLKAVQNVLHDVIG-------GMDNVNFGLMDF 283 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD----KKRSFFT 315 + S + Q + D T A+ ++ + L D +R + Sbjct: 284 YGKGRVIYPVSLVNDDRVQLLKTVDDLEANGGTPLQEALHESGRYLMGDTYYYGERGVSS 343 Query: 316 NFFRQGVKIPSL----PFQKFIIFLTDGENNNFKSNVNTIKI------------CDKAKE 359 F G S ++ LTDGE ++ ++ + C ++ Sbjct: 344 AFESDGKTYDSPILNECQANAVVLLTDGEPTDYGTDNVIHDMIKPVHEKNGTRRCTSYRD 403 Query: 360 NF-----------------------------IKIVTISINASPNGQRLLKTCVSSP-EYH 389 N + T+ + Q LL+ S + Sbjct: 404 NGNDYTRYCLPEIAQFMRDFDLSSGLDGDQTAQTFTVGFKSD---QELLQDTASKGQGNY 460 Query: 390 YNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + ++ L ++I + + + G Sbjct: 461 FQANDSVELTQALKDIVRKIKSESSTFTTPG 491 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 29/241 (12%), Positives = 65/241 (26%), Gaps = 64/241 (26%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I + +D+S SM R + + + F+ Sbjct: 88 DPPTIVIALDVSPSMGAMDIPG--------------RQRFQVAREVIRGFVR-------S 126 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GL+ + + P+ E + + + T + + L+ Sbjct: 127 YPHMAVGLVLFGKEALLEVPPTIDVEYFLERLEAVRLFSLGDGTALGMGVGTSLLHLSRV 186 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + ++ LTDG+N + + A+E I + T+ Sbjct: 187 NASF------------------RAVVILTDGKNTTGEI--LPETAAEMARELGIPLFTVG 226 Query: 369 INAS-----------------------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + + L + S ++ SL +FQ I Sbjct: 227 VGSDRPVSLDVIDPSTGTRYAGVLEEGYDEETLRRIAEISGGQFFSGYTPTSLHRIFQYI 286 Query: 406 S 406 Sbjct: 287 G 287 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 70/250 (28%), Gaps = 33/250 (13%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L + + + R G L D D Sbjct: 14 VLLLDLCDSYDLNNPS--YILAQSYAQRRNNGLPHRTLNPAATDSQCRSRPLDLVFIIDS 71 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + ++ K+ D +D L + + ++ Y + V+ Sbjct: 72 SRSVRPGEFEKVKI--------FLADMVDTLDVGPDATRVAVVNYASTVKIESLLKSHLT 123 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K + T + A+K+A + G + S K I Sbjct: 124 KDTIKQAITRIEPLAAGTMTGMAIKKAMD----------EAFTEKSGARPKSKNISKVAI 173 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHY 390 +TDG + ++ A+ + I+I + + + + LK S+P ++ + Sbjct: 174 IVTDGRPQDQ-----VEEVSAAARASGIEIYAVGV----DRADMRSLKLMASNPLEDHVF 224 Query: 391 NVVNADSLIH 400 V + Sbjct: 225 YVETYGVIEK 234 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + +VD+SGSM K Sbjct: 210 NKDSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMDAPD-------------------K 250 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 251 LPLLQSSLKLLVRQLRAQD------RITLVTYAGNTSVVLPPTPGDQQGR-IVEAIDALQ 303 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY+ R I+ TDG+ N + Sbjct: 304 SGGSTAGASGIELAYKAAQQGYLRGGIN----------------RILLATDGDFNVGVTN 347 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---------S 397 + + + + I + T+ L++ + + Y ++ Sbjct: 348 FDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHE 407 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 408 LGATLATIARDV 419 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/251 (12%), Positives = 78/251 (31%), Gaps = 51/251 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + I+S + + ++ + +D+S SM + + Sbjct: 61 MLLLGISFLILSLVNPKMGSKLKTVKREGVDVVFALDVSKSM---------------LAE 105 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV- 280 D ++ K + +D + +G+I Y + + ++ Sbjct: 106 DIAPNRLEKAKQIISKIIDKLGS-------DRVGVIIYAGNSYPLLPITTDHAAANMFLQ 158 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + D + + T A++ A +F++ ++DGE Sbjct: 159 NANPDMVSSQGTAINEALELAKTYYN------------------NDEQTNRFLVIISDGE 200 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI- 399 ++ T ++ + +KI TI + G ++ S Y + + Sbjct: 201 DHQ----EETKQVAQNLSNDGVKIYTIGVGTEKGGPIPMRLNGSMIGYKKDNQGETVITK 256 Query: 400 ---HVFQNISQ 407 V ++I+ Sbjct: 257 LTPDVLEDIAD 267 >gi|221130232|ref|XP_002156394.1| PREDICTED: similar to tyrosine kinase receptor [Hydra magnipapillata] Length = 1746 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 18/175 (10%), Positives = 55/175 (31%), Gaps = 18/175 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + ++ +++ +I+ + E+ + + T Sbjct: 1166 EFLINLASTFGINPSGAHAAVVTFSSDAVLSIKLNDYFEQAPFNKSVYEIEHMNGWTRID 1225 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG-ENNNFKSNVNTIKIC 354 A++++ ++ + + LTDG + N + + + Sbjct: 1226 LALRKSLEMFEEING--------------ARKNVPRLLFLLTDGKQETNEGGAEDPVNVA 1271 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL--IHVFQNISQ 407 ++ ++I+ + I N + L S + + N D L + I + Sbjct: 1272 QLLRDRGVEIIAVGIGKGVN-RLELNNIAGSSDKVFLAENFDELIKQDFLKKIKE 1325 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/304 (12%), Positives = 87/304 (28%), Gaps = 21/304 (6%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + D RFI K AV + + + L Sbjct: 1459 AVYAGDQYNLPQDALVRFIVKQEKP--LIEKAVITSLPKLEKEFVLTFDVQFESVQDLYY 1516 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + D+ + + I + + ++ + + + L Sbjct: 1517 NVIKFSGSSVSSIYPDFWFHKSRNSTMLTASTLINNKLVSIDQNPVAFDMWNKVVLSQTF 1576 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +G++H E D + + + K+ + G++ + Sbjct: 1577 NGTVHLED-PSCEGFFDVGFILDSSGSLESNYSQEVDFLKQLASSFGISKQGSHAGVVTF 1635 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 ++ + +I+ D + T A+++A ++ + K Sbjct: 1636 SSEAKLSIQLDKYFTDADFNKAVDDIPYMGGGTRIDLALEKAIELFDTKKGS-------- 1687 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K + LTDG + I + ++ K+ I++ I I ++ N L+ Sbjct: 1688 ------RNEAPKLLFLLTDGV----QEPKMEIPVPNEIKQKIIQLFAIGIGSNVNKDELI 1737 Query: 380 KTCV 383 K V Sbjct: 1738 KIVV 1741 >gi|154149676|ref|YP_001403294.1| von Willebrand factor, type A [Candidatus Methanoregula boonei 6A8] gi|153998228|gb|ABS54651.1| von Willebrand factor, type A [Methanoregula boonei 6A8] Length = 1081 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 53/394 (13%), Positives = 101/394 (25%), Gaps = 40/394 (10%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A ++ + + S L +++ ++ I + ++S + + Sbjct: 448 DATDSGYVTATATWSSISQPLQLSWKNYPYLSELTTISSQNVNVNGKFYVTVSLTADGWN 507 Query: 115 NTEI-QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-------QFIEHLL- 165 T ++V S ++ RL + + T Y Sbjct: 508 LTGTPADVVIVSDLAAGIGGATRLTHTKAAEVGFIKNATDNTYVALASFGSAPNAGSTPY 567 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + Q + + + RP V D + NS +K Sbjct: 568 DSSDTQNLWNLQLSKSNTTYTYRPFNPYGNVWDYNLVNPANWNSISSSTAYCFNSSSQKN 627 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +L L ++ + + N P++ T V Sbjct: 628 PSSQSL--GLRACVN--ATSPYGYTYLNPWSDSKIDADLMNAGPTYKTTNQNALVNTVNA 683 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK---------------IPSLPFQ 330 TD + A Q L S S G PS Sbjct: 684 YTAYGGTDYAAGINAALQELQSKGNPSHNQTIIIMGDGVNMMAPIAPGSFESYWPSDWNP 743 Query: 331 KFIIFLTDGENNNFKSNVNTIKIC-----DKAKENFIKIVTISI------NASPNGQRLL 379 + +++G N + + K AK I I I + N Sbjct: 744 RNGTGISEGPNLWYLDESDVGKAAALNASTTAKNLGITIYGIQFPTPDNYGHNINDTAFF 803 Query: 380 KTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHR 412 + VSSP Y + ++ +FQ I + + Sbjct: 804 QQMVSSPTSTWYYAPDPTTMTGIFQQIEGQIQNT 837 >gi|260881326|ref|ZP_05404133.2| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] gi|260849116|gb|EEX69123.1| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] Length = 428 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 22/323 (6%), Positives = 77/323 (23%), Gaps = 41/323 (12%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ ++ A + + G + + +++ +++A +AA LAGA + N + Sbjct: 5 RRGQQGAILVLTAFLLPFIIAFTGMAVDFGSAYVRRSQLQNAADAAALAGAYHLDDNQAD 64 Query: 75 L---GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ H F + ++ + + Sbjct: 65 DVVLKYLKTNLDPHFTSYSYQTGDDFPDKFETLNY-HTDKQKDELDVTLRSSVEASFLKL 123 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + S + ++ + Sbjct: 124 FDIDTIPVYATAKAKVSKEKESL--------------PTDDMFNYAITVANKSYETGNPS 169 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + ++ + N N D K + Sbjct: 170 IFMVSSGMNIKGNILTNGSILFWNDRVNTLDGK-----------------------IYSA 206 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 V + ++ + N ++ T + + + D + + + + Sbjct: 207 VPLNKQVWSNKAWDNKPLAFATPDGKTFTDIHDYNKNNGIADIRIDSNSGIEKMIRNYRD 266 Query: 312 SFFTNFFRQGVKIPSLPFQKFII 334 + ++ + P + + Sbjct: 267 MAIKDREKEHIYYDDRPNKDYNF 289 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 26/53 (49%) Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 CD AK I+I TI+ A GQ LL C S +++ + L+ F+ I Sbjct: 2 CDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIG 54 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/337 (11%), Positives = 90/337 (26%), Gaps = 54/337 (16%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISM 129 R+E + + + + F N E + Sbjct: 115 TARYEHYDENPIKQVSQAPLATFSLDVDTGSYANVRRFLNQGQLPPPEAVRVEEMLNYFP 174 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + I + + + Y L + + ++ E P Sbjct: 175 APQPVADKQDNTKPIAACIPMPFAVKYELAPS---PWNAQRTLLKVDVQ-ARDMQTSELP 230 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D SGSM ++ +++AL L ++ + ++ Sbjct: 231 PANLVFLIDTSGSMQ-------------------PAERLPLIQSALKLLVNDLRAQDNIT 271 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 Y G T V T ++ + T ++ AY+ Sbjct: 272 IVTYAG----GTHVALASTAGNNTTAIKAAIDNLD---AYGSTGGEAGLRLAYEQAEKGF 324 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTIS 368 + I+F TDG+ N ++ I+ K +E I + T+ Sbjct: 325 IKGGVN----------------RILFTTDGDFNLGITDPKDIEALVKKEREKGITLSTLG 368 Query: 369 INASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + V ++ Sbjct: 369 VGDDNFNEAMMVRIADVGNGNYSYIDSLSEAQKVVKD 405 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 69/200 (34%), Gaps = 33/200 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D+ ++ +N + ++ E+ +G++ +T+ K Sbjct: 616 DDSGSMVSNDRYNQRLTVAQNLI----------DNLPENSKVGVVKFTSSTTKLTTSLTS 665 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ + T A+ ++ + + K Sbjct: 666 DKETAKSYLTTSYFRSSGGTSMYTAINSSFSMFEA-----------------TDDNILKM 708 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG---QRLLKTCVSSPEYH 389 +I L+DG + + A N +KI T+ + +S + Q L ++ Sbjct: 709 MIVLSDGATSYTY---LHSSVVTTANNNGVKIYTVGLGSSSSSYFTQYLKPLANNTGGAF 765 Query: 390 YNVVNADSLIHVFQNISQLM 409 Y +A L ++++I++ + Sbjct: 766 YLASDASQLEDIYKDINKKI 785 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 62/167 (37%), Gaps = 21/167 (12%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 S++L+ ++ Y+G++ +++ P T + R+ + S+ T+ Sbjct: 326 AVAGSLELIRSARDQDYIGVVAFSSSARWVFRPRPMTPQGRREAESLLLSVRAGGGTEIG 385 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A +A Q L +G+K + LTDG + T++ Sbjct: 386 EAYAEALQAL--------------RGLKTEDKQV----LVLTDGLVQDPTLP--TLQAAH 425 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +A+ N I+ +++ + + L + Y+V + L F Sbjct: 426 QAQANKIRTNAVALGSDADRAFLRELAKQGGGTFYDVPSPKDLPRFF 472 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 60/178 (33%), Gaps = 33/178 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTR 282 K+ + A + ++++ + +G++ + + + + +++ ++ Sbjct: 407 GRKIELARLAAIGVVENLRPIDS------VGVLIFDNSFQWAVPIRKAEDRATIKKLISG 460 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T PA+ +AYQ + K I+ LTDG + Sbjct: 461 IT---PDGGTQIAPALTEAYQRILP------------------QTAMYKHIVLLTDGISE 499 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S T +A+ N + I T+ + N L K ++ Y + + L Sbjct: 500 EGDSMTLT----KEAQANHVTISTVGLGQDVNRAFLEKVASNADGKAYFLNDPSGLEQ 553 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 21/260 (8%), Positives = 70/260 (26%), Gaps = 2/260 (0%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + ++I AL+ + +G W+ + ++ A + + A + + Sbjct: 15 SRALKRDVSGTAAVIAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVRKRA 74 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + N A + + + + + + + + E+ R+ + Sbjct: 75 GDPKSQ-IDAAALNIALNSGMSSSLGNMLANSPPTSGIKAGDTSSLEVILTEVRPRLFSS 133 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 ++ + + + + L +V+ + Sbjct: 134 VFSSEPVSIRARAVASIVAGSQACVLALSPAASGAVTLSGSTVVNLKGCDIASNSTAADA 193 Query: 191 FLIELVV-DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 FL+ LS ++ + +T +++ ++ + Sbjct: 194 FLMSGSSAALSAGCASSVGGAVTTIQLKLTQCATVKTSAPVVRDPYASVVEPAAFGTCQN 253 Query: 250 EDVYMGLIGYTTRVEKNIEP 269 +V T +N Sbjct: 254 SNVGTPSTSTTLTPTENHPI 273 >gi|301609920|ref|XP_002934508.1| PREDICTED: vitrin-like [Xenopus (Silurana) tropicalis] Length = 779 Score = 64.1 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/386 (11%), Positives = 108/386 (27%), Gaps = 49/386 (12%) Query: 51 NSMESANNAAI---LAGASKMVSNLSRLGDR-FESISNHAKRALIDDAKRFIKNHIKESL 106 N M A + + L G + + S K A+ ++ +++ ++L Sbjct: 418 NEMAEALDVGLSGPLMGVVQYGDDPSTEFGLGAHFNVGDLKNAIEKIPQKGGHSNVGKAL 477 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 S + +++ N ++ +++ + D + + + + Sbjct: 478 SYVNKNYFSDSSGNRGSAPNVAVVLVDGWPTDKVEEASRLARESGINIFFV-----TIEG 532 Query: 167 QRYNQKIVSFIPALLRIEM-------GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 N++ P + + L ++ ++ + D + + Sbjct: 533 ADDNERSSVVEPNFVDKAVCRTNGHFSINVPSWFGLHKVVNPLVNRMCDIDKLVCSKTCL 592 Query: 220 CQDKKRTKMA--------ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + L + + D +G + YT Sbjct: 593 NAADIGFVIDGSSSVGTGNFRTVLQFIANITNEFEISDTDTRIGAVQYTYEQRLEFGFDK 652 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + K T + A+ A + L S K +K Sbjct: 653 YSTKQDVMNAIMRIGYWSGGTSTGAAITYASEQLFSKSK----------------PNKRK 696 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YH 389 +I +TDG + + N + + I + Q L++ + P+ + Sbjct: 697 ILIVITDGRSYDD-----VRAPAAAVHRNGVIAYAVGI--AWAAQDELESIATDPDKDHS 749 Query: 390 YNVVNADSLIHVFQNISQLMVHRKYS 415 + V + DSL I Q + S Sbjct: 750 FFVEDFDSLYKFVGKIFQNICTEYNS 775 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 64.1 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 62/201 (30%), Gaps = 20/201 (9%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + Q + +KN + +DS + + + G++ ++ V+ + Sbjct: 47 FDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKF 106 Query: 273 TEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K D + T+ ++ + S + + Q Sbjct: 107 KTK-EDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG------------GNPDKQ 153 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 K +I +TDG++N K K + ++ I I + L+ + Y Sbjct: 154 KIVILMTDGKSNAG---APPQHEAHKLKAEGVTVIAIGIGQG-FVKTELEQIATMKNYVL 209 Query: 391 NVVNADSLIHVFQNISQLMVH 411 + L + + + L Sbjct: 210 TTNSFSELSTLLKLVIDLACE 230 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 70/208 (33%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 50 DSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 107 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 108 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 155 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 156 RPLRENVLRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKA 207 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 IGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 672 FEIVKQFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 726 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 727 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSTRVPRVAIVFTDGRAQDDVS- 777 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KA+ N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 778 ----EWASKAQANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISEKL 831 Query: 406 SQLMVH 411 + + Sbjct: 832 KKGICE 837 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 89/270 (32%), Gaps = 53/270 (19%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + S N+I S + E N R + I + + Sbjct: 517 DTFYDSENSIVSIETEHFSKYMVVDKTEWFNNWRKIYNSYAAIFSAI--------PSYTA 568 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + VD SGSM S +D ++ A+ +++S+ + + Sbjct: 569 ICVDCSGSMSTNDKSFKDDNGVLTCYRNI----------AVQNYVESMFVFDNAS----- 613 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 +I + + + E + ++ ++ T++ A+ A L Sbjct: 614 -IITFESSASEECEMTNN----KRTLSGKASFYNRGGTNANSAIDIAIDELNHVYG---- 664 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 +K II L+DG+ N N IK C K N I+I T+++ + N Sbjct: 665 ---------------KKNIILLSDGDVNVSDDN---IKYC---KNNGIRIHTVALGSGAN 703 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 Q L + + A+ L ++++ Sbjct: 704 SQLLKQYANDTGGTPLTATTAEGLTKIYES 733 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 50/163 (30%), Gaps = 21/163 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL---- 287 KN + +D + ++G+I ++T +I V D Sbjct: 117 KNYVKSIVDM---GDISDQGTHVGIITFSTDPVIDIPFDKYKGVKMNAVNIKRDIDELRR 173 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKS 346 T A+ A + L + + QK + ++DG + Sbjct: 174 KKGYTFIDKALTLADKSLFTQEAGM-------------REDSQKVAVLMSDGIQTKDRGP 220 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 TI ++ K +++ T+ I AS + L+ Sbjct: 221 FTPTIIAANRLKMKGVQVYTVGIGASIDVLELMWIASEDTGLF 263 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 73/224 (32%), Gaps = 27/224 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM+ + + PE A T++ LK ++ + +D + Sbjct: 69 PIDVVAVLDVSGSMNDPVAASPESNLQA--------TRLDVLKASMKFIIRKLDDGDRLS 120 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + G + + R P +++A +IL + Sbjct: 121 IVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGS-ALMPELQEAVKILDERQ 179 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S F I+ LTDG++ + K + T ++ Sbjct: 180 GNSRNRVGF--------------ILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTFAL 222 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 A+ + + LL S + V + + L + ++ + K Sbjct: 223 GAAHDPEALLHIAQESRGTYSFVDDGN-LDKIAGALAVCLGGLK 265 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 70/231 (30%), Gaps = 44/231 (19%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +PA R + L+VD SGSM KM + A Sbjct: 106 IDVPAARAARGQPRAPAAVVLLVDASGSMQGP--------------------KMENARAA 145 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTD 293 F+D + + + + + + P+ G T+ Sbjct: 146 AQAFVDRLPDGD------LVSVASFADTAQARVAPTVLGRSTRPAVARAIAALGPDGSTN 199 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIK 352 +K A + + PS + ++ ++DG+ N S Sbjct: 200 LFAGLKLA----------------EQHALAAPSTHAVRRVVLISDGQANIGPSSPDILGA 243 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + + ++I +I + A + + L V S Y++ A + V + Sbjct: 244 LAQRGAAHGVQITSIGVGADYDERTLNALAVGSSGRLYHLTEAREMSSVLE 294 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 73/224 (32%), Gaps = 27/224 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM+ + + PE A T++ LK ++ + +D + Sbjct: 45 PIDVVAVLDVSGSMNDPVAASPESNLQA--------TRLDVLKASMKFIIRKLDDGDRLS 96 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + G + + R P +++A +IL + Sbjct: 97 IVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGS-ALMPELQEAVKILDERQ 155 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S F I+ LTDG++ + K + T ++ Sbjct: 156 GNSRNRVGF--------------ILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTFAL 198 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 A+ + + LL S + V + + L + ++ + K Sbjct: 199 GAAHDPEALLHIAQESRGTYSFVDDGN-LDKIAGALAVCLGGLK 241 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 43/519 (8%), Positives = 114/519 (21%), Gaps = 121/519 (23%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSR------------------------LGDRFESISNHAKRALIDDAKRFIK----N 100 S DD I Sbjct: 67 NSGRKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTNDDKMNLIPICKTV 126 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---MT 152 + S+ + + + +++ ++ + + + Sbjct: 127 NNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTLEVIPQQVNVASQSRAFKKNTFNIPI 186 Query: 153 SYDYRLQFIEHLL----------------NQRYNQKIVSFIPALLRIEMGERPIFLI-EL 195 + + I LL + + + Sbjct: 187 DLMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTPF 246 Query: 196 VVD---LSGSMHCAMNSDPEDVNSAPICQDKKRTKMA--------------ALKNALLLF 238 + + S + D P+ +K K Sbjct: 247 ALGAEIDASSCALPYHWDKGKDPDTPLKIKNILSKKEGNNQQSRAEFIEHLVYKMNTKAT 306 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA- 297 LD++ + K G KN + + + T ++ Sbjct: 307 LDNVGEKQNYKVTFPKGNFCLKNMKNKNHGW-FSRKDKSDFTNHVRSLRADGATLASSGV 365 Query: 298 MKQAYQILTSDKKRS-------------FFTNFFRQGVKIPSLPFQKFIIFLTD------ 338 + A ++ + N + +T+ Sbjct: 366 LVAANNMIRDGSRTEQLKEQTKRVILVLSDGNDEIVKGDPNNKVPFLNYTRITENLIYGR 425 Query: 339 --------------------GENNNFKSNVNTIKICDKAK--------ENFIKIVTISIN 370 + + T +C+ + + KIV + Sbjct: 426 QEVFLSQKQKISLSHSTIYTYLTTDTQPKETTNGMCEVIRNKLDTLNGDKNTKIVFVEFG 485 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 ++ + CV +Y+ + +SL++ F+ I + Sbjct: 486 YKSTSKQAWEHCVG-NGNYYSANDKESLLNSFKQAIGET 523 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/270 (13%), Positives = 84/270 (31%), Gaps = 51/270 (18%) Query: 145 FYNMDVMTSYDYRLQF---IEHLLNQRYNQKIVSFI----PALLRIEMGERPIFLIELVV 197 +S RL F E + R + ++ + + E R I V+ Sbjct: 12 ATAQAPNSSVAARLTFESTSEFVAYSRGQRHKINSVFHVQAPEYQGEEAARGAVDIVAVI 71 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D+SGSM +K+ K L + ++ +MGL+ Sbjct: 72 DVSGSM--------------------SGSKLDLAKATLEFLIKNLSQTD------HMGLV 105 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y + V + + ++ T + T+ + + + +++ ++ + + Sbjct: 106 VYHSDVSVAFPLTRMDAEGKRTATAALSTLRAQRCTNLSGGLFKGIEMMQGRERSAASVS 165 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPN 374 ++ +TDG N N I + + + T ++ Sbjct: 166 ---------------SVLLMTDGIANEGVRGPNLITATRQLMGDNPSYSLYTFGYGSNHE 210 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + L +Y + N D++ F + Sbjct: 211 EELLKDLSEVGNGMYYYIENNDTIPESFGD 240 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 18/195 (9%), Positives = 54/195 (27%), Gaps = 25/195 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T+ K +S ++ + + T + ++ Sbjct: 153 NSGSINDTEFDNFKEFAKKLAESFT---ISATYTHVAAVYFNTLANFGFNLKYDINVIKT 209 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + I T A+ + P + ++ LTD Sbjct: 210 AIDNLPN--IGGGTHIGKALTYTLDNVFKVA---------------PRQNVKNVLVVLTD 252 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G++++ +V + +++ + + A + L S P ++ ++V + Sbjct: 253 GKSHD---SVTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFS 309 Query: 397 SLIHVFQNISQLMVH 411 + + + Sbjct: 310 QIESTTGAVEDEICK 324 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 62/201 (30%), Gaps = 20/201 (9%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + Q + +KN + +DS + + + G++ ++ V+ + Sbjct: 50 FDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKF 109 Query: 273 TEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K D + T+ ++ + S + + Q Sbjct: 110 KTK-EDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG------------GNPDKQ 156 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 K +I +TDG++N K K + ++ I I + L+ + Y Sbjct: 157 KIVILMTDGKSNAG---APPQHEAHKLKAEGVTVIAIGIGQG-FVKTELEQIATMKNYVL 212 Query: 391 NVVNADSLIHVFQNISQLMVH 411 + L + + + L Sbjct: 213 TTNSFSELSTLLKLVIDLACE 233 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 70/250 (28%), Gaps = 33/250 (13%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L + + + R G L D D Sbjct: 14 VLLLDLCDSYDLNNPS--YILAQSYAQRRNNGLPHRTLNPAATDSQCRSRPLDLVFIIDS 71 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + ++ K+ D +D L + + ++ Y + V+ Sbjct: 72 SRSVRPGEFEKVKI--------FLADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLT 123 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K + T + A+K+A + G + S K I Sbjct: 124 KDTIKQAITRIEPLAAGTMTGMAIKKAMD----------EAFTEKSGARPKSKNISKVAI 173 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHY 390 +TDG + ++ A+ + I+I + + + + LK S+P ++ + Sbjct: 174 IVTDGRPQDQ-----VEEVSAAARASGIEIYAVGV----DRADMRSLKLMASNPLEDHVF 224 Query: 391 NVVNADSLIH 400 V + Sbjct: 225 YVETYGVIEK 234 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 49/154 (31%), Gaps = 1/154 (0%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + ++ + +I+ + + + + + W+ K +++S +AA L GA Sbjct: 3 LKTLIRRLCTGTEGAVAIMVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAF 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ S + +I + + I H + Y+ E+ ++ Sbjct: 63 ELGSGSNSSVISAAAIQDAGRNGFQATGGATIAVHTPPASGKYAGNPQMVEVSVSQPTTL 122 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + L + + V + L Sbjct: 123 LFSALF-LKSLQVNARAVAKTGSVGDACILALDN 155 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + ++ + + V G + Y+ E + ++ T + Sbjct: 835 MNLTIHLVEKANVGRGQVQFGALKYSKEPEDLFYLNTFSKGAAITENLRRRRDTYGETYT 894 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A T + N ++ +I +TDG +++ Sbjct: 895 AKALEHANSQFTEEHGSRIKQN------------VKQMLIVITDGVSHD---REQLSDTA 939 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K + I I + + + Q L+ +V N D L ++Q + + M Sbjct: 940 LKLRNKGIIIYAVGVGEAD--QYELEAIAGDKNNTRHVDNFDKLKDIYQFLQEGMC 993 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 44/382 (11%), Positives = 101/382 (26%), Gaps = 78/382 (20%) Query: 37 IGFLIYVLD--WHYKKN---SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 G I ++ + + + A+L ++ + R + + A+ Sbjct: 313 AGAAIDMMRREAFSESSGSRRAQGVPQIAVL-VTNRPSDDEVREAAQKLLREDVIVFAMS 371 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + S V ++ SRI + N D +T + Sbjct: 372 IQGANNTQLEEIVSYPSRQRVSTLKSYADLERYSRIFQKKLQNEIWDQI-STRAEQRN-- 428 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 L L + + + ++ E E+ + V ++ Sbjct: 429 ------LDETGCLDTKEADLYFLIDGSTSIQNEQFEQIKRFMLEVTEV------------ 470 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 S V +G++ Y+ + + Sbjct: 471 ---------------------------------FSIGPGRVQVGVVQYSHEIREEFSIGV 497 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + V + T + A+ R+G K Sbjct: 498 YSNDVDLRKAVLNIKQLTGDTHTGAALAFML-------------PKIREGRKQRPSKVPC 544 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++I LTDG++ + + T ++ + + I + I + + L+ + E + Sbjct: 545 YLIVLTDGQSQDHHRILET---ANRIRAEGVTIHAVGIGEAD--KTELQQIAGNEERVHF 599 Query: 392 VVNADSLIHVFQNISQLMVHRK 413 N DSL + + + K Sbjct: 600 GQNFDSLKSIKSEVVHSICTEK 621 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 61/168 (36%), Gaps = 22/168 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + + +G++ ++T ++ + + + D SLI + T Sbjct: 648 MKTFMKNLLAKIQIGPDKTQIGVVQFSTDAKEEFQLNKYFTQKEISDAIDRMSLINQNTY 707 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ Q + +K +KF+I +TDGE + + Sbjct: 708 TGHALEFVDQYFS--------------HLKGARPGVKKFLILITDGEARDGVRDP----- 748 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +E + I ++ + N + L+ + V D L + Sbjct: 749 ARALREKEVVIFSVGV-YGAN-RTQLEEISGDGSLVFQVEKFDDLKAI 794 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 18/167 (10%), Positives = 43/167 (25%), Gaps = 19/167 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 D ID + + +G+ + +R ++ I K + Sbjct: 1026 TTFLSDFIDNFDIQSQRMKIGMAQFGSRYQEIIHLQNSLTKTEWKTQIQTVTKSHGHPQI 1085 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + + ++ +T G+ S+ Sbjct: 1086 NLALSKVALMFHPTAGGRGKVG------------VPQTLVVITSGDPLYDVSD-----AG 1128 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 K+ I ++ + I Q L + E + + L Sbjct: 1129 RILKDLGICVLVLGIGDVHKEQLL--PITGNSEKIIAFQDFNKLKDA 1173 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLVGLVRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDVIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ QN+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLQNL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 17/171 (9%), Positives = 52/171 (30%), Gaps = 27/171 (15%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + L+ Y + + P+ G++K + T + AYQ Sbjct: 223 RPQDRITLVTYASGTRVALPPTPGSDKT-AISAAIDQLVAGGSTAGASGIALAYQAAQQS 281 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 ++ TDG+ N ++ + ++ +++ + + T+ Sbjct: 282 FIAGGIN----------------RVLLATDGDFNVGVTDFRQLKSMVEEKRKSGVSLSTL 325 Query: 368 SINASPNGQRLLKTCV-SSPEYHYNVVN--------ADSLIHVFQNISQLM 409 ++L++ + + + N + I++ + Sbjct: 326 GFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEISSTLATIARDV 376 >gi|319941893|ref|ZP_08016214.1| hypothetical protein HMPREF9464_01433 [Sutterella wadsworthensis 3_1_45B] gi|319804546|gb|EFW01416.1| hypothetical protein HMPREF9464_01433 [Sutterella wadsworthensis 3_1_45B] Length = 570 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 15/160 (9%), Positives = 41/160 (25%), Gaps = 7/160 (4%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 HL +R R I + ++ L ++ + L + V ++M+SA + + Sbjct: 15 HLKTRVRH----FITDRRGAAAMFLGLGILPAMALGTGAVLVNREFMNLSAMQSAVDTSS 70 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 +A A + + + + Sbjct: 71 IALAKIQSIGADSYLTTAVAQQWVDEN---SKLFAETLEVKSVLPTFAEKSISVSAQYAS 127 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 ++ + + + + S I + + Sbjct: 128 KPRLTGAIGSLFSKKKELSVQAIAQFFPSSLEVVFTIDNS 167 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 62/201 (30%), Gaps = 20/201 (9%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWG 272 + + Q + +K+ + +DS + + + G++ ++ V+ + Sbjct: 59 FDDSSSIQYDNKENYQMMKDFVKELVDSFTTVGVNGRNGSQFGVVQFSQGVKTAFPLNKF 118 Query: 273 TEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K D + T+ +K + S + + Q Sbjct: 119 KTK-EDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGAEGG------------GNPDKQ 165 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 K +I +TDG++N K K + ++ I I + L+ + Y Sbjct: 166 KIVILMTDGKSNAG---APPQHEAHKLKAEGVTVIAIGIGQG-FVKTELEQIATMKNYVL 221 Query: 391 NVVNADSLIHVFQNISQLMVH 411 + L + + + L Sbjct: 222 TTNSFSELSTLLKLVIDLACE 242 >gi|15674025|ref|NP_268200.1| hypothetical protein L107379 [Lactococcus lactis subsp. lactis Il1403] gi|12725093|gb|AAK06141.1|AE006434_3 unknown protein [Lactococcus lactis subsp. lactis Il1403] Length = 1450 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 30/296 (10%), Positives = 82/296 (27%), Gaps = 35/296 (11%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + N + + + + Sbjct: 310 DVYLNVRGNVQNPIKPVDIVLVIDMSGSMQGAKETAVRQGVSDF--LSTIQNTAYADYVN 367 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V + S + + P +D + SHVK Sbjct: 368 VGIVGYSSPGNYVTGASGYITVP--------------------IDKVSSESHVKSINQAL 407 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + GTE + Q + D +++ TD P + + Sbjct: 408 APQFSGGTFTQLGLRKGTEMLEQD-SSDNQKMMILMTDGVPTFSYKVNSASKVDNVIYGQ 466 Query: 316 NFFRQGVKIPSLP--FQKFIIFLTDGENNNFKSNVNTIKICDK-AKENFI-KIVTISINA 371 +F + + + + +G +N + + + + I +I T+ I Sbjct: 467 SFAESRDEPGNTSKIQSPYYVKDINGGSNIEIRDTWAATLGEAEISKQEISEIHTLGIQL 526 Query: 372 SPNGQRLLKT-------CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 +G L + +++ + + +A+ + +N + +++ +++ I G Sbjct: 527 GNDGSYLSQEEVKSRTSLIATTGLYQDANSANDITDYLKNQANVVLS-RFNTITNG 581 >gi|24431113|gb|AAN61407.1|AF486289_1 matrilin [Biomphalaria glabrata] Length = 394 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 58/189 (30%), Gaps = 17/189 (8%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K A+ + + V + +I Y + I + T + Sbjct: 61 SSSSLRNRDFKTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDE 120 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + + K ++ + + + + + K I +TDG Sbjct: 121 VIEAIGRIPHK-----AGLRT------DTGRAIQYMHEAQLANGVVRPGVTKVSIVITDG 169 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + T + ++A+++ I + I + LL V N + L Sbjct: 170 ---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLS 225 Query: 400 HVFQNISQL 408 + ++++ Sbjct: 226 SIKESLAHQ 234 >gi|89094518|ref|ZP_01167457.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] gi|89081254|gb|EAR60487.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] Length = 445 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 26/182 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 I ++ + ++ + ++ Y +RV + + T + + + R T + Sbjct: 95 IMAINRLSQNDIVSVVSYDSRVNVVVPATKVSDTNTIARAINRI---QANGNTALFAGVS 151 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAK 358 + L K L +I L+DG N S K+ Sbjct: 152 KGANEL----------------RKFLDLNKVNRVILLSDGLANIGPSTPNELGKLGLSLA 195 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + + + TI + N + + S H V NAD L VFQ + SV+ Sbjct: 196 KEGMSVTTIGLGLGYNEDLMTQLAGFSDGNHAFVENADDLARVFQYEFGDV----LSVVA 251 Query: 419 KG 420 +G Sbjct: 252 QG 253 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 76/240 (31%), Gaps = 52/240 (21%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + +V+D SGSM T+M A K A+ +D Sbjct: 23 ASTAAAAGELSPVMVVLDSSGSMTARDAGG-------------SGTRMDAAKRAVGSMVD 69 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----------GTEKVRQYVTRDMDSLILK 290 + + V +Y G ++ EK +++ VT Sbjct: 70 GLPAGAQVGLAIY-GAGTGSSGAEKVAGCKDVRVVQPVGPVNKPALKRAVTATK---ASG 125 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A L Q+ I+ ++DGE+ + Sbjct: 126 YTPIGQALRTAAAQL--------------------PKEGQRSIVLVSDGEDTC--APPQP 163 Query: 351 IKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ + + + + TI + L ++ +++ +ADSL+ V +++ Sbjct: 164 CEVAKELSKQGVDLHVHTIGFRVDAKARAQLACIAQNTGGTYHDASDADSLLGVLGRVTE 223 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 279 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 335 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 336 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 385 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 386 -----WAARAKEEGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 438 Query: 406 SQLMV 410 + Sbjct: 439 RGSIC 443 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 3 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 61 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 62 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 106 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 107 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 153 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 268 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 324 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 325 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 374 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 375 -----WAARAKEEGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 427 Query: 406 SQLMV 410 + Sbjct: 428 RGSIC 432 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLPGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 320 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 376 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 377 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 426 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 427 -----WAARAKEEGIVVYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 479 Query: 406 SQLMV 410 + Sbjct: 480 RGSIC 484 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 430 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 486 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 487 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 535 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 536 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 589 Query: 406 SQLMVH 411 + + Sbjct: 590 KKGICE 595 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 387 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 443 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 444 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 492 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 493 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 546 Query: 406 SQLMVH 411 + + Sbjct: 547 KKGICE 552 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 252 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 308 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 309 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 357 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 358 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 411 Query: 406 SQLMVH 411 + + Sbjct: 412 KKGICE 417 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 456 FEVVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKY 512 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 513 MGKGSMTGLALKHMFERSFTQ----------GEGARPLSTRVPRAAIVFTDGRAQDDVS- 561 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 562 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTNKHLFYAEDFSTMDEISEKL 615 Query: 406 SQLMVH 411 + + Sbjct: 616 KKGICE 621 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 107 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 108 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 155 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 156 RPLRENVLRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKA 207 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 IGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 672 FEIVKQFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 726 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 727 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSTRVPRVAIVFTDGRAQDDVS- 777 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KA+ N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 778 ----EWASKAQANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISEKL 831 Query: 406 SQLMVH 411 + + Sbjct: 832 KKGICE 837 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 107 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 108 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 155 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 156 RPLRENVLRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKA 207 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 IGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 672 FEIVKQFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 726 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 727 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSTRVPRVAIVFTDGRAQDDVS- 777 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KA+ N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 778 ----EWASKAQANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISEKL 831 Query: 406 SQLMVH 411 + + Sbjct: 832 KKGICE 837 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 50 SSCENKRADLVFIIDSSRSVNTHDYAK--VKEFIVDILQFLDIGPDVTRVGLLQYGSTVK 107 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 108 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 155 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 156 RPLRENVLRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKA 207 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 IGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + + + + + Sbjct: 672 FEIVKQFVTGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLRNFNSA--KDMKKAVAHM 726 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 S + A + + +G + S + I TDG + S Sbjct: 727 KYMGKGSMTGL--ALKHMFERSFTQV------EGARPLSTRVPRVAIVFTDGRAQDDVS- 777 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KA+ N I + + + + + L+ S + ++ + + ++ + + + Sbjct: 778 ----EWASKAQANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISEKL 831 Query: 406 SQLMVH 411 + + Sbjct: 832 KKGICE 837 >gi|239613390|gb|EEQ90377.1| U-box domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 766 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 60/212 (28%), Gaps = 13/212 (6%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I L +D+S SM + D + P + K+A +++++ Sbjct: 75 DIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSV--LDLTKHAARTIIETLNDND----- 127 Query: 252 VYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +G++ ++T E + S + + + L T+ +K + + L Sbjct: 128 -RLGVVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEE--V 184 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 N + + + + K K+ I T Sbjct: 185 TPIPQNVQALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQ--KDRLPMIHTFGFG 242 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + +A + VF Sbjct: 243 YYIRSGLLQAISEVGGGTYSFIPDAGMIGTVF 274 >gi|147899676|ref|NP_001087858.1| collagen alpha-1(XXI) chain precursor [Xenopus laevis] gi|82234134|sp|Q641F3|COLA1_XENLA RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|51950065|gb|AAH82384.1| MGC81791 protein [Xenopus laevis] Length = 957 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 29/191 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +G++ Y+ I + T + Sbjct: 58 KWVVNITSNFNIGPKFTQVGVVQYSDYPILEIPLGSYESIDDLSRRTQSIQYLGGNTQTG 117 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A L + R P K I LTDG++ + I + Sbjct: 118 NAIQFAIDNLFARSLR----------------PLTKIAIVLTDGKSQDD-----VKHIAE 156 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQL------ 408 +A++N I + I + + L + + V + ++ + + I + Sbjct: 157 EARKNKITLFAIGVGSEIEESELRAIANKPSSTYVFYVEDYIAISRI-REIMKQKLCEES 215 Query: 409 MVHRKYSVILK 419 + + V + Sbjct: 216 VCPTRIPVAAR 226 >gi|108756796|ref|YP_635538.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108460676|gb|ABF85861.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 700 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 68/237 (28%), Gaps = 44/237 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 R +V + +R + V+D+SGSM+ + Sbjct: 311 RKGYHVVHVGVKAREVSRPQRKPSHLVFVIDVSGSMNLE-------------------NR 351 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K AL L ++ +D V ++ Y + +EP+ Sbjct: 352 LGLVKRALHLLVNELDERDQVS------IVVYGSTARLVLEPTSAVHA-HIIRAAIDSLH 404 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ ++ Y + S +I +DG N ++ Sbjct: 405 TEGSTNAQAGLEMGYSLAASHLVEGGIN----------------RVILCSDGVANTGLTD 448 Query: 348 VNTI-KICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVF 402 N+I + I + T+ L+ + + V + +F Sbjct: 449 ANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERLSQVGEGNYAYVDRIEEAHRIF 505 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 71/233 (30%), Gaps = 49/233 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++P + +V+D SGSM + +R ++ K A+ + + Sbjct: 405 QQPDLALVMVIDRSGSMSELVGG-------------SRRNRLDLAKEAVYQASLGLTPID 451 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +GL+ + + V I T+ P ++ Sbjct: 452 ------QVGLVVFDDAANWVLPLQRLPSVVEIE-RALGSFGIGGGTNIRPGIE------- 497 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + K +I LTDG + S++ + + + I T Sbjct: 498 -----------QAAQALASADAKVKHVILLTDGIAESNYSDLI-----AQMRAAGVTIST 541 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF--QNI---SQLMVHRKY 414 ++I N L+ + Y V + + +F + I + +V + Sbjct: 542 VAIGEDANPN-LVDVANAGGGRSYRVTRIEDVPRIFLQETIIAAGRDIVEERI 593 >gi|164425557|ref|XP_960517.2| hypothetical protein NCU05555 [Neurospora crassa OR74A] gi|157070974|gb|EAA31281.2| predicted protein [Neurospora crassa OR74A] Length = 766 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 59/214 (27%), Gaps = 27/214 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--TKMAALKNALLLFLDSIDLLSHVK 249 I L +D+SGSM Q + + + +K+A + +++ Sbjct: 70 DIVLAIDVSGSMSADAPVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSD--- 126 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +G++ ++T + T ++ T+ + + ++ Sbjct: 127 ---RLGIVTFSTEAKVLQPLMPMTALNKKKTERNLGGMQPFSATNLWGGIVEGLKLFD-- 181 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 ++ LTDG N+ + + I T Sbjct: 182 ----------------GQSGRMPALMVLTDGMPNHMCPAQGYVAKLRAMETLPAAIHTFG 225 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 S L + + +A + VF Sbjct: 226 FGYSLRSGLLKSVAEIGGGGYSFIPDAGMIGTVF 259 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 59/192 (30%), Gaps = 28/192 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + LF+D + L + V G T ++ E ++ +V L T Sbjct: 56 EGVKLFVDMLGLTDNQVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEY--T 113 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-------- 344 D T +K+A ++L I+ TDG N Sbjct: 114 DITSGLKEAVKMLNQRNASG----------------NSPLIVVFTDGNNAIGGVANRTPA 157 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + I +A+ I TI +N + N L K V + + + L + Sbjct: 158 DIDKDLAAIISQAQSEGYPIYTIGLNDNGKLNEAYLEKISVDTKAKAFATKDPAELPDIL 217 Query: 403 QNISQLMVHRKY 414 I + K Sbjct: 218 TEIFAAHSNLKV 229 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 13/222 (5%), Positives = 57/222 (25%), Gaps = 3/222 (1%) Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP-ALLR 182 +RI + A L + + D + + N+ + + + Sbjct: 5 INRIFIIGFAIMLLLFPSMLMAATSDAQLDAILVIDASGSMKETDPNKLGLEGVKLFVDM 64 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL--KNALLLFLD 240 + + + + ++ D+S + ++ + D + D + + L + Sbjct: 65 LGLTDNQVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAVK 124 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + + + + + + + T + Sbjct: 125 MLNQRNASGNSPLIVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPIYTIGLND 184 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 ++ + ++ + + + +N Sbjct: 185 NGKLNEAYLEKISVDTKAKAFATKDPAELPDILTEIFAAHSN 226 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 69/192 (35%), Gaps = 31/192 (16%) Query: 235 LLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDSLILKP 291 + F+ D + L + +GL+ Y + V+ +V + V R Sbjct: 76 VKEFILDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLST--G 133 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A++ A I S+ +G + + I+ +TDG + + Sbjct: 134 TMTGLAIQYALNIAFSEA----------EGARPLRENVPRIIMIVTDGRPQDSVA----- 178 Query: 352 KICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN- 404 ++ KA+ I I I + N LK S P ++ + V N L VFQN Sbjct: 179 EVAAKARNTGILIFAIGVGQVDLNT---LKAIGSEPHKDHVFLVANFSQIESLTSVFQNK 235 Query: 405 -ISQLMVHRKYS 415 S+ + H Sbjct: 236 LCSKSVPHLSLR 247 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 72/196 (36%), Gaps = 26/196 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ ALK A+ +FL ++ S V + + N + + V+ V R Sbjct: 545 NRIGALKEAVGVFLGTLPPGSKVAVIEFNSFVNPLVFGPANEIFTTRFDDVKSQVNRF-- 602 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T A+ +A +++ + ++ ++ LTDGE+ + + Sbjct: 603 -RANGGTSYYDAVDRALELI-------------------ANQTGRRAVLALTDGEDTSSR 642 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + KA+ + + T+ + + L + + ++ +A L +F Sbjct: 643 L-AGLDSVILKARNLGLPVHTLGVGREDEIEVGELQRLARETRGRYFPARDATKLRVIFA 701 Query: 404 NISQLMVHRKYSVILK 419 ++Q + YS+ + Sbjct: 702 ELAQSL-RESYSLTYR 716 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 14/130 (10%), Positives = 41/130 (31%), Gaps = 13/130 (10%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 T+ L +L + + + + +G P V++ + ++ + Sbjct: 528 PTTVVLVLDTSGSML----QDNRIGALKEAVGVFLGTLPPGSKVAVIEFNSFVNPLVFGP 583 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 ++ + T+ +K+ + F + + D + LI T + + Sbjct: 584 ANEIFT---------TRFDDVKSQVNRFRANGGTSYYDAVDRALELIANQTGRRAVLALT 634 Query: 271 WGTEKVRQYV 280 G + + Sbjct: 635 DGEDTSSRLA 644 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 72/210 (34%), Gaps = 38/210 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVR 277 ++ L A LFL L V++ ++G++ + + E + R Sbjct: 316 SGSMTIGNRLKRLNQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQSELVQINSATER 370 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T+ + + T ++ A+ ++ II LT Sbjct: 371 DTLTKSLPTTASGGTSICSGLRSAFTVIKKKYPTDGAE-----------------IILLT 413 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA- 395 DGE+N TI C D+ K++ I T+++ Q L + + + Sbjct: 414 DGEDN-------TISACFDEVKQSGAIIHTVALG-PSAAQELEQMSKMTGGLQTYASDQV 465 Query: 396 --DSLIHVFQNIS---QLMVHRKYSVILKG 420 + L+ F +S + + R + +G Sbjct: 466 QNNGLVDAFAALSSGNKAVSQRSIQLESRG 495 >gi|291388581|ref|XP_002710673.1| PREDICTED: collagen, type XXII, alpha 1 [Oryctolagus cuniculus] Length = 1571 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 58/178 (32%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D E +G++ Y+ R E + + + T++ Sbjct: 59 QWVANLVDTFEVGPEHTRVGVVRYSDRPSTAFELGHFSSREEVKAAARRIAYHGGNTNTG 118 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + + G + FQ+ I LTDG + + + Sbjct: 119 DALRFITARSFSP-----------QAGGRPGDRAFQQVAILLTDGRSQDLVRDAAAAAH- 166 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 I+I + + + L + + + ++V + D++ + + + + Sbjct: 167 ----AAGIRIFAVGVG-EALREELHEIASEPTSAHVFHVSDFDAIDKIRGKLRRRLCE 219 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++V I P + ++D+SGSM + Sbjct: 224 NEQHRLVRIGLKAKEIPTENLPASNLVFLIDVSGSM-------------------YGPER 264 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y+ + + + G++K ++ + Sbjct: 265 LDLVKSSLKLLVNNLREKD------KVAIVIYSGAAGEKLASTPGSDK-QKIREAIDELE 317 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY+I + II TDG+ N S Sbjct: 318 ASGSTAGGEGIMLAYKIAQKNFISGGNN----------------RIILCTDGDFNVGVSS 361 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ I + + ++ H + N Sbjct: 362 DKELEKLIEQKRKSGIFLTVLGYGMGNYKDNKMQTLAEKGNGNHAYIDNLQE 413 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RSSIC 525 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 279 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 335 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 336 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 385 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 386 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 438 Query: 406 SQLMV 410 + Sbjct: 439 RGSIC 443 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 400 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 456 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 457 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 506 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 507 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 559 Query: 406 SQLMV 410 + Sbjct: 560 RGSIC 564 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 320 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 376 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 377 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 426 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 427 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 479 Query: 406 SQLMV 410 + Sbjct: 480 RGSIC 484 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 402 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 458 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 459 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 508 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 509 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 561 Query: 406 SQLMV 410 + Sbjct: 562 RGSIC 566 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRSEDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASKPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 57/174 (32%), Gaps = 21/174 (12%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++I +E +GL Y+ + + T + A+ Sbjct: 398 NTISSFPIGEEYTQIGLATYSDNPRIIFHLNKYHKLDDIRKAVLEVEHTSGGTATGKAIL 457 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + + ++ ++ LTDG++ + + AKE Sbjct: 458 YLTNNMFTHENGV-------------RPNAKRLVVVLTDGKSQDDVIVPS-----RIAKE 499 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 + I + I + G+ L+ S P+ + Y + + +L + + +S + Sbjct: 500 SGIVMFAIGVGKVVMGE--LRAIASDPDRYVYKINDFSALESIRRELSHSIASL 551 >gi|327543487|gb|EGF29906.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 464 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 78/271 (28%), Gaps = 50/271 (18%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + ++ V ++ ++ + + ERP + LV Sbjct: 13 MTATPLHQASAEQVKLDVRLVHPVMK--AGEKQTNHLRIALTGFELKSTEERPPVNVCLV 70 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SGSM K+A K A +D + V + Sbjct: 71 LDHSGSM--------------------SGQKLARAKEAAEAAIDRLSDDDIVS------V 104 Query: 257 IGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + Y + V + + ++Q + T + + Sbjct: 105 VLYDSNVTVLVPATKATDRSSIKQKIRGI---QAGSSTALFAGVSKGAA----------- 150 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASP 373 + K + +I L+DG N KS + + I + T+ + + Sbjct: 151 -----EVRKFLADEQVNRVILLSDGLANVGPKSPQELEGLGRSLMKEAISVSTLGLGSGY 205 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N ++ H + +ADSL+ VF Sbjct: 206 NEDLMVALASVGGGNHAFIEDADSLVAVFNQ 236 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 25/228 (10%), Positives = 71/228 (31%), Gaps = 11/228 (4%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + ++ + ++ + + A D + + + S+ + + +G Sbjct: 27 ICTVNQANAASLGTASIEGYDAVFVLDTSYSMRDTDPEGIAAEVISMFMDLSDADRTRVG 86 Query: 256 LIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSF 313 + Y V + + G + + +++ L TD +++ ++L + + Sbjct: 87 FVAYNHHVVASKPLTSIGVAAQKSQIQQEIRMLNRSGYTDLGLGLRKGSELLAAGASQGR 146 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ G + SN + + A+ + TI +N Sbjct: 147 QPFMILLSDGETD-------FGVSSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDG 199 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 N Q L + + + +A+ L + I + K + Sbjct: 200 TVNRQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLVPVAA 247 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 70/239 (29%), Gaps = 56/239 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + I + +D+S SM+ ++ A + A + F+D+ Sbjct: 91 SSRDTVGIDIMIALDVSDSMNTPDFGGK--------------SRFAGARTAAMRFIDN-- 134 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAY 302 + +GL+ ++ + E + + T A+ A Sbjct: 135 -----RPADRIGLVVFSGGSFTRCPLTLDHEVLGRLAETVAPGFFDEPGTAIGTAILTAT 189 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L + + K ++ +TDGENN + A I Sbjct: 190 NRLKASSSKE------------------KALVLITDGENNAGEVT--PETAARLAANYGI 229 Query: 363 KIVTI-------SINASPNGQ-------RLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +I T+ + + N L S ++ + L+ F++I + Sbjct: 230 RIYTVFAGKEARAFENTSNTALNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDIDR 288 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 48/175 (27%), Gaps = 29/175 (16%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 KM K + L LD ++ + GLI + + + T K + + Sbjct: 411 MKMELAKESAQLVLDLLEEKDY------FGLIAFDHSYQWIVPLQPLTNK-EETASLISK 463 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T P +K A + L K II +TDG+ Sbjct: 464 ISPGGGTALYPPLKSAGEALIKA------------------PIKSKHIIAITDGQTEGGD 505 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I + TI I N L Y+ N +L Sbjct: 506 FYNLVKYLAKY----KITVSTIGIGEDANIPLLKDIANWGNGRFYHTWNIRNLPQ 556 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 72/210 (34%), Gaps = 38/210 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVR 277 ++ L A LFL L V++ ++G++ + + E + R Sbjct: 314 SGSMTIGNRLKRLNQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQSELVQINSATER 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T+ + + T ++ A+ ++ II LT Sbjct: 369 DTLTKSLPTTASGGTSICSGLRSAFTVIKKKYPTDGAE-----------------IILLT 411 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA- 395 DGE+N TI C D+ K++ I T+++ Q L + + + Sbjct: 412 DGEDN-------TISACFDEVKQSGAIIHTVALG-PSAAQELEQMSKMTGGLQTYASDQV 463 Query: 396 --DSLIHVFQNIS---QLMVHRKYSVILKG 420 + L+ F +S + + R + +G Sbjct: 464 QNNGLVDAFAALSSGNKAVSQRSIQLESRG 493 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/245 (11%), Positives = 69/245 (28%), Gaps = 72/245 (29%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D ++ K ++ + Sbjct: 91 DIVFAIDVSKSM---------------LAEDVAPNRLEKAKRIAFETINQLK-------G 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 +G++ Y + + + ++ ++ + T A++ A Sbjct: 129 DRVGIVAYAASAYPQLALTTDHSAAKMFLQSMNTNMLSSQGTAIQEAIRMATNYFDDKST 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + ++DGE++ + +I +A+E I I TI + Sbjct: 189 T------------------SRLLFIISDGEDH----EMGATEIAAEAQEKGIHIYTIGVG 226 Query: 371 ASP---------------------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 N + L + +++ + N N + + Sbjct: 227 TEKGSPIPMRELGEQSYKRDRNGEVVITRLNKELLQQIAINAGGQYLNGDNTQEAVSQIE 286 Query: 404 NISQL 408 I + Sbjct: 287 KILES 291 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V I P + ++D+SGSM + Sbjct: 226 NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSMW-------------------GANR 266 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y +E + G++K ++ + Sbjct: 267 LDLVKSSLKLLVNNLRDKD------KVAIVTYAGNAGVKLEATPGSDK-QKIREAIDELE 319 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY+I + II TDG+ N S Sbjct: 320 ASGSTAGGEGIMLAYKIAQKNFISGGNN----------------RIILCTDGDFNVGVSS 363 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ I + + ++ H + N Sbjct: 364 DKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQE 415 >gi|239926966|ref|ZP_04683919.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291435315|ref|ZP_06574705.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291338210|gb|EFE65166.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] Length = 527 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 59/221 (26%), Gaps = 43/221 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 GERP + VVD SGSM ++ + +L D + Sbjct: 162 GERPPAALTFVVDTSGSMAE-------------------PGRLDLARKSLAAMTDRLRDD 202 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + L+ ++ E + + + + T+ ++ Y+ Sbjct: 203 DS------VALVTFSDEAETVLPMTRLGGNRDEVHDAVDRLEPDRSTNLGAGVEAGYETA 256 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIK 363 ++ ++D N ++ +I + + I Sbjct: 257 VEGL----------------REGATNRVVLVSDALANTGETDADAILERISRSRRAHGIT 300 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + + + + + V + VF Sbjct: 301 LFGVGVGSDYGDALMERLADRGDGHTVYVSGEEDAREVFGE 341 >gi|196233777|ref|ZP_03132616.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196222139|gb|EDY16670.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 883 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 45/362 (12%), Positives = 98/362 (27%), Gaps = 59/362 (16%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNH--AKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 AA G L G + SN + +N + Y Sbjct: 289 AATSGGIGDNSYGLDDRGGIADKASNANSFDTLTENAFLNVPENPLSTFSIDVDTASYAI 348 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR----YNQK 172 + + ++ + L + Y F + + Sbjct: 349 VRRYLNDNHLPPTGAVRIEEL------LNYFPYDYPQPQGAAPFSATMEVATCPWAPEHR 402 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V I ERP + ++D+SGSM+ K+ L+ Sbjct: 403 LVRVGLKGREIPKDERPPSNLVFLIDVSGSMN-------------------MPNKLPLLQ 443 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 L ++ + V ++ Y + + +EP+ E ++ + T Sbjct: 444 KCFSLLVEQLGPKDRVS------IVTYASGTKLVLEPTQDKEAMQTAIDGL---HAGGGT 494 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTI 351 + + AY++ +I TDG+ N +N + Sbjct: 495 HGSSGIDLAYRMAQQSFI----------------PGGTNRVILATDGDWNIGITNQSELL 538 Query: 352 KICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVF-QNISQLM 409 + + ++ + + + +L K ++ + VF +S + Sbjct: 539 SMITRKAKSGVFLTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQEARKVFVDQLSSTL 598 Query: 410 VH 411 V Sbjct: 599 VT 600 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + + V +G + Y + E + + K + + T++ Sbjct: 456 TFLSEVVGMFNIAPHKVRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTG 515 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +L K K ++ LT+G + + ++ + Sbjct: 516 AALNFTLSLLQKAK-------------KQRGNKVPCHLVVLTNGMSKDSI-----LEPAN 557 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +E I++ I I Q L+ + Y V + D+L + + Q + Sbjct: 558 RLREEHIRVYAIGI--KEANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 612 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 28/183 (15%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKP 291 + F+ ++ + V +G++ ++ ++ + + + ++ +D + + Sbjct: 640 MKTFMKNLVSKSQIGPDRVQIGVVQFSDINKEEFQLNRFMS--QSDISNAIDQMAHIGQT 697 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + K +KF+I +TDGE + Sbjct: 698 TLTGSALSFVSQYFSPT--------------KGARPNIRKFLILITDGEAQD-----IVK 738 Query: 352 KICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++ + I ++ ++ L+ PE + V N D L + ++ + Sbjct: 739 EPAVVLRQEGVIIYSVGVFGSNVTQ---LEEISGRPEMVFYVENFDILQRIEDDLVFGIC 795 Query: 411 HRK 413 + Sbjct: 796 SPR 798 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/343 (11%), Positives = 86/343 (25%), Gaps = 43/343 (12%) Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E+ + K+ I L+ + E + + + + Sbjct: 108 KALQEAHRTYFSAPANGRDKKQFPP-ILVVLASSESEDNVEEASKALRKDGVKIISVGVQ 166 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N + + +Q I I G +E Sbjct: 167 KASEENLKAMATSQFHFNLRTVRDL--SMFSQNMTHIIKDVIKYKE----GAVDDIFVE- 219 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + +D + I ++ LK +S+ L + + +G Sbjct: 220 ------ACQGPSMADVVFLLDMSINGSEEN--FDYLK---GFLEESVSALDIKENCMRVG 268 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ Y+ + S G K S + A+K + Sbjct: 269 LVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIK-----------KLRKE 317 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F + + + + +T + + T + + + I T+ I + + Sbjct: 318 VFSARNGSRKNQGVPQIAVLVT----HRDSEDNVTKAAVN-LRREGVTIFTLGIEGASD- 371 Query: 376 QRLLKTCVSSPE--YHYNVVNADSL----IHVFQNISQLMVHR 412 L+ S P Y + L + + + H Sbjct: 372 -TQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHT 413 >gi|150251390|gb|ABR68007.1| matrilin-like 40 kDa protein [Lehmannia valentiana] Length = 390 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 51/176 (28%), Gaps = 16/176 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + G + ++ VE + T T + Sbjct: 229 LNFASRFVSHFLVGNSKARFGALLFSDFVENLFYLNKYTSTADVSKAILRAPYHRGTTLT 288 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A T K ++ TDG++ T+ Sbjct: 289 NEAFDF------------IRTEGVFSTPKGGRSNAPDIVVVFTDGQST---KPALTLAAA 333 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 D K ++IV + I + + L+ SS + + + ++L ++ Q +++ + Sbjct: 334 DNLKRQNVRIVAVGIGNEVSKEE-LRQVASSRDDVFEASSFENLDYIEQKLAKNVC 388 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 48/300 (16%), Positives = 92/300 (30%), Gaps = 41/300 (13%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 ++S +++T + + L + Y L + + + +P Sbjct: 16 GGALSSQTLTITQVDTSSLLFTQTVRAYFYVDNLPGGKVLTAADITAQESPDGASWRLLP 75 Query: 179 -ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 LR + LV+D SGSM D +D R ++ K A+ Sbjct: 76 VRSLRRGVNREEGISFLLVLDASGSM--------WDALDGTPTEDPDRMRITHAKRAIRE 127 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTP 296 FL + +GL + V + + S T+ Sbjct: 128 FLPLLSERD------RVGLAVFNRTYRMIQPIVDDPALVLEKLDAIERPSREQAYTELYR 181 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---NNFKSNVNTIKI 353 +M++A + + ++ ++ L+DGEN + KS Sbjct: 182 SMEEALTSFEEEGR-------------------RRVLVVLSDGENFPVDPEKSPATPGTA 222 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 D A I I + RL+ S + ++ NA L V+ I + ++ Sbjct: 223 VDLAHRYGITCYVIHFGTEKD--RLIGDLASETGGRVFDARNALELASVYTAIQEQVLQE 280 >gi|261880540|ref|ZP_06006967.1| BatB protein [Prevotella bergensis DSM 17361] gi|270332763|gb|EFA43549.1| BatB protein [Prevotella bergensis DSM 17361] Length = 342 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 69/226 (30%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ +D +++ K + + + +GL+ + + + Sbjct: 99 DVSNSMKAEDVAPSRLDKSKMLVESMTEHFT-------NDKLGLVVFAGDAFVQLPITSD 151 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ + T + Sbjct: 152 YVSAKMFLQNIDPSLVSTQGTDIARAITVGMRSFTQ------------------QEKVGR 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ +A++ I + + + + GQ Sbjct: 194 AIIVITDGEDHEGGA----MEAAKEARKRGINVFILGVGSTKGAPIPTGNGGYMTDATGQ 249 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 250 TVMSALNEEMCKEVAKAGSGTYIHVDNTS---DAQEQLNDELTKLQ 292 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V I P + ++D+SGSM + Sbjct: 226 NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSMW-------------------GANR 266 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y +E + G++K ++ + Sbjct: 267 LDLVKSSLKLLVNNLRDKD------KVAIVTYAGNAGVKLEATPGSDK-QKIREAIDELE 319 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY+I + II TDG+ N S Sbjct: 320 ASGSTAGGEGIMLAYKIAQKNFISGGNN----------------RIILCTDGDFNVGVSS 363 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ I + + ++ H + N Sbjct: 364 DKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQE 415 >gi|13471071|ref|NP_102640.1| hypothetical protein mll0948 [Mesorhizobium loti MAFF303099] gi|14021815|dbj|BAB48426.1| mll0948 [Mesorhizobium loti MAFF303099] Length = 408 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 47/358 (13%), Positives = 96/358 (26%), Gaps = 17/358 (4%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R R S +AN + IFALS+ + GF + W+Y +++ +AA Sbjct: 4 NRFTRIRLVATGIRRSTRANVATIFALSLPIVVGAAGFGVETSYWYYNSLRLQATADAAA 63 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 AGA + +S + + + A L SG + E+ Sbjct: 64 YAGALEQISGSDKPTIIAAATQSAATNGLGSGTIVVNTP----PASGPNTAKKAVEVIVA 119 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 N +R+ + ++ + + + L V I + Sbjct: 120 QNINRMFTSIFTQGQVPEQARAVALITNASKACVLALDPSASQAALFSGNTSVKLIGCSV 179 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ L + +C + A L + Sbjct: 180 MSNSIASDAIKLQGSAGLQADCLISGGGVSLSNPVTTVCAAPITQALPAADPFADLPAPA 239 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + Y + + N+ S G V+ + + ++ I Sbjct: 240 ASNPCQNDNKATLSPGTYCSGMTLKGNVTLSPGVYVVQGNLKINANAAITGS-------- 291 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 I S VK+ + + L G+ ++ D Sbjct: 292 -GVTIFMSGSSSVSING--NASVKLSAPTSGAYSGVLFYGDRTGNSASSTFNGTADSL 346 >gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 459 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 32/199 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--------RVE 264 D + + KM K + F S++ + V Y + R++ Sbjct: 160 DASGSMKADVSGGNKMMLAKETIKEFTSSLEDDASVSLMAYGHVGTGNDEDKAESCSRID 219 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + G + + T A+ +A ++L+ Sbjct: 220 EVFPL--GAYEKTAFNKSMDSFEASGWTPLAGAIDKARELLS----------------AY 261 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-T 381 S ++ + ++DG + + ++ + + + I K+ I + GQ+ LK Sbjct: 262 NSTDYKNTLYIVSDGVETC---DGDPVEAAQQLQGSNIEAKVNIIGFDVDDEGQKQLKEV 318 Query: 382 CVSSPEYHYNVVNADSLIH 400 + + V + D L Sbjct: 319 AEAGGGTYATVRDKDELED 337 >gi|145494949|ref|XP_001433468.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400586|emb|CAK66071.1| unnamed protein product [Paramecium tetraurelia] Length = 611 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 65/201 (32%), Gaps = 39/201 (19%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR-DMDS 286 + +K +LLL LD + ++ +I + ++ TEK +QY Sbjct: 207 INMVKKSLLLLLDFLGEQDRLQ------IITFNEHAQRLTPLKCLTEKNKQYFQAVISQI 260 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A A++ L R+ T+ + L+DG + + Sbjct: 261 SAEGLTKISSATYIAFKQLKEKVYRNNVTS----------------VFLLSDGHDGDALF 304 Query: 347 NVNTIKICDKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +I D+ + + I T + Q + Y V + L F + Sbjct: 305 -----EISDQIRHVKEVFTISTFGFGDDHDAQMMTSISNLKNGNFYYVKDITLLDEFFAH 359 Query: 405 --------ISQLM-VHRKYSV 416 I++ + + ++ Sbjct: 360 ALGGIVSVIAEQIQISLSLTL 380 >gi|288942712|ref|YP_003444952.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] gi|288898084|gb|ADC63920.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] Length = 1059 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/279 (10%), Positives = 78/279 (27%), Gaps = 31/279 (11%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 V + + L + + + + + LL M + I + ++ Sbjct: 46 VAPNVLFILDESGSMDDYNRMTDLKNSMKTLLDNPMMDHVQAAIMGYTTANTDIYYNNRG 105 Query: 210 DPEDVNSAPICQDKKRTKMAAL-----KNALLLFLDSIDL--LSHVKEDVYMGLIGYTTR 262 N K + + ++A+ +D + + + +G+ + +R Sbjct: 106 WNTYANVNGSLAIKLHSDFKTVGEGSNRSAMKAIVDGLSPRASTPTVWALKLGMEWFDSR 165 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ----ILTSDKKRSFFTNFF 318 + + + + +LT + S N++ Sbjct: 166 ATQIEGTPSDRTRATS---------PTYSSPMSDPAYWCVPNHLVLLTDGEPNSNDPNYY 216 Query: 319 RQGVKIPSLPFQKFIIFLTDGE------NNNFKSNVNTIKICDKAKENFIKIVTISINA- 371 + + G + +++ T + + + TI++ Sbjct: 217 GLTSYKGTTCVKDSTSIEEKGRCAAEIAAWGYNNDLRTDSVWED--NQNVTTHTIALGDS 274 Query: 372 -SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 N + LK +Y NA +L++ F+ I Sbjct: 275 LGSNDKAFLKNIAIKGGGGYYEADNASTLLNAFETIVDD 313 >gi|148656912|ref|YP_001277117.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569022|gb|ABQ91167.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 543 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 81/265 (30%), Gaps = 51/265 (19%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF------LIELVVDLSGSM 203 Q + V P I + I LVVD+SGSM Sbjct: 321 ANVDVPLADPISLAYGVQPQGVQSVLPTPPAEVIVAAKNAWSINRKRADILLVVDVSGSM 380 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + K+ A K+ L FL I +GLI ++T Sbjct: 381 --------------------EGEKLEAAKSGLGTFLSRILPED------RVGLIVFSTDA 414 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + P+ +E T A+ Q+ + Sbjct: 415 RVVVPPAPLSEARIALDDAIAQLNARGKTALYDALITGKQVFDD--------------LP 460 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 P + I+ L+DG +N ++ ++ +++ E I I ++ + + L + Sbjct: 461 PPDEERIRAIVLLSDGLDNASRTTLDQVRLA--FDETGISIFPVAYGSDADLAALEQIA- 517 Query: 384 SSPEYHYNVV-NADSLIHVFQNISQ 407 + V + + +F+N+S+ Sbjct: 518 -TFSRTIVVQGDTGDIGQIFENLSR 541 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLVGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIAMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLRNL 520 Query: 406 SQLMV 410 + Sbjct: 521 RSSIC 525 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLVGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ QN+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLQNL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 72/239 (30%), Gaps = 55/239 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +R + +VD SGSMH K+ + A+ + + ++ Sbjct: 236 VSRSQRKPAHLVFLVDTSGSMHSED-------------------KLPLAREAIKVAVKNL 276 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + ++ Y + P+ T+ + T M+ AY Sbjct: 277 NEND------TVAIVTYAGNTRDVLPPTPATDA-KSIHAALDSLTAGGGTAMGSGMELAY 329 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---E 359 + VK S ++ LTDG+ N + NV+ + D Sbjct: 330 R----------------HAVKKASGSVVSRVVVLTDGDANIGR-NVSANAMLDSIHKYTA 372 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN--------ISQLM 409 + + T+ L++ + V + VF+ I++ + Sbjct: 373 EGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFETQLTGTLEVIAKDV 431 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + I + K + F+ D + L + +GL+ Y + V+ Sbjct: 49 SSCENKRADLVFIIDSSRSVNTYDYAK--VKEFILDILQFLDIGPDVTRVGLLQYGSTVK 106 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 107 NEFSLKTFKRKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 154 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + I+ +TDG + + ++ KA+ I I I + + + K Sbjct: 155 RPLRENVPRIIMIVTDGRPQDSVA-----EVAAKARNTGILIFAIGVGQ-VDLNTV-KAI 207 Query: 383 VSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQN Sbjct: 208 GSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|320352629|ref|YP_004193968.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121131|gb|ADW16677.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 577 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 80/250 (32%), Gaps = 45/250 (18%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + A + P + ++D+SGSM K+ Sbjct: 179 RDYLLARIGLAAKDLAKEHLPPSNLVFLIDVSGSMQ-------------------DGNKL 219 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 LK AL L + + + L+ Y + P+ G ++ ++ + Sbjct: 220 PLLKQALPLVVRQLGARD------RVALVVYAGADSVVLPPTPG-DRQQEILAALDQLQA 272 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSN 347 T ++ ++ AY++ + +I +DG+ N S Sbjct: 273 GGSTHASSGIRTAYELARKSFIKGGNN----------------RVILASDGDFNVGVTSR 316 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ ++ +++ I + + + ++ +Y DSL+ + + + Sbjct: 317 DELTRLIEEERKDGIYLTVLGLGMGNYHDDTMEVLADKGNGNYA--YIDSLLEAKKVLVK 374 Query: 408 LMVHRKYSVI 417 M +++ Sbjct: 375 EMSGTLFALA 384 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/296 (9%), Positives = 69/296 (23%), Gaps = 27/296 (9%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ I L++ L + G +I + K ++SA ++ LA A ++ L Sbjct: 7 KKERGVILPIVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQELDGAADAL 66 Query: 76 G-DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-- 132 ++ + + S +++ + N+ H+ Sbjct: 67 TRATSAGLTAGNANKVQYQKASASLIDTDVTFSDSLTGAFSSTFTPVANARYAKCGHLTT 126 Query: 133 -----------ANNRLDSSNNTIFYNMDV----MTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + + + + + Sbjct: 127 GILAYLIQMVGGPTSNAVAAIGVATRTHAQSTCPIPVGLLPRTGGTAPDYGFAVGDWVTV 186 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 G + +D GS + + V + + K A+ Sbjct: 187 LYDGTKTAGPGEMGW--YNLD--GSTNANETKNEMSVGYC-NSKVNDTLRTPGAKVAVDD 241 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 +S + D+ +T + + T+ Sbjct: 242 QWNSRFGIYKNNGDMQTMRPDFTGYSYT----TQNWPNGKSAYNGTSGGSDPTATN 293 >gi|284052945|ref|ZP_06383155.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569123|dbj|BAI91395.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 489 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 60/191 (31%), Gaps = 33/191 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +K+ ++ A F+ + + ++ +++R + + +++Q Sbjct: 60 TSGSMSGSKLPEVQRAASEFVSR-----QNLKRDDLAVVEFSSRASVVADFTRDERELQQ 114 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R T+ + A +L + + I+ TD Sbjct: 115 AIARLS---AWGGTNLSEGFNLATSVLQNSDR-------------------PGNILLFTD 152 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE NN + I + + + I + +++ L P+ + N L Sbjct: 153 GEPNNRR---MAASIAQQIRASGINL--VAVGTGDAPVNYLTALTGDPDLVFYA-NFGDL 206 Query: 399 IHVFQNISQLM 409 F+ + + Sbjct: 207 DSAFRGAEKAI 217 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 78/255 (30%), Gaps = 54/255 (21%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + I+ + + R I + +D+S SM A + ++ Sbjct: 66 SLLIIGAARPADSVRV-PRDRATIIIALDISLSMEAADV---------------QPNRIT 109 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A K A F++ + E +G++ + + P+ + V + Sbjct: 110 AAKEAAQKFVEDL------PERFNVGVVAFARSASVVVSPTTDHQAVSASLGNLT---TR 160 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T A+ + + S +++ + P I+ L+DG+N + +S Sbjct: 161 AGTAIGEAVFNSLDAVRSFDQQAV------------TDPPPAAIVLLSDGDNTSGRSVAE 208 Query: 350 TIKICDKAKENFIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVNA 395 + A+ + I TI+ N L + Y + Sbjct: 209 AVDAAMSAR---VPISTIAYGTQEGTVSIDGRDVNVPVNKATLQTLSEGTSGRAYEAESG 265 Query: 396 DSLIHVFQNISQLMV 410 L V++ I + Sbjct: 266 SQLREVYEQIGTSLG 280 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 36/260 (13%), Positives = 77/260 (29%), Gaps = 35/260 (13%) Query: 163 HLLNQRYNQKIVSF-IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + +++ + ++ +R L V L+ + + S+P D+ Sbjct: 3 SMRWDISQKEVTHMGVTNSNKVINTDRIPCDGTLRVTLALTAAPDIVSNPTDIVLVLDRS 62 Query: 222 DK-KRTKMAALKNALLLFLDSIDLLSHVKED------VYMGLIGYTTRVEKNIEPSWGTE 274 T +A +K F+D ID + +D MG++ ++ + + + Sbjct: 63 GSMTGTPLADMKLGAKTFIDLIDEATDSSQDGQIGSGSRMGVVSFSNTAVADTQLITSVD 122 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ V T+ A +A Q+ + K ++ Sbjct: 123 ALKAAVDNLS---AGGSTNHADAFAKAIQLFDPASANA------------------KVMV 161 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPE--YHYN 391 TDG I I I I + L + P+ + Sbjct: 162 MFTDGNTTIGAPPAPVAAAARA---QGIIIYCIGLIGSDGLDITALNDWATDPDASHVAV 218 Query: 392 VVNADSLIHVFQNISQLMVH 411 NA L +F ++ + Sbjct: 219 TPNAADLEELFAELAANISK 238 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 40/350 (11%), Positives = 89/350 (25%), Gaps = 81/350 (23%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-------------------- 179 S + + + + N + Sbjct: 111 STSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPESERYRYEEIYAGNLSETEDQEYYT 170 Query: 180 ---LLRIEMGERPIFLIELVV------DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 G++ +F E + + ++ TK+ A Sbjct: 171 KDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAEDSNPNHIQFYERRDLGTNITKLQA 230 Query: 231 LKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVT 281 L+NA+ F++ D + +K + L+ + + NI ++ ++ + Sbjct: 231 LQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNYNRSQIVTELK 290 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I T + ++ A F G QK ++F TDG+ Sbjct: 291 SYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGG---TREGAQKVVVFFTDGQP 347 Query: 342 NNFKSNVNTIKIC-----DKAKENFIKIVTISINASPNGQRLLKTC-------------- 382 + N++ + K+ I +I + N + Sbjct: 348 TSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNY 407 Query: 383 ---------------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + +Y+ +AD L ++F IS + Sbjct: 408 PDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDLET 457 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 71/199 (35%), Gaps = 27/199 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + R + ++A F+ + L+ + T+ E + TEK+ Sbjct: 93 IDTSNSIRDRFKFEQDAASEFIKGVVHA----NQDKAMLVSFDTKAELVSDLIGDTEKLD 148 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T A+ A + + P F++ I+ ++ Sbjct: 149 HAIRSL---RPGGGTALYDAIFFACR--------------DKLSQDQPKHKFRRAIVIVS 191 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINAS---PNGQRLLKT-CVSSPEYHYNVV 393 DG++N + + + A++ + + +IS N S +G ++LK + + Sbjct: 192 DGDDNQSQYTR--DQALEMAQKADVVLYSISTNISKIESDGDKVLKYYAAETGGKAFFPF 249 Query: 394 NADSLIHVFQNISQLMVHR 412 + L F+NI+ + H+ Sbjct: 250 KVEDLEQSFENIANELRHQ 268 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 40/350 (11%), Positives = 89/350 (25%), Gaps = 81/350 (23%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-------------------- 179 S + + + + N + Sbjct: 111 STSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPESERYRYEEIYAGNLSETEDQEYYT 170 Query: 180 ---LLRIEMGERPIFLIELVV------DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 G++ +F E + + ++ TK+ A Sbjct: 171 KDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAEDSNPNHIQFYERRDLGTNITKLQA 230 Query: 231 LKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVT 281 L+NA+ F++ D + +K + L+ + + NI ++ ++ + Sbjct: 231 LQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNYNRSQIVTELK 290 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I T + ++ A F G QK ++F TDG+ Sbjct: 291 SYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGG---TREGAQKVVVFFTDGQP 347 Query: 342 NNFKSNVNTIKIC-----DKAKENFIKIVTISINASPNGQRLLKTC-------------- 382 + N++ + K+ I +I + N + Sbjct: 348 TSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNY 407 Query: 383 ---------------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + +Y+ +AD L ++F IS + Sbjct: 408 PDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDLET 457 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 67/220 (30%), Gaps = 52/220 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + ++ K A+ D + +GLI + + + + Sbjct: 97 DVSNSMLAEDFEPNRLERTKYAINKLFDGL-------HQDRVGLIVFAGEPKVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + R + ++ T A+ QA + + Sbjct: 150 YRMAKAFAKRIDPSLVPVQGTAIGKALSQALMSFS----------------GETEENHSR 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + + I+I TI I Sbjct: 194 VVILITDGENHEDDALAAARHAAE----MGIRIYTIGIGTPEGAPIQIGGEFIKDEKGDM 249 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ 407 N + L + + + L + ++I++ Sbjct: 250 VVSKLNEEMLAQIADITGGAYVRSSKQSIGLDEIVKSINE 289 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ + I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 320 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 376 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 377 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 426 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 427 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 479 Query: 406 SQLMV 410 + Sbjct: 480 RGSIC 484 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ + I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 361 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 417 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 418 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 467 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 468 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 520 Query: 406 SQLMV 410 + Sbjct: 521 RGSIC 525 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 3 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 61 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 62 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 106 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ + I+I + + + G L+ S P E+ + V + D + Sbjct: 107 AQARASGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 153 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 268 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 324 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 325 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 374 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 375 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 427 Query: 406 SQLMV 410 + Sbjct: 428 RGSIC 432 >gi|239617869|ref|YP_002941191.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] gi|239506700|gb|ACR80187.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] Length = 1706 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 33/186 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M K+A FLD + E+ + LI + T +E + E++++ + Sbjct: 310 MEKAKDAASYFLDLL------PENSELALIAFDTEIEVLKNFTRDREQLKRALAIIK--- 360 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + + ++L+ +F+I +TDG + N+ Sbjct: 361 ARGATPLYDTVAKGIELLSERSG-------------------PRFLILVTDGVDANYGDT 401 Query: 348 VN-----TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A+EN + I I + + L S+ D+L F Sbjct: 402 APGSEKTLSEVIRLARENNVVIFAIGLGTRIDEFSLGTLARSTGGMFLKSPTIDNLKTAF 461 Query: 403 QNISQL 408 ++ + Sbjct: 462 NSLLET 467 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 65/224 (29%), Gaps = 45/224 (20%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 +R + I+ VD+SGSM + + A A+ FLD Sbjct: 91 QQKRSLTNIQFAVDVSGSM---------------LAPFGDGNRYDASMKAIDTFLDF--- 132 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQA 301 ++ GL + + + +R ++ + T A+ Sbjct: 133 ----RKGDAFGLTFFGDAFVHWVPLTTDVTAIRCSPPFMRPETVPPPFGGTAIAKALNGC 188 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L K I+ +TDG + + N +I Sbjct: 189 KTEL------------------RRRDEGDKMIVLITDGFSYDLTGND--EEIARTLSAEG 228 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + I + ++ C + + + D+L VF+ I Sbjct: 229 VAVFCIIVGGFEPQAEIVNICRLTGGEAFRADDPDALPAVFKKI 272 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 40/350 (11%), Positives = 89/350 (25%), Gaps = 81/350 (23%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-------------------- 179 S + + + + N + Sbjct: 113 STSNTVKTVSRPLDIVLVVDTSGSMENSPHMGTRPESERYRYEEIYAGNLSETEDQEYYT 172 Query: 180 ---LLRIEMGERPIFLIELVV------DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 G++ +F E + + ++ TK+ A Sbjct: 173 KDGGKITSEGQKILFWWEFTHWELNGQTVEPKISAEDSNPNHIQFYERRDLGTNITKLQA 232 Query: 231 LKNALLLFLDSI----DLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVT 281 L+NA+ F++ D + +K + L+ + + NI ++ ++ + Sbjct: 233 LQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNYNRSQIVTELK 292 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I T + ++ A F G QK ++F TDG+ Sbjct: 293 SYTTKNISDLTSTVNSLIAAGATRADFGLNQAQRAFQLGG---TREGAQKVVVFFTDGQP 349 Query: 342 NNFKSNVNTIKIC-----DKAKENFIKIVTISINASPNGQRLLKTC-------------- 382 + N++ + K+ I +I + N + Sbjct: 350 TSNSDWSNSVAAAAITNAKELKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNY 409 Query: 383 ---------------------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + +Y+ +AD L ++F IS + Sbjct: 410 PDATATSSTRPNRYSCTLGKRTDNSDYYKAATDADELNNIFNEISSDLET 459 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 44/138 (31%), Gaps = 12/138 (8%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 S+K + FA+ + + + V ++K + + +AA LAGA ++ Sbjct: 6 LFKSKKGTTFVFFAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQEL----- 60 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++++ + + ES + + ++ + Sbjct: 61 -------VTNVSNTNNVVNNYIAKNNSGLNESTIIVNESERSVKVTSSKTVENNFAKVFG 113 Query: 134 NNRLDSSNNTIFYNMDVM 151 NN D ++ Sbjct: 114 NNSQDVIATAKAKVENIS 131 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 71/232 (30%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V I P + ++D+SGSM + Sbjct: 226 NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSMW-------------------GANR 266 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y +E + G+ K ++ + Sbjct: 267 LDLVKSSLKLLVNNLRDKD------KVAIVTYAGNAGVKLEATPGSNK-QKIREAIDELE 319 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY+I + II TDG+ N S Sbjct: 320 ASGSTAGGEGIMLAYKIAQKNFISGGNN----------------RIILCTDGDFNVGVSS 363 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ I + + ++ H + N Sbjct: 364 DKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQE 415 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 66/223 (29%), Gaps = 57/223 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K L +D + MGLI + + + Sbjct: 97 DISNSMMAQDIAPSRLDYAKMLLSQLIDRLTD-------DKMGLIVFAGDAFIQMPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ D + + T A+ A + K+ + Sbjct: 150 KVSAKMFLKTIQPDLIQRQGTAIGSAIDLAVKSFNDTKQSGG-----------------R 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I LTD EN+ + ++ A++ I + + I Sbjct: 193 AIFLLTDAENHEDNA----VEAAKMARDKNITVNVVGIGTPEGSPIPVKGTMSYIKDKDG 248 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + + + N + I +++ + Sbjct: 249 NVVVSKLNEELGKQIAAAGNGIYVRASNTGATIKA---LAKEI 288 >gi|329851995|ref|ZP_08266676.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] gi|328839844|gb|EGF89417.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] Length = 399 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 39/140 (27%), Gaps = 9/140 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R F + + N ++I AL L +I I + S+++A +A L GA+ Sbjct: 1 MRRQFVQWAHCQGGNVAVISALCAAPLLYVITATIDHSAMVKDRFSLQAAADAGALMGAA 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 K+ L A ++ + + Sbjct: 61 KLALGSDEL---------VFGAAEAAAHQQIGDLRNPVTFEVVVDRSTGAVTVTGRSQHA 111 Query: 127 ISMTHMANNRLDSSNNTIFY 146 + M++ S Sbjct: 112 PLIGFMSDGPTPISARATAE 131 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 39/285 (13%), Positives = 78/285 (27%), Gaps = 49/285 (17%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 LQ + N + +S L +G + DV+ Sbjct: 93 LNLQTSTSMSNVTEMKTNMSLGL-TLTRNVGTGGFLTCGPLWAQQCGSQYYTTGVCSDVS 151 Query: 216 SAPICQDKK-------RTKMAAL-----------KNALLLFLDSI-DLLSHVKEDVYMGL 256 + + + + +A+ FL+ L MGL Sbjct: 152 PDFQLRTSFAPAVQTCPSFIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQMGL 211 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFF 314 I Y + K + + T++ A++ A S Sbjct: 212 IQYANNPRVVFNLNTFKSKDE-MIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAG--- 267 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++ +TDGE+++ I C+ ++ I I++ N Sbjct: 268 ----------GRPGATKVMVVVTDGESHDGSKLKAVIDQCN---KDNILRFGIAVLGYLN 314 Query: 375 GQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + + +NV + L+ I + + Sbjct: 315 RNALDTKNLIKEIKAIASIPTERHFFNVSDEADLLEKAGTIGEQI 359 >gi|329849361|ref|ZP_08264207.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841272|gb|EGF90842.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 505 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 77/252 (30%), Gaps = 49/252 (19%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N +++ + ERP + ++D+SGSM K Sbjct: 130 NPNSRLLRVGLRAYDVPRSERPAANLVFLIDVSGSMDEKD-------------------K 170 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++AL L D + V ++ Y +EP+ +VR+ + + Sbjct: 171 LPLVQHALRLVADDMRPRDRVS------IVVYAGAAGIVLEPTANPAQVRRALGQLK--- 221 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + AY + +I TDG+ N S+ Sbjct: 222 AGGSTAGGEGIALAYATARAAYIDGGIN----------------RVILATDGDFNVGISD 265 Query: 348 VNT-IKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + K K++ I + + + L++ + + +A + + Sbjct: 266 PEAIKDLVRKNKDDGITLTALGFGTGNYNEALMEGIADVGNGNYAYIDSASE---ARKVL 322 Query: 406 SQLMVHRKYSVI 417 + ++V Sbjct: 323 DDELSSTLFTVA 334 >gi|297622708|ref|YP_003704142.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297163888|gb|ADI13599.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 329 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 72/290 (24%), Gaps = 44/290 (15%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 N L+ S + R A + +++ + + Sbjct: 51 GNVALNVSALDSAGEPIAGRLSNPRATVTSVTPLPGLAAAQRYTATATITVDITVQEVIN 110 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 L +D SGSM +A FL+ + + Sbjct: 111 AVLNMDRSGSMRLNDPERLR-------------------VDAAKSFLERVTPEDRIAVME 151 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + G + + + R T ++ + Sbjct: 152 FPGQSSGFRASTLLQGFTSDKALLEAALDRVGQ---RGNTPIWDSLLDTLDL-------- 200 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + ++ TDGE + + A E+ +++ TI + + Sbjct: 201 -------HAADEGGQGASRVVLLFTDGEREGGQV--AFGEALAAALESDVRVFTIGLGSD 251 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM-----VHRKYSVI 417 + L + + NV +A L +FQ + + S I Sbjct: 252 IDTAELQELAAETGGTFANVASAAELEELFQRAFNAIRASGTITLSISPI 301 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 71/208 (34%), Gaps = 31/208 (14%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + I + K + F+ D + L + +GL+ Y + ++ Sbjct: 85 SSCENKQADLVFIIDSSRSVNTHDYAK--VKEFILDILQFLDISPDLTRVGLLQYGSTIK 142 Query: 265 KNIEPSWGT--EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +V + V R T + A++ A I S+ +G Sbjct: 143 NEFSLKTFKKKSEVERAVKRMRHLST--GTMTGLAIQYALNIAFSEA----------EGA 190 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKT 381 + + I+ +TDG + + ++ KA++ I I I + N LK Sbjct: 191 RPLRENVLRVIMIVTDGRPQDSVA-----EVAAKARDTGILIFAIGVGQVDLNT---LKA 242 Query: 382 CVSSP--EYHYNVVNADS---LIHVFQN 404 S P ++ + V N L VFQ Sbjct: 243 IGSEPHEDHVFLVANFSQMESLTSVFQK 270 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 63/186 (33%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V + Sbjct: 707 FEIVKQFVAGIIDSLA---VSPKAARVGLLQYSTQVRTEFTLGDFSSARDMKKAVAHMKY 763 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 764 MGKGSMTGLALKHMFERSFN----------PVEGARPVSPGVSRVAIVFTDGRAQDDVS- 812 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 +A+ I + + + + + L+ S + ++ + + ++ + + + Sbjct: 813 ----AWARRARAGGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISEKL 866 Query: 406 SQLMVH 411 + + Sbjct: 867 KKGICE 872 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 51/142 (35%), Gaps = 27/142 (19%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 AA K L FLD +E +GL+ + R + VR + + L Sbjct: 17 AAAKAVLSDFLDR-------REGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLA 69 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A+ + + L K+ Q+ ++ LTDG N N Sbjct: 70 GRETAIGDAIALSVKRLREQKQG------------------QRVVVLLTDGVNTAGVLN- 110 Query: 349 NTIKICDKAKENFIKIVTISIN 370 +K + AK +++ TI+ Sbjct: 111 -PLKAAELAKAEGVRVHTIAFG 131 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 42/418 (10%), Positives = 111/418 (26%), Gaps = 49/418 (11%) Query: 18 EKAN----FSIIFALSVMSFLLLI----GFLIYVLDWHY-KKNSMESANNAAILAGASKM 68 ++ N + + AL +++ L G + + + A+ + Sbjct: 20 QRGNGIPSATNMAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQEASQR---QAEMRD 76 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + + + + +++ L+ + +S Sbjct: 77 AAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAVIPLAQNKFEQQVQNGIMVAGEIPVS 136 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYR-LQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + ++ + + Y + P + E+ Sbjct: 137 TFSIDVDTGSYATLRRMLREGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAP 196 Query: 188 RPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPICQDKKRTKMAALKNALLLFLDSID 243 P + L + L G + K+ L+ AL L + Sbjct: 197 SPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLLTAQLS 256 Query: 244 LLSHVKEDVYMGL---IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 V VY G + ++ T+ + Q Sbjct: 257 AQDKVSIVVYAGAAGVVLDGASGNDTQTLTY----------ALEQLSAGGSTNGGQGITQ 306 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKE 359 AYQ+ + +I TDG+ N ++ + I + +K K+ Sbjct: 307 AYQLAKKHFIPNGIN----------------RVILATDGDFNVGVTDFDDLIALIEKEKD 350 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + I + T+ +L++ +Y ++ L + + + +++ Sbjct: 351 HGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT--LNEARKVLVDELSSTLFTIA 406 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 14/151 (9%), Positives = 39/151 (25%), Gaps = 12/151 (7%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M + + + ++ +++ + + + + K+ + + +AA Sbjct: 1 MQKIKLLKNLLGNFLNNQNGLAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 LAGA ++ N + K D+ S + Sbjct: 61 ALAGAQELPDNP--AQAVQVARDYAGKNGFAPDSLNI----------SISGDNRTISVAG 108 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + I + S + + Sbjct: 109 REVVNTIFARVLGIYSKTVSAGSSASVQGLT 139 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 71/214 (33%), Gaps = 35/214 (16%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 V D SGSM + P ++ AL + FL+ D S + + Sbjct: 334 FVNDKSGSMGGSDAR--------PTSSKYSNDRLGALFESCEKFLEVRDGSSDLVSCIMY 385 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 Y S V+ + T T AM+ +++S Sbjct: 386 DHSAYNC--FTTNPLS------TSLVSTMSSYVAGGGTSFTNAMQSVSSLISS------- 430 Query: 315 TNFFRQGVKIPSLPFQKFIIF-LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 P+ K ++ ++DGE++ ++ T ++ + I + TI + S Sbjct: 431 --------TYPNHQSYKIVVLFMSDGEDSADEAVSITGQLVS---SHDIILHTIQLGGSS 479 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L + + ++ SL ++Q I+ Sbjct: 480 DNTGLRQMAATGRGQFKRANDSASLAGIYQEIAN 513 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 65/174 (37%), Gaps = 27/174 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDST 295 + L+ +++ +G++ + + E + ++ Q +T ++ T Sbjct: 330 NQAAELYLIQIIEKGSLVGMVTFDSSAEIQNNLTKIIDENTYQKITANLPQKPSGGTSIC 389 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 +K +Q ++ + + + II LTDGE+N + C Sbjct: 390 GGLKAGFQAISQSNQSTSGSE----------------IILLTDGEDN-------QMSSCF 426 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQ 407 ++ K++ I TI++ S + + L + + + + LI F IS Sbjct: 427 EEVKQSGAIIHTIALGPSADRE-LETLSNMTRGRRFYAHKDINGLIDAFSRISS 479 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 67/243 (27%), Gaps = 64/243 (26%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + ++ I V+D+S SM + +++ A K A+ + + Sbjct: 82 QEKVYTAKGSEILFVLDVSPSMAAKDIAG--------------MSRLEAAKQAVRVIVPE 127 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 GL+ + + P+ E + + + + A Sbjct: 128 AG-------GTAFGLVALASEAALMVPPTLDREAFFARLNSLQAGELGDGSAIGMGVSTA 180 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L S +K I+ +TDGENN + A EN Sbjct: 181 AYHLISSAA------------------PKKSIVLITDGENNAGSV--HPGTAAQLAFENG 220 Query: 362 IKIVTISIN---------ASP--------------NGQRLLKTCVSSPEYHYNVVNADSL 398 I + + + P + L + +++ ++ V + L Sbjct: 221 ITLYVLGVGTRGSVPLEYVDPATGKTYSGYLDSRFDESPLQEIALTAGGRYFGVESMGEL 280 Query: 399 IHV 401 Sbjct: 281 TAA 283 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 73/216 (33%), Gaps = 54/216 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + D + +GLI + + + Sbjct: 98 DISNSMLAQDVVPSRLDKSKLLIEDLFRIFD-------NDKVGLIVFAGDAFVQLPITSD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A+ A + K K Sbjct: 151 FISAKMFLDNINPSLIGTQGTDIGQAINLAMHSFSPTSKSG------------------K 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL------------ 379 I+ +TDGE+N ++ ++ KA++ I++ + + ++ + L Sbjct: 193 AIVVITDGEDNEGRA----EEMASKAQKAGIQVYILGVGSTSGAEIPLGNGEMLKDNSGN 248 Query: 380 ------------KTCVSSPEYHYNVVNADSLIHVFQ 403 K + + +V N + ++ + Sbjct: 249 VVRTHLNENMCKKIAAAGKGVYIHVDNNNDAQNILK 284 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 57/189 (30%), Gaps = 29/189 (15%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + K L ++ + +G++ Y+TR + K + Sbjct: 13 RANRFKMCLNFINKLVNSFHIGPHNTRIGIVRYSTRPSGIFRFTSYRNKHSTKHRVNRIR 72 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+ A + L + +K +I +TDG++ + Sbjct: 73 YTGGWTRTGAAINYARRYLYQHNR---------------RRGVRKVLIVMTDGKSQDS-- 115 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + K I++ I I L + + + L F+++ Sbjct: 116 ---VVGASRSVKRMGIEVFAIGIGRGYRRSELNQM-ATDRNHV--------LTARFRDLH 163 Query: 407 QLMVHRKYS 415 +++ KY Sbjct: 164 KIIGKIKYR 172 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 69/190 (36%), Gaps = 26/190 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + + A + + + L ++ +GL+ Y + V+ K + + Sbjct: 59 SSRSVRPADFEKVKEFLITMLKFLDIGPDNTRVGLLQYGSTVKNEFSLKTYKRKPDIERA 118 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V R M T + A++ A I S+ +G + + + + +TDG Sbjct: 119 VKRMMHLAT--GTMTGLAIQYAMNIAFSEA----------EGARPLNQYVPRIAMIVTDG 166 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS 397 + + +I KA+ + I I I + LKT S P E+ + V N Sbjct: 167 RPQDPVA-----EIAAKARNSGILIFAIGVGRVDMS--TLKTIGSQPHSEHVFLVANFSQ 219 Query: 398 LIH---VFQN 404 + VFQN Sbjct: 220 IETLTSVFQN 229 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 69/186 (37%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + LDS+ ++ +GLI Y+T V + + Sbjct: 587 FEIVKQFVKGILDSL---EISQKAARVGLIQYSTHVRTEFTMAQYSSAKDVKKAVSQIKY 643 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + + + A+K ++ S+ + G + + + I TDG + Sbjct: 644 MGRGSMTGLALKLMHEKSFSEAQ----------GARARPMRVPRVAIVFTDGRAQD---- 689 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + +KAK++ I I I I + + + L+ S+P ++ + ++ ++ + + Sbjct: 690 -EVSEYAEKAKQSGITIYAIGIGKAIDEE--LQEIASAPQEKHVIYAEDFSAMGYIMEKL 746 Query: 406 SQLMVH 411 M Sbjct: 747 KSSMCE 752 >gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] Length = 540 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 70/241 (29%), Gaps = 51/241 (21%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 ++ I + E + V+D SGSM ++M K+ Sbjct: 53 IALIAPKTEVSASEIAAKRVTFVIDTSGSMQ--------------------GSRMQIAKD 92 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKP 291 AL + ++ ++ ++T VE S E +++ V + Sbjct: 93 ALKYCVTRLNPQDTFN------VVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGG 146 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ + Q ++F+TDG+ +++ I Sbjct: 147 TAIDEALVRGLQ------------------DNDGKSSAPHLLMFITDGQPTIGETDEGAI 188 Query: 352 KI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS----LIHVFQNIS 406 ++ ++ T + N + L + V + + + +S Sbjct: 189 AQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKVS 248 Query: 407 Q 407 Sbjct: 249 N 249 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/282 (13%), Positives = 76/282 (26%), Gaps = 46/282 (16%) Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 S ++ + L+ S ++T + +L + Sbjct: 5 YLLSALAFLSCSTFTLNVSAMPTDNEAKIITKAELSAPI---ILENTQENNYLKISLTGF 61 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + E+ + LV+D SGSM ++ + A ++ ++ Sbjct: 62 KQDETEKSPINLSLVIDRSGSM--------------------SGDRIEKAREAAIMAINM 101 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + V +I Y+ I + K + T + + Sbjct: 102 LKDDDIVS------VIAYSDNAYLIIPATKVKNKNEMIKIINDTIKPGGSTALFAGVSKG 155 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV-NTIKICDKAKEN 360 + K II L+DG+ N S + A + Sbjct: 156 I----------------TEVNKFIKKNQVNRIILLSDGQANIGPSTTKELADLGQVAGKQ 199 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + TI + N + S H V N+ L F Sbjct: 200 GIAVTTIGLGNGYNEDLMTALAGFSDGNHAYVENSADLETAF 241 >gi|170741515|ref|YP_001770170.1| hypothetical protein M446_3336 [Methylobacterium sp. 4-46] gi|168195789|gb|ACA17736.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 407 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 18/214 (8%), Positives = 62/214 (28%), Gaps = 11/214 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + N + IF L+++ + L+G + + ++ +++ + A L+ + + Sbjct: 9 LSAFRTNVSGNVAAIFGLTLLPLMFLLGSSVDLSRTLRERARLQALTDQAALSAVTSSSA 68 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + D +++ + D +R + + + + N + Sbjct: 69 S----QDPADTVQTFFPQPSDPDQRRLAP------TATVTVEGSTVTVTSTQNVATAFTG 118 Query: 131 HMANNRLDSSNNTIFYNMDVMTSY-DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + ++ + + + L V ++ Sbjct: 119 ILGVAQIPITARSTAAPGNDGPPVCVLALNPTATDAILFSGNATVVASNCVIYSNSSAAN 178 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + + +CA+ ++ P Q Sbjct: 179 ALTRQGSASVQATGYCAVGGTNLPTSTTPRPQSG 212 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 33/267 (12%), Positives = 76/267 (28%), Gaps = 33/267 (12%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L + + + R G L D D Sbjct: 14 VLLLDLCDSYDLNNPS--YILAQSYAQRRNNGFPHRTLNPAATDSQCRSRPLDLVFIIDS 71 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + ++ K+ D +D L + + ++ Y + V+ Sbjct: 72 SRSVRPGEFEKVKI--------FLADMVDTLDVGPDATRVAVVNYASTVKIEFLLKSHLT 123 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K + T + A+K+A + G + S K I Sbjct: 124 KDTIKQAITRIEPLAAGTMTGMAIKKAMD----------EAFTEKSGARPKSKNISKVAI 173 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHY 390 +TDG + ++ A+ + I+I + + + + LK S+P ++ + Sbjct: 174 IVTDGRPQDQ-----VEEVSAAARASGIEIYAVGV----DRADMRSLKLMASNPLEDHVF 224 Query: 391 NVVNADSLIHVFQNISQLMVHRKYSVI 417 V + + + + K ++ Sbjct: 225 YVETYGVIEKLTSKFRETLCEVKVEIV 251 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/285 (13%), Positives = 78/285 (27%), Gaps = 49/285 (17%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 LQ + N + +S L +G + DV+ Sbjct: 84 LNLQTSTSMSNVTEMKTNMSLGL-TLTRNVGTGGFLTCGPLWAQQCGSQYYTTGVCSDVS 142 Query: 216 SAPICQDKK-------RTKMAAL-----------KNALLLFLDSI-DLLSHVKEDVYMGL 256 + + + + +A+ FL+ L MGL Sbjct: 143 PDFQLRTSFAPAVQTCPSFIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQMGL 202 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFF 314 I Y + K + + T++ A++ A S Sbjct: 203 IQYANNPRVVFNLNTFKSKDE-MIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAG--- 258 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++ +TDGE+++ I C+ ++ I I++ N Sbjct: 259 ----------GRPGATKVMVVVTDGESHDGSKLKAVIDQCN---KDNILRFGIAVLGYLN 305 Query: 375 GQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + + +NV + L+ I + + Sbjct: 306 RNALDTKNLIKEIKAIASIPTERHFFNVSDEADLLEKAGTIGEQI 350 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 72/224 (32%), Gaps = 27/224 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM+ + + PE A T++ LK ++ + +D + Sbjct: 45 PIDVVAVLDVSGSMNDPVAASPESNLQA--------TRLDVLKASMKFIIRKLDDGDRLS 96 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + G + + R +++A +IL + Sbjct: 97 IVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGSGS-ALMLELQEAVKILDERQ 155 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S F I+ LTDG++ + K + T ++ Sbjct: 156 GNSRNRVGF--------------ILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTFAL 198 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 A+ + + LL S + V + + L + ++ + K Sbjct: 199 GAAHDPEALLHIAQESRGTYSFVDDGN-LDKIAGALAVCLGGLK 241 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V I P + ++D+SGSM + Sbjct: 226 NADHRLVRIGLKAKEIPTDNLPASNLVFLIDVSGSMW-------------------GANR 266 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y + + + G++K ++ + Sbjct: 267 LDLVKSSLKLLVNNLRDKD------KVAIVTYAGNAGEKLASTPGSDK-QKIREAIDELE 319 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY+I + II TDG+ N S Sbjct: 320 ASGSTAGGEGIMLAYKIAQKNFISGGNN----------------RIILCTDGDFNVGVSS 363 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ I + + ++ H + N Sbjct: 364 DKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLSEKGNGNHAYIDNLQE 415 >gi|219850571|ref|YP_002465004.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544830|gb|ACL26568.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 58/183 (31%), Gaps = 29/183 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVT 281 + ++ +K A + LD + ++ L+ + R E + + ++++ ++ Sbjct: 57 RGERLQQVKQAAMQILDLLG------DNESFALVTFNDRAEVVVSSQLARARAEIKRQIS 110 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T+ + Q L ++ LTDG Sbjct: 111 AI---EAAGGTEMATGLALGVQELQRAMM----------------PRAIHRLLLLTDGRT 151 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +S ++I +A+ I I + I + N L + + +A + + Sbjct: 152 YGDES--RCVEIARRAQARGIGITALGIGSEWNEDLLETIAARENSRTHYITSAADITKI 209 Query: 402 FQN 404 F Sbjct: 210 FTA 212 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 76/254 (29%), Gaps = 54/254 (21%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 R ++ ++ ER + +D+SGSM+ Sbjct: 122 NRQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMN-------------------MEN 162 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ +K +L + ++ +D + ++ Y +EP+ ++ R + Sbjct: 163 RLELVKRSLAMLVEKLDSRD------TLAIVVYGDTARTVLEPTRIMDRSR-ILEAINAL 215 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 T+ ++ AY I +I +DG NN Sbjct: 216 HPEGSTNVQAGLQVAYAI----------------AASQVREGATSRVILCSDGVANNGIT 259 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVF- 402 +++ + A + +++ T+ L++ + V +F Sbjct: 260 QADSIFQSVKAYA-QQGVRLTTVGFGMGNYNDELMERLSHVGDGQYAYVDALPEARRIFI 318 Query: 403 -------QNISQLM 409 Q I++ + Sbjct: 319 EQFTGTLQLIARDV 332 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 42/404 (10%), Positives = 108/404 (26%), Gaps = 39/404 (9%) Query: 25 IFALSVMS--FLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESI 82 + AL +++ G V K+ + + A+ + + + Sbjct: 31 MAALLLVAVSLTACGGKGAEVQHRQAKQQAEQRHQEASQRQAEMRDAAKVEMARVAAPMQ 90 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + + +++ L+ + +S + + + Sbjct: 91 MSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFSIDVDTGSYATL 150 Query: 143 TIFYNMDVMTSYDYR-LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL--IELVVDL 199 + ++ + + Y + P + E+ P + L + L Sbjct: 151 RRMLREGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGL 210 Query: 200 SGSMHCAMNSDPED--VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL- 256 G + K+ L+ AL L + V VY G Sbjct: 211 KGYDLPKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAA 270 Query: 257 --IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + ++ T+ + QAYQ+ + Sbjct: 271 GVVLDGASGNDTQTLTY----------ALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGI 320 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASP 373 +I TDG+ N ++ + I + +K K++ I + T+ Sbjct: 321 N----------------RVILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGN 364 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 +L++ +Y ++ L + + + +++ Sbjct: 365 YNDQLMEQLADKGNGNYAYIDT--LNEARKVLVDELSSTLFTIA 406 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 195 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL-GAFSRREDMEGAIRALVPLAQGTMT 253 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 254 GLAIQYAMNVAFS----------VAEGARPPEARVPRVAVIVTDGRPQD-----RVAEVA 298 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 299 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 345 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 501 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 557 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 558 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDVSV 607 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 608 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPDFSTMTHLLENL 660 Query: 406 SQLMV 410 + Sbjct: 661 KGSIC 665 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 76/254 (29%), Gaps = 54/254 (21%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 R ++ ++ ER + +D+SGSM+ Sbjct: 170 NRQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMN-------------------MEN 210 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ +K +L + ++ +D + ++ Y +EP+ ++ R + Sbjct: 211 RLELVKRSLAMLVEKLDSRD------TLAIVVYGDTARTVLEPTRIMDRSR-ILEAINAL 263 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 T+ ++ AY I +I +DG NN Sbjct: 264 HPEGSTNVQAGLQVAYAI----------------AASQVREGATSRVILCSDGVANNGIT 307 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVF- 402 +++ + A + +++ T+ L++ + V +F Sbjct: 308 QADSIFQSVKAYA-QQGVRLTTVGFGMGNYNDELMERLSHVGDGQYAYVDALPEARRIFI 366 Query: 403 -------QNISQLM 409 Q I++ + Sbjct: 367 EQFTGTLQLIARDV 380 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 55/160 (34%), Gaps = 25/160 (15%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + + ++ + R+E + + K V D + T + + L Sbjct: 436 NDKISVVKFDERLETELRLTAQGSKT-DCVKFDGLTRYGGSTALYAGADEGLESLK---- 490 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-----FKSNVNTIKICDKAKENFIKIV 365 + K ++ TDGE N+ K ++ KA+E I++ Sbjct: 491 ---------------NAQNNKVMLLFTDGEENSSLQYFGKRAFRASEVVKKAREKGIRVF 535 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI+ N + L + + Y + N D + V++ + Sbjct: 536 TIAYGTGVNNKTLNALSMLTDGKTYFIENPDEIKQVYEEL 575 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L+ +G+I Y+++V+ + D + + T + Sbjct: 55 QFLMGLLRGLNVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGLQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 320 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 376 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 377 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 426 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 427 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 479 Query: 406 SQLMV 410 + Sbjct: 480 RSSIC 484 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 74/233 (31%), Gaps = 65/233 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D K +++ KN + +D++ E +G I +++ + + Sbjct: 90 DTSKSMLAEDIKPDRLSRAKNIIESIIDNL-------EGDRIGFIPFSSAAYIQMPLTDD 142 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + R Y+ D + T+ A+ A + + Sbjct: 143 YDLARMYLDVIDTDMIAGGGTNVGTALNLAENSFEE------------------TSSADR 184 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I L+DGE +N S + I + +K+ TI I + Sbjct: 185 VVIILSDGEEHNSNS----VDILKSFNDEHLKVFTIGIGTAKGGLVPDYGSDGGQKSGYK 240 Query: 374 ------------NGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHRK 413 N + L K + +Y D + ++++ + K Sbjct: 241 KDSNGEFVMSKLNSETLKKLASTGKGSYYQSSLTGDEI----ASLTKKISSLK 289 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 78/264 (29%), Gaps = 45/264 (17%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y T + + N ++ E E P + ++D+SGSM+ Sbjct: 164 YQTPDSTEQPFAVNTHVFSAPWNSNAYLMEIGIKGFEPEQQELPPSNLVYLIDVSGSMNS 223 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ +K +L L + ++ Y Sbjct: 224 ED-------------------KLGLVKKSLKLLAQESSDQD------RISIVVYAGASGV 258 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 +EP+ G ++ T+ ++ AY++ + Sbjct: 259 VLEPTKGNDR-MAIEQALDRLSAGGSTNGGAGIELAYKLAEQAFIKDGIN---------- 307 Query: 326 SLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +I TDG+ N N I + ++ +E+ I T+ + + L++ Sbjct: 308 ------RVILATDGDFNVGTINREQLIDLVERKRESGISFTTLGFGSGNYNEHLMEQLAD 361 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 +Y DSL + + + Sbjct: 362 KGNGNY--GYIDSLQEARKLLVEQ 383 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 32/243 (13%), Positives = 74/243 (30%), Gaps = 72/243 (29%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D ++ K + ++ + Sbjct: 91 DIVFAVDVSKSM---------------LAEDIAPNRLEKAKRLVSEIINQLAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ ++ + T A++ A +++ Sbjct: 129 DRIGIIAYAGQAFPQLPITTDYGAAKMFLQNMNTNMLTSQGTAINEAIELATTYYDDEEQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ S T+K D A E I+I TI + Sbjct: 189 TN------------------RVLFIISDGEDH---SEGTTLKAVDDAIEEGIQIFTIGVG 227 Query: 371 ASPNG---------------------------QRLLKTCVS-SPEYHYNVVNADSLIHVF 402 S + +L+ S + + N + Sbjct: 228 KSKGAPIPIKRNGVVESLKKDRQGEVVITKLNETILQDIASEGEGEYIDGSNTSDAVEQI 287 Query: 403 QNI 405 + I Sbjct: 288 KEI 290 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 65/224 (29%), Gaps = 34/224 (15%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPIC------QDKKRTKMAALKNALLLFLDSIDLLSHVK 249 S+ D + +C + +K A + + +++ Sbjct: 20 NTQRQFSLAICATGDQDKTLPLNLCLVLDHSGSMAGKPLETVKQAAIELVKQLNVED--- 76 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +I + R + + + G + + + + T +K Q + K Sbjct: 77 ---RLSIIAFDHRAKVLVP-NQGIDNLNTIIEQINSLKPAGGTAIDEGLKLGIQESANGK 132 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 K I LTDGE N N +K+ A + I + T+ Sbjct: 133 KDRVSQ-----------------IFLLTDGE-NEHGDNERCLKLAHVASDYNITLNTLGF 174 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI---SQLMV 410 N L K S+ + D I F + +Q + Sbjct: 175 GNHWNQDVLEKISDSAGGTLCYIETPDKAIEEFSRLFNRAQSIG 218 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 65/218 (29%), Gaps = 49/218 (22%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V D SGSM +T++ A K+A F+ + + Y Sbjct: 1 MVVFDSSGSMITNDAGG--------------QTRIDAAKDAARTFITEAGDDAPLGLVTY 46 Query: 254 MGLI--------GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 G + P G + + T ++++A L Sbjct: 47 GGNTGEAPEDEAAGCQDITVVTPPEAGNS--EKMIAHMDGLQPRGFTPIGESLRKAAAEL 104 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--K 363 Q+ II ++DG V + + KE I Sbjct: 105 --------------------PKEGQRSIILVSDGVATCTPPPVC--DVAKELKEQGIDLV 142 Query: 364 IVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 I T+ N P Q+ L+ ++ + N +ADSL Sbjct: 143 INTVGFNVEPEAQQELQCIADATGGTYANASDADSLAK 180 >gi|163849338|ref|YP_001637382.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527332|ref|YP_002571803.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670627|gb|ABY36993.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451211|gb|ACM55477.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 418 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 38/272 (13%), Positives = 79/272 (29%), Gaps = 59/272 (21%) Query: 154 YDYRLQFIEH-LLNQRYNQKIVSFIPALLRIEMGERP--IFLIELVVDLSGSMHCAMNSD 210 D R+ L + Q + + + + + P + LV+D S SM Sbjct: 4 VDLRITPSRSVLPASQEPQLLYALVELSAQSGATKMPRLPLNLCLVIDRSSSMR------ 57 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-- 268 ++ +K A + LD + L+ + R E + Sbjct: 58 --------------GERLQQVKQAAMQILDLLGDHES------FALVTFNDRAEVVVSAQ 97 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + ++++ ++ T+ + Q L Sbjct: 98 LARARAEIKRQISAI---EAAGGTEMATGLALGVQELQRAMM----------------PR 138 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 ++ LTDG + ++I +A+ I I + I + N L Sbjct: 139 AVHRLLLLTDGRTYGDEG--RCVEIARRAQSRGIGITALGIGSEWNEDLLETIAARENSR 196 Query: 389 HYNVVNADSLIHVFQ-------NISQLMVHRK 413 + + +A + +F NI VH + Sbjct: 197 THYITSAAEITKIFTAEVERMHNIFAQDVHLR 228 >gi|291401551|ref|XP_002717040.1| PREDICTED: anthrax toxin receptor 2 [Oryctolagus cuniculus] Length = 486 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYDFVQKLTER-FVSPEMRLSFIVFSSQATIILPLTGDRGKITKGLEDLKSVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLEKIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELRPSSVCVG 232 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 77/237 (32%), Gaps = 44/237 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + QD K +++ A + A L V +G++ P+ Sbjct: 103 DLSGSMRAQDVKPSRLRAAQQAATTLL------EAQPAGVSVGVVAMAGTAAVAQAPTRA 156 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF------FTNFFRQGVKIPS 326 E V + R T M A L + T R+ + + Sbjct: 157 REAVATAIERL---QPQGGTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPRALAN 213 Query: 327 L-----------PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-- 373 I+ +DGE+N + + ++ A E+ ++I T+ + Sbjct: 214 PPADTEPVKPGSYTSGAIVLFSDGESNAGPAALRAAQL---AAEHGVRIYTVGVGTPEGV 270 Query: 374 -------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + L + ++ ++ + +A L V++ ++ + K S + Sbjct: 271 VLSVDGWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRALNARLAFDKRSQV 327 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 62/181 (34%), Gaps = 45/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D +++ K + ++++ Sbjct: 89 DIVFAVDVSKSM---------------LAEDISPSRLDKSKRIVSEIINNLGS------- 126 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y R + + + ++ ++ + T A++ A ++ Sbjct: 127 DRIGIIAYAGRAVPQLPITTDFSAAKMFLNNLNTNMLSSQGTAIDDAIRLAKTYYDDVEQ 186 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + ++ ++DGE++ + + ++A + IK TI + Sbjct: 187 TN------------------RVLVIISDGEDHTGGAGQ----LAEEATKEGIKTYTIGVG 224 Query: 371 A 371 Sbjct: 225 T 225 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 62/170 (36%), Gaps = 23/170 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 + + L +G+I Y+++V+ E + + + + + T Sbjct: 70 QFLVGLLHSLDVGLNATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPL--AQGTM 127 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + S+ +G + + ++ +TDG + ++ Sbjct: 128 TGLAIQYAMNVAFSEA----------EGARPSEERVPRVLVIVTDGRPQD-----RVAEV 172 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P ++ + V + D + Sbjct: 173 AAQARARGIEIYAVGVQRADVGS--LRAMASPPLDQHVFLVESFDLIQEF 220 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 62/185 (33%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 417 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 473 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + G + L + + TDG + + S Sbjct: 474 MERGTMTGLALRHMVEHSFSEVQ----------GARPRDLNVPRVGLVFTDGRSQDDISV 523 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + +++ H+ +N+ Sbjct: 524 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPDFNTMTHLLENL 576 Query: 406 SQLMV 410 + Sbjct: 577 KGSIC 581 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 62/170 (36%), Gaps = 23/170 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 + + L +G+I Y+++V+ E + + + + + T Sbjct: 57 QFLVGLLHSLDVGLNATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPL--AQGTM 114 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + S+ +G + + ++ +TDG + ++ Sbjct: 115 TGLAIQYAMNVAFSEA----------EGARPSEERVPRVLVIVTDGRPQD-----RVAEV 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P ++ + V + D + Sbjct: 160 AAQARARGIEIYAVGVQRADVGS--LRAMASPPLDQHVFLVESFDLIQEF 207 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 62/185 (33%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 404 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 460 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + G + L + + TDG + + S Sbjct: 461 MERGTMTGLALRHMVEHSFSEVQ----------GARPRDLNVPRVGLVFTDGRSQDDISV 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + +++ H+ +N+ Sbjct: 511 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPDFNTMTHLLENL 563 Query: 406 SQLMV 410 + Sbjct: 564 KGSIC 568 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 75/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + +VD+SGSM K Sbjct: 109 NKDSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMDAPD-------------------K 149 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + + P+ G ++ R V Sbjct: 150 LPLLQSSLKLLVRQLRAQD------RITLVTYAGNISVVLPPTPGDQQGR-IVEAIDALQ 202 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY+ R I+ TDG+ N + Sbjct: 203 SGGSTAGASGIELAYKAAQQGYLRGGIN----------------RILLATDGDFNVGVTN 246 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---------S 397 + + + + I + T+ L++ + + Y ++ Sbjct: 247 FDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLTHE 306 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 307 LGATLATIARDV 318 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/331 (8%), Positives = 67/331 (20%), Gaps = 35/331 (10%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA---------- 65 E+ + L + L + G +I + K ++SA ++ L+ A Sbjct: 7 QRERGAVLPMVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQELDGASDAL 66 Query: 66 -SKMVSNLSRLGDRFESISNHAKRALIDD--AKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + ++ + D + + S+ Y + Sbjct: 67 TRATNAGVTAGNANRVVYQASSASLANADVTFSTALNGTYSAAGVASSSSSYVKCKHSKS 126 Query: 123 NSSRISMTHMAN-NRLDSSNNTIFYNMDV----MTSYDYRLQFIE-------------HL 164 + + + N + + + + + Sbjct: 127 GITARLIKFVGGANSYAVAALAVATRVHAQSTCPIPVALIPKTGGTAPNYGFQVGEWVSM 186 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQD 222 L + L + V+ + Sbjct: 187 LYSGGSTTPGEMGWYNLDGSTNANETKTEMDQGYCNSKIGDQLGTPGAKVSVDDNWNARF 246 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + F ++ V T + + K Y Sbjct: 247 GIYKNKSNASQLQPDFTGYSYTSTNWTNTVPQNAYSGTPASGSDKSAANFKNKRLAYANY 306 Query: 283 DMDSLILKPTDSTPAM--KQAYQILTSDKKR 311 D + D + K Y+ L + Sbjct: 307 DDTGTSVSGGDKITGLNVKGGYKNLATSGSS 337 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 61/179 (34%), Gaps = 22/179 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ ++ + + +G+ ++ ++ + + D LI T Sbjct: 647 MKNFMQTLLAKIQIGADKTQIGVAQFSDYNKEEFPLNKYFTQKEISDAIDRMLLITGNTL 706 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+K T K +KF+I +TDGE + + Sbjct: 707 TGSALKFIDTYFTQS--------------KGARHGVKKFLILITDGEAQDD-----VREP 747 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + I+++ + N + L+ ++V N + L + + + L+ Sbjct: 748 AVALRDKGVIILSVGV-YGAN-RTQLEEISGDGSLVFHVENFEDLKEIERKLIFLVCTL 804 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L + + ++ V G + Y+ + + + + T + Sbjct: 834 INLTIHLVKKSDVGRDRVQFGALRYSDDPDILFYLNTYSNRSAIIEHLRRRRDTGGSTFT 893 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ ++ + N ++ +I +TDGE+++ +T Sbjct: 894 AKALGRSATLFEEQHGSRIKQN------------VKQMLIIITDGESHDRHLLNDT---A 938 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 K + I I I++ Q L+ + E +V + D L V+ + + M Sbjct: 939 LKLRNKGITI--IAVGVGKANQEELEAMAGNKENTIHVKDFDKLKDVYLPLQESMCT 993 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 56/180 (31%), Gaps = 24/180 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 L D+ + + V +G++ Y+ + +R+ + T Sbjct: 463 EFMLAVTDMFNIGPDKVRVGVVQYSNDRAVEFDIDVYHDNSVLRKAIYNIKQLK--GGTL 520 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ F ++G K + +++ LTDG KS + Sbjct: 521 TGKAL-------------DFILPIMKKGRKTRASQVPCYLLVLTDG-----KSEDEVLGP 562 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + I I I + + L E N D+L + + I + K Sbjct: 563 AERIRAEQISTHAIGIGKAHKKELL--QIAGEEERVNFGQNTDALKSIKKEIVHSICTEK 620 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/273 (11%), Positives = 76/273 (27%), Gaps = 74/273 (27%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 +L ++ + ++ I +D+S SM + Sbjct: 60 SVLAVVLVLLSIALANPKIGTKIETVKREGVDIVFAIDVSKSM---------------LA 104 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +D ++ K + + +G++ Y + + + ++ Sbjct: 105 EDVAPNRLEKAKRIAFETISQLK-------GDRVGIVAYAASAYPQLALTTDHSAAKMFL 157 Query: 281 TRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D L + T A++ A + + + + LTDG Sbjct: 158 QGMNTDMLSSQGTAIQEAIRMASNYFDENTPTA------------------RLLFILTDG 199 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------------------- 373 E++ + +I +A+E + I TI I Sbjct: 200 EDH----EMGATEIATEAQEKGVHIYTIGIGTEKGAPIPIKDGGEQTYKRDRNGEVVITK 255 Query: 374 -NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N + L + +++ + + N ++ I Sbjct: 256 LNRELLQQIAINAGGEYLDGDNTQKVVSQINKI 288 >gi|170744040|ref|YP_001772695.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168198314|gb|ACA20261.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 654 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 84/336 (25%), Gaps = 45/336 (13%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A + + + A + + + G Y + Sbjct: 147 AAMARAAGETAPVPSEPVGRDRFANAPEGGFRITREAPVSTVSLGVDTASYGIVRDALNR 206 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + L + Y +R+ +K++ + Sbjct: 207 NHLPPPAAVRTEEL-INYFPYAYPAPASPDAPFRVTASVFPSPWAEGRKLLHIGIRGYAV 265 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 ERP + +VD SGSM ++ +K +L + L ++D Sbjct: 266 APAERPPANLVFLVDTSGSMA-------------------APNRLPLVKQSLAMLLTTLD 306 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + L+ Y V +EP+ E R + T ++QAY Sbjct: 307 ARD------RVALVAYAGEVGTVLEPTPAGEAGR-ILAAIETLQAHGSTAGGEGIRQAYA 359 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFI 362 + + +I TDG+ N + + I Sbjct: 360 L----------------AARHFDPKAVNRVILATDGDFNVGITGRDELTGFVARERRKGI 403 Query: 363 KIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 + + L++ ++ A Sbjct: 404 FLSVLGFGMGNLNDALMQALAKDGNGVAAHIDTAQE 439 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 62/181 (34%), Gaps = 44/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D +++A K + + + Sbjct: 91 DIVFAVDVSKSM---------------LAEDIAPSRLAKAKRIVSEIIAQLGS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ ++ + T A+ A D++ Sbjct: 129 DRIGIIAYAGQAYPQLPITTDYGAAKMFLQGLNTNMLSSQGTAINQALDLASTYYDDDEQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + + ++DGE++ S +T +KA + IKI TI + Sbjct: 189 TN------------------RVLFIISDGEDH---SEGSTEGAVEKAVDQGIKIFTIGVG 227 Query: 371 A 371 Sbjct: 228 T 228 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 71/237 (29%), Gaps = 68/237 (28%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D + ++ K ++ F+ + E +GL+ + + Sbjct: 97 DTSKSMLAEDVRPNRLERSKFGIMDFVSKL-------EGDRVGLLPFAGTAFLMCPLTLD 149 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + R + ++ TD A+ +A + K Sbjct: 150 YDAFRNSLEALDTNIIPQGGTDIASAIYEAEAAF-------------------NNDANHK 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 ++ ++DGE+ ++ + AKE + I T+ + Sbjct: 191 ILVLVSDGEDLEGEA----LSAAQAAKERDLTIYTVGVGTPSGELIPLVQGGKDGAFVKD 246 Query: 374 ----------NGQRLLKTCVSSPEYHYN-VVNADSLIHVFQNISQLMVHRKYSVILK 419 + L K ++ + A+ L +++ K S++ K Sbjct: 247 EKGQPVKSRLDETMLQKIAEATGGRYEPLGQQAEGLEAIYRE--------KLSLVPK 295 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 49/222 (22%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + +DLSGSM + T++ A+ F + Sbjct: 92 DLAIALDLSGSMVRDDF----------YLDGQPITRLEAVTTVGAEFARR-------RAG 134 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + LI + + ++ TE + + + + + T+ + A+ A + + Sbjct: 135 DRVALIVFGSEAYYAAPFTFDTEAIARRIEEATIGISGRATNISDALGLALKRMA----- 189 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 S + +I L+DG NN N + A + +++ TI++ Sbjct: 190 -------------GSDADTRVVILLSDGANNAGA--TNPRGVAQLAAQMGVRVHTIAMGP 234 Query: 372 SP------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + L S + V + LI V Sbjct: 235 KSVDEAEEGERGVVDAETLDAISKVSGGETFRVRTTEDLIAV 276 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 65/190 (34%), Gaps = 34/190 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KV 276 ++A LK A F+ L ++++ ++G++ + V Sbjct: 310 VSGSMSGNNRIARLKQAAETFI-----LQNIEDGSWVGIVTFNNAATIQTGLQQVVSDTV 364 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ + + T+ +++ +Q+ +S + I+ L Sbjct: 365 RKTLNGYLPISANGGTNICAGVQKGFQVFSSKYASTEGCE----------------IVLL 408 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 TDGE++ S C + + + I TI++ S + + + ++ Sbjct: 409 TDGEDSGLSS-------CFAEVQRSGSVIHTIALGPSAAKELEM-LADMTGGLKFSATDS 460 Query: 394 -NADSLIHVF 402 ++ SL F Sbjct: 461 VDSSSLEDAF 470 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 53/178 (29%), Gaps = 29/178 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K + F+ ++ + K LI + + + + +++ + Sbjct: 215 IDTMKRTMSDFVRNL----NYKVGDTGELISFDSYLMYMATYTNDKDRLLTGIDNMT--- 267 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ + P +I TDG++N Sbjct: 268 PYGMTALYDALYTGIT-------------------NASNHPGFNCVIAFTDGQDNESTHT 308 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ AKE I + I + + L + Y +++ + + V I Sbjct: 309 --ADEVISLAKEKGIPVYLIG-TSEADSSTLQNIANETNGYFWDMNSISDMQEVMNRI 363 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 64/178 (35%), Gaps = 27/178 (15%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG--TEKVRQYVTRDMDSLILKP 291 + F+ + + L + +GLI Y + V+ +V + V Sbjct: 30 VKEFIVTILQFLDVAPDVTRVGLIQYGSTVKNEFSLKTYGRKSEVERAVKVMKRL--GTG 87 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A++ A I S +G + + I+ +TDG + + Sbjct: 88 TMTGLAIQYAVNIAFS----------ESEGARPLRENVPRIIMIVTDGRPQDPVA----- 132 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN 404 ++ KA+ + I I I + LK+ S P ++ + V N L VFQN Sbjct: 133 EVAAKARNSGILIFAIGVGQVDYN--TLKSIGSKPHQDHVFLVANFSQIESLTSVFQN 188 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 54/199 (27%), Gaps = 25/199 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +K A +D ++ + ++ + R + + S E + Sbjct: 51 SGSMNGRSLETVKKAANRLVDRLNPSD------RLSVVVFDHRAKVLVP-SQSVEDPEKI 103 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T ++ + L KK + F LTDG Sbjct: 104 KNQINRLAADGGTAIDEGLRLGIEELAKGKKDTVSQAF-----------------LLTDG 146 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 EN + +N A + + T+ + N L K + + + + Sbjct: 147 ENEHGDNNRCLK-FAQLAASYNLTLNTLGFGDNWNQDVLEKIADAGLGTLSYIQKPEEAV 205 Query: 400 HVFQNISQLMVHRKYSVIL 418 F + + + Sbjct: 206 DEFNRLFSRIQTVGLTNAY 224 >gi|326430897|gb|EGD76467.1| hypothetical protein PTSG_07584 [Salpingoeca sp. ATCC 50818] Length = 985 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 66/220 (30%), Gaps = 28/220 (12%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I LV+D+SGSM N + ++T + +K+A + + Sbjct: 217 IKTPVHICLVIDVSGSMDRHA----TQRNKYNQLEKFEQTYLDIVKHAARSIANFLQNED 272 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + Y+ ++ ++ + TE V + T+ + + Q+L Sbjct: 273 N-----YLSVVAFSENASVTLPMERMTEAGVSRATDAVGALQPCSCTNLGDGVLRGMQLL 327 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAK-ENFIK 363 + Q ++ LTDGE N + + K Sbjct: 328 LKG---------------TDTTKAQPVLMVLTDGEPNEGNDARDVLRSFRNTYKHARRFI 372 Query: 364 IVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + T + L + + V ++ + F Sbjct: 373 VNTFGFGFEQIDSPLLSELAALGGGTYAFVPDSSFVGTAF 412 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 75/226 (33%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + + +GLI + + + Sbjct: 98 DISNSMLAEDVAPSRLEKSKLLVENLMSRFSE-------DKIGLIVFAGEAFVQLPITSD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A++ A T + K Sbjct: 151 YVSAKMFLDNINPSLIGTQGTDIGKALQLAANSFTP------------------NSKAGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE+N + + +A+ IK+ + I + + +G+ Sbjct: 193 AIILITDGEDNEGGA----EAMAKQARSKGIKVFILGIGSRQGAVIPMPDGSELKTSDGE 248 Query: 377 RL---------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + + +V N++ V + + + + Sbjct: 249 PVKTHLNEEMCRQIAAAGQGVYVHVDNSNVADAV---LGRELGKLQ 291 >gi|52082207|ref|YP_080998.1| YwmC protein [Bacillus licheniformis ATCC 14580] gi|52787598|ref|YP_093427.1| YwmC [Bacillus licheniformis ATCC 14580] gi|319648080|ref|ZP_08002297.1| YwmC protein [Bacillus sp. BT1B_CT2] gi|52005418|gb|AAU25360.1| YwmC [Bacillus licheniformis ATCC 14580] gi|52350100|gb|AAU42734.1| YwmC [Bacillus licheniformis ATCC 14580] gi|317389715|gb|EFV70525.1| YwmC protein [Bacillus sp. BT1B_CT2] Length = 228 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 61/225 (27%), Gaps = 31/225 (13%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + +++D SGSM + A +K+ + + + Sbjct: 34 EDAPNVAIMLDASGSMAKKI-GGVSKYELAKNEAFSFGSKLENA----NVLMRVFGSEGN 88 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 K + V ++ + R + T A++ A L Sbjct: 89 NKNSGKVQSCNAIRGVYGFQ--TYDEQSFRNSLNGI---GPTGWTPIANALQDAKNALDQ 143 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + LTDGE N +K+ + +++ + I Sbjct: 144 -----------------LDNNGKNVVYLLTDGEETCG---GNPVKVATELRKSNAVVNVI 183 Query: 368 SINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + +L + ++ + + +F + + Sbjct: 184 GFDYEGDFHGQLTSIAAAGGGEYFQAKTKNDIKRIFTQEAIELSK 228 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/307 (12%), Positives = 84/307 (27%), Gaps = 54/307 (17%) Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 K+ +S Y + ++ S + +T + L Sbjct: 319 KQGISMDVRGKYGLPASLQQLNEFDAIMFADIEATHLSARQMDNVRKYVTEFGGGLIMTG 378 Query: 163 HLLNQRYNQKIVSFIPALLRIEM-----GERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + I +L + E+P + LV+D SGSM+ Sbjct: 379 SENSFGLGGYYKTPIEEVLPVTSRYEKEKEQPSLALVLVIDKSGSMN------------- 425 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + A + + +G+I + + + + K Sbjct: 426 -------GQPIVLAREASKAAAELLSSRD------QVGVIAFDGSAKLVTDLTSAANKGE 472 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +++ T+ PAM +L + K +I L+ Sbjct: 473 -VLSQIDGIGAGGGTNLYPAMVMGRDMLGIASAKI------------------KHMIVLS 513 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++ + ++ + I T+S+ + Y NA+ Sbjct: 514 DGQSQGGDFEGISSELAQ----MGVTISTVSLGQGAAVDLMAAIAQIGNGRAYVTNNAEE 569 Query: 398 LIHVFQN 404 + +F Sbjct: 570 MPRIFTK 576 >gi|300812790|ref|ZP_07093186.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] gi|300496224|gb|EFK31350.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] Length = 893 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/418 (7%), Positives = 105/418 (25%), Gaps = 45/418 (10%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA ++ +L G + ++ + A + ++ G + Sbjct: 14 LFAFLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + L D ++ + Y + + + Sbjct: 70 QRGGDQVLGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR---IEMGERPIFLIELVVDL 199 + + + + ++ + + V + Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESNSRGTNRADAVRTGVKNFLTSIQNAGLGDYVNVGLI 189 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S + + I + + + + G Sbjct: 190 GFSSPGYIGGGNKTTGPGYIRVGLGK---------VGNTKQQQAINDALSPMFQGGTYTQ 240 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + T ++ + D + + + ++ S Sbjct: 241 IGLRQGSAMLNADTSGNKKMMILLTDGVPTFSNKVINSEWINGTLYGTNFGSSRDKPGDT 300 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG---- 375 ++ P I+ T T+ AK++ ++ + I + + Sbjct: 301 ARLRWPYSDKSGHRIYDTW---------PATLGEAKIAKDSGNEVHALGIQLADDRYYAD 351 Query: 376 -------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 ++ ++ +SP+ + + +AD++ +N ++ ++ K++ + G Sbjct: 352 DGYYMTYMTKEKIRKNMQLITNSPDLYEDADSADAVEAYLKNQAKDIIK-KFNTVADG 408 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 53/171 (30%), Gaps = 32/171 (18%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + + L+ + + S+ E + + + + + T + + A Sbjct: 126 ADFARRRGGDRVALVVFGSEAYFAAPFSFDVEAIARQIESAQIGVSGRATSISDGLGIAL 185 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + + +I L+DG NN N + + A + + Sbjct: 186 KRMENSEA------------------ASRVVILLSDGVNNAGA--TNPRGVAELAAQMGV 225 Query: 363 KIVTISINASP------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ TI++ + L S + V + L+ V Sbjct: 226 RVHTIALGPKDLSSADPGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/250 (11%), Positives = 67/250 (26%), Gaps = 64/250 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + ++ I V+D+S SM ++++ A KN+++ + Sbjct: 82 QQKVYSSKGASIVFVLDVSPSMAAKD--------------IGSQSRLEAAKNSIVSMAEM 127 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +GL+ + P+ + + + T + A Sbjct: 128 -------NSGSELGLVVMAENAAVLVSPTMDRKFFLNRLKTVSVGELGDGTAIGTGLSSA 180 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L +K I+ +TDGENN+ + A Sbjct: 181 IYHLEKS------------------KSPKKSIVLITDGENNSGAV--HPHTAARLAVNKD 220 Query: 362 IKIVTISIN---------ASP--------------NGQRLLKTCVSSPEYHYNVVNADSL 398 I + + + P + + + + + + +L Sbjct: 221 ISLYILGVGTRGVVPIDYVDPKSNKIYSGYLESKFDTSSIARIASEGNGKFFEIESISAL 280 Query: 399 IHVFQNISQL 408 +IS+ Sbjct: 281 SQAISSISKS 290 >gi|304382531|ref|ZP_07365026.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] gi|304336362|gb|EFM02603.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] Length = 340 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 67/217 (30%), Gaps = 54/217 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D+ + +GLI + + + Sbjct: 97 DISNSMLAQDVVPSRLEKSKMLIENMVDNFT-------NDKVGLIVFAGDAFIQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ A + T K Sbjct: 150 FVSAKMFLQNIDPSLIATQGTDIAKAIDMAMKSFTQ------------------QENVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ AK + + + GQ Sbjct: 192 AIIVITDGEDHEGGA----LEAAKTAKAKGYNVFILGVGTAKGAPIPTGDGSYLKDNTGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQN 404 ++ + + + +V N + + N Sbjct: 248 TVITALNEDMCKEIAAAGSGTYIHVDNTNEAQKILNN 284 >gi|297199802|ref|ZP_06917199.1| lipoprotein [Streptomyces sviceus ATCC 29083] gi|197710264|gb|EDY54298.1| lipoprotein [Streptomyces sviceus ATCC 29083] Length = 506 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/241 (10%), Positives = 67/241 (27%), Gaps = 45/241 (18%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V ERP + V+D+SGSM + ++ Sbjct: 142 VRVGLATRSAGENADERPPAALTFVIDISGSM-------------------SEPGRLDLA 182 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + +L + + + L+ ++ R K + + + Sbjct: 183 QRSLDTMTERLRDDDS------VALVTFSDRARKVLPMTRLGGHRDRIHEAIDGLEPTYS 236 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVN 349 T+ ++ Y+ ++ ++D N+ + Sbjct: 237 TNLGAGVETGYKTAVEGL----------------RKGATNRVVLISDALANDGETDPDAI 280 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQL 408 +I +E+ I + + + + + + + V D+ VF + + Q Sbjct: 281 LERIDTARREHGITLFGVGVGSDYGDALMERLADKGDGHTVYVSGDDA-EKVFCEQLPQN 339 Query: 409 M 409 + Sbjct: 340 I 340 >gi|51095063|gb|EAL24307.1| similar to matrilin 2 precursor [Homo sapiens] Length = 448 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/271 (12%), Positives = 77/271 (28%), Gaps = 29/271 (10%) Query: 156 YRLQFIEHLLN-QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + L Y +S + G + C Sbjct: 65 FPADRDGSLALHSTYESLRLSASSWTVNPLRGINMMPSSLAPSSQGCGPKCKETPLELVF 124 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 +KN + D + L +G+I Y+ +VEK + Sbjct: 125 VIDSSESVGPEN-FQIIKNFVKTMADRVALDLAT---ARIGIINYSHKVEKVANLKQFSS 180 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K + D + + T + A++ A + +K + Sbjct: 181 KDDFKLAVDNMQYLGEGTYTATALQAANDMFED-----------------ARPGVKKVAL 223 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYH 389 +TDG+ + + ++ A + ++I I + + + + + PE+ Sbjct: 224 VITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHV 282 Query: 390 YNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 Y + +L + + Q + S +++ Sbjct: 283 YQFDDFFTLQDTLKQKLFQKICEDFDSYLVQ 313 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 40/146 (27%), Gaps = 15/146 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F+ E+ N +IFA+++++ L +I V + ++ SM + Sbjct: 10 FRYLFKDERGNVLVIFAVALIALLGFAAIVIDVGGMYVERRSM----------VTAADAG 59 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY-----NTEIQNIVNSS 125 L+ + ES + I + + + VN Sbjct: 60 ALAGARELAESGDEALAEQAAALFAQTNGAEITDDIEVLDVEYNGESFKAVRASVGVNRE 119 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM 151 + N D + Sbjct: 120 HFFAKALGFNDSDVFARAVATWGYPK 145 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 77/242 (31%), Gaps = 40/242 (16%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 IP + + V+D SGSM + ++ RT A+ Sbjct: 22 IPLVNAESSNTSSNLDVVFVLDSSGSMKESD-------------PEEIRT------EAIK 62 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP-TDS 294 +FLD + GL+ Y+ V + + + ++ + ++ L TD+ Sbjct: 63 MFLDM-----SQVQGNKFGLVAYSDNVVREHNLDTINSNDDKERIKNMALNIPLGQKTDT 117 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + +A ++ S ++ + ++ K ++I C Sbjct: 118 GAGILEAVNLMNSGHDKNHKPVIILLSDGKNDPQRK---------TEDSLKDLKSSISTC 168 Query: 355 DKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + TI +N + +L + + +Y A L + +I Sbjct: 169 ---KDKGYPVYTIGLNYDGTVDKTQLEEMSNETKGKNYITSTAADLPKILTDIYADNSKL 225 Query: 413 KY 414 K Sbjct: 226 KV 227 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 70/229 (30%), Gaps = 43/229 (18%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWG 272 A + K+ +++A F+++ S + E + G + + Sbjct: 1074 DASASMQENGKKLKDIQDAAKAFVNTTKEKSPISEIAVIWYQGSEGSSSTITDSGFYTLD 1133 Query: 273 TEKVRQYVTR-DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 T + R + T A+++A IL+ S + F G+ + Sbjct: 1134 TSDNVDAINRFISNKNASGGTPMGDALEEANSILSGRPNSSKYALLFTDGMPGYNSSNNS 1193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN----------------- 374 F + + NN + KE K+ TI S + Sbjct: 1194 FNCMVANHANN----------EAKEIKEY-AKLYTIGYKLSGSFKWEEGHSQDSTNNHGS 1242 Query: 375 ------GQRLLKT-CVSSPE----YHYNVVNADSLIHVFQNISQLMVHR 412 LK SSPE Y Y N D L +F++I+ + Sbjct: 1243 HKTETKAADFLKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQIGDL 1291 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 73/237 (30%), Gaps = 24/237 (10%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G++ + LV+D SGSM N + M LK + +D+I Sbjct: 540 GKKAGVDVLLVIDKSGSMGLNDNGRTDSN---------YFNLMPTLKKTVPTLVDTILPD 590 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQI 304 S V +I W + + T+ AM+ A + Sbjct: 591 SDSVNRVAAISFSSDDYTGNDISTDWVDYNGKSGFNRKIEGLGTKGGTNWQLAMRNADKK 650 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF-----LTDGENNNFKSNVNTI----KICD 355 L + P+ ++K + T G + +N+ D Sbjct: 651 LKPRAESQNKKVVVFLSDGEPTYRYEKRSYYPYEEYETGGGQSYSSANLTNAVDEVAGSD 710 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNISQLMV 410 K+ +I ++ + S R+ + ++ + + + ++ Q I +V Sbjct: 711 YLKD--AEIYSVYL-TSQTSTRMTEFANKLTAKNINAHAKDGTNMASALQEIINQIV 764 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/219 (10%), Positives = 69/219 (31%), Gaps = 12/219 (5%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + ++ + ++ + P + A D + + + ++ + + +G Sbjct: 27 ICTVNQANAASLGAVPIEGYDAVFVLDTSYSMRDTDPEGISAEVINMFMDLSDADRTRVG 86 Query: 256 LIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSF 313 + Y V + + + + +D+ +L TD +++ ++L + + Sbjct: 87 FVAYNHNVVASKPLTSIAVAAQKSQIQQDIRTLNRSGYTDLGLGLRRGSELLAAGASQGR 146 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 G + SN + + A+ + TI +N Sbjct: 147 QPFMILLSDGETDFGASS-------GSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDG 199 Query: 374 --NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI-SQLM 409 N Q L + + + +A+ L + I + + Sbjct: 200 TVNRQELERIASQTGGASFITSSAEDLPEILNRIFASQI 238 >gi|47227632|emb|CAG09629.1| unnamed protein product [Tetraodon nigroviridis] Length = 457 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 67/184 (36%), Gaps = 22/184 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ ++ V + + I ++ + + + + + K+++ + + D T Sbjct: 68 IYSFVKNLTDR-FVSPRMRVSFIVFSAQAKVLLPLTGDSYKIKEGLRKLYDVKPAGETFM 126 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K+A ++ P I+ LTDG+ + ++ + Sbjct: 127 HVGIKEA-----------------SVQIRAQPSPTSSIILALTDGKLEVYVHDLTVKE-A 168 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQLMVHRK 413 ++A++ ++ + I ++ L + + + V + +L + +I + Sbjct: 169 NEARKYGARVYCVGIKD--FDEQQLANIADTKDQVFPVKDGFHALKGIVNSILKRSCTEI 226 Query: 414 YSVI 417 SV Sbjct: 227 LSVE 230 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 65/215 (30%), Gaps = 18/215 (8%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G R + S N + V + + T+ K L +DS+ Sbjct: 28 GRRNHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSVRPTEFEKAKEFLQDLVDSL--- 84 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +GL+ Y + V K + T + A++ A + Sbjct: 85 EVGLDSTRVGLVNYASTVRMEFPLKAHFSKPALKGALARVEPLASGTMTGLAIRTAVEKA 144 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + G ++ S + + +TDG + ++ A+E+ I+I Sbjct: 145 FA----------AEAGARLNSTKVARVAVVVTDGRPQD-----EVERVSAAARESGIEIY 189 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++ + L ++ + V + Sbjct: 190 AVGVDRADRTSLRLMASQPHEDHVFYVETYGVIEK 224 >gi|149018695|gb|EDL77336.1| similar to procollagen, type VI, alpha 3 isoform 4 (predicted) [Rattus norvegicus] Length = 719 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/343 (9%), Positives = 98/343 (28%), Gaps = 41/343 (11%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 ++ + + + K + + S++++ R + + I Sbjct: 26 YSDKIISQFLLTQYTSMEKLGTAIGNIQQGGGGTTTGEALSKMALVFRNTARTNVAQYLI 85 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG--S 202 + + L + N + + E+ E + D S Sbjct: 86 VITDGQSSD--PVADAAQGLRDTGINIYAIG-VRDANTTELEEIANNRVFFTDDFHFLKS 142 Query: 203 MHCAMNSDPEDVNSAPICQDK-----------KRTKMAALKNALLLFLDSIDLLSHVKED 251 +H + D + + +K+ + ++ + + Sbjct: 143 IHQEVVRDICSFENCRSQKADIIFLIDGSESISSEDFEKIKDFVKRMVNQ---SNIGADK 199 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKK 310 + +GL+ +++ + + + + I T++ A+ + Sbjct: 200 IQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQG 259 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +++I +TDG++++ + ++ I I I + Sbjct: 260 G--------------RPGVHQYLIVITDGDSHDDIVSP-----AKALRDRNIIIFAIGVG 300 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 Q L + + Y+ N +SL ++ + I + + Sbjct: 301 KIQRAQLL--AITNDQDKVYHEENFESLQNLEKEILYEVCTSQ 341 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 50/172 (29%), Gaps = 25/172 (14%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D I + + V G++ Y+ ++ + T + A+ Sbjct: 7 DVIKMFHIGPDGVRFGVVQYSDKIISQFLLTQYTSMEK----------------LGTAIG 50 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 Q + +++I +TDG++++ ++ ++ Sbjct: 51 NIQQGGGGTTTGEALSKMALVFRNTARTNVAQYLIVITDGQSSDPVAD-----AAQGLRD 105 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I I + N L + + + L + Q + + + Sbjct: 106 TGINIYAIGV-RDANTTELEEIA---NNRVFFTDDFHFLKSIHQEVVRDICS 153 >gi|326407579|gb|ADZ64650.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 1433 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/298 (10%), Positives = 78/298 (26%), Gaps = 39/298 (13%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + N + + + + Sbjct: 293 DVYLNVRGNVQNPIKPVDIVLVIDMSGSMQGAKETAVRQGVSDF--LSTIQNTAYADYVN 350 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V + S + + P +D + SHVK Sbjct: 351 VGIVGYSSPGNYVTGASGYITVP--------------------IDKVSSESHVKSINQAL 390 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + GTE + Q + D +++ TD P + + Sbjct: 391 APQFSGGTFTQLGLRKGTEMLEQD-SSDNQKMMILMTDGVPTFSYKVNSASKVDNVIYGQ 449 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI------KIVTISI 369 +F + + + + D N + E I +I T+ I Sbjct: 450 SFAESRDEPGNTSKIQSPYSVKD--INGGSNIEIRDTWAATLGEAEISKQEISEIHTLGI 507 Query: 370 NASPNGQRLLKT-------CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 +G L + +++ + + +A+ + +N + +++ +++ I G Sbjct: 508 QLGNDGSYLSQEEVKSRTSLIATTGLYQDANSANDITDYLKNQANVVLS-RFNTITNG 564 >gi|209527391|ref|ZP_03275898.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492182|gb|EDZ92530.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 488 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 57/191 (29%), Gaps = 33/191 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ ++ A F+ + + ++ +++R + + +++Q Sbjct: 59 TSGSMSGQKLREVQTAASEFVSR-----QNLKRHDLAVVEFSSRASVVADFTRNETELQQ 113 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R T+ + A +L + + I+ TD Sbjct: 114 AIARLS---ARGGTNLSEGFNLATSVLQNSDRT-------------------PNILLFTD 151 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G NN I + + + I + +++ L P+ + N L Sbjct: 152 GVPNN---PPMAASIAQQIRASGINL--VAVGTGDAQINYLTALTGDPDLVFYA-NFGDL 205 Query: 399 IHVFQNISQLM 409 F+ + + Sbjct: 206 DRAFRGAEKAI 216 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 49/394 (12%), Positives = 90/394 (22%), Gaps = 87/394 (22%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 N +R N+ A D I S + T + ++ Sbjct: 33 NAAENAARTTTTDNVTINNWHDANALDDSTKNVGRIWTDKSVSAGDVTLTSREKESGTAT 92 Query: 127 ISMTHMANNRLDSSNNT----IFYNMDVMTSYDYRLQFIEHLL---NQRYNQKIVSFIPA 179 I ++ + S + I V L + N K + + A Sbjct: 93 IKKGADSDFLVGLSALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKA 152 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + D + K+ ++A +K A Sbjct: 153 AV------------------------NSFIDGSAKVNDQRADVNKQNRIAVVKFAGNK-T 187 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D I + + + + + S + V T + AM Sbjct: 188 DKIGNDQYSQNRYWYNYTQVVSGYKAYT--SGNKSEWETTVNALK---PAGCTAADYAMD 242 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKA 357 K + ++ +IF TDGE N + + A Sbjct: 243 L--------TKTLVDQSKTDANNNADRKNVKRVVIFFTDGEPNHQSGFDDDVANDAITSA 294 Query: 358 K--ENFIKIVTISINASPNG-----------------QRLL------------------- 379 K + I TI I + + + Sbjct: 295 KTIKTDADIYTIGIFSGADVSITGHSGSGSWSAKEKFNAFMHGLSSNYPDAERYKKLGTR 354 Query: 380 -KTCVSSPEYHYN-VVNADSLIHVFQNISQLMVH 411 K +Y AD L ++F I ++ Sbjct: 355 AKDSKGQDATYYKVATKADELKNIFTQIEDEIIS 388 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/257 (12%), Positives = 65/257 (25%), Gaps = 45/257 (17%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y + + + ++IV + +R + +VD+SGSM+ Sbjct: 245 YTRPTSAARPFAVTATTTASPWSEGRRIVHVGLQGYELPENQRRPLNLTFLVDVSGSMN- 303 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ K ++ L +D + + + Y Sbjct: 304 ------------------SPDKLDLAKQSMNLIIDRLRPQD------RVAVAYYAEGAGT 339 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + P+ GT+K++ T M AY + R+ Sbjct: 340 TLAPTAGTQKLK-LRCAVASLRASGGTAGATGMTNAYDQAQASFGRNKVN---------- 388 Query: 326 SLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 I+ TDG+ N + + I + ++ Sbjct: 389 ------RILMFTDGDFNVGVTDDKRLEDYVADKRRTGIYLSVYGFGRGNYQDARMQAIAQ 442 Query: 385 SPEYHYNVVNADSLIHV 401 + D L Sbjct: 443 AGNG--TAAYVDDLKEA 457 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 56/204 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D K ++ + +L FL+ V + +GLI ++ + Sbjct: 76 DVSPSMMVEDIKPNRLERARREVLDFLN-------VVQGDRIGLIAFSGVAFVQCPLTLD 128 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ ++ + + + TD A++ K Sbjct: 129 YGAIQMFLDELKPELIPVAGTDLGAAIEAGISSFDFKSVTD------------------K 170 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGE+N K + KAKE +KI + Sbjct: 171 VIMLITDGEDNEGKG----LIAAQKAKEKGVKIFVFGMGDPSGGPIPATDGKGGFRKDRN 226 Query: 374 --------NGQRLLKTCVSSPEYH 389 + + L K ++ + Sbjct: 227 GNVIMSKMDEESLEKIASATGGRY 250 >gi|260912479|ref|ZP_05919015.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] gi|260633398|gb|EEX51552.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] Length = 591 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 72/228 (31%), Gaps = 57/228 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D +GL+ + + + Sbjct: 134 DISNSMMAQDVVPSRLEKSKLLIENLVDHFTH-------DRIGLVVFAGDAFVQLPITTD 186 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ + + + K Sbjct: 187 YVSAKMFLQNIDPALIATQGTDIAKAINLSMRSFSQ------------------QKDIGK 228 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---------------ASPNGQ 376 +I +TDGE++ + ++ A E I++ + I A +GQ Sbjct: 229 AVIVITDGEDHEGGA----LEAAKAANERGIRVFILGIGSTKGSPIPLAEGGYLADRSGQ 284 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L + + + +V N + ++ + + + Sbjct: 285 TVLTALNESMCKQIAQAGNGTYIHVDNT---NDAQEKLNNELAKLQRA 329 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/273 (11%), Positives = 76/273 (27%), Gaps = 74/273 (27%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 +L ++ + ++ I +D+S SM + Sbjct: 60 SVLAVVLVLLSIALANPKIGTKIETVKREGVDIVFAIDVSKSM---------------LA 104 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +D ++ K + + +G++ Y + + + ++ Sbjct: 105 EDVAPNRLEKAKRIAFETISQLK-------GDRVGIVAYAASAYPQLALTTDHSAAKMFL 157 Query: 281 TRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D L + T A++ A + + + + LTDG Sbjct: 158 QDMNTDMLSSQGTAIQEAIRMASNYFDENTPTA------------------RLLFILTDG 199 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------------------- 373 E++ + +I +A+E + I TI I Sbjct: 200 EDH----EMGATEIATEAQEKGVHIYTIGIGTEKGAPIPIKEGGGQTYKRDRNGEVVITK 255 Query: 374 -NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N + L + +++ + + N ++ I Sbjct: 256 LNRELLQQIAINAGGEYLDGDNTQKVVSQINKI 288 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 68/226 (30%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ +CQD +++ K + +D+ + +GLI + + + Sbjct: 97 DISNSMLCQDVVPSRLDKSKMLIENLVDNF-------NNDKIGLIVFAGDAFVQLPITTD 149 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ LI + T+ A+ A + T + Sbjct: 150 YVSAKMFLQNITPGLIQTQGTNIGAAIDLASKSFTQ------------------QENVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGEN+ + + I + + I G Sbjct: 192 AIIVITDGENHEPGAQEAAAAA----NKKGINVFILGIGNTKGAPIPMGDGSYLKDNAGN 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 248 TVMTALNEQMCKELAQAGKGQYIHVDNTS---DAERALNDDIAKLQ 290 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 43/363 (11%), Positives = 91/363 (25%), Gaps = 64/363 (17%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 +L F++ ++ LI+ + K LS +A I + I Sbjct: 119 NSLYDMQLDLEFKADESYQDNRLINYNLTGQYKNSKNKLSIDTA------ISALNTKQVI 172 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 S + + V T+ + + N+ Sbjct: 173 SKVAKGKKGVALAYRNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGK-------------- 218 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P+ D R +M LK L +D + + + Sbjct: 219 ----------------------RNSQGKWGPLDADNPRARMNILKKKANLLVDDLKEIGN 256 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V ++ + E +E T K+++ + S I T+ ++ L Sbjct: 257 VSVNLVRFSGSASYIQEDFVELDKDTGKIKEKIKSLPTSWI---TNPGDGLRYGLVSLQR 313 Query: 308 DKK---------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + I + ++ T + AK Sbjct: 314 NPAQLKYVVLLTDGIPNAYTGSPDGIGKYDLTANFPTDNKQIKADQPVSLTTEYVGQVAK 373 Query: 359 --ENFI-KIVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + + +I I + +KT + N +L F +I + Sbjct: 374 TFGSGVKRISVIGFSGNVGEIKDGQKIADQIKTVGNVESTFVIATNEAALEQTFADIKKQ 433 Query: 409 MVH 411 + Sbjct: 434 IQQ 436 >gi|297170900|gb|ADI21918.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [uncultured gamma proteobacterium HF0130_26L16] Length = 563 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 68/227 (29%), Gaps = 42/227 (18%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 ++ ++D SGSM + P ++ K L ++ Sbjct: 28 NILYFILDASGSMWERVEGKP---------------RIVIAKETLSSLIEKTPAEIRTGI 72 Query: 251 DVYMGLIGY-TTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 Y + + +++ + S Q R + T T +++Q L ++ Sbjct: 73 TAYGHRRKFDCSDIQEIVPLKSLDPMTKYQVQERISTLSAMGKTPITDSIRQTVDRLKTE 132 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVT 366 + + I+ ++DG + K + K + I + Sbjct: 133 EG-------------------RSTIVLISDGLESCGKDPCALTS---ELKSSGINFVMHV 170 Query: 367 ISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + ++ L + + NA L+ + + +V + Sbjct: 171 VGFGLTQEQEQKLSCISAAGEGKFFTAGNAADLLDALTVVKESVVEQ 217 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 10/77 (12%), Positives = 26/77 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 ++ + + +++ L +I V +++A +A LAGA Sbjct: 2 LSRIVRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAR 61 Query: 67 KMVSNLSRLGDRFESIS 83 ++ + +I Sbjct: 62 ELDGRDDAITRAQTAIE 78 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 68/191 (35%), Gaps = 20/191 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ + +A+ LD+ + + ++++ +I + ++V + + ++ Q Sbjct: 28 SGSMSGEKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVALHQPLASASDIHWQ- 86 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D T A++ A ++ +PS ++ ++ ++DG Sbjct: 87 -----DLSAGGMTPLGTALQMAKAMIEDK-------------DVVPSRAYRPTVVLVSDG 128 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N+ I D + + ++I A + L K + + NA L Sbjct: 129 GPNDAWEKPLNAFISDG-RSAKCDRLAMAIGADADEAVLGKFIEGTSNRLFYAENAKQLR 187 Query: 400 HVFQNISQLMV 410 F+ ++ + Sbjct: 188 DFFKFVTMSVT 198 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 93/306 (30%), Gaps = 41/306 (13%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + ++S +++T + + L + Y L + + + Sbjct: 10 VAVMWLGGALSSQTLTITQVDTSSLLFTQTVRAYFYVDNLPGGKVLTAADIEARESPDGA 69 Query: 173 IVSFIP-ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +P +R + LV+D SGSM D +D R ++ Sbjct: 70 SWRLLPVRSVRRGVNREEGISFLLVLDASGSM--------WDALDGTPTEDPDRMRITHA 121 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILK 290 K A+ FL + +GL + V + + S Sbjct: 122 KRAIREFLPLLSGRD------RVGLAVFNRTYRVIQPIVGDPSLVLEKLDAIERPSREQA 175 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---NNFKSN 347 T+ +M++A + + ++ ++ L+DGEN + +S Sbjct: 176 YTELYRSMEEALTDFGEEGR-------------------RRVLVVLSDGENFPVDPSESP 216 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 D A I I + RL+ S + ++ NA L V+ I Sbjct: 217 STPGTAIDLAHRYGITCYVIHFGTEKD--RLIGDLASETGGRVFDARNALELASVYTAIQ 274 Query: 407 QLMVHR 412 + ++ Sbjct: 275 EQVLQE 280 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 77/239 (32%), Gaps = 35/239 (14%) Query: 193 IELVVDLSGSMHCA----MNSDPEDVNSAPICQDKKRTKMAALKN-ALLLF-------LD 240 I + S + C + D + + K+ + F L Sbjct: 543 ICVNTGDSYACKCHEGFALQEDRKTCKGCTEGPVDLVFVIDGSKSLGVNNFEIVKEFVLG 602 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTRDMDSLILKPTDSTPAM 298 +D L+ + +GL+ Y+T+V + ++ V++ T Sbjct: 603 ILDSLTISPKAARIGLLQYSTQVRTEFTLKQFSTATDMKKAVSQMKYMGKGSMT------ 656 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + +T +G + S + + TDG + + KAK Sbjct: 657 GLALKQMTERSFT------EAEGARHLSAKVPRVCVVFTDGRAQD-----EVSEWAAKAK 705 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I + I I + + L+ S P ++ + + ++ + + + + M K S Sbjct: 706 QRGITMYAIGIGKAIEEE--LREIASDPPEKHLFYAEDFSAMGEITEKLQKRMCEGKSS 762 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 67/176 (38%), Gaps = 23/176 (13%) Query: 235 LLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ + + L + +GLI Y + V+ T +Q + R + ++ T Sbjct: 38 VKEFILNILQFLDIRPDVTRVGLIQYGSTVKNEFSLK--TFARKQDMERAVRRMMYLSTG 95 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + + + +G + S + I+ +TDG + + +I Sbjct: 96 TMTGLAIQFAV--------NIAFSETEGARPLSQNVPRVIMIVTDGRPQDPVA-----EI 142 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH---VFQN 404 KA+ + I I I + + LK+ S P ++ + V N + VFQN Sbjct: 143 AAKARNSGILIFAIGVGR-VDMNT-LKSIGSEPYEDHVFLVANFSQIETLTSVFQN 196 >gi|295688686|ref|YP_003592379.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] gi|295430589|gb|ADG09761.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] Length = 583 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 17/178 (9%), Positives = 49/178 (27%), Gaps = 26/178 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ + AL + +D + + ++ Y + P+ G K++ Sbjct: 234 PDRLPLAQKALNVLIDQLRPQD------RVAMVAYAGSAGAVLAPTDGRSKLKMRC-ALG 286 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T ++ AY + + +I +TDG+ N Sbjct: 287 ALQAGGSTAGGRGLELAYGLAKQNF----------------DKKAVNRVILITDGDFNVG 330 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ +++ + + +++ + D+L Sbjct: 331 IADPSRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGV--AAYVDTLNEA 386 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 72/231 (31%), Gaps = 52/231 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + ++D+SGSM+ K+ LK++L + +D Sbjct: 240 KAANLVFLLDVSGSMNAPD-------------------KLPLLKSSLTMLTKQLDENDS- 279 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + ++ Y + + G + + + T+ ++ AYQI + + Sbjct: 280 -----VAIVVYAGAAGLVLPATKG-NEYQVISNALNNLSAGGSTNGAQGIELAYQIASQN 333 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K+ +I TDG+ N S+ K+ ++ I + T+ Sbjct: 334 FKKEGIN----------------RVILATDGDFNVGMSSVDALKKLIANKRKTGIALTTL 377 Query: 368 SINASPNGQRLLKTCVS-SPEYHYNVVN--------ADSLIHVFQNISQLM 409 L++ + H + D L Q I++ + Sbjct: 378 GFGQGNYNDGLMEQLANIGNGQHAYIDTINEARKVLVDELSSTMQIIAKDV 428 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 60/180 (33%), Gaps = 21/180 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRAVVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPPEAHVPRVAVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 +A+ I+I + + + G L+ S P E+ + V + D + + + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRAMASPPLNEHVFLVESFDLIQEFGRQFQGRLCTL 218 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 363 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 419 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 420 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDNISV 469 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 470 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPDFSTMTHLLENL 522 Query: 406 SQLMV 410 + Sbjct: 523 KGSIC 527 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 60/180 (33%), Gaps = 21/180 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRSLDVGPNATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRAVVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPPEAHVPRVAVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 +A+ I+I + + + G L+ S P E+ + V + D + + + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRAMASPPLNEHVFLVESFDLIQEFGRQFQGRLCTL 218 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 363 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 419 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 420 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDNISV 469 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 470 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPDFSTMTHLLENL 522 Query: 406 SQLMV 410 + Sbjct: 523 KGSIC 527 >gi|149920662|ref|ZP_01909127.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] gi|149818449|gb|EDM77898.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] Length = 540 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 67/213 (31%), Gaps = 31/213 (14%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTK------MAALKNALLLFLDSIDLLSHVKEDV 252 + M+ ++ D + +K + ++ LL + ++ Sbjct: 140 VVVGMNTPIDPAELDRPPLNLTIAVDLSKSMEGEPIDRVRQGLLQMREQLEPED------ 193 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + L+G+ + +E + + V T + T+ ++ A+ Sbjct: 194 RVTLVGFGDEAQVIVE-NADKDSVE-LATAIAALVPWGSTNLYAGLRTAF---------- 241 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINA 371 Q +Q ++ ++DG N + I+ + + + T+ I Sbjct: 242 ------EQTDLYAQEGWQNRVLLVSDGVPTTGIVNSDKIEGLAEAWSGMGYGLTTVGIGN 295 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + Y V + D++I VF Sbjct: 296 DFDIELMRNLSELGSGSFYYVEDPDAVIEVFSE 328 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 48/150 (32%), Gaps = 22/150 (14%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + +I E + PT PA++ AY L + + R+ Sbjct: 469 MLGVLTFNDASNWDIPLGRVRESRPELHDAIGRIKASGPTAIFPALRNAYDALANVRVRA 528 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K +I L+DG+ + + K I + T+++ Sbjct: 529 ------------------KHVILLSDGQ----SDPEDFEGLVRKMSAAHITVSTVALGPD 566 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L Y V +A + +F Sbjct: 567 ADAALLRNLASWGGGRSYVVQDAQQIPEIF 596 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 62/170 (36%), Gaps = 21/170 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ + + + +GL+ +++ ++ + + KV Y + T + A+ Sbjct: 1051 VNQSNIGTDGIQIGLLQFSSIPQEEFRLNQYSSKVDIYSAIFDVQQMRDGTRTGKALNFT 1110 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 F K Q+++I +TDG + N I ++ Sbjct: 1111 L--------------PFFDSSKGGRPSVQQYLIVITDGVAQD-----NVIIPAKALRDKN 1151 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I I I + +LL+ + + Y+ VN ++L ++ + I + Sbjct: 1152 IIIFAIGVG-EAKKSQLLEI-TNDEDKVYHDVNFEALQNLEKEILSKVCD 1199 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 55/180 (30%), Gaps = 30/180 (16%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKP 291 + F+ I + V G+I Y+ +++ S +++ + Sbjct: 862 MKAFIKEVIKMFQIGPNRVQFGVIQYSDKIQSQFILSQYPSVAELKVAIDNIQQGGGG-- 919 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 N + + +++I +TDG KS+ Sbjct: 920 ----------------TATGEALNNMTQVFADTGRINVARYLIVITDG-----KSSDPVA 958 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + +EN + I I + N L + + + V D L + + + Q + Sbjct: 959 EAAEGLRENGVIIYAIGV-REANIDELKEIA---KDKIFFVYEFDLLKDIQKEVVQDICS 1014 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 23/204 (11%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYV 280 R + + + + LS + V GL+ Y+ K+ +++ Sbjct: 641 FSRVRQPNFQQVVNFLKTIVSSLSIHPDTVRFGLVFYSEEPRLEFSLDTFQNPAKILEHL 700 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T + A+ F ++ + Q+ + + +G Sbjct: 701 DKLTYRERRGRTKTGAALDF-----------LRNEVFIQEKGSRSNHGVQQIAVVIMEGF 749 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSL 398 + + S + I I + + + + L+ S P + Y+V + L Sbjct: 750 SQDSVSRP-----ASHLRRAGITIYAVG-TQNVSESKELEKIASYPHWKYSVPLESFLQL 803 Query: 399 IHVFQNISQLMVHRKY--SVILKG 420 V + + V +G Sbjct: 804 SVVRSKLINQLCSEMVDSKVSFRG 827 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 68/219 (31%), Gaps = 23/219 (10%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V +S + A +D + + + +KN L + L + V Sbjct: 219 PRVDHVSPACREAALADIVFLVDSSTSIGPQN--FQKVKNFLYSVV---LGLDISSDHVR 273 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL Y + + + K T++ A++ Sbjct: 274 VGLAQYNDNIYPAFQLNQHPLKSTVLEQIQNLPYRTGGTNTGSALEF-----------IR 322 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + + +I +TDGE+N+ ++ D+ KE+ + + + Sbjct: 323 TNYLTEESGSRAKDRVPQIVILVTDGESND-----EVQEVADRLKEDGVVVYVVG-VNVQ 376 Query: 374 NGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVH 411 + Q L K E +N N + L +I Q + Sbjct: 377 DVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCS 415 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 74/210 (35%), Gaps = 37/210 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 ++ + A FL + V+ ++G++ + + + R Sbjct: 296 SGSMSSSNRLNRMNQAAKYFL-----MQIVENGSWVGMVHFDSTASIRSDLIQITGSNER 350 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + T +++A++++ K+ S I+ LT Sbjct: 351 DKLLGSLPTTASGGTSICSGIRRAFEVV----------------RKLYSHTDGSEIVLLT 394 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N T C D+ K++ I I++ S + + +++ ++ H+ + Sbjct: 395 DGEDN-------TAGACVDEVKQSGAIIHFIALGPSAD-KAVIEMSTATGGVHFYATDEA 446 Query: 397 S---LIHVFQNISQ---LMVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 447 ENNGLIDAFGALASGNTDISQQSLQLESKG 476 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 27/198 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VEK + K + D Sbjct: 814 FQIIKNFVKTMADRVALDLAT---ARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ A + +K + +TDG+ + + Sbjct: 871 LGEGTYTATALQAANDMFED-----------------ARPGVKKVALVITDGQ-TDSRDK 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A + ++I I + + + + + PE+ Y + +L Sbjct: 913 EKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTL 972 Query: 403 QN-ISQLMVHRKYSVILK 419 + + Q + S +++ Sbjct: 973 KQKLFQKICEDFDSYLVQ 990 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI+++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIALSTVVNEAKLRLISGDSSS 208 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 27/198 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VEK + K + D Sbjct: 814 FQIIKNFVKTMADRVALDLAT---ARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ A + +K + +TDG+ + + Sbjct: 871 LGEGTYTATALQAANDMFED-----------------ARPGVKKVALVITDGQ-TDSRDK 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A + ++I I + + + + + PE+ Y + +L Sbjct: 913 EKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTL 972 Query: 403 QN-ISQLMVHRKYSVILK 419 + + Q + S +++ Sbjct: 973 KQKLFQKICEDFDSYLVQ 990 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 62/165 (37%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI ++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIGLSTVVNEAKLRLISGDSSS 208 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 71/231 (30%), Gaps = 49/231 (21%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + +V+DLSGSM ++ T+ A+ F Sbjct: 83 VSALKVTGRDLAIVLDLSGSMVRDDF----------DLDGRQVTRRDAVATVGADFARR- 131 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + L+ + + S+ E + + + + + T + + A Sbjct: 132 ------RGGDRVALVVFGSEAYFAAPFSFDVEAIARQIEGAQIGISGRATSISDGLGLAL 185 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + + +I L+DG NN N + + A + + Sbjct: 186 KRMENSEA------------------ASRVVILLSDGVNNAGA--TNPRGVAELAAQMGV 225 Query: 363 KIVTISINASP------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ TI++ + L S + V + L+ V Sbjct: 226 RVHTIALGPKDLSTADPGERGVVDAATLRAISEISGGESFRVRTTEDLVAV 276 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 27/198 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VEK + K + D Sbjct: 814 FQIIKNFVKTMADRVALDLAT---ARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ A + +K + +TDG+ + + Sbjct: 871 LGEGTYTATALQAANDMFED-----------------ARPGVKKVALVITDGQ-TDSRDK 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A + ++I I + + + + + PE+ Y + +L Sbjct: 913 EKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTL 972 Query: 403 QN-ISQLMVHRKYSVILK 419 + + Q + S +++ Sbjct: 973 KQKLFQKICEDFDSYLVQ 990 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 62/165 (37%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI ++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIGLSTVVNEAKLRLISGDSSS 208 >gi|327282764|ref|XP_003226112.1| PREDICTED: collagen alpha-1(XXI) chain-like [Anolis carolinensis] Length = 956 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 56/179 (31%), Gaps = 22/179 (12%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + + +G++ Y+ I + + + T + Sbjct: 59 WLVNITSNFNIGPKFIQVGVVQYSDYPVLEIPLGFHDSNENLVRGMEYIQYLGGNTQTGK 118 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ A L S K + LTDG++ + ++ + Sbjct: 119 AIQFALDHLF----------------AKSSRFLTKIAVVLTDGKSQDD-----VKEVAAE 157 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHRKY 414 A++N I + I + + L + + V + ++ + + I Q + Sbjct: 158 ARKNRITLFAIGVGSETEEDELRAIANKPSSTYVFYVEDYIAISRIREVIKQKLCEESV 216 >gi|291295700|ref|YP_003507098.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470659|gb|ADD28078.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 354 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 24/232 (10%), Positives = 66/232 (28%), Gaps = 47/232 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D ++ A K F++ +GL+ ++ + P+ Sbjct: 93 DTSRSMLATDLNPNRLEAAKATARKFIELAPPT------TQIGLVSFSDSASALVMPTTD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP---- 328 +K+ + + R + T A+ ++L G+ P Sbjct: 147 RQKLLEAIERLK---PAQNTSIENAIITGVRMLPGRNTLRPPAELQPPGLSQPDPLQGIP 203 Query: 329 --------------FQKFIIFLTDGENNNFKSNVNTIK-----ICDKAKENFIKIVTISI 369 ++ L+DG +N + + AK +++ T + Sbjct: 204 DLPLPQQAQPPANLPPGSLVILSDGASNVSSNPTLPTRTTLEVAARFAKNANVRLYTFPM 263 Query: 370 NASPNG---------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + L + ++ + ++L + + + Sbjct: 264 GQPGGAVTQIEGRHYYIPFEPRNLEQLAQATGGKNTYPPTEEALRAIVKELG 315 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 55/181 (30%), Gaps = 22/181 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + +D + +G++ Y+ V+ + K + Sbjct: 190 ADNFETVKDFVVSVVDGFEIGQSRTRIGVVQYSDEVQNEFNLTEYGNKADVQSAISNITY 249 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ + S G + P K I +TDGE + Sbjct: 250 LQGRTYTGAALRYMTDVSFS----------EEAGARPPYQAIPKVGIVVTDGEATD---- 295 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVN---ADSLIHVF 402 N A E + + I I R L+ + P+ + + V N D + Sbjct: 296 -NVQGPASSAHEAGVNVFAIGIG--GYDVRELRQIATDPDATHVFAVDNFAATDYIKDAL 352 Query: 403 Q 403 + Sbjct: 353 E 353 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 60/207 (28%), Gaps = 40/207 (19%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 C K+ + A+ F + L+ + + E+++ Sbjct: 103 DCSGSMGQKLDKSRQAVSQFFKLANPED------EFFLVQFNDSASLIQPFTRNLEEIQN 156 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ T A+ A + +K ++ ++D Sbjct: 157 HLAFT---QSKGRTALLDAVYLAL------------------HEMKKAKNPRKALLLISD 195 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG-----------QRLLKTCVSSPE 387 G +N+ + IK + KE ++I I I S G L + + Sbjct: 196 GGDNSSRYTEPEIK--NLVKEADVQIYAIGIYESAAGRGRTPEESSGPALLTEIAEQTGG 253 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKY 414 Y V N + L V I + ++ Sbjct: 254 RQYQVDNLNELPDVAAKIGVELRNQYI 280 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 27/198 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VEK + K + D Sbjct: 847 FQIIKNFVKTMADRVALDLAT---ARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY 903 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ A + +K + +TDG+ + + Sbjct: 904 LGEGTYTATALQAANDMFED-----------------ARPGVKKVALVITDGQ-TDSRDK 945 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A + ++I I + + + + + PE+ Y + +L Sbjct: 946 EKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTL 1005 Query: 403 QN-ISQLMVHRKYSVILK 419 + + Q + S +++ Sbjct: 1006 KQKLFQKICEDFDSYLIQ 1023 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 61/165 (36%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI ++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIGLSTVVNEAKLRLISGDSSS 208 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 58/168 (34%), Gaps = 21/168 (12%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDST 295 + + L +G+I Y+++V+ + + + T + Sbjct: 58 FLVGLVRSLDVGPNATRVGVIQYSSQVQSVFPL-GAFSRREDMERALRTLVPLAQGTMTG 116 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + S +G + P + + +TDG + ++ Sbjct: 117 LAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVAA 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 162 QARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 207 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 100/367 (27%), Gaps = 50/367 (13%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A LA + ++ I++ A + + + I+ + + +T Sbjct: 108 ASLAAKQRSAKHVINTHYVAAPIASDAWYGIKQPERNRFEKQIQNGIMVAGEIPISTFSI 167 Query: 120 NIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 ++ S + L M +YDY L + P Sbjct: 168 DVDTGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDYPLPN-------------KNAAP 214 Query: 179 ALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPICQDKKRTKMAALKNA 234 + E+ P + L + L G + K+ L+ A Sbjct: 215 FSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFLLDVSGSMASADKLPLLQTA 274 Query: 235 LLLFLDSIDLLSHVKEDVYMGL---IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 L + + V VY G + + ++ Sbjct: 275 LKMLTQQLSAQDKVSIVVYAGAAGVVLDGASGDDIQALTY----------ALEQLRAGGS 324 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNT 350 T+ + + QAYQ+ + +I TDG+ N + Sbjct: 325 TNGSQGILQAYQLAQKHFIQGGIN----------------RVILATDGDFNVGVTNFDLL 368 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I + +K K+ I + T+ +L++ HY ++ L + + + Sbjct: 369 ISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADKGNGHYAYIDT--LNEARKVLVDELS 426 Query: 411 HRKYSVI 417 ++ Sbjct: 427 STLLTIA 433 >gi|126310411|ref|XP_001373784.1| PREDICTED: similar to collagen type XII alpha 1 [Monodelphis domestica] Length = 3116 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + + +G++ Y++ + ++ Sbjct: 146 NNFKYILDFIAALVSAFDVGEGKTRVGVVQYSSDTRTEFNLNQYYQRKDLLAAIKKIPYK 205 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ + F K I +TDG++ + Sbjct: 206 GGNTMTGDAIDYLIKNTFTESAGARVG-------------FPKVAIIITDGKSQD----- 247 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + L+ S P + +NV N D+++ + I Sbjct: 248 EVEIPARELRNIGVEVFSLGIKAAD--AKELRQIASPPSLKHVFNVANFDAIVDIQNEII 305 Query: 407 QLMVH 411 + Sbjct: 306 SQVCS 310 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 73/237 (30%), Gaps = 31/237 (13%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ PI ++E + + C+ D + + +K + F Sbjct: 397 YSMKGLTSSEPISIMEKTQPMKVQVECSRGVDIKADIVFLVDGSYSIGIANFVK--VRAF 454 Query: 239 LDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTP 296 L+ + V + L+ Y+ T +V + T++ Sbjct: 455 LEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-RVEDIIEAINTFPYRGGSTNTGK 513 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 AM + + K S K +I +TDG KS+ K Sbjct: 514 AMTYVREKIFVASKGS-------------RSNVPKVMILITDG-----KSSDAFRDPAIK 555 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + + ++I + + + + L+ S P + + V + FQ IS + Sbjct: 556 LRNSDVEIFAVGVKDAVRSE--LEAIASPPPETHVFTVEDF----DAFQRISFELTQ 606 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 58/169 (34%), Gaps = 22/169 (13%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + V + L Y+ + + ++ + T + A+ Sbjct: 1221 FEIGPKRVQIALAQYSGDPRTEWQLNAHKDRRSLLDSVANLPYKGGNTLTGMAL------ 1274 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 +F + +K + +TDG++ + K K++ +++ Sbjct: 1275 -------NFIRQNNFRPQAGMRPRARKIGVLITDGKSQDD-----VEAPSKKLKDDGVEL 1322 Query: 365 VTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I I + + LK + P+ + YNV + DSL + +++ + + Sbjct: 1323 FAIGIKNADESE--LKMIATDPDDTHAYNVADFDSLSRIVDDLTVNLCN 1369 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + Y+ V+ + + +K + Sbjct: 2332 DNFNKVVKFIFNTVGAFDLISPAGIQVSFVQYSDEVKSEFKLNTYADKAQALGALQNIRY 2391 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2392 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKAVPKVLVVVTDGRSQD---- 2434 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2435 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2487 Query: 406 SQLMVH 411 ++ Sbjct: 2488 EDNLIT 2493 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 63/170 (37%), Gaps = 23/170 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDSLILKPTD 293 + + L+ +G+I Y+++V+ + E + + + + + T Sbjct: 57 QFLVGLLHGLNVGPNATRVGVIQYSSQVQSVFPLGAFSRPEDMERAIRALVPL--AQGTM 114 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A + S +G + P + + +TDG + ++ Sbjct: 115 TGLAIQYAMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEV 159 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 160 AAQARARGIEIYAVGVQRADVGS--LRAMASHPLDEHVFLVESFDLIQEF 207 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 61/189 (32%), Gaps = 25/189 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + +GL+ +++RV Sbjct: 363 FELVKRFVNQIVDFL---DVSPDGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 419 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 420 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 469 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +AKE I + + + + L+ S P + + D F ++ Sbjct: 470 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPAELHVSYSPD-----FSTMTH 517 Query: 408 LMVHRKYSV 416 L+ + + S+ Sbjct: 518 LLENLRGSI 526 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 62/169 (36%), Gaps = 23/169 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDSLILKPTDS 294 ++ I L +G+I Y+++V+ + E + + + + + T + Sbjct: 58 FLVNIIRGLDIGPNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIHAIVPL--AQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPSQARVPRVAVIVTDGRPQD-----RVTEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 161 AQARNRGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQQF 207 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 62/185 (33%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ Y++RV Sbjct: 404 FELVKRFVNQIVDFL---DVSPEGTRVGLVQYSSRVRTEFPLGRYGTADEVKQAVLAVEY 460 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A++ + S+ + G + + + + TDG + + S Sbjct: 461 MEKGTMTGLALRHLVEHSFSEAQ----------GARPRAQNVPRVGLVFTDGRSQDDISV 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P ++ + ++ H+ +N+ Sbjct: 511 -----WAARAKEEGIIMYAVGVG--KAVEEELREIASDPPEQHVSYSPDFSTMTHMLENL 563 Query: 406 SQLMV 410 + Sbjct: 564 KVNIC 568 >gi|260836190|ref|XP_002613089.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] gi|229298473|gb|EEN69098.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] Length = 267 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 31/178 (17%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ +G++ Y + + T + A+ Sbjct: 29 NTTSDFQIGTAHTQVGVVQYEDNPYDEFPLNQYATLDELLTAIRNITYRGGGTQTGKAID 88 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 ++ K +I +TDG++ + + +A Sbjct: 89 HVVDNSLTESHG-------------ARPGVPKVVIVVTDGQSWDS-----VVAPAQRANH 130 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE----------YHYNVVNADSLIHVFQNISQ 407 + I +V I + + + L++ S+ + V N L +FQ+I++ Sbjct: 131 SGIIMVAIGVGSGYDINELMEIASSNDTLGTIEYFLRCKYLKVNN---LTFLFQDINE 185 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 73/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++V I P + ++D+SGSM + Sbjct: 224 NEQHRLVRIGLKAKEIPTENLPASNLIFLIDVSGSM-------------------YGPER 264 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y+ + + + G++K ++ + Sbjct: 265 LDLVKSSLKLLVNNLRDKD------KVAIVVYSGAAGEKLASTPGSDK-QKIREAIDELE 317 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T +K AY+I + II TDG+ N S Sbjct: 318 AGGSTAGGEGIKLAYKIARKNFITGGNN----------------RIILCTDGDFNMGVSS 361 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADS 397 + K+ ++ +++ + + + ++ H + N Sbjct: 362 DQELKKLIEQKRKSGVFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQE 413 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 43/378 (11%), Positives = 99/378 (26%), Gaps = 40/378 (10%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 +A A A + L + R + F + + Sbjct: 100 AAQEQYSAAVARGESAGLVKATGRKTEQFQVSVNVAPAAKVTFELVYEELLKRQLGVYEL 159 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++Q + M S + + + L + N+ Sbjct: 160 LLKVQPQQLVKHLQMDIHIFEPQGIS-FLKTESTFMTSELANALTISQ---NKTKAHIQF 215 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI--------------- 219 +++ F++ V+ + S + V+ Sbjct: 216 KPQKTAGQLDKVLDGNFIVRYDVNRTLSGGSIQIENGYFVHYFAPEGLPTIPKNVIFVID 275 Query: 220 -CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ + AL+ LD + + G + + ++ ++ Sbjct: 276 KSGSMSGRKIQQTREALIKILDDLKPNDQFNLISFSGDVT-HWKPLLVPASPENVDQAKR 334 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 Y T+ AM A ++L S ++ ++ II LTD Sbjct: 335 YAANI---EAHGGTNINDAMLTAVRLLQSANQKELLSDGSVS-----------LIILLTD 380 Query: 339 GENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVV 393 G+ S K KA + + + + L K + + + + Sbjct: 381 GDPTVGETSPARIQKNVQKAIDGQYSLFCLGFGFDVSYVFLEKLALDNGGLARRIYEDSD 440 Query: 394 NADSLIHVFQNISQLMVH 411 +A L +Q ++ ++ Sbjct: 441 SALQLQDFYQEVANPLLT 458 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 27/198 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VEK + K + D Sbjct: 814 FQIIKNFVKTMADRVALDLAT---ARIGVINYSHKVEKVANLKQFSSKDDFKLAVDNMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ A + +K + +TDG+ + + Sbjct: 871 LGEGTYTATALQAANDMFED-----------------ARPGVKKVALVITDGQ-TDSRDK 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ A + ++I I + + + + + PE+ Y + +L Sbjct: 913 EKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTL 972 Query: 403 QN-ISQLMVHRKYSVILK 419 + + Q + S +++ Sbjct: 973 KQKLFQKICEDFDSYLVQ 990 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 69/196 (35%), Gaps = 29/196 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSL 398 + I + A+ + I +TI ++ N +L S + D + Sbjct: 166 PRNPD--VQSISEDARISGISFITIGLSTVVNEAKLRLISGDSSSEPTLLLSDPTLVDKI 223 Query: 399 IHVFQNISQLMVHRKY 414 N+ + RK Sbjct: 224 QDRLDNLFEKKCERKI 239 >gi|229550752|ref|ZP_04439477.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] gi|229315887|gb|EEN81860.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] Length = 898 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/404 (6%), Positives = 98/404 (24%), Gaps = 48/404 (11%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + ++ + A + ++ G + Sbjct: 14 LFAVLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D + + Y + + + + Sbjct: 70 QRGGDQVSGWDNNTTWDGDATNTTNSYLKFGDPNKPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + + + Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESNSSGTNRAGAVR------------------TGVKNF 171 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + N+ + + + + + + L S + + Sbjct: 172 LTSIQNAGLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKAGNASQQQAINGALSPRFQGG 231 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD-------KKRSFFT 315 I G+ +++ + +++ + T Sbjct: 232 TYTQIGLRQGS--------AMLNADASGNKKMMILLTDGVPTFSNEVINSEWINGTLYGT 283 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 NF + + + + G T+ KAK++ ++ + I + + Sbjct: 284 NFGSSRDEPGNTAQLGWPYIDSSGNRIYDTWPA-TLGEAKKAKDSGNEVHALGIQLADDR 342 Query: 376 QRL--------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + ++ +SP+ + + +AD++ N ++ ++ Sbjct: 343 KYMTKEKIRQNMQLITNSPDLYEDADSADAVEAYLNNQAKDIIK 386 >gi|220913381|ref|YP_002488690.1| hypothetical protein Achl_2636 [Arthrobacter chlorophenolicus A6] gi|219860259|gb|ACL40601.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 310 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/245 (9%), Positives = 70/245 (28%), Gaps = 5/245 (2%) Query: 30 VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRA 89 ++ + + + ++ +++ +A+ LAGA +N S+++ + Sbjct: 1 MVVLIGAGALAVDTGQIYAERAQLQNGADASALAGADLCSANGGCTQAAATSVADALANS 60 Query: 90 LIDDAKRFIKN-HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 +D+K +++ + N +++ + + M Sbjct: 61 NSNDSKSTVQSVDLSVPGQVTVTTSTKDGTTNAGFLTKMFASALNAPAATVGATATAKWM 120 Query: 149 DVMTSY-DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE--LVVDLS-GSMH 204 + L F + L +I + G+ Sbjct: 121 YPIKGITVLPLTFATCEFKDDGLPHKILIQGGALDCNGLNPSNQIIPGGFAWLMPDGNTG 180 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 C + ++ + M ++ L + + V + G+ G+ + Sbjct: 181 CKVTAEVGQWSPTSSGASVPTGCMDLFDPSINPSLANSTVALPVYKYTCKGMPGFGSCTG 240 Query: 265 KNIEP 269 N++ Sbjct: 241 SNVQY 245 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 62/186 (33%), Gaps = 26/186 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ +K L L LD ++ + LI + ++ ++ T+ +QY Sbjct: 177 SGSMNGQKIEMVKQTLALLLDFLNENDRYQ------LITFESQAQRLTPLKRVTDGNKQY 230 Query: 280 VTRDMDSLILKPTDSTPAM-KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + + A++ L K R+ T+ I L+D Sbjct: 231 FKQVIQQINSGGGTTIGTATEIAFKQLQERKYRNNVTS----------------IFLLSD 274 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G+ + ++N + E + T + Q + + C Y V + L Sbjct: 275 GQ--DGQANQRIQEQIKTVNEV-FTLHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVTLL 331 Query: 399 IHVFQN 404 F + Sbjct: 332 DEFFAD 337 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 61/184 (33%), Gaps = 27/184 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---MDSLILKPT 292 LD IDL+ E+V++ ++ + + +Q + T Sbjct: 89 KFVLDFIDLVPISSEEVHLSVVTFADSPQDVFTFKQPQATNKQLAKEAFKYLRYRRGGST 148 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + +A + LT + V K ++ +TDGE++ TI+ Sbjct: 149 ATDKGLIRARRYLT-------------RPVYGTRANVPKVLVLMTDGESDRHYD---TIQ 192 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-----YNVVNADSLIHVFQNISQ 407 D+A+ I + + + N C + + + N + L+ +I Sbjct: 193 AADQARAEGISVFVVGVGM-ANPVECRGVCGC--GRYGPCPQFIMSNWNELVQTVDSIMG 249 Query: 408 LMVH 411 + Sbjct: 250 EVCK 253 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 58/202 (28%), Gaps = 28/202 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 R K+ LK +L + + HV VY G G + Sbjct: 251 VSGSMNSRDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGASG--------VVLDGVKGNDT 302 Query: 278 QYVT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 Q + T+ ++QAY++ + +I Sbjct: 303 QAINQALNSLKAGGSTNGGAGIQQAYRLAQKHFIQGGVN----------------RVILA 346 Query: 337 TDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 TDG+ N ++ + + ++ I + T+ L++ HY ++ Sbjct: 347 TDGDFNVGTTDHQALMDLIAAKRDQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDT 406 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 L + + + ++ Sbjct: 407 --LNEARKVLVDELSSTLLTIA 426 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 61/211 (28%), Gaps = 61/211 (28%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++ A K L + S+ +G I ++ + + Sbjct: 87 DTSRSMLTEDVYPNRLEAGKRVLTNLIQSLK-------GDRVGFIPFSDSAYIQMPLTDD 139 Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ + T+ A++ A + + K Sbjct: 140 YNITQNYINAIDTTLISGGGTELYQALELAEKSFKEIGSEN------------------K 181 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I ++DG + + KS D KEN I + +I + Sbjct: 182 TVIVISDGGDFDKKSL-------DFVKENKIDVYSIGVGTKEGNVIPEYLNGVKRGFIKD 234 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVN 394 N L K + +Y V N Sbjct: 235 ESGSAVISKLNSDFLQKISNENNGKYYEVNN 265 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 51/185 (27%), Gaps = 24/185 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K+ K A F+ + + L+ Y T V E Q Sbjct: 55 SGSMRGQKLDDAKRAAREFITRVSEED------RVALVHYGTDVTVFPSTLATPETREQM 108 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +T T+ + ++ A Q L + II L+DG Sbjct: 109 LTFVNAIEDEGSTNISGGLEAAAQQLQKN----------------ADQFRVSRIILLSDG 152 Query: 340 ENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + + + + + + N L++ + +D L Sbjct: 153 QPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNEN-LMQGIADQGGGFSGFLRSDQL 211 Query: 399 IHVFQ 403 VF Sbjct: 212 AEVFT 216 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 17/233 (7%), Positives = 61/233 (26%), Gaps = 5/233 (2%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 L+ + V + + + Q+ + + + R+ E + L+ Sbjct: 31 LEVKAHAPQERRRVPVNVALVIDRSGSMRGQKLDDAKRAAREFITRVS-EEDRVALVHYG 89 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQ--DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 D++ + + + D+ T ++ A L V + + Sbjct: 90 TDVTVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEAAAQQLQKNADQFRVSRIILL 149 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK--PTDSTPAMKQAYQILTSDKKRS 312 T + + + + +R + + + + + + Sbjct: 150 SDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIADQGGGFSGFLRSD 209 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 F + ++ + + + + V + +A + + Sbjct: 210 QLAEVFTRELEQATSTVARAVEVRLSLPPSVLSVEVMGVNAVREAHAVRVPLY 262 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 54/188 (28%), Gaps = 25/188 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K+A L +D ++ + +I + R + IE Sbjct: 59 LETVKSAALGLIDRLEEDD------RLSVIAFDHRAKIVIENQQVRNGA-AIAKAIERLK 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K Q I LTDGEN + ++ Sbjct: 112 AEGGTAIDEGLKLGIQE-----------------AAKGKEDRVSHIFLLTDGENEHGDND 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + A + + + T+ N L S+ + N +H F+ + Q Sbjct: 155 RCLK-LGTVASDYKLTVHTLGFGDHWNQDVLEAIAASAQGSLSYIENPSEALHTFRQLFQ 213 Query: 408 LMVHRKYS 415 M + + Sbjct: 214 RMSNVGLT 221 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 77/256 (30%), Gaps = 53/256 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +QR +S + L + + L++D SGSM+ Sbjct: 20 SSQRQLAVSISAVGETLDRRV----PLNLCLILDHSGSMN-------------------- 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTR 282 + +K A+ L +D + + ++ + R + + +++Q + R Sbjct: 56 GRALETVKKAVSLLVDQLSSED------RLSIVVFDHRAKILVPNQIISDRNQIKQQINR 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T ++ + L KK + F LTDGEN Sbjct: 110 LT---ADGGTAIDEGLRLGIEELAKGKKDTISQAF-----------------LLTDGENE 149 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +N A + + T+ + N L K + ++ + + + F Sbjct: 150 HGDNNRCLK-FAQLAASYNLTLNTLGFGDNWNQDILEKIADAGLGNLSHIEHPNQAVDKF 208 Query: 403 QNISQLMVHRKYSVIL 418 + M + Sbjct: 209 SRLFSRMQTVGLTNAY 224 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 68/191 (35%), Gaps = 20/191 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ + +A+ LD+ + + ++++ +I + ++V + + ++ Q Sbjct: 28 SGSMSGEKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFGSQVALHQPLASASDIHWQ- 86 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D T A++ A ++ IPS ++ ++ ++DG Sbjct: 87 -----DLSAGGMTPLGTALQMAKAMIEDK-------------DVIPSRAYRPTVVLVSDG 128 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N+ I D + + ++I A + L K + + NA L Sbjct: 129 GPNDAWEKPLNAFISDG-RSAKCDRLAMAIGADADEAVLGKFIEGTSNRLFYAENAKQLR 187 Query: 400 HVFQNISQLMV 410 F+ ++ + Sbjct: 188 DFFKFVTMSVT 198 >gi|313225346|emb|CBY06820.1| unnamed protein product [Oikopleura dioica] Length = 369 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 70/235 (29%), Gaps = 36/235 (15%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-------- 231 ++G F +L GS S + N+ P+C+ K + Sbjct: 41 SDENKVGSNCTFTCPPDQNLIGSKWRECTSTGQWTNTLPVCRQCSHLKSDLVFIIDGSWS 100 Query: 232 ------KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + A ++ + + ++ Y+ +K D Sbjct: 101 VGNVNFRKAKDFMKSLVNPFEIGWDYTRVSVLQYSDDPRIEFYLKDYQDKTTLLNAIDAI 160 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A++ + S + S +K ++ LTDG++ + Sbjct: 161 TYKGGNTRTGEAIRYMMGQIFSVEAGS-------------RPYVKKHMVLLTDGQSQDDV 207 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 AK I+ I + + LK + P + Y+V + D + Sbjct: 208 G-----APARAAKNFNIRTFAIGVGD--AIEDELKLVATPPFSDTLYHVEDYDGI 255 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 76/252 (30%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A I + P + +VD+SGSM + K Sbjct: 128 NNDSLLLRVGVAGRAIATADLPPANLVFLVDVSGSM-------------------ESPDK 168 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R + Sbjct: 169 LPLLQSSLKLLVRQLRAKD------RITLVTYAGNTAVVLPPTPGDQQGR-IIEAIDTVQ 221 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 222 SGGSTAGASGIELAYKAAQQGYLRGGIN----------------RILLATDGDFNVGVTD 265 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 266 FDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKVLTHE 325 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 326 LGATLATIARDV 337 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 48/375 (12%), Positives = 102/375 (27%), Gaps = 64/375 (17%) Query: 60 AILAGASKMVSNLS------RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A LAG S D + + ++ + + I N S + Sbjct: 23 ASLAGCSSAGRTYEATKSAFDSNDAGKMVEYYSPNVGYNTEEYNIINESGFQSVSTSPLS 82 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD---------VMTSYDYRLQFIEHL 164 + + + I + I ++ I Sbjct: 83 TFAADVDTASYANIRRFITDGELPPADAVRIEEMLNYFYYDYPQPKDDEPFSVTTEISSC 142 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I + + ++D+S SM Sbjct: 143 PWNPDTKLMQ--IGLQAKNTDTTTKPSNLVFLIDVSASMDEPD----------------- 183 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +KNA LL D + + ++ Y +E + G++K + ++ Sbjct: 184 --KLPLVKNAFLLLCDELKEND------TISIVTYAGTDSVVLEGAKGSDK-KSIMSAIE 234 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D T + +K AY+I K ++ TDG+ N Sbjct: 235 DLTAGGSTAGSDGIKTAYKI----------------AEKYFKTEGNNRVVLATDGDLNVG 278 Query: 345 -KSNVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 S IK+ K KE+ I + + ++ + + + + Sbjct: 279 ITSEGELIKLIKKEKESNIFLSVLGFGTDNIKDNKMQSLADNGDGNYSYIDSRFE---AK 335 Query: 403 QNISQLMVHRKYSVI 417 + +S + ++V Sbjct: 336 KVLSDELGANFFTVA 350 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL-GAFARREDMEHAIRALVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPPEARVPRIAVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 207 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 25/189 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 363 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 419 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 420 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 469 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +AKE I + + + + L+ S P + + D F ++ Sbjct: 470 -----WAARAKEEGIVMYAVGVG--KAVEEELRQIASEPAELHVSYSPD-----FGTMTH 517 Query: 408 LMVHRKYSV 416 L+ + + S+ Sbjct: 518 LLDNLRGSI 526 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 33 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL-GAFARREDMEHAIRALVPLAQGTMT 91 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 92 GLAIQYAMNVAFS----------VAEGARPPEARVPRIAVIVTDGRPQD-----RVAEVA 136 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 137 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 183 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 25/189 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 380 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 436 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 437 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 486 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +AKE I + + + + L+ S P + + D F ++ Sbjct: 487 -----WAARAKEEGIVMYAVGVG--KAVEEELRQIASEPAELHVSYSPD-----FGTMTH 534 Query: 408 LMVHRKYSV 416 L+ + + S+ Sbjct: 535 LLDNLRGSI 543 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 17/171 (9%), Positives = 56/171 (32%), Gaps = 26/171 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ V + L+ Y+ + + T++ AM Sbjct: 435 VNSFDIGPNKVQISLVQYSRDPHTEFALNKFDDNAAMVKAVRTFPYRGGSTNTGKAMTYV 494 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + + + ++ +TDG++++ + +K ++ Sbjct: 495 REKIFVSGRGARDN-------------VPRVMVLITDGKSSDSFKDP-----ANKLRDTD 536 Query: 362 IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 ++I + + + + L+ + P + + V + F+ IS+ + Sbjct: 537 VEIFAVGVKDAVRSE--LEAIANVPADNHVFEVEDF----DAFERISKELT 581 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/309 (10%), Positives = 83/309 (26%), Gaps = 36/309 (11%) Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV-MTSYDYRLQFI--EHLLNQR 168 + S + + N + S + L Sbjct: 5 SGTVITSSVMEPPSDLKFKILNENTVQMSWRKPSSQIQGYRIQVTSDTDDSKELSLPASS 64 Query: 169 YNQKIVSFIP---ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + I P + I + I ++ ++ A+ V+ + + Sbjct: 65 TSTSITDLTPDVDYSVSINSYDGAEESIPILGQITKCSVSAVADLVFLVDGSWSVGRENF 124 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + + A+ ++ + ++ Y++ + + Sbjct: 125 RFIRSFIGAM------AGAFDIEEDKTRVAVVQYSSDTRTEFSLNTHFRRPDVLRAIKNL 178 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + AM + + F K + +TDG++ + Sbjct: 179 PYKGGNTMTGDAMDYLVKNTFTQAAG-------------ARKGFPKVAMIITDGKSQDP- 224 Query: 346 SNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + ++ + ++I + I A + LK S P ++ YNV N D + V Sbjct: 225 ----VEEYAERLRNIGVEIFVLGIKGADEDE---LKEIASRPHSKHVYNVPNFDMINQVQ 277 Query: 403 QNISQLMVH 411 + + + Sbjct: 278 KALITEVCS 286 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 26/263 (9%), Positives = 76/263 (28%), Gaps = 39/263 (14%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + E + + +S + + + + Sbjct: 1122 TVNVFGMFDGGESMPLAGEEKTTLSDAASE-----------NLPYTPSDAQC-----KTT 1165 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + + ++ K + + + V +GL Y+ + + Sbjct: 1166 AQADIVLLVDGSWSIGRLN-FKTIRNFIARMVGVFDIGPDRVQIGLAQYSGDPKTEWHLN 1224 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + T + A+ IL ++ K + + Sbjct: 1225 AHRTRTQLLDAVANLPYKGGNTLTGLALNY---ILQNNFKPNVGM----------RPNSR 1271 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--Y 388 K + +TDG++ + + ++ I++ I + + L+T S P+ + Sbjct: 1272 KIGVLVTDGKSQDDIVANS-----QNLRDQGIELYAIGV--KNADENELRTIASDPDDIH 1324 Query: 389 HYNVVNADSLIHVFQNISQLMVH 411 YNV + L+ + +++ + + Sbjct: 1325 MYNVADFSFLLDIVDDLTNNLCN 1347 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 51/185 (27%), Gaps = 24/185 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K+ K A F+ + + L+ Y T V E Q Sbjct: 105 SGSMRGQKLDDAKRAAREFITRVSEED------RVALVHYGTDVTVFPSTLATPETREQM 158 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +T T+ + ++ A Q L + II L+DG Sbjct: 159 LTFVNAIEDEGSTNISGGLEAAAQQLQKN----------------ADQFRVSRIILLSDG 202 Query: 340 ENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + + + + + + N L++ + +D L Sbjct: 203 QPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNEN-LMQGIADQGGGFSGFLRSDQL 261 Query: 399 IHVFQ 403 VF Sbjct: 262 AEVFT 266 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 17/233 (7%), Positives = 61/233 (26%), Gaps = 5/233 (2%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 L+ + V + + + Q+ + + + R+ E + L+ Sbjct: 81 LEVKAHAPQERRRVPVNVALVIDRSGSMRGQKLDDAKRAAREFITRVS-EEDRVALVHYG 139 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQ--DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 D++ + + + D+ T ++ A L V + + Sbjct: 140 TDVTVFPSTLATPETREQMLTFVNAIEDEGSTNISGGLEAAAQQLQKNADQFRVSRIILL 199 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK--PTDSTPAMKQAYQILTSDKKRS 312 T + + + + +R + + + + + + Sbjct: 200 SDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIADQGGGFSGFLRSD 259 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 F + ++ + + + + V + +A + + Sbjct: 260 QLAEVFTRELEQATSTVARAVEVRLSLPPSVLSVEVMGVNAVREAHAVRVPLY 312 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 63/186 (33%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 131 FEIVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVASMKY 187 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 188 MGKGSMTGLALKHMFERSFTQI----------EGARPLSARVPRVAIVFTDGRAQDDVS- 236 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + Sbjct: 237 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISDKL 290 Query: 406 SQLMVH 411 + + Sbjct: 291 QKGICE 296 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 63/186 (33%), Gaps = 22/186 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +DS+ + +GL+ Y+T+V Sbjct: 415 FEIVKQFVTGIIDSLT---ISPKAARVGLLQYSTQVRTEFTLRNFGSAKDMKKAVASMKY 471 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K + + A+K ++ + +G + S + I TDG + S Sbjct: 472 MGKGSMTGLALKHMFERSFTQI----------EGARPLSARVPRVAIVFTDGRAQDDVS- 520 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNI 405 + KAK N I + + + + + L+ S + ++ + + ++ + + Sbjct: 521 ----EWASKAKANGITMYAVGVGKAIEEE--LQEIASEPTDKHLFYAEDFSTMGEISDKL 574 Query: 406 SQLMVH 411 + + Sbjct: 575 QKGICE 580 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 78/256 (30%), Gaps = 51/256 (19%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 I+S + ++ R + LVVD+S SM ++ A Sbjct: 73 VIISLAGPISETKVA-RNRATVMLVVDVSLSMSATDV---------------APDRITAA 116 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A F++++ +D+ +GL+ ++ R + P+ + V + + Sbjct: 117 KEAGQEFVENL------PDDLNIGLVTFSGRARTAVSPTTNHDTVNRALQAAELDQATAT 170 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 D+ A A + + I+ L+DG+ + + Sbjct: 171 GDAIAAALDAINQFSDSVQGGGEGAP------------PATIVLLSDGKQTVPQELDDPR 218 Query: 352 KICDKAKEN---FIKIVTISIN--------------ASPNGQRLLKTCVSSPEYHYNVVN 394 A E + + TIS + L + + ++ + Sbjct: 219 GAYTAAAEAAKAGVPVNTISFGTAQGAITVQGELIPVPNDDDSLREIARRTKGEFFSAGS 278 Query: 395 ADSLIHVFQNISQLMV 410 + L + ++ + Sbjct: 279 LEQLRDAYGSLEDDIG 294 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 37/144 (25%), Gaps = 9/144 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++K ++ ++ ++ L + F I + K +++A + A LAGA Sbjct: 6 RRTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDV 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ LS T ++ + Sbjct: 66 DQAEAAVIAT---------LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPT 116 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRL 158 D +M + Sbjct: 117 GEYDIYVRVAVTDMGISQYLSAVF 140 >gi|290981305|ref|XP_002673371.1| predicted protein [Naegleria gruberi] gi|284086954|gb|EFC40627.1| predicted protein [Naegleria gruberi] Length = 353 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 70/205 (34%), Gaps = 29/205 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +K++TKM + +D++ +E +G++ + + E + + + ++ + Sbjct: 2 SNKEKTKMQVANQVICEIIDNL------REFERLGIVLFDDKAETFLPLTIVQDLDKKSL 55 Query: 281 TRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T+ M++ + ++ II+LTD Sbjct: 56 KETVLKIKEKGSTNFEAGMQRGIDLFSTLDSSDLSN--------------SNRIIYLTDA 101 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 N ++ I + A I + I I N + + +++V + + Sbjct: 102 CPNVGGTDTLDI-LTKDANSGPYNIFSTFIGIGLDFNSDIVEELTQVRGCNYFSVKSTED 160 Query: 398 ----LIHVFQNISQLMV-HRKYSVI 417 L F I + + K S+ Sbjct: 161 FTKILNQDFNYIVTPICYNVKLSLE 185 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/247 (15%), Positives = 74/247 (29%), Gaps = 48/247 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + + + E G + LV+D SGSM Sbjct: 17 NQNSSQRQLSLAISAQGEPGRTLPLNLCLVLDHSGSM--------------------GGR 56 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + +K A + + ++ V +I + R + + + G E + + + Sbjct: 57 PLETVKKAAIELVKQLNPEDRVS------VIAFDHRAKVIVP-NQGIEDLNTVIEQIKAL 109 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T +K + K+ I LTDGE N Sbjct: 110 RAAGGTAIDEGLKLGIKESALGKQERVSQ-----------------IFLLTDGE-NEHGD 151 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI- 405 N +K+ A + I + T+ N L K S+ + N + + F + Sbjct: 152 NERCLKLAQVASDYNITLNTLGFGNHWNQDVLEKIADSAGGSLSYIENPEKALEEFSRLF 211 Query: 406 --SQLMV 410 +Q + Sbjct: 212 SRAQSIG 218 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/349 (7%), Positives = 76/349 (21%), Gaps = 73/349 (20%) Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--ESLSGYSAVFYNTEIQNIVNSSR 126 L + + N + + T+ + Sbjct: 20 SETLGFATGGAQDVENFRTNLEAGYLPLPTDLTYEGLAKDYYFDTSSNATKPCTKLFCPL 79 Query: 127 ISMTHMANNRLDSSNNTIFYNMDV-------------MTSYDYRLQFIEHLLNQRYNQKI 173 S+ + L + ++ FY + L + + + Sbjct: 80 YSVGLSPDPLLGTPASSEFYMAVGLDSGMKAADFARKQLNLVVVLDVSGSMGSPFDSYYY 139 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + + + + K+ K Sbjct: 140 DQTVQPTAGV-----------------------------------PDEGETKKKIDVAKE 164 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLIL 289 L + + + ++ ++ +K+++ ++ D+ + Sbjct: 165 VLAGIVGLLRPDDS------LSVVLFSDAACVPKPLGPVRCADVDKLKEQISADVVEM-- 216 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-V 348 T+ + LT + ++FLTD + N + Sbjct: 217 GGTNFQAGIDAGGAQLTGCAACMEANASL----------VENRVVFLTDAQPNAGDDSEQ 266 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + I I + N Q + +++V Sbjct: 267 GLLARIKALSADGIYTTIIGVGLDFNTQLVESIGKVRGSNYFSVHTPGE 315 >gi|148689378|gb|EDL21325.1| procollagen, type VII, alpha 1 [Mus musculus] Length = 2944 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ + Sbjct: 70 SGAASAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSD 129 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + K K + Sbjct: 130 RVFLPRL---------------TRPGVPKVCILITDGKSQD-----LVDTAAQKLKGQGV 169 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + IS+ + Sbjct: 170 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 >gi|115647999|ref|NP_031764.2| collagen alpha-1(VII) chain precursor [Mus musculus] gi|143955303|sp|Q63870|CO7A1_MOUSE RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor Length = 2944 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ + Sbjct: 70 SGAASAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSD 129 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + K K + Sbjct: 130 RVFLPRL---------------TRPGVPKVCILITDGKSQD-----LVDTAAQKLKGQGV 169 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + IS+ + Sbjct: 170 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 >gi|2326168|gb|AAB66593.1| type VII collagen [Mus musculus] Length = 2944 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ + Sbjct: 70 SGAASAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSD 129 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + K K + Sbjct: 130 RVFLPRL---------------TRPGVPKVCILITDGKSQD-----LVDTAAQKLKGQGV 169 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + IS+ + Sbjct: 170 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 >gi|257062762|ref|YP_003142434.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] gi|256790415|gb|ACV21085.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] Length = 2281 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 57/225 (25%), Gaps = 28/225 (12%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNI 267 D +T+MAA +NA+ S+ + + V M L+G++T Sbjct: 81 DNSGSMDTRTGGYGSQTRMAAAQNAVNNLARSLYAYNTTEFPDLVQMALVGFSTTGSVVQ 140 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA----YQILTSDKKRSFFTNFFRQGVK 323 P+ V R T+ A++ A + Sbjct: 141 GPTNSYNTFSGAVNRLD---ADGGTNWEDALQDAAGINFNDDDPTYVIFVSDGNPTFRNT 197 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNT-------------IKICDKAKENFIKI-----V 365 + + T G N + D A+ + Sbjct: 198 RGNYNPMDNYYYNTWGVYGNGSDSQTVAGIAAATTIARCYEHAVDDAESLATSVGADHFY 257 Query: 366 TIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 TI + L + +++ N L + I + Sbjct: 258 TIGAYGNVDRMRSLTTDAGAPAGNYFSAANTTDLQNALAAILAQI 302 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 22/241 (9%), Positives = 72/241 (29%), Gaps = 33/241 (13%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +V+ + + VV+L+ + A+ + V + + K Sbjct: 15 GSTKGLVTITTPTYSKDAAALTADEVTAVVELNATSSTAVREGLDLVVVLDVSGSMRGEK 74 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++K A+ + + + + ++ +++ ++ T+ + + +D L Sbjct: 75 LQSMKRAMQFVIMKLTPVD------RLSVVSFSSSATRHCPLRSVTQAAQAELKGIVDGL 128 Query: 288 -ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + A ++ + + ++DG+ + + Sbjct: 129 VANGGTNIKAGLDTALAVIAGRA---------------TTKARTPNVFLMSDGQQTDGDA 173 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQN 404 + + T + LL S +V + ++ F Sbjct: 174 RQVDPG--------NVAVYTFGFGKDAD-HALLSDVAKKSPGGTFNSVPDGGNVSAPFSQ 224 Query: 405 I 405 + Sbjct: 225 L 225 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/291 (8%), Positives = 77/291 (26%), Gaps = 30/291 (10%) Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 S+ + + +L + S + + + L+ Sbjct: 48 SKSYIGNDIEVKLKLTPAGSINVSRSPVSVVLVIDSSGSMSASSKMTAAKNAAKNLIDSF 107 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 +VD ++ D ++ + +L + Sbjct: 108 KNSAKSGDKLGIVD--------FDTFVNDNSNFYVKG---------------FYLQNGSW 144 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +Y T ++ + T + + T+ A+ +A + Sbjct: 145 QKGNSSTIYGPYSLPNTCTSSLLDLT-NTSAINSAKNLIDNMNASGGTNMEAALNKAKNL 203 Query: 305 LTSDKKRSFFTNFF---RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L + + F + + + G N + T+ ++ Sbjct: 204 LNASPSGNDKYVIFITDGMPTFYLNGTHNGYPLVDGPGLQPNNTTKSETLSAVQSLSQSG 263 Query: 362 IKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K+ + + A + + ++ Y + + ++L + Q+I +++ Sbjct: 264 TKLFVVGVDTTGADVDKTFIELMASTANGKSYYISSTNALNSILQDIFKII 314 >gi|320010752|gb|ADW05602.1| Protein of unknown function DUF3520 [Streptomyces flavogriseus ATCC 33331] Length = 528 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/356 (10%), Positives = 97/356 (27%), Gaps = 47/356 (13%) Query: 51 NSMESANNAAILAG--ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + +SA + + +G A + +R + ++ A+ + + Sbjct: 30 TTADSAVDRSTGSGGPAGGRQPSGARSAAPQQEEDGASREEGDGPARESAAPDYLSTFAL 89 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + ++ ++ Y + + Sbjct: 90 DVDTAGYGYARRTLGDGQLPAADTVRPEEFVNSFRQGYERPKGNGFAVSVDGARSDAAGW 149 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 ++ A GERP + VVD+SGSM ++ Sbjct: 150 SLVRVGLATRAAS--NTGERPPAALTFVVDISGSMAE-------------------PGRL 188 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K +L + D + V L+ ++ E + + + + Sbjct: 189 DLAKTSLGILADELRDDDSVS------LVTFSEEAETRLPMTRLRGNRTKLRDAIEEMEP 242 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KS 346 T+ +++ Y + V+ ++ L+D N ++ Sbjct: 243 ADSTNVAAGVERGY----------------EEAVEGHRKGATNRVVLLSDALANTGETEA 286 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ D +E I + + + + + + + + + VF Sbjct: 287 DAILERVGDARQEYGITLFGVGVGSDYGDELMERLTNKGDGNTTYIADEAQARKVF 342 >gi|194220937|ref|XP_001501805.2| PREDICTED: matrilin 3 [Equus caballus] Length = 450 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + Sbjct: 54 SSRSVRPLEFTKVKTFVSQIIDTL---DIGPADTRVAVVNYASTVKIEFHLNTHSDKQSL 110 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G ++PS K I + Sbjct: 111 KQAVARITPLST--GTMSGLAIQTAMD----------EAFTVEAGARVPSYNIPKVAIIV 158 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 159 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKVIASEPLDEHVFYVET 211 Query: 395 ADSLIH 400 + Sbjct: 212 YGVIEK 217 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 68/225 (30%), Gaps = 21/225 (9%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + L + + + ++S+ + + LK Sbjct: 34 IVVKSGLFNKELWNENDSVLADKKNEVCNEGIVDIGFIMDSSGSLGKNYKNEKDLLKTLA 93 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 LF S G+I ++ E +I+ + +++ D L+ T Sbjct: 94 SLF-------SIKPNGSQAGVITFSFYTEHSIKLNQFSDQDSFNDAVDRIPLMGHTTRID 146 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ A + + + K ++ LTDG K ++ I D Sbjct: 147 KGLRLAQKEMFKVENG-------------GRPGVSKLLVLLTDGSQTQGKGVIDPAIIAD 193 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++ + I+ I I N L+K Y+ + + L Sbjct: 194 EIRKQGVPIIAIGIGKEINKNELIKI-GGGEANTYSADDFEKLKE 237 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 26/90 (28%), Gaps = 3/90 (3%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I+ NF I+ AL ++ + G + + + +A A + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSG 64 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIK 103 + N DA+ + + Sbjct: 65 AVAAAMAMNGNGTISLGKTDARDIFMSQVS 94 >gi|149915102|ref|ZP_01903630.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] gi|149810823|gb|EDM70662.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] Length = 444 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 67/256 (26%), Gaps = 51/256 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 N I P+ E RP + LV+D S SM Sbjct: 19 NTLNALIRIVAPSAPVTETEPRPPLNLALVLDRSSSMR--------------------GQ 58 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDM 284 + K A + + + ++ + E P + R ++R Sbjct: 59 PLHEAKRAADQIVAGLRPSD------RLAIVAFDNATEVMFSGGPRGDGQAARAALSRI- 111 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T Q + + + L+DG N Sbjct: 112 --HARGMTALHDGWLLGV----------------EQSIAMREAGTPARVFLLSDGVANVG 153 Query: 345 KSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ + I C + E+ I T + N + + + Y A+ L F+ Sbjct: 154 LTDASAIAADCTRMAEHGITTSTCGLGMGFNEDLMAEMARAGRGNAYYGETAEDLQDPFE 213 Query: 404 N---ISQLMVHRKYSV 416 + + + R + Sbjct: 214 QEFDLLRNICARGLRL 229 >gi|116671476|ref|YP_832409.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] gi|116611585|gb|ABK04309.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] Length = 353 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 45/140 (32%), Gaps = 10/140 (7%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA--------ILAGASKMV 69 E+ S+I A+ ++ L + V + ++ + + +AA + S Sbjct: 21 ERGAVSVIVAIMLVVLLGFGAVAVDVGMLYAERTQLRNGADAAALAVAQKCAKSAPSSSD 80 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ S S++N + K I + ++ + + N V+ Sbjct: 81 ADCSNTSTLAASLANSNANDGRSNIKSIILDTTNRKVTVTAGAQEKGKSPNEVSL--FFA 138 Query: 130 THMANNRLDSSNNTIFYNMD 149 + N + + + Sbjct: 139 RVLGMNSAEVNAPSTVVWGS 158 >gi|119613999|gb|EAW93593.1| hCG19532, isoform CRA_a [Homo sapiens] Length = 302 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 24/173 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+I Y+ +VEK + K + D + + T + A++ A + Sbjct: 13 RIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFED----- 67 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +K + +TDG+ + + ++ A + ++I I + Sbjct: 68 ------------ARPGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKK 114 Query: 373 PNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 + + + + PE+ Y + +L + + Q + S +++ Sbjct: 115 NDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICEDFDSYLVQ 167 >gi|119614000|gb|EAW93594.1| hCG19532, isoform CRA_b [Homo sapiens] Length = 304 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 24/173 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+I Y+ +VEK + K + D + + T + A++ A + Sbjct: 13 RIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFED----- 67 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +K + +TDG+ + + ++ A + ++I I + Sbjct: 68 ------------ARPGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKK 114 Query: 373 PNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 + + + + PE+ Y + +L + + Q + S +++ Sbjct: 115 NDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICEDFDSYLVQ 167 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 58/208 (27%), Gaps = 30/208 (14%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE- 268 D + K L + L + V+ ++ + Sbjct: 349 RGTDWTLLLDISGSMQGKFQTLIEGVKKGLKRFNPQDRVR------VVLFNDYASNLTGG 402 Query: 269 -PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + + + L T ++ A L +D+ Sbjct: 403 FLPATQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLDADR------------------ 444 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 I +TDG N + K D K+ I++ T + N L +S Sbjct: 445 --TSAIWLVTDGVTNVGE--TKQRKFVDLLKQKDIRVFTFIMGNGANRPLLKAITKASNG 500 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYS 415 + NV N+D +I + + + H Sbjct: 501 FAINVSNSDDIIGQLEKAASKVTHEALR 528 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 30/329 (9%), Positives = 80/329 (24%), Gaps = 24/329 (7%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ ++ LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGY-SAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 I+ ++ + S + N +S S+ ++ + Sbjct: 61 ITAGHLNYA---LFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTG 117 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI---------VSFIPALLRIEMGERPIF 191 F M + + Sbjct: 118 IVNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPPVAGATYN 177 Query: 192 LIELVVDLSGSMHCAMNSD-PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + + + +GS + ++ AL + + Sbjct: 178 IGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAA 237 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +PS+GT T + P+ AY S ++ Sbjct: 238 TTNAYNTRFGIYANPYKDPSYGTPDFT--------GYAYDAT-TWPSQSNAYADFVSKRQ 288 Query: 311 R-SFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + G+ + D Sbjct: 289 TFTSYQGDLITGINTGGTYNPNYYAAGAD 317 >gi|326385750|ref|ZP_08207379.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] gi|326209729|gb|EGD60517.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] Length = 543 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 52/183 (28%), Gaps = 4/183 (2%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM--ESANNAA 60 L R R ++ AN ++I AL+++ L +GF I ++ + + A Sbjct: 11 LALRSRALLTDIAKADDANVTLITALAIIPALFGLGFTIDYARAEMLQSRINAVADAAAL 70 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + + I D + G + T + Sbjct: 71 AATDTTYISKTSTVAQAASTQIFTTQVTDYADFVYTPASDLTVTITDGGTLNLGRTALVK 130 Query: 121 IV-NSSRISMTHMANNRLDSSNNTIFYNM-DVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 +S+ + + L ++ Y ++ + +L + I + Sbjct: 131 WRGSSTNLFSGILGVATLPIGGSSTAYASFFPNINFYLVMDKSPSMLLPSTSSGISAIQK 190 Query: 179 ALL 181 Sbjct: 191 VTG 193 >gi|311070192|ref|YP_003975115.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] gi|310870709|gb|ADP34184.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] Length = 227 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 55/206 (26%), Gaps = 33/206 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--------EDVYMGLIGYTTRVE 264 D + + + +K K + F S+ S VK + G + + Sbjct: 42 DASGSMAKRIDGVSKFNLAKKEIFQFAKSLPTDSQVKMSVFGSEGNNKNSGKVQSCEAIR 101 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 E + + T A+ +A Sbjct: 102 NVYGFQGFDE--QSFRNSLNTIGPTGWTPIAKALNEAKSSFDQ----------------- 142 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCV 383 + + LTDGE N IK + ++ I + I + +L Sbjct: 143 LDKKGENVVYLLTDGEETCG---GNPIKTAKELHKHNITVNVIGFDFKEGYKGQLNAIAK 199 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLM 409 ++ + + +F+ ++ + Sbjct: 200 VGGGEYFPASSQSDIKQIFK--AESI 223 >gi|320106177|ref|YP_004181767.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924698|gb|ADV81773.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 370 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 76/255 (29%), Gaps = 60/255 (23%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + P ++ + I +V DLSGSM +K Sbjct: 124 NFYLYENNQPQTIKTFSTDDAPVSIGIVFDLSGSM---------------------MSKF 162 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + AL F+ + + ++G+ R ++ + + V R + Sbjct: 163 GRARKALSEFMRTSNPQD------EFFVVGFNDRPAVIVDYTSNVDDVDA---RMVMLRP 213 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A L +K ++ ++DG +N + Sbjct: 214 ERRTALIDAAYLGLNKLKDA------------------KYERKALLIISDGGDNRSRYVE 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQR---------LLKTCVSSPEYHYNVVNADSLI 399 + ++ +E+ +I +I I L+ C + + V +AD L Sbjct: 256 SELRRA--VRESDTQIYSIGI-FDVYAATPEEKSGPTLLMDICEMTGGRMFRVTDADELG 312 Query: 400 HVFQNISQLMVHRKY 414 + IS + + Sbjct: 313 DIAARISAELRNEYV 327 >gi|290999945|ref|XP_002682540.1| predicted protein [Naegleria gruberi] gi|284096167|gb|EFC49796.1| predicted protein [Naegleria gruberi] Length = 502 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 74/224 (33%), Gaps = 45/224 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + I LV+D+SGSM + + K +K+ A K+A+ + + Sbjct: 70 KTPVNICLVLDISGSMDEPLKNR------------SKGSKLTACKSAIRELVTNFLTYKD 117 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + LI Y+ + + + T+ A+ A +L + Sbjct: 118 T-----IHLITYSDSPKTVFTEKNKESVNLNDIDKIS---TEGSTNIASALHSAVDLLHN 169 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN------VNTIKICD--KAKE 359 + P K I F +DG+ N ++N K+ D + K+ Sbjct: 170 S-----------------NAPGTKLIAFFSDGQCNVGETNLNIFGSGLLKKLKDYSEGKD 212 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + I I + + + + L + +Y + + F+ Sbjct: 213 DQIHISSYGVGSDYDELWLQAIARTGKGEYYYLEDETYAKDAFE 256 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 104/379 (27%), Gaps = 46/379 (12%) Query: 46 WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 ++ M A + A+ M + + A I A+ + ++ Sbjct: 67 ASQRQAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNG 126 Query: 106 LSGYSAVFYNTEIQNIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + + +T ++ S + L M +YDY L Sbjct: 127 IMVAGEIPVSTFSIDVDTGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYDYPLPA---- 182 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPIC 220 + P + E+ P + L + L G + Sbjct: 183 ---------KNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSG 233 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ L+ AL L + + ++ Y + Q + Sbjct: 234 SMASVDKLPLLQTALKLLTAQLSAQD------KVSIVVYAGAAGVV--LDGASGNDTQTL 285 Query: 281 T-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ + QAYQ+ + +I TDG Sbjct: 286 NYALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGIN----------------RVILATDG 329 Query: 340 ENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + N ++ + I + +K K++ I + T+ +L++ +Y ++ L Sbjct: 330 DFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT--L 387 Query: 399 IHVFQNISQLMVHRKYSVI 417 + + + +++ Sbjct: 388 NEARKVLVDELSSTLFTIA 406 >gi|300933821|ref|ZP_07149077.1| hypothetical protein CresD4_07088 [Corynebacterium resistens DSM 45100] Length = 676 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 67/213 (31%), Gaps = 49/213 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 + T++ A K A +D++ ++ MG++ Y + Sbjct: 59 SDSMMTKDAGGGGTRLDAAKKAATGLIDALPDSAN------MGMVVYGQQESNAPNNRAA 112 Query: 273 TEKVRQYV------------TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 K + + R + T ++ +A + L Sbjct: 113 GCKDVETISPVGPINKGELKDRISNFKAKGYTPIGNSLLKAAEEL--------------- 157 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNG 375 ++ I+ ++DG + +C+ AK+ + I T+ +A Sbjct: 158 -----GKEGERSIVLVSDGHDTCAPPP-----VCEVAKKLAGEGYNLTIHTVGFHADRKA 207 Query: 376 QRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ L+ +S + + NA L + + ++ Sbjct: 208 RKELECIAKTSGGQYLSAENASELSNSMKFLAT 240 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 58/199 (29%), Gaps = 39/199 (19%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 Q +++ + A ++ + + +GL+ + + + Sbjct: 112 SQTDTQSRFEIARQAARNVVEQ-------RSNDRIGLVVFRGEAYTLSPLTRDHTVLSLL 164 Query: 280 VTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + +I T A+ A L + + + +I LTD Sbjct: 165 LDNLSSRIIQDDGTAIGSALLVALNRLQASESEL------------------QMVILLTD 206 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQR---------LLKTCVSSPE 387 GENN + + T A ++ ++ + + L + + Sbjct: 207 GENNAGEVSPLTAAA--LAARRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGG 264 Query: 388 YHYNVVNADSLIHVFQNIS 406 ++ V N L +I+ Sbjct: 265 SYFTVNNKTELETTIASIA 283 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GLI Y + + Sbjct: 97 DISNSMMAEDVTPSRLEKSKMLVSNIVDKMTD-------DKIGLIVYAGEAYTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ S+ + TD A+ A + TS + K Sbjct: 150 YVSAKIFLETINPSMITTQGTDIKQAIDLAMKSFTS------------------NQDVSK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 I +TDGE+N + +++ A E IK+ + + + Sbjct: 192 AIFVITDGEDNEGGA----VEMAKAAAEKGIKVYVLGVGSPQ 229 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 35/102 (34%), Gaps = 20/102 (19%) Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA------------ 371 + +I LTDG N + +K + A++ IKI TI + A Sbjct: 7 RERPAESRLLILLTDGANTAGEI--EPLKAAELAQQQQIKIHTIGVGANEMLVPGLFSSR 64 Query: 372 ------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L K + ++ + + L ++ + Q Sbjct: 65 RVNPSADLDEDTLKKIASQTGGQYFRAQDTEQLQQIYALLDQ 106 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 74/210 (35%), Gaps = 37/210 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 ++ + A FL + V+ ++G++ + + + R Sbjct: 260 SGSMSSSNRLNRMNQAAKYFL-----MQIVENGSWVGMVHFDSTASIRSDLIQITGSNER 314 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + T +++A++++ K+ S I+ LT Sbjct: 315 DKLLGSLPTTASGGTSICSGIRRAFEVV----------------RKLYSHTDGSEIVLLT 358 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N T C D+ K++ I I++ S + + +++ ++ H+ + Sbjct: 359 DGEDN-------TAGACVDEVKQSGAIIHFIALGPSAD-KAVIEMSTATGGVHFYATDEA 410 Query: 397 S---LIHVFQNISQ---LMVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 411 ENNGLIDAFGALASGNTDISQQSLQLESKG 440 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 57/162 (35%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GLI Y + + Sbjct: 97 DISNSMMAEDVTPSRLEKSKMLVSNIVDKMTD-------DKIGLIVYAGEAYTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ S+ + TD A+ A + T + K Sbjct: 150 YVSAKMFLETINPSMITTQGTDIKQAIDLAMKSFTP------------------NQDVSK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 I +TDGE+N +++ A E IK+ + + + Sbjct: 192 AIFVITDGEDNEG----GVVEMAKAAAEKGIKVYVLGVGSPQ 229 >gi|2654431|gb|AAC01506.1| type XII collagen [Homo sapiens] Length = 517 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 80/259 (30%), Gaps = 41/259 (15%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI----ELVVDLSGSMHCAMNSDPEDVNS 216 + + + + + + ++ PI ++ + V + S + +D V+ Sbjct: 90 TLSVRDLSADTEYQISV-SAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKADMFLVDG 148 Query: 217 APICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + K + FL+ + V + L+ Y+ T K Sbjct: 149 SYSIGIANFVK-------VRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFT-K 200 Query: 276 VRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V + T++ AM + + K K +I Sbjct: 201 VEDIIEAINTFPYRGGSTNTGKAMTYVREKIF-------------VPSKGSRSNVPKVMI 247 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNV 392 +TDG KS+ K + + ++I + + + + + L+ S P + + V Sbjct: 248 LITDG-----KSSDAFRDPAIKLRNSDVEIFAVGVKDAVDSE--LEAIASPPAETHVFTV 300 Query: 393 VNADSLIHVFQNISQLMVH 411 + FQ IS + Sbjct: 301 EDF----DAFQRISFELTQ 315 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 69/203 (33%), Gaps = 35/203 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++ + A FL L ++ ++G++ + + + K R + Sbjct: 319 FNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTAYIKSNLIQIISSKERNNLLES 373 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T +K A+Q++ + + I+ LTDGE+N Sbjct: 374 LPTTANGGTSICAGIKSAFQVIGEIYPQIDGSE----------------IVLLTDGEDNT 417 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 + K++ I I++ S + Q +++ + H+ + LI Sbjct: 418 ------AKNCIGEVKQSGAIIHLIALGPSAD-QAVIEMSTITGGNHFFASDEAQNNGLID 470 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 471 AFGALASGNTDLSQQPLQLESKG 493 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 28/234 (11%), Positives = 71/234 (30%), Gaps = 52/234 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ERP + ++D+SGSM+ K+ LK + L +D + Sbjct: 248 AERPASNLVFLLDVSGSMN-------------------NARKLPLLKQGMKLLVDQLGEN 288 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + ++ Y + + G +K + T+ ++ AYQ Sbjct: 289 D------KVAIVVYAGAAGMVLNSTNGDDK-STIMEALDRLQAGGSTNGGQGIELAYQAA 341 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKI 364 T + + +I TDG+ N S + + + ++ + + Sbjct: 342 TENFIKGGVN----------------RVILCTDGDFNVGVTSTSDLVTMAADKAKSGVFL 385 Query: 365 VTISINASPNGQRLLKTCVS-SPEYHYNVVN--------ADSLIHVFQNISQLM 409 + + +++ + + + + + I++ + Sbjct: 386 SVMGFGTGNHNDAMMEELSGKANGNYAFIDTITEAKKVLVEQMSGTLTTIAKDV 439 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 71/239 (29%), Gaps = 46/239 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPI-FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + S P L E + LV+D SGSM ++ Sbjct: 385 AGGWRRSLLESLFPVSLDPPSREERFDLALTLVIDRSGSMSELVDG-------------- 430 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 RT++ + A + + +I + + + + + + Sbjct: 431 LRTQLDLAREAA------FQASLGLSRQDQLSIIAFDSVADVILPLQPLPD-LATIEDAL 483 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ M A + + + R + +I LTDG + Sbjct: 484 SRLSAGGGTNIRSGMALAAETIATADARI------------------RHVILLTDGVSET 525 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ + + + + T++I + + + L + +Y V A++L + Sbjct: 526 EYAD-----LVANLRAQGVTVSTVAIGLNTDPE-LERVAQIGGGKYYVVRQAEALPQIL 578 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 66/221 (29%), Gaps = 34/221 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + +V+D+SGSM +TK+ K A + L + Sbjct: 414 RKPAHVLMVIDVSGSM-------------GADVPGTGQTKLELAKQAAINALPQLGPNDQ 460 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V ++ + + + + R + T + AY+ + Sbjct: 461 VGLWMFSTNQDGGKDYRELVPMGRNNRDLLK--KRIQGLIPGGGTGLYDTTRAAYRTVLE 518 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKI 364 ++ LTDG+N + S + + E +++ Sbjct: 519 RHSNDVINA----------------VVVLTDGKNEDDNSISLEDLLAELRTETGQETVRV 562 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI+ + + L + + Y+ S+ VF + Sbjct: 563 FTIAYGNDADLEVLRQISQVTDAAAYDSREPGSIDQVFTAV 603 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 68/223 (30%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDK-------MENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESIDPSLISKQGTAIGAAINLASRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + ++ A E I++ + + Sbjct: 192 AVIVITDGENHEGDA----VEAAKDAAEKGIQVNVLGVGMPEGAPIPAEGTNDYRRDRDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N Q + + + V N + IS+ + Sbjct: 248 NVIVTRLNEQMCQEIAKAGNGIYVRVDNTNGAQKA---ISREI 287 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 37/280 (13%), Positives = 80/280 (28%), Gaps = 57/280 (20%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 +L R + + + + + I + + + Sbjct: 38 GDIELVKQLSPSLS---NRRRFIKFLLMQSSIALMVLIIARPQIGNRISSTKNGKGIETV 94 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 D++++ + QD +++ K + L S D + +GLI + + Sbjct: 95 IALDISNSMLAQDVIPSRLDKSKLLIEDLLRSFD-------NDKVGLIVFAGDAFVQLPI 147 Query: 270 SWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + ++ SLI + TD A+ A + + Sbjct: 148 TSDFISAKMFLNDINPSLIGTQGTDIGKAINLAMHSFSP------------------TSK 189 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN------------------ 370 K II +TDGE+N + + KA+E I + I Sbjct: 190 AGKAIIIITDGEDNEGGA----EAMAKKAQEAGFHIYILGIGSTSGAEIPIGNGEKLKDK 245 Query: 371 -----ASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 S + + K + + +V N + + Sbjct: 246 RGNIVVSHLNEDMCKGIADAGKGIYIHVDNNSDAQQILKK 285 >gi|224048537|ref|XP_002190467.1| PREDICTED: similar to collagen, type XII, alpha 1 [Taeniopygia guttata] Length = 3122 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 42/366 (11%), Positives = 96/366 (26%), Gaps = 54/366 (14%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 K ++ SA A L A+ + S + + + + + + + I Sbjct: 2 KTALCSAV--AALCAATLLSSIEAEVNPPSDLNFTIIDEHNVQMSWKRPPDAIVGYRITV 59 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + ++ S +I D S Q Sbjct: 60 VPTNDGPTKEFTLSPSTTQTVISDLIPDVEYIVSIASYDDREESLPVFGQLTIQTGGPGI 119 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ + + +VD S S+ Sbjct: 120 PEEKKV---EAQLQRCSISAMTDLVFLVDGSWSVGRNNFG-------------------- 156 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDSL 287 L + + +E +G++ Y++ + + + R Sbjct: 157 ---YILDFMVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLIDAIKRIP--Y 211 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ Q ++ F K I +TDG+ + Sbjct: 212 KGGNTMTGEAIDYLVQNTFTESAG-------------ARKGFPKVAIVITDGKAQDDVEI 258 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + + +++ ++ I A+ + LK S P + +NV N D ++ + I Sbjct: 259 P-----ARELRNIGVEVFSLGIKAAD--AKELKLIASQPSLKHVFNVANFDGIVDIQNEI 311 Query: 406 SQLMVH 411 + Sbjct: 312 ILQVCS 317 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 73/240 (30%), Gaps = 31/240 (12%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 ++ PI ++E + + C+ D + + +K + Sbjct: 401 INVYAMKGLTPSEPITIMEKTQQVKVQVECSRGVDVKADVVFLVDGSYSIGIANFVK--V 458 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTD 293 FL+ + V + L+ Y+ + +V + T+ Sbjct: 459 RAFLEVLVKSFEISPRKVQISLVQYSRDPHMEFSLNR-YNRVEDIIQAINTFPYRGGSTN 517 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + AM + + K S + +I +TDG KS+ + Sbjct: 518 TGKAMTYVREKVFVTSKGS-------------RPNVPRVMILITDG-----KSSDAFKEP 559 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 K ++ ++I + + + L+ S P + Y V + FQ IS + Sbjct: 560 AIKLRDADVEIFAVGVKD--AVRTELEAIASPPADTHVYTVEDF----DAFQRISFELTQ 613 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ + + + Y+ + + + +K + Sbjct: 2339 DNFNKVVKFVFSTVGAFDLINPAGIQVSFVQYSDEAKSEFKLNTFDDKAQALGALQNIQY 2398 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2399 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RRGVPKVLVVVTDGRSQD---- 2441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 K + + + + + A + L K S + + V + F+ I Sbjct: 2442 -EVRKAATVIQHSGFSVFVVGV-ADVDYNELAKIASKPSERHVFIVDDF----DAFEKIQ 2495 Query: 407 QLMVH 411 +V Sbjct: 2496 DNLVT 2500 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/394 (9%), Positives = 89/394 (22%), Gaps = 39/394 (9%) Query: 39 FLIYVLDWHYKKNSMESANNAAIL--AGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 +I + + + +AA L A +S + + + + A+ Sbjct: 41 GVIQYSNQPQSEILLNDHQDAASLQQAISSINYLQGGTNTGKALRYLANNAFSGKNGARA 100 Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + G S+ N + +D + Sbjct: 101 GVSKVAIVVTDGRSSDDVVRPALNAGKEGIVLYAVGIGGSVDY--QELRDIASSDDKVVN 158 Query: 157 RLQFIE--HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 F + N +Q S P + L + + Sbjct: 159 VTDFSGLQSVGNTLPDQVCQSLCPNSDHRSDLQCRAAQPPLSGIFNLQCSRWSYTSSNVS 218 Query: 215 NSAPICQDKKRTKMAALK-------------NALLLFLD-SIDLLSHVKEDVYMGLIGYT 260 A C T + A F ++ +++ + + Y Sbjct: 219 KHAHCCYAGCFTPLDIAILLDGSDGVSSDDFEAEKSFAKLFLNEFDIGQDNSRVTVFQYG 278 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 T + + + A++ S + Sbjct: 279 TEPRQEFALDTYETDQDVQDAIADTEYMGGDRNLGQAIRYMATYGFSGRNG--------- 329 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 I +T GE S KA+ + I + I + + L Sbjct: 330 ----ARRSIPSVAIIITGGE-----SLDEVASAASKARRSGIILYAIGVG-NATVPAELA 379 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 ++ Y + +L + ++ + ++ Sbjct: 380 AIATTANTSYAAASFAALKDLRGALADEICTSEF 413 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 75/243 (30%), Gaps = 46/243 (18%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L + + + E P I L++D SGSM + Sbjct: 379 LGGYKDTSLEKVLPVYMDKRGKNEVPAISINLIIDKSGSMSA---------------EGG 423 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVT 281 +K+ K A + L +++E + +I + ++ + E +++ ++ Sbjct: 424 GVSKLTLAKEAA------MKALENLREVDEISVIAFDDTYDEVVPLQKVGDKEAIKELIS 477 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T PA++Q Y + + K I LTDG++ Sbjct: 478 GIQI---RGGTSIYPALEQGYNMQMQSSAKI------------------KHTILLTDGQD 516 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N T + +N I + T+++ N L + Y + + Sbjct: 517 GYGLDNYAT--LLQNFIDNNITLSTVAVGEGANAGLLNQLASIGKGRSYYTDIYTDIPRI 574 Query: 402 FQN 404 F Sbjct: 575 FAK 577 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 50/333 (15%), Positives = 114/333 (34%), Gaps = 69/333 (20%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 +FI + + + + + + N S ++ + + + + + Y Sbjct: 221 QFIPDKKQTASASIMYMQSLSSAMQFCNKSNHNIKATNMQNKQCNFRSTWEVIMNSSDYA 280 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 N + +S + R+ + LV+D+SGSM Sbjct: 281 SSFPI-----NSPPSAPAISLLQTHDRV---------VCLVLDVSGSM------------ 314 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTE 274 ++A LK A LFL L ++ ++G++ + + + I ++ Sbjct: 315 -------TTYDRIARLKQAAELFL-----LQIIETGSWVGIVTFNSYATRQIGLRQITSD 362 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 VR+ + + + T ++Q +Q+ + K + I+ Sbjct: 363 SVRESLKNYLPTSAGGGTIICSGVRQGFQV---------------KQKKYQTSTKGCEIV 407 Query: 335 FLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 LTDGE+N+ S C + + + I TI++ + + + + ++ Sbjct: 408 LLTDGEDNSVSS-------CFAEVQSSGSIIHTIALGPNAAKELEM-LADMTGGLKFSAT 459 Query: 394 ---NADSLIHVFQNISQL---MVHRKYSVILKG 420 +++ LI F IS + + + KG Sbjct: 460 DSLDSNGLIDAFSRISSESGDISQQSIQLESKG 492 >gi|332358821|gb|EGJ36643.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1056] Length = 434 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 68/238 (28%), Gaps = 29/238 (12%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V D SGSM + + + RT+M LK L +D + + +V ++ Sbjct: 199 VTFVFDTSGSMGYGLWNQK-------LEPTDSRTRMNILKTKANLLVDDLKEIGNVSVNL 251 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK-- 310 + E +E T+ ++ + S I T+ ++ L + Sbjct: 252 VRFSGDASYIQEDFVELDKDTDTIKTKIKALPTSWI---TNPGDGLRYGLVSLQRNPAQL 308 Query: 311 -------RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENF 361 + I + ++ T + AK + Sbjct: 309 KYVVLLTDGIPNAYTASPDGIGKYDLTANFPTNGKQIKADQPVSLTTEYVGQVAKTFGSG 368 Query: 362 I-KIVTISINASPNGQRLLKTCVSS-------PEYHYNVVNADSLIHVFQNISQLMVH 411 + +I I + + + + N +L F +I + + Sbjct: 369 VKRISVIGFSGNVKEIKDGQKIADQIKTVGKVDSTFVIATNEAALEQTFADIKKQIQQ 426 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 61.4 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 82/232 (35%), Gaps = 38/232 (16%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I I V+D S S+ + + L+ + F + + Sbjct: 168 IADIMFVLDDSSSVDDTAFRSALNWIIQVVSYFSSYIDSGDLRVGVYGFSN-----DDHR 222 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSD 308 + +GL +T+ +++ + ++ T + A+K+ ++ ++ Sbjct: 223 SGIRIGLRKWTS------------ATLKKQIGELLNVKSTGAGTYISHAIKETVKVFEAN 270 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 ++ K II LTDG +++ D A++ I +V++ Sbjct: 271 GRKGI----------------SKEIILLTDGGASDWW---LLKGEADTARDKGIVLVSVG 311 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + S N +LL + + + ++L V N+ + K +V L+G Sbjct: 312 VGTSVNNDQLLAI-AGNKSRVFQATDYNTLDEVVNNVVSTIDAIKITVNLEG 362 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/283 (10%), Positives = 72/283 (25%), Gaps = 70/283 (24%) Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + N +++ ++ + + L ++ + ++ ++ ++ + E Sbjct: 153 FSYVIFKNDMENPFTHEKIEENMNVNVEIVLDASGSMVKKIGDKTMMEIAKESIKQVLSE 212 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P V + H N+ + S + Sbjct: 213 MPTN---AKVGVRVFGHKGDNTASKKDESC----------------GANELI-------- 245 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V T +++ + L Sbjct: 246 ----------------YPIEDL-----NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK- 283 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIV 365 + + +TDG N ++I + K I + Sbjct: 284 ---------------ALDGEKTLNILYIITDGIETCG---GNPVEIAKQLKGENTNIVLG 325 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 I N N RLLK ++ Y+ +V +AD L I++ Sbjct: 326 IIGFNVDANQNRLLKQIADAAGGYYSSVNDADKLTGELYRINE 368 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/224 (12%), Positives = 70/224 (31%), Gaps = 31/224 (13%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LV+D SGSM + + + ++ + L++ S Sbjct: 405 RKPANVLLVIDRSGSMQQTVPGTGKSKGDLAKEAAAEALAEFRGQDQVGLWVFSAARRQG 464 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILT 306 ++ ++ + TE R + R + + T AY+ +T Sbjct: 465 ERDW------------QEVVPLGRMTEAHRSLLRERLLGLTLSGGTGLYNTTAAAYEKMT 512 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--KENFIKI 364 ++ ++ +TDG+N I +E +++ Sbjct: 513 GSRRGDAINA----------------VVVMTDGKNERPGGLDLDGLIAKLGARREESVRV 556 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI + L + ++ Y+ + +++ +F + Sbjct: 557 FTIGYGEDADQNVLRRIAEAADGAAYDSSDPNTIGDIFTEVISN 600 >gi|149918184|ref|ZP_01906676.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] gi|149820944|gb|EDM80351.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] Length = 522 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 74/230 (32%), Gaps = 45/230 (19%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + ++D+SGSMH + K+ L ++L + ++ + Sbjct: 152 PPRNLVFLLDVSGSMH-------------------DQDKLPLLTDSLRVLVNQLGERD-- 190 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ ++ Y + P+ G+++ + + T+ +++AY + Sbjct: 191 ----HVAIVVYAGASGVVLPPTRGSDRG-TILAAISELRAGGSTNGGEGIQKAYALAREH 245 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 +I TDG+ N + + ++ +E+ + + + Sbjct: 246 F----------------DPQGINRVILATDGDFNVGTTTESGLENLIERERESGVFLTVL 289 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 G R ++ +Y DSL + + ++ Sbjct: 290 GFGRGNLGDRTMEMLADKGNGNYA--YIDSLAEARKVLGTEAGSTLVTIA 337 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 39/152 (25%), Gaps = 25/152 (16%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-------------------- 57 ++ +I+FA+ ++ L + + + K ++SA Sbjct: 13 QRGAVTIMFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGAGSLVNSSLSTYD 72 Query: 58 -NAA---ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 +AA L A + N ++ + + A Sbjct: 73 WDAAEQKGLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVP-VTGDVPAVRATV 131 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 T QN + N + + I Sbjct: 132 ALTSTQNNGPLKLFFAPFLGINESNIPASAIA 163 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGGYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 60/227 (26%), Gaps = 45/227 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 R + L D+SGSM T++ A K A F+ Sbjct: 78 ATVPQPRREGTVVLAFDVSGSMAATD---------------IAPTRLEAAKAAARGFVQR 122 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 V +G++ + +P+ V + R T ++ A Sbjct: 123 ------QPAAVRIGIVAFGATGLVTQQPTSDRASVVAAIDRLS---PQGGTALGGGLQTA 173 Query: 302 YQILTSDK---KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + S ++ LTDGEN +++ D A Sbjct: 174 LGAIVGKPVVVPGSDPGGGPEPSGPDLGYHGSAAVVLLTDGENTAQPD---PLQVADIAS 230 Query: 359 ENFIKIVTISINASP---------------NGQRLLKTCVSSPEYHY 390 +K+ I + + + L + + ++ Sbjct: 231 TAGVKVYPIGLGSPAGTVLQIDGFQIATRLDEPLLQQIADRTDGRYF 277 >gi|225387166|ref|ZP_03756930.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] gi|225046714|gb|EEG56960.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] Length = 556 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/377 (11%), Positives = 116/377 (30%), Gaps = 61/377 (16%) Query: 55 SANNAAILAGASKMVSNLSRLGDR---FESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 SA A + A A S + + + + +I + ++ + Sbjct: 33 SATEAEVKAEAGSYASETMAAQSQWDGAVMEAEGPPLSHNTEEYNYIAENAFLAV-ANAP 91 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD---------VMTSYDYRLQFIE 162 + + + + + + N + + I ++ +I Sbjct: 92 LSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYDYPEPTEDEPFSVTTYIG 151 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 N K++ + ++ + + ++D+SGSM Sbjct: 152 DCPWN-ENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSM------------------- 191 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + K+ +K A LL +++ + ++ Y + ++ G EK +T Sbjct: 192 ESADKLGLVKRAFLLLTENLRPED------TVSIVTYASSDTVVLDGVSGEEKA-AIMTA 244 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + TD + ++ AY++ ++ +I TDG+ N Sbjct: 245 IENLTAGGSTDGSKGIETAYRLAEEHFQKDGNN----------------RVILATDGDLN 288 Query: 343 NF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIH 400 S + ++ K KE+ + + + ++ + + V + L+ Sbjct: 289 LGLTSEGDLTRLIQKKKESGVFLSVMGFGTGNIKDNKMEALADNGNGQYAYVDS---LME 345 Query: 401 VFQNISQLMVHRKYSVI 417 + + + + ++V Sbjct: 346 AKRVLVEELGGTLFTVA 362 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 52/160 (32%), Gaps = 27/160 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D+ +D Y +G+ + V+ P+ V T+ Sbjct: 53 EFTVDAAKEFDDRTKDSYFSAVGFASGVKLIQAPTQSLSTFNTAVNTVSPL--NGGTNIF 110 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ YQ L + +I +TDG I C+ Sbjct: 111 RGLRGCYQQL------------------KTKPMTDRVLILVTDG------FGGQPINYCN 146 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 K I +VT+ I S N LK C +S E++ NV + Sbjct: 147 FIKSKGILLVTVGIGTSINQN-FLKNCATSEEFYINVKDT 185 >gi|327274818|ref|XP_003222173.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1097 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 66/198 (33%), Gaps = 32/198 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMD 285 +K + +D ++ + +G+I ++ +V+ E ++ + Sbjct: 730 FDRIKRFVKTVID---AVTVNQATARVGIINFSHKVDVVSTLQQYPNKESLKGAIDVMQY 786 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ A I K +K I +TDG+ + Sbjct: 787 L--GEGTYTATAISNATNIF-----------------KSARPGVRKVAIVITDGQADRRD 827 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSL 398 N + A + I+I I + + + + S P+ + Y + + +L Sbjct: 828 PNT-LEVVVKDAHASNIEIFVIGVVQKSDPNFDSFRKEMDLIASDPDIEHVYQIDDFITL 886 Query: 399 IHVFQNISQLMVHRKYSV 416 + + + + +SV Sbjct: 887 QALENKVFKQICENDFSV 904 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 66/183 (36%), Gaps = 22/183 (12%) Query: 228 MAALKNALLLFLDSI---DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K + L D + ++ D+ + ++ +++ V + + Sbjct: 65 FDKQKEFVTLLSDKLFLMKPTRVLRYDIKLAIMQFSSSVRIDYPFDEWRSLPDFKLRVKE 124 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + I T S A+ A Q+ ++ ++S K ++ + DG ++ Sbjct: 125 MTYIGHGTYSYYAISNATQLFKTEGRKSSV----------------KVVVLMADGIDHPK 168 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQ 403 + I + A+ I +TI ++ + +LL +SP + + + + L + Sbjct: 169 SPD--VQAISEAARTFGISFITIGLSNVADKVKLLSISGNSPGTPVFILNDPNLLDKIRN 226 Query: 404 NIS 406 ++ Sbjct: 227 QLA 229 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 80/234 (34%), Gaps = 51/234 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +P+ I LV+D SGSM+ + + A KN + Sbjct: 108 SAKPVKDIVLVIDNSGSMNETD---------------PNQDRYTAAKNLINRM------- 145 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 D + +I + + ++ + + ++D L TD + A++ Sbjct: 146 ---DRDNRVSVIMFDHATTLLQPFTRVNNQETKDEIIAEIDGLATTDGGTDISLALEDTM 202 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + +I L+DG + V K+ I Sbjct: 203 SHIQESRDAGRSA----------------MVIMLSDGFSETDHDRVLAEY-----KQQQI 241 Query: 363 KIVTISIN-ASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + TI ++ +P+G +LL+ + +Y+V +A+ L VFQ I + R Sbjct: 242 AVNTIGLSLVNPDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQKIYDDVGDRSL 295 >gi|326670350|ref|XP_001332841.4| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 609 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++ + + ++ + E++R+ + D T Sbjct: 57 NEIYNFVEHLA-QKFISPQLRMSFIVFSDQGKILMQLTEDREQIRKGLKELQDVRPGGDT 115 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +++A + + II LTDGE + + Sbjct: 116 FMHEGIQRASEQIY--------------YGNTEGYRTASVIIALTDGELHE-NHFYYAER 160 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 ++++ + + + + L S ++ + V + ++L V +I + Sbjct: 161 EANRSRSLGASVYCVGVKD--FNETQLAKIADSKDHVFPVNDGFEALQGVIDSILKK 215 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 63/189 (33%), Gaps = 25/189 (13%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WG 272 +S+ + + + LK+A+ F +I ++ + +I+ + Sbjct: 409 DSSGSLRKYYQNEKDFLKSAISAF-------GVSVNGTRAAVITFSYHAQLSIKLNKYSN 461 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ V + L+ T A++ A + + + K Sbjct: 462 LNSFKEAVDNIV--LMGSTTRIDKALRLAQKEVFELENG-------------ARPGVAKI 506 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + LTDG + + N + I ++ + + I+ I I + + L E Y Sbjct: 507 LFLLTDGSQTQERGSENPVAIANELRSAGVTIIVIGITNAVDVSELF-DIAGGEENAYFA 565 Query: 393 VNADSLIHV 401 + + L V Sbjct: 566 DSFEKLKDV 574 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 62/189 (32%), Gaps = 25/189 (13%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WG 272 +S+ + + + LK+ + F +I ++ + +I+ + Sbjct: 17 DSSGSLRKYYQNEKDFLKSVISAF-------GVSFNGTRAAVITFSYHAQLSIKLNKYSN 69 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ V + L+ T A++ A + + + K Sbjct: 70 LNSFKEAVDNIV--LMGSTTRIDKALRLAQKEVFELENG-------------ARPGVAKI 114 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + LTDG + + N + I ++ + + I+ I I + N L E Y Sbjct: 115 LFLLTDGSQTQERGSENPVAIANELRSAGVTIIVIGITNAVNVSE-LSDIAGGEENAYAT 173 Query: 393 VNADSLIHV 401 + + L V Sbjct: 174 ESFEKLKDV 182 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/229 (10%), Positives = 75/229 (32%), Gaps = 59/229 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GL+ + + + Sbjct: 97 DISNSMLAEDITPSRLDRSKMMVENLVDHFT-------NDKIGLLVFAGDAFVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ + + TD A+ A T ++ K Sbjct: 150 YVSAKMFLSSIDPSMMATQGTDIARAIDMATHSFTQEEG------------------IGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA----------------SPNG 375 II +TDGE++ + ++ + AK+ +++ + + + G Sbjct: 192 AIIVITDGEDHEGGA----LESAEAAKKAGMRVYVLGVGSTQGAPIPIPGTGDYMKDNTG 247 Query: 376 QRLL---------KTCVSSPEYHYNVVN----ADSLIHVFQNISQLMVH 411 ++ + + + +V N D L + +++ Sbjct: 248 NTVMSALNEDMCRQVAQAGGGAYIHVENNSAAQDQLDNELSKLAKKETT 296 >gi|156523281|ref|NP_001096038.1| anthrax toxin receptor 2b [Danio rerio] gi|151335854|gb|ABS00409.1| capillary morphogenesis protein 2B [Danio rerio] Length = 487 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 58/174 (33%), Gaps = 20/174 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F++ + V + + I +++ E + + + + + T Sbjct: 57 IYGFVEQLTNR-FVSPKMRVSFIVFSSSAEIILPLTGDRVDIDSGLQQLSKIRPAGDTYM 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K+A + +TS II LTDG N IK Sbjct: 116 HEGLKKAIEQMTSQGA-----------------RASSIIIALTDG-KLEVFMNELAIKEA 157 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 D A++ ++ + + + +L + + + V +L ++ +I Q Sbjct: 158 DLARQYGARVYCVGV-KDFDANQLTEIADNKDQVFPVVDGFQALKNIVNSILQK 210 >gi|157819015|ref|NP_001100328.1| procollagen, type VII, alpha 1 [Rattus norvegicus] gi|149018489|gb|EDL77130.1| procollagen, type VII, alpha 1 (predicted) [Rattus norvegicus] Length = 2588 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ + Sbjct: 70 SGAANAQGVRFATVQYSDDPQTEFGLDTLGSGGDTIRAIRELSYKGGNTRTGAALLHVSD 129 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + K K + Sbjct: 130 RVFLPHL---------------TRPGIPKVCILITDGKSQD-----LVDTAAQKLKRQGV 169 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + IS+ + Sbjct: 170 KLFAVGI-KNADPEELKRIASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 77/226 (34%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + ++S + +GL+ + + + Sbjct: 97 DISNSMLAQDVQPSRLDKSKLMIENLINSF-------INDKIGLVVFAGEAYVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ + + + TD A++ + T K Sbjct: 150 YVSAKMFLSDITPNLISAQGTDIARAIRVSLSSFTQ------------------QKGVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE+N + ++ +AKE + + + + +GQ Sbjct: 192 AIILITDGEDNEGGA----LEAVKEAKEKGVNVFILGVGDSKGAPIPLGNGEYLKDNHGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + ++ N L + ++ + + Sbjct: 248 TVMTALNENMCKEIAQAGSGTYIHIDNTS-LAQ--EQLNNELSKLQ 290 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 48/160 (30%), Gaps = 25/160 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M +K A +D + + +I + E + + E ++ + Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP-NQSVEDLQGVKNKINRLT 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T MK + K+ I LTDGE N N Sbjct: 112 ASGGTCIDEGMKLGIKEAALGKENRVSQ-----------------IFLLTDGE-NEHGDN 153 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 +K+ A E I + T+ ++ N L + S+ Sbjct: 154 ERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGG 193 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 48/160 (30%), Gaps = 25/160 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M +K A +D + + +I + E + + E ++ + Sbjct: 59 METVKEAANYLVDGLGPDD------RLSVITFNHHAEVILP-NQSVEDLQGVKNKINRLT 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T MK + K+ I LTDGE N N Sbjct: 112 ASGGTCIDEGMKLGIKEAALGKENRVSQ-----------------IFLLTDGE-NEHGDN 153 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 +K+ A E I + T+ ++ N L + S+ Sbjct: 154 ERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGG 193 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV V Sbjct: 23 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAVEVKQAVLAMEY 79 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 80 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 129 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 130 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 182 Query: 406 SQLMV 410 + Sbjct: 183 RGSIC 187 >gi|115376470|ref|ZP_01463705.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310819725|ref|YP_003952083.1| hypothetical protein STAUR_2452 [Stigmatella aurantiaca DW4/3-1] gi|115366538|gb|EAU65538.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309392797|gb|ADO70256.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 218 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 62/198 (31%), Gaps = 20/198 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ AL A+ +++ S ++ +V++ +I + + ++ + Sbjct: 23 VSGSMGVDGKIQALNLAVREMIEAFQDESDLRAEVHVSVITFGGQSRVHLPLGRARDAAW 82 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 D T A A ++ +PS ++ I+ ++ Sbjct: 83 T------DLGANGGTPMGAAFDLARTMVEDRN-------------AVPSRAYRPTIVLVS 123 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG+ + + + + + ++I A + L Y A Sbjct: 124 DGQPTD-EWKQPLESLLKNERGGKAFRMALAIGADADHAVLQAFLADPEARVYRADEARQ 182 Query: 398 LIHVFQNISQLMVHRKYS 415 + FQ ++ + R S Sbjct: 183 IRKFFQLVTMSVSARSRS 200 >gi|145482457|ref|XP_001427251.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394331|emb|CAK59853.1| unnamed protein product [Paramecium tetraurelia] Length = 568 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 68/211 (32%), Gaps = 44/211 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + VVD+SGSM + K++ +K++L + Sbjct: 128 KANIDLVCVVDVSGSM--------------------EGEKISLVKDSLRYIQKILSPND- 166 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ + T N+ + + +Q + + I T+ +K +++ Sbjct: 167 -----RIALVTFGTYSGINLPWTINKPENKQKIKDAIIGMKIRDSTNIADGVKLGLRMIK 221 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIV 365 K+++ T + LTDG+++N ++ + ++ + I Sbjct: 222 ERKQKNPVT----------------CMFVLTDGQDDNKGADERCQQAINEYQIQDTFVIN 265 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + + + + + N Sbjct: 266 SFGYGQDHDAKVMNNISNLKGGTFTFIENIA 296 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 60/181 (33%), Gaps = 45/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D ++ K + ++++ Sbjct: 91 DIVFAIDVSKSM---------------LAEDIAPNRLEKSKQLVTQIINNLAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ D L + T A++ A D++ Sbjct: 129 DRVGIIAYAGKAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAINEAIELAKTYYDDDQQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + ++ ++DGE+++ + + I+I TI + Sbjct: 189 TN------------------RVLVIISDGEDHSEAAANVAEEA----SNEGIRIFTIGVG 226 Query: 371 A 371 Sbjct: 227 D 227 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 59/160 (36%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + ++ +GLI + + + Sbjct: 98 DISNSMLAEDVAPSRLEKSKLLVENLMNKFSE-------DKIGLIVFAGDAFVQLPITSD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A++ + T + K Sbjct: 151 YVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFTP------------------NSKVGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 II +TDGE+N + ++ +A+ I++ + + + Sbjct: 193 AIILITDGEDNEGGA----EEMAKQAQSKGIRVFILGVGS 228 >gi|73980586|ref|XP_540098.2| PREDICTED: similar to matrilin 3 precursor [Canis familiaris] Length = 481 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + Sbjct: 89 SSRSVRPLEFTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKIEFHLQTYSDKQSL 145 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + P+ K I + Sbjct: 146 KQAVARITPLST--GTMSGLAIQTAMD----------EAFTEEAGARGPTSNIPKVAIIV 193 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 194 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKIIASEPLDEHVFYVET 246 Query: 395 ADSLIH 400 + Sbjct: 247 YGVIEK 252 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 58/177 (32%), Gaps = 25/177 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+AALK+A+ ++++ + ++ + V+ + ++ T+K + + Sbjct: 58 KLAALKDAVKRVIETLTPQDI------VAIVLFDDTVQTLVPATFATDKA-TLIAQVDAI 110 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + M L K ++ LTDG+ Sbjct: 111 EEAGGTAMSGGMAAGIVEL----------------RKNHDPGRVGAMLLLTDGQ-TWGDE 153 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + + + ++I + + A N L ++ + + + FQ Sbjct: 154 DRC-RALAQELARDGVRITALGLGAEWNEALLDDIAEATGGISDYIADPAQITTFFQ 209 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 74/213 (34%), Gaps = 29/213 (13%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIG 258 S S+ ++ I + K + F+ + L + +GLI Sbjct: 18 SQSLLENTCNNKRLDLVFIIDSSRSVRPYDFEK--VKEFILTILQFLDVSPDATRVGLIQ 75 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y + V+ +Q + R + ++ T + A++ A I S Sbjct: 76 YGSTVKHEFSLKTFR--RKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFS--------- 124 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +G + + + I+ +TDG + +I KA+ + I I I + + Sbjct: 125 -ESEGARPLNQNVPRIIMIVTDGRPQDPVG-----EIAAKARNSGILIFAIGVGR-VDMN 177 Query: 377 RLLKTCVSSP--EYHYNVVNADSLIH---VFQN 404 LK+ S P E+ + V N + VFQ Sbjct: 178 T-LKSIGSEPHEEHIFLVANFSQIETLTSVFQT 209 >gi|298705150|emb|CBJ28593.1| glycin-rich protein [Ectocarpus siliculosus] Length = 1238 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 50/178 (28%), Gaps = 28/178 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 K + S + +++ + + E +V D+ Sbjct: 98 WELSKEFAKNTVASFAQQNLFTNGGSASFAQFSSDASEGGTF-YSLEDFNAFVDA--DAK 154 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD + + ++L++ F+I TDG + Sbjct: 155 YSGGTDIIDGIAKGRELLSAS------------------PTTTSFMIVTTDGVAPD---- 192 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D A+ + + + + P+ + LL ++V N + L +I Sbjct: 193 --PQDEADAARAEGTILYAVGVGSGPSQENLLAI-GGDEANVFDVDNFEELDLALDDI 247 >gi|148697327|gb|EDL29274.1| procollagen, type XIV, alpha 1, isoform CRA_c [Mus musculus] Length = 1802 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 1052 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHIS 1110 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 1111 YKGGNTKTGKAIKHVRDTLFTSDSG-------------TRRGIPKVIVVITDGRSQDDVN 1157 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 KI + + + I I + + + + S + + V + F+ I Sbjct: 1158 -----KISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIE 1208 Query: 407 QLMVH 411 ++ Sbjct: 1209 DELIT 1213 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 59/194 (30%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 171 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 229 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 230 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 276 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E+ +++ I + + + L+ S P+ + YNV D + Sbjct: 277 SQDDIIPPS-----RNLRESGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 329 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 330 HTVVESLTRTVCSR 343 >gi|226423922|ref|NP_851794.3| collagen alpha-1(XIV) chain [Mus musculus] gi|148697325|gb|EDL29272.1| procollagen, type XIV, alpha 1, isoform CRA_a [Mus musculus] gi|219519338|gb|AAI45245.1| Collagen, type XIV, alpha 1 [Mus musculus] gi|223459902|gb|AAI38346.1| Collagen, type XIV, alpha 1 [Mus musculus] Length = 1794 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 1044 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHIS 1102 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 1103 YKGGNTKTGKAIKHVRDTLFTSDSG-------------TRRGIPKVIVVITDGRSQDDVN 1149 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 KI + + + I I + + + + S + + V + F+ I Sbjct: 1150 -----KISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIE 1200 Query: 407 QLMVH 411 ++ Sbjct: 1201 DELIT 1205 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 59/194 (30%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E+ +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLRESGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|148697326|gb|EDL29273.1| procollagen, type XIV, alpha 1, isoform CRA_b [Mus musculus] Length = 1093 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 395 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHIS 453 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 454 YKGGNTKTGKAIKHVRDTLFTSDSG-------------TRRGIPKVIVVITDGRSQDDVN 500 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 KI + + + I I + + + + S + + V + F+ I Sbjct: 501 -----KISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIE 551 Query: 407 QLMVH 411 ++ Sbjct: 552 DELIT 556 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 59/194 (30%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E+ +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLRESGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|146345398|sp|Q80X19|COEA1_MOUSE RecName: Full=Collagen alpha-1(XIV) chain; Flags: Precursor Length = 1797 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 1047 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHIS 1105 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 1106 YKGGNTKTGKAIKHVRDTLFTSDSG-------------TRRGIPKVIVVITDGRSQDDVN 1152 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 KI + + + I I + + + + S + + V + F+ I Sbjct: 1153 -----KISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIE 1203 Query: 407 QLMVH 411 ++ Sbjct: 1204 DELIT 1208 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 59/194 (30%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E+ +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLRESGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|30420885|gb|AAO64442.1| collagen type XIV [Mus musculus] Length = 1797 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 52/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 1047 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHIS 1105 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 1106 YKGGNTKTGKAIKHVRDTLFTSDSG-------------TRRGIPKVIVVITDGRSQDDVN 1152 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 KI + + + I I + + + + S + + V + F+ I Sbjct: 1153 -----KISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIE 1203 Query: 407 QLMVH 411 ++ Sbjct: 1204 DELIT 1208 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 59/194 (30%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E+ +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLRESGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 31/131 (23%), Gaps = 9/131 (6%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAK 87 + ++ L + F I + K +++A + A LAGA I+ Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIAT--- 57 Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 LS T ++ + D + Sbjct: 58 ------LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTD 111 Query: 148 MDVMTSYDYRL 158 M + Sbjct: 112 MGISQYLSAVF 122 >gi|108760371|ref|YP_628476.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464251|gb|ABF89436.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 422 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 48/186 (25%), Gaps = 25/186 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+A + A + + + I Y T V E + Sbjct: 56 SGSMNGQKLADARRAATELVQRLKPED------RLAFIDYGTDVRVQPSRRMTEEAREEL 109 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +T T+ + A+ A L I L+DG Sbjct: 110 LTLISGLQDDGSTNISGALDAAANALRPHM----------------REYRVSRAILLSDG 153 Query: 340 ENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADS 397 + + + + + + I + + + + L++ + ++ Sbjct: 154 QPTTGIVSEPGLLDQVRQLRRDGITVSALGVGRD-YQETLMRGMAEQGGGFSGFIDDSAR 212 Query: 398 LIHVFQ 403 L VF Sbjct: 213 LAEVFS 218 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRGLDVGPNATRVGVIQYSSQVQSVFPL-GAFSRREDMERAIRALVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S +G + P + + +TDG + ++ Sbjct: 116 GLAIQYAMNVAFS----------VAEGARPPEARVPRIAVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + + + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFNLIQEF 207 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 25/189 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 404 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 460 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 461 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +AKE I + + + + L+ S P + + D F ++ Sbjct: 511 -----WARRAKEEGIVMYAVGVG--KAVEEELRQIASEPAELHVSYSPD-----FGTMTH 558 Query: 408 LMVHRKYSV 416 L+ + + S+ Sbjct: 559 LLENLRGSI 567 >gi|227533248|ref|ZP_03963297.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227189098|gb|EEI69165.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 596 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 31/418 (7%), Positives = 95/418 (22%), Gaps = 53/418 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + ++ + A + ++ G + Sbjct: 11 LFAVLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 66 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 67 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 126 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE----MGERPIFLIELVVD 198 + + ++ + + + V Sbjct: 127 GNTQQNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVRTGVKNFLTSIQNAGLGDYVNVGL 186 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S + + I L S + Sbjct: 187 IGFSSPGYIGGGNKTTGPGYIHVG----------------LGKAGNTSQQQAINSALSPT 230 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I G+ ++ + + K S N Sbjct: 231 FNGGTYTQIGLRQGS--------AMLNEDTSGNKKMMILLTDGVPTFSDKVKNSVVENGT 282 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSN------VNTIKICDKAKENFIKIVTISIN-- 370 G + + + + T+ KAK++ + + I Sbjct: 283 LYGTNFGTTRDEPGYTAELKNSYIDSSGDRIYDTWPATLGEAKKAKDSGNVVHALGIQLA 342 Query: 371 -----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + ++ ++ +SP+ + + +AD++ N ++ +V ++I Sbjct: 343 GDDGYWSEGYMSDEDVRQNMELIPTSPDLYEDADSADAVEAYLNNQAKDIVKNFNTII 400 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 46/134 (34%), Gaps = 4/134 (2%) Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D ++ + T + QA L + F + + I+ L++G + Sbjct: 140 DSINVAVGMTADRNRLDQAISGLRAWGATRLNDAVFASAGALAGAEGRGAIVLLSEGPDA 199 Query: 343 NFKSNVNT----IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + E+ + + + + L + ++ ++ V +A +L Sbjct: 200 DPSGAPLSVVDSEAALAAVVESGAPVYAVGLGPGADAALLRRLAEATGGAYFPVADAAAL 259 Query: 399 IHVFQNISQLMVHR 412 F +I+ + H+ Sbjct: 260 PATFSDIATRLRHQ 273 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 62/187 (33%), Gaps = 34/187 (18%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE---KVRQYVTRDMDSLILKPTDS 294 F + D++ + D+ + ++ + + + + K T Sbjct: 136 FKAAADMVQRMDSDMNIAVVTFHDQTNVLQPLTELSSQSVKDEVVKKLLQFPRTDGGTRI 195 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ L + ++ ++DG S+++ Sbjct: 196 DLALQAGLDQLQ------------------ANQMANSTVVLMSDG-----YSDLDVPAAL 232 Query: 355 DKAKENFIKIVTISIN-ASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNI------S 406 K+N + + T+ ++ +G LL K + ++NV +AD + +F I Sbjct: 233 APYKQNQVIVHTVGMSQIDADGTALLQKIAAETGGSYFNVEHADQMTGIFGQIYDMSRTD 292 Query: 407 QLMVHRK 413 + +V + Sbjct: 293 RNIVSER 299 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 48/381 (12%), Positives = 103/381 (27%), Gaps = 50/381 (13%) Query: 46 WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 ++ M A + A+ M + + A I A+ + ++ Sbjct: 82 ASQRQAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNG 141 Query: 106 LSGYSAVFYNTEIQNIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + + +T ++ S + RL M +YDY L Sbjct: 142 IMVAGEIPVSTFSIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPLPA---- 197 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPIC 220 + P + E+ P + L + L G + Sbjct: 198 ---------KNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLLKSQLGASNLVFLLDVSG 248 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL---IGYTTRVEKNIEPSWGTEKVR 277 K+ L+ AL L + V VY G + ++ Sbjct: 249 SMASTDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAGVVLDGASGNDTQTLTY------ 302 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T+ + QAYQ+ + +I T Sbjct: 303 ----ALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGIN----------------RVILAT 342 Query: 338 DGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ N ++ + + +K K + I + T+ +L++ +Y ++ Sbjct: 343 DGDFNVGVTDFDDLTALIEKEKAHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT- 401 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L + + + +++ Sbjct: 402 -LNEARKVLVDELSSTLFTIA 421 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 100/367 (27%), Gaps = 50/367 (13%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A LA + ++ I++ A + + + I+ + + +T Sbjct: 108 ASLAAKQRSAKHVINTHYVAAPIASDAWYGIKQPERNRFEKQIQNGIMVAGEIPISTFSI 167 Query: 120 NIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 ++ S + L M +YDY L + P Sbjct: 168 DVDTGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDYPLPN-------------KNAAP 214 Query: 179 ALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPICQDKKRTKMAALKNA 234 + E+ P + L + L G + K+ L+ A Sbjct: 215 FSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFLLDVSGSMASADKLPLLQTA 274 Query: 235 LLLFLDSIDLLSHVKEDVYMGL---IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 L + + V VY G + + ++ Sbjct: 275 LKMLTQQLSAQDKVSIVVYAGAAGVVLDGASGDDIQALTY----------ALEQLRAGGS 324 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNT 350 T+ + + QAYQ+ + +I TDG+ N + Sbjct: 325 TNGSQGILQAYQLAQKHFIQGGIN----------------RVILATDGDFNVGVTNFDQL 368 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I + +K K+ I + T+ +L++ HY ++ L + + + Sbjct: 369 ISLIEKEKQRGIGLTTLGFGMGNYNDQLMEQLADKGNGHYAYIDT--LNEARKVLVDELS 426 Query: 411 HRKYSVI 417 ++ Sbjct: 427 STLLTIA 433 >gi|316983245|pdb|3N2N|F Chain F, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983246|pdb|3N2N|A Chain A, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983247|pdb|3N2N|B Chain B, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983248|pdb|3N2N|C Chain C, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983249|pdb|3N2N|D Chain D, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983250|pdb|3N2N|E Chain E, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain Length = 185 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 25 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 83 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 84 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 128 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 129 EANRSRDLGAIVYAVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 183 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 57/165 (34%), Gaps = 24/165 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDST 295 +D ++ L +G++ Y+++V + V + + + T + Sbjct: 29 FMIDILNTLDIGLNSTRVGVVQYSSQVRSEFSLRS-HASLDSMVKAIQEMVPLAQGTMTG 87 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + + + ++ +TDG + ++ Sbjct: 88 LAIRYTMNVAFTAAEG-------------DRPKVPNVVVIVTDGRPQD-----RVAEVAA 129 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 +A+E ++I + + + L+ S P ++ + V + D + Sbjct: 130 EARERGMEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 172 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 15/120 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D L + +GL+ Y++RV S + K T + A+K Sbjct: 502 VDSLDVSPQGTRVGLVQYSSRVRTEFPLSMYHSAQEIKAAVMKVEYMEKGTMTGLALKH- 560 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +L + + R+ + + TDG + + S + KAKE Sbjct: 561 --MLENSFSEAEGARPARRH-------VPRIGLVFTDGRSQDDIS-----EHARKAKEAG 606 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 57/167 (34%), Gaps = 19/167 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D I L +G+I Y+++V+ + + + + T + Sbjct: 68 FMIDIIHNLDIGPNATRVGVIQYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTMTGL 127 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A++ + + ++G + + + +TDG + ++ + Sbjct: 128 AIQYVMNVAFT----------TQEGARPLHKKIPRVAVIVTDGRPQD-----RVTEVSAQ 172 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSS--PEYHYNVVNADSLIHV 401 A+ I+I + + + L+ S E+ + V + D + Sbjct: 173 ARAAGIEIYAVGVQRADMNS--LRAMASPALEEHVFLVESFDLIQQF 217 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 65/185 (35%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + +GL+ Y++RV + T Sbjct: 373 FELVKQFVNQIVDFL---DVSPHGTRVGLVQYSSRVRTEFPLNKFTTAADLKKAVQRVQY 429 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + K T + A+K +L +G + S + + TDG + + S Sbjct: 430 MEKGTMTGLALKH---MLEHSF-------TEAEGARPLSQNVPRIGLVFTDGRSQDDIS- 478 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 + ++KE I + + + + + L+ S P ++ + +++ ++ +N+ Sbjct: 479 ----EWARRSKEAGIIMFAVGVGKAVESE--LREIASEPVDKHFSYSADFNTMTNIVENL 532 Query: 406 SQLMV 410 + Sbjct: 533 KINIC 537 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 45/172 (26%), Gaps = 29/172 (16%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA------ 65 K I SEK + A+ +M +G + + +++A +AA L GA Sbjct: 5 SKLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTR 64 Query: 66 -----------------------SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 ++ S+ + L D + + I Sbjct: 65 PTDPTQLPTLQWEVARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTLPTPVP 124 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + ++ N M + S+ + + Sbjct: 125 GVDIPAVRVIASRSDGNNGGPVKNFFMQIFGKDYSQVSSRAAVAMLGFPYTV 176 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 27/69 (39%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 I+ + + + + G LI + + M+S ++AA L G ++ ++ Sbjct: 108 RGGAVLILIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTTT 167 Query: 78 RFESISNHA 86 R + A Sbjct: 168 RIAVVDRAA 176 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 404 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 460 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + G + L + + TDG + + S Sbjct: 461 MERGTMTGLALRHMVEHSFSEAQ----------GARPRDLNVPRVGLVFTDGRSQDDISV 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 511 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPDFSTMTHLLENL 563 Query: 406 SQLMV 410 + Sbjct: 564 KGSIC 568 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL-GAFSRREDMERAIRAVVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S+ +G + + ++ +TDG + ++ Sbjct: 116 GLAIQYAMNVAFSEA----------EGARPSEERVPRVLVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+T S P ++ + V + D + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRTMASPPLDQHVFLVESFDLIQEF 207 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/352 (10%), Positives = 91/352 (25%), Gaps = 60/352 (17%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + ++ + + + ++ + A +Q Sbjct: 271 GVRLIAPFDTQPANNQAVAFTFVSGPPRLLLLTTTPDQVTALVAAWQATGITVVVQEPSQ 330 Query: 124 SSRISMTHMANNR---LDSSNNTIFYNMDVMTSYDYRLQFIE----------HLLNQRYN 170 A + +D+ + + + + Sbjct: 331 VPADPRALRAFDTIALVDTPADEVPTALQRALTTY-VRDNGGGLLVIGGPRSFGAGGWRR 389 Query: 171 QKIVSFIPALLRIEM-GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + +P L + ERP + LV+D SGSM ++ RT++ Sbjct: 390 TLLEPILPVALDPPLREERPDLALVLVIDRSGSMRELVDDG--------------RTQLD 435 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + A+ + + LI + + + + + Sbjct: 436 LAREAVYQASRGLTQRD------QIALIAFDSIADTLLPLQPLPGLF-TIEDALSRLVAG 488 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ + A + + + + R + +I LTDG + ++ Sbjct: 489 GGTNIRSGIALAAETIATSQARI------------------RHVILLTDGVSETEYAD-- 528 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I + I+I + L + +Y V L V Sbjct: 529 ---LVADLRAQGITVSAIAIGLDTDPA-LERVAQIGGGKYYLVQRVPDLPQV 576 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 404 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 460 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + G + L + + TDG + + S Sbjct: 461 MERGTMTGLALRHMVEHSFSEAQ----------GARPRDLNVPRVGLVFTDGRSQDDISV 510 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 511 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPDFSTMTHLLENL 563 Query: 406 SQLMV 410 + Sbjct: 564 KGSIC 568 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 60/169 (35%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + + + T + Sbjct: 57 QFLVGLLRSLDVGLNATRVGVIQYSSQVQSVFPL-GAFSRREDMERAIRAVVPLAQGTMT 115 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S+ +G + + ++ +TDG + ++ Sbjct: 116 GLAIQYAMNVAFSEA----------EGARPSEERVPRVLVIVTDGRPQD-----RVAEVA 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+T S P ++ + V + D + Sbjct: 161 AQARARGIEIYAVGVQRADVGS--LRTMASPPLDQHVFLVESFDLIQEF 207 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 61/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 214 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 270 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + G + L + + TDG + + S Sbjct: 271 MERGTMTGLALRHMVEHSFSEAQ----------GARPRDLNVPRVGLVFTDGRSQDDISV 320 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + L+ S P + + ++ H+ +N+ Sbjct: 321 -----WAARAKEEGIVMYAVGVG--KAVEEELREIASEPSELHVSYSPDFSTMTHLLENL 373 Query: 406 SQLMV 410 + Sbjct: 374 KGSIC 378 >gi|301758388|ref|XP_002915048.1| PREDICTED: matrilin-3-like [Ailuropoda melanoleuca] Length = 466 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + Sbjct: 70 SSRSVRPLEFTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKTEFHLQTYSDKQSL 126 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + P+ K I + Sbjct: 127 KQAVARITPLST--GTMSGLAIQTAMD----------EAFTVEAGARGPTSNIPKVAIIV 174 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 175 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMIASEPLDEHVFYVET 227 Query: 395 ADSLIH 400 + Sbjct: 228 YGVIEK 233 >gi|281344227|gb|EFB19811.1| hypothetical protein PANDA_002988 [Ailuropoda melanoleuca] Length = 395 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + Sbjct: 17 SSRSVRPLEFTKVKTFVSQIIDTL---DIGAADTRVAVVNYASTVKTEFHLQTYSDKQSL 73 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + P+ K I + Sbjct: 74 KQAVARITPLST--GTMSGLAIQTAMD----------EAFTVEAGARGPTSNIPKVAIIV 121 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 122 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMIASEPLDEHVFYVET 174 Query: 395 ADSLIH 400 + Sbjct: 175 YGVIEK 180 >gi|47217757|emb|CAG05979.1| unnamed protein product [Tetraodon nigroviridis] Length = 1380 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 54/176 (30%), Gaps = 15/176 (8%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D + + ++ Y+ R + + T + Sbjct: 30 QWVANLVDSFDVAPDKTRVAVVRYSDRPTTEFNLARYRTLEDVKRAARNIRYLGGNTMTG 89 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + +++ + Q+ I LTDG + ++ ++ Sbjct: 90 DAISYTTSNIFTERNGARPIA----------RGIQRVAILLTDGRSQDY-----VLEPSK 134 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A + I++ + I + + + ++V + +++ + + + + Sbjct: 135 AAAKAGIRMFAVGIGEALKVELDEIAAEPKNAHVFHVTDFNAIDKIRGRLRKRLCE 190 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 58/198 (29%), Gaps = 35/198 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K A + + L G++ + + +E +K Sbjct: 428 KVELAKEAAIQATSILGPLD------MAGVVAFDDTAQWVVEFQAVKDKD-AIQDDIATI 480 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFK 345 T PA+ AY L + K II LTDG++ Sbjct: 481 RADGGTSIYPALALAYTALKDAHTKF------------------KHIILLTDGQSATTGD 522 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF--- 402 + ++ I + T+++ + L + +Y ++ +F Sbjct: 523 YYFLSRRMARA----GITMSTVAVGEGADTLLLEQLAAWGQGRYYFSDEISNIPRIFTKE 578 Query: 403 --QNISQLMVHRKYSVIL 418 + I +V + +L Sbjct: 579 TMKAIKSYLVEESFFPVL 596 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 70/244 (28%), Gaps = 34/244 (13%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++ I + VD D + D + + + Sbjct: 28 MIVLITTAGLGNSHTGASETGQFGVDYCQPGAGLDLMFVLDNSGTVNLDDTDSIRSSTVS 87 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + L G+I + T + E S + ++ D T Sbjct: 88 DYAENMLP----------GDRGGIISFNTEADMLQEMSDNRYDLLDALSALPD--PSGGT 135 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D + M+ A + K + ++ ++ +TDG + + V Sbjct: 136 DLSQGMRAANEQFVQTKGAN-----------------KQIMVLITDGADTINLAEVY--N 176 Query: 353 ICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +A+ N I I T+ + + + L + + V NA + V Q+I + Sbjct: 177 QVREARMNGITIFTLGLGSLATGLDEALLQDIADQTRGQYRQVPNATVIESVLQDIRSSL 236 Query: 410 VHRK 413 + Sbjct: 237 EGMR 240 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 11/155 (7%), Positives = 39/155 (25%), Gaps = 11/155 (7%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + A ++I + + L ++ + H + +++A ++A L Sbjct: 1 MRALLARFRADADGTMTMIVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAVQ 60 Query: 67 KMVSNLSRLGDRFESISNH--AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV-- 122 + + + + + + A + ++A + Sbjct: 61 VLPDSATAVSRGVSLVGQNLPPSYGTAAAATDVVVGTYDPGAKAFTAGGAQPNAVKVTAR 120 Query: 123 -------NSSRISMTHMANNRLDSSNNTIFYNMDV 150 L++ ++ Sbjct: 121 RSAALGNAIPVYFAWIFGYRSLEAKAESVAVAAGG 155 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 104/379 (27%), Gaps = 46/379 (12%) Query: 46 WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 ++ M A + A+ M + + A I A+ + ++ Sbjct: 82 ASQRQAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNG 141 Query: 106 LSGYSAVFYNTEIQNIVNSS-RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + + +T ++ S + L M +YDY L Sbjct: 142 IMVAGEIPVSTFSIDVDTGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYDYPLPA---- 197 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPIC 220 + P + E+ P + L + L G + Sbjct: 198 ---------KNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSG 248 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ L+ AL L + + ++ Y + Q + Sbjct: 249 SMASVDKLPLLQTALKLLTAQLSAQD------KVSIVVYAGAAGVV--LDGASGNDTQTL 300 Query: 281 T-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ + QAYQ+ + +I TDG Sbjct: 301 NYALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGIN----------------RVILATDG 344 Query: 340 ENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + N ++ + I + +K K++ I + T+ +L++ +Y ++ L Sbjct: 345 DFNVGVTDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT--L 402 Query: 399 IHVFQNISQLMVHRKYSVI 417 + + + +++ Sbjct: 403 NEARKVLVDELSSTLFTIA 421 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|221127586|ref|XP_002157796.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 449 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 16/168 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ + ++G + ++ E EK + D T + A++ A Sbjct: 61 INNFKIDEFHTHVGALVFSEIAEVQFRLDELFEKQDIFDRIDKIPYPGYRTATDDALRVA 120 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + S + +I + DG NV++ KE Sbjct: 121 NKYMFSLSGG-------------ARQGVPQVLILVNDGRCTVCSENVSSASA--PLKEKG 165 Query: 362 IKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 + I + I+ + + + LL + ++ + V D L + + Sbjct: 166 VSIFCVGISKTVDKKELLSIASEPAEDHFFYVETIDELPTFISKLHKK 213 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 48/167 (28%), Gaps = 17/167 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D S +++ + Y + + S + T A++ A Sbjct: 297 VDSFSISQQNARFAALVYGSNASVEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVA 356 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 L S + + + DG S+ + K+ Sbjct: 357 KSDLFSLQGKV-------------RSRRPMILYVFFDGTVTRSMSD--LESVVQPLKDYG 401 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +KI+ I + N +L K S ++ + L + +I + Sbjct: 402 VKIIAIGVGPEVNRYQLKKI--SEDNAIFSGKSFKELAPLLYSIVEQ 446 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 257 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 296 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 297 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 352 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 353 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 401 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 402 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 461 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 462 VEEVTQNNFRLLFKG 476 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|218672104|ref|ZP_03521773.1| von Willebrand factor type A [Rhizobium etli GR56] Length = 366 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 73/237 (30%), Gaps = 45/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I P + ++D+SGSM K+ LK+A L ++ Sbjct: 67 DIAPATAPHANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRLLVNR 107 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ Y +EP+ EK + ++ T ++ A Sbjct: 108 LKPDD------TVSIVTYAGNAGTVLEPTRVAEKSK-ILSAIDRLEAGGSTGGAEGIEAA 160 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKEN 360 Y + + ++ TDG+ N S+ + +I + +E Sbjct: 161 YDLAKKAFVKDGVN----------------RVMLATDGDFNVGPSSDEDLKRIIEDKREE 204 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + + L++T + + D+L + + + + + Sbjct: 205 GIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEEAGSTLFPIA 259 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|156742722|ref|YP_001432851.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234050|gb|ABU58833.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 452 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 31/265 (11%), Positives = 73/265 (27%), Gaps = 43/265 (16%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + ++ + ++ +LLR + + V Sbjct: 33 VQASVTGAAPSRPVNWALVADVSRSMRIPIVDETQF---RSLLRTGSAQEMLVDGVPVWQ 89 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 LSGS+ + +D + + + AL ++ +D + L+ Sbjct: 90 LSGSVPQEV-------------RDTASSALDYVVRALHTIVERLDHHD------RLALVV 130 Query: 259 YTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + I + + + R + T+ + A + +++ Sbjct: 131 FADHALLLIPGMVGADRVTLVRAIERLPGLNLGDGTNLADGIALALNQIRANRDGRCA-- 188 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 II LTDG + + + + D+A + I I TI + Sbjct: 189 --------------DRIILLTDGFT---RDSAACLALADQAADEHIAITTIGLGGEFQDD 231 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHV 401 L S + + ++ Sbjct: 232 LLTAIADRSGGHALFLKRVSAIPRA 256 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 67/190 (35%), Gaps = 26/190 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + + + + + L + +GL+ Y + V+ +K + + Sbjct: 59 SSRSVRPPDFEKVKEFLITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLKTYKKKMDIERA 118 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V R M T + A++ A I S +G + + + + +TDG Sbjct: 119 VKRMMHLAT--GTMTGLAIQYAMNIAFS----------ESEGARPLNQHVPRIAMIVTDG 166 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS 397 + + +I KA+ + I I + + LKT S P E+ + V N Sbjct: 167 RPQDPVA-----EIAAKARNSGILIFAVGVGRVDMS--TLKTIGSEPHTEHVFLVANFSQ 219 Query: 398 ---LIHVFQN 404 L VFQN Sbjct: 220 IETLTTVFQN 229 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 69/188 (36%), Gaps = 26/188 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +K + LDS+ ++ ++GLI Y+T + + + V++ V++ Sbjct: 587 FEIVKQFVNGILDSL---EISQKAAHVGLIQYSTHIRTEFTMAQYSSAKDVKKAVSQIKY 643 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T A +++ G + ++ + I TDG + Sbjct: 644 MGRGSMT------GLALKLMHEKSFSEVQ------GARPRAMGVPRVAIVFTDGRAQD-- 689 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 + KAK++ I I I + + + + L+ S+P ++ + ++ ++ + Sbjct: 690 ---EVSEYAKKAKQSGITIYAIGVGKAIDEE--LQEIASAPQEKHVIYAEDFSAMGYITE 744 Query: 404 NISQLMVH 411 + + Sbjct: 745 KLKSSICE 752 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 16/152 (10%), Positives = 43/152 (28%), Gaps = 21/152 (13%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + T + ++ A Sbjct: 64 SATRTEFTFTTHRNGTELVQAIRNLGYKGGNTRTGTGLRYAADNFF-------------- 109 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 G I K I +TDG++ + + K IK+ + I + + + L + Sbjct: 110 GPTIIRPNVPKVAILITDGKSQDDVDPPT-----QRLKSQGIKMFAVGI-KNADSRELTR 163 Query: 381 TCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + + ++ + V + L + +++ + Sbjct: 164 VASTPTEDFFFYVNDFRILGSLLPVVTRKVCT 195 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 57/185 (30%), Gaps = 23/185 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +KN L + SI L V + + R + S V Q + Sbjct: 1008 ETVKNFLSKVMSSIGQLGPDATQVGLSAYSFRARPWILLNRSSDLSTVLQQIRTIPFEEP 1067 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A+ A + + ++ L DG + + Sbjct: 1068 SG-TSIGAAIDFAKNYMFTTNYGRRNN-------------VPGTLVVLADGPSGDD---- 1109 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNIS 406 ++ K+ KI +++ + Q L+ VS SP + ++ +L + +++ Sbjct: 1110 -VLEPARVMKDAGFKI--LAVGTNGADQEQLRRIVSRQSPRNVFFTRDSGNLNSLVDSLT 1166 Query: 407 QLMVH 411 + Sbjct: 1167 GAICT 1171 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 66/225 (29%), Gaps = 62/225 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++ A K L L + +G I ++ + + Sbjct: 87 DTSRSMMTEDVYPNRLEAAKRTLENLLQGLK-------GDRIGFIPFSDSAYIQMPLTDD 139 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ + + T+ A++ A + + K Sbjct: 140 YSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN------------------K 181 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II L+DG + + KS K+N + + +I I Sbjct: 182 TIIVLSDGGDFDDKSL-------KFVKDNKMNVFSIGIGTDEGTIIPEYVNGKKVGFIKD 234 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQ 407 N L K S +Y V N D + F++ + Sbjct: 235 QKGSAVISKLNSDFLKKLSSESNGKYYEVNNLKDDTSNFFRDTAN 279 >gi|170741047|ref|YP_001769702.1| hypothetical protein M446_2843 [Methylobacterium sp. 4-46] gi|168195321|gb|ACA17268.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 439 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 38/154 (24%), Gaps = 8/154 (5%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 K +A ++I L + + G + H ++ ++ A + A LA A ++ Sbjct: 2 RSLKTFLADRAGAAALILGLCLPMLVAGSGAALEYARIHKRRTELQKAVDTAALAAAGEL 61 Query: 69 V-SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + + D A ++ T + Sbjct: 62 TIAGSDTYVRNLAVRTAYEAAGGTDPAVTRATARVQNRRGWVQVEIRET-------VRSL 114 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + ++ S L Sbjct: 115 FGRLLTMPTMELSAQATGELSGSTRVCLLTLDDS 148 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/342 (11%), Positives = 106/342 (30%), Gaps = 31/342 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G + ++ + S +A IDD ++ + A + ++ V Sbjct: 883 GVVQYSDGINIQFALSQYSSMAELKAAIDDIQQRKGGTMTGEALSRMAQVFVDTARSNVP 942 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 I +T + + + + Y ++ + + + F P + Sbjct: 943 WYLIIIT-DGKSEDPVAEPAEALRGEGVIIYAIGVKNANVMELKEIAKDKTFFTPEFDSL 1001 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 ++ +R VV S N + + + +K + ++ + Sbjct: 1002 KVIQRD------VVQDICSSETCKNRQADIIFLIDGSESISPNDFEKMKGFVKRMVNQAN 1055 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +++ +GL+ +++ ++ + + K + + T + A+ Sbjct: 1056 ---IGADEIQIGLLQFSSSPQEEFRLNQYSSKADIHRAISKVVQMNDGTRTGKALTFTLP 1112 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 S + +++I +TDG + + ++ I Sbjct: 1113 FFDSSRGG--------------RPNVHQYLIVITDGVAQDDVAIP-----AKALRDRNIV 1153 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I I + +LL+ + +Y N +SL ++ + I Sbjct: 1154 IFAIGVG-EAKSAQLLQITDDVQKVYYE-ENFESLQNLEKKI 1193 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 56/192 (29%), Gaps = 21/192 (10%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYV 280 R K + + ++ L+ + V +GL+ Y+ K+ +++ Sbjct: 641 FSRDKQWNFQQVIDFLKTTVSSLNVHPDGVRIGLVFYSEEPRLEFSLDTFQTPAKMLEHL 700 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T + A+ F + + + +T+G Sbjct: 701 DKLTYRRRSGRTKTGAALDF-----------LRNEVFVEERGSRSKQGVLQMAVVITEGF 749 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + + S + + I + + + L+ S P ++ ++ + L Sbjct: 750 SQDQLSEP-----ASLLRRAGVTIYAVG-THRASESKDLENIASYPPWKHVISLESFLQL 803 Query: 399 IHVFQNISQLMV 410 V I + Sbjct: 804 SVVGSKIKNQLC 815 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 53/184 (28%), Gaps = 18/184 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + I L + V +GL Y + + + K Sbjct: 249 QNFQKVKNFLYSVILGLDISSDQVRVGLAQYNDNIYPAFQLNQYPLKSVVLEQIQNLPYR 308 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ + +I +TDGE+N+ Sbjct: 309 TGDTNTGSALEF-----------IRMHYLTEAAGSRAKDSVPQIVILVTDGESND----- 352 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQ 407 + +K KE+ + + + + Q L K E +N+ N + L I Q Sbjct: 353 EVQEAANKLKEDGVVVYVVG-VNVQDVQELQKIASEPFEKFLFNIENFNILQDFSGGILQ 411 Query: 408 LMVH 411 + Sbjct: 412 TLCS 415 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 49/187 (26%), Gaps = 20/187 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +++ ++ + K+ + +GL Y S K Sbjct: 45 TQDIRSVRNFLYIMVNSFNVSKDSIRVGLAQYGDVPRSEFLLSTYPRKGDVLKHIQKLQP 104 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 A++ +F S + + ++ N + Sbjct: 105 KSWGHKMGLALQF-----------LLDHHFQATAGSRASQGVPQMAMVIS-----NSPAE 148 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNI 405 + K + + T+ + + + LK SSP + V N L Q + Sbjct: 149 DPVQEAAKALKRAGVLLYTVGVKDAVLAE--LKEIASSPAEKFTSFVPNFPDLGSHAQKL 206 Query: 406 SQLMVHR 412 Q + Sbjct: 207 RQQVCDT 213 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 63/172 (36%), Gaps = 26/172 (15%) Query: 233 NALLLFLDSIDLLSH-VKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLIL 289 ++ F+ + + ++ V++G++ ++ ++ + + ++ + + Sbjct: 215 TSMKTFITKVVNGTIIGEDSVHVGVVQFSNNPQEQFPLNRYFDQNELEEAIDGIEQL--T 272 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A+ + + +F++ +TDGE ++ + Sbjct: 273 GDTYTGKALSFISKYFDASNGG--------------RPDVPQFLVVITDGEAHDAVAVP- 317 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ + I +I + AS N +L + + + Y + D+L + Sbjct: 318 ----AKAIRDKGVTIFSIGV-ASVNTTQLWEIS-GTQDKVYVQRDFDALHSI 363 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 59/191 (30%), Gaps = 21/191 (10%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + T+ ++ L S+ L + V +G++ + + Sbjct: 20 NSGSTRAD-FEDVKKFILGSLQLFNIGPNRVRVGVVKVDRNPTLQFSLTEHKNRASFEAA 78 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S + ++ A+K + K Q+ +I +TD + Sbjct: 79 VRGISQPVGGSEKGKALKYVASLFNQAKASRPA-------------KVQEILIVITDKTS 125 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + ++ + + + I + + LLK + Y V N D+L + Sbjct: 126 QDDVGDP-----AEELRIQGVSVYAIGV-KDASQDELLKMTADETKDFY-VTNYDALNVL 178 Query: 402 FQNISQLMVHR 412 + I + + Sbjct: 179 KREIVTDICSQ 189 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 67/221 (30%), Gaps = 23/221 (10%) Query: 185 MGERPIFLIELVVDLS-----GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + F +D + S H + + + Q ++ ++ + + Sbjct: 351 VYVQRDFDALHSIDKNLQFKLCSSHPGGCPSTQLADVIFLVQCTRQIRLQDFEKIKSFLI 410 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ + G+I Y+ + S R+ + + SL Sbjct: 411 SVVNSTQIGDNLIRFGVIVYSDTP---SQFSLNQYNTRRQIAEAITSLKSP--------- 458 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A T+ T F + + +TDG + + N D+ Sbjct: 459 -AVSGYTARALAYSLTYFTEANGGRQKRGVPQMLFMITDG---DARDRENLRARADEFAA 514 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I + I + + + + L+ S ++V N D+L Sbjct: 515 KQINVYGIGVARAQDSE--LEIITKSKNKIFHVNNYDNLQD 553 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 15/102 (14%), Positives = 35/102 (34%), Gaps = 2/102 (1%) Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + +S ++ F+ N ++ ++ + + S+ + D+ Sbjct: 613 FRVGIAQFSSVYQKEFYMNEYKDADGEDKGSRKQSGVPQNLVLITDGVSSDRVNEAADQL 672 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + I + I I A + L SP+ + V N + L Sbjct: 673 RNLGINVFAIGIGAV--SLQQLSYIAGSPDRLFKVQNFNYLN 712 >gi|116494017|ref|YP_805751.1| von Willebrand factor domain-containing protein [Lactobacillus casei ATCC 334] gi|116104167|gb|ABJ69309.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei ATCC 334] Length = 909 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 31/418 (7%), Positives = 100/418 (23%), Gaps = 53/418 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + ++ + A + ++ G + Sbjct: 14 LFAVLLILMSMLTGLVTSGSSVVTATANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 70 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE----MGERPIFLIELVVD 198 + + ++ + + + V Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVRTGVKNFLTSIQNAGLGDYVNVGL 189 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S + + I L S + Sbjct: 190 IGFSSPGYIGGGNKTTGPGYIRVG----------------LGKAGNTSQQQAINSALSPT 233 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I G+ +++ + + K S N Sbjct: 234 FNGGTYTQIGLRQGS--------AMLNADTSGNKKMMILLTDGVPTFSDKVKNSVVENGT 285 Query: 319 RQGVKIPSLPFQKFII------FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 G + + ++ + + + T+ KAK++ ++ + I Sbjct: 286 LYGTNFGTTRDEPGYTAELKNPYIDSSGDYIYDTWPATLGEAKKAKDSGNEVHALGIQLA 345 Query: 371 -----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + ++ ++ +SP+ + + +AD++ N ++ +V ++I Sbjct: 346 GDDGYWSEGYMSDEDVRQNMELIPTSPDLYEDADSADAVEAYLNNQAKDIVKNFNTII 403 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/269 (14%), Positives = 85/269 (31%), Gaps = 62/269 (23%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F + + P + + ER ++ LV+D+SGSM Sbjct: 283 DFNNSMPMETPPAP-----PFFSLLRISER---IVCLVLDVSGSMA-------------- 320 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-R 277 ++ + A FL I ++ ++G++ + E + R Sbjct: 321 -----SYDRLDLMNRAAKHFLSQI-----IENRSWVGMVHFNHLANIKSELIQMNSNIER 370 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + + T +K A+Q+ + + T I+ L+ Sbjct: 371 NQLLQTLPTSADGGTSICSGIKAAFQVFKNGGYETDGTE----------------ILLLS 414 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DGE + D+ K++ + I++ S + + + + H + Sbjct: 415 DGE------DSTAKDCIDEVKDSGAIVHFIALGPSADLA-VTNMSILTGGKHMYASDEAQ 467 Query: 398 ---LIHVFQNISQL---MVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 468 NNGLIDAFVALASENVDVTQKSLQLESKG 496 >gi|294460157|gb|ADE75661.1| unknown [Picea sitchensis] Length = 350 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 7/58 (12%), Positives = 18/58 (31%), Gaps = 4/58 (6%) Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + T + + +S + + F +Q + SV+++ Sbjct: 3 IPVHTFGFGTDHDSASMHSISETSGGTFSFIETESIIQDAF---AQCIGGL-LSVVIQ 56 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 74/261 (28%), Gaps = 48/261 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPA-LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + E++ ++ + I+ P + ++D+SGSM Sbjct: 168 YTEYMDCPWNEDTKLALVSMNTQAIDFKSAPASNLVFLIDVSGSM--------------- 212 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ ++ AL + +++ V Y G + V ++ Sbjct: 213 ----FDDNKLPLVQQALTMLAENLTEKDRVSIVTYAG----SDEVVLQGVSGDDYHEISS 264 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ + ++ AY + K +I TD Sbjct: 265 AIEGL---EAYGSTNGSAGIETAYAL----------------AKKYFIKGGNNRVILCTD 305 Query: 339 GENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNAD 396 G+ N S K+ + K++ + + T + L+ + + + Sbjct: 306 GDLNVGLTSEGQLEKLITEKKDSGVFLSTFGVGYGNYKDNKLELLADKGNGNYAYIDSMF 365 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 + + + +V Sbjct: 366 E---AKKALVDELGANMVTVA 383 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 69/223 (30%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D ++ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDK-------MQNDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESIDPSLISKQGTAIGAAINLATRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + ++ A E I++ + + Sbjct: 192 AIIVITDGENHEGGA----VEAAKAATEKGIQVNVLGVGMPDGAPIPMEGTNDYRRDRDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + + + V N +S ISQ + Sbjct: 248 NVIVTRLNEEMCQEIAKAGNGIYVRVDNTNSAQKA---ISQEI 287 >gi|326677363|ref|XP_691404.5| PREDICTED: collagen alpha-1(XII) chain-like [Danio rerio] Length = 3085 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 52/174 (29%), Gaps = 30/174 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMK 299 ++ V + L+ Y+ T V + + + T++ AM Sbjct: 446 VNTFDIGPNKVQISLVQYSRDPYTEFYL--NTHHDLNAVVKAVRTFPYRGGSTNTGKAMT 503 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + + + I +TDG KS+ K + Sbjct: 504 YVRERIFIATRG-------------ARENVPRVTILITDG-----KSSDAFKDPAAKLRN 545 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ + P + Y V + FQ IS + Sbjct: 546 TDVEIFAVGVKDAVRSE--LEAIANPPAETHVYTVEDF----DAFQRISNELTQ 593 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 56/178 (31%), Gaps = 22/178 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ +G++ Y+T + +V + T + Sbjct: 140 NFIAALAGAFDLGEDKTRVGVVQYSTDTRTEFNLNQHFRRVDLLRAINNLPYKGGNTMTG 199 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + ++ F + + +TDG++ + Sbjct: 200 EALDYLLKNMFTEAAG-------------ARKGFPRVAVVITDGKSQDP-----VEGYAK 241 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 K K +++ T+ I + + LK S+P + Y V N D + V ++ + Sbjct: 242 KLKNAGVELFTLGIKEADEEE--LKQMSSTPYRTHVYTVPNFDMIKAVEKSFIAQVCS 297 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/263 (9%), Positives = 79/263 (30%), Gaps = 40/263 (15%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + + +S P + S + S Sbjct: 1135 SVAVFGMFDGGQSMPLAGEEKTTLSDAPDS----------------PPVKYSGNECKTSA 1178 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D+ ++ K + + + V +GL Y+ + + Sbjct: 1179 KADIVLLVDGSWSIG-RLN-FKTIRAFIGRMVGVFDIGPDKVQIGLAQYSGDPKTEWHLN 1236 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + T + A+ IL ++ + + + Sbjct: 1237 AHPTRASLLDAVANLPYKGGNTMTGMALNY---ILQNNFRPNVGM----------RPDSR 1283 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--Y 388 K + +TDG++ + + + +++ I++ I + + L++ + P+ + Sbjct: 1284 KIGVLVTDGKSQDEIVVNS-----QRLRDSGIELYAIGV--KNADENELRSIATDPDEIH 1336 Query: 389 HYNVVNADSLIHVFQNISQLMVH 411 YNV + L+ + ++++ + + Sbjct: 1337 MYNVNDFSFLLDIVDDLTENLCN 1359 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 46/166 (27%), Gaps = 26/166 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + Y+ + +K + T + A+K Y+ + + Sbjct: 2338 GPTGMQISFVQYSDDANTEFRLNTYKDKGTALSALKLIRYQGGNTKTGVALKHVYEKVIT 2397 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + K ++ +TDG + + K + + + Sbjct: 2398 VENGM-------------RRNVPKVVVAVTDGRSQDDVHKN-----AAKLQHAGYSVFVV 2439 Query: 368 SINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 + + L+ S P + + V + F I +V Sbjct: 2440 GVADVDFVE--LQNIASKPSERHVFVVDDF----DAFSTIQDNLVT 2479 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/235 (11%), Positives = 60/235 (25%), Gaps = 59/235 (25%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + I L VDLS SM P + L + Sbjct: 75 ETSVYRGRGVDIMLAVDLSESMAALDMPLP-----------------DRTVSRLEAVAQA 117 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +GL+ + +R + P+ + Q ++R + T A+ A Sbjct: 118 AARFAADRPGDRIGLVAFGSRAYVVLPPTDDRAALTQALSRLSVGAAGRRTAMGDAVGLA 177 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L + + ++ DG +N + ++ A Sbjct: 178 VKQLD------------------RAPGLARLVVVFGDGLSNAGEV--RPVEAAKAAAARG 217 Query: 362 IKIVTISINASP----------------------NGQRLLKTCVSSPEYHYNVVN 394 I + T+ + + L + S + + Sbjct: 218 IAVFTVGVGGDGPAPFLVNHPLLGQEIVRENAAVDTAALTELAALSGGAFFRAED 272 >gi|126331114|ref|XP_001371712.1| PREDICTED: similar to Anthrax toxin receptor 2 precursor (Capillary morphogenesis gene 2 protein) (CMG-2) [Monodelphis domestica] Length = 610 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 60/184 (32%), Gaps = 21/184 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + ++K+ + + + + T Sbjct: 185 IYNFVKQLTER-FVSPGMRLSFIVFSSQATIILPLTGDSKKITEGLKDLKEVQPVGETYI 243 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ A + + + L +I LTDG + K Sbjct: 244 HEGLRLANEQIKNA----------------GGLKTSSIVIALTDG-KLDKLVPKYAAKEA 286 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHRK 413 + ++ ++ + + + + L+ S + + V +L + +I + Sbjct: 287 NISRTLGARVYCVGV-LDFDQAQ-LENIADSKDQVFPVTGGFQALKGIINSILEKSCTEI 344 Query: 414 YSVI 417 + Sbjct: 345 LDLE 348 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 60/193 (31%), Gaps = 33/193 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ LK+AL + + + ++ Y + + G ++ Q + Sbjct: 226 KLPLLKSALRQLVTQMRAQD------RVAIVAYAGSAGLVLPSTPG-DRHAQILAAIDGL 278 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ ++ AY + I+ +DG+ N ++ Sbjct: 279 QASGSTNGGAGLELAYAEAAKGLVKDGVN----------------RIVLASDGDFNVGRT 322 Query: 347 NV-NTIKICDKAKENFIKIVTISINASPNGQRL-LKTCVSSPEYHYNVVN--------AD 396 ++ ++ I + T+ + + + ++ + ++ + + A Sbjct: 323 DLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVFAS 382 Query: 397 SLIHVFQNISQLM 409 L I++ + Sbjct: 383 ELSATLLTIAKDV 395 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 61/233 (26%), Gaps = 59/233 (25%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + I LVVDLS SM D+ T++ A+ +A Sbjct: 75 GGTTAYQGRGLDIMLVVDLSESMAAMDM----------RLADRTVTRLDAVADAA----- 119 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + +GL+ + +R + PS + + R K T + Sbjct: 120 --ARFAANHPGDRIGLVAFGSRAYAVMPPSADRAALTGALARLAVGAAGKRTAMGDGLGL 177 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + L+ + + DG +N + + A E Sbjct: 178 AVKRLSDA------------------PGLSRLAVVFGDGRSNAGEVS--PEDAAKAASER 217 Query: 361 FIKIVTISINASP----------------------NGQRLLKTCVSSPEYHYN 391 + + ++ + + L ++ +Y Sbjct: 218 GVTVYSVGVGGDEPAPFLVTHPLLGSQIVTEKAAVDATTLAAMAKATGGAYYR 270 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 76/252 (30%), Gaps = 33/252 (13%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + + + G P ++ D + + + K Sbjct: 30 GPIKDPNGFPAVGPTITVRPGPGPEPE-----RVTFQDCPVDVFFVLDTSESVALRVKPF 84 Query: 226 TKM-AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + +K+ F+D + + + G + Y+ V + + +R V Sbjct: 85 KTLVTQVKDFTKKFIDKLTSRYYRCDRNLVWNAGALHYSDEVIMISSLTRDMKTLRDDVE 144 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I K T + A+K+ + + K++I +TDG Sbjct: 145 --TVEYIGKGTHTDCAIKRGIEEVLIGGSHQKEN---------------KYLIVVTDGHP 187 Query: 342 NNFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY--HYNVVNADS 397 ++AK IK+ +++I + + + L S + ++ +A Sbjct: 188 LEGYKEPCGGLEDAANEAKHLGIKVFSVAI-SPNHLEPRLSVIASDASHRRNFTATSAAG 246 Query: 398 LIHVFQNISQLM 409 L + I + Sbjct: 247 LTD--EEIDNTI 256 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 57/192 (29%), Gaps = 18/192 (9%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + RT+ A + ++ V + + E G Sbjct: 597 DTSGSMNARVGGRTRFQVATTAADRAVGLLNSADRVALWSFSSETDQRRGKPYSEEIRLG 656 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + R + T ++ A++ L + Sbjct: 657 PYDRAAFTRRLTGLRVGGNTALYATVRAAHRRLLDN----------------HDPDRINA 700 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHY 390 ++ LTDG+N + N + D + +K+ ++ + + L + +S + Sbjct: 701 VVVLTDGKNEYPRDNDLDRLLADIELDPNRPVKVFCVAFDRESDLAALDRIAGASAGKAF 760 Query: 391 NVVNADSLIHVF 402 + + ++ F Sbjct: 761 DATDPATIDEAF 772 >gi|126322632|ref|XP_001381040.1| PREDICTED: similar to collagen, type XIV, alpha 1 [Monodelphis domestica] Length = 1892 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ L + + + ++ +T + K S Sbjct: 1193 DNFNKIINFLYNTVGALDKIGVDGTQVSIVQFTDDPRTEFMLNTYKTKDTLLEGIKNLSY 1252 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K L + + K I+ +TDG + + + Sbjct: 1253 KGGNTKTGKALKHVRDALFTAEGG-------------TRRGIPKVIVVITDGRSQDDVN- 1298 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 KI + + I I I + + + + S + + V + F+ I Sbjct: 1299 ----KISRELQLEGISIFAIGVADADYAELVSIGSQPSARHVFFVDDF----DAFKKIED 1350 Query: 408 LMVH 411 ++ Sbjct: 1351 ELIT 1354 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 52/173 (30%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +GL Y+ + + K T + A+ Sbjct: 332 VTAFNVGANKTRIGLAQYSGDPRIEWHLNTFSTKDAVIDAVRNLPYKGGNTLTGLALNY- 390 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + +++ Sbjct: 391 ------------IFENSFKPEAGARTGVSKIGILITDGKSQDDIIP-----ASKTLRDSG 433 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V + +++ + R Sbjct: 434 VELFAIGVKNADVAE--LQEIASEPDSTHVYNVAEFDLMHTVVEGLTKTVCTR 484 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 171 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 227 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 228 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 277 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 278 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 330 Query: 406 SQLMV 410 + Sbjct: 331 RGSIC 335 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 212 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 268 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 269 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 318 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 319 -----WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENL 371 Query: 406 SQLMV 410 + Sbjct: 372 RGSIC 376 >gi|45384318|ref|NP_990352.1| collagen alpha-1(XII) chain precursor [Gallus gallus] gi|2506307|sp|P13944|COCA1_CHICK RecName: Full=Collagen alpha-1(XII) chain; AltName: Full=Fibrochimerin; Flags: Precursor gi|222811|dbj|BAA00701.1| alpha 1 chain of type XII collagen [Gallus gallus] gi|2326442|emb|CAA43358.1| collagen type XII alpha 1 chain [Gallus gallus] Length = 3124 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 59/187 (31%), Gaps = 26/187 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT--EKVRQYVTRDMDS 286 + L + + +E +G++ Y++ + + + R Sbjct: 153 NNFRYILDFMVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIP-- 210 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+ + ++ F K I +TDG+ + Sbjct: 211 YKGGNTMTGEAIDYLVKNTFTESAG-------------ARKGFPKVAIVITDGKAQD--- 254 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQN 404 + + +++ ++ I A+ + LK S P + +NV N D ++ + Sbjct: 255 --EVEIPARELRNIGVEVFSLGIKAAD--AKELKLIASQPSLKHVFNVANFDGIVDIQNE 310 Query: 405 ISQLMVH 411 I + Sbjct: 311 IILQVCS 317 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 74/240 (30%), Gaps = 31/240 (12%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 ++ PI ++E + + C+ D + + +K + Sbjct: 401 INVYAMKGLTPSEPITIMEKTQQVKVQVECSRGVDVKADVVFLVDGSYSIGIANFVK--V 458 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTD 293 FL+ + V + L+ Y+ + +V+ + T+ Sbjct: 459 RAFLEVLVKSFEISPRKVQISLVQYSRDPHMEFSLNR-YNRVKDIIQAINTFPYRGGSTN 517 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + AM + + K S + +I +TDG KS+ + Sbjct: 518 TGKAMTYVREKVFVTSKGS-------------RPNVPRVMILITDG-----KSSDAFKEP 559 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 K ++ ++I + + + L+ S P + Y V + FQ IS + Sbjct: 560 AIKLRDADVEIFAVGVKD--AVRTELEAIASPPAETHVYTVEDF----DAFQRISFELTQ 613 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 54/185 (29%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + L+ Y+ + + + +K + Sbjct: 2341 DNFNKVVKFVFNTVGAFDLINPAGIQVSLVQYSDEAQSEFKLNTFDDKAQALGALQNVQY 2400 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2401 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RRGVPKVLVVVTDGRSQD---- 2443 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 K + + + + + A + L K S + + V + F+ I Sbjct: 2444 -EVRKAATVIQHSGFSVFVVGV-ADVDYNELAKIASKPSERHVFIVDDF----DAFEKIQ 2497 Query: 407 QLMVH 411 +V Sbjct: 2498 DNLVT 2502 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 19/183 (10%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ K+A+L LD + + + + + + + Sbjct: 452 TKLQRAKDAVLKALDHFTAEDEIGLAAFSQVGDGPLTPGVVSPVAPFKTNKEDLIAKLNE 511 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-- 343 + T A + F K F I+ L+DG+N+ Sbjct: 512 LKAVDATPLFEA----------------VSRFAGDQAKEYKDNFINAIVLLSDGKNDTTH 555 Query: 344 -FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + + + T++ + L + +S ++Y+ + + L V Sbjct: 556 PGDLGGLSEQLGHQNHSTPVLVFTLAYGPDADVPTLREIARASGAHYYDATDPNRLEEVL 615 Query: 403 QNI 405 + Sbjct: 616 GEL 618 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 31/283 (10%), Positives = 72/283 (25%), Gaps = 70/283 (24%) Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + N +++ ++ + + L ++ + ++ ++ ++ + E Sbjct: 153 FSYVIFKNDMENPFTHEKIEENMNVNVEIVLDASGSMVKKIGDKTMMEIAKESIKQVLSE 212 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 P V + H N+ + S + Sbjct: 213 MPAN---AKVGIRVFGHKGDNTASKKDESC----------------GANELI-------- 245 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + V T +++ + L Sbjct: 246 ----------------YPIGDL-----NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK- 283 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIV 365 + + +TDG N ++I + K I + Sbjct: 284 ---------------ALDGEKTLNILYIITDGIETCG---GNPVEIAKQLKGENTNIVLG 325 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQ 407 I N N RLLK ++ Y+ +V +AD L I++ Sbjct: 326 IIGFNVDANQNRLLKQIADAAGGYYSSVNDADKLTGELYRINE 368 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 63/173 (36%), Gaps = 30/173 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V + +T ++ T +K +Q + ++ + Sbjct: 346 VGMVTFDSVAEIRNNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQST 405 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N I C ++ K++ + I TI++ Sbjct: 406 SGSE----------------IILLTDGEDN-------EIHSCIEEVKQSGVIIHTIALG- 441 Query: 372 SPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQNISQ---LMVHRKYSVILKG 420 + L + + Y + + L + F IS + + + K Sbjct: 442 PSAAKELETLSDMTGGHRFYANKDINGLTNAFSRISSRSGNITQQTIQLESKA 494 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/268 (8%), Positives = 61/268 (22%), Gaps = 34/268 (12%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES--------------- 55 ++ + +++ +L + L + I + K+ ++ Sbjct: 1 MRRLLRDRSGQITVLASLLLPIGLGIAALAIDLSTLQLVKHRLKVAADAASLAAVAVLPD 60 Query: 56 ---ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 A + A L+ A+ + + + + AK F + +A Sbjct: 61 TTTALDRA-LSIAADNAGTGAGTVTAASDVRFGSYNSA---AKSFTPGATPANAVQVTAS 116 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 +V + + + D S + + + L Sbjct: 117 RNQAHGNPVVLA---FAKALGWSTPDISVSAVAVRFSPAYCF-LVLDPSASDALSVSGTG 172 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +S +++ + C + + Sbjct: 173 RLSVPNCGVQVNSTSATAATVGNNSTAQARSFCIVGG--------YSGTSFSPKPITKCA 224 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYT 260 A D + Y GL + Sbjct: 225 AAPDPLADIPEPAQPTAGCYYNGLNTGS 252 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 68/193 (35%), Gaps = 18/193 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ +L +A + +++ ++ + + +I + T VE + + + + + Sbjct: 25 SGSMSGAKIDSLYDATIDMIETFSAAQAKEQVIDVAIITFGTHVELHTKYTPVKDLQAKG 84 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T A++ A ++ PS ++ ++ ++DG Sbjct: 85 ICKFS---ASGLTPMGTALRMAKDMIEDKDVT-------------PSRIYRPAVVLVSDG 128 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N+ + I + + + ++I + + +L+ P +A + Sbjct: 129 APNDDWKSPMDKFI-NDGRSAKCQRFAVAIGNDAD-RSILERFTQDPNAVLFAEDAKDIS 186 Query: 400 HVFQNISQLMVHR 412 F+ IS + Sbjct: 187 EQFKTISMSISTM 199 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEGRLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 36/274 (13%), Positives = 82/274 (29%), Gaps = 49/274 (17%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 Y + + ++ IE ERP + ++D+SGSM Sbjct: 174 YARPTTRDVPFTTNIDVAKTPWNEDTYLMRIGLRGYDIERDERPPANLVFLMDVSGSM-- 231 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 K+ ++ AL + V ++ Y Sbjct: 232 -----------------GSPDKLPLVQTALSGLAGELGEQDRVS------IVVYAGAAGL 268 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 +EP+ T K+R + ++ AY I + Sbjct: 269 VLEPTNDTAKIRAALMSLSAGGST---AGGAGIQLAYNIAEDNFIEGGVN---------- 315 Query: 326 SLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +I TDG+ N S+ +++ +K ++ I + T+ + +++ + Sbjct: 316 ------RVILATDGDFNVGVSDRDALVEMVEKNRDRGITLTTLGFGTGNFNEAMMEQIAN 369 Query: 385 SP-EYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + +A + + +S M +++ Sbjct: 370 KGNGNYAYIDSA---LEAKKVLSDEMSSTLFTIA 400 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 68/232 (29%), Gaps = 51/232 (21%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D+SGSM A K+ L+ AL + + V Sbjct: 244 ASNLVFLLDVSGSMASAD-------------------KLPLLQTALKMLTQQLSAQDKVS 284 Query: 250 EDVYMGL---IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 VY G + + ++ T+ + + QAYQ+ Sbjct: 285 IVVYAGAAGVVLDGASGDDIQALTY----------ALEQLRAGGSTNGSQGILQAYQLAQ 334 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + +I TDG+ N + I + +K K+ I + Sbjct: 335 KHFIQGGIN----------------RVILATDGDFNVGVTNFDQLISLIEKEKQRGIGLT 378 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 T+ +L++ HY ++ L + + + ++ Sbjct: 379 TLGFGMDNYNDQLMEQLADKGNGHYAYIDT--LNEARKVLVDELSSTLLTIA 428 >gi|326918656|ref|XP_003205604.1| PREDICTED: anthrax toxin receptor 2-like, partial [Meleagris gallopavo] Length = 480 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 69/224 (30%), Gaps = 46/224 (20%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 F + V+D SGS+ + + VN + +++ Sbjct: 47 AFDMYFVLDKSGSVAQNWHEIFDFVNQLTERFVSPKMRLS-------------------- 86 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 I ++++ + + + EK+++ + + T +KQA + Sbjct: 87 ------FIVFSSQAQVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQANLQIEKQG 140 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 F II LTDG + + + K +++ ++ + + Sbjct: 141 ASRF----------------SSIIIALTDG-KLDGQIPLYAEKEAKTSRQLGARVYCVGV 183 Query: 370 NASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHR 412 Q L+ + E + V +L + ++ + Sbjct: 184 LD--FVQAQLERIADTKEQVFPVTGGFQALKGIINSVLKQSCTE 225 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 53/201 (26%), Gaps = 31/201 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--------EDVYMGLIGYTTRVE 264 D + + + +K + K + F S+ + VK + G + + Sbjct: 42 DASGSMAKRIDGVSKFNSAKKEISKFASSLPEGTQVKMSVFGSEGNNKNSGKVQSCESIR 101 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 E + ++ T A+ +A Sbjct: 102 NVYGFQSFNE--QSFLNSLNAIGPTGWTPIAKALNEAKSSFDQ----------------- 142 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCV 383 +K + LTDGE N IK + ++ I + I + +L Sbjct: 143 LDTKGEKVVYLLTDGEETCG---GNPIKTAKELHKDNITVNVIGFDYKEGYKGQLNAIAK 199 Query: 384 SSPEYHYNVVNADSLIHVFQN 404 ++ + +F Sbjct: 200 VGGGEYFPAYTQKDVEKIFTQ 220 >gi|198429401|ref|XP_002121222.1| PREDICTED: similar to integrin alpha 9 [Ciona intestinalis] Length = 1242 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 31/257 (12%), Positives = 70/257 (27%), Gaps = 28/257 (10%) Query: 158 LQFIEHLL----NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + ++ I + R+ G+ + + G + Sbjct: 99 FDKTGAMGISISMNTDDENIYVSVAYAQRLLEGDINTLYLSIACPTEGLTADIIF---VV 155 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S + D+ R + +K + F ID + + + +W Sbjct: 156 DESGSVDVDEYRDSLNWMKQVISSFRSYIDKGDVHVGVIGFSRLNNIDTKVRIRLQAWSY 215 Query: 274 EKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + ++ T A+ K Sbjct: 216 TSLTSQINNMVNVRSLNGLTYIGYAINLTITEFDD----------------HGRESVPKE 259 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +I LTDG N ++A+ N I V++ + + + +LL + + Sbjct: 260 MILLTDGAAT---KPENVKPAAERARANGIVTVSVGVGSRVDETQLLTI-AGNASRVFKA 315 Query: 393 VNADSLIHVFQNISQLM 409 N D+L V + + + Sbjct: 316 TNYDNLDSVVEGVKSTI 332 >gi|114586161|ref|XP_001172548.1| PREDICTED: hypothetical protein [Pan troglodytes] Length = 543 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 64/188 (34%), Gaps = 24/188 (12%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K + ++ + + + +GL+ +++ + + + KV Y Sbjct: 108 SPKDFEKMKRFVESMVNQ---SNIGTDGIQIGLLQFSSIPLEEFRLNQYSSKVDIYRATF 164 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + A+ F K Q+++I +TDG + Sbjct: 165 DVQQMRDGTRTGKALNFTL--------------PFFDSSKGGRPSVQQYLIVITDGVAQD 210 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 N I ++ I I I + +LL+ + + Y+ VN ++L ++ + Sbjct: 211 -----NVIIPAKALRDKNIIIFAIGVG-EAKKSQLLEI-TNDEDKVYHDVNFEALQNLEK 263 Query: 404 NISQLMVH 411 I + Sbjct: 264 EILSKVCD 271 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 9/90 (10%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + +++I +TDG KS+ + + + + N + I I I N L + Sbjct: 6 ADTSRINVARYLIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGI-REANIDELKEI 59 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + V D L + + + Q + Sbjct: 60 A---KDKIFFVYEFDLLKDIQKEVVQDICS 86 >gi|260769129|ref|ZP_05878062.1| putative hemolysin [Vibrio furnissii CIP 102972] gi|260614467|gb|EEX39653.1| putative hemolysin [Vibrio furnissii CIP 102972] Length = 1476 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 44/344 (12%), Positives = 97/344 (28%), Gaps = 34/344 (9%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ + LS + + + AL D K + S +V + + Sbjct: 748 VIVSGVPDDATLSVGTNLGDGRWSIPVSALDSDGKATVVVEAPTGTSPELSVTVGAQEVD 807 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + + + + N D + + + + Sbjct: 808 LNGEALDLPKYASTDIGSVGLPDSAPNGDNTVE-----------GGKGDDVLMGDIGGYV 856 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-TKMAALKNALLLFL 239 + ++ G I L++D SGSM + + +++ Q + ++M + +AL Sbjct: 857 VSVQPGVNY--NIALIIDTSGSMKFDLAGNDNGFSNSYQSQSQYNASRMKLVIDALTNLA 914 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + V +G + + + + ++Q +T D T+ A Sbjct: 915 TDLVNHDGVININLIG---FESSAHSALTLQLTADNLQQLLTEIQDMDAEGGTNYEAAFD 971 Query: 300 QAYQILT--SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 A + + T F G S TD + A Sbjct: 972 LASNWFSHQPTEGYENLTYFLTDGDPTFSNSGDNGAGNQTDYRDMYDALEAFKDLSSQSA 1031 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I I + RL+ ++ + SL V Sbjct: 1032 ------VHAIGIGSGITEGRLM---------FFDNTSDTSLQAV 1060 >gi|269969412|sp|C7G0B5|PIF_PINFU RecName: Full=PIF; Contains: RecName: Full=Pif97; Contains: RecName: Full=Pif80; AltName: Full=Aragonite-binding protein; Flags: Precursor gi|256252203|dbj|BAH97338.1| Pif177 [Pinctada fucata] Length = 1007 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 64/193 (33%), Gaps = 24/193 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D LK A+L+ + + + + +G++ Y + V +I + + Sbjct: 36 ASDDVSDQDFDKLKRAMLMMVRGLSIDD---NQIRLGMVTYGSEVCDSIPLQGDRLDLAR 92 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + P+ M +A ++ +S + + G I + Sbjct: 93 TIRYMK--KPTGPSKPFKGMGEARRMFSSRGRYNVPHITMNLGGDIVDTEVKD------- 143 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + DKA++ IK++ I + A + + ++ + Y + + D L Sbjct: 144 -----------LMDETDKARDEDIKVMAIGLGAKVDRDEI-ESIAYDRDQAYFMDDEDDL 191 Query: 399 IHVFQNISQLMVH 411 I + I + Sbjct: 192 IRKVKEIPDYLCK 204 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 64/252 (25%), Gaps = 50/252 (19%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 N+ Y + + + ++ IE+ E + V D S Sbjct: 347 NSTAYQLKRNIDTMLEYTCSG-MNSTCNLTEVDMRAR---AIELNEAGGLDVVFVFDASS 402 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVYMGLIGY 259 S+ KM + L ++ + S ++ I Y Sbjct: 403 SI-----------------------KMDDFRLGLDFSIELVKLLGTSWKPGGTHVAAITY 439 Query: 260 TTRVEKNIEPSWGTE-KVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 T + + + T S A+ + + Sbjct: 440 GTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVP---------- 489 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 QK + F+TDG +N S K K+ +I I + + Sbjct: 490 ------FTREGSQKALFFITDGHSNIGGS---PRKAAKILKDKGFQIYAIGVGKKVRRRE 540 Query: 378 LLKTCVSSPEYH 389 L++ + + Sbjct: 541 LMEIASEPEDEY 552 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 64/252 (25%), Gaps = 50/252 (19%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 N+ Y + + + ++ IE+ E + V D S Sbjct: 340 NSTAYQLKRNIDTMLEYTCSG-MNSTCNLTEVDMRAR---AIELNEAGGLDVVFVFDASS 395 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVYMGLIGY 259 S+ KM + L ++ + S ++ I Y Sbjct: 396 SI-----------------------KMDDFRLGLDFSIELVKLLGTSWKPGGTHVAAITY 432 Query: 260 TTRVEKNIEPSWGTE-KVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 T + + + T S A+ + + Sbjct: 433 GTESHLEFNLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQVVP---------- 482 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 QK + F+TDG +N S K K+ +I I + + Sbjct: 483 ------FTREGSQKALFFITDGHSNIGGS---PRKAAKILKDKGFQIYAIGVGKKVRRRE 533 Query: 378 LLKTCVSSPEYH 389 L++ + + Sbjct: 534 LMEIASEPEDEY 545 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 38/297 (12%), Positives = 86/297 (28%), Gaps = 25/297 (8%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + + + + + + FY + Sbjct: 270 VAIYPAEGTFWHDNPFIVMASTSADEQAAAERFYEFLLSEESQRAAMQYGFRPANPNVPL 329 Query: 173 IVSFIPALLRIEMGERPIFLIELV-VDLSGSMHCAMNSDPEDVNSAPICQDK-KRTKMAA 230 PA G + + + V ++ +N D+ + KM Sbjct: 330 TDPISPAFGADPQGVQTVLAVPSAEVIVAVKDSWKLNRKRADILLVVDTSGSMEGDKMTM 389 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K + FL I +GLI + ++ + + +E + Sbjct: 390 VKAGIETFLMRILPED------RLGLITFDSQARLVVPMAPLSENRIDLQIAVQEMRASG 443 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A+ A Q L + + I+ L+DG +N + + Sbjct: 444 RTALFDALDLARQTLEALPPAEDDRI--------------RAIVLLSDGADNASR--LTL 487 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++ + E+ I I ++ + + Q L S V ++ + +F+N+S+ Sbjct: 488 EEVRRQFDESGITIFPVAYGSDADRQVLDAIAEFSR-TIVVVGDSGDIAQIFENLSR 543 >gi|223694808|gb|ACN18090.1| von Willebrand factor type A [uncultured bacterium BLR5] Length = 347 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 56/194 (28%), Gaps = 17/194 (8%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-- 289 K A F+++I +I +T + + Q + R +L Sbjct: 112 KAAARSFIETI----IRSSKDQAAIIPFTDYAYLEQGLTPNVLAIYQALQRLEVALPSYV 167 Query: 290 KPTDSTPAMKQAYQILTS---DKKRSFFTNFFRQGVKIPSLPFQKF--IIFLTDGENNNF 344 + + + + G + P ++ II LTDG++ + Sbjct: 168 GSGRKISGISSGPGTIANPREGSTAIWDAVTVSAGEILTRSPGRRRRAIILLTDGQDTSS 227 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA----SPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + T DKA E I I I + L + + L Sbjct: 228 RVTRGT--AIDKALEAETVIYAIGIGDSKYEGIDKGALNNVAERTGGRAFFPKRGADLTS 285 Query: 401 VFQNISQLMVHRKY 414 VF I + + + Sbjct: 286 VFTEIEKELRSQYL 299 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGKKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 365 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 69/198 (34%), Gaps = 34/198 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K K+ NA+ F+D + V +GLI + V I P+ E + + Sbjct: 111 TSGSMKGDKIITAINAVKKFIDQ------TIDYVLIGLITFNDHVRIAIPPTSDQELLYK 164 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + ++ AY L + + IIF+TD Sbjct: 165 KLGEIK---AFGGTIYSKPLEIAYDWLVPFAEFNL----------------SPTIIFVTD 205 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP--------NGQRLLKTCVSSPEYHY 390 G + + + + A+ I I I I QRL + + Y Sbjct: 206 GLPYSQDAPLYREVVYKCAR-YNITIYPIFIETPGMSIYETMMAQQRLREIANITKGQFY 264 Query: 391 NVVNADSLIHVFQNISQL 408 NV +SLI++F+ +++ Sbjct: 265 NVKQTNSLINLFEKLAEK 282 >gi|118090156|ref|XP_420539.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 606 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 70/225 (31%), Gaps = 40/225 (17%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSHV 248 F + V+D SGS+ + + VN + T + + K L + Sbjct: 153 AFDMYFVLDKSGSVAQNWHEIFDFVNQLDGSGFVRLTERFVSPKMRLSFIV--------- 203 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++++ + + EK+++ + + T +KQA + Sbjct: 204 ----------FSSQAHVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQANMQIEKQ 253 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 F II LTDG + + + K ++E ++ + Sbjct: 254 GASRF----------------SSIIIALTDG-KLDGQIPLYAEKEAKISRELGARVYCVG 296 Query: 369 INASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHR 412 + Q L+ + E + V +L + ++ + Sbjct: 297 VLD--FVQAQLERIADTKEQVFPVTGGFQALKGIINSVLKQSCTE 339 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 66/225 (29%), Gaps = 62/225 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++ A K L L + +G I ++ + + Sbjct: 90 DTSRSMMTEDVYPNRLEAAKRTLENLLQGLK-------GDRIGFIPFSDSAYIQMPLTDD 142 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ + + T+ A++ A + + K Sbjct: 143 YSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN------------------K 184 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II L+DG + + KS K+N + + +I I Sbjct: 185 TIIVLSDGGDFDDKSL-------KFVKDNKMNVFSIGIGTDEGTIIPEYVNGKKVGFIKD 237 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQ 407 N L K S +Y V N D + F++ + Sbjct: 238 QKGSAVISKLNSDFLKKLSSESNGKYYEVNNLKDDTSNFFRDTAN 282 >gi|113682008|ref|NP_001038479.1| hypothetical protein LOC563353 [Danio rerio] gi|94732542|emb|CAK03688.1| novel collagen protein [Danio rerio] Length = 873 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 51/185 (27%), Gaps = 18/185 (9%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A ++ + ++ Y+ I S + Sbjct: 30 FETAKRWLINITKGFDVSSRHTQVAVVQYSDTPRLEIPLGKHQNSQELVEAVGSVSYLGG 89 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+K A + F P + + LTDG + + + Sbjct: 90 NTRTGRAIKFATD------------HVFGMPNHTSQSPRNRIAVVLTDGRSQDDVED--- 134 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLM 409 +A+ I + + + L+ + +V + +S+ ++ + Q + Sbjct: 135 --AAMEARAQNIVLFAVGVGNEITNSELVSMANKPASTYVLHVEDYNSIASIWDLMEQKL 192 Query: 410 VHRKY 414 Sbjct: 193 CEESV 197 >gi|296196207|ref|XP_002745750.1| PREDICTED: anthrax toxin receptor 2-like [Callithrix jacchus] Length = 663 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + +K+ + + + T Sbjct: 237 IYNFVQQLAER-FVSPEMRLSFIVFSSQASIILPLTGDRDKISKGLEDLKHVNPVGETYI 295 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 296 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 338 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 339 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 396 Query: 410 VHRKYSVILKG 420 + S + G Sbjct: 397 LELWPSSVCVG 407 >gi|291386646|ref|XP_002709870.1| PREDICTED: anthrax toxin receptor 1 [Oryctolagus cuniculus] Length = 564 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYFFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 86/250 (34%), Gaps = 29/250 (11%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + +D SG++ C + + D+ TK L+N++ F ++ LS Sbjct: 711 NSSAKFYIDNSGTLKCRFQFVQNSCSPVYMKADEASTKSEVLQNSIARFAATLGYLSPGS 770 Query: 250 E--DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-------- 299 + + ++ + + T +V + ++ + + + + + Sbjct: 771 QIAMTRFSVDTFSNAECALLNWTNDTGEVTAAMNQEYGNPLAEGGRANQTLDGLRVYNYG 830 Query: 300 -----QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 Y+ + S + P +++I TDG++N+ + Sbjct: 831 ITGSTHTYRGIESYIENMTNGASGGYVPNAPQGNNSRYLIIFTDGKDNSGNLQKSMDDT- 889 Query: 355 DKAKENFIKIVTI-----SI---NASPNGQRLLKTC---VSSPEYHYNV--VNADSLIHV 401 D K N I+T+ + + + L + S +Y Y + + L+ V Sbjct: 890 DALKNNGYTIITVLMQSAGMTSEDVEHSTTFLKRLASSNASGEKYFYTAMYNDPEGLVKV 949 Query: 402 FQNISQLMVH 411 FQ+I+ + Sbjct: 950 FQDIAHEIAK 959 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 64/218 (29%), Gaps = 33/218 (15%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-------TKMAALKNA 234 RP ++ VVD SGSM + E + K +K+ + A Sbjct: 38 AEATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKIDLVIEA 97 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L L ++ + ++ + E + EK R + + T Sbjct: 98 LQNLL----SSPQLQPSDRLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGTQM 153 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M++ ++L + +I LTDG+ + Sbjct: 154 GAGMREGMRLLE-------------------REAGSRRLILLTDGQT---FDEPLVETVA 191 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + + I + I + N L + + ++V Sbjct: 192 AQLAQARIPVTAIGVGDEWNDDLLAEITDRTQGKPFHV 229 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 52/172 (30%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L + V +GL+ Y+ + + K + T + A++ Sbjct: 262 VSGLDVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEF- 320 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + ++ +TDGE S+ + D+ K + Sbjct: 321 ----------IRANSLTEMSGSRAKDGVPQIVVLVTDGE-----SSDEVQDVADQLKRDG 365 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 + + + IN + Q L K E + N L + + Q + Sbjct: 366 VFVFVVGINI-QDVQELQKIANEPFEEFLFTTENFSILQALSGTLLQALCST 416 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 45/172 (26%), Gaps = 18/172 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + L ++++ +GL Y+ S K A++ Sbjct: 59 ANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGNRMGQALQF- 117 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +F S + + ++ G + + + + Sbjct: 118 ----------ILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHI-----REPAEALRRAG 162 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVHR 412 I + I + + L + S + + V N L + Q + + Sbjct: 163 ILVYAIGV-KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCST 213 >gi|116625432|ref|YP_827588.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228594|gb|ABJ87303.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 61/189 (32%), Gaps = 31/189 (16%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ + ++ FL + LI ++ + + TE++ + + Sbjct: 106 NRLQDARQSVEQFLRTGSTGD------EYFLIRFSDEAKMLAPFTADTEEIARQLGSI-- 157 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++ A + +K ++ ++DG +NN + Sbjct: 158 -EAKGWTALNDSIVLAANQ------------------SRKARNHRKALLVISDGGDNNSR 198 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 V ++ +E +++ +SI Q L K C + V L + + + Sbjct: 199 YTVG--EMISILREADLRVYAVSIF--ERSQLLEKICEETGGRALWVRKLGDLPDIMERL 254 Query: 406 SQLMVHRKY 414 SQ M Sbjct: 255 SQEMRSEYI 263 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 21/202 (10%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID----- 243 P I +++D SGSM+C+ + + + A K+ A NAL F+ Sbjct: 111 PDAWIVVLIDFSGSMNCSQDLNTKCDAKAVAKGK---RKLDAAINALGTFIKLASERKGN 167 Query: 244 ------LLSHVKEDVYMGLIGYTTRVEKN----IEPSWGTEKVRQYVTRDMDSLILKPTD 293 ++ G Y +V K+ ++ D T+ Sbjct: 168 TYLSIVPFGVEGKNDKPGACDYYPKVTSETLDNFNLVQDV-KLTNFLGSLADKTPCATTN 226 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+K+ + +DK+ F+ P II L+DG +NN + Sbjct: 227 FYQALKETVKFFKNDKEGRFYPKDKEGKPLKPQPRLS--IILLSDGFDNNSNYQEVQKTL 284 Query: 354 CDKAKENFIKIVTISINASPNG 375 + I + T+ +P Sbjct: 285 ANLQNNKDIVVHTLGYGLTPQQ 306 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 20/198 (10%), Positives = 56/198 (28%), Gaps = 25/198 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T+ K +S ++ + + T + ++ Sbjct: 30 NSGSINDTEFDNFKEFAKKLAESFT---ISATYTHVAAVYFNTLANFGFNLKYDINVIKT 86 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + I T A+ + P + ++ LTD Sbjct: 87 AIDNLPN--IGGGTHIGKALTYTLDNVFKVA---------------PRQNVKNVLVVLTD 129 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G++++ +V + +++ + + A + L S P ++ ++V + Sbjct: 130 GKSHD---SVTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFS 186 Query: 397 SLIHVFQNISQLMVHRKY 414 + + + KY Sbjct: 187 QIESTTGAVEDEICKGKY 204 >gi|148656642|ref|YP_001276847.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568752|gb|ABQ90897.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 452 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/279 (11%), Positives = 76/279 (27%), Gaps = 46/279 (16%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 L ++ + ++ +LLR + + V Sbjct: 31 LTVRAAATGDAGVRPVNWALVADASRSMRIPIVDETQF---RSLLRNGSAQETLVDGVPV 87 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 LSGS+ + + + + + +AL ++ +D + L Sbjct: 88 WQLSGSVPQEV-------------RKAASSALDHVVHALHTVVERLDRND------RLSL 128 Query: 257 IGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + + I + + + R + T+ + A + +++ Sbjct: 129 VVFADHALLLIPGMVGSDRVTLVRAIERLPGLDLGDGTNLADGIALALNQIRANR----- 183 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 ++ LTDG + + + D+A + I I TI + Sbjct: 184 -----------DARRANRVLLLTDGFT---RDPAACLTLADQAADEHIAITTIGLGGEFQ 229 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 L S + A ++ IS + + Sbjct: 230 DDLLTGIADRSGGNALFLKRASAIPRA---ISAELESAR 265 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 76/211 (36%), Gaps = 37/211 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 + ++ LK A LFL L ++ + G++ + ++ ++ V Sbjct: 310 VSGSMRDFNRIHRLKQAAELFL-----LQIIETGSWAGIVVFNSKASTKAFLQQITSDSV 364 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ ++ + ++ T +++ +Q+ + I+ L Sbjct: 365 RQTLSDHLPTVADGGTSICSGIREGFQVFLQKYSSTEGCE----------------IVLL 408 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 TDGE+++ S C + + + I TI++ S + + + ++ Sbjct: 409 TDGEDSSVSS-------CFAEVERSGSTIHTIALGPSAAKELEM-LANMTGGLTFSATDS 460 Query: 394 -NADSLIHVFQNISQ---LMVHRKYSVILKG 420 +++ LI F IS + + + KG Sbjct: 461 LDSNGLIDAFSGISSGSGDISQQSIQLESKG 491 >gi|301613500|ref|XP_002936246.1| PREDICTED: collagen alpha-1(XII) chain-like [Xenopus (Silurana) tropicalis] Length = 3127 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 49/366 (13%), Positives = 110/366 (30%), Gaps = 43/366 (11%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A A K +++L L F + + + D+ + + ++E LS + E Sbjct: 278 AADAKELKQIASLPSLKHVFNVANFDSIVEVQDEIITQVCSGVEEQLSELVSGEEAVEPP 337 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMD------VMTSYDYRL-QFIEHLL---NQRY 169 + + + IS M S + Y + + L + Sbjct: 338 SDMTVTEISSKSMKITWRPSPSQITGYRIQLLPMLAGSKQHSLNLGPQTTSINVKDLSPD 397 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + ++ ++ PI +E + + C+ D + + Sbjct: 398 TEYQINL--YAMKGLTASEPISTLEKTQAVKIKVECSGGVDIKADVVLLVDGSYSIGVAN 455 Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-L 287 K + FL+ + V + L+ Y+ + + + + Sbjct: 456 FAK--VRAFLEVLVKSFDISPSKVQISLVQYSRDPFTEFTLNR-YDSIEDILKAVNTFPY 512 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ AM + + + K + +I +TDG KS+ Sbjct: 513 RGGSTNTGKAMTYVREKVFVEIKG-------------ARPNVPRVMILITDG-----KSS 554 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K + + ++I + + + + L+ S P + Y V + FQ I Sbjct: 555 DAFKDPAIKLRNSDVEIFAVGVKDAVRSE--LEAIASPPSDTHVYTVEDF----DAFQRI 608 Query: 406 SQLMVH 411 S + Sbjct: 609 SFELTQ 614 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ + ++ + V + L Y+ + + + K Sbjct: 1216 RTVRSFIARLVEVFDIGSDRVQIALAQYSGDPRSEWQLNAHSTKKSLMDAVANLPYKGGN 1275 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ IL ++ K +K + +TDG++ + Sbjct: 1276 TLTGMALNF---ILQNNFKAEAGM----------RAKSKKIGVLITDGKSQDDIV----- 1317 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K ++ I + I I + L+ S P+ Y +NV + L+++ +++ + Sbjct: 1318 APSKKLRDQGIDLFAIGI--KNADENELRQIGSDPDETYVFNVADFSLLVNIVDDLTTNL 1375 Query: 410 VH 411 + Sbjct: 1376 CN 1377 >gi|301065602|ref|YP_003787625.1| hypothetical protein LCAZH_0484 [Lactobacillus casei str. Zhang] gi|300438009|gb|ADK17775.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei str. Zhang] Length = 909 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/418 (7%), Positives = 101/418 (24%), Gaps = 53/418 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + ++ + A + ++ G + Sbjct: 14 LFAVLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 70 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE----MGERPIFLIELVVD 198 + + ++ + + + V Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVRTGVKNFLTSIQNAGLGDYVNVGL 189 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S + + I L +S + Sbjct: 190 IGFSSPGYIGGGNKTTGPGYIRVG----------------LGKAGNISQQQAINSALSPT 233 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I G+ +++ + + K S N Sbjct: 234 FNGGTYTQIGLRQGS--------AMLNTDTSGNKKMMILLTDGVPTFSDKVKNSVVENGT 285 Query: 319 RQGVKIPSLPFQKFII------FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 G + + ++ + + + T+ KAK++ ++ + I Sbjct: 286 LYGTNFGTTRDEPGYTAELKNPYIDSSGDYIYDTWPATLGEAKKAKDSGNEVHALGIQLA 345 Query: 371 -----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + ++ ++ +SP+ + + +AD++ N ++ +V ++I Sbjct: 346 GDDGYRSEGYMSDEDVRQNMELIPTSPDLYEDADSADAVEAYLNNQAKDIVKNFNTII 403 >gi|73695956|gb|AAZ80784.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 58/189 (30%), Gaps = 17/189 (8%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K A+ + + V + +I Y + I + T + Sbjct: 55 SSSSLRNRDFKTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDE 114 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + + K ++ + + + + + K I +TDG Sbjct: 115 VIEAIGKIPHK-----AGLRT------DTGRAIQYMHEAQLANGVVRPGVTKVSIVITDG 163 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + T + ++A+++ I + I + LL V N + L Sbjct: 164 ---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLS 219 Query: 400 HVFQNISQL 408 + ++++ Sbjct: 220 SIKESLAHQ 228 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/318 (9%), Positives = 79/318 (24%), Gaps = 70/318 (22%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 A + + ++ + + N +++ ++ Sbjct: 118 PALDENASEEEIQHLFNQFLYIVGYDYTPFETIDRFSYVIFKNDMENPFTHEKIEENMNV 177 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 + + L ++ + ++ ++ ++ + E P V + H N+ + Sbjct: 178 NVEIVLDASGSMVKKIGDKTMMEIAKESIKKVLSEMPAN---AKVGIRVFGHKGDNTASK 234 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 S + + + Sbjct: 235 KDESCGSNEL----------------------------------------IYPIGDL--- 251 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V T +++ + L + Sbjct: 252 --NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK----------------ALDGEKTLNI 293 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNGQRLLKTCV-SSPEYH 389 + +TDG N ++I + K I + I N N RLLK ++ Y+ Sbjct: 294 LYIITDGIETCG---GNPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYY 350 Query: 390 YNVVNADSLIHVFQNISQ 407 +V +A+ L I++ Sbjct: 351 SSVNDANKLTGELYRINE 368 >gi|300869050|ref|ZP_07113652.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300332961|emb|CBN58846.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 411 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 55/193 (28%), Gaps = 29/193 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMD 285 + +K A +D + + +I + + + + +++ + + Sbjct: 59 LDTVKQAAGRLVDRLKPGD------RLSVIAFDHKAKVIVPNQFIDDPGSIKKQIDKL-- 110 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T +K + + K + F LTDGEN + Sbjct: 111 -RSSGGTAIDEGLKLGIEEMGKGKSETVSQAF-----------------LLTDGENEHGD 152 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +N + A + + + ++ N L K ++ + + I F + Sbjct: 153 NNRCLK-LAKLAADYNMTLNSLGFGDDWNQDILEKIADAAGGTLAYIQRPEQAIDEFSRL 211 Query: 406 SQLMVHRKYSVIL 418 + + Sbjct: 212 FTRLQSVGLTNAY 224 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 80/243 (32%), Gaps = 49/243 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 ++ +Q++++ + + + R + L++D SGSM K Sbjct: 16 ASKSSSQRLIAISVSAIGETIDSRVPLNLCLILDHSGSM--------------------K 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 + +K A L +D + + ++ + R E + + ++Q + R Sbjct: 56 GQPVENVKRAAWLLVDKLRDQD------RLSIVVFNHRAEVLLSNQNVVDRDHIKQQINR 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T ++ + L ++ + F LTDGEN Sbjct: 110 LS---ANGGTSIDEGLRLGIEELAKGRRDTISQAF-----------------LLTDGENE 149 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + +N A + + + T+ + N L K + ++ + D + F Sbjct: 150 HGDNNRCLK-FAQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHPDQAVDKF 208 Query: 403 QNI 405 ++ Sbjct: 209 NSL 211 >gi|73695954|gb|AAZ80783.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 58/189 (30%), Gaps = 17/189 (8%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K A+ + + V + +I Y + I + T + Sbjct: 55 SSSSLRNRDFKTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDE 114 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + + K ++ + + + + + K I +TDG Sbjct: 115 VIEAIGRIPHK-----AGLRT------DTGRAIQYMHEAQLANGVVRPGVTKVSIVITDG 163 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N + T + ++A+++ I + I + LL V N + L Sbjct: 164 ---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLS 219 Query: 400 HVFQNISQL 408 + ++++ Sbjct: 220 SIKESLAHQ 228 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 66/226 (29%), Gaps = 59/226 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D + ++ K + + V +GL+ ++ + Sbjct: 96 DISNSMMVEDAQPNRLERAKREVADLIR-------VATGDRLGLVAFSGVAFTQCPLTLD 148 Query: 273 TEKVRQYVTRDMDSLIL---KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + ++ ++ + L+ + TD A++ Sbjct: 149 YQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMTAFDPKSSTD----------------- 191 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------- 373 K I+ +TDGE+N +K +KA + I+I + I Sbjct: 192 -KVILLITDGEDN----EEAGLKAAEKASDEGIRIFVLGIGDPAGGPVPSLDGSGFEKDA 246 Query: 374 ---------NGQRLLKTCVSSPEYHYNVVNAD-SLIH-VFQNISQL 408 + L + + D L F I + Sbjct: 247 GGKIILSKPDESTLQAIANETGGDYIRSEAGDFDLDQLYFNGIKKK 292 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 62/173 (35%), Gaps = 24/173 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+I Y+ +VEK + K + D + + T + A++ A + Sbjct: 836 RIGIINYSHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAANDMF------- 888 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K +K + +TDG+ + + + A ++ ++I I + Sbjct: 889 ----------KEARPGVKKVALVITDGQ-TDSRDKKKLADVVKDANDSNVEIFVIGVVKK 937 Query: 373 PNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 + + + + E+ Y + +L + +S+ + S +++ Sbjct: 938 DDPNFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKICEDFDSYLIQ 990 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 60/166 (36%), Gaps = 26/166 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + + I L+ + D+ + + +++ V+ + S +Q V Sbjct: 64 FDNQKDFVDSLSEKIFQLTPGRSLKYDIKLAALQFSSSVQIDPPLSSWKDLRTFKQRVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 124 LN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + I + A+ I +T+ ++ N + L+ P Sbjct: 166 PKSPD--VQSISEDARILGISFITVGLSTVVNEAK-LRLISGDPSN 208 >gi|325673441|ref|ZP_08153132.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] gi|325555462|gb|EGD25133.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] Length = 321 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/239 (7%), Positives = 58/239 (24%), Gaps = 16/239 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R+ + E+ ++ A+ ++ L + + + K +++ A A Sbjct: 1 MRWVRSRMSNDERGVVAVFVAILMVVLLGCAAISVDIGANYVVKRQLQN----GADAAAL 56 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S S + +A ++ + I + + Sbjct: 57 AVAQESSCKAGSSASSVSRLVQANVNSSSASSAAVIDGVKRKVTVTASAVGDDGLAGRRN 116 Query: 127 ISMTHMANNRLDSSNNTIF---YNMDVMTSYDYR-----LQFIEHLLNQRYNQKIVSFIP 178 + + +R + S + + + L + Sbjct: 117 VFAPVLGVDRSEISASATASCVFPLGGTAELPLTFHKCHFDESRSL--DVKILVAYNVTA 174 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--MAALKNAL 235 G + ++G +++ K + +NA+ Sbjct: 175 PRCNGTSGNAAPGNFGWLQGVNGRCPAKIDAAVYATPGDTGNNIPGPCKDTIKQFQNAV 233 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 18/162 (11%), Positives = 47/162 (29%), Gaps = 26/162 (16%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN------------N-- 58 K + SE+ +I+FAL++ L + + H + +++A + Sbjct: 11 KRLQSERGGAAILFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSLSDSG 70 Query: 59 ----------AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE--SL 106 +A L A + V+N + + D + + + + + Sbjct: 71 GNPYNWSAAGSAALDIARRNVANGAGIQDALIETGYWNIQDPSEGLRAPGTPGVPAAGDV 130 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 + + N + D +++ Sbjct: 131 AAVQVTITISRTLNNGPLRLFFAPVLGIAEQDVQGSSVAVIA 172 >gi|3273249|dbj|BAA31168.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273271|dbj|BAA31179.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 60/193 (31%), Gaps = 33/193 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ LK+AL + + + ++ Y + + G ++ Q + Sbjct: 226 KLPLLKSALRQLVTQMRAQD------RVAIVAYAGSAGLVLPSTPG-DRHAQILAAIDGL 278 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ ++ AY + I+ +DG+ N ++ Sbjct: 279 QASGSTNGGAGLELAYAEAAKGLVKDGVN----------------RIVLASDGDFNVGRT 322 Query: 347 NV-NTIKICDKAKENFIKIVTISINASPNGQRL-LKTCVSSPEYHYNVVN--------AD 396 ++ ++ I + T+ + + + ++ + ++ + + A Sbjct: 323 DLAQLKDYVGSQRKRGIALTTLGLGSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVFAS 382 Query: 397 SLIHVFQNISQLM 409 L I++ + Sbjct: 383 ELSATLLTIAKDV 395 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 71/200 (35%), Gaps = 15/200 (7%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 M A K A F + + ++ + +I + + + + +++ + + Sbjct: 199 MTAQKEAAKKFCED-TIGTNKNSNHKFAVITLDSGSKTLTDFTNDIDELDSAIAKTT--- 254 Query: 288 ILKPTDSTPAMKQAYQILTS-DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + A++ A ++L+ G L+D + + + Sbjct: 255 AYGSTNYSAALRNAAELLSKVSADAVRNIVLCSDGNPYGGEEKSTGKYTLSDYSDYEYAN 314 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL------LKTCVSSPEYHYNVVNADSLIH 400 I + K+ +I T+ S +G+ L LK S + V D L Sbjct: 315 AAY--DIAQEIKK-DYEIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNYAEVNKVDDLQK 371 Query: 401 VFQNIS-QLMVHRKYSVILK 419 VF +++ + ++K S + K Sbjct: 372 VFADVAGNAVSYKKVSSVTK 391 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 79/234 (33%), Gaps = 51/234 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +P+ I LV+D SGSM+ + + A KN + Sbjct: 108 SAKPVKDIVLVIDNSGSMNETD---------------PNQDRYTAAKNLINRM------- 145 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 D + ++ + + + + + ++D L TD + A++ Sbjct: 146 ---DRDNRVSVMVFDHATTLLQPFTRVKNQETKDEIIAEIDGLATNDGGTDISLALEDTM 202 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + +I L+DG + V K+ I Sbjct: 203 SHIQESRDAGRSA----------------MVIMLSDGFSETDHDRVLAEY-----KQQQI 241 Query: 363 KIVTISIN-ASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + TI ++ +P+G +LL+ + +Y+V +A+ L VFQ I + R Sbjct: 242 AVNTIGLSLVNPDGAQLLQTIAAETGGQYYDVQHAEDLSFVFQKIYDDVGDRSL 295 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 61/178 (34%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D +G++ Y+ R E + + + T++ Sbjct: 68 QWVANLVDTFEVGPGHTRVGVVRYSDRPTTAFELGHFSSREEVKAAARRITYHGGNTNTG 127 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + + + G + + F++ I LTDG + + + Sbjct: 128 DALRY----------ITSRSFSAQAGGRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-- 175 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + A+ + L S P+ + ++V + +++ + + + + Sbjct: 176 ---AAGIRIFAVGVGAALKEE--LDEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 228 >gi|306820467|ref|ZP_07454103.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551542|gb|EFM39497.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 538 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 70/241 (29%), Gaps = 49/241 (20%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++ E +E+V+D SGSM +N ++ M KN++ Sbjct: 179 PFTGEKVNENKQVNVEIVLDASGSMAKQING---------------QSMMNIAKNSITEV 223 Query: 239 LDSIDL--------LSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLIL 289 L + H + G + E + T + + ++ Sbjct: 224 LKHLPKNAKVGLRVFGHKGNNTDSGKTESCSANELIHPIETLNTSAISKALSSV---EAT 280 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T ++K + L+ K + +TDG Sbjct: 281 GWTSIADSIKNGGEDLS----------------KFKEEGAVNILYIVTDGIETCGGD--- 321 Query: 350 TIKICDKAKENFIKIV--TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNIS 406 I+ K + +V I N + +LK + ++ +A +L I+ Sbjct: 322 PIEAAQTLKNSGTNVVLGIIGFNVNATQDAVLKKIAEAGGGHYAIANDAGTLTSELYKIT 381 Query: 407 Q 407 + Sbjct: 382 E 382 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 72/210 (34%), Gaps = 35/210 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV- 276 ++ + A FL I ++ ++G++ ++++ E + Sbjct: 315 VSGSMSSSDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIVHELIQMNSDIE 369 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + + + + T +K A+Q+ + + ++ T I+ L Sbjct: 370 RNKLLQTLPTSAIGGTSICSGIKTAFQVFKNGEYQTDGTE----------------ILLL 413 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DGE + D+ K++ + I++ S + + + + H + Sbjct: 414 SDGE------DSTAKDCIDEVKDSGSIVHFIALGPSADLA-VTNMSILTGGNHKLATDEA 466 Query: 397 S---LIHVFQNISQL---MVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 467 QNNGLIDAFGALASENTDITQKSLQLESKG 496 >gi|227818616|ref|YP_002822587.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] gi|36958872|gb|AAQ87297.1| Hypothetical protein RNGR00523 [Sinorhizobium fredii NGR234] gi|227337615|gb|ACP21834.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] Length = 140 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 36/105 (34%), Gaps = 1/105 (0%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-A 65 + ++ S+ ++ FAL LLL+ +I Y +N M A + A Sbjct: 3 LSLFLRRAWRSQSGATAVEFALVCFPLLLLVLGVIEFGRAFYVRNDMSYAADVAAREVLI 62 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 K+ + + + S D R K+++ G Sbjct: 63 GKIARDAPDSEAQAKLASAVRDSFDSGDPARLEIAVTKQTVDGID 107 >gi|224080732|ref|XP_002192406.1| PREDICTED: anthrax toxin receptor 1 [Taeniopygia guttata] Length = 516 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 70/219 (31%), Gaps = 20/219 (9%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L D G DK + + N + F++ + + Sbjct: 19 AIFTLTCDGHGGKRENGGPACYGGFDLYFILDKSGSVLHH-WNEIYHFVEHLARK-FISP 76 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + M I ++TR + + E++RQ + L T ++A + + Sbjct: 77 QLRMSFIVFSTRGTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY---- 132 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + II LTDGE + + + +++++ + + + Sbjct: 133 ----------YENVHGYRTASVIIALTDGELHEDLFF-YSEREANRSRDLGATVYCVGVK 181 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 + L S ++ + V + +L + +I + Sbjct: 182 D--FNETQLARIADSRDHVFPVNDGFQALQGIIDSILKK 218 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/286 (9%), Positives = 75/286 (26%), Gaps = 46/286 (16%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + I +L + + + + + V Sbjct: 151 SSAPVAPLPTIPSPPNPTVQLTTPPAPLATITPEVLPVETSAPATLSVSTFTTTVSAVET 210 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + ++D S S+ ++ +K Sbjct: 211 ETDSSCLS----RPLDLVFIIDSSRSVR--------------------PSEFEKVKI--- 243 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDS 294 D +D L + + ++ Y + V+ +++ ++R T + Sbjct: 244 FLADMVDTLDVGADATRVAVVNYASTVKTEFLLKDHFNKPNLKKAISRIEPLAT--GTMT 301 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K A + G + K I +TDG + ++ Sbjct: 302 GLAIKTAVS----------EAFTEQSGARPRPRNIAKVAIIVTDGRPQDQ-----VEEVS 346 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 A+ + ++I + ++ + L V ++ + V + Sbjct: 347 AAARASGVEIYAVGVDRADMRSLQLMASVPLEDHVFYVETYGVIEK 392 >gi|218548279|ref|YP_002382070.1| hypothetical protein EFER_0898 [Escherichia fergusonii ATCC 35469] gi|218355820|emb|CAQ88433.1| conserved hypothetical protein; putative exported protein [Escherichia fergusonii ATCC 35469] Length = 543 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 79/253 (31%), Gaps = 45/253 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I I + P + ++D SGSM+ Sbjct: 157 PWNEKHTLLRLDIA-ANDIARSKLPPTNLVFLIDTSGSMN-------------------S 196 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K++L L ++ + + ++ Y + + G EK + Sbjct: 197 DERLPLIKSSLKLLVNELRDQD------RISIVTYAGSARLLLSSTSGAEK-NTILNAIA 249 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ + AY+ + + I+ TDG+ N Sbjct: 250 NLQAGGGTNGGAGVAMAYEQAQAGYIKGGVN----------------RILLATDGDFNIG 293 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + K +E+ I + T+ + + + ++ + +Y DSL + Sbjct: 294 DDPSSVEDLVKKQRESGITLSTLGVGDNNYNEAMMVKIADTGNGNY--SYLDSLSEAQKV 351 Query: 405 ISQLMVHRKYSVI 417 +S M +V Sbjct: 352 LSNEMNQTLVTVA 364 >gi|87199929|ref|YP_497186.1| hypothetical protein Saro_1912 [Novosphingobium aromaticivorans DSM 12444] gi|87135610|gb|ABD26352.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 435 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/278 (9%), Positives = 64/278 (23%), Gaps = 14/278 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L + F++ + + N +++ A + + GF + W+ K M+ A + A L+ Sbjct: 8 LPKSISLFRRLLRARAGNATLLMAFGAPALIATAGFAVDTAQWYLWKREMQYAADQAALS 67 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A +S ++ + + + +G + Sbjct: 68 AAYSKSKGISTTAYETHAVQEYNANLQLVTFSDTPTVSLAA-YNGGTNNSVVVRASATRE 126 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI-PALLR 182 + + + S + TS I+ Sbjct: 127 LAFSGIVLGKPTTVSVSAQATYTAGATYTSCIIATNATADGAITIGGSSILKSGCGIAAL 186 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 +D++ + D + N+ +K + D Sbjct: 187 SNSTNAIKVDGSPTIDVNYVLAAGGIDDWFNTNTD-----------DVVKEYVTSLADPF 235 Query: 243 DL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + GT Sbjct: 236 ASLTPPTNNTAATPYRCVRSGGVTQATLNPGTYTDITT 273 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 60.3 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 67/228 (29%), Gaps = 51/228 (22%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 L + L+ N K+ S + + R + I +D+S SM Sbjct: 32 ASGFKAALKVTILLLGMTFLIISLTNPKMGSKLKTIKREGV------DIVFALDISKSM- 84 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + +D ++ K + +D + +G+I Y Sbjct: 85 --------------LAEDIAPNRLEKSKQIISKIIDRLGS-------DRVGIIVYAGNSY 123 Query: 265 KNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + ++ D + + T A++ A Sbjct: 124 PLLPITTDHAAANMFLQNANPDMVSSQGTAINEALELAKTYYN----------------- 166 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +F+I L+DGE++ T ++ N +KI TI + Sbjct: 167 -NDEQTNRFLIILSDGEDHQ----EETKQVAQNLANNGVKIYTIGVGT 209 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 60.3 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 61/222 (27%), Gaps = 20/222 (9%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL-------FLDSIDLLSH 247 + ++G C + P + P D + + K + +DS + Sbjct: 15 FLGSVTGQDSCVQVASPTWAATCPCDSDGQYSFTMEGKTTVTANTVNVAVIIDSSGSVDD 74 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST----PAMKQAYQ 303 + D+ M + T + + S T + A Sbjct: 75 DEWDMSMAFAKDAVSSFADQNL--FTNGGSASIAQFSSSASEGGTFYSLEDFNAFVDGNT 132 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +S + + S F+I TDG+ + + D A++ Sbjct: 133 KYSSGGTDIIDGIAKGRELLKASPATTSFMIVTTDGQ------SSSPKAEADAARDEGTI 186 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + P + LL ++V D L I Sbjct: 187 VYAVGVGTGPTQEILL-DIGGEEANVFDVDGFDELDVALAGI 227 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 77/267 (28%), Gaps = 75/267 (28%) Query: 172 KIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ + ++ I +D+S SM + +D ++ Sbjct: 69 LSIALANPKIGTKIETVKREGVDIVFAIDVSKSM---------------LAEDVAPNRLE 113 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLI 288 K + + +G++ Y + + + ++ D L Sbjct: 114 KAKRIAFETISQLK-------GDRVGIVAYAASAYPQLALTTDHSAAKMFLQGMNTDMLS 166 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T A++ A + + + L+DGE++ + Sbjct: 167 SQGTAIQEAIRMASNYFDDKTPTA------------------RLLFILSDGEDH----EM 204 Query: 349 NTIKICDKAKENFIKIVTISINA------------------SPNGQ--------RLLKTC 382 +I +A+E + I TI I NG+ LL+ Sbjct: 205 GATEIASEAQEKGVHIYTIGIGTEKGAPIPIKEGGGQTYKRDSNGEVVITKLNSELLQQI 264 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 S+ Y N D+ I++++ Sbjct: 265 ASNAGGEYL--NGDNTQKTVSQINKVL 289 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/296 (9%), Positives = 69/296 (23%), Gaps = 42/296 (14%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + + + + + + N L + + + Sbjct: 48 DTIGVDKTAKRTPGCRTFEVNLNITGELQQAPVDVVLVLDRSGSMN------FVETPNSP 101 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 I E V+ +G ++ + + Sbjct: 102 TRLDYGKLAAINFAERVLGPNGIPGSRVSVVSFSGPAYATGVRNNPQRH----------- 150 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 G + +++ S V + R T++ + Sbjct: 151 -----------------YGQQNQATTDLDLSSDLRAVTDSINRIT---AFGGTNTEAGFE 190 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 Q ++ + + LP + + N I + Sbjct: 191 QGRSVIEGTTSNQNPNSNKVVIMLTDGLPTASNGNPYAETTDINHVHIQRAINAGKNIYQ 250 Query: 360 NFI-KIVTISI--NASPNGQRLLKTC--VSSPEYHYNVVNADSLIHVFQNISQLMV 410 N I + TI + + + L + + +Y +A L +F+ ISQ + Sbjct: 251 NDIADVFTIGLTTGMNATEKALADNILTQAQNKGYYPAPSATDLDAIFEEISQRLG 306 >gi|221109964|ref|XP_002168937.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 221 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 66/216 (30%), Gaps = 25/216 (11%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + A+ ++S+ + + + LK+ + F Sbjct: 11 DIFCFYPYCRTMFQPTCEAVVDIVFLLDSSASLRKYYQNEKDFLKSLISAF-------GV 63 Query: 248 VKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +I ++ + +I+ + ++ V + L+ T A++ A + + Sbjct: 64 SINSTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIV--LMGSTTRIDKALRLAQKEV 121 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K + LTDG + + N + I ++ + + I+ Sbjct: 122 FELENG-------------ARPGVAKILFLLTDGSQTQERGSENPVVIANELRSAGVTII 168 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I I + + L E Y + + L V Sbjct: 169 VIGITNAVDVSELF-DIAGGEENAYFADSFEKLKDV 203 >gi|332226724|ref|XP_003262542.1| PREDICTED: anthrax toxin receptor 1 isoform 2 [Nomascus leucogenys] Length = 333 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|297266186|ref|XP_002799329.1| PREDICTED: anthrax toxin receptor 1-like [Macaca mulatta] Length = 484 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|158257658|dbj|BAF84802.1| unnamed protein product [Homo sapiens] Length = 333 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|114577872|ref|XP_001136716.1| PREDICTED: anthrax toxin receptor 1 isoform 5 [Pan troglodytes] Length = 321 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|114577870|ref|XP_001136388.1| PREDICTED: tumor endothelial marker 8 isoform 1 [Pan troglodytes] Length = 326 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|114577874|ref|XP_001136473.1| PREDICTED: tumor endothelial marker 8 isoform 2 [Pan troglodytes] Length = 297 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|114577864|ref|XP_525774.2| PREDICTED: anthrax toxin receptor 1 isoform 6 [Pan troglodytes] Length = 564 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + E +GL+ +++RV Sbjct: 23 FELVKRFVNQIVDFL---DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEY 79 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A++ + S+ + + + + TDG + + S Sbjct: 80 MERGTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV 129 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 130 -----WAARAKEEGIAMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLRNL 182 Query: 406 SQLMV 410 + Sbjct: 183 RSSIC 187 >gi|62898107|dbj|BAD96993.1| tumor endothelial marker 8 isoform 3 precursor variant [Homo sapiens] Length = 317 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|16933551|ref|NP_444262.1| anthrax toxin receptor 1 isoform 2 precursor [Homo sapiens] gi|114577866|ref|XP_001136564.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Pan troglodytes] gi|332226726|ref|XP_003262543.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Nomascus leucogenys] gi|16566413|gb|AAL26496.1|AF421380_1 anthrax toxin receptor [Homo sapiens] Length = 368 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|16933553|ref|NP_060623.2| anthrax toxin receptor 1 isoform 3 precursor [Homo sapiens] gi|114577868|ref|XP_001136638.1| PREDICTED: anthrax toxin receptor 1 isoform 4 [Pan troglodytes] gi|15082333|gb|AAH12074.1| Anthrax toxin receptor 1 [Homo sapiens] gi|190690489|gb|ACE87019.1| anthrax toxin receptor 1 protein [synthetic construct] gi|190691863|gb|ACE87706.1| anthrax toxin receptor 1 protein [synthetic construct] gi|312150350|gb|ADQ31687.1| anthrax toxin receptor 1 [synthetic construct] Length = 333 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|62988680|gb|AAY24067.1| unknown [Homo sapiens] Length = 234 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|62870691|gb|AAY18344.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 4 [Homo sapiens] Length = 411 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|62870685|gb|AAY18341.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 1 [Homo sapiens] Length = 387 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|62870689|gb|AAY18343.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 3 [Homo sapiens] Length = 410 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|62870687|gb|AAY18342.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 2 [Homo sapiens] Length = 419 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|14149904|ref|NP_115584.1| anthrax toxin receptor 1 isoform 1 precursor [Homo sapiens] gi|332226722|ref|XP_003262541.1| PREDICTED: anthrax toxin receptor 1 isoform 1 [Nomascus leucogenys] gi|17366074|sp|Q9H6X2|ANTR1_HUMAN RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|14017381|gb|AAK52094.1| tumor endothelial marker 8 precursor [Homo sapiens] Length = 564 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 61/181 (33%), Gaps = 45/181 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I VD+S SM + +D +++ + + ++++ Sbjct: 91 DIVFAVDISKSM---------------LAEDIAPSRLEKSQQLVTQIINNLAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI Y + + + ++ D + + T A++ A + Sbjct: 129 DRIGLIAYAGSAVPQLPITTDYSSAKMFLQSMNTDLVSSQGTAIAEAIQLAESYYSE--- 185 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 K ++ ++DGE++ ++ + A EN I+I+TI + Sbjct: 186 ---------------DTEASKVLVIISDGEDHEGEALDY----AEAAAENGIRIITIGVG 226 Query: 371 A 371 Sbjct: 227 T 227 >gi|119774146|ref|YP_926886.1| type IV pilin biogenesis protein [Shewanella amazonensis SB2B] gi|119766646|gb|ABL99216.1| type IV pilin biogenesis protein, putative [Shewanella amazonensis SB2B] Length = 1148 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/289 (9%), Positives = 67/289 (23%), Gaps = 47/289 (16%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E + + + + RT++ K A+ Sbjct: 176 SSGSKEATKELAMSKAKNTGFGTGRVVTLYTKTYLTWYHSKKKQVNRTRIDIAKEAVTNV 235 Query: 239 LDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 L + + + Y T ++ + +D L T Sbjct: 236 LLTTPGVDFGLAIFNSNVYEGYDDGGRIIAGIKPATASNKKDLIEAVDLLKGTTWTPLCE 295 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSL--------------PFQKFIIFLTDGENN 342 + +AY+ + + + + + + +I+++TDGE Sbjct: 296 TLYEAYRYFSGGEVWFGDDDPTLKPYRDKDAIDANSRYKSPFKTCQNRAYIVYVTDGEPT 355 Query: 343 N------------FKSNVNTIKICDKAK-----------------ENFIKIVTISINASP 373 + T + + TI + Sbjct: 356 RDSNANQLVYDLTGGVDAYTSSPASYLSSLSSWMNTQDVNPNMTGKQSVSTYTIGFSQGA 415 Query: 374 NGQRLL--KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 L T +Y+ N D L + + ++ + S Sbjct: 416 ASAAGLLRHTAEKGGGKYYDATNVDDLQKSLMQVFKNILEKNASFTAPA 464 >gi|47214619|emb|CAG01460.1| unnamed protein product [Tetraodon nigroviridis] Length = 1723 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 73/240 (30%), Gaps = 31/240 (12%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 L+ + +++ + S+ C++ D + + K + Sbjct: 400 VSLYALKGLTPSEAVSVLQKTEPVKVSVECSLGVDVQADVVLLVDGSYSIGLANFAK--V 457 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTD 293 FL+ + + + V + L+ Y+ + VT T+ Sbjct: 458 RAFLEVLVNTFDIGPDKVQISLVQYSRDPHTEFYL-DSHHNLEAVVTALRTFPYRGGSTN 516 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + AM + + + + I +TDG KS+ + Sbjct: 517 TGRAMTYVRETVFQASRG-------------ARAHVPRVTILITDG-----KSSDAFQEP 558 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 + + ++I + + + + L+ ++P + Y V + FQ IS + Sbjct: 559 AANLRNSDVEIFAVGVKDAVRSE--LEAIANAPAETHVYTVEDF----DAFQRISTELTQ 612 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/288 (9%), Positives = 72/288 (25%), Gaps = 31/288 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 T D+ DY + +Q +P ++ R ++ L Sbjct: 42 PAATKTSISDLSADVDYVVSISAFAGSQES-------LPISGQLTHARRTCGILHLASSQ 94 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAA----LKNALLLFLDSIDLLSHVKEDVYMG 255 S + +A + K+ + ++ +G Sbjct: 95 SFCCFYPVLFPECSAGAAADLLFLVDGSWSVGKANFKHIRAFLSAAASAFQIGQDRTRVG 154 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST----PAMKQAYQIL------ 305 ++ Y + + + T + P Sbjct: 155 VVQYGSDARVEFRLDAHPSRPALLRAIGTLPYMGGDTRTGSRRSPGRCWLGSNYVVHPFS 214 Query: 306 -TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + F F K ++ +TD KS + + +++ Sbjct: 215 RSPGHALKFLLEKSLTEEAGARKDFPKVLVVVTD-----SKSVDPVENSAGRLRSAGVEV 269 Query: 365 VTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 + + + + +K S+P + Y+V ++ V + + + Sbjct: 270 FVLGVGQADEAE--MKQIASTPYRNHVYSVATFQTIKSVQRELISQLC 315 >gi|317122034|ref|YP_004102037.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] gi|315592014|gb|ADU51310.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] Length = 207 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 20/55 (36%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 ++ + F L + L +G + + ++ + A LA ++ + Sbjct: 47 QQGAVAAAFVLVLPVLLAAVGLGLDAGRLVVVRAHAQAVADLAGLAAVQEIDEDA 101 >gi|126304011|ref|XP_001381695.1| PREDICTED: similar to tumor endothelial marker 8 [Monodelphis domestica] Length = 564 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 61/177 (34%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR + + E++RQ + L T Sbjct: 60 NEIYYFVEHLAHK-FISPQLRMSFIVFSTRGSILMRLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + II LTDGE + + Sbjct: 119 YMHEGFQRASEQIY--------------YENMQGYRTASVIIALTDGELHEDLFF-YAER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 ++++E + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRELGATVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIDSILKK 218 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 71/205 (34%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L +++ + G++ + + E T+ R + Sbjct: 319 SGDRLNRLNQASKLFL-----LQIIEKGSWAGMVTFDSSATIQSELIQIETDAQRNSLIT 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPTVAGGGTSICSGLRTAFTVIKKKFSTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---L 398 TI C D+ K++ I T+++ S + L K + + L Sbjct: 417 -------TISTCFDEVKQSGAIIHTVALGPSADPG-LEKLAEMTGGMKTTATDNAQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R ++ KG Sbjct: 469 IDAFSALSSGNGAITQRSIQLVSKG 493 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 61/210 (29%), Gaps = 34/210 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM ++++A K A LD++ + Sbjct: 41 VELVLDVSGSMRTRDMGGE--------------SRISAAKQAFNDVLDAVPEEVQLGIRT 86 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R + + P D T A K A L Sbjct: 87 LGADYPGDDR--------------KVGCKDTKQLYPVGPLDRTEA-KTAVATLAPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISIN 370 + + I+ +TDGE+ + ++ I I T+ + Sbjct: 132 IGPALLGAADDLDGGDSTRRIVLITDGEDTCGPLDPC--EVARDIAARGIHLVIDTLGLV 189 Query: 371 ASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 ++ L ++ + V N D L Sbjct: 190 PDAKIRQQLTCIAEATGGTYTAVQNTDELS 219 >gi|191637338|ref|YP_001986504.1| hypothetical protein LCABL_05200 [Lactobacillus casei BL23] gi|190711640|emb|CAQ65646.1| Putative uncharacterized protein yvcC [Lactobacillus casei BL23] Length = 909 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/418 (7%), Positives = 99/418 (23%), Gaps = 53/418 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + ++ + A + ++ G + Sbjct: 14 LFAVLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 70 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE----MGERPIFLIELVVD 198 + + ++ + + + V Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVRTGVKNFLTSIQNAGLGDYVNVSL 189 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S + + I L S + Sbjct: 190 IGFSSPGYIGGGNKTTGPGYIRVG----------------LGKAGNTSQQQAINSALSPT 233 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I G+ +++ + + K S N Sbjct: 234 FNGGTYTQIGLRQGS--------AMLNADTSGNKKMMILLTDGVPTFSDKVKNSVVENGT 285 Query: 319 RQGVKIPSLPFQKFII------FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 G + + ++ + + + T+ KAK++ + + I Sbjct: 286 LYGTNFGTTRDEPGYTAELKNPYIDSSGDYIYDTWPATLGEAKKAKDSGNVVHALGIQLA 345 Query: 371 -----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + ++ ++ +SP+ + + +AD++ N ++ +V ++I Sbjct: 346 GDDGYWSEGYMSDEDVRQNMELIPTSPDLYEDADSADAVEAYLNNQAKDIVKNFNTII 403 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 68/248 (27%), Gaps = 26/248 (10%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV--VDLSGSMHCAMNSDPEDVNSA 217 ++ + + + L + + I+ + ++ + + V Sbjct: 69 YMTAMKYNLKDNISIQASSHTLMNQQNAALMITIKSNDILLINQRGQECVRQGVDLVCLI 128 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 + K+ ++ L L + + LI + +V + E V Sbjct: 129 DHSGSMQGEKIKLVRKTLKQMLTFLQPCD------RLCLIMFDCKVYRLTRLMRVTQENV 182 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +++ TD MK A IL K + I L Sbjct: 183 QKFRVAISSLQARGGTDIGNGMKMALSILK--------------HRKYKNPV--SAIFLL 226 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG + + V I +++ I T + + + Y V N Sbjct: 227 SDGVDEGAEERVRDDLIQYNIRDS-FTIKTFGFGRDCCPKIMSEIAHYKEGQFYFVPNLT 285 Query: 397 SLIHVFQN 404 ++ F Sbjct: 286 NIDECFAE 293 >gi|311268857|ref|XP_003132237.1| PREDICTED: collagen alpha-1(VII) chain-like [Sus scrofa] Length = 2945 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 20/167 (11%), Positives = 47/167 (28%), Gaps = 23/167 (13%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AYQILT 306 + V + Y+ S T + A+ A Q+ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFSLDTLGSGGDVIRAIRELSYKGGNTRTGAAILHVADQVFL 132 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 K I +TDG++ + + KE +K+ Sbjct: 133 PQL---------------ARPGVPKVCILITDGKSQD-----LVDAAAQRLKEQGVKLFA 172 Query: 367 ISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 + I + + + L + + ++ + V + L + +S+ + Sbjct: 173 VGI-KNADPEELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|326932668|ref|XP_003212436.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 499 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 63/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR + + E++RQ + L T Sbjct: 60 NEIYHFVEHLARK-FISPQLRMSFIVFSTRGTILMRLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIY--------------YENVHGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 ++++E + + + + L S ++ + V + ++L + +I + Sbjct: 164 EANRSRELGATVYCVGVKD--FNETQLARIADSKDHVFPVNDGFEALQGIIDSILKK 218 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 49/165 (29%), Gaps = 19/165 (11%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +G++ Y++ T + T + A++ A L S Sbjct: 102 NNTTQVGIVKYSSSANMVEMLQDLTSNKSDLIATINGLSASGSTATGTAIQVATAELLSS 161 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + K + L+DGE N +A I + T+ Sbjct: 162 RAIAG---------------HAKMEVVLSDGEYNVGID---PKIAAAQAHAQGITVHTVG 203 Query: 369 IN-ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + +T V+ NV N + L+ +F +V Sbjct: 204 VQLYGTGYTSMQQTAVAGGGIFTNVNNLNDLVALFSGTGGNLVGL 248 >gi|118101296|ref|XP_425758.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 552 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 63/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR + + E++RQ + L T Sbjct: 60 NEIYHFVEHLARK-FISPQLRMSFIVFSTRGTILMRLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIY--------------YENVHGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 ++++E + + + + L S ++ + V + ++L + +I + Sbjct: 164 EANRSRELGATVYCVGVKD--FNETQLARIADSKDHVFPVNDGFEALQGIIDSILKK 218 >gi|94482904|gb|ABF22495.1| anthrax toxin receptor 2 [Rattus norvegicus] Length = 487 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVHQLTER-FVSPEMRLSFIVFSSQATIILPLTGDRYKISKGLEDLKAVQPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + + L II LTDG + K Sbjct: 121 HEGLKLANEQIQNA----------------GGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 K++ + + + Q L+ S + + V +L + +I + Sbjct: 164 KKSRSLGASVYCVGVLD--FEQAQLERIADSKDQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + S + G Sbjct: 222 LELSPSSVCVG 232 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TAMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEAIQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 365 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TAMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEAIQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 365 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 50/138 (36%), Gaps = 2/138 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R Y + ++K ++ +S+++F+ + I V + K +++A + LAGA+ Sbjct: 12 MRQYPNRL-RNQKGLTLVVMTMSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGAT 70 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + GD +I K+ + I+ + +G + + S Sbjct: 71 VANRT-NEKGDTDTAIIESYKKVIESPGNDEIELTATDDGNGLNLLSIEYSDSPNSGFST 129 Query: 127 ISMTHMANNRLDSSNNTI 144 + + + I Sbjct: 130 TFPSSPDMVYVRVEVSEI 147 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 14/88 (15%), Positives = 33/88 (37%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + N +I+FAL L++ I + + ++ +M+S + A + A Sbjct: 6 RLSLAALRRAREQGGNVAILFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAA 65 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDD 93 + +R+ L ++ Sbjct: 66 GDLHKAETRVLTLLSENGFGDVLLLAEE 93 >gi|116748933|ref|YP_845620.1| hypothetical protein Sfum_1496 [Syntrophobacter fumaroxidans MPOB] gi|116697997|gb|ABK17185.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 427 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 79/266 (29%), Gaps = 6/266 (2%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + K ++ II AL+++ L F + + + +N +++A +A LA Sbjct: 1 MKKHAIFSKSRLSERDGATVIIVALALVMLLGFGAFAVDIGYLYVVRNELQNAADAGALA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV-FYNTEIQNIV 122 GA+ + +N ++ + ++ + + + Sbjct: 61 GAAALYNNDGTAVQPTANVIGQEAAMRNTAVRTAVEVTLNGNSGDVQRGHYSFATGTFTP 120 Query: 123 NSSRISMTHMANNRLDSSNNT-IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 N+S + ++ + + NT + V T Y+ ++ Sbjct: 121 NASLLPVSLWNVSTEELDANTDFINAVKVTTHRSAPAAPSFFSRIFGYDSFALAAEAVAY 180 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 G L L VD +M D E S I + A A Sbjct: 181 IGFAGT----LYPLNVDKPIAMCQESLLDAEGNYSCSIGRMINSGSGAGHNTAAWTNYSQ 236 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNI 267 ++ + + + I Sbjct: 237 PCETANANDMAQLLAGCGGSNPTPII 262 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 5/99 (5%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR---- 377 PS F LT N + T + AK I + TI + + Sbjct: 36 GSNPSSRFPPTNQDLTTPINIRNALDALTTQAYTNAKAAGISVYTIGFSTPSDSIDDKGL 95 Query: 378 -LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 LL C SS + +A++LI F I++ + + + Sbjct: 96 SLLSNCASSSSQAFVANDANTLISAFNQIAKSVGSLRLT 134 >gi|87308177|ref|ZP_01090319.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] gi|87289259|gb|EAQ81151.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] Length = 1032 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 35/253 (13%), Positives = 81/253 (32%), Gaps = 50/253 (19%) Query: 160 FIEHLLNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 N K+ P + P+ + LV+D SGSM Sbjct: 425 PSSFGAGGWANTKLEEASPVRFTIRDAKVVPVGALMLVLDKSGSMQ-------------- 470 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 KM + A L + ++ + G+IG+ ++ ++ + + Sbjct: 471 ------GEKMQMTQGAALAAIRAMGAAD------FAGVIGFDSQAQRIVPIRK-VDNPGM 517 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +V + T+ TP + ++ L + K +I L+D Sbjct: 518 FVAQVRKLSASGGTNMTPGVALGFRDLQNVDAGV------------------KHMIVLSD 559 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G+ N +I K+ + + +++ + + + + + Y V N ++ Sbjct: 560 GQ----TEPGNVAQIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAVNNPKAI 615 Query: 399 IHVFQNISQLMVH 411 +F ++ + Sbjct: 616 PRIFMREARRVAQ 628 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 58/204 (28%), Gaps = 56/204 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D K ++ K + FL + +GL+ ++ + Sbjct: 96 DVSPSMLVEDIKPDRLERAKREITDFLKVVK-------GDRVGLVAFSGAAYTQCPLTLD 148 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ +++ TD A+ A + + K Sbjct: 149 YGALMMFLNILHPNNIPHPGTDLGAAVLGAIKAFDPKSETD------------------K 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGE+N + + +A IKI + Sbjct: 191 VILLITDGEDNEKRG----LDAAREAVRKGIKIFVFGMGDPAGGPIPASNGKGGFVKDDK 246 Query: 374 --------NGQRLLKTCVSSPEYH 389 + L + + + Sbjct: 247 GELVLSKLDEAGLQEIAAVTGGEY 270 >gi|149727851|ref|XP_001493364.1| PREDICTED: anthrax toxin receptor 1 [Equus caballus] Length = 603 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 119 YMHEGFQRASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|3273257|dbj|BAA31172.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 44/527 (8%), Positives = 119/527 (22%), Gaps = 130/527 (24%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ I E ++++ L + L LI + + + + A+L+ ++ Sbjct: 7 SQARRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAEN 66 Query: 69 VSNLSRLGDRFESISN-------------------------HAKRALIDDAKRFIK---- 99 + + SN DD I Sbjct: 67 NNGRKDNDYKLSGSSNKENDSFDISSEVGKRDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN-----NRLDSSNNTIFYNMDV---M 151 + S+ + + + ++D ++ + + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIKHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 152 TSYDYRLQFIEHLLNQRYN---------------------------------QKIVSFIP 178 + + + Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E + ++ V+ + +H S + ++ ++ + L Sbjct: 247 FAMGAEHPKENKCVLPFVLQNN--LHEMSKSKIKQYLTSSHKSLRRTEFVDNFVALLDTE 304 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + ++ ++K + + E+ ++ R T ++ + Sbjct: 305 ATLNSIGKPNYDIIFPKSSICLEGLKKASQFWYTKEEKEKFRNRVDSLKANGGTLASSGL 364 Query: 299 KQAYQILTS--------DKKRSFFTNFFRQGVKI-------------------------- 324 A + S +++ G Sbjct: 365 LTASNQMLSEKSRSEELNQETKRVILVLSDGNDDMSNLNLADLERNSIPFTNFSRITKNL 424 Query: 325 ---------PSLPFQKFIIFLTDGENNNFKSNVNTIK-----ICDKAKEN--------FI 362 + K N N T +C +E Sbjct: 425 ILGKKEDLSSTTTSNKAYYNRHSTFNYNTYLTNKTKDISRKGMCSIIQEKLNTLNKDKNT 484 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQL 408 K+V + KTCV ++ N +SL++ F+ I + Sbjct: 485 KLVFVEFGYRSESADAWKTCV-GNGNYFYADNRESLLNSFKQAIGET 530 >gi|222635792|gb|EEE65924.1| hypothetical protein OsJ_21784 [Oryza sativa Japonica Group] Length = 578 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 75/224 (33%), Gaps = 27/224 (12%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + V+D+SGSM+ + + PE A T++ LK ++ + +D + Sbjct: 45 PIDVVAVLDVSGSMNDPVAASPESNLQA--------TRLDVLKASMKFIIRKLDDGDRLS 96 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + G + + R + + P +++A +IL + Sbjct: 97 IVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQAVVAVAL-RLCPELQEAVKILDERQ 155 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S F I+ LTDG++ + K + T ++ Sbjct: 156 GNSRNRVGF--------------ILLLTDGDDTTGFRWSRDVIHGAVGK---YPVHTFAL 198 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 A+ + + LL S + V + + L + ++ + K Sbjct: 199 GAAHDPEALLHIAQESRGTYSFVDDGN-LDKIAGALAVCLGGLK 241 >gi|194474004|ref|NP_001124020.1| collagen alpha-1(XIV) chain [Rattus norvegicus] gi|149066377|gb|EDM16250.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 1794 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 54/186 (29%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 1044 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDAYKTK-ETLLDAIRHIS 1102 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 1103 YKGGNTKTGKAIKHVRDTLFTADSG-------------TRRGIPKVIVVITDGRSQDDVN 1149 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNI 405 KI + + + I I + A + L++ S + + V + F+ I Sbjct: 1150 -----KISREMQADGYNIFAIGV-ADADYSELVRIGSKPSSRHVFFVDDF----DAFKKI 1199 Query: 406 SQLMVH 411 ++ Sbjct: 1200 EDELIT 1205 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 58/194 (29%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLREAGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|149066378|gb|EDM16251.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1127 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 54/186 (29%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 + ++ L + + ++ +T + K + S Sbjct: 395 DNFNKIINFLYSTVGALDKIGADGTQVAMVQFTDDPRTEFKLDAYKTK-ETLLDAIRHIS 453 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K L + K I+ +TDG + + + Sbjct: 454 YKGGNTKTGKAIKHVRDTLFTADSG-------------TRRGIPKVIVVITDGRSQDDVN 500 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNI 405 KI + + + I I + A + L++ S + + V + F+ I Sbjct: 501 -----KISREMQADGYNIFAIGV-ADADYSELVRIGSKPSSRHVFFVDDF----DAFKKI 550 Query: 406 SQLMVH 411 ++ Sbjct: 551 EDELIT 556 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 58/194 (29%), Gaps = 23/194 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + + + + E +GL Y+ + K Sbjct: 166 GSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVID 224 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ +F + K I +TDG+ Sbjct: 225 AVRSLPYKGGNTLTGLAL-------------NFIFENSFKPEAGSRSGVSKIGILITDGK 271 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + + +E +++ I + + + L+ S P+ + YNV D + Sbjct: 272 SQDDIIPPS-----RNLREAGVELFAIGVKNADLSE--LQEIASEPDSTHVYNVAEFDLM 324 Query: 399 IHVFQNISQLMVHR 412 V +++++ + R Sbjct: 325 HTVVESLTRTVCSR 338 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 54/161 (33%), Gaps = 35/161 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + + VD+SGSM D +++ ++ L FL+ ++ Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKG 133 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ ++ VR ++ + K T A+ A + L Sbjct: 134 DRVGLILFGSQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL------ 187 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + ++ +TDG NN + + T Sbjct: 188 ------------RMRPATSRALVLVTDGANNAGQIDPITAA 216 >gi|149419345|ref|XP_001517573.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 139 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 54/155 (34%), Gaps = 18/155 (11%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + M I ++TR ++ + E++RQ + L T ++A + + Sbjct: 2 QLRMSFIVFSTRGSTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY---- 57 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 II LTDGE + + + +++++ + + + Sbjct: 58 ----------HENWQGYRTASVIIALTDGELHE-NLFFHAEQEANRSRDFGATVYCVGVK 106 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQN 404 + L S ++ + V + ++L + + Sbjct: 107 D--FNETQLARIADSKDHVFPVNDGFEALQGIIDS 139 >gi|157819693|ref|NP_001101483.1| matrilin-3 [Rattus norvegicus] gi|149050910|gb|EDM03083.1| matrilin 3 (predicted) [Rattus norvegicus] Length = 463 Score = 59.9 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + + Sbjct: 85 SSRSVRPLEFTKVKTFVSRIIDTL---DIGATDTRVAVVNYASTVKIEFQLNTYSNKQAL 141 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + G + P+ K I + Sbjct: 142 KQAVARITPLST--GTMSGLAIQTAME----------EAFTVEAGARGPTSNIPKVAIIV 189 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P ++ + V Sbjct: 190 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASKPLEDHVFYVET 242 Query: 395 ADSLIH 400 + Sbjct: 243 YGVIEK 248 >gi|292655414|ref|YP_003535311.1| von Willebrand factor type A domain-containing protein [Haloferax volcanii DS2] gi|291372503|gb|ADE04730.1| von Willebrand factor type A domain protein [Haloferax volcanii DS2] Length = 818 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 48/161 (29%), Gaps = 25/161 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D L + ++ +G++G+ R TD + A Sbjct: 428 LDALDQLGDENRVGIVGFNYRAYDVAPLRPLGPNRESAADLIRRLESGGATDIAVGLDGA 487 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q L + II ++DG + + D+ + Sbjct: 488 AQQLGDRRGT---------------------IILISDG----HDRFQDAATLADQLGRDG 522 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++TI +PN + L +S + D L +F Sbjct: 523 VSVITIGTGPNPNERTLRAIARASGGNYLRADETDRLRILF 563 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 75/235 (31%), Gaps = 45/235 (19%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I L++D SGSM +T++ A K A F S Sbjct: 64 QAAAEAKDIPPTMLILDASGSMMARDAGG--------------QTRLDAAKEASKNFSRS 109 Query: 242 IDLLSHVKEDVYMGLIGYT------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + S + VY +G + + G + T Sbjct: 110 VSEESELGFMVYGTKVGNSPEEREAGCKDVTTLLPVGKGNAGKISGEVDKVNASGHTPMG 169 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 PA+KQA + L ++ + + I+ ++DGE+ V + Sbjct: 170 PALKQAAKELPNEGE--------------------RSIVLVSDGEDTCAPPPVC--DVAK 207 Query: 356 KAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I I T+ P ++ L+ + + + +A+SL + ++ Sbjct: 208 DLHKQGIDLTINTVGFLVDPAARKELQCIAEAGGGEYLDAQDAESLAESMKVLAT 262 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 71/253 (28%), Gaps = 56/253 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + V+D SGSM K+ + AL+ LD + Sbjct: 1 MPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRDQ- 39 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + R + K R + L T+ AM A Q+L S Sbjct: 40 FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLDSS 96 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTI 367 + II LTDG+ ++N +I +A + + Sbjct: 97 NQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCL 145 Query: 368 SINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS----------------Q 407 + L K + + H + +A L +Q ++ + Sbjct: 146 GFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVE 205 Query: 408 LMVHRKYSVILKG 420 + + ++ KG Sbjct: 206 EVTQNNFRLLFKG 218 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 7/98 (7%), Positives = 28/98 (28%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +++R + + N +++ L LL F + + ++ ++ + A + Sbjct: 1 MMARIAGRAVQALRDRSGNVAVMAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAV 60 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + + + + Sbjct: 61 TAVANISRANAAAALTMADNRQGNVQLIDRATLSTPMA 98 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 60/178 (33%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D +G++ Y+ R E + + T++ Sbjct: 68 QWVANLVDTFEVGPGHTRVGVVRYSDRPTTAFELGHFNSREEVKAAARRITYHGGNTNTG 127 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + + + G + + F++ I LTDG + + + Sbjct: 128 DALRY----------ITSRSFSAQAGGRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-- 175 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I+I + + A+ + L S P+ + ++V + +++ + + + + Sbjct: 176 ---AAGIRIFAVGVGAALKEE--LDEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCE 228 >gi|74205761|dbj|BAE23197.1| unnamed protein product [Mus musculus] Length = 751 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 48/163 (29%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + K + S T + A+K L + Sbjct: 23 DGTQVAMVQFTDDPRTEFKLDSYKTK-ETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTS 81 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K I+ +TDG + + + KI + + + I I Sbjct: 82 DSG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQADGFNIFAIG 123 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 124 VADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIEDELIT 162 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 86/275 (31%), Gaps = 55/275 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 +L + I + M I + +V+DLSGSM QD Sbjct: 69 MLLLLALAALTFAIARPQAVMMLPSHIETVVVVIDLSGSMRA---------------QDI 113 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + +++ A + A + LD++ V G++ PS + + R Sbjct: 114 QPSRIRAAQQAAKVLLDTLPAG------VSAGVVAMAGTAAVAQAPSRSKDAAATAIDRL 167 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKR------SFFTNFFRQGVK-------IPSLPFQ 330 T + A L + T ++ P Sbjct: 168 K---PQGGTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPDASHSGEAVTPGSYPS 224 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NG 375 I+ +DGE+N + ++ + +++ T+ + + + Sbjct: 225 GAIVLFSDGESNTGPAATQAAQLAAA---HGVRVYTVGVGTTDGVVLSVDGWSARVRLDE 281 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + ++ ++ + +A L V++ ++ + Sbjct: 282 KVLKEVANATGAEYFPLADAAQLKRVYRALNMRLT 316 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 59/179 (32%), Gaps = 21/179 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F++ + V + + I ++T+ + + + ++ + + + T Sbjct: 60 IYEFVEKLTER-FVSPRMRLSFIVFSTQAKIILPLTGDRYEITKGLKDLSSVIPAGETYM 118 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 K A + + VK II LTDG + V T K Sbjct: 119 HEGFKLANEQI----------------VKAGGKSTASVIIALTDG-KLADQIPVLTEKEA 161 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHR 412 + A+ ++ + + N +L + + PE + V L + +I + Sbjct: 162 NIARGRGARVYCVGV-LDFNFDQLKRIAAA-PENVFRVEGGFKDLGLIINSILEKSCTE 218 >gi|326681146|ref|XP_002665520.2| PREDICTED: collagen alpha-1(XXII) chain [Danio rerio] Length = 1623 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 58/178 (32%), Gaps = 19/178 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + + ++ Y+ R + + T + Sbjct: 59 QWVANLVESFDVGVDKTRVAVVRYSDRPTVEFNLARYKTLEEVKRAAGNIRYLGGNTKTG 118 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + R G + + QK I LTDG++ +F + Sbjct: 119 DAISFTTTNIFT----------ERAGARPAAKGIQKVAILLTDGQSQDFVLEPSVAAAA- 167 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 I++ + I + + L+ + P+ + ++V + D++ + + + + Sbjct: 168 ----AGIRLFAVGIGEALKEE--LEEIAAEPKSAHVFHVTDFDAIDKIRGRLRRRLCE 219 >gi|224049043|ref|XP_002191793.1| PREDICTED: similar to capillary morphogenesis protein 2 [Taeniopygia guttata] Length = 554 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 72/258 (27%), Gaps = 52/258 (20%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERP------IFLIELVVDLSGSMHCAMNSDPEDVN 215 + + +P L F + V+D SGS+ + VN Sbjct: 83 GSMAGAVPLLLLPLLLPPLPAGSAPAAGEPSCHGAFDMYFVLDKSGSVATNWREIFDFVN 142 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + +++ I ++T+ + + EK Sbjct: 143 QLTERFVSPKMRLS--------------------------FIVFSTQAHVIMPLTGDREK 176 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 +++ + + T +KQA + + F II Sbjct: 177 IKKGLKDLEEVKPAGETYIHEGLKQANEQIAKQGASRF----------------SSIIIA 220 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-N 394 LTDG + + + K K++E ++ + + L+ E + V Sbjct: 221 LTDG-KLDGQIPLYAEKEAKKSRELGARVYCVGVQD--FEPEQLERIADVKEQVFPVTGG 277 Query: 395 ADSLIHVFQNISQLMVHR 412 +L + ++ + Sbjct: 278 FQALKGIINSVLKQSCTE 295 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 74/202 (36%), Gaps = 22/202 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-TRVEKNIEPSWGTEKV 276 K+ A+ AL +D+ S + ++++ +I + ++++ + Sbjct: 20 TSGSMSVDGKIDAMNQALRDLIDTFSGESRLNAEIHLSVITFGGDGAKEHLPLTCAHT-- 77 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ D T A + A +++ + KIPS ++ I+ + Sbjct: 78 ---ISGFSDLQAHGMTPMGGAFRIAKELIEDKE-------------KIPSRAYRPVIVLV 121 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNA 395 +DG N+ + + + ++I + + +LK ++ PE + A Sbjct: 122 SDGYPNDDWE-AAFSSLRGSERAQKATRMAMAIGSDADEN-MLKDFINDPETPVFRANGA 179 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 +I F+ +S + R S Sbjct: 180 RDIIRFFRAVSMSVTSRSRSSA 201 >gi|3273247|dbj|BAA31167.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|296224421|ref|XP_002758053.1| PREDICTED: matrilin-3 isoform 2 [Callithrix jacchus] Length = 445 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 91 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 147 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 148 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTLEAGARGPSSNVPKVAIIV 195 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 196 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASEPLEEHVFYVET 248 Query: 395 ADSLIH 400 + Sbjct: 249 YGVIEK 254 >gi|296224419|ref|XP_002758052.1| PREDICTED: matrilin-3 isoform 1 [Callithrix jacchus] Length = 487 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 91 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 147 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 148 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTLEAGARGPSSNVPKVAIIV 195 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 196 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASEPLEEHVFYVET 248 Query: 395 ADSLIH 400 + Sbjct: 249 YGVIEK 254 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 57/208 (27%), Gaps = 31/208 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--------EDVYMGLIGYTTRVE 264 D + + + +K K ++ F D I S V+ + G + + Sbjct: 43 DASGSMAKRIDGVSKYNMAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQSCESIR 102 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + ++ T A++ A T Sbjct: 103 GVYGFQRF--DRQSFLNSLNGIGPTGWTPIAKALEDAKASFTG----------------- 143 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCV 383 K + LTDGE +K + ++ IK+ I + +L + Sbjct: 144 LHKLGSKSVFLLTDGEETCGGD---PVKTAKELRKQHIKVNVIGFDFKEGFNGQLHEIAK 200 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVH 411 + +Y + + +F + + Sbjct: 201 AGGGKYYEAHSQKDMNRIFTMAASSLAE 228 >gi|148681993|gb|EDL13940.1| mCG48880 [Mus musculus] Length = 249 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 62/173 (35%), Gaps = 24/173 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+I Y+ +VEK + K + D + + T + A++ A + Sbjct: 49 RIGIINYSHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAANDMF------- 101 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K +K + +TDG+ + + + A ++ ++I I + Sbjct: 102 ----------KEARPGVKKVALVITDGQ-TDSRDKKKLADVVKDANDSNVEIFVIGVVKK 150 Query: 373 PNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 + + + + E+ Y + +L + +S+ + S +++ Sbjct: 151 DDPNFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKICEDFDSYLIQ 203 >gi|329664416|ref|NP_001193157.1| anthrax toxin receptor 1 [Bos taurus] gi|297480483|ref|XP_002691486.1| PREDICTED: anthrax toxin receptor 1 [Bos taurus] gi|296482440|gb|DAA24555.1| anthrax toxin receptor 1 [Bos taurus] Length = 564 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++T+ ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTQGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|3273275|dbj|BAA31181.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273291|dbj|BAA31189.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 76/233 (32%), Gaps = 52/233 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + + + LV+D SGSM ++ A K+ + Sbjct: 109 GRKSVQDLVLVIDNSGSMQQTD---------------PDNERLTAAKSLIGQM------- 146 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSL---ILKPTDSTPAMKQA 301 D + ++ + + + + GT+ +Q V +DS+ + T+ A+ + Sbjct: 147 ---DGDKRVAIVSFESTAQLVQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEIGLALDET 203 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + +I L+DG + I + Sbjct: 204 IKEIE----------------TQGNAEKGSLVIMLSDGFSELDTQTALAPYIARQ----- 242 Query: 362 IKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 I I TI + A G LL+ S + + NV NA L F I + R Sbjct: 243 IPINTIGLKLAESEGIALLQNIASLTGGTYSNVANAQGLTQAFGKIYNKIGDR 295 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 63/191 (32%), Gaps = 24/191 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + +D E +G++ Y+ + E + + T++ Sbjct: 571 QWVANLVDTFEVGPERTRVGVVRYSDQPTTAFELGLFGSREAVKAAARHLAYHGGNTNTG 630 Query: 296 PAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + + G + F++ I L G + + + Sbjct: 631 DALRFITRHSFSP-----------QAGGRPGDRAFKQVAILLPAGRSQDLVLDAAAAA-- 677 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMV-- 410 I+I + + A+ + L+ S P+ + ++V + +++ + + + + Sbjct: 678 ---HRAGIRIFAVGVGAALKEE--LEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLCES 732 Query: 411 -HRKYSVILKG 420 S+ L+G Sbjct: 733 FSSALSLFLRG 743 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 61/173 (35%), Gaps = 24/173 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G+I Y+ +VEK + K + D + + T + A++ A + Sbjct: 836 RIGVINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMF------- 888 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 K +K + +TDG+ + + ++ + ++I I + Sbjct: 889 ----------KDARPGVKKVALVITDGQ-TDSRDKEKLTEVVKNVSDTNVEIFVIGVVKK 937 Query: 373 PNGQ-----RLLKTCVSSPEYHYNVVNADSLIHVFQN-ISQLMVHRKYSVILK 419 + + + + PE+ Y + +L + + Q + S +L+ Sbjct: 938 NDPNFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICEDFDSYLLQ 990 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 69/196 (35%), Gaps = 29/196 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATKLLKREGRKGSV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSL 398 + I D A+ + I +TI ++ N +L S + D + Sbjct: 166 PKNPD--VQSISDDARISGISFITIGLSTVVNEAKLRLISGDSSNEPILLLSDPTLVDKI 223 Query: 399 IHVFQNISQLMVHRKY 414 + + + RK Sbjct: 224 QDHLETLFEKKCERKI 239 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/226 (8%), Positives = 70/226 (30%), Gaps = 58/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +++ + + +I + + + Sbjct: 97 DISNSMLAEDVTPSRLEKSKKLISRLVETF-------NNDKVAMIVFAGEAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ A + T + + Sbjct: 150 YISAKMFLETITPSLISTQGTDIRGAIDLAMKSFTP------------------NEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + ++ +A + +++ + + + Sbjct: 192 AIVLITDGENHEGGA----VEAAQEAAKKGVRVFVLGVGSPDGSPIPTGNGSEFRRDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N Q + + + V N + + ++ + Sbjct: 248 NVIVTKLNEQMCQEIAKAGNGMYVRVDNT---NNAERALNAEINKL 290 >gi|220923693|ref|YP_002498995.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219948300|gb|ACL58692.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 130 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 40/131 (30%), Gaps = 3/131 (2%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ A + ++ FA+ M L+ + ++ + +N + A + A + Sbjct: 2 SALRRFCACRSGSTAVEFAMVGMIMLVTMLGIVELGRGLNVRNQLSQAADFGARAV---L 58 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 ++ I + A D + + + L + + S + Sbjct: 59 MNKTISDSGLEAVIRAAFQAASPDQLQVTVGAEVVNGLQFRTVSVSYPFVPLTWGFSTGT 118 Query: 129 MTHMANNRLDS 139 + + R Sbjct: 119 INLSVSRRTPV 129 >gi|317419330|emb|CBN81367.1| Integrin alpha-M [Dicentrarchus labrax] Length = 1058 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 66/202 (32%), Gaps = 32/202 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + QD + K +KN + F + + + Y ++ Sbjct: 68 DGSGSVASQDFTKMK-DFVKNLVNSFQGKDTKFAIAQFSNAPLVHYYFD--------TFD 118 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 R + R T + A++ + + S +K Sbjct: 119 INNWRTQIDRIRQL--TGGTYTAAAIEHVVNNVFDPSRGSRLN-------------VKKV 163 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSP--E 387 +I +TDGE+++ + A+ I I + ++ ++ L T S P Sbjct: 164 LIVITDGESHDRRDLPY---AASLAQGKNIVRFAIGVGGAFSNVAAKQELDTIASDPPAS 220 Query: 388 YHYNVVNADSLIHVFQNISQLM 409 + + V N +L + QN+ + Sbjct: 221 HVFRVDNFGALEQIRQNLQDKI 242 >gi|291393629|ref|XP_002713393.1| PREDICTED: alpha 1 type VII collagen [Oryctolagus cuniculus] Length = 2937 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 48/170 (28%), Gaps = 21/170 (12%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + Y+ S T + A++ Sbjct: 69 SGAASAQGVRFAAVQYSDDPRTEFGLDTLASGGEVIRAIRELSYKGGNTRTGAAIRHVAD 128 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + ++ K I +TDG++ + + K +K Sbjct: 129 HIFL--------------PQLTRPGIPKVCILITDGKSQDQVD-----AAAQRLKGQGVK 169 Query: 364 IVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I + + + L + + ++ + V + L + +S+ + Sbjct: 170 LFAVGI-KNADPEELKRVASQPTGDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEASQWRPSLVPASAENVNKARSFAVGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEEQLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEASQWRPSLVPASAENVNKARSFAVGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEEQLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEASQWRPSLVPASAENVNKARSFAVGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEEQLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|167526012|ref|XP_001747340.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774175|gb|EDQ87807.1| predicted protein [Monosiga brevicollis MX1] Length = 1632 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 74/216 (34%), Gaps = 31/216 (14%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + V+D SGS+ + + + +K + F Sbjct: 328 KSAIDLIFVLDGSGSIDRESYGGT--------PGNFQYKILDFVKQVVSYF-------DI 372 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +G+I +++ N + +K D + T + A+ Q + Sbjct: 373 SANATRVGVITFSSSAVINFNLNSFYDKSDMLDAIDNINYPASSTRISLALASVRQNMLK 432 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + G++ S K ++ LTDG+ ++ K+ + + +I Sbjct: 433 EYN----------GMRPESEGVPKVVVVLTDGQASSGYEPAY---EAALLKDMGVNMFSI 479 Query: 368 SINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 I +S + + L+ ++P + + + N D++ + Sbjct: 480 GIGSSIDTDQ-LEDMATAPLASHMHLLKNFDAIPDI 514 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 64/236 (27%), Gaps = 54/236 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D T+++A + A + + V +G++ + + P+ Sbjct: 93 DVSRSMEATDVAPTRISAAQQAARDLIVGLPAS------VRLGIVSFAGTATVVLRPTSN 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ------------ 320 + + + + T + + QA +L D F Sbjct: 147 R---QDMLDAIDRFQLQRGTATGSGLIQALAVLFPDDGIDLEAILFADEPVFSTRRAVPL 203 Query: 321 ----------------GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 P +I L+DG + A + +++ Sbjct: 204 DEAAAADAVRKREQATQSAQPGSYRHGAVILLSDGRRTVGPD---PVDAARMAAQRGVRV 260 Query: 365 VTISINA--------------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 T+ + L + ++ +A L V++ +S Sbjct: 261 YTVGFGTLGGNAPETSLSYYMQLDEPALRAVATITGAEYFQAGSAADLSQVYRQLS 316 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 66/183 (36%), Gaps = 16/183 (8%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ A K AL + ++ + +GL Y +V+ +P+ Q V Sbjct: 58 TKIEAAKRALTGVVGAL------PDTAQVGLRVYGAKVDGKGKPTPAACADTQLVHPIAT 111 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 K T + A+K + + + +K ++ I+ ++DGE + Sbjct: 112 LDKPKLTSTIAAIK--------ALGETPIAHSLTEALKDLGTSGKRNIVLVSDGEE-SCV 162 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN 404 + A ++I T+ + + L+ + +Y+ +A +L Sbjct: 163 PDPCPAITKLTAAGVDLQIDTVGFGVNTKARAQLQCIAAAGKGTYYDAKDASALTTSLSK 222 Query: 405 ISQ 407 +SQ Sbjct: 223 LSQ 225 >gi|296223654|ref|XP_002757711.1| PREDICTED: anthrax toxin receptor 1 [Callithrix jacchus] Length = 568 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 62 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 120 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 121 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 165 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 166 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 220 >gi|73970245|ref|XP_855427.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 555 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 51 NEIYYFVEQLAHK-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 109 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 110 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 154 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 155 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 209 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/260 (10%), Positives = 61/260 (23%), Gaps = 41/260 (15%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + R Q + L ++ E ++E ++ P Sbjct: 307 VIFVLDTSGSMAGTRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVT----EWDLRKPYK 362 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + E Sbjct: 363 GPDHYYFNSPP-----------EETTEDATAVPQNNESEVKFGPYDDILAYPVT-----E 406 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V++ + T+ A+ A + + I Sbjct: 407 QSVKRAKEFVAAMDVTSSTNINDALLLALK---------------NSQSVQSRVRLTPII 451 Query: 334 IFLTDGENNNFKSNV--NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-- 389 IFLTDGE ++ + ++ + I ++ + Q L K + + Sbjct: 452 IFLTDGEPTASVTDTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARK 511 Query: 390 -YNVVNAD-SLIHVFQNISQ 407 Y +A L F ++ Sbjct: 512 IYEAADATLQLKGFFDEVAS 531 >gi|324499530|gb|ADY39800.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 3675 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 61/180 (33%), Gaps = 17/180 (9%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K+ +L F+ +L ++ +GLI Y+ ++ + S +K + Sbjct: 1207 VFKHEVLRFVKEFVELFDIGLDNTRVGLIQYSDQIRHEFDLSQYRDKASVIQAISQVHYL 1266 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ R+G + S + I +TDG + NV Sbjct: 1267 TGLTRTGAAIQHMV----------MEGFSERRGARAESDDVARVAIVITDGR---SQDNV 1313 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I A+ + + I + L++ SP + V L +++ Q Sbjct: 1314 TEPAIA--ARRLHVNMFAIGV-TDHVLASELESIAGSPSRWFYVDRFKDLDTRLRSLIQK 1370 >gi|324112804|gb|EGC06780.1| von Willebrand protein type A [Escherichia fergusonii B253] Length = 530 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 79/253 (31%), Gaps = 45/253 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I I + P + ++D SGSM+ Sbjct: 144 PWNEKHTLLRLDIA-ANDIARSKLPPANLVFLIDTSGSMN-------------------S 183 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K+ L L ++ + + ++ Y + + G+EK + Sbjct: 184 DERLPLIKSLLKLLVNELRDQD------RISIVTYAGSARLLLSSTSGSEK-NTILNAIA 236 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ + AY+ + + I+ TDG+ N Sbjct: 237 NLQAGGGTNGGAGVAMAYEQAQAGYIKGGVN----------------RILLATDGDFNIG 280 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + K +E+ I + T+ + + + ++ + +Y DSL + Sbjct: 281 DDPSSVEDLVKKQRESGITLSTLGVGDNNYNEAMMVKIADTGNGNY--SYLDSLSEAQKV 338 Query: 405 ISQLMVHRKYSVI 417 +S M +V Sbjct: 339 LSNEMNQTLVTVA 351 >gi|309790583|ref|ZP_07685138.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] gi|308227385|gb|EFO81058.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] Length = 430 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/249 (12%), Positives = 71/249 (28%), Gaps = 48/249 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 +P + G I + LV+D SGSM + K+ ++A Sbjct: 79 AVPVADFVGAGGSAIRSV-LVIDRSGSM-------------------DEGNKIDGARDAA 118 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F+ + + LIG+ +V + + + R T Sbjct: 119 QAFVGMVRSDD------QVALIGFNDQVVVLEPFTDDQAILEAAIRRL---RADGGTALY 169 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ + +L + G L D ++ S+ + Sbjct: 170 DSIVEGVDLLRD-QPGRRALLVLTDGQDCRDL----------DSCPDDAGSSHTLAEAIA 218 Query: 356 KAKENFIKIVTISIN-------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 A + I + + + L + + + +A +L +++ I+ Sbjct: 219 YANAANQPVTLIGLGQRGSSGDDGIDERVLQRIATETRGSYAYSPDAAALTDLYREIAGG 278 Query: 409 MVHR-KYSV 416 + + + Sbjct: 279 LQSEYRLTY 287 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 80/262 (30%), Gaps = 51/262 (19%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 D TS + ++ +P + LV+D SGSM Sbjct: 258 DASTSLALVPDA----DDGDEGTFALTVVP---DPAPATANAKDVVLVLDRSGSMTGW-- 308 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 KM A + A +D++ + G++ + + + Sbjct: 309 ------------------KMVAARRAAARIVDTLTGGDRFAVLSFDGVVEQPEGLGEGLS 350 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + R V T+ ++QA +L+ + Sbjct: 351 EASDRNRYR-AVEHLARLEARGGTEMLAPLEQAVALLSESGRD----------------- 392 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + ++ +TDG+ + +I A+ +++ T+ I+ + N L + Sbjct: 393 --RVLVLVTDGQ--VGNEDQILERI--GARLAGVRVHTVGIDRAVNAGFLGRLAGLGSGR 446 Query: 389 HYNVVNADSLIHVFQNISQLMV 410 V + D L ++I + + Sbjct: 447 CELVESEDRLDEAMEHIHRRIG 468 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 62/186 (33%), Gaps = 28/186 (15%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +I+L+ + + ++ + + + + K+++ + + T A+ Sbjct: 211 ATINLVKASNPEDEVFVVNFNDDYYLDQDYTDSVAKLKEALEK---YETRGGTALYDAV- 266 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + +K + +TDGE++ N I +E Sbjct: 267 ----------------LASNAHLMKAPKLEKKVLFIVTDGEDDA-SLNTLEQTIRKVQQE 309 Query: 360 NFIKIVTISINASP------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 N I TI I + L + S+ + + D + + Q I+ + ++ Sbjct: 310 NGPTIYTIGILDETGGHKRRAQRALREMAESTGGVAFFPQSLDEVSRITQQIAHDIRNQ- 368 Query: 414 YSVILK 419 Y++ K Sbjct: 369 YTISYK 374 >gi|297673847|ref|XP_002814961.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pongo abelii] gi|261858584|dbj|BAI45814.1| anthrax toxin receptor 2 [synthetic construct] Length = 488 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|224809466|ref|NP_001139266.1| anthrax toxin receptor 2 isoform 2 [Homo sapiens] gi|306526289|sp|P58335|ANTR2_HUMAN RecName: Full=Anthrax toxin receptor 2; AltName: Full=Capillary morphogenesis gene 2 protein; Short=CMG-2; Flags: Precursor Length = 489 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|133777718|gb|AAI14375.1| ANTXR2 protein [Homo sapiens] Length = 446 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 20 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 78 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 79 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 121 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 122 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 179 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 180 LELQPSSVCVG 190 >gi|119626254|gb|EAX05849.1| anthrax toxin receptor 2, isoform CRA_b [Homo sapiens] Length = 322 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|62089412|dbj|BAD93150.1| anthrax toxin receptor 2 variant [Homo sapiens] Length = 502 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 76 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 134 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 135 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 177 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 178 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 235 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 236 LELQPSSVCVG 246 >gi|55733508|emb|CAH93432.1| hypothetical protein [Pongo abelii] Length = 435 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 9 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 67 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 68 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 110 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 111 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 168 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 169 LELQPSSVCVG 179 >gi|30013741|gb|AAP04016.1| capillary morphogenesis protein 2 [Homo sapiens] Length = 489 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|50513243|ref|NP_477520.2| anthrax toxin receptor 2 isoform 1 [Homo sapiens] gi|21750159|dbj|BAC03731.1| unnamed protein product [Homo sapiens] Length = 488 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|7022738|dbj|BAA91707.1| unnamed protein product [Homo sapiens] Length = 218 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 56/156 (35%), Gaps = 18/156 (11%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 M I ++TR ++ + E++RQ + L T ++A + + + + Sbjct: 1 MSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENR--- 57 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II LTDGE + + + +++++ + + + Sbjct: 58 -----------QGYRTASVIIALTDGELHEDLFF-YSEREANRSRDLGAIVYCVGVKD-- 103 Query: 374 NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 + L S ++ + V + +L + +I + Sbjct: 104 FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 139 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 72/237 (30%), Gaps = 45/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ + + ++D+SGSM+ R K+ LK +L + Sbjct: 259 EVDKSQLGASNLVFLLDVSGSMN-------------------SRDKLPLLKTSLKMLSQQ 299 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + HV VY G + V + + Q + T+ ++QA Sbjct: 300 LSEQDHVSIVVYAGA----SGVVLDGVKGNDIYAINQALNNLK---AGGSTNGGAGIQQA 352 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKEN 360 Y + + +I TDG+ N ++ + + ++ Sbjct: 353 YGLAQKHFIQGGVN----------------RVILATDGDFNVGTTDHQALMDLIASKRDQ 396 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + T+ L++ HY ++ L + + + ++ Sbjct: 397 GIALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDT--LNEARKVLVDELSSTLLTIA 451 >gi|254802534|sp|Q0PMD2|ANTR1_RAT RecName: Full=Anthrax toxin receptor 1; Flags: Precursor Length = 562 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 58 NEIYYFVEQLAHR-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 116 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 117 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 161 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 162 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 216 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 21/226 (9%), Positives = 70/226 (30%), Gaps = 58/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +++ + + +I + + + Sbjct: 97 DISNSMLAQDVTPSRLEKSKKLISRLVETF-------NNDKVAMIVFAGEAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ A + T + + Sbjct: 150 YISAKMFLETINPSLISTQGTDIAGAINLAMKSFTP------------------NEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + ++ +A + +++ + + + Sbjct: 192 AIVLITDGENHEGGA----VEAAQEAAKKGVRVFVLGVGSPDGAPIPAEGTNEFRRDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N Q + + + V N + + ++ + Sbjct: 248 NVIVTRLNEQMCQEIAKAGNGIYVRVDNT---NNAERALNAEISKL 290 >gi|148666786|gb|EDK99202.1| anthrax toxin receptor 1 [Mus musculus] Length = 533 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 56 NEIYYFVEQLAHR-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 114 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 115 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 159 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 160 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 214 >gi|113205504|ref|NP_001037714.1| anthrax toxin receptor 1 [Rattus norvegicus] gi|111052876|gb|ABH03702.1| anthrax toxin receptor [Rattus norvegicus] gi|124297147|gb|AAI31854.1| Anthrax toxin receptor 1 [Rattus norvegicus] Length = 576 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 72 NEIYYFVEQLAHR-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 130 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 131 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 175 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 176 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 230 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 474 GSSSMGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLEFGFDKYNSKADILSA 533 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 534 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 577 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + + + C K + I I + Q L+ + P ++ + V + D+L Sbjct: 578 YD-DVRIPAMAACQK----GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 630 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 631 KIAPRIIQNICTE 643 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 38/146 (26%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 286 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 343 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 344 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 385 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + Sbjct: 386 SRVARESGINVFFITVEGAAERDIQH 411 >gi|32189436|ref|NP_473382.1| anthrax toxin receptor 1 precursor [Mus musculus] gi|17366052|sp|Q9CZ52|ANTR1_MOUSE RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|15987505|gb|AAL11999.1|AF378762_1 tumor endothelial marker 8 precursor [Mus musculus] gi|63100398|gb|AAH94544.1| Anthrax toxin receptor 1 [Mus musculus] Length = 562 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 58 NEIYYFVEQLAHR-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDT 116 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + II LTDGE + + + Sbjct: 117 YMHEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSER 161 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 162 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 216 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 71/226 (31%), Gaps = 58/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + QD +++ K L +D D +GL+ + + + Sbjct: 97 DVSNSMLAQDVLPSRLENAKQILSKLVD-------DMNDDKVGLVVFAGDAYTQLPITVD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ + + + T A+ A + + K Sbjct: 150 YVSAKMFLSNISPELVPRQGTAIGSALDLAIKSFGAK------------------SEAGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + I A EN I + I + + Sbjct: 192 AIILITDGENHEDDA----IGAAKLAAENNIIVNVIGMGKTDGAPIPVPGTMSFRKDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N Q + ++ + + N + ++ I++ + Sbjct: 248 NVVVSKLNEQMCKEIALAGKGVYVHADNTNG---AYKVINKELDKL 290 >gi|149019069|gb|EDL77710.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] gi|149019070|gb|EDL77711.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] Length = 2827 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 51/173 (29%), Gaps = 28/173 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 V + L+ Y+ +V + + T++ AM Sbjct: 409 AKSFEISPNRVQISLVQYSRDPHTEFTLKE-FNRVEDIIKAINNFPYRGGSTNTGKAMTY 467 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + K K +I +TDG KS+ K + + Sbjct: 468 VREKIF-------------VPNKGSRSNVPKVMILITDG-----KSSDAFRDPAIKLRNS 509 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 ++I + + + + L+ S P + + V + FQ IS + Sbjct: 510 DVEIFAVGVKDAVRSE--LEAIASPPAETHVFTVEDF----DAFQRISFELTQ 556 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 2280 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 2339 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 2340 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 2382 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 2383 -EVKKAAFIIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 2435 Query: 406 SQLMVH 411 ++ Sbjct: 2436 EDNLIT 2441 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + +K Sbjct: 1158 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGN 1217 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 1218 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 1259 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 1260 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFESLSKIVDDLTINL 1317 Query: 410 VH 411 + Sbjct: 1318 CN 1319 >gi|327284423|ref|XP_003226937.1| PREDICTED: anthrax toxin receptor 2-like [Anolis carolinensis] Length = 441 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 54/179 (30%), Gaps = 25/179 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D V + + I ++ + + ++ + ++ + + + T Sbjct: 65 VKQLTDRF-----VSPRMRLSFIVFSMQAKVILQLTENRAQIERGLEELRNVKPGGETYM 119 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K+A + + + II LTDG K Sbjct: 120 HEGIKEANRQIETA----------------GGQRTNSIIIALTDG-KLEGLIPQYAEKQA 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 D ++ ++ + + Q L++ S + + V +L + +I + Sbjct: 163 DISRRLGARVYCVGVL--NFNQEQLESIADSRDQVFPVKEGFQALRGIINSILKQSCTE 219 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/226 (10%), Positives = 65/226 (28%), Gaps = 50/226 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + D ++ K+ +L + L+ + + Sbjct: 96 DVSNSMLVADVSPDRLTRAKSGILRISKGLRDG-------RQALLLFAGSPLVQCPMTTD 148 Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + + T A+ A ++ + + K Sbjct: 149 HAAFEALLGMVSTELVSDQGTAFDSALNLAMRLFERTEPPGDVKEVQGE----------K 198 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS------------------- 372 I+ L+DGEN++ + D K++ + + TI + Sbjct: 199 VIVLLSDGENHSGN----FRAVADALKQSGVSVFTIVLGKPLPAAIPLGQSSGVKKDAAG 254 Query: 373 ------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + + S ++ D+ V+ +++ + Sbjct: 255 KIVKTRSSPETMRRLAGDSGGTFFDASEDDA---VYDRVAERISTL 297 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 181 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 220 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 221 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 276 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 277 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 325 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 326 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 385 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 386 VEEVTQNNFRLLFKG 400 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 70/189 (37%), Gaps = 30/189 (15%) Query: 233 NALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLIL 289 N+ F+ + ID +S +G+I ++ +VE T K ++ V + + Sbjct: 792 NSTKTFMKTVIDEVSANHATTRIGIINFSHKVELVSSLETYTTKESLKSAVDKM--LYLG 849 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + T + A+K+A + + +K + +TDG+ +N + V+ Sbjct: 850 EGTYTASAIKKAISLFQA-----------------ARPAVRKVALVVTDGQADN-RDKVH 891 Query: 350 TIKICDKAKENFIKIVTISI---NASPNGQRL--LKTCVSSPE--YHYNVVNADSLIHVF 402 + +A I+I I I L + + P+ + Y + + +L + Sbjct: 892 LDLVVKEAHAANIEIFVIGIVQKTDPHYHNFLKEMHLIATDPDEEHFYQIEDFKTLSALA 951 Query: 403 QNISQLMVH 411 + + Sbjct: 952 DKLITKICE 960 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 56/163 (34%), Gaps = 25/163 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 KN +L D+I + VK +V + + +++ V + + + ++ + Sbjct: 63 FDLQKNFVLSLTDNIFQMKPVKSQNYNVKLAGMQFSSTVSIDHPFTAWKNVQNFKEKIRA 122 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + I + T S A+ A Q+ ++ + K +TDG ++ Sbjct: 123 LV--YIGQGTYSYYAISNATQLFKTEGRERSI----------------KVAFLMTDGVDH 164 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 N I A+ I TI ++ + L+ Sbjct: 165 PNSPN--VQGIATAARSLGIHFFTIGLSKKNVKEEKLRLISDD 205 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 96/265 (36%), Gaps = 59/265 (22%) Query: 165 LNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + R + + S +P + + + LV+D+SGSM+ Sbjct: 279 DDFRNSSVVNSLVPPFETTFELLQTQDRA-VSLVLDVSGSMNTN---------------- 321 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVT 281 ++ L+ A +FL + ++ +G++ + + + T RQ + Sbjct: 322 ---NRITNLRTAAEVFL-----IQIIEIGSRVGIVTFESSAYEKSPLLQITSVATRQRLV 373 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +++ + T +++ +I+T+ + + I+ LTDGE+ Sbjct: 374 QNLPTTAGGGTKICAGIEKGLEIITNAIGTT----------------YGSEIVLLTDGED 417 Query: 342 NNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNADS 397 + T+ +C +K KE+ I TI++ + L + + Y +V Sbjct: 418 S-------TMSLCREKVKESGAIIHTIALG-PSAAKELEEFSNITGGLQLYAVDVDVPSK 469 Query: 398 LIHVFQNI---SQLMVHRKYSVILK 419 L+ F I S + + + K Sbjct: 470 LVEAFSEITTGSGDISEQSIQLESK 494 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 62/226 (27%), Gaps = 30/226 (13%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E + + +++D SGSM ++ V+ + + ++L + + Sbjct: 24 AEKTETEAKAPANVAVLLDASGSMAKRIDG----VSKFNSAKKEISKFASSLPEGTQVKM 79 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + K + V + + ++ T A+ Sbjct: 80 SVFGSEGNNKNSGKVQSCEAIRNVYGFQSFNE-----QSFLNSLNTIGPTGWTPIAKALN 134 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A +K + LTDGE N IK + ++ Sbjct: 135 EAKSSFDQ-----------------LDAKGEKVVYLLTDGEETCG---GNPIKTAKELQK 174 Query: 360 NFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + I + +L ++ V + +F Sbjct: 175 ENITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPVYTQKDVEKIFTQ 220 >gi|54124354|gb|AAV29939.1| anthrax toxin receptor [Rattus norvegicus] Length = 245 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 61/175 (34%), Gaps = 19/175 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F++ + + + M I ++TR ++ + E++RQ + L T Sbjct: 1 IYYFVEQLAHR-FISPQLRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYM 59 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++A + + II LTDGE + + + Sbjct: 60 HEGFERASEQIY--------------YENSQGYRTASVIIALTDGELHEDLFF-YSEREA 104 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 105 NRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 157 >gi|332233322|ref|XP_003265851.1| PREDICTED: anthrax toxin receptor 2 [Nomascus leucogenys] Length = 488 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKHVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|311252475|ref|XP_003125114.1| PREDICTED: anthrax toxin receptor 1-like [Sus scrofa] Length = 564 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 64/177 (36%), Gaps = 19/177 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F++ + + + M I ++T+ ++ + E++RQ + L T Sbjct: 60 NEIYYFVEQLAHK-FISPQLRMSFIVFSTQGTTLMKLTEDREQIRQGLEELQKVLPGGDT 118 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A + + + + II LTDGE + + + Sbjct: 119 YMHEGFERASEQIYYENR--------------QGYRTASVIIALTDGELHEELFF-YSER 163 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 +++++ + + + + L S ++ + V + +L + +I + Sbjct: 164 EANRSRDLGAIVYCVGVKD--FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 218 >gi|329850448|ref|ZP_08265293.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] gi|328840763|gb|EGF90334.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] Length = 395 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/314 (11%), Positives = 83/314 (26%), Gaps = 23/314 (7%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 +I+ LS + G + + + ++ M++A +AA+L G + + Sbjct: 1 MIYGLSATLVIAAGGGALDLFNASNLRHDMQNALDAAVLTGVRASSQMGTSAS---NAFK 57 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + + ++ + S S S V + S M + N L S + Sbjct: 58 QNVADDMDGASQSYSSAVSSSSASSSSYVTTTLTGTASLESPTYFMKLLGLNDLTVSVKS 117 Query: 144 ----IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 Y L + V + + ++ P + L Sbjct: 118 VAQGTTTIAPAGKPCIYVLDPSGSQALLVNSGANVQALSCEIHVKSTGNPAAIFNSGSSL 177 Query: 200 SGSMHC--------AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI--DLLSHVK 249 + C + P V D + ++ + + ++ Sbjct: 178 NFKKLCVQGTNIIKNSVTVPNLVTGCAASGDPYAGTLPTPASSTCTYSNQNYSAATQNLT 237 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 VY G + + G YV ++ + T + Y + + Sbjct: 238 PGVYCGWFNFNNSSATV-NFAPGV-----YVIKNGGWNVNGGTWKGTGVTFYYADTSKIQ 291 Query: 310 KRSFFTNFFRQGVK 323 S + Sbjct: 292 FNSGISADLSAPSS 305 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 71/223 (31%), Gaps = 57/223 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + D K ++ K + D +D + +GL+ + + + Sbjct: 98 DVSKSMLAGDVKPDRLTRAKQLISKLADKLD-------NDRVGLVVFAGNAYLQMPLTID 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+T D + + T A++ A +++ K Sbjct: 151 YSAAKMYLTTVSPDMIPTQGTAIGQAIQVANDAFNKKERK------------------HK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I ++DGE++ + I A E+ + I TI I + Sbjct: 193 SLIIISDGEDH----DEAAISKARAAFEDGVVINTIGIGSPTGSPLPDPETGTYKKDKEG 248 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N L + + ++ N V +++Q + Sbjct: 249 NTVISKLNEDALKSIAAAGKGIYEHLDNNTE--EVVNSLTQKI 289 >gi|110833328|ref|YP_692187.1| pilin biogenesis-like protein [Alcanivorax borkumensis SK2] gi|110646439|emb|CAL15915.1| pilin biogenesis related protein [Alcanivorax borkumensis SK2] Length = 1009 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 41/121 (33%), Gaps = 8/121 (6%) Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN-------FKSNVNTIKICDKA-KE 359 D+K+ + + I+ TDGE N+ S I D ++ Sbjct: 267 DEKKIVDNGNYVSPRNTANECESNHIVLFTDGEANDVNLPCGGGSSYDCQRAISDYLDRD 326 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 IK + ++ N + + Y +A+SL F +I L+ S+ Sbjct: 327 FEIKTYNVGLHMEDNRADMETVSSDGADGTYTASDAESLASAFLDIFDLIEKESRSIAAP 386 Query: 420 G 420 G Sbjct: 387 G 387 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/299 (11%), Positives = 85/299 (28%), Gaps = 30/299 (10%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRY--------NQKIVSFIPALLRIEMGERPIFLI 193 ++ + V + + + +K I + ++ ++ Sbjct: 47 SSSAGQLGVKHNIMLFIDDSGSMKWIPGEKRPPRYYGEKSRLAITKSVLNKVLDK----- 101 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + S+ N+ D + ++ ++ A + + + Sbjct: 102 -YQDQFNWSLQTLHNNGRTDTADFKTPWKSVKDRVDGIEAANG--TPTTRRYYEIVSGIV 158 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-------ILKPTDSTPAMKQA----- 301 M + Y + + S G + Q R S D + A Sbjct: 159 MPSVEYRCQKAFVVLMSDGDANLSQACGRYSSSRNRSFVYPAGYAYDDYFGRQHAGACRD 218 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + D T K+ S+ F+ + + T K + ++ Sbjct: 219 IDGGSYDTFWDRNTGLSFFSRKLSSIDFKTSGLDKAGKSWQGDPKDPKTAKNPNGFRDQL 278 Query: 362 IKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 ++ TI + G+ L S +Y+++ + D L F I+ + +V++ Sbjct: 279 VQTFTIGFGRDISSAGRNYLTNGASRDDYYFSAESEDDLYRAFDTITDSIKDESQNVVI 337 >gi|298375542|ref|ZP_06985499.1| BatB protein [Bacteroides sp. 3_1_19] gi|298268042|gb|EFI09698.1| BatB protein [Bacteroides sp. 3_1_19] Length = 574 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 59/207 (28%), Gaps = 26/207 (12%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI------- 324 + +++ S++ + T A+ A + T D+ G Sbjct: 150 YISAKMFLSSINPSMVSTQGTAIGAAINLAARSFTPDETTDKAIILITDGENHEDDAIGA 209 Query: 325 --PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + G+ + K K+ + I N Q + Sbjct: 210 AKAAAEKGIHVNIVGMGDPKGSPIPIQGSNNYMKDKDGNVVI------TKLNEQMGQEIA 263 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + + N + + + + Sbjct: 264 AAGNGMYVRADNT---NSALKALQKEI 287 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 257 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 296 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + + R + L T+ AM A Q+L Sbjct: 297 Q-FNLIVFSTEATQWRPSLVPASAENVNEARSFAAGI---QALGGTNINEAMLVAVQLLD 352 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 353 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 401 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 402 CLGFGFDVSYAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 461 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 462 VEEITQNNFRLLFKG 476 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + + R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNEARSFAAGI---QALGGTNINEAMLVAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 365 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEITQNNFRLLFKG 488 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/255 (14%), Positives = 73/255 (28%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + + R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNEARSFAAGI---QALGGTNINEAMLVAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 S + + II LTDG+ ++N +I+ +A + Sbjct: 365 SSNQEERLPDGSVS-----------LIILLTDGDPTVGETNPRSIQKNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALENGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEITQNNFRLLFKG 488 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 69/186 (37%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 91 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 147 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V + T S A++ A + + G + PS K I + Sbjct: 148 KQAVGQITPLST--GTMSGLAIQTAMDEVFT----------AEAGARGPSSNIPKVAIIV 195 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 196 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMQS--LKMMASEPLEEHVFYVET 248 Query: 395 ADSLIH 400 + Sbjct: 249 YGVIEK 254 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 70/231 (30%), Gaps = 56/231 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D + ++ K A+ D + +GL+ + + + Sbjct: 97 DVSNSMLAEDFQPNRLERTKYAIDKLFDGLK-------QDRVGLVVFAGDAVVQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + R S++ + TD A+ A + P + Sbjct: 150 YRMAKAFARRISPSMVSVQGTDIGQALSLATMSFSE----------------KGDNPAGR 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG---------------- 375 I+ +TDGE ++ + + E I+I TI I Sbjct: 194 VIVLITDGEGHDSGAIEAAERA----AEQGIRIFTIGIGTPEGAPIQIGGEFIKDDKGEM 249 Query: 376 -------QRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ----LMVHRKY 414 L K ++ + N L + + I+ + ++ Sbjct: 250 VVSKLGEPLLEKIAQATDGAYIRSTNQSIGLDEIVRTINNMEKGDLAALRF 300 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 71/203 (34%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 321 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATVVNKLIQIKSSDERNTLMAGL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T +K A+Q++ + + ++ LTDGE+N Sbjct: 376 PTYPRGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 417 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + +++ + H+ V + LI Sbjct: 418 -----TASSCIDEVKQSGAIVHFIALGRDAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALTSGNTDLSQKSLQLESKG 494 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 67/203 (33%), Gaps = 47/203 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + ++ +GLI + + + Sbjct: 98 DISNSMLAEDVAPSRLGKSKLIVENLMNKFSE-------DKIGLIVFAGDAFVQLPITSD 150 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A++ + T + K Sbjct: 151 YVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFTP------------------NSKVGK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 II +TDGE+N + ++ +A+ I++ + + S+ Sbjct: 193 AIILITDGEDNEGGA----EEMAKQAQSKGIRVFILGVG-------------STEGATIP 235 Query: 392 VVNADSLIHVFQNISQLMVHRKY 414 + + L + +V + Sbjct: 236 MPDGSELKTS----NSEVVKTRL 254 >gi|290996921|ref|XP_002681030.1| predicted protein [Naegleria gruberi] gi|284094653|gb|EFC48286.1| predicted protein [Naegleria gruberi] Length = 353 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 69/205 (33%), Gaps = 29/205 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K++TKM + +D++ +E +G++ + + E + + + ++ + Sbjct: 2 SSKEKTKMQVANQVICEIIDNL------REFERLGIVLFDHKAETLLPLTIVQDLDKKSL 55 Query: 281 TRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T+ M++ + ++ II+LTD Sbjct: 56 KETVLKIEEQGSTNFEAGMQRGIDLFSTLDSSDLSN--------------SNRIIYLTDA 101 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 N ++ I + A I + I I N + + +++V + + Sbjct: 102 CPNVGGTDTLDI-LTKDANSGPYNIFSTFIGIGLDFNSDIVEELTRVRGCNYFSVRSTED 160 Query: 398 ----LIHVFQNISQLMV-HRKYSVI 417 L F I + + K S+ Sbjct: 161 FTKILNQDFNYIVTPICYNVKLSLE 185 >gi|148686422|gb|EDL18369.1| mCG2843 [Mus musculus] Length = 1122 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 74/234 (31%), Gaps = 27/234 (11%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V ++ +M C + L + D V Sbjct: 86 NTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVSPTFQVVN 145 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSD 308 +G++ Y V + + V + T + + A + ++ Sbjct: 146 SFAPVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEAFTE 205 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +K ++ +TDGE+++ I+ C+ + I+ +I+ Sbjct: 206 ARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIA 249 Query: 369 INASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVHR 412 I N L +K+ S + ++ +NV + +L+ + + + + + Sbjct: 250 ILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 303 >gi|257127006|ref|YP_003165120.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] gi|257050945|gb|ACV40129.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] Length = 509 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/282 (8%), Positives = 69/282 (24%), Gaps = 72/282 (25%) Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + ++ + + L + + ++ ++ + + E Sbjct: 135 FSYVIFKDDMANPFTKQKIEENMNVNLEIILDASGSMKQKISDKTMMEIAKESIEKVVSE 194 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + + + H Sbjct: 195 -----MPANTKV-----------------------------------------GLRVFGH 208 Query: 248 VKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ + E + +K++ + T +++ L Sbjct: 209 KGDNTASKKQESCSANELISPIETLDKDKLKSSLAPI---QPTGWTSIAKSIENGTNDLK 265 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKI 364 + + +TDG + N ++ K K I + Sbjct: 266 ----------------ALKGEKTLNILYIITDGIETC---DGNPVETAKKFKNENTDIVL 306 Query: 365 VTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 I N + ++LK ++ Y+ + +A L Q I Sbjct: 307 GIIGFNVDAHQNKVLKEIANAANGYYSSANDAAKLTEELQRI 348 >gi|260825786|ref|XP_002607847.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] gi|229293196|gb|EEN63857.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] Length = 187 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 56/183 (30%), Gaps = 26/183 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K +++ +G++ Y+T +K D S Sbjct: 29 FDKVKEFTENVVNAF---DISASSTRVGVVQYSTSNTLEFNLGDHADKPSTLAAIDSISY 85 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A++ A R K +I +TDG++ + ++ Sbjct: 86 QGGGTRTGSALEFA-----------------RLNAAWRGGSVPKVMIVVTDGKSGDSVAS 128 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + I + + + +LL+ + + + ++L I+Q Sbjct: 129 S-----ANDLASQGVDVYAIGVG-NYDATQLLEIAAGNQNNVIELTDFNALSAEINQIAQ 182 Query: 408 LMV 410 + Sbjct: 183 TVC 185 >gi|331003698|ref|ZP_08327192.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412081|gb|EGG91476.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] Length = 528 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/252 (13%), Positives = 84/252 (33%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K++ ++E E+ + ++D+SGSM K Sbjct: 145 NPDTKLMMIGMQAKKVEESEKKPSNLVFLIDVSGSMDEPD-------------------K 185 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +KNA LL + + + ++ Y + +E + G++ ++ ++ D Sbjct: 186 LPLVKNAFLLLCEELKEND------TISIVTYAGYDQVVLEGASGSDS-KEIMSAIEDLE 238 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + +K AY+I K +I TDG+ N S Sbjct: 239 AAGSTAGSDGIKTAYKIAKKYFKSDGNN----------------RVILATDGDLNVGITS 282 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 ++ K KE+ + + + ++ + + + + + + Sbjct: 283 EGKLTRLIKKEKESGVFLSVLGFGTENIKDNKMEALADNGNGNYSYIDSRFE---AKKVL 339 Query: 406 SQLMVHRKYSVI 417 S+ + ++V Sbjct: 340 SEELGANFFTVA 351 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 53/194 (27%), Gaps = 32/194 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQY 279 K+ + AL+ LD + LI +++ + + + Y Sbjct: 288 GRKIKQTREALIKILDDLSPHD------QFDLISFSSEATTWKPLLVPASTENVNEAKSY 341 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T T+ AM A Q+L + II LTDG Sbjct: 342 ATGI---QAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT-----------LIILLTDG 387 Query: 340 ENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVV 393 + + + + + + L K + + + + Sbjct: 388 DPTVGETNPSNIQKNVRKAINGQH-SLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSD 446 Query: 394 NADSLIHVFQNISQ 407 +A L +Q ++ Sbjct: 447 SALQLQDFYQEVAN 460 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 474 GSSSMGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLEFGFDKYNSKADILSA 533 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 534 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 577 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V + D+L Sbjct: 578 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 630 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 631 KIAPRIIQNICTE 643 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 38/146 (26%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 286 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 343 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 344 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 385 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + Sbjct: 386 SRVARESGINVFFITVEGAAERDIQH 411 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 74/247 (29%), Gaps = 51/247 (20%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++ P L + +R + IE+ VD+SGSM+ + T+ Sbjct: 63 AVLLCAGPRRLGEPIDKRKLTNIEICVDVSGSMN---------------NPFGRATRYDG 107 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL- 289 A+ F ++ GL + V + + + Sbjct: 108 AMEAVTAFT-------SYRQGDAFGLTFFGNEVLHWCPLTTDVSAINCATPFMRPGQLPP 160 Query: 290 --KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A++ L I + I+ +TDG++ +F +N Sbjct: 161 WFGGTLIAKALRACKAEL------------------IKRPEGDRMIVLITDGDSQDF-AN 201 Query: 348 VNTIKICDKAKENFIKIVTISINAS-------PNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ ++ K I + + I + + + + ++L Sbjct: 202 GADAEVAEELKAEGITVFAVVIGNDRQFQNPIIRNGSVQTVTARTGGESFEAGDPNALAT 261 Query: 401 VFQNISQ 407 VF+ I + Sbjct: 262 VFKRIDE 268 >gi|114594050|ref|XP_526573.2| PREDICTED: anthrax toxin receptor 2 isoform 6 [Pan troglodytes] Length = 489 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|114594046|ref|XP_001145091.1| PREDICTED: hypothetical protein isoform 4 [Pan troglodytes] gi|114594048|ref|XP_001145172.1| PREDICTED: anthrax toxin receptor 2 isoform 5 [Pan troglodytes] Length = 488 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|114594044|ref|XP_001144933.1| PREDICTED: hypothetical protein isoform 2 [Pan troglodytes] Length = 508 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 18/176 (10%), Positives = 59/176 (33%), Gaps = 7/176 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + FR ++ +++ ++ +S+ + + I KN +++A + A LA Sbjct: 1 MLAFRRSGRR--KAQQGLTLVVMTVSMAVIVGVAALSIDANHLMVSKNRLQNALDTAALA 58 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 GA+ + + + K + + + S + I+ N Sbjct: 59 GATVANRTYEEDDAKEAIVEAYNK-----VTSAAGNDELVLAASDDGTSLKSLTIEYSDN 113 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ ++ ++ N+++ L + + + + + + + Sbjct: 114 ANSGFSSNFPSSADYIYVRLQVSNVELSEYLAGLLGYSKSINSSTVAGPVFADRTS 169 >gi|324499923|gb|ADY39979.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 2123 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 63/179 (35%), Gaps = 21/179 (11%) Query: 229 AALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMD 285 + +L FL +L + + ++ Y+ R+ + + + V + R Sbjct: 461 DVFRGEVLRFLKEFVELFDISSDRTRVAVVQYSDRIRHEFDLNQYSSIQNVEDAIDRIQ- 519 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A++ R+G + S + I +TDG + + Sbjct: 520 -YMTGLTRTGAAIEH----------VRNEAFNERRGARPLSDKISRVTIVITDGRSQDNV 568 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S +A++ I++ + + + L+T + + +++V L ++ Sbjct: 569 SLP-----AQQARQQHIQLFAVGV-TNHVLDSELETIAGAKDRYFHVTGFKDLNARLRS 621 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 59/170 (34%), Gaps = 18/170 (10%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ + +I Y+T E + + ++++ + + + + T +++A + + Sbjct: 106 TNPNLRISIITYSTEAEVILPLTSDSKEINKSLLVLKNIVPQGLTHMQKGLRKANEQIRK 165 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 II LTDG K ++T++ KA+ + T+ Sbjct: 166 STLG--------------GRIVNSVIIALTDGLLL-LKPYLDTMEEAKKARRMGAIVYTV 210 Query: 368 SINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRKYSV 416 + ++ L P+ + V +L V ++ SV Sbjct: 211 GVFM--YSKQQLVNIAGDPDRCFGVDEGFSALEGVVDPLTSKSCTEILSV 258 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 257 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 296 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 297 Q-FNLIVFSTEASQWRPSLVPASAENVNKARSFAVGI---QALGGTNINDAMLMAVQLLD 352 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 353 SSNQEEQLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 401 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 402 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 461 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 462 VEEVTQNNFRLLFKG 476 >gi|58395788|gb|AAW72737.1| thrombospondin related adhesive protein [Plasmodium reichenowi] Length = 565 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 58/175 (33%), Gaps = 25/175 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + +K+A+ L + I L+ + +++ ++ + I K ++ Sbjct: 54 CSGSIRRHNWVKHAVPLAIKLIQQLNLNESAIHLYASIFSNNAREIISLHSDASKNKEKA 113 Query: 281 -----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + +L T+ + A+ Q + L I + ++ Sbjct: 114 LIIIKSLLNTNLPYGKTNLSDALLQVRKHLNDR---------------INRQNANQLVVI 158 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEY 388 LTDG N+ + + +K K + +KI I N R L C S Sbjct: 159 LTDGIPNSIQDS---LKESRKLNDLGVKIAVFGIGQGINVAFNRFLAGCHPSDGK 210 >gi|89100226|ref|ZP_01173093.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085076|gb|EAR64210.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 463 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/385 (10%), Positives = 115/385 (29%), Gaps = 54/385 (14%) Query: 28 LSVMSFLLLIGFLIYVLD-WHYKKNSMESANNAAILAGASKMVSNLSRLGDR-FESISNH 85 + +S ++ + K++ N+AA + + + + N Sbjct: 10 ILFLSIFIITACSKESGNNTEQKQSK---GNDAAAIGQVDESEDTENAEDAEDAKEQINL 66 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 A+ I + I ++ K + + E+Q ++ ++ Sbjct: 67 AENKKIPASLEEIISYPKGPFTAEDTQIKDPEVQQALS------------KVPELPEEAS 114 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 F + + R ++ E+ F +E+++D SGSM Sbjct: 115 EEELNDLFAYLYSLFRKEYRDPREAIVSLTVSGPESEGSSEEKGSFNVEIILDSSGSMAN 174 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 M +T+M K ++ F ++ ++ +GL Y + Sbjct: 175 KM---------------GSQTRMELAKASIKKFASALPEEAN------VGLRVYGHKGT- 212 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK-- 323 G++ ++ + + + A + Sbjct: 213 ------GSDADKKMSCASNELVYAPQPYIEAELNTALNKFKPAGWTPLAQSLMEAQKDLE 266 Query: 324 -IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASP-NGQRLL 379 + + ++DG + N ++ K++ + + I + + + Q+L Sbjct: 267 AYKGEKNKNIVYVVSDGIETC---DGNPVEAAASLKDSGVAPVVNIIGFDVNGKDQQQLE 323 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQN 404 + ++ + NV + + L + + Sbjct: 324 EVAQAAGGTYQNVKSQEQLDNELEK 348 >gi|325496693|gb|EGC94552.1| hypothetical protein ECD227_0790 [Escherichia fergusonii ECD227] Length = 496 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 80/253 (31%), Gaps = 45/253 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + I I + P + ++D SGSM+ Sbjct: 110 PWNEKHTLLRLDIA-ANDIARSKLPPANLVFLIDTSGSMN-------------------S 149 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K++L L ++ + + ++ Y + + G+EK + Sbjct: 150 DERLPLIKSSLKLLVNELRDQD------RISIVTYAGSARLLLSSTSGSEK-NTILNAIA 202 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ + AY+ + + I+ TDG+ N Sbjct: 203 NLQAGGGTNGGAGVAMAYEQAQAGYIKGGVN----------------RILLATDGDFNIG 246 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + K +E+ I + T+ + + + ++ + +Y DSL + Sbjct: 247 DDPASVEDLVKKQRESGITLSTLGVGDNNYNEEMMVKIADTGNGNY--SYLDSLSEAQKV 304 Query: 405 ISQLMVHRKYSVI 417 +S M +V Sbjct: 305 LSNEMNQTLVTVA 317 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 + + L +G+I Y+++V+ + D + + T + Sbjct: 55 QFLVGLLRGLKVGANATRVGVIQYSSQVQSVFPL-RAFSRREDMERAIRDLVPLAQGTMT 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + S +G + P + + +TDG + ++ Sbjct: 114 GLAIQYVMNVAFS----------VAEGARPPEERVPRVAVIVTDGRPQD-----RVAEVA 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 +A+ I+I + + + G L+ S P E+ + V + D + Sbjct: 159 AQARARGIEIYAVGVQRADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|326433400|gb|EGD78970.1| hypothetical protein PTSG_11807 [Salpingoeca sp. ATCC 50818] Length = 2673 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 62/181 (34%), Gaps = 20/181 (11%) Query: 230 ALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ +L F+ + + + D + + + + NI + + Sbjct: 680 TFQDRVLAFVSQVTTYFTIGEHDTRVAVATFASGATVNIRLNDHFDGDALRDAIADIPYP 739 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T ++ ++ Q + ++ G++ S + ++ LTDG N + + Sbjct: 740 QGQTYTSLGLRAVRQDILTEAN----------GMRPASEGVPRVLVVLTDG---NSQPSY 786 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD---SLIHVFQ 403 + + + + I + +S Q L+ S P+ + +N+ + ++ Sbjct: 787 DPATEASILHDQNVNVFAIGVGSSI-SQSQLEDIASDPDARHVFNLRSFSLIGDIVDAMS 845 Query: 404 N 404 Sbjct: 846 A 846 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 65/188 (34%), Gaps = 20/188 (10%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ L A+ LD+ ++ ++ + +I + +V+ ++ + + + Sbjct: 28 SGDKIDNLNKAVENMLDTFAQEEKMETEILVSVITFGGKVDLHVPFT------KASQVQW 81 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T A+K A ++ + PS ++ I+ ++DG+ + Sbjct: 82 HGLQVNGDTPMGTALKMAKAMIEDKE-------------TTPSRAYRPTIVLVSDGQPTD 128 Query: 344 FK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + ++I + L + + + NA L F Sbjct: 129 GNIWKQAMADFISEGRSSKCDRMAMAIGHDADETVLKRFIEGTAHDLFYAENAGQLHEFF 188 Query: 403 QNISQLMV 410 Q ++ + Sbjct: 189 QRVTMSVT 196 >gi|194223903|ref|XP_001494710.2| PREDICTED: integrin, alpha 1 [Equus caballus] Length = 1208 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 73/203 (35%), Gaps = 27/203 (13%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + K+ D ++ + + +G++ Y V + + V + Sbjct: 209 SNSIYPWKSVTDFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANK 268 Query: 285 DSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T + + A + ++ + +K ++ +TDGE+++ Sbjct: 269 IVQRGGRQTMTALGIDTARKEAFTEARG-------------ARHGVKKVMVIVTDGESHD 315 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVV 393 I+ C+ + I+ +I+I N L +K+ S + ++ +NV Sbjct: 316 NHQLNQVIQDCE---KQNIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVS 372 Query: 394 NADSLIHVFQNISQLMVHRKYSV 416 + +L+ + + + + + + +V Sbjct: 373 DELALVTIVEALGERIFALEATV 395 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/230 (11%), Positives = 71/230 (30%), Gaps = 47/230 (20%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D+SGSM K+ L+ AL + +D Sbjct: 232 ASNLVFLLDVSGSMASDD-------------------KLPLLQTALKMLTQQLDEQD--- 269 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ Y ++ + G ++ T+ ++ AYQ+ Sbjct: 270 ---KVSIVVYAGAAGVVLDGAAG-NDIKILTYALEQLTAGGSTNGAEGIQLAYQLAQKHF 325 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTIS 368 + +I TDG+ N +N + + + K++ I + T+ Sbjct: 326 VKGGIN----------------RVILATDGDFNVGTTNLDELVDLVEVQKKHGIGLTTLG 369 Query: 369 INASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 L++ + + + + + + + + + +++ Sbjct: 370 FGMGNYNDHLMEQLANKGNGQYAYIDSVNE---ARKVLVEQLGATLLTIV 416 >gi|307294186|ref|ZP_07574030.1| TadE family protein [Sphingobium chlorophenolicum L-1] gi|306880337|gb|EFN11554.1| TadE family protein [Sphingobium chlorophenolicum L-1] Length = 126 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 33/106 (31%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + I +E +I F L++ FL+L+ I + + ++ A + + Sbjct: 1 MMRLIRNECGAAAIEFVLALPPFLILLMGAIQLGVIACARTGLQHAVDEGARYASIYPTP 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +++ R S A + + + + Sbjct: 61 TDAQITARVTSREFGLDPAYSNVPTITHGTQYGVPYTEITMTYSRP 106 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 57/164 (34%), Gaps = 19/164 (11%) Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 FL + + + + G++ ++ E +I + D S + T Sbjct: 60 KDFLKKLANSFGISNKGSHAGVVTFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRI 119 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +A ++ + LTDG+ + + I Sbjct: 120 DLALAKALELFDIKNGARND--------------VPNLLFLLTDGK---QEPEMPLTHIS 162 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 D+ K+ I++ + I A N + L+ V +PE + V + D L Sbjct: 163 DEIKQKGIQLFAVGIGAGAN-KTELEKIVGNPENVFMVDDFDKL 205 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 56/171 (32%), Gaps = 21/171 (12%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 D + L DV +G+ Y++ + S + ++ + T + ++ Sbjct: 845 DLVSNLMSGDNDVRVGVAEYSSTYTQIVFPFSSSQSAIDSSLSSMI--QTAGATATGTSL 902 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 +A + + +I +TDGE ++ + D + Sbjct: 903 GEAADDI----------------GSTARSSAARVLILMTDGETSDGDEQNIDPSV-DALR 945 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I I + S + LL+ SS ++ +N + L I + Sbjct: 946 ALGVSITAIGVGNSASESELLQIAGSS-DHVFNNIAFVDLSSFINQIIGQI 995 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 56/173 (32%), Gaps = 19/173 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK+ + S+ + + + +++ + S+ + + Sbjct: 466 YQLLKDFTSALVRSLT---VSSTNARVAIELFSSEPQIETGFSYDESYLISVINSLPHLK 522 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ A Q + SD F S F I +TDG N+ + Sbjct: 523 L--GTATGEALRMARQDIFSDNDALF-----------RSFSVPAFAIVITDG--NSLEDA 567 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +E+ +++ + + + + L + V + L Sbjct: 568 SYVAEQARRLREHGVQVFALGVGSQITVSQ-LVDIAGDNARVFGVADFGVLNA 619 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 59/181 (32%), Gaps = 21/181 (11%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + ++ ++ + I + + + T + + ++L Sbjct: 1260 FQTSIQFLRALVNGADIESSGSRIAAITFCSEPTLLTDYVSTTSEALDALNTASNTLTCG 1319 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ + + + + ++ +I +TDGE + + Sbjct: 1320 -TATGAALDFVRENILT---------------DRSNSGARRVVIVITDGE--SQEDFSVV 1361 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + + I + + + L+ SS + + + D+L ++ +I QL+ Sbjct: 1362 QNAGARLQAEVDDVYAIGVGSGTDLAE-LRVIASSDDNTFQEASFDNLDNI--DILQLLC 1418 Query: 411 H 411 Sbjct: 1419 T 1419 >gi|198424353|ref|XP_002120419.1| PREDICTED: similar to mCG120740 [Ciona intestinalis] Length = 1650 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 71/193 (36%), Gaps = 29/193 (15%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KV 276 + +M ++ A+ F+D++ + + ++G++ + +R + T Sbjct: 381 ISTSMENYGRMGLMRQAVSNFIDTVPM------NTWVGIVVFASRANTLARLTEITSYDA 434 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + + + T + + ++L + RS + II L Sbjct: 435 RNILKTRLVNTTVVGTSIGSGIMKGLEVLETSGPRSLRGSGGS-------------IIIL 481 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG +N +TI ++ +E +++ TI++ ++ + L S + + Sbjct: 482 TDGLEHNNPKINDTI---ERVREFGVRVSTIALGSNV-AKDLEWLASVSNGRTHAASSGQ 537 Query: 397 -----SLIHVFQN 404 L F + Sbjct: 538 FGIDAELQEAFAS 550 >gi|149036652|gb|EDL91270.1| anthrax toxin receptor 1 [Rattus norvegicus] Length = 457 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 54/156 (34%), Gaps = 18/156 (11%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 M I ++TR ++ + E++RQ + L T ++A + + Sbjct: 1 MSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY------- 53 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II LTDGE + + + +++++ + + + Sbjct: 54 -------YENSQGYRTASVIIALTDGELHEDLFF-YSEREANRSRDLGAIVYCVGVKD-- 103 Query: 374 NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQL 408 + L S ++ + V + +L + +I + Sbjct: 104 FNETQLARIADSKDHVFPVNDGFQALQGIIHSILKK 139 >gi|239629502|ref|ZP_04672533.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239528188|gb|EEQ67189.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 909 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/418 (7%), Positives = 98/418 (23%), Gaps = 53/418 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +F + ++ +L G + ++ + A + ++ G + Sbjct: 14 LFVVLLILMSMLTGLVTSGSSVVTAAANIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 69 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 70 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 129 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE----MGERPIFLIELVVD 198 + + ++ + + + V Sbjct: 130 GNTQQNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVRTGVKNFLTSIQNAGLGDYVNVGL 189 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S + + I L S + Sbjct: 190 IGFSSPGYIGGGNKTTGPGYIRVG----------------LGKAGNTSQQQAINSALSPT 233 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I G+ +++ + + K S N Sbjct: 234 FNGGTYTQIGLRQGS--------AMLNADTSGNKKMMILLTDGVPTFSDKVKNSVVENGT 285 Query: 319 RQGVKIPSLPFQKFII------FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-- 370 G + + ++ + + + T+ KAK++ + + I Sbjct: 286 LYGTNFGTTRDEPGYTAELKNPYIDSSGDYIYDTWPATLGEAKKAKDSGNVVHALGIQLA 345 Query: 371 -----------ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + ++ ++ +SP+ + + +AD++ N ++ +V ++I Sbjct: 346 GDDGYWSEGYMSDEDVRQNMELIPTSPDLYEDADSADAVEAYLNNQAKDIVKNFNTII 403 >gi|242072180|ref|XP_002446026.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] gi|241937209|gb|EES10354.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] Length = 434 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 80/238 (33%), Gaps = 38/238 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ER + V+D+SGSM + N ++ K A+ + S+ + Sbjct: 8 ERAPVDVVAVLDVSGSMAWDYGNGTTVENH----------RLELAKEAMAKAIQSLGPAA 57 Query: 247 HVK-----EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST-PAMKQ 300 + ++ ++ V++ + + +Q V +D+L +K Sbjct: 58 AAVAAGGARRNRLAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMPLKI 117 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +IL K IIF++DG+++ F+ + + + K Sbjct: 118 AAKILDERKAEEKDRLA--------------IIIFVSDGQDHYFRDTDDMKETLTQHKLI 163 Query: 361 FIKIVTISINASPN------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 I ++ S + L ++ + ++ D + + +++ + Sbjct: 164 KYPIHAFGVSVSEQDSSGGGAKALRAMADATSGSYTSITQDDDVDTM--AVAEKLAQL 219 >gi|284989271|ref|YP_003407825.1| hypothetical protein Gobs_0677 [Geodermatophilus obscurus DSM 43160] gi|284062516|gb|ADB73454.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 146 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 36/128 (28%), Gaps = 6/128 (4%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 E+ + A+ V LL++G + + + A AG + + + L Sbjct: 13 ERGAIGVFLAVLVPGLLLIVGLAVDGGAKVAATQRANAIADEAARAGGQALDLS-AALTG 71 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + A A+ + +R + ++ ++ + + L Sbjct: 72 QVRVDPAAAVAAVQNYLERS-----GVQGAVTVVDGDTLQVSTTISEPTTFLGLIGITTL 126 Query: 138 DSSNNTIF 145 Sbjct: 127 TVEGTGTA 134 >gi|46581745|ref|YP_012553.1| hypothetical protein DVU3344 [Desulfovibrio vulgaris str. Hildenborough] gi|46451168|gb|AAS97813.1| hypothetical protein DVU_3344 [Desulfovibrio vulgaris str. Hildenborough] gi|311235373|gb|ADP88227.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 389 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 12/154 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 K+ + E+ N ++I ALS + L L + + + K++ M+ A + A LAGA + Sbjct: 2 SLLKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQAL 61 Query: 69 -VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY-----------NT 116 S+ + R ++I+ D + S +A+ Sbjct: 62 INSSGNTDMARTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDVHIRR 121 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 + + I ++ D + + Sbjct: 122 NAEAGNSVGLIFANLFGDSYSDLTVMARAEAVPA 155 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 62/226 (27%), Gaps = 30/226 (13%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E + + +++D SGSM ++ V+ + + ++L + + Sbjct: 24 AEKTETEAKAPANVAVLLDASGSMAKRIDG----VSKFNSAKKEISKFASSLPEGTQVKM 79 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + K + V + + ++ T A+ Sbjct: 80 SVFGSEGNNKNSGKVQSCEAIRNVYGFQSFNE-----QSFLNSLNTIGPTGWTPIAKALN 134 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A +K + LTDGE N IK + ++ Sbjct: 135 EAKSSFDQ-----------------LDAKGEKVVYLLTDGEETCG---GNPIKTAKELQK 174 Query: 360 NFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + I + +L ++ + +F Sbjct: 175 DNITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPAYTQKDVEKIFTQ 220 >gi|261855659|ref|YP_003262942.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836128|gb|ACX95895.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 625 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 40/114 (35%), Gaps = 8/114 (7%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN--------NNFKSNVNTIKI 353 + + S++ + + + I+ L+DG N K+ Sbjct: 107 AKKIRSNEPFTDIPAALAAAAATWNDGTDRNILLLSDGMVDISPEKAINVRAQEELLQKL 166 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + I++ TI+++ + + L + + +AD+L F I + Sbjct: 167 VPQLRAEHIRVHTIALSKDADSKLLSQIAADTGGIFVEADSADALQRAFLKIFE 220 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 76/238 (31%), Gaps = 49/238 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP I +V+D SGSM K+ K A + L + Sbjct: 147 ERPPVNIAIVLDRSGSM--------------------SGQKLEKAKQAAIEALRRLGQKD 186 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +I Y V+ + + V +R T + Q Sbjct: 187 ------MFSVIVYDHNVKTIVP-AQSARNVEWIESRIRGIGPGGNTALFGGVSQGAS--- 236 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--ICDKAKENFIKI 364 + K S + II L+DG N S+ + KE+ I + Sbjct: 237 -------------EVRKNLSNKYVHRIILLSDGLANVGPSSPEDLGRLGAALIKES-ISV 282 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQ--LMVHRKYSVILK 419 TI + N + + +S Y V ++ L +F + +V +K SVI++ Sbjct: 283 TTIGVGTDYNEDLMARLSQNSDGNTYFVESSRDLPKIFAAELGDVLNVVAKKVSVIIQ 340 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 72/210 (34%), Gaps = 35/210 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 ++ + A FL I ++ ++G++ ++++ E ++ Sbjct: 315 VSGSMTSYDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIVHELIQINSDIE 369 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + + + T +K A+Q+ + + ++ T I+ L Sbjct: 370 RNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE----------------ILLL 413 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DGE + D+ K++ + I++ S + + + + H + Sbjct: 414 SDGE------DSTAKDCIDEVKDSGSIVHFIALGPSADLA-VTNMSILTGGNHKLATDEA 466 Query: 397 S---LIHVFQNISQL---MVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 467 QNNGLIDAFGALASENADITQKSLQLESKG 496 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 51/168 (30%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + +D L + + L+ Y + V K + T + Sbjct: 226 FLSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 285 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + G + K I +TDG + ++ Sbjct: 286 AIKTAMDQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 330 Query: 357 AKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + + + LK S P ++ + V + Sbjct: 331 ARASGIEIYAVGV----DRAEMRSLKQMASQPLDDHVFYVETYGVIEK 374 >gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 698 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 66/236 (27%), Gaps = 31/236 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + IP + ++ DL + + + Sbjct: 266 YRLDDTTPARIELIPYKADRNATGTMMLVVTPAADLQ-----PITEGTDWTFVLDVSGSM 320 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+A L + + L ++ + + TR + E V Q++ R Sbjct: 321 DGHKIATLADGVSQTLGKLNSNDRFRIITFNQSAADLTRGFV----TATPEAVGQWINRV 376 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ ++ A + L D+ S I+ +TDG N Sbjct: 377 KTIAAGGSTNLFAGLETACRRLDDDRTTS--------------------IVLVTDGVANV 416 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + E +++ T I S N L + S + + + D + Sbjct: 417 GR--TEQREFLQLLTEYDVRLFTFVIGNSANRPLLDRLAKDSGGFAMQISDVDDIQ 470 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 26/192 (13%) Query: 221 QDKKRTKMAALKNALLLF-LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K + F ++ ++ +G++ Y+ R K Sbjct: 676 GSGSVNAANFVK--VKQFAVNVVNTFDVSLTATRVGVVQYSDRNTLVFNLGNKVNKPSTV 733 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T++ A++ RQ K II LTDG Sbjct: 734 SAINNIVYQSGGTNTGAALQY-----------------VRQYAAWRGGNVPKVIIVLTDG 776 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++++ S + +++ I + S + +LL+ + + N ++L Sbjct: 777 KSSDSVSGPS-----QNLVAAGVEVYAIGVG-SFDHGQLLQIANNKQNNVIELNNFNALA 830 Query: 400 HVFQNISQLMVH 411 IS + Sbjct: 831 TKIDMISTNVCS 842 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 73/223 (32%), Gaps = 46/223 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N P+ ++ + LV+D SGSM Sbjct: 497 NSPPSNLNDAQIEPSFDLVQASTGDECRVVLVLDTSGSM-------------------GT 537 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 ++ + +A F++ +D + +G++ +T + T+ R + Sbjct: 538 SNRIDKVNSAATAFVNLVDDG------ISIGIVTFTGSPTTRHALTQINTQADRDSLRDI 591 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T ++Q ++L + S I+ +TDG+++ Sbjct: 592 FQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG---------------IIVLMTDGQDSG 636 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 ++++ I ++ +++ T++I G+ L + Sbjct: 637 IQNHI----IRQTLQDMGVRVNTVAIGEDAYGELSL-IAQETG 674 >gi|291388471|ref|XP_002710799.1| PREDICTED: collagen, type XIV, alpha 1 [Oryctolagus cuniculus] Length = 1796 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 53/184 (28%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + ++ +T + + K S Sbjct: 1046 DNFNKIINFLYSTVGALDKIGTDGTQVAMVQFTDDPRTEFKLNAYETKETLLDAIKRISY 1105 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K L + + K I+ +TDG + + + Sbjct: 1106 KGGNTKTGKAIKHVRDTLFTAESG-------------TRRGIPKVIVVITDGRSQDDVN- 1151 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 KI + + + I + + + + + S + + V + F+ I Sbjct: 1152 ----KISREMQSDGYNIFAVGVADADYSELVNIGSKPSARHVFFVDDF----DAFKKIED 1203 Query: 408 LMVH 411 ++ Sbjct: 1204 ELIT 1207 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 55/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFNVGSEKTRVGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVPKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|84685162|ref|ZP_01013061.1| hypothetical protein 1099457000257_RB2654_09854 [Maritimibacter alkaliphilus HTCC2654] gi|84666894|gb|EAQ13365.1| hypothetical protein RB2654_09854 [Rhodobacterales bacterium HTCC2654] Length = 496 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 11/125 (8%), Positives = 34/125 (27%), Gaps = 7/125 (5%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + SR + E ++FAL + ++ ++ M+S + L Sbjct: 2 MRSRLASH----WRDEGGAVLVVFALMLAILSGIVALSYDFGRAAATQSEMQSFADNVAL 57 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A A ++ + + + + + + + Sbjct: 58 AAAGELDGGAD---ALTRAQAAAEQLIADSQTYGEGPGALGAEDFVLTFYAVRPDASGEI 114 Query: 123 NSSRI 127 +++ Sbjct: 115 DATTT 119 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 44/356 (12%), Positives = 90/356 (25%), Gaps = 57/356 (16%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A L + I +R + A + L + T + Sbjct: 125 AFLQAELLTTGAVLPTQTAGLIIRPGPQRPVAVIAADRAEGEAVVQLLLDQGMVDITLLN 184 Query: 120 NIVNSSRI-------SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE---HLLNQRY 169 + + S+ + L+ ++ Y L + Y Sbjct: 185 PAQTPTYLRDWAPYGSVVLVDTPALEIEPYAQRMLQRMVNEYGLGLLIMGGPQSFGPGGY 244 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + L E P + V+D SGSM ++ Sbjct: 245 FETPLEELSPLSARVPREAPEVTMVFVLDRSGSMQQ---------------AVGDSNRLG 289 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLI 288 KNA ++ L + +G+I + T + + + + R Sbjct: 290 VAKNA------TLSALELLNPQSQIGVIVFDTEETTVVPLSTLDIPAAQIALDRVD---T 340 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T P + AY+ L S K II +TDG + Sbjct: 341 GGGTAIYPGLVAAYRELQ------------------RSESPAKHIIVMTDGLSQPGDW-- 380 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I + + + ++I + + + ++L + Sbjct: 381 --EGILRQITADGTTVSAVAIGVGADTGAAENIARLGNGVAHISRDFEALPSILAQ 434 >gi|308081588|ref|NP_001183957.1| vitrin isoform 2 [Mus musculus] gi|26342052|dbj|BAC34688.1| unnamed protein product [Mus musculus] Length = 628 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 452 GSSSMGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLEFGFDKYNSKADILSA 511 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 512 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 555 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V + D+L Sbjct: 556 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 608 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 609 KIAPRIIQNICTE 621 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 38/146 (26%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 264 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 321 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 322 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 363 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + Sbjct: 364 SRVARESGINVFFITVEGAAERDIQH 389 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 63/183 (34%), Gaps = 25/183 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 KN ++ +D L+ Y+ + + + + Sbjct: 176 KNFVMTVMDQFKGTD-----TQFSLMQYSDDFKTHFTFNNFKNDPTSKNLVGPIEQLNGK 230 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + +++ + L + +K +I +TDG+ N Sbjct: 231 THTASGIRKVVRELFQEWNG-------------ARKDAKKILIVITDGQIQGDSLN--YR 275 Query: 352 KICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 + +A++ + I + +P+ ++ L+T S P E+ + V N D+L ++ + Sbjct: 276 DVIPEAEKEGVIRYAIGVGYAFNTPSARQELRTIASQPAQEHVFQVNNFDALKNIQNQLQ 335 Query: 407 QLM 409 + + Sbjct: 336 EKI 338 >gi|120601101|ref|YP_965501.1| hypothetical protein Dvul_0050 [Desulfovibrio vulgaris DP4] gi|120561330|gb|ABM27074.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 389 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 12/154 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 K+ + E+ N ++I ALS + L L + + + K++ M+ A + A LAGA + Sbjct: 2 SLLKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQAL 61 Query: 69 -VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY-----------NT 116 S+ + R ++I+ D + S +A+ Sbjct: 62 INSSGNTDMARTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDVHIRR 121 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 + + I ++ D + + Sbjct: 122 NAEAGNSVGLIFANLFGDSYSDLTVMARAEAVPA 155 >gi|156350148|ref|XP_001622163.1| hypothetical protein NEMVEDRAFT_v1g221080 [Nematostella vectensis] gi|156208611|gb|EDO30063.1| predicted protein [Nematostella vectensis] Length = 2040 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 62/191 (32%), Gaps = 29/191 (15%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L D + + ++ G++ Y T+ + N+ + D + Sbjct: 221 FRKLLQFVKDVMSSFPLSESGIHTGVVSYGTKAKTNLGFDVHFSQANFNSAIDGINFPGG 280 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 TD+ +++ + L K ++ +T G++ S Sbjct: 281 ATDTGNGIRKVMKSLFKTSK---------------RRSIPHILVLITAGKSTGDPSLN-- 323 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 ++ + + +++ + + + L SSP Y L F ++S L+ Sbjct: 324 ---AEELRASGVRLFCVGVG-GAYDRTQLDAIASSPSTTY------VLTAGFDDLSGLVP 373 Query: 411 HR--KYSVILK 419 + + + + Sbjct: 374 TLVSRITKVAQ 384 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 18/189 (9%), Positives = 55/189 (29%), Gaps = 22/189 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-- 286 K AL F ++GL+ + + K + + +D+ Sbjct: 425 ENFKQALSFFKTIYQAFWTPFGSTHVGLVVFGADSTMVFDF-DNKLKDKPAIDAAIDASV 483 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A++ ++ + + ++ ++ L + Sbjct: 484 FPSGQTNLGAALQ-----------QTKDLSLGYLFGSKHNDGHRRTLVVLA-----GSTA 527 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 + + + N + + + + + + L S+P V + + L + Q Sbjct: 528 EDDVFQGASDLRMNGVTVYCVGAGPNYDSAQ-LDGIASTPADSNVLTVASYNDLPGISQT 586 Query: 405 ISQLMVHRK 413 + + + Sbjct: 587 LINKIEEGR 595 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 58/170 (34%), Gaps = 18/170 (10%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ + +I Y+T E + + ++++ + + + T +++A + + Sbjct: 106 TNPNLRISIITYSTEAEVILPLTSDSKEINKSLLVLKSIVPQGLTHMQKGLRKANEQIRK 165 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 II LTDG K ++T++ KA+ + T+ Sbjct: 166 STLG--------------GRIVNSVIIALTDGLLL-LKPYLDTMEEAKKARRMGAIVYTV 210 Query: 368 SINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRKYSV 416 + ++ L P+ + V +L V ++ SV Sbjct: 211 GVFM--YSKQQLVNIAGDPDRCFGVDEGFSALEGVVDPLTSKSCTEILSV 258 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 61/196 (31%), Gaps = 30/196 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVK----EDVYMGLIGYTT--RVEKNIEPSWGT 273 + ++ ALK AL + L+ + LI +++ + E + T Sbjct: 330 SGSMRGARIDALKTALRGLSGADTTLTGRYATFANRERVTLIPFSSAPGAPRTTELTPAT 389 Query: 274 --EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++Q + T+ A++ AY+ + + + Sbjct: 390 RGAALKQLRAQVDALTPDGGTNIYGALQAAYEQARAAPAGRYTS---------------- 433 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 I+ +TDGE S + +K T+ + + + + Sbjct: 434 -IVLMTDGERTEGPSADQFRATYAALPERARQVKTFTVLFG-DSDATEMNRIATLTGGRT 491 Query: 390 YNVVNADSLIHVFQNI 405 ++ N L F++I Sbjct: 492 FDGQN--DLRAAFKDI 505 >gi|18042139|gb|AAL57848.1|AF454755_1 vitrin [Mus musculus] Length = 650 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 474 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLQFGFDKYNSKADILSA 533 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 534 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 577 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V + D+L Sbjct: 578 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 630 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 631 KIAPRIIQNICTE 643 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 40/146 (27%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 286 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 343 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 344 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 385 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + ++ Sbjct: 386 SRVARESGINVFFITVEGAAEREKQH 411 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 56/204 (27%), Gaps = 35/204 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEPS 270 D + + + KM K A+ F + ++V + G + E Sbjct: 139 DASGSMAARVSGGEKMQVAKEAVRSFTSQMPEGTNVSLIVYGHKGSNSKADQAESCKGIE 198 Query: 271 -------WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + ++ + T AM QA Q L + Sbjct: 199 EIVELGPYNESTLQSKLDPI---RATGWTPLAGAMNQAGQRLKETEG------------- 242 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK- 380 + I ++DG +K ++ I + I + + LK Sbjct: 243 ----QAENVIYVVSDGLETCGGD---PVKEAKSLNQSNIKATVNIIGFDVGNKEHQALKK 295 Query: 381 TCVSSPEYHYNVVNADSLIHVFQN 404 + +++ + L F+N Sbjct: 296 VAEAGGGKYFSATSKTELDLYFRN 319 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/260 (10%), Positives = 61/260 (23%), Gaps = 41/260 (15%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + R Q + L ++ E ++E ++ P Sbjct: 240 VIFVLDTSGSMAGTRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVT----EWDLRKPYK 295 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + E Sbjct: 296 GPDHYYFNSPP-----------EETTEDATAVPQNNESEVKFGPYDDILAYPVT-----E 339 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V++ + T+ A+ A + + I Sbjct: 340 QSVKRAKEFVAAMDVTSSTNINDALLLALK---------------NSQSVQSRVRLTPII 384 Query: 334 IFLTDGENNNFKSNV--NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-- 389 IFLTDGE ++ + ++ + I ++ + Q L K + + Sbjct: 385 IFLTDGEPTASVTDTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQNRGFARK 444 Query: 390 -YNVVNAD-SLIHVFQNISQ 407 Y +A L F ++ Sbjct: 445 IYEAADATLQLKGFFDEVAS 464 >gi|257093736|ref|YP_003167377.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046260|gb|ACV35448.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 452 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 47/182 (25%), Gaps = 24/182 (13%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ L+ + RV+ + P + T+ + Sbjct: 75 ADQLEPTDIASLVVFDDRVQTLVPPRP-VGDRQALHLALSRVHSGGSTNLHGGWQAGADG 133 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFI 362 L ++ +I L+DG N +C +A E + Sbjct: 134 LLPAAGQAALA----------------RVILLSDGNANVGEITDPAGIAALCAQAAERGV 177 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV-----HRKYSVI 417 T + + N +++ HY A L F + H + S+ Sbjct: 178 STSTYGLGSHFNEDLMVEMAKRGGGNHYYGDTAADLFEPFAAEFDFISALCARHVRLSLA 237 Query: 418 LK 419 Sbjct: 238 AA 239 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/251 (12%), Positives = 80/251 (31%), Gaps = 45/251 (17%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ K++ I P + ++D+SGSM K Sbjct: 42 NHDTKLMHVAIKGYDIAPATAPHANLVFLIDVSGSMDEPD-------------------K 82 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L ++ + + ++ Y +EP+ EK + ++ Sbjct: 83 LPLLKSAFRLLVNRLKADD------TVSIVTYAGNAGTVLEPTRVAEKSK-ILSAIDKLE 135 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY + + ++ TDG+ N S Sbjct: 136 AGGSTGGAEGIEAAYDLAKKAFVKDGVN----------------RVMLATDGDFNVGPSS 179 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +I ++ +++ I + + L++T + + D+L + + Sbjct: 180 DEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLV 237 Query: 407 QLMVHRKYSVI 417 + + + Sbjct: 238 EEAGSTLFPIA 248 >gi|182413803|ref|YP_001818869.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841017|gb|ACB75269.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 792 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/249 (10%), Positives = 75/249 (30%), Gaps = 45/249 (18%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++V + R + ++D+SGSM + K+ Sbjct: 397 QHRLVRIGLKAKDAAVSGRAAANLVFLLDVSGSM-------------------DQPNKLR 437 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L L + + ++ Y + + + R+ + + Sbjct: 438 LVQESMRLLLGRLQPED------RVAIVTYAGNSGLALPSTP-VARQREILDAIDELRAG 490 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNV 348 T+ ++ AY I ++ + +I TDG+ N S Sbjct: 491 GSTNGAMGLQLAYDIAKANFVANGVN----------------RVILCTDGDFNVGVTSEG 534 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +++ ++ ++ + + + +L+ Y ++ + + Q Sbjct: 535 ELVRLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSYGYIDTR--REAEKLLVQQ 592 Query: 409 MVHRKYSVI 417 + +V Sbjct: 593 VSGTLLTVA 601 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|320105085|ref|YP_004180676.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752367|gb|ADV64127.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 323 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 63/205 (30%), Gaps = 30/205 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + T+ A+ FLD + GL + + + Sbjct: 109 DVSGSMMSPFGDGTRYDMSMKAIDKFLD-------SRRGDAFGLTFFGNNYLHWVPLTSD 161 Query: 273 TEKVRQYVTRDMDSLIL---KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 ++ + T+ A+ + L ++ Sbjct: 162 VSAIKCAPPFMKPEVAPLWMSGTEIGKALLGCRRTLVERQEGD----------------- 204 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + II ++DG + + + ++ K + I + I I+ + ++ C + Sbjct: 205 -RAIILISDGASFDLGGGND-EEVARLLKRDGIVVYAIHIDETEIPDPIVTICSITGGDA 262 Query: 390 YNVVNADSLIHVFQNISQLMVHRKY 414 + + +L +F+ I M + Sbjct: 263 FAPDDPSALEAIFKRI-DQMTPTRL 286 >gi|3273293|dbj|BAA31190.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|189524674|ref|XP_684145.3| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 607 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 60/176 (34%), Gaps = 19/176 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F+D + + + M I ++T +E + +++R + L T Sbjct: 55 NEIYYFVDHLAHK-FISPQLRMSFIVFSTEGRILMELTEDRDQIRAGLEELQRVLPGGDT 113 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++A Q + II LTDGE + ++ + Sbjct: 114 FMHKGFQKASQQIYYGTG--------------DGYRTASVIIALTDGELRENEFDLAARE 159 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQ 407 ++ + + + + L T S ++ + V + ++L V +I + Sbjct: 160 AGRS-RQLGASVYCVGVKD--FNETQLATIADSKDHVFPVNDGFEALQGVIDSILK 212 >gi|3273269|dbj|BAA31178.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273273|dbj|BAA31180.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273295|dbj|BAA31191.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|311262926|ref|XP_003129419.1| PREDICTED: anthrax toxin receptor 2-like [Sus scrofa] Length = 241 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 54/185 (29%), Gaps = 23/185 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V + + I ++++ + + K+ + + + T Sbjct: 62 IYNFVHQLTER-FVSPQMRLSFIVFSSQATIILPLTGDRGKISEGLDNLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIE----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 ++ ++ + + Q L+ S E + V + I R+ Sbjct: 164 KISRSLGARVYCVGVLD--FEQAQLERIADSKEQVFPVTGGFQ---ALKGIINSSSSRER 218 Query: 415 SVILK 419 + + Sbjct: 219 GLAFQ 223 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/188 (10%), Positives = 55/188 (29%), Gaps = 12/188 (6%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + I ++++ FL + + + K +++A ++A LA A+ + ++ + Sbjct: 10 KKQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQA 69 Query: 76 GDRFESISNHAKRALI------DDAKRFIKNHIKESLSGYSAVFYNTEIQNI---VNSSR 126 E S A + + + ++ ++ VN S Sbjct: 70 AVSAEVTSTLNAMAGASGNHEFNFSTAVVSVEYSNDPQSFAGTTTFGADDDVYVRVNVSS 129 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL---NQRYNQKIVSFIPALLRI 183 + M + + + +S + I + + ++ Sbjct: 130 LDMDEFFIQMFGLAKDVSATAVAGPSSGLQVVNNIVPIGVCIGDGTSDNDINEDGYDDAT 189 Query: 184 EMGERPIF 191 +F Sbjct: 190 GEAITNVF 197 >gi|304441881|gb|ADM34176.1| martilin [Aplysia californica] Length = 348 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 43/113 (38%), Gaps = 7/113 (6%) Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 TS + + + + ++ + +TDG N + + T + +A+E+ I + Sbjct: 61 YTSTGEGIEYMATAQLASQFTRSWAERVGLVITDG---NSQESAKTKEAARQARESGITM 117 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL--MVHRKYS 415 I + + + L + V + D L ++ Q ++ + K + Sbjct: 118 FAIGVGNVKDQE--LVNIAGDASRVFKVDSYDELENIKQTLAHQTCIRQLKTT 168 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 69/239 (28%), Gaps = 63/239 (26%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I +V+D+S SM + + + K + Sbjct: 87 SISKKKMTYLSNGADIVIVLDISPSM--------------GAIEFSSKNRFEFAKELIKY 132 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F ++ +E+ +GL+ + + + + + + + + Sbjct: 133 F-------AYQRENDNIGLVAFAKEASLIVPLTIDRDFFSRKLDDIYIMDLGNGSALGLG 185 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L S ++ +I LTDG N+ + ++ + A Sbjct: 186 VSIALSHL------------------KHSEAPKRSVIVLTDGVVNSDEV--YKDQVINLA 225 Query: 358 KENFIKIVTISINASP----------------------NGQRLLKTCVSSPEYHYNVVN 394 + +KI +I I + + LL+ + Y+V + Sbjct: 226 QGLNVKIYSIGIGSGEELSVGFKLRSGKFYQGTLREVYDPSMLLEIASKTGGLFYSVSD 284 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|209527269|ref|ZP_03275780.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492336|gb|EDZ92680.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 414 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 45/160 (28%), Gaps = 25/160 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K A +D +++ + ++ + R + + + + Sbjct: 59 LETVKQAAKELIDRLNVGD------RISVVAFDHRAKVLVP-NQDLTDPDGIKKKIDGLR 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K + L K+ F LTDGEN + + Sbjct: 112 CSGGTAIDEGIKLGIEELGKGKQDRISQGF-----------------LLTDGENEHGDNK 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A E + I ++ N L K + Sbjct: 155 RCLK-LAKLATEYKLTINSLGFGDDWNQDILEKIADAGGG 193 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 63/240 (26%), Gaps = 45/240 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++IV + GER + +VD+SGSM Sbjct: 229 SANAGAGGRQIVHIGLQGYELPAGERRPLNLTFMVDVSGSMQ------------------ 270 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K+ + + L +D + + + Y + V + P+ G+EK++ Sbjct: 271 -SPDKLGLAQQTMNLIIDRLRPED------RVAVTYYASDVGTAVGPTPGSEKLK-LRCA 322 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T M AY+ + S I+ TDG+ N Sbjct: 323 VAALNAGGSTAGAQGMVNAYEQAEAAF----------------SPDKVNRILMFTDGDFN 366 Query: 343 NF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + I + ++T + D L Sbjct: 367 VGVTDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDARMQTIAQAGNGV--AAYVDDLDEA 424 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 254 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 293 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 294 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 349 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 350 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 398 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 399 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 458 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 459 VEEVTQNNFRLLFKG 473 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 72/254 (28%), Gaps = 46/254 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +V IP + RP + LV+D SGSM Sbjct: 20 PTTLDVLVKIIPPVPETRPQ-RPPLNLGLVIDRSGSMQ--------------------GA 58 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM + A ++ + + + + RVE + + +K + Sbjct: 59 KMEVARQAACFAVEQLLPSD------RLSVTIFDDRVECPVPSTLVRDKA-TIIRTIQGI 111 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + Q + + +I L+DG N ++ Sbjct: 112 HSRGSTALHDG----------------WVQGGIQVSQHLNPAHLNRVILLSDGLANVGET 155 Query: 347 NVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-N 404 N + I + + T+ I L S Y++ + L +FQ Sbjct: 156 NPDAIAQHVHGLAQRGVSTSTMGIGEDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQAE 215 Query: 405 ISQLMVHRKYSVIL 418 + LM +V L Sbjct: 216 LQGLMGTLGQTVSL 229 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 45/122 (36%), Gaps = 21/122 (17%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T++ A+++A Q+L K I+ +TDG+ + Sbjct: 24 GKTNTGGALERAQQMLAEG-----------------RPSVPKIILLITDGDATDK---ER 63 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +K K++ I I TI + + L + + ++ Y + DS+ + ++ + Sbjct: 64 LDAQIEKLKKSNILIYTIGVGDLIDRNELNRI-ATDEDFVYETRDFDSISKIKSSLLGRV 122 Query: 410 VH 411 Sbjct: 123 CK 124 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 24/165 (14%) Query: 235 LLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 F+ ++ DV +G++ Y+T V + G + + S+ D Sbjct: 159 AKNFVANVSSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGD 218 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A + ++++ + K +IF+TDG + Sbjct: 219 THTG--TALRYISTNNR--------------WREEVPKILIFVTDGTP---QDRAIVPAA 259 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 ++ ++I I + + LK S P + + AD Sbjct: 260 ARSLRDKGVRIFAIGVG--NAVESELKEIASEPYENHAIFIQGAD 302 >gi|283787709|ref|YP_003367574.1| tight adherence protein TadG [Citrobacter rodentium ICC168] gi|282951163|emb|CBG90854.1| putative tight adherence protein TadG [Citrobacter rodentium ICC168] Length = 647 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/233 (8%), Positives = 75/233 (32%), Gaps = 24/233 (10%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS---KMVSN 71 ++SE +I++A+ + L ++ + +K + A++ AILA ++ ++V + Sbjct: 19 LSSENGAITIVYAIIFPALLAMVALALDGALMINRKARLADASSEAILAISAVDNRLVDS 78 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ ++ + + A++ + + + + + H Sbjct: 79 VAIDNNKQIAKDFVNYYLPNNQAEQLKVVVTSFDRTIEKGYIDYKIAISATLPTLLPLGH 138 Query: 132 MA------------NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS---- 175 + + + + Y + + F + + + +Q V+ Sbjct: 139 LGFSAFDRSVTVGNFDNNSGNARKFVTVISDPADYVFVVDFSDSMNSSYIDQGRVTTRLA 198 Query: 176 ----FIPALLRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + ++ ++ + + + ++N+ N + Sbjct: 199 MLKQVVREVISGNKNPDSQFAIVPFDIGVPFRIKDSVNTTSSYANKENEGGGE 251 >gi|260818212|ref|XP_002604277.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] gi|229289603|gb|EEN60288.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] Length = 1119 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/246 (9%), Positives = 72/246 (29%), Gaps = 28/246 (11%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ + R I + + C + +D V Sbjct: 898 NNNQDRVFELADFNAIANIMNRIIQAACINIVFPTVEPCDVTTDLFFVLDGSGSVGLYN- 956 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 +K ++ + + + + D +G++ Y++ + +V Sbjct: 957 -FNTVKQFVVTLVSAFTIGLNDVNDTRVGVLQYSSSNTLGCNL-GDHPDLSSFVNAMNAM 1014 Query: 287 LI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + A++ A QI P + ++ +TDG ++ Sbjct: 1015 RYHYGPSTQTGAALQAAGQI-----------------AAWRPAPVPRIMVVVTDGMAHDS 1057 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + D+ + + I + + LL+ ++ + + + +++ + Sbjct: 1058 VVAPSQGLAADQ-----VNVFAIGVGNYVRSE-LLQIANNNQARVFELADFNAIRDNIND 1111 Query: 405 ISQLMV 410 I+ + Sbjct: 1112 IALAVC 1117 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 57/181 (31%), Gaps = 27/181 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS- 286 +K + + + D +G++ Y+ +V Sbjct: 768 FDIVKE---FVVAVVSGFTISLTDTRVGVLQYSDGSTLECNL-GDHPDWSSFVNSMNTMA 823 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A++ A L + + P+ + +I LTDG++ + Sbjct: 824 RQGGGTSTGAALEFA--RLIAAWR--------------PAPVVPRIMIVLTDGDSEDSVV 867 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + I + S N LL+ ++ + + + + +++ ++ I Sbjct: 868 TP-----AQALATEQVTVFAIGVG-SFNRSELLQITNNNQDRVFELADFNAIANIMNRII 921 Query: 407 Q 407 Q Sbjct: 922 Q 922 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 26/247 (10%), Positives = 54/247 (21%), Gaps = 16/247 (6%) Query: 55 SANNAAILAGASKMV----------SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 +A A L A + ++ A + A + E Sbjct: 833 AALEFARLIAAWRPAPVVPRIMIVLTDGDSEDSVVTPAQALATEQVTVFAIGVGSFNRSE 892 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L + N+ M + + DV T + L + Sbjct: 893 LLQITNNNQDRVFELADFNAIANIMNRIIQAACINIVFPTVEPCDVTTDLFFVLDGSGSV 952 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFL-----IELVVDLSGSMHCAMNSDPEDVNSAPI 219 +N + + +G + ++ + + + D +A Sbjct: 953 GLYNFNTVKQFVVTLVSAFTIGLNDVNDTRVGVLQYSSSNTLGCNLGDHPDLSSFVNAMN 1012 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 AL I M ++ + + PS G + Sbjct: 1013 AMRYHYGPSTQTGAALQA-AGQIAAWRPAPVPRIMVVVTDGMAHDSVVAPSQGLAADQVN 1071 Query: 280 VTRDMDS 286 V Sbjct: 1072 VFAIGVG 1078 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 50/158 (31%), Gaps = 3/158 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +GI ++ +I ++++ + + F I + + ++++ +AA LA A + + Sbjct: 8 RGIRKQRGLVVVIVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 SRLGDRFESISNHAK---RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + S K A + + N + + + + + Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVFPDSGYSSSPDGDRY 127 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 + N+LD + M V Sbjct: 128 VRVVINQLDLESFFFARVMGVTKRLTASAVAGPSPGGN 165 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 61/191 (31%), Gaps = 31/191 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDV----YMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + + A K + F+++++ S E +GL+ ++ N S V Q Sbjct: 59 GSALEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNSMLSP---VVEQAA 115 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ A++ A ++L K + +TDG+ Sbjct: 116 RAAEGLTAGGKSNQAEAIRAAVKLLDMKTPGE------------------KMLFLITDGQ 157 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP--EYHYNVVNADS 397 T +A++ + + I I A + L++ S P + + Sbjct: 158 T---PFRSQTDSAAAEARQAGVTVYCIGIAAPDGVNREALRSWASGPSDSHIIEIRELGE 214 Query: 398 LIHVFQNISQL 408 F+ + + Sbjct: 215 AQTAFERLMKN 225 >gi|288928459|ref|ZP_06422306.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331293|gb|EFC69877.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] Length = 554 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 72/228 (31%), Gaps = 57/228 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D+ +GL+ + + + Sbjct: 97 DISNSMMAQDVVPSRLEKSKLLIENLVDNFTH-------DRIGLVVFAGDAFVQLPITTD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ + + + K Sbjct: 150 YVSAKMFLQNIDPALIATQGTDIAKAINLSMRSFSQ------------------QKDIGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ A E I + + I + +GQ Sbjct: 192 AIIVITDGEDHEGGA----LEAAKAANERGIHVFILGIGSTKGSPIPTSEGGYLTDRSGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 +L + + + +V N + ++ + + + Sbjct: 248 TVLTALNESMCKQIAQAGNGTYIHVDNT---NDAQEKLNNELAKLQRA 292 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 334 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 393 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 394 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 430 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 431 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 484 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 38/117 (32%), Gaps = 6/117 (5%) Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A L + + + K I+ ++DGE +K+ Sbjct: 109 ADAITTAATELKPEGDTPIAYSLEKAAGDFTEAKGPKTILLVSDGEETCGGD---PVKVA 165 Query: 355 DKAKENFI--KIVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 +K + ++ I ++L + + +Y+ + +L + S+ Sbjct: 166 EKIASQGVDLRVHVIGFQVDDATRKQLTEIAKAGKGSYYDAQDGPALASRLKRASES 222 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|50355939|ref|NP_598499.1| anthrax toxin receptor 2 precursor [Mus musculus] gi|68052321|sp|Q6DFX2|ANTR2_MOUSE RecName: Full=Anthrax toxin receptor 2; Flags: Precursor gi|49901393|gb|AAH76595.1| Anthrax toxin receptor 2 [Mus musculus] gi|74141652|dbj|BAE38584.1| unnamed protein product [Mus musculus] gi|74217872|dbj|BAE41940.1| unnamed protein product [Mus musculus] Length = 487 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVHQLTER-FVSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + + L II LTDG + Sbjct: 121 HEGLKLANEQIQNA----------------GGLKASSIIIALTDG-KLDGLVPSYAENEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 K++ + + + Q L+ S + + V +L + +I + Sbjct: 164 KKSRSLGASVYCVGVLD--FEQAQLERIADSKDQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + S + G Sbjct: 222 LELSPSSVCVG 232 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 52/194 (26%), Gaps = 32/194 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQY 279 K+ + AL+ LD + LI +++ + + + Y Sbjct: 288 GRKIKQTREALIKILDDLSPHD------QFDLISFSSEATTWKPLLVPASTENVNEAKSY 341 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T T+ AM A Q+L + II LTDG Sbjct: 342 ATGI---QAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT-----------LIILLTDG 387 Query: 340 ENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVV 393 + + + + + L K + + + + Sbjct: 388 DPTVGETNPLNIQKNVRKAINGQH-SLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSD 446 Query: 394 NADSLIHVFQNISQ 407 +A L +Q ++ Sbjct: 447 SALQLQDFYQEVAN 460 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + +SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKDLNSLIDAFSRISST 479 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 52/194 (26%), Gaps = 32/194 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-----SWGTEKVRQY 279 K+ + AL+ LD + LI +++ + + + Y Sbjct: 288 GRKIKQTREALIKILDDLSPHD------QFDLISFSSEATTWKPLLVPASTENVNEAKSY 341 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T T+ AM A Q+L + II LTDG Sbjct: 342 ATGI---QAQGGTNINDAMLMAVQLLEKANQEELLPEGSIT-----------LIILLTDG 387 Query: 340 ENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVV 393 + + + + + L K + + + + Sbjct: 388 DPTVGETNPLNIQKNVRKAINGQH-SLFCLGFGFDVSYAFLEKMALENGGLARRIYEDSD 446 Query: 394 NADSLIHVFQNISQ 407 +A L +Q ++ Sbjct: 447 SALQLQDFYQEVAN 460 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 68/222 (30%), Gaps = 33/222 (14%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + +V+D+SGSM D R+K+ K A + L Sbjct: 409 RKPANVLMVIDVSGSM-------------GAGVPDTGRSKLDLAKQAAINALPQFGPHDK 455 Query: 248 VKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V ++ + + + + TR T AYQ + Sbjct: 456 VGLWMFSTKRDGEKDHLELAPLDTVDAAQRKTLRTRLDGLTPDGGTGLYDTALAAYQHVR 515 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG---ENNNFKSNVNTIKICDKAKENFIK 363 +IFLTDG +NN+ + ++ E ++ Sbjct: 516 DRHSGEAINA----------------VIFLTDGKNEDNNSLSLENLLPDLRAESAEESVR 559 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + TI+ + L + ++ Y+ + S+ VF + Sbjct: 560 MFTIAYGQDADLGVLKQISETTNAAAYDSRESGSIDQVFTAV 601 >gi|159900556|ref|YP_001546803.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893595|gb|ABX06675.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 423 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 58/182 (31%), Gaps = 27/182 (14%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + ++ +K A ++ ++ + ++ + R E + + + R Sbjct: 60 RGERLYQVKEACNNVVNQLNRQDY------FSVVSFNDRAEVVVP--CQRPNDKDQIKRA 111 Query: 284 MDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T+ M Q ++ ++ LTDG Sbjct: 112 IGMIEAKGGTEMATGMMMGLQEISRPMMSRGI----------------SRMVLLTDGRTY 155 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +S ++I +A+ I I + I N L + + NA +++VF Sbjct: 156 GDES--RCVEIARRAQSKGIGITALGIGDEWNEDLLETIASAENSRTEYITNAQQIVNVF 213 Query: 403 QN 404 + Sbjct: 214 SD 215 >gi|3273277|dbj|BAA31182.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273281|dbj|BAA31184.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273283|dbj|BAA31185.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 71/255 (27%), Gaps = 56/255 (21%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R + K R + L T+ AM A Q+L Sbjct: 309 Q-FNLIVFSTEATQWRPSLVPASAENVNKARSFAAGI---QALGGTNINDAMLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIV 365 S + II LTDG+ ++N +I +A + Sbjct: 365 SSNQEERLPEGSVS-----------LIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNIS--------------- 406 + + L K + + H + +A L +Q ++ Sbjct: 414 CLGFGFDVSYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNA 473 Query: 407 -QLMVHRKYSVILKG 420 + + + ++ KG Sbjct: 474 VEEVTQNNFRLLFKG 488 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 74/254 (29%), Gaps = 47/254 (18%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + + + + ++ + +S IP + Sbjct: 138 SVRIHVTSRNVTRLLKMAVSAGFDW---VMYDKKGYARMSVIPDACDTN------LDVVF 188 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGS+ + + AL ++ +G Sbjct: 189 VLDQSGSIGYYNH-----------------------QLALNFLSKVVEFFKIGANKTQVG 225 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI Y+T + + K T + + QA IL + Sbjct: 226 LITYSTHAYVQFDLNDYHSKSTILNRISRIYYTGGWTATALGLFQAGVILNPQQ------ 279 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 +G + S + +I LTDG +N + ++ + I++ T+ + Sbjct: 280 ---MRGARPISQGVPRVVILLTDGRSNRVPID----EVAPSLHDFGIQVYTVGVGNIYLP 332 Query: 376 QRLLKTCVSSPEYH 389 + LK S P+ + Sbjct: 333 E--LKFIASDPDPY 344 >gi|118346119|ref|XP_977009.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288305|gb|EAR86293.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 685 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 63/182 (34%), Gaps = 12/182 (6%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-R 282 K+ +K L ++ + + LI + + + + T + + + + Sbjct: 101 SENKLQNVKQYLCNLIEKANT------NSQFALISFGSSQKLIFNFTQVTHENLESIKGQ 154 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T+ A++ A+ I+ D++ + + LTDG++N Sbjct: 155 INNIISTGDTNIIQALEVAHNIIKQDQQLENQKEEQTKKR-----IVRYSAFLLTDGQDN 209 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + K K+ I + + L + Y + +S+ VF Sbjct: 210 MKEKAIFKFRENFKNKDMDYSINCLGFGIDHDPLLLGAITSYTGGKFYYIKPEESVFSVF 269 Query: 403 QN 404 Q+ Sbjct: 270 QD 271 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/178 (8%), Positives = 46/178 (25%), Gaps = 22/178 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + A +D + L +++ E IE + Sbjct: 519 IDSARRAAQALVDRLAPADD------FSLTTFSSDAEVVIEDGPVGPRRAAIRRAIEGLR 572 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ + Y + + ++ ++DG + Sbjct: 573 EGGGTNIGAGLSLGY---------------AQASRPGIPEDAVRVVLLVSDGRATSGLTH 617 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + A + I+ + + +GQ + +Y + + + + Sbjct: 618 SERLAWLALDAFQRGIQTSALGLGDDFDGQLMSAIASDGAGGYYYLRHPEQIAPALST 675 >gi|3273285|dbj|BAA31186.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273297|dbj|BAA31192.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273289|dbj|BAA31188.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273259|dbj|BAA31173.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273255|dbj|BAA31171.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 23/45 (51%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 +K I EK ++ +IFAL + L +I + V + KKN M Sbjct: 5 IRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRMYE 49 >gi|281423275|ref|ZP_06254188.1| BatB protein [Prevotella oris F0302] gi|281402611|gb|EFB33442.1| BatB protein [Prevotella oris F0302] Length = 342 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 67/226 (29%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GL+ + + + Sbjct: 97 DISNSMLAEDVTPSRLEKSKLLIENLVDHFT-------NDKVGLVVFAGDAFVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A++ + K Sbjct: 150 YVSAKMFLQNIKPSLIATQGTDLARAIELSQNSFMQ------------------RDNIGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ A + + + I GQ Sbjct: 192 AIIVITDGEDHEGGA----LEAAKAAHKKGSNVFILGIGDPKGTPIPDGEGGYMKDNRGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 248 TVMSALNEQMCQQIAQAGSGTYIHVDNT---NDAQEKLNAELSKLQ 290 >gi|332253805|ref|XP_003276022.1| PREDICTED: matrilin-3 [Nomascus leucogenys] Length = 488 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 67/188 (35%), Gaps = 30/188 (15%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 92 SSRSVRPPEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 148 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ V R T S A++ A G + PS K I + Sbjct: 149 RQAVGRITPLST--GTMSGLAIQTAMD----------EAFTVEAGARDPSSNIPKVAIIV 196 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNV 392 TDG + + ++ +A+ + I++ + + + L LK S P E+ + V Sbjct: 197 TDGRPQDQVN-----EVAARARASGIELYAVGV----DRADLKSLKMMASEPLEEHVFYV 247 Query: 393 VNADSLIH 400 + Sbjct: 248 ETYGVIEK 255 >gi|326916308|ref|XP_003204450.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 902 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 57/185 (30%), Gaps = 22/185 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + L + + +E +G++ Y++ + + Sbjct: 153 NNFRYILDFMVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIPYK 212 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + ++ F K I +TDG+ + Sbjct: 213 GGNTMTGEAIDYLVKNTFTESAG-------------ARKGFPKVAIVITDGKAQD----- 254 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ ++ I A+ + LK S P + +NV N D ++ + I Sbjct: 255 EVEIPARELRNIGVEVFSLGIKAAD--AKELKLIASQPSLKHVFNVANFDGIVDIQNEII 312 Query: 407 QLMVH 411 + Sbjct: 313 LQVCS 317 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 73/240 (30%), Gaps = 31/240 (12%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 ++ PI ++E + + C+ D + + +K + Sbjct: 401 INVYAMKGLTPSEPITIMEKTQQVKVQVECSRGVDVKADVVFLVDGSYSIGIANFVK--V 458 Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTD 293 FL+ + V + L+ Y+ + +V + T+ Sbjct: 459 RAFLEVLVKSFEISPRKVQISLVQYSRDPHMEFSLNR-YNRVEDIIQAINTFPYRGGSTN 517 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + AM + + K S + +I +TDG KS+ + Sbjct: 518 TGKAMTYVREKVFVTSKGS-------------RPNVPRVMILITDG-----KSSDAFKEP 559 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 K ++ ++I + + + L+ S P + Y V + FQ IS + Sbjct: 560 AIKLRDADVEIFAVGVKD--AVRTELEAIASPPAETHVYTVEDF----DAFQRISFELTQ 613 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 17/158 (10%), Positives = 51/158 (32%), Gaps = 3/158 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +GI ++ +I +++++ + + F I + + ++++ +AA LA A + + Sbjct: 8 RGIRKQRGLVVVIVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 73 SRLGDRFESISNHAK---RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + S K A + + + + + + + + Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVFPDSGYSSSPDGDRY 127 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 + N+LD + + V Sbjct: 128 VRVVINQLDLESFFFARALGVTKRLTASAVAGPSPGGN 165 >gi|74196449|dbj|BAE34363.1| unnamed protein product [Mus musculus] Length = 650 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 474 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLEFGFDKYNSKADILSA 533 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 534 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 577 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V + D+L Sbjct: 578 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 630 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 631 KIAPRIIQNICTE 643 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 40/146 (27%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 286 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITKR--GGLSN 343 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 344 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 385 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + ++ Sbjct: 386 SRVARESGINVFFITVEGAAEREKQH 411 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 119/358 (33%), Gaps = 50/358 (13%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + ASK L+ E+ + A+ ++A + +K++ + + + ++ + Sbjct: 88 SIASKPKPLLTAEQAIPEASNRENYSAIDENAFKRVKHNPLSTFAIDVDTASYSNLRRFL 147 Query: 123 NS-SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 N+ + L + N + + + + + ++V Sbjct: 148 NNGQLPPTNAIRIEELINYFNYDYPEPESDRPFSITTEISQ--APWNPTHQLVHIGIQGE 205 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ + + P + ++D+SGSM+ K+ LK+A + ++ Sbjct: 206 KMAIEDLPPSNLVFLLDVSGSMN-------------------TPNKLPLLKDAFRMLVNE 246 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + V ++ Y + P+ G EK + +T + T +K A Sbjct: 247 LREEDQVS------IVVYAGAAGVVLPPTPGNEKDK-ILTAIENLNAGGSTAGGAGIKLA 299 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKEN 360 Y++ + +S +I TDG+ N S+ +K+ ++ + Sbjct: 300 YKLAQDNFIKSGNN----------------RVILATDGDFNVGVSSDTELVKLIEQKRNK 343 Query: 361 FIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + + + ++ K + + N L + + + ++ Sbjct: 344 GVFLTVLGFGSGNLQDSKMEKIANKGNGNYAYIDN--EL-EAKKVLVNEIGATLLTIA 398 >gi|260890922|ref|ZP_05902185.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] gi|260859475|gb|EEX73975.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] Length = 516 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 75/237 (31%), Gaps = 33/237 (13%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + ++ E +E+++D SGSM + D + A +K ++M A Sbjct: 147 KDDMVNPFTKQKIEENMNVNLEIILDASGSMKQKI-GDKTMMEIAKESIEKVVSEMPANT 205 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKP 291 A L + K+ + E + +K++ + Sbjct: 206 KAGLRVFGHKGDNTASKKQES------CSANELISPIETLNKDKLKSSLAPI---QPTGW 256 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T +++ L + + +TDG + N + Sbjct: 257 TSIAKSIENGTNDLK----------------ALKGEKTLNILYIITDGIETC---DGNPV 297 Query: 352 KICDKAKENF--IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 + K K I + I N + ++LK ++ Y+ +V +A L Q I Sbjct: 298 ETAKKFKNENTDIVLGIIGFNVDAHQNKVLKEIANAANGYYSSVNDAAKLTEELQRI 354 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 54/173 (31%), Gaps = 23/173 (13%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ ++ +G++ Y+ R K + S T++ A+ Sbjct: 19 VNVVNTFDISPTATRVGVVQYSDRNSLVFNLGNKVNKPSTVSAINGISYQSGGTNTGAAL 78 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 K RQ K II LTDG++++ S + Sbjct: 79 KY-----------------VRQYAAWREGNVPKVIIVLTDGKSSDSVSGPS-----RDLV 116 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +++ I + + +LL+ + + N ++L IS + Sbjct: 117 AAGVEVYAIGVGK-FDHGQLLQIANNKQNNVIELNNFNALATKIDMISTNVCS 168 >gi|153873859|ref|ZP_02002297.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152069676|gb|EDN67702.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 367 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 51/168 (30%), Gaps = 27/168 (16%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + L+ Y +EP+ G + V+ T + + AY + + Sbjct: 40 DKVSLVVYAGAAGVVLEPTPGHQSVKIN-GALERLTAGGSTHGSAGIHLAYNLAEQAFIK 98 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISIN 370 + I+ TDG+ N + + ++ +++ I + T+ Sbjct: 99 NGIN----------------RILLATDGDFNVGTVDFEALKNLVEEKRKSGISLTTLGFG 142 Query: 371 ASPNGQRLLKTCV-SSPEYHYNVVN--------ADSLIHVFQNISQLM 409 +L++ + + + D + I++ + Sbjct: 143 RGNYNDQLMEQLADAGNGNYAYIDTLNEAQKVLVDEMSSTLNTIAKDV 190 >gi|148234427|ref|NP_001080470.1| matrilin 2 [Xenopus laevis] gi|28175657|gb|AAH45220.1| Matn2-prov protein [Xenopus laevis] Length = 589 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 64/188 (34%), Gaps = 22/188 (11%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + A + + + L + +GL+ Y + V+ K + + Sbjct: 59 SSRSVRPADFEKVKEFLITMLKFLDIGPDTTRVGLLQYGSTVKNEFSLKMY--KRKSDIE 116 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R + ++ T + + Y + +G + + + + +TDG Sbjct: 117 RAVKRMMHLATGTMTGLAIQYAM--------NIAFSEAEGARPLNQYVPRIAMIVTDGRP 168 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +I KA+ + I I I + LKT S P E+ + V N + Sbjct: 169 QDP-----VEEISAKARMSGILIFAIGVGRVDMS--TLKTIGSEPHSEHVFLVANFSQIE 221 Query: 400 H---VFQN 404 VFQN Sbjct: 222 TLTSVFQN 229 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/381 (10%), Positives = 91/381 (23%), Gaps = 63/381 (16%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSN-----LSRLGDRFESISNHAKRALIDDAKRFIK 99 ++ A AA ++ A + + + + N A F Sbjct: 93 RTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAI 152 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + S + + + M + + + S L Sbjct: 153 DVDTGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEA-----PFSVTTELA 207 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + ++ E + ++D+SGSM Sbjct: 208 PSP----YNDDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMA--------------- 248 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ L+ AL + + + ++ Y Q Sbjct: 249 ----SPDKLPLLQTALKMLTQQLGAQD------KVSIVVYAGAAGVV--LDGAAGNDSQT 296 Query: 280 VT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ ++ AYQ+ +I TD Sbjct: 297 LNYALEQLSAGGSTNGAQGIQLAYQLAKKHLVEGGIN----------------RVILATD 340 Query: 339 GENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNAD 396 G+ N +N I + K+ I + T+ L++ + + + Sbjct: 341 GDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDS-- 398 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L + + + + ++ Sbjct: 399 -LNEARKVLVEQLSATLLTIA 418 >gi|293356227|ref|XP_574569.3| PREDICTED: rCG40584-like [Rattus norvegicus] Length = 1182 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 63/227 (27%), Gaps = 29/227 (12%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDLLSHV 248 + + + I T K + + + Sbjct: 128 LDSSFRQNQNFPAAQQGCPRQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQ----- 182 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + D + + T + A+K L + Sbjct: 183 QSSTRFSLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTT 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 +K K +I +TDG + + + I + + A I I Sbjct: 243 QKG-------------ARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAI 286 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + ++ LK SSP ++V N D+L + + + + Sbjct: 287 GVGQAFYQAQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|298491707|ref|YP_003721884.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233625|gb|ADI64761.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 426 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 17/197 (8%), Positives = 42/197 (21%), Gaps = 20/197 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A+ L + ++ ++ + E I + Sbjct: 59 INTVIQAVEQLLAQLQPGD------HISIVAFAGTSEVIIPNQIVQDAESIKCQLHKRLK 112 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + L K + F Sbjct: 113 AGGGTIIAEGLSLGITELLKGTKGAVSQAFLLTDGHGDR-------GLKIWKWEMGPNDK 165 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +++ KA + + T N L K + + + F + + Sbjct: 166 KRCLELAQKATRVSLTLNTFGFGNDWNQDLLEKIADAGGGTLAYIERPQQAVDQFSRLLK 225 Query: 408 LM-------VHRKYSVI 417 + H S++ Sbjct: 226 RIQSVGLTNAHLLLSLV 242 >gi|293344414|ref|XP_001080404.2| PREDICTED: rCG40584-like [Rattus norvegicus] gi|149067645|gb|EDM17197.1| rCG40584, isoform CRA_b [Rattus norvegicus] Length = 1182 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 63/227 (27%), Gaps = 29/227 (12%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDLLSHV 248 + + + I T K + + + Sbjct: 128 LDSSFRQNQNFPAAQQGCPRQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQ----- 182 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + D + + T + A+K L + Sbjct: 183 QSSTRFSLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTT 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 +K K +I +TDG + + + I + + A I I Sbjct: 243 QKG-------------ARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAI 286 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + ++ LK SSP ++V N D+L + + + + Sbjct: 287 GVGQAFYQAQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|149067644|gb|EDM17196.1| rCG40584, isoform CRA_a [Rattus norvegicus] Length = 1084 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 63/227 (27%), Gaps = 29/227 (12%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDLLSHV 248 + + + I T K + + + Sbjct: 128 LDSSFRQNQNFPAAQQGCPRQDQDIVFLIDGSGSITYTDFEKMLAFVKAVMSQLQ----- 182 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + D + + T + A+K L + Sbjct: 183 QSSTRFSLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTT 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 +K K +I +TDG + + + I + + A I I Sbjct: 243 QKG-------------ARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAI 286 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + ++ LK SSP ++V N D+L + + + + Sbjct: 287 GVGQAFYQAQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|297673849|ref|XP_002814962.1| PREDICTED: anthrax toxin receptor 2 isoform 2 [Pongo abelii] Length = 411 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + I ++++ + + K+ + + + T +K A + + Sbjct: 1 MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQ----- 55 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K L II LTDG + K ++ + + + Sbjct: 56 -----------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEAKISRSLGASVYCVGVLD 103 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLMVHRKYSVILKG 420 Q L+ S E + V +L + +I ++ + S + G Sbjct: 104 --FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSVCVG 155 >gi|16550415|dbj|BAB70976.1| unnamed protein product [Homo sapiens] gi|119626255|gb|EAX05850.1| anthrax toxin receptor 2, isoform CRA_c [Homo sapiens] Length = 245 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + I ++++ + + K+ + + + T +K A + + Sbjct: 1 MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQ----- 55 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K L II LTDG + K ++ + + + Sbjct: 56 -----------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEAKISRSLGASVYCVGVLD 103 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLMVHRKYSVILKG 420 Q L+ S E + V +L + +I ++ + S + G Sbjct: 104 --FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSVCVG 155 >gi|79154032|gb|AAI07877.1| ANTXR2 protein [Homo sapiens] Length = 411 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + I ++++ + + K+ + + + T +K A + + Sbjct: 1 MRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQ----- 55 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K L II LTDG + K ++ + + + Sbjct: 56 -----------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEAKISRSLGASVYCVGVLD 103 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLMVHRKYSVILKG 420 Q L+ S E + V +L + +I ++ + S + G Sbjct: 104 --FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSVCVG 155 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 62/193 (32%), Gaps = 44/193 (22%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D + T++ K A+L ++ + E +GL+ + P+ Sbjct: 97 DSSKSMLASDLRPTRLERAKLAILDLVEQL-------ESDRIGLVAFAGSAFLQTPPTLD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R+ + D + +D A+++A + P K Sbjct: 150 YGAFRESLDATAPDMMSRGGSDLGVALREATKAF-------------------PVENNYK 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++ LTDGE+ + I KA + +K+ I + + Sbjct: 191 AVVLLTDGEDLGGHA----IDEAKKASKEGVKVFAIGLGTP-------------EGDYLR 233 Query: 392 VVNADSLIHVFQN 404 N + ++ Sbjct: 234 QTNDAGIEEYMRD 246 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 67/189 (35%), Gaps = 33/189 (17%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A F+ S+ +++ + L+ + V++ + + +K+ + R Sbjct: 110 REAAKKFMASL-----LRKQDQIDLMDFADDVDELVSFTSDVQKIDSGLGRIHHGDA--- 161 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ A Q L G S ++ ++ +TDGEN + Sbjct: 162 TALYDAVYLASQRL---------------GETPTSAGQRRVLVLITDGENTTHHGS--YD 204 Query: 352 KICDKAKENFIKIVT-ISINASPNG-------QRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++A+ I I + S + L++ + +Y V + L FQ Sbjct: 205 AALEQAQRAGAMIYALIIVPVSADAGRNTGGEHALIQLARDTGGKYYYVEDKHDLAPAFQ 264 Query: 404 NISQLMVHR 412 ++S + + Sbjct: 265 HVSDDLRTQ 273 >gi|299140485|ref|ZP_07033623.1| BatB protein [Prevotella oris C735] gi|298577451|gb|EFI49319.1| BatB protein [Prevotella oris C735] Length = 342 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 67/226 (29%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++ K + +D + +GL+ + + + Sbjct: 97 DISNSMLAEDVTPSRLEKSKLLIENLVDHFT-------NDKVGLVVFAGDAFVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A++ + K Sbjct: 150 YVSAKMFLQNIKPSLIATQGTDLARAIELSQNSFMQ------------------RDNIGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ A + + + I GQ Sbjct: 192 AIIVITDGEDHEGGA----LEAAKAAHKKGSNVFILGIGDPKGTPIPDGEGGYMKDNRGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 248 TVMSALNEQMCQQIAQAGSGTYIHVDNT---NDAQEKLNAELSKLQ 290 >gi|34525892|emb|CAE46626.1| trombospondin-related protein [Plasmodium falciparum] Length = 331 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 70/204 (34%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + I+ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLIVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 + I N R L C S Sbjct: 188 VGIGQGINVAFNRFLVGCHPSDGK 211 >gi|260808373|ref|XP_002598982.1| hypothetical protein BRAFLDRAFT_150456 [Branchiostoma floridae] gi|229284257|gb|EEN54994.1| hypothetical protein BRAFLDRAFT_150456 [Branchiostoma floridae] Length = 132 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 43/146 (29%), Gaps = 15/146 (10%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + T K + S T + A+ A + + F Sbjct: 2 VEFKLADHTNKASLATAINNVSYQSGGTQTGRALDAARTQMDWRQPPVPNVCFSLLQAA- 60 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 I +TDG + + N + ++N I + I + N L + Sbjct: 61 ---------IVVTDGMSGD-----NVQQPAKALRDNDISAYGVGIGPAINANELNEIAGG 106 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMV 410 + + + N D L + IS + Sbjct: 107 DAGHVFYIPNYDKLEKEMEKISNSVC 132 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 69/227 (30%), Gaps = 49/227 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + E+ LV+D SGSM K+AALK A Sbjct: 30 EAQASAVAEKAPLNFCLVLDRSGSMQ--------------------GAKLAALKEATRRV 69 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTP 296 +D++ V ++ + V+ + ++ + ++ V ++ T + Sbjct: 70 IDTLTPQDIVS------IVLFDDTVQTLVPATFATDRDALKAQVDAIEEA---GGTAMSG 120 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 M L K ++ LTDG+ + + + Sbjct: 121 GMAAGIVEL----------------RKHHDPGRVSAMLLLTDGQ-TWGDEDRC-RALAQE 162 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + +++ + + A N + L ++ + + + FQ Sbjct: 163 LARDHVRVTALGLGAEWNEKLLDDIADATGGLSDYIADPSQITTFFQ 209 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K S + Sbjct: 381 SNFRLMLKFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 474 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRVGAVQYTYEQRLEFGFDKYNSKADILSA 533 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 534 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMIIITDGRS 577 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V + D+L Sbjct: 578 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLY 630 Query: 400 HVFQNISQLMVHR 412 + I Q + Sbjct: 631 KIAPRIIQNICTE 643 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 40/146 (27%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 286 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 343 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 344 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 385 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + ++ Sbjct: 386 SRVARESGINVFFITVEGAAEREKQH 411 >gi|257387423|ref|YP_003177196.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257169730|gb|ACV47489.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 788 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/269 (9%), Positives = 69/269 (25%), Gaps = 35/269 (13%) Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 D + + + + + L + G+ Sbjct: 297 DVTTAASVPSALDDYAAVVVQDTSASDVGNATALQEFVVNGGGLVVAGGDNAYENGGYET 356 Query: 198 DLSGSM----HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 GSM ++ + ++ K L LD + + Sbjct: 357 SPIGSMLPVQVGNATGGESNIVVLVDVSSSAESGLSIQKAVALDVLDQLGDEN------R 410 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G++ + + E + + + TD ++ A ++L + Sbjct: 411 VGVVAFNHNAYRVSELRTLGQNRAETAEKIRQLESGGATDIAVGLQGADELLGDREGT-- 468 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II L+DG++ + ++ ++V++ + Sbjct: 469 -------------------IILLSDGQD----RLGPPAAVANQLGREGTRVVSVGVGKRV 505 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + S ++ + L +F Sbjct: 506 GVPTMRQIAGESGGSYFAADETERLRLLF 534 >gi|149632103|ref|XP_001514474.1| PREDICTED: similar to collagen type VI alpha 4 [Ornithorhynchus anatinus] Length = 1844 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 59/169 (34%), Gaps = 21/169 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +++ + E+V +G++ +++ + + T K I T + A+ Sbjct: 649 VNMSNIGPENVRIGVLQFSSSPREEFMLNKYTTKEDLSRAISDIKQIKAGTQTGQALTFT 708 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + +++I +TDGE + ++ Sbjct: 709 LPYFDTSR--------------WGRPTEPQYLIVITDGEAQDS-----VKGPAKALRDKG 749 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I I I + N +LL+ + + + + DSLI + + IS + Sbjct: 750 ISIFAIGV-LEANKTQLLEI-TGTEDQVFYENDFDSLIFLKKKISFKLC 796 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 46/163 (28%), Gaps = 26/163 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V G++ Y+ E V+ + A+ Q Sbjct: 475 GNNRVRFGVVQYSDSPHLEFEVGQYHSTVKLKEAIRGIKQLRGRDRIGEALNYMNQRFMD 534 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + K +I +T G + + + ++ I I I Sbjct: 535 NDRV-------------------KILILITAGNFQD-----EVAESAQELRQRGIVIYAI 570 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + Q L + + E V + D+L H+ + Q + Sbjct: 571 GV--KTDNQLKLISIAGTEENVLCVNDFDTLKHIKDEVVQDIC 611 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 39/143 (27%), Gaps = 16/143 (11%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +++ L ++V +GL+ Y+ K + A+K Sbjct: 260 ETVKSLVIGPDNVRIGLVLYSDEPRLVFSLETFQTKESIMSHLAKLPYRGGKPKTGAALK 319 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 F + G + Q+ + +T+G S K + + Sbjct: 320 F-----------LRENIFTQDGNRRYEKRVQRMAVVITEGF-----SEDRVSKPASQLRR 363 Query: 360 NFIKIVTISINASPNGQRLLKTC 382 + I + I L K Sbjct: 364 AGVTIYALGIQRGLERGNLEKMA 386 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 384 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRSIRYM 443 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 444 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIITDGQSYDD---- 484 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + +I Sbjct: 485 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDI 540 Query: 406 SQLMV 410 + + Sbjct: 541 IRGIC 545 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 63/197 (31%), Gaps = 23/197 (11%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +++ ++ T+MA + ++S++ +G++ + E Sbjct: 331 DYSASMYEKNGDITRMAL---SAKDLIESLNDT------HQVGVVEFHRPDEPPAILQDF 381 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T + K Y+ +S +Q + P K Sbjct: 382 TTYKNAAIEAVSQFSSGKI----------YRDFSSCW--DAVLKGLKQFPEKPDPDIFKT 429 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 ++FL+DG +N S + AKE + I + I + L + + + + Sbjct: 430 LVFLSDGFDN--SSFSTPGNVISLAKERDVHIYILGIGRGSEEEVLKNIALETGGTYVHA 487 Query: 393 VNADSLIHVFQNISQLM 409 N F+ + + Sbjct: 488 ENIAVFRERFKQTIKDI 504 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 38/115 (33%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ + +II A+ + +L + F I + Y + +++ + A +AGA + + S Sbjct: 14 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLA 73 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 I I+ S +SA N V Sbjct: 74 TGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQANAVQVQLSESV 128 >gi|86606733|ref|YP_475496.1| hypothetical protein CYA_2093 [Synechococcus sp. JA-3-3Ab] gi|86555275|gb|ABD00233.1| hypothetical protein CYA_2093 [Synechococcus sp. JA-3-3Ab] Length = 431 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 59/197 (29%), Gaps = 23/197 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + L + + L + + + + +R+ V R Sbjct: 214 LDQFRATPLDLGAVLRFDNRGAGFGATALGQPLRTAQLLQDFTSDKQLLRRGVLRAN--- 270 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFK 345 T A +A + L+ + F + ++ TDG +N Sbjct: 271 PGGNTALYDATVEAGRFLSDFRPTERFN---------------RRLVVFTDGIDNESTRS 315 Query: 346 SNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 N + ++ A+E + + + + N L + ++ ++ L F Sbjct: 316 INQASQELTTLARERGQKLTVYVVGLGVDLNLLELQRLAAATEGTFVLARFSEGLEAPFA 375 Query: 404 NIS-QLMVHRKYSVILK 419 N+ + + V ++ Sbjct: 376 NLFPAAIGEHRLQVRVQ 392 >gi|52695907|pdb|1TZN|AA Chain a, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695909|pdb|1TZN|BB Chain b, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695911|pdb|1TZN|CC Chain c, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695913|pdb|1TZN|DD Chain d, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695915|pdb|1TZN|EE Chain e, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695917|pdb|1TZN|FF Chain f, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695919|pdb|1TZN|GG Chain g, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695921|pdb|1TZN|HH Chain h, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695923|pdb|1TZN|II Chain i, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695925|pdb|1TZN|JJ Chain j, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695927|pdb|1TZN|KK Chain k, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695929|pdb|1TZN|LL Chain l, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695931|pdb|1TZN|MM Chain m, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695933|pdb|1TZN|OO Chain o, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor Length = 181 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 53/175 (30%), Gaps = 21/175 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 25 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 83 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 84 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 126 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQL 408 ++ + + + Q L+ S E + V +L + +I Sbjct: 127 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQ 179 >gi|48425689|pdb|1SHU|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 182 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 53/175 (30%), Gaps = 21/175 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 26 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 84 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 85 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 127 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQL 408 ++ + + + Q L+ S E + V +L + +I Sbjct: 128 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQ 180 >gi|48425688|pdb|1SHT|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 181 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 53/175 (30%), Gaps = 21/175 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 26 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 84 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 85 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 127 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQL 408 ++ + + + Q L+ S E + V +L + +I Sbjct: 128 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQ 180 >gi|261414285|gb|ACX83559.1| plant-derived anthrax toxin receptor-Fc fusion protein [synthetic construct] Length = 171 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 53/172 (30%), Gaps = 21/172 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 19 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 77 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 78 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 120 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNI 405 ++ + + + Q L+ S E + V +L + +I Sbjct: 121 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSI 170 >gi|58429477|gb|AAW78142.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429505|gb|AAW78156.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 581 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429455|gb|AAW78131.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D DV C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDVYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K K+ +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLKDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429527|gb|AAW78167.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 575 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273261|dbj|BAA31174.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|163848376|ref|YP_001636420.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526300|ref|YP_002570771.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669665|gb|ABY36031.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450179|gb|ACM54445.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 446 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/252 (9%), Positives = 62/252 (24%), Gaps = 44/252 (17%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + N E+ + Sbjct: 30 AAAGTTGLPLHMTIVADASRSMRIPILN----------------EQQFRDVVRGSGAHEV 73 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + R+ + +AL ++ +D +GLI + Sbjct: 74 LVDGVPVWQLNNPLSAEQRSRYRSPIDYTTHALHSLIERLDHND------RLGLIACASD 127 Query: 263 VEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 P ++ + R + + T+ ++ A + Sbjct: 128 AIVLASGIPGSRRAELVAAIARLPALRLGETTNLAQGLQLALAQFVA------------- 174 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + I+ +TDG + + + +A I + TI + S L + Sbjct: 175 ----ADDATVRRIVLITDGFTTDQ---TLCLTLAREAAARGISLSTIGLGGSFEEHLLTQ 227 Query: 381 TCVSSPEYHYNV 392 S V Sbjct: 228 LADLSGGRASFV 239 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 69/203 (33%), Gaps = 35/203 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++ + A FL L ++ ++G++ + + + K R + Sbjct: 319 FNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIKSNLIQIISSKERNNLLES 373 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T M+ A+Q++ + + I+ LTDGE+N+ Sbjct: 374 LPKAANGGTSICAGMRSAFQVIREVYPQIDGSE----------------IVLLTDGEDNS 417 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 D+ ++ I I++ S + Q +++ + H+ + LI Sbjct: 418 ------AKDCIDEVTQSGAIIHLIALGPSAD-QAVIEMSAMTGGNHFFASDEAQNNGLID 470 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 471 AFGALASGNTDLSQQPLQLESKG 493 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 69/275 (25%), Gaps = 52/275 (18%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + L + R Q + L E ++E Sbjct: 241 SEVEALPKQVIFVLDTSGSMDGNRIKQLKEAMNSI-LSELKKEDVFNIVEF--------- 290 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + N + +D + Sbjct: 291 ---------------------SSIVKVWNVDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQ 329 Query: 265 KNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + TD A++ +++ +K+ Sbjct: 330 VLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKENKEDA-------- 381 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLK 380 Q I+FLTDGE ++N I + I ++S + + L K Sbjct: 382 -----HQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGADREFLQK 436 Query: 381 TCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + + + Y +A L ++ IS +++ Sbjct: 437 ISLKNLGFARHIYEAADASLQLQEFYKQISSPLLN 471 >gi|254464606|ref|ZP_05078017.1| TadE-like protein [Rhodobacterales bacterium Y4I] gi|206685514|gb|EDZ45996.1| TadE-like protein [Rhodobacterales bacterium Y4I] Length = 178 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 34/78 (43%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 ++ R ++ + N ++ FA+ + +F++++ + + + + + +E A + + Sbjct: 1 MIRRAAHALRRFRQQQDGNATVEFAIVIPAFIMILMSTVELGMINLRHSQLERALDQTVR 60 Query: 63 AGASKMVSNLSRLGDRFE 80 ++S R E Sbjct: 61 TIRLSTGGDMSHDQLRDE 78 >gi|156409373|ref|XP_001642144.1| predicted protein [Nematostella vectensis] gi|156229285|gb|EDO50081.1| predicted protein [Nematostella vectensis] Length = 203 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 31/184 (16%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + + LL +G + +++R + T Sbjct: 27 RVINFIKTIVSLLPVSPRQTRIGAVLFSSRPYLMFNFQKYRTVRQVLAALQRIRYPRGGT 86 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A++ Y L + S ++ +I LTDG++ + + Sbjct: 87 KTGRALRYTYSRLFRSR----------------SRVRKQALIVLTDGKSQDSVG-----Q 125 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 K +++ I + + + L + S + F+N+ +++ Sbjct: 126 PAAFIKNQGVELFAIGVGRNYRRRDLNQIA--SRGNVFTAK--------FENLGRIIGAI 175 Query: 413 KYSV 416 K V Sbjct: 176 KSKV 179 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 20/156 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ ED +GL+ Y+ V ++ + + + D T + + Sbjct: 402 LRGQYYIGEDNMIGLVSYSNDVNIDLPIAKFDLNQRASFAGAVNDLQAGGGTATFDGIAV 461 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++ + + I L+DGE N + I + + Sbjct: 462 AMKMIQEQRA--------------ADPNIRPVIFVLSDGETNKGHPLNDIKGIVE---DT 504 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 I I TI + N L + N D Sbjct: 505 GIPIYTIG--YNANIPALQAISSINEAASINADTDD 538 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 69/225 (30%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D SGSM KM + A++ L + Sbjct: 273 NVVFVIDKSGSMA--------------------GRKMRQTREAMVQILGDLRPEDQFNLV 312 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + + S E+ +++ + T+ A+ A ++L ++ Sbjct: 313 IFDGHV-FQWMPALLQASSQNVEQAKKFTSLISAM---GATNINDAVLLAVKMLDDSNRK 368 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ + ++N I+ A + + Sbjct: 369 E-----------KLPPGSVSMVILLTDGDATDGETNPKKIQENVKAAIGGSYHLYCLGFG 417 Query: 371 ASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQLMVH 411 N L K + + + + + L +Q ++ ++ Sbjct: 418 FDVNYAFLEKLALENGGVARRIYEDSDSDLQLQDFYQEVANPLLT 462 >gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] Length = 699 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 59/194 (30%), Gaps = 44/194 (22%) Query: 249 KEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 ++ ++ + + V + + + + T+ +++ + Sbjct: 327 RQQDRFKVVLFNNGSVDLTSGFLTVSQANVANVLQQLDNYKVGGGTNLYAGLQKGLTGID 386 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +D+ II +TDG N + ++ +++ T Sbjct: 387 ADRPAG--------------------IILVTDGVANVGQ--TEKKTFLKLLNDHDVRLFT 424 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLI------------HVFQNIS-------- 406 + S N L + S + +V NAD ++ F++I Sbjct: 425 FIMGNSANRPLLKEMADVSNGFAMSVSNADDIVGHIMLATSKLTHQAFRDIDLDIDGVKV 484 Query: 407 QLMVHRKYSVILKG 420 + + K S + +G Sbjct: 485 KNITPEKISSLYRG 498 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 72/204 (35%), Gaps = 37/204 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRD 283 ++ L+ A FL L ++ + G++ + ++ T + VRQ ++ Sbjct: 316 FDRIYRLRQAGEQFL-----LQILETGSWAGIVVFNSQALTKTYLKQITGDSVRQTLSAY 370 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T+ +++ +Q+ + I+ LTDGE+ Sbjct: 371 LPTAAGGGTNICSGIREGFQVFLKKYPSTEGCE----------------IVLLTDGEDAG 414 Query: 344 FKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLI 399 S C + + + I TI++ S + + + ++ +++ LI Sbjct: 415 VSS-------CFAEVQRSGSIIHTIALGPSAAKELEM-LADMTGGLKFSATDSLDSNGLI 466 Query: 400 HVFQNISQ---LMVHRKYSVILKG 420 F IS + + + KG Sbjct: 467 DAFSGISSGSGDISQQSIQLESKG 490 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 38/115 (33%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ + +II A+ + +L + F I + Y + +++ + A +AGA + + S Sbjct: 20 RGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLA 79 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 I I+ S +SA N V Sbjct: 80 TGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQANAVQVQLSESV 134 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 58/201 (28%), Gaps = 29/201 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVR 277 + +K A + +D + + ++ + R + + E+++ Sbjct: 51 SGSMNGRPLEIVKQAAIRLVDRLKTGD------RLSVVAFDHRAKVLVPNQVIDNPEQIK 104 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + R T ++ + L KK + F LT Sbjct: 105 KQINRLA---ADGGTAIDEGLRLGIEELAKGKKETISQAF-----------------LLT 144 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DGEN + +N A + + T+ + N L K + + A+ Sbjct: 145 DGENEHGDNNRCLK-FAQLAAGYNLTLNTLGFGDNWNQDVLEKIADAGLGSLSYIQKAEQ 203 Query: 398 LIHVFQNISQLMVHRKYSVIL 418 + F + + + Sbjct: 204 AVDEFGRLFSRIQTVGLTNAY 224 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 69/275 (25%), Gaps = 52/275 (18%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + L + R Q + L E ++E Sbjct: 300 SEVEALPKQVIFVLDTSGSMDGNRIKQLKEAMNSI-LSELKKEDVFNIVEF--------- 349 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + N + +D + Sbjct: 350 ---------------------SSIVKVWNVDKVQVDYEVGEDPWPLYDSPEAPQKNKTNQ 388 Query: 265 KNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + TD A++ +++ +K+ Sbjct: 389 VLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKENKEDA-------- 440 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLK 380 Q I+FLTDGE ++N I + I ++S + + L K Sbjct: 441 -----HQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGADREFLQK 495 Query: 381 TCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + + + Y +A L ++ IS +++ Sbjct: 496 ISLKNLGFARHIYEAADASLQLQEFYKQISSPLLN 530 >gi|317493250|ref|ZP_07951672.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918643|gb|EFV39980.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 544 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 70/240 (29%), Gaps = 45/240 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +++ + I I P + ++D+SGSM Sbjct: 141 PWNAHSKLLRIAIK-ATDINATALPPANLVFLIDVSGSMSDED----------------- 182 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +KN+L L ++ + + ++ Y+ + + P+ G +K ++ Sbjct: 183 --KLPLVKNSLKLLVNKMRDQD------KISIVIYSGETKTVLPPTSGKDK-SDILSAIN 233 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + AYQ+ ++ II TDG+ N Sbjct: 234 QLSAGGSTAGGSGIDLAYQMAEKGFIKNGIN----------------RIILATDGDFNVG 277 Query: 345 -KSNVNTIKICDKAKENFIKIVTISINASPNGQR-LLKTCVSSPEYHYNVVNADSLIHVF 402 + K +N I + T+ ++ + + + V Sbjct: 278 ITDTQQLEEKIKKKSKNGINLTTLGFGQGNYNDSLMMHIADVGNGNYAYIDSMQEAQKVL 337 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 57/164 (34%), Gaps = 19/164 (11%) Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 FL + + + + G++ ++ E +I + D S + T Sbjct: 33 KDFLKKLANSFGISNKGSHAGVVTFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRI 92 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +A ++ + LTDG+ + + I Sbjct: 93 DLALAKALELFDIKNGARND--------------VPNLLFLLTDGK---QEPEMPLTHIS 135 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 D+ K+ I++ + I A N + L+ V +PE + V + D L Sbjct: 136 DEIKQKGIQLFAVGIGAGAN-KTELEKIVGNPENVFMVDDFDKL 178 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 42/401 (10%), Positives = 96/401 (23%), Gaps = 92/401 (22%) Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 + R + SA + S T+ + + ++ Sbjct: 2 ASASVAAFADDRQPAATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDES 61 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQR--------YNQKIVSFIPALLRIEMGERPIFLIELV 196 V L + + + + + + + Sbjct: 62 TEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAIS 121 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKR-----------------TKMAALKNALLLFL 239 + +S + T++ ALK+A+ FL Sbjct: 122 WGR-CTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFL 180 Query: 240 DSIDLLSH-----------------VKEDVYMGLIGYT------TRVEKNIEPSWGTEKV 276 D ++ + K +G Y + +W E + Sbjct: 181 DQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQTVHSLAWTPEDL 240 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ T + ++ A + L S QK +F Sbjct: 241 QKEQAAVNSLKAGGATRADFGLQHAVKQLNSG-----------------RPGAQKLTVFY 283 Query: 337 TDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPN------GQRLLKTCVSS 385 +DG + N IK + K + ++++I + + + S+ Sbjct: 284 SDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSN 343 Query: 386 --------------PEYHYNVVNA-DSLIHVFQNISQLMVH 411 +Y V+A L +F+ I ++ Sbjct: 344 YPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEIISIVTS 384 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 61/162 (37%), Gaps = 28/162 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D + +++ K L LD + +GL+ + + Sbjct: 111 DASKSMLARDVQPSRLERAKLELNTLLDELK-------GDRVGLVVFAGDAFIQSPLTSD 163 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V+ ++ + + ++ A+K A Q+L++ + + + Sbjct: 164 YSAVKLFLRAVDPEQMPQGGSNIGAALKLANQVLSNADRGAKE----------------R 207 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ L+DGE+ + T + K+ ++++ + + + Sbjct: 208 AVVLLSDGEDLFGEVGEAT----EALKDGGVQVLAVGVGSES 245 >gi|297668115|ref|XP_002812300.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-3-like [Pongo abelii] Length = 493 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 69/186 (37%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 97 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQTYTDKQSL 153 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + + G + PS K I + Sbjct: 154 KQAVGRITPLST--GTMSGLAIQTAMDEIFT----------VEAGARGPSSNIPKVAIIV 201 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 202 TDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMES--LKMMASEPLEEHVFYVET 254 Query: 395 ADSLIH 400 + Sbjct: 255 YGVIEK 260 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 58/190 (30%), Gaps = 24/190 (12%) Query: 222 DKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 D + A + + F ++ + +G++ Y+ +K Sbjct: 24 DGSGSVTDANFDKMKQFAKNVVNAFDISASSTRVGVVQYSDSNTLEFNLGDHADKPSTLA 83 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D T + A++ A R K +I +TDG+ Sbjct: 84 AIDSIVYQGGGTRTGSALEFA-----------------RVNAAWRGESVPKVMIVVTDGK 126 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + ++ + + + I + + +LL+ + + + ++L Sbjct: 127 SADSVTSS-----ANNLASQGVDVYAIGVG-NYRSTQLLEIAAGNQNNVIELTDFNALSA 180 Query: 401 VFQNISQLMV 410 + I+Q + Sbjct: 181 EIEQIAQAVC 190 >gi|160725|gb|AAA29777.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|239908149|ref|YP_002954890.1| hypothetical protein DMR_35130 [Desulfovibrio magneticus RS-1] gi|239798015|dbj|BAH77004.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 328 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/213 (10%), Positives = 55/213 (25%), Gaps = 27/213 (12%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + A + G M A++ P Sbjct: 43 VGPSFTLHLGGLLRPLGLVAAVLALAGPSLPPDAPTYAGRGVDMMFAVDLSPSMAAMDIP 102 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + T++AA+ A + + +GL+ + R + P+ + Q Sbjct: 103 AEGRTITRLAAVAEAAKTL-------ALSRPGDRIGLVAFGARAYLVVPPTTDRAALVQA 155 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + T A+ A + L K ++ DG Sbjct: 156 LASLDTGAAGRKTAMGDAVGLAAKRLDES------------------PGQAKAVVVFGDG 197 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +N +++ A + + + + + Sbjct: 198 RSNAGETDPVP--AAQAAVRHGVAVFAVGVGGD 228 >gi|160710|gb|AAA29770.1| thrombospondin related anonymous protein [Plasmodium falciparum] gi|33333554|gb|AAQ11893.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 72/234 (30%), Gaps = 51/234 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +P+ + LV+D SGSM D + + A KN + Sbjct: 108 SAKPVKDVVLVIDNSGSM---------------KDTDPNQDRYTAAKNLINRM------- 145 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDMDSLI--LKPTDSTPAMKQAY 302 D + +I + + +++ + ++D L TD + A++ Sbjct: 146 ---DRDNRVSVIVFDHATTLLQPFTRVKNQEIKDEIMAEIDGLATTDGGTDISLALEDTM 202 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + +I L+DG + V K+ I Sbjct: 203 SHIQESQDAGRSA----------------MVIMLSDGFSETDHDRVLADY-----KQQQI 241 Query: 363 KIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + TI ++ L + +Y+V NA L VFQ I + R Sbjct: 242 AVNTIGLSLVYKDGANLLQTIAAETGGQYYDVQNAADLSFVFQKIYDDVGDRSL 295 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 78/268 (29%), Gaps = 58/268 (21%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM------GERPIFLIELVVDLSGSM 203 + P I R I LVVD SGSM Sbjct: 329 ANVDVALASPLTAQFGVDPNQPRNSLATPPADVIVAAKNAWANNRKPANIMLVVDSSGSM 388 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 KM K + +FL+ + + +G+IG+++ Sbjct: 389 RDDD-------------------KMDQAKLGVEVFLNRLPSKDN------VGMIGFSSSP 423 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + +E + + + T A+ A Q L + K+ Sbjct: 424 AVLVPLATRSENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQPDRINA------- 476 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 I+ L+DG + + ++ ++ E+ I+I I+ A L + Sbjct: 477 ---------IVVLSDGADTASQLSI--DQMLGNFGESSIQIFPIAYGADAETSILQQIAD 525 Query: 384 SSPEYHYNVV----NADSLIHVFQNISQ 407 + + +F+N+S+ Sbjct: 526 -----FSRTELVQGSTGDIDKIFENLSR 548 >gi|294011133|ref|YP_003544593.1| tight adherence protein TadE [Sphingobium japonicum UT26S] gi|292674463|dbj|BAI95981.1| tight adherence protein TadE [Sphingobium japonicum UT26S] Length = 126 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 37/118 (31%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + ++ +I F L+V F++L+ + + + ++ A + + Sbjct: 3 RLARDQRGAAAIEFVLAVPPFIMLLMGALQLGIIACARTGLQHAVDEGARYASIFPRPTD 62 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + S A +D S + + + + IS++ Sbjct: 63 EEIIAKVRSREFGLDPAYTNDPTVVQGTQYGVSYREITMTYSRPLNFIVYQTPAISIS 120 >gi|33333556|gb|AAQ11894.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 64/186 (34%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K L +D++ + + ++ Y + V+ E + Sbjct: 158 SSRSVRPREFEKVKTFLSQVIDTL---DIGETATRVAVVNYASTVKVEFHLQTHSDKESL 214 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V+R T S A++ A + + G + P+ K ++ + Sbjct: 215 KQAVSRIAPLAT--GTMSGLAIRTAMDEVFT----------VEAGARAPAFNIPKVVVIV 262 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + +A+ + I+I + + + L+ S P + + V Sbjct: 263 TDGRPQDQ-----VQEAVAQAQASGIEIYAVGVGRADMQS--LRQLASEPVETHAFYVET 315 Query: 395 ADSLIH 400 + Sbjct: 316 YGVIEK 321 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 77/264 (29%), Gaps = 49/264 (18%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM------GERPIFLIELVVDLSGSM 203 + + V +P I R I LVVD SGSM Sbjct: 324 ANPNVPLTDPISPAFGVDPQGVQTVLAVPTAEVIVAIKNSWSLNRKRADIVLVVDTSGSM 383 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + K+ +K + FL I +GLI + + Sbjct: 384 --------------------EGDKLTMVKAGIETFLMRILPED------RLGLITFASAA 417 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + ++ T A+ Q+L + Sbjct: 418 RLVVPMAPLSDNRIALQDAVQAMRASGRTALFDALVLGKQVLEQ--------------LP 463 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + I+ L+DG +N+ +++ +I E+ I I ++ + Q L Sbjct: 464 PADDDRIRAIVLLSDGADNSSQAS--LDQIRTLFDESGISIFPVAYGNDADRQVLDAIAE 521 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 S V + + +F+N+S+ Sbjct: 522 FSR-TIVVVGDTGDIAQIFENLSR 544 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 50/179 (27%), Gaps = 26/179 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + + + +T + + K + T + Sbjct: 684 LEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTAT 743 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + + P + F++ +TDG++ + Sbjct: 744 GDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD-----VRGPA 783 Query: 355 DKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 A + I I ++ + A + LK S P + + L + ++ + + Sbjct: 784 AAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 839 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 61/169 (36%), Gaps = 27/169 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L+ +++ +G++ + + + + Q +T ++ T Sbjct: 329 NQAAELYLIQIIEKGSLVGMVTFDSYAQIQNNLIKITEDNTYQKITANLPQEASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 +K +Q ++ + + + II LTDGE+N I +C Sbjct: 389 NGLKAGFQAISQSNQSTLGSE----------------IILLTDGEDN-------QISLCF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVF 402 ++ +++ I TI++ + L + + + + + LI F Sbjct: 426 EEVRQSGAIIHTIALG-PSAEKELETLSNMTRGHRFYAHNDINGLIDAF 473 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 63/212 (29%), Gaps = 38/212 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D + A K A F+ + +G++ + PS Sbjct: 93 DVSLSMMAKDVAPNRFEAAKAAAKKFIQDL------PARFNVGVVAFAGSANVVATPSGD 146 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + T A+ + Q + S ++ Sbjct: 147 RAAAISSIDTLTLAKR---TAIGEAVFTSLQAIRSFDAQAGQDPP------------PAH 191 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--------------PNGQRL 378 I+ L+DG+N +S + D A+ + + TI+ N L Sbjct: 192 IVLLSDGDNTTGRS---VPEAIDAARAADVPVSTIAFGTPYGTVEIDGETTPVEVNKVTL 248 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + Y + D L V+ NI + Sbjct: 249 AGLAQGTNGKAYEAADNDQLSQVYANIGTSLG 280 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ + +E+ +GL+ + + + + + + Sbjct: 118 SSKNRLDFAKELIEYFV-------YQRENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDD 170 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S +K +I LTDG N Sbjct: 171 IYIMDLGNGSALGLGISIALSHL------------------KHSEAPKKSVIVLTDGVVN 212 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + ++ + A+ +KI ++ I + K Y L V+ Sbjct: 213 SDEV--YKDQVINLAQGLNVKIYSVGIGSDEELNVGFKL---RSGKFYQ----GVLKEVY 263 Query: 403 Q-----NISQLMVHRKYSV 416 IS YSV Sbjct: 264 DPSMLFEISNKTGGLFYSV 282 >gi|3273287|dbj|BAA31187.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|326789198|ref|YP_004307019.1| hypothetical protein Clole_0061 [Clostridium lentocellum DSM 5427] gi|326539962|gb|ADZ81821.1| Protein of unknown function DUF3520 [Clostridium lentocellum DSM 5427] Length = 670 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/363 (11%), Positives = 96/363 (26%), Gaps = 60/363 (16%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LA A +++ + ++ + + Y+ + I Sbjct: 42 LAPAESATADIEETAPIVFNSESYN-SFTENPFIKTTDETFSTFSIDVDIASYSNVRRFI 100 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH---LLNQRYNQKIVSFIP 178 N + + L I Y + F + + I Sbjct: 101 TNQELPPVDAIRTEEL------INYFNYSYPEPTDNIPFSLSQEMMPCPWNESSQLLLIG 154 Query: 179 ALLR-IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ++ E P + ++D+SGSM K+ LK + + Sbjct: 155 LQGKHLQPDEVPPSNLVFLLDVSGSM-------------------SDTNKLPLLKKSFNI 195 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTP 296 ++ + VY G G + + T Sbjct: 196 LTSNLKESDCISIVVYAGASG--------VVLDGVAGNDESLINEALESLEAGGSTAGAE 247 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICD 355 + AY++ + +I TDG+ N S + I+I + Sbjct: 248 GIAMAYELAEKHFIKDGNN----------------RVILATDGDFNVGPNSESDLIRIIE 291 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 K +E I + + + +++ + + + L + + + + + Sbjct: 292 KKREKGIFLSVLGLGMGNYKDDKMESLADHGNGNYAYIDS---LQEAKKVLGEQLTGTLF 348 Query: 415 SVI 417 ++ Sbjct: 349 TIA 351 >gi|258507439|ref|YP_003170190.1| pilus specific protein [Lactobacillus rhamnosus GG] gi|257147366|emb|CAR86339.1| Pilus specific protein, ancillary protein involved in mucus-adhesion, contains von Willebrand factor (VWF) domain [Lactobacillus rhamnosus GG] gi|259648793|dbj|BAI40955.1| putative cell surface protein [Lactobacillus rhamnosus GG] Length = 895 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 28/397 (7%), Positives = 102/397 (25%), Gaps = 34/397 (8%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA--NNAAILAGASKMVSNLSRLGDRFESI 82 +FA+ ++ +L G + +++ + A + ++ G + Sbjct: 11 LFAVLLILMSMLTGLVTSGSSVVTATDNIRPTYQTD----ANGTYPTNSWQVTGQQNVIN 66 Query: 83 SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNN 142 + D ++ + Y + + + Sbjct: 67 QRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAKETNTPGLYDVYLNVK 126 Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + R+ + ++ + + + Sbjct: 127 GNKQQNVKPVDIVLVVDMSGSMESNRWGTNRAGAVRTGVKN-----------FLTSIQNA 175 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + S+P K ++ + + G Sbjct: 176 GLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKAGNASQQQAINGALSPRFQGGTYTQIGL 235 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + + T ++ + D + + + + ++ S + Sbjct: 236 RQGSAMLNADTSGNKKMMILLTDGVPTFSNEVINSEWINGTLYGTNFGSSRDEPGNTARL 295 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL---- 378 + P I+ T T+ AK++ ++ + I + + + Sbjct: 296 RWPYTDSSGHYIYDTW---------PATLGEAKIAKDSGNEVHALGIQLADDDHYMTKEK 346 Query: 379 ----LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ +SP+ + + +AD++ N ++ ++ Sbjct: 347 IRQNMQLITNSPDLYEDADSADAVEAYLNNQAKDIIK 383 >gi|3273299|dbj|BAA31193.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNDFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 41/381 (10%), Positives = 93/381 (24%), Gaps = 63/381 (16%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSN-----LSRLGDRFESISNHAKRALIDDAKRFIK 99 ++ A AA ++ A + + + + N A F Sbjct: 93 RTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGETPVSTFAI 152 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + S + + + M + + + S L Sbjct: 153 DVDTGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNEA-----PFSVTTELA 207 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 Y+ ++ E + ++D+SGSM Sbjct: 208 PSP----YNYDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMA--------------- 248 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ L+ AL + + + ++ Y Q Sbjct: 249 ----SPDKLPLLQTALKMLTQQLGAQD------KVSIVVYAGAAGVV--LDGAAGNDSQT 296 Query: 280 VT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ ++ AYQ+ +IF TD Sbjct: 297 LNYALEQLSAGGSTNGAQGIQLAYQLAKKHLVEGGIN----------------RVIFATD 340 Query: 339 GENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNAD 396 G+ N +N I + K+ I + T+ L++ + + + Sbjct: 341 GDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDS-- 398 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L + + + + ++ Sbjct: 399 -LNEARKVLVEQLSATLLTIA 418 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 69/203 (33%), Gaps = 35/203 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++ + A FL L ++ ++G++ + + + K R + Sbjct: 320 FNRLNRMNQAAKHFL-----LQTIENGSWVGMVHFDSTANIKSNLIQIISSKERNNLLES 374 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T M+ A+Q++ + + I+ LTDGE+N+ Sbjct: 375 LPKAANGGTSICAGMRSAFQVIREVYPQIDGSE----------------IVLLTDGEDNS 418 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 D+ ++ I I++ S + Q +++ + H+ + LI Sbjct: 419 ------AKDCIDEVTQSGAIIHLIALGPSAD-QAVIEMSAMTGGNHFFASDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALASGNTDLSQQPLQLESKG 494 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 67/202 (33%), Gaps = 32/202 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG- 272 S + T++ + A L+L + ++++ +GL+ + + Sbjct: 314 DKSGSMSSGDPITRLTLMNQAAELYL-----IQILEKESLVGLVTFDSIAIVQNNLIRMI 368 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T +++ ++ +TS + + + Sbjct: 369 NDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE---------------- 412 Query: 333 IIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHY 390 I+ LTDGE++ I C ++ K + I TI++ + L + Y Sbjct: 413 IVLLTDGEDD-------QISSCFEEVKHSGAVIHTIALGPDA-ARELETLSDMTGGRRFY 464 Query: 391 NVVNADSLIHVFQNISQLMVHR 412 + LI F IS Sbjct: 465 ASEGINGLIDAFSGISSKSGSL 486 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 67/202 (33%), Gaps = 32/202 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG- 272 S + T++ + A L+L + ++++ +GL+ + + Sbjct: 314 DKSGSMSSGDPITRLTLMNQAAELYL-----IQILEKESLVGLVTFDSIAIVQNNLIRMI 368 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T +++ ++ +TS + + + Sbjct: 369 NDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE---------------- 412 Query: 333 IIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHY 390 I+ LTDGE++ I C ++ K + I TI++ + L + Y Sbjct: 413 IVLLTDGEDD-------QISSCFEEVKHSGAVIHTIALGPDA-ARELETLSDMTGGRRFY 464 Query: 391 NVVNADSLIHVFQNISQLMVHR 412 + LI F IS Sbjct: 465 ASEGINGLIDAFSGISSKSGSL 486 >gi|198420236|ref|XP_002121660.1| PREDICTED: similar to collagen type VI alpha 6 [Ciona intestinalis] Length = 1638 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 59/182 (32%), Gaps = 22/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++ S + + +G++ Y++RV I+ K Sbjct: 811 ETTKEWIGSFVREFEIGEYNTKIGVVQYSSRVRSEIDIGDYDSKADLLAAISSIEFAAGN 870 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ A++ + S + K +I LTDG + + Sbjct: 871 TNTGSALEYVRTVGFSGRHG-------------ARNGVPKVLIVLTDGNAQDGVLD---- 913 Query: 352 KICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K + + + I + G + ++SP Y+V N D++ + + + + Sbjct: 914 -AASKLHRDGVAVYAIGVGRPNMGQLNAVTSEPITSPN-IYHVRNFDAIQTIQSGLLRRV 971 Query: 410 VH 411 + Sbjct: 972 CN 973 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/170 (10%), Positives = 51/170 (30%), Gaps = 34/170 (20%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ + ++ Y++ + + V D+ + T + A++ A + Sbjct: 63 NFDISRDTTRVAVVQYSSYPRTEFDLDTYSSAVGVLRGIDLIQYMSGNTQTGLALRYAIE 122 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + S + K I L+DG + + + + + + I Sbjct: 123 SVFSRAREDSA----------------KVAIVLSDGRSQDQVN-----EAATSLRSSGIA 161 Query: 364 IVTISINASPNGQRLLK------TCVSSPEYHYNVVNADSLIHVFQNISQ 407 + I + +R+ + + + F++I Q Sbjct: 162 TFAVGIGDEMSHERMEELRQISVAAAEDQSSVFMAKD-------FRSIGQ 204 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/330 (8%), Positives = 72/330 (21%), Gaps = 39/330 (11%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 ++ D SA ++ ++ L Sbjct: 296 NMDYIMGDTHTGAALTYMLDEIYSSANGDRPDVPDLAIVMTD--GKAQEPDLVVEAANRV 353 Query: 146 YNMDVMTSYDYRLQFIE----HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 + V + + + ++ + + I I + + Sbjct: 354 HEAGVTVYTVGVADYSLEEIKVIASDPDKNYVIEALNFDIIELKRRGLIKSICTDAEQTC 413 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 P D + + +D + ++ +T Sbjct: 414 ---------PAATAELVFLIDGSTSIGFDNFEKLKRWLKSIVDAFQVGPHYTRVAVVQFT 464 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 R + + T + A++ + Sbjct: 465 NRPVLEFGLNDHSTTQATLQAIQRIRYRRGSTSTGRAIEFVMNEVF-------------- 510 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 K +I LTDG++ + + A E + T+ L+ Sbjct: 511 --THSRENVPKILIALTDGQSQDD-----VTQATASAAEAG--VHTLVFGIGNTRPGQLR 561 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 VS ++ + D + + + L+ Sbjct: 562 QLVSKEDHVFQAAGFDVIQKMQSKLVSLIC 591 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/315 (10%), Positives = 84/315 (26%), Gaps = 36/315 (11%) Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + V + N +I + D + + Sbjct: 496 TSTGRAIEFVMNEVFTHSRENVPKILIALTDGQSQDDVTQATASAAEAGV-HTLVFGIGN 554 Query: 163 HLLNQ-----RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 Q + + +M + + LI + + S D + + Sbjct: 555 TRPGQLRQLVSKEDHVFQAAGFDVIQKMQSKLVSLICVKAEPECSSQEIDLHFMVDGSGS 614 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++ ++ K KN + F K + L YT+ + + + K Sbjct: 615 IGNENFRKVKAWI-KNVVRSF-------DVGKYTTRVALTQYTSTINTEFDFKKFSTKRE 666 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 D T + A+ F + K ++ ++ Sbjct: 667 IDYAIDQMEFAGGATLTAQALVH-----------IRENGFTEESGA--RPGAPKVLVVIS 713 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNA 395 DG + + N K E+ + + I + + L+ S P + + Sbjct: 714 DGRSAD-----NIETPARKLHESGVYVFAIGVGNTWRSA--LEIIGSEPVVTHVQEGASY 766 Query: 396 DSLIHVFQNISQLMV 410 D++ + +++ + + Sbjct: 767 DAINNFRRDLVRNIC 781 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 58/172 (33%), Gaps = 18/172 (10%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + V +G+I Y+TR + NI +K L T + A++ Sbjct: 1010 NMTQKFQIGPDAVRVGMIQYSTRPKTNIAIGQYNDKESLQEAFGQVEWQLGDTYTARALR 1069 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + + +L K +I +TDG+ + + Sbjct: 1070 Y-------------VSKSYARATTRENLHATKLLIIITDGQP---QDRNEVKQAVRNLHS 1113 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-NISQLMV 410 +I I + S + + ++ + N +S +FQ +S+L+ Sbjct: 1114 EGWRIFAIGVGQSDISELGILASNPDSDHVFYANNFNS-TRIFQGRLSRLIC 1164 >gi|114594054|ref|XP_001145008.1| PREDICTED: hypothetical protein isoform 3 [Pan troglodytes] Length = 411 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 53/174 (30%), Gaps = 24/174 (13%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + I ++++ + + K+ + + + T +K A + + Sbjct: 1 MRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQ----- 55 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K L II LTDG + K ++ + + + Sbjct: 56 -----------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEAKISRSLGASVYCVGVLD 103 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLMVHRKYSVILKG 420 Q L+ S E + V +L + +I ++ + S + G Sbjct: 104 --FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSVCVG 155 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/265 (12%), Positives = 66/265 (24%), Gaps = 30/265 (11%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 V + + + +L ++ Q +PA + C Sbjct: 255 AVPKNVVFVIDTSASMLGKKMRQVRAGPLPAS-----RPGTPTFPPKSPRRAERCSCQSG 309 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + ++T K ALL L + + + + + Sbjct: 310 AGTSAKRDYFGPPCGRKT-----KEALLTILGDLRPAD-RFNFISFSSRIRVWQPGRLVP 363 Query: 269 PSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + VR + TD A++ +L + Sbjct: 364 ATP--SAVRDAKKFVVMLPTSGGGTDIDGAIQTGSSLLRD-----------HLSGRDAGP 410 Query: 328 PFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 IIFLTDG+ + I A + I TI + + + L + + + Sbjct: 411 NSVSLIIFLTDGQPTVGEVRPGAILGNARAAVRDKFCIFTIGMGDDVDYRLLERMALDNC 470 Query: 387 EYHYNVVNADS----LIHVFQNISQ 407 + L + I Sbjct: 471 GMMRRIPEEADASSMLKGFYDEIGT 495 >gi|332283431|ref|YP_004415342.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] gi|330427384|gb|AEC18718.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] Length = 585 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 22/205 (10%), Positives = 53/205 (25%), Gaps = 30/205 (14%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK--EDVYMGLIGYTTRVEKNIEP 269 D + + Q K+ + + + S +++ + + Sbjct: 31 YDASGSMWGQVDGVNKIVTARKVMGELVKSWPENTNLGLIAYGHRSAGSCSDIETMIEPQ 90 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + V T + ++KQA +L Sbjct: 91 RVDRDAFIKTVNAIT---PKGKTPISASLKQAADVLQYR-------------------DH 128 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI--VTISINASPNGQRLLK-TCVSSP 386 ++ ++DG + + + KE + + + G L ++ Sbjct: 129 NATVVLISDGLESCHGDPC---AVAAELKEKGVDFKAHVVGFDLDQEGNEALSCIAKNTG 185 Query: 387 EYHYNVVNADSLIHVFQNISQLMVH 411 NAD L Q + +V Sbjct: 186 GIFVPASNADELQDALQQVQAKVVQ 210 >gi|159898662|ref|YP_001544909.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159891701|gb|ABX04781.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 610 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 42/368 (11%), Positives = 102/368 (27%), Gaps = 65/368 (17%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKR-FIKNHIKESLSGYSAVFYNTEIQNI 121 A +V + N+A + + KN+ +T +I Sbjct: 127 AAGKPLVDTWELPTQPIDPNPNYAYEQDQEIFDSMYFKNYGTNPFVRTETDPLSTFAMDI 186 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMT-----------SYDYRLQFIEHLLNQRYN 170 ++S M N L +++ + + + N Sbjct: 187 DSASYSLMRSSINQGLLPPADSVRVEEYLNAFDYEYPQPEDGDFAIYSEVAPS-PFGGPN 245 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++V IE+ +R + V+D SGSM ++ Sbjct: 246 YELVQIGIQARSIEVADRKPAALTFVIDTSGSMAQD-------------------NRLEM 286 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +KNAL+ ++ + ++ + + + P+ G + +T Sbjct: 287 VKNALIYLAGQLEPDDS------LAIVAFNDGMRVVLNPTSGENQ-MDIITAINSLEPAG 339 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVN 349 T++ + Y+ + I+ +DG N+ Sbjct: 340 STNAEAGL---YKGFELAWQAFKPEGI-------------NRILLCSDGVANSGMTEPSQ 383 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY---HYNVVNA------DSLIH 400 + + + +++ T + LL+ + +++ + + L Sbjct: 384 LLATFQQYLDAGVQLSTYGVGMGNYNDILLEQLADKGDGNYAYFDSADEAQRLFGEQLTG 443 Query: 401 VFQNISQL 408 Q I + Sbjct: 444 SLQTIGRE 451 >gi|34481888|emb|CAE46492.1| trap [Plasmodium falciparum] Length = 331 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNTVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVMLTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLAGCHPSDGK 211 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/270 (13%), Positives = 80/270 (29%), Gaps = 61/270 (22%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + I + I + LV+DLSGSM QD + +++ A + Sbjct: 79 IFAIARPQAVMTLPSRIKTVILVIDLSGSMRA---------------QDVRPSRIRAAQQ 123 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 A + LD V +G++ PS + V + T Sbjct: 124 AARVLLD------AQPAGVSVGVVAMAGTAALAQAPSHSKDDVATAIEGLK---PQGGTA 174 Query: 294 STPAMKQAYQILTSDKKRS-------FFTNFFRQGVK------------IPSLPFQKFII 334 + A L + K P I+ Sbjct: 175 LGNGLLIALTTLLPQTTNDAERLMNGGDVAQPGKPGKAAPGELDNGEPVRPGSYASGAIV 234 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------------NGQRLL 379 +DGE+N+ ++ A +++ T+ + + + + L Sbjct: 235 LFSDGESNSGPG---AVQAAQLAATYGVRVYTVGVGTTEGVVLSADGWSARVRLDEKVLK 291 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + ++ ++ + + +L V++ ++ + Sbjct: 292 QVADTTGAEYFRLEDTAALKKVYRALNTRL 321 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 59/179 (32%), Gaps = 25/179 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 ++ L + + +G++G+ + + + TE R T+ ++ A Sbjct: 410 LNALGQLGDSTSVGVVGFNRQAYEVVGLEQLTENRDTTRQRIRQLRAGGGTNIANGLRGA 469 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++L + +I ++DG + ++ V + + Sbjct: 470 EEMLDGQRGT---------------------VILISDGVDARSRATV----VAESLGRRG 504 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 ++++T+ N L + S ++ D L +F + +++ G Sbjct: 505 VRVITVGAGQRVNEPLLEQIADISGGTYFQANETDRLRILFGGSGRQFDGEGLTIVDSG 563 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 89/381 (23%), Gaps = 68/381 (17%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSN-----LSRLGDRFESISNHAKRALIDDAKRFIK 99 ++ A A ++ A + + + + N A F Sbjct: 81 RTMSAESRAYIAQPTASISAAPALNGDWPGAVPPERNRFEKQVQNGIMVAGEIPVSTFAI 140 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + S + + + M + + S L Sbjct: 141 DVDTGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPGKNDA-----PFSVTTELA 195 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + ++ E + ++D+SGSM Sbjct: 196 PSP----YNDDMMLLRIGLKGYEQSKAELGASNLVFLLDVSGSMA--------------- 236 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ L+ AL + +D + ++ Y Q Sbjct: 237 ----SPDKLPLLQTALKMLTQQLDAQD------KVSIVVYAGAAGVV--LDGAAGNDTQT 284 Query: 280 VT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ ++ AYQ+ +I TD Sbjct: 285 LNYALEQLSAGGSTNGAQGIQLAYQLAQKHFVEGGIN----------------RVILATD 328 Query: 339 GENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNAD 396 G+ N +N I + K+ I + T+ L++ + + + + Sbjct: 329 GDFNVGTTNLDELIDLVSARKQQGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDSIN 388 Query: 397 S--------LIHVFQNISQLM 409 L I++ + Sbjct: 389 EARKVLVEHLSATLLTIAKEV 409 >gi|297463635|ref|XP_002702824.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] Length = 2933 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ A Sbjct: 69 SGAAGAQGVRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + + K + Sbjct: 129 RVFLPQL---------------ARPGVPKVCILITDGKSQD-----MVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|297488708|ref|XP_002697097.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] gi|296474920|gb|DAA17035.1| collagen, type VII, alpha 1 [Bos taurus] Length = 2932 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ + S T + A+ A Sbjct: 69 SGAAGAQGVRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + + K + Sbjct: 129 RVFLPQL---------------ARPGVPKVCILITDGKSQD-----MVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|170672288|gb|ACB29772.1| matrilin-3 alternative transcript [Homo sapiens] Length = 444 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 90 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 146 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 147 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTVEAGAREPSSNIPKVAIIV 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 195 TDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMAS--LKMMASEPLEEHVFYVET 247 Query: 395 ADSLIH 400 + Sbjct: 248 YGVIEK 253 >gi|158255148|dbj|BAF83545.1| unnamed protein product [Homo sapiens] Length = 486 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 90 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 146 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 147 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTVEAGAREPSSNIPKVAIIV 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 195 TDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMAS--LKMMASEPLEEHVFYVET 247 Query: 395 ADSLIH 400 + Sbjct: 248 YGVIEK 253 >gi|11321565|ref|NP_002372.1| matrilin-3 precursor [Homo sapiens] gi|14548113|sp|O15232|MATN3_HUMAN RecName: Full=Matrilin-3; Flags: Precursor gi|3647275|emb|CAA12110.1| matrilin-3 [Homo sapiens] gi|62630192|gb|AAX88937.1| unknown [Homo sapiens] gi|119621242|gb|EAX00837.1| matrilin 3 [Homo sapiens] gi|146218451|gb|AAI39908.1| Matrilin 3 [Homo sapiens] Length = 486 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 90 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 146 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 147 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTVEAGAREPSSNIPKVAIIV 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 195 TDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMAS--LKMMASEPLEEHVFYVET 247 Query: 395 ADSLIH 400 + Sbjct: 248 YGVIEK 253 >gi|149918084|ref|ZP_01906577.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] gi|149821089|gb|EDM80495.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] Length = 719 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/401 (8%), Positives = 87/401 (21%), Gaps = 53/401 (13%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A A A+ + L D + N + D A S + Sbjct: 160 ALERAGAEIAATKADHPEDLADPARAYINI---LVTDGAWTGTDGTTVMSPANQDPAITA 216 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY----RLQFIEHLLNQRYNQ 171 +E+ + + + N + + + + D ++ + Sbjct: 217 SELFDNHDVPTFVIALAGNPDAELAADETAAAGGTTAATDADTPALFDKALSMV---ADD 273 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS---DPEDVNSAPICQDKKRTKM 228 + + + L L+ P D A + + Sbjct: 274 LTDAVLGPACIGGQPRVMVLLDASSSMLNVDGGQTWGPQGQTPWDQARAALTGVGGPFDL 333 Query: 229 AALKNALLLF----LDSIDLLSHV------------KEDVYMGLIGYTTRVEKNI----- 267 AL L +++ L ++ E Sbjct: 334 DLGVGALEDLSLFGLAVFGDDEPGEEKIMAQYGPCLRDNFEWALDPESSCAEPTCSDPWG 393 Query: 268 --EPSWGTEKVRQY-----VTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFT 315 W + Q V + + + S A +++ +++ + Sbjct: 394 GPPIGWSFQDGSQTEPPGFVHATLSHMPRCEGSGACSGSGSATHLGLELIAANRSQYHLD 453 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I +TDG + ++ + + + + Sbjct: 454 GQAPDAEFPTHAQTPYVNILITDGAYAAYSTDAQVQAALEAMFNEGVTTHVVGFGEGADT 513 Query: 376 QRLL----KTCV---SSPEYHYNVVNADSLIHVFQNISQLM 409 + L Y+V L I+ + Sbjct: 514 PQALIELSAMAAWGSGGEGAPYHVDTQQELQSALAQIAASI 554 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 15/234 (6%), Positives = 51/234 (21%), Gaps = 23/234 (9%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 + + + + ++ + M Sbjct: 28 EAECSSKKPNVMLIVDYSTSMNEVWDADNQLTRWEVTVAAMQEATAPGSFLSQHTHLALM 87 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 A + P + + + ++ + + Sbjct: 88 RFAHDPAP--------GASGTMIAVDVDQTIVDGQAIDVEWDDADATYLPCNGQAIGDAL 139 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 P+ G ++ T + A+++A + + K Sbjct: 140 AAISPPAGG-------------AVFGHGTWTKGALERAGAEIAATKADHPEDLADPA-RA 185 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQ 376 ++ TDG +N + + + + I++ +P+ + Sbjct: 186 YINILVTDGAWTGTDGTTVMSPANQDPAITASELFDNHDVPTFVIALAGNPDAE 239 >gi|332665830|ref|YP_004448618.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332334644|gb|AEE51745.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 630 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 64/217 (29%), Gaps = 44/217 (20%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + P + +VD+SGSM A K+ ++ + L + + Sbjct: 252 TPVENLPAANLVFLVDVSGSMSAA-------------------NKLPLVQASYKLLAEQL 292 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ Y +E + G K + T + AY Sbjct: 293 RPQD------RVAIVVYAGAAGLVLESTTGNNKTKIK-EAIDKLQAGGSTAGGEGILLAY 345 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENF 361 + + + +I +DG+ N S+ ++I ++ +++ Sbjct: 346 KTAKENFIKGGNN----------------RVILASDGDFNVGVSSDGELVRIIEEERKSG 389 Query: 362 IKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 + + + ++ S H + N D Sbjct: 390 VYLTILGYGMGNYKDNKMQKLADSGNGNHAYIDNLDE 426 >gi|125586596|gb|EAZ27260.1| hypothetical protein OsJ_11197 [Oryza sativa Japonica Group] Length = 540 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 54/185 (29%), Gaps = 32/185 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTR 282 K+ +K ALL + + + + ++ ++ + + + R + Sbjct: 82 AGNKLDRMKAALLFVIRKLADVD------RLSIVTFSNDAARLCPLRFVAGDAARADLGA 135 Query: 283 DMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ + + + ++ ++DG+ Sbjct: 136 LVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAV---------------NVMLMSDGQQ 180 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 N + + + T + A + L S + V + +L Sbjct: 181 NRGDATRLDPG--------GVPVHTFGLGADHDPAVLQAIAGKSREGMFHYVADGVNLTA 232 Query: 401 VFQNI 405 F + Sbjct: 233 PFSQL 237 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K S + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 66/222 (29%), Gaps = 55/222 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + QD + +++ A K +L LD + E +G++ + + + Sbjct: 95 DGSRSMAAQDVRPSRIDASKRMVLALLDRL-------EGNQVGMLMFGSSSYVQFPLTSD 147 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R V L L TD + + + + + Sbjct: 148 LAAARSLVEPINPRGLSLGGTDVEEVITEGLRSFPIGQIEG------------------R 189 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDG +++ +S+ + +A + + I TI + Sbjct: 190 TMILITDGGDSDEQSDGEAVAAAREAAKMGLTIHTIGMATEAGGQIPIYDDLGNISYVED 249 Query: 374 ---------NGQRLLKTCVSSPEYHYNVVNAD--SLIHVFQN 404 N L + ++ +++ D L Sbjct: 250 QGQRVISKLNRPLLEQIASATGGTYFDGSTLDLNQLQTALDQ 291 >gi|291245103|ref|XP_002742431.1| PREDICTED: complement component factor B/C2-like [Saccoglossus kowalevskii] Length = 782 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 67/241 (27%), Gaps = 52/241 (21%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 R+ +G + D S S+ + + + + Sbjct: 283 GRLSVGGSLGLDLVFAFDCSSSIDPVDFNRGMIFSRHIVREFGVS--------------- 327 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + I + + + + + AMK Sbjct: 328 ------YEPGGTRVAAISFASVATLEFNLGDDIVNTTEKAIEQLQVIQASGG---GTAMK 378 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A++I+ F + V +K + +TDG++N+ I + ++ Sbjct: 379 EAFEIM------------FSEVVPRLRPESKKAMFIITDGKSNSGS----PISYAQRLRD 422 Query: 360 NF---IKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 +I I I + LK S P + + + + L + ++ K Sbjct: 423 RNEHAFEIFAIGIGNGVDRNE-LKKIASEPFTSHVFLIRQYEDLTTL-----TDIISEKR 476 Query: 415 S 415 + Sbjct: 477 T 477 >gi|213625177|gb|AAI69984.1| Complement factor B [Xenopus laevis] Length = 747 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/262 (12%), Positives = 86/262 (32%), Gaps = 48/262 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + + + + + I +V+D S S+ Sbjct: 211 FSSSMLENVDTTNLED--RSDRSVRILKDGLMNIFIVLDTSKSV---------------- 252 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEK 275 + + K+A +LF++ + +I Y ++ + S + Sbjct: 253 ----GQNRFDEAKSASILFIEKMSNYDIKPRY---CIISYASKAISVVSLRDPDSNNADA 305 Query: 276 VRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ Y+ L + + R+G K + I Sbjct: 306 VMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQE-----LAYEREGKKEDFMKIHNVI 360 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK----------IVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ + + I+ + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKLIKRFLDVGIRKDNPREEYLDVYVFGLGSDIDQPEINDLAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQ 403 + + +++ N D + F+ Sbjct: 421 KKEKEVHTFHLQNVDKMKEFFE 442 >gi|326435586|gb|EGD81156.1| hypothetical protein PTSG_11196 [Salpingoeca sp. ATCC 50818] Length = 1445 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/352 (9%), Positives = 92/352 (26%), Gaps = 35/352 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + V + ++ ++ + + L + + ++ Sbjct: 40 VSGTAVDYIDLGTAQYNALIDFEEFYLNSTGTSSPVTFASSTFRLMTEYLAEKLTALALD 99 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + ++ D+ R L N + S + Sbjct: 100 TPDTIVIVQGFAPRPTNPTRPPELTDIGRGLHMRYLNNGSLTNFLSDLANRSIEAGFDFV 159 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + ++ V +++ D + + + + + + F Sbjct: 160 AIPGQGSYVNVSVPNVNCGRSAVDLLFILDGSGSIGSSNFETMR--QFTATVTSF----- 212 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQA 301 + + L+ Y++ V + + S+ R + + T + A+ A Sbjct: 213 -FDVSPDTTRVALMVYSSSVTEIFDFSYVLSNTRDEIITTIRNTNYPGGGTRTGSALDYA 271 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + +G + S + I + DG++ + + + + + Sbjct: 272 RTNMF----------LTSRGARPSSAGVPRVAIVIIDGQSGDSVA-----QPAENLRNEN 316 Query: 362 IKIVTISI-NASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 + I I I A N L S P + F + + Sbjct: 317 VNIFAIGISGADVNE---LNMIASPPITNNVKFIDTFQ----AFSQLPAEIS 361 >gi|315613111|ref|ZP_07888021.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] gi|315314673|gb|EFU62715.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] Length = 863 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/301 (8%), Positives = 68/301 (22%), Gaps = 29/301 (9%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ---KIVSFIPALLRIEMGERPIFL 192 LD ++ + NQ + +S I AL G Sbjct: 62 SLDITSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKG 121 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + +M +++ + Q K + + + Sbjct: 122 LIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFR--PYWPYERMTKYYDGVSPW 179 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKR 311 + + T+ + A L + Sbjct: 180 DDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAGLTLA-NQLMGSARS 238 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----------- 360 + ++ + + + N + D+ N Sbjct: 239 NAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPT 298 Query: 361 --FIKIVTISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +I S N + + S P ++ + D L F++I+ ++ Sbjct: 299 LDG--FYSIKFRYSNNVDSITSLQYYMRQHNASIPNEIFSANDEDQLRDSFKDITDKILP 356 Query: 412 R 412 Sbjct: 357 L 357 >gi|293365338|ref|ZP_06612055.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307703880|ref|ZP_07640821.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] gi|291316788|gb|EFE57224.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307622715|gb|EFO01711.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] Length = 863 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/301 (8%), Positives = 68/301 (22%), Gaps = 29/301 (9%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ---KIVSFIPALLRIEMGERPIFL 192 LD ++ + NQ + +S I AL G Sbjct: 62 SLDITSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKG 121 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + +M +++ + Q K + + + Sbjct: 122 LIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFR--PYWPYERMTKYYDGVSPW 179 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKR 311 + + T+ + A L + Sbjct: 180 DDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAGLTLA-NQLMGSARS 238 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----------- 360 + ++ + + + N + D+ N Sbjct: 239 NAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPT 298 Query: 361 --FIKIVTISINASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +I S N + + S P ++ + D L F++I+ ++ Sbjct: 299 LDG--FYSIKFRYSNNVDSITSLQYYMRQHNASIPNEIFSANDEDQLRDSFKDITDKILP 356 Query: 412 R 412 Sbjct: 357 L 357 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K S + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 67/219 (30%), Gaps = 51/219 (23%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + V+D SGSM + P +K +D + Sbjct: 306 SNNVRPLELAFVLDSSGSMRWSD----------PDNIRIVGSKH---------LVDRLKE 346 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +I + + + + ++ + TD + +A + Sbjct: 347 VDRG------AVIDFDSTAQLLQSLTDNKAVIKSALDLID---ASGGTDIGDGVSKALEE 397 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + + S + ++ LTDG + + + + I++ Sbjct: 398 FANARSASDWA-----------------VVLLTDGSGSYNHALT------TELVQKNIRV 434 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + I++ + N + S+ + +V AD LI VF+ Sbjct: 435 LGITMGSGANQSLIRGISDSTYGIYQHVNTADELIEVFE 473 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 19/217 (8%), Positives = 47/217 (21%), Gaps = 14/217 (6%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L + + + L+ +I+ N Sbjct: 311 PLELAFVLDSSGSMRWSDPDNIRIVGSKHLVDRLKEVDRGAVIDFDSTAQLLQSLTDNKA 370 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK----- 265 I + AL F ++ + G Y + Sbjct: 371 VIKSALDLIDASGGTDIGDGVSKALEEFANARSASDWAVVLLTDGSGSYNHALTTELVQK 430 Query: 266 ---NIEPSWGTEKVRQYVTRDMDS--LILKPTDSTPAMKQAYQILT----SDKKRSFFTN 316 + + G+ + + DS I + ++ + + ++ + + Sbjct: 431 NIRVLGITMGSGANQSLIRGISDSTYGIYQHVNTADELIEVFERFSTISGDTLADTDQDG 490 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 F TD N + + I Sbjct: 491 LTDCQEIQGIRVANSTDTFFTDPNNPDTDGDGLLDGI 527 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/251 (13%), Positives = 69/251 (27%), Gaps = 49/251 (19%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + IV+ +L + + +V+D SGSM Sbjct: 10 SIRKALASAFIVALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGG------------ 57 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-------IGYTTRVEKNIEPSWGTEK 275 +T++ A K A ++ I + V Y G + + + Sbjct: 58 --QTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMGCQDVTILG-GPSRGN 114 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T A+ L I+ Sbjct: 115 ADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN----------------------IVL 152 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVS-SPEYHYNV 392 ++DG N +V ++ + ++ I I TI +N P + L+ + + Sbjct: 153 VSDGIANCTPPDVC--EVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADA 210 Query: 393 VNADSLIHVFQ 403 +A SL Sbjct: 211 SDAQSLTDALT 221 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 64/222 (28%), Gaps = 62/222 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++ A K L L + +G I ++ + + Sbjct: 90 DTSRSMMAEDVYPNRLEAAKRTLENLLQGLK-------GDRIGFIPFSDSAYIQMPLTDD 142 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ + + T+ A++ A + + K Sbjct: 143 YSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN------------------K 184 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II L+DG + + K K+N + + +I I Sbjct: 185 TIIILSDGGDFD-------EKSLKFVKDNKMNVFSIGIGTEEGTIIPEYVNGKKVGFIKD 237 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQN 404 N L K S +Y V N D + F++ Sbjct: 238 QNGSAVISKLNSDFLKKLSSESDGKYYEVNNLKDDSSNFFKD 279 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/251 (13%), Positives = 69/251 (27%), Gaps = 49/251 (19%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + IV+ +L + + +V+D SGSM Sbjct: 10 SIRKALASAFIVALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGG------------ 57 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-------IGYTTRVEKNIEPSWGTEK 275 +T++ A K A ++ I + V Y G + + + Sbjct: 58 --QTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMGCQDVTILG-GPSRGN 114 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T A+ L I+ Sbjct: 115 ADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN----------------------IVL 152 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVS-SPEYHYNV 392 ++DG N +V ++ + ++ I I TI +N P + L+ + + Sbjct: 153 VSDGIANCTPPDVC--EVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADA 210 Query: 393 VNADSLIHVFQ 403 +A SL Sbjct: 211 SDAQSLTDALT 221 >gi|308048514|ref|YP_003912080.1| type IV pilin biogenesis protein [Ferrimonas balearica DSM 9799] gi|307630704|gb|ADN75006.1| type IV pilin biogenesis protein, putative [Ferrimonas balearica DSM 9799] Length = 1183 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/283 (9%), Positives = 62/283 (21%), Gaps = 49/283 (17%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ER + + ++ +R ++ + Sbjct: 208 SPSASLSERERAADDARNSTQFGQNDVVTLFTENYLTYLHHHSGSVQRQRITIARETTNS 267 Query: 238 FLDSIDLLSH-----VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKP 291 +++ + + IG TE R V Sbjct: 268 LINTTTGVDFGLMVFNRNRNSGNTIGSDDGGRIVEGIREMTESNRADLVNTVSSLEATSD 327 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS-------------LPFQKFIIFLTD 338 T ++ +AY+ + S +++ +TD Sbjct: 328 TSLCESLFEAYRYFSGGAVLGGNKGGALLPAADDSVVSGDNYISPLSSCQANSYVLLMTD 387 Query: 339 GENNNFKSNVNTIK------ICDKAKE----------------------NFIKIVTISIN 370 G S I+ DK + T+ + Sbjct: 388 GSPYRDNSLNTLIESELGLTAADKHNGSHLPGVAEWMYQNDMNTSATGHQKVVTYTVGFS 447 Query: 371 ASP-NGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + LL+ +Y +A +L Q I ++ Sbjct: 448 QGAVDAAELLEQTAHRGGGLYYPANDAAALQSSLQQIVSEILS 490 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 70/204 (34%), Gaps = 29/204 (14%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 A + T+ ++A F+ + ++E M LI ++ V++ + + ++ Sbjct: 82 AIDTSESVITQFQTERDAAKRFVKQM-----LREQDEMDLISFSDTVDEIVPFTNDAGRM 136 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T A+ A Q LT K+ +K ++ + Sbjct: 137 NA---GIGNLHKGDATSLYDAIYLASQRLTEAKR---------------DATRRKILVIV 178 Query: 337 TDGENNN-FKSNVNTIKICDKAKENFIKIVTISINASPNGQR-----LLKTCVSSPEYHY 390 TDG N ++ ++A I+ + I A L++ + ++ Sbjct: 179 TDGGNTTKGMRYQQAVEAAERAGAAIYPIIMVPIEADAGRNTGGEHALIQMAQDTGGKYF 238 Query: 391 NVVNADSLIHVFQNISQLMVHRKY 414 V++ L F ++S + + Sbjct: 239 YVLDKHDLDKAFAHLSDDLRTQYL 262 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 56/185 (30%), Gaps = 23/185 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + N + + D V + I Y+T ++ + ++R Sbjct: 6 ISRSGSVNNNWMDIYNMVEDVVKKFD-----NPKVRISFITYSTDGHTLMKITSDKNEIR 60 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + + + T +++A + + + I+ LT Sbjct: 61 ENLAKLQNVVPSGATHMQEGLRKANEQIEQENAGEKKAPI--------------VILALT 106 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-AD 396 DG T ++++ + I + + L SP++ + V N Sbjct: 107 DG-TLLPFPFEETKMEAEESRRLGATVYCIGVKD--YRKDQLLDIADSPDHMFGVDNGFK 163 Query: 397 SLIHV 401 L ++ Sbjct: 164 GLQNI 168 >gi|218512349|ref|ZP_03509189.1| hypothetical protein Retl8_00989 [Rhizobium etli 8C-3] Length = 222 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 24/232 (10%), Positives = 59/232 (25%), Gaps = 18/232 (7%) Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 + + + + +L I Sbjct: 2 GSAVKGPATSYLNVYIVIDRSPSMLL-------------AATTSGQSTMYSGIGCQFACH 48 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 + N + + + +A+ LD ID E + +GL Sbjct: 49 TGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESDSNHERIKVGLYSLG 108 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLIL----KPTDSTPAMKQAYQILTSDKKRSFFTN 316 ++ + P+ T R+ ++ D L T A+ +I+ + + N Sbjct: 109 DTTKEVLAPTLDTSNARKRLSDDSYGLTSATSMNYTYFDVALAALQKIVGTGGDGTSSAN 168 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + + + + + N + +N C K + + Sbjct: 169 PLKLVLLLTDGVQSQRGWVVKNSSNLKKVAPLNP-DWCGYVKNKSATMAVLY 219 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 73/223 (32%), Gaps = 46/223 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N P+ ++ + LV+D SGSM Sbjct: 283 NSPPSNLNDAQIEPSFDLVQASTGDECRVVLVLDTSGSM-------------------GT 323 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 ++ + +A F++ +D + +G++ +T + T+ R + Sbjct: 324 SNRIDKVNSAATAFVNLVDDG------ISIGIVTFTGSPTTRHALTQINTQADRDSLRDI 377 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T ++Q ++L + S I+ +TDG+++ Sbjct: 378 FQLTASGGTCIGCGLEQGLEVLMAHPSGSADGG---------------IIVLMTDGQDSG 422 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 ++++ I ++ +++ T++I G+ L + Sbjct: 423 IQNHI----IRQTLQDMGVRVNTVAIGEDAYGELSL-IAQETG 460 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 64/210 (30%), Gaps = 35/210 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 ++ + A FL I ++ + G++ + + E ++ Sbjct: 314 VSGSMGSYDRLNRMNQAAKFFLQQI-----LESRSWAGMVHFHSSATVKSELIQINSDVE 368 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + + T ++ A+Q+ + ++ + I+ L Sbjct: 369 RNQLLETLPTSASGGTSICSGIRTAFQVFKNKGYQTGGND----------------ILLL 412 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DGE + D+ K++ + I++ Q + + + Sbjct: 413 SDGE------DSTAKDCLDEVKDSGAVVHFIALGK-AFDQSISNMANVTGGKQLFATDEA 465 Query: 397 S---LIHVFQNISQL---MVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 466 QNNGLIDAFGALASENADVTEKSLQLESKG 495 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 67/223 (30%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D ++ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDK-------MQNDKVGMIVFAGDAFTQLPITND 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGAAISLATRSFTP------------------QEGIGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + A E I++ + + Sbjct: 192 AIIVITDGENHEG----GVAEAAKTATEKGIQVNVLGVGMPDGAPIPVEGTNDYRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + + + V N +S ISQ M Sbjct: 248 NVIVTRLNEEMCQEIAKAGNGIYVRVDNTNSAQKA---ISQEM 287 >gi|3273251|dbj|BAA31169.1| thrombospondin-related protein [Plasmodium falciparum] Length = 574 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 50/158 (31%), Gaps = 20/158 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQ 300 ++ L ++ E+ +GL+ Y+ V N+ + Y + T + Sbjct: 402 LNSLQYINEENQIGLVSYSDDVTINVPIDTMNSTQKSYFTSAIKGLTPSGGTATYDGTLV 461 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++ + + I L+DG+ N I K Sbjct: 462 AVKMILDKM--------------KENPGARPVIFVLSDGQTNGGYEFERVEPI---IKAL 504 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I + TI + + + L K + + N D + Sbjct: 505 GITVNTIG--YNADLKELTKLSKINESVSIDASNDDVI 540 >gi|311253076|ref|XP_003125384.1| PREDICTED: matrilin-3-like [Sus scrofa] Length = 488 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 66/184 (35%), Gaps = 22/184 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + + +K + +D++ ED + ++ Y + V+ T +Q Sbjct: 92 SSRSVRPLEFTKVKTFVSRIIDNL---DIGAEDTRVAVVNYASTVKIEFHLQ--THSDKQ 146 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R + + T + + A Q + G + P+ K I +TD Sbjct: 147 ALKRAVARIAPLSTGTMSGL--AIQTAMDEAFTV------EAGARGPNSNIPKVAIIVTD 198 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + + ++ +A+ + I++ + ++ + L+ S P E+ + V Sbjct: 199 GRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LRLMASEPLDEHVFYVETYG 251 Query: 397 SLIH 400 + Sbjct: 252 VIEK 255 >gi|29788808|gb|AAP03354.1| hypothetical protein [Oryza sativa Japonica Group] gi|108708692|gb|ABF96487.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 540 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 54/185 (29%), Gaps = 32/185 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTR 282 K+ +K ALL + + + + ++ ++ + + + R + Sbjct: 82 AGNKLDRMKAALLFVIRKLADVD------RLSIVTFSNDAARLCPLRFVAGDAARADLGA 135 Query: 283 DMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ + + + ++ ++DG+ Sbjct: 136 LVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAV---------------NVMLMSDGQQ 180 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 N + + + T + A + L S + V + +L Sbjct: 181 NRGDATRLDPG--------GVPVHTFGLGADHDPAVLQAIAGKSREGMFHYVADGVNLTA 232 Query: 401 VFQNI 405 F + Sbjct: 233 PFSQL 237 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 67/204 (32%), Gaps = 21/204 (10%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-TRVEKNIEPS 270 + +S + +T++ L + + +D + + + + +I + V+ S Sbjct: 22 DTSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGAGGVKLQTPLS 81 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + T A++ + + P + Sbjct: 82 ------KIESIQFSPLGTGGNTPLGMALELTRDYIQNK-------------DTFPGKFYT 122 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 +++ ++DGE N+ I +K + + ++ I + + SP Sbjct: 123 PYVVMVSDGEPNDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDFSGSPNQV 182 Query: 390 YNVVNADSLIHVFQNISQLMVHRK 413 Y + SLI+ F+ I+ + + Sbjct: 183 YYANDVQSLINCFKAITASVTQGR 206 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ L + + L+ Y + V K + T + Sbjct: 226 FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 285 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + + G + K I +TDG + ++ Sbjct: 286 AIKTAMEQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 330 Query: 357 AKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + + + LK S P ++ + V + Sbjct: 331 ARASGIEIYAVGV----DRAEMRSLKQMASQPLDDHVFYVETYGVIEK 374 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ L + + L+ Y + V K + T + Sbjct: 226 FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 285 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + + G + K I +TDG + ++ Sbjct: 286 AIKTAMEQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 330 Query: 357 AKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + + + LK S P ++ + V + Sbjct: 331 ARASGIEIYAVGV----DRAEMRSLKQMASQPLDDHVFYVETYGVIEK 374 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 27/179 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + + +T ++ + T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE++ I C Sbjct: 389 RGLEAGFQAITSSDQSTSGSE----------------IVLLTDGEDD-------LISSCF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQLMVHR 412 + K + I TI++ + L + Y + +SL+ F IS + Sbjct: 426 EVVKHSGAVIHTIALG-PKAARELETLSDMTGGLRFYANKDVNSLMDAFSGISSASGNL 483 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 66/212 (31%), Gaps = 46/212 (21%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPI--FLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 L +++ + ++ I VD+S SM + Sbjct: 60 SLQLLGVAAIVIALVNPKAGTKLETVKREGVDIVFAVDVSKSM---------------LA 104 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +D +M K + ++ + +G+I Y + + + + ++ Sbjct: 105 EDIAPNRMEKAKRLVSEIINELAS-------DRIGIIAYAAQAYPQLPITTDFGAAKMFL 157 Query: 281 TRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D L + T + A++ A + + + + ++DG Sbjct: 158 QGMNTDMLSSQGTAISDAIELATTYYNDAAQTN------------------RVLFIVSDG 199 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 E++ S + KA E IKI TI + Sbjct: 200 EDH---SEGGAVNAVSKATEAGIKIFTIGVGT 228 >gi|114764813|ref|ZP_01443995.1| hypothetical protein 1100011001322_R2601_10474 [Pelagibaca bermudensis HTCC2601] gi|114542699|gb|EAU45722.1| hypothetical protein R2601_10474 [Roseovarius sp. HTCC2601] Length = 178 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 23/56 (41%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 +++R ++ SE + + FAL + + + I V + +E A + Sbjct: 1 MIARIAKTLRRFRKSEDGSAVVPFALWMPMMVGIALSTIEVGAMTLRHTQLERALD 56 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 75/258 (29%), Gaps = 56/258 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + L + + Sbjct: 174 RKKLNLALVLDISGSMGSSFDEYY--------------------------------YDRF 201 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + VN +D +++K+ A++ LD ++ +GL+ + T E Sbjct: 202 GNHVAVND---TEDAEKSKIEIAAAAIVALLDHLEDDD------RLGLVLFNTGAELAEP 252 Query: 269 PSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S K Q + D+ + T + M+ A ++ + +++ Sbjct: 253 VSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMATELYD-------------EFLEVNQS 299 Query: 328 PFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 ++ IIFLTD N+ +++ + + + + I I N + + Sbjct: 300 EYENRIIFLTDAMPNSGQTSEESLLGMIEANANKNVYTTFIGIGVDFNTELVEYITKIRG 359 Query: 387 EYHYNVVNADSLIHVFQN 404 +Y+V +A + Sbjct: 360 ANYYSVHSATQFKERMDD 377 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 61/191 (31%), Gaps = 30/191 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K++ L ++S +G++ +++ P T + ++ Sbjct: 313 SGSMAGEKLSMAVAGALALVESAAPED------RLGVVVFSSGHRVLFPPRPMTAQAKKE 366 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + SL T A ++A ++L +K ++ LTD Sbjct: 367 AESLLLSLRAGGGTVLGGAFREAVRLLQ------------------GVPGERKAVLVLTD 408 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + K I D A+ + +++ +++ + L + Y + L Sbjct: 409 GLIADAKEP-----ILDLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQAPSPREL 463 Query: 399 IHVFQNISQLM 409 +F Q + Sbjct: 464 PRLFLREGQEV 474 >gi|326916310|ref|XP_003204451.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 2040 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 53/185 (28%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + Y+ + + + +K + Sbjct: 1257 DNFNKVVKFVFNTVGAFDLINPAGIQVSFVQYSDEAKSEFKLNTFDDKAQALGALQNVQY 1316 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1317 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RRGVPKVLVVVTDGRSQD---- 1359 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 K + + + + + A + L K S + + V + F+ I Sbjct: 1360 -EVRKAATVIQHSGFSVFVVGV-ADVDYNELAKIASKPSERHVFIVDDF----DAFEKIQ 1413 Query: 407 QLMVH 411 +V Sbjct: 1414 DNLVT 1418 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 3/55 (5%) Query: 7 FRFYFKKGI---ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 R ++ +E F+I+ AL + + ++GF I + + +K ++S + Sbjct: 9 VRVQSRRLRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQSVAD 63 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 58/175 (33%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQATGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGISS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKHVSSLIDAFSRISST 479 >gi|326434685|gb|EGD80255.1| hypothetical protein PTSG_10931 [Salpingoeca sp. ATCC 50818] Length = 706 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/378 (8%), Positives = 95/378 (25%), Gaps = 41/378 (10%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A A+ L+ SN ++ + +A + S + Sbjct: 15 ATMASALSPGQTSPSNTDEADVSGVAVDYIDIGSAAYEALNNFDTFVGTSHGIVFDKESH 74 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI----------EHLL 165 + + ++ + + V TS L Sbjct: 75 KLMTQYMAQLLQALDNAIGEDIAV-VRGFQSRPQVPTSAPSLLDVGRALRLRYKADGSAS 133 Query: 166 NQRYNQKI----VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + ++ + + + + + S + Sbjct: 134 SDARLSTLAQACINAGFDFVAVPADTTTDNYVYVSTPKMDCGDAVADLLFILDGSGSVGS 193 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 +T + + + F + + + Y + + + + +Q + Sbjct: 194 GNFQTMLNFAQEVVSFF-------DVAPDKTRVAAMVYDSSNYRKFDFDYIQSVSKQQLI 246 Query: 282 RDMDS--LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D+ T++ A+ A + +G + S + I +TDG Sbjct: 247 NYFDTFAYPDGGTETGSALSFALSSMF----------VTSRGARDLSEGVPRVAIVITDG 296 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSL 398 ++ + S ++ + + + I + + L + + + Sbjct: 297 KSGDDVS-----APAQALRDAGVTLYAVGI-SGADVSELNQIASPPVEDNVVFIDTFSEF 350 Query: 399 IHVFQNISQLMVHRKYSV 416 + IS+ + SV Sbjct: 351 SALASKISRANCDQPASV 368 >gi|309789848|ref|ZP_07684427.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308228152|gb|EFO81801.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 420 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 78/242 (32%), Gaps = 56/242 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + R + V+D SGSM K K+ ++ A+ L +D Sbjct: 35 AVMAQVRTPVNVSFVLDRSGSM--------------------KGDKIDRVRQAISLAVDR 74 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +D L+ + R E I + T++ R T PA+++ Sbjct: 75 LDAQDIAS------LVIFDHRNEVLIPAAPVTDRRMIK-DRVSRIRDAGGTKIAPAVEKG 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + D+ + + ++ LTDG+ N + + D A Sbjct: 128 LREIEKDRSGAI-----------------RRLVLLTDGQTEN--EDECLRR-ADDAGRIG 167 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN---------ISQLMVHR 412 + I + + N L++ S + AD + FQN I +++ Sbjct: 168 VPITALGVGQDWNEDLLIEMANRSGGTADYIARADEITEYFQNTVQRAQNSAIQNSILNL 227 Query: 413 KY 414 + Sbjct: 228 RL 229 >gi|221117277|ref|XP_002154725.1| PREDICTED: similar to HyTSR1 protein, partial [Hydra magnipapillata] Length = 3382 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 65/196 (33%), Gaps = 31/196 (15%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----------PS-WGTEKVRQYVTRD 283 + + + K+ V+ Y T + +I + +K +++ + Sbjct: 3013 INFIIKLALMFPISKDQVHFAYFPYGTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKM 3072 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + + A A + + + +K ++ I TDG N Sbjct: 3073 IILESRGFSYTWSAAITARETIFTTQKGM-------------RSNVKRVAILFTDGVYNG 3119 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 K+ I++V I I + N L+ SS + +N + F Sbjct: 3120 KHDTQKEW---QYVKDQGIQVVAIGIGSPINISN-LELWASSKNFVFNATTYEE-ADAF- 3173 Query: 404 NISQLMVHRKYSVILK 419 IS ++++ S ++K Sbjct: 3174 -ISSVILNEPSSTVVK 3188 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 54/188 (28%), Gaps = 28/188 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----------PS-WGTEKVRQYVTRD 283 + + + K+ V+ Y T + +I + +K +++ + Sbjct: 2754 INFIIKLALMFPISKDQVHFAYFPYGTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKM 2813 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + + A A + + + +K ++ I TDG N Sbjct: 2814 IILESRGFSYTWSAAITARETIFTTQKGM-------------RSNVKRVAILFTDGVYNG 2860 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 K+ I+++ + I + N L+ S+ ++ Sbjct: 2861 KHDTQKEW---QYVKDQGIQVIAVGIGSPINTDN-LELWASNKNSVFSATTYQEADAFIS 2916 Query: 404 NISQLMVH 411 I + Sbjct: 2917 TIGDNVCK 2924 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 12/116 (10%), Positives = 39/116 (33%), Gaps = 7/116 (6%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRD 283 TK+A LK A+ + ++ + +I +++ + + ++ RQ + Sbjct: 256 GTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAV 309 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ +++ +++ +R+ + + D Sbjct: 310 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYTTNHPDPSYKDA 365 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 12/116 (10%), Positives = 39/116 (33%), Gaps = 7/116 (6%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRD 283 TK+A LK A+ + ++ + +I +++ + + ++ RQ + Sbjct: 257 GTKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAV 310 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T+ +++ +++ +R+ + + D Sbjct: 311 NSLVANGGTNIVDGLRKGAKVMEDRLERNSVASIILLSDGRDTYTTNHPDPSYKDA 366 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 61/191 (31%), Gaps = 30/191 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K++ L ++S +G++ +++ P T + ++ Sbjct: 313 SGSMAGEKLSMAVAGALALVESAAPED------RLGVVVFSSGHRVLFPPRPMTAQAKKE 366 Query: 280 VTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + SL T A ++A ++L +K ++ LTD Sbjct: 367 AESLLLSLRAGGGTVLGGAFREAVRLL------------------HGVPGERKAVLVLTD 408 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + K I D A+ + +++ +++ + L + Y + L Sbjct: 409 GLIADAKEP-----ILDLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQAPSPREL 463 Query: 399 IHVFQNISQLM 409 +F Q + Sbjct: 464 PRLFLREGQEV 474 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 69/262 (26%), Gaps = 68/262 (25%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + I LV DLS SM D+ T+ L Sbjct: 77 RFAHDRTETQASGVDIMLVFDLSWSMMVLDMGG----------HDETGTRFGIASAVLED 126 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 F++ + + +GLI ++ + + + + + R +I + T Sbjct: 127 FVNK-------RPNDRIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIRELGTAIGD 179 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A A + L K + II LTDG+NN + Sbjct: 180 ATAAATKRLQMSKDSK-----------------SRIIILLTDGDNNQGEIEPVPAAQLAA 222 Query: 357 AKENFIKIVTISINA-------------------------------SPNGQRLLKTCVSS 385 A KI TI + N L + + Sbjct: 223 AIGA--KIYTIGLGIEEPSHLPAFDVDTGKFKHGPGGELIPTIMLQPANYSVLGQMSRLA 280 Query: 386 PEYHYNVVNADSLIHVFQNISQ 407 Y N L +++ I + Sbjct: 281 HGKFYRATNRRDLENIYNEIDR 302 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 68/187 (36%), Gaps = 33/187 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +A K A F+ DL +GLI + ++ + + + + + + R Sbjct: 103 GSALAEAKQAAQEFVRKSDLA-----HTAIGLIEFGSKAKIISGLTQNAKHLYKAINRLK 157 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ T + AY L ++ +FII LTDG N+ Sbjct: 158 ---TNGSTNMTEGLTTAYLKLK-------------------NVDDPRFIILLTDGLPNHP 195 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 K NT +I + + I+++TI + L + A +++ F Sbjct: 196 K---NTQQIAQEICADGIELITIGTG-DADKTYLQSLACYDQNSFFA--KAGTMVSTFSR 249 Query: 405 ISQLMVH 411 I+Q++ Sbjct: 250 IAQVLTE 256 >gi|148655977|ref|YP_001276182.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568087|gb|ABQ90232.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 420 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 70/224 (31%), Gaps = 47/224 (20%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ R + V+D SGSM K K+ ++ A + ++ Sbjct: 34 GQVMTQVRAPVNVCFVIDRSGSM--------------------KGEKIDRVRRATIRAIE 73 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +D V ++ + R E I + + + + R T PA++ Sbjct: 74 MLDAQDVVS------VVIFDHRTEVLIPATP-VTRPAELIDRINRVRDSGGTRIAPAIEA 126 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + +I LTDG+ N ++ A + Sbjct: 127 GLREIE-----------------KGPPQMVRRLILLTDGQTENESD---CLRRATDAGQR 166 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + + N L++ S + + ++ FQ+ Sbjct: 167 NVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPEKIVEYFQS 210 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 65/207 (31%), Gaps = 48/207 (23%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D +++ K + F+ + D LS + L+G+ + E + Sbjct: 106 DVWPSRLERAKLEIEDFVKNTDNLS-------VALVGFAGTSFVASPFTQDMETFSYILN 158 Query: 282 RDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S+ L+ T A+ A + +K II +TDGE Sbjct: 159 ELNTKSVTLQGTRIADALVTAKNTFNVNIPG------------------KKSIILITDGE 200 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINA------------------SPNGQRLLKTC 382 ++ + + K+N I + T+ + + S + LK Sbjct: 201 DHAGYFDNIL----KELKDNDISVYTVGVGSELGATIRSDIGYSENTVISKRDDKTLKLI 256 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 S + + SL +F +I M Sbjct: 257 ADSTGGKSYISDNISLKSIFDDIKNNM 283 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/232 (11%), Positives = 69/232 (29%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K++ ++ E P + ++D SGSM+ K Sbjct: 166 NRDHKLMMVGIQGKELQQQETPPSNLVFLIDSSGSMN-------------------SYDK 206 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++A + + +D + ++ Y ++ G+ + + Sbjct: 207 LPLVQSAFSMLAEQLDKND------RISIVTYAGSSAVLLDGEKGSNTDE-ILEQLYSIT 259 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ +K AY++ + +I TDG+ N S Sbjct: 260 ASGSTNGEGGIKTAYELAEEHFIKGGNN----------------RVILATDGDLNVGASS 303 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 ++ + ++N I + + ++ + + + D Sbjct: 304 EEELTRLIETKRDNGIYLSVLGFGEGNYKDARMEALADNGNGNFSYIDSEDE 355 >gi|3273253|dbj|BAA31170.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ L DG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILIDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|116623354|ref|YP_825510.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226516|gb|ABJ85225.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 328 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 53/184 (28%), Gaps = 33/184 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +M A+ FL + L+ ++ + + ++ +T Sbjct: 128 NRMDKSVAAIQQFLRTNMAGD------EYFLVRFSDKPTMVTTFTHNPAEIASELTLIK- 180 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T A+ Q + K + + LTDG +NN + Sbjct: 181 --PLGWTALHDAIYLGTQQMRKAKNSR------------------RALFVLTDGGDNNSR 220 Query: 346 SNVNTIKICDKAKENFIKIVTIS-INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +K + E+ +++ I + L + + + L + Sbjct: 221 YTEAEVK--NFVVESDVRVYAIGLFERP---KFLERLAALTGGEALWAKSLKDLPDAIER 275 Query: 405 ISQL 408 IS+ Sbjct: 276 ISRE 279 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 69/235 (29%), Gaps = 49/235 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + V+D SGSM T + +K A +D + Sbjct: 35 GVSAPLNVCFVLDRSGSMM--------------------GTPLQTVKQAASRIVDRL--- 71 Query: 246 SHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +I + + E I + + +++ + T +K + Sbjct: 72 ---SNRDRISIIAFDHKAEVLISNELASDPQAIKRRINSL---RAGGGTCIDDGLKAGIE 125 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L S K+ ++ LTDGE N N IK+ D A + Sbjct: 126 QLASGKEGYISQ-----------------LLLLTDGE-NEHGDNSRAIKLADVAIGYNLT 167 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + T+ N L + + + +A+ I F + M + Sbjct: 168 VNTLGFGDHWNQDVLEQIADAGGGSLSYIEHAEEAIATFGRLFTRMQSVSLTNAF 222 >gi|114586712|ref|XP_001158576.1| PREDICTED: alpha 1 type VII collagen isoform 1 [Pan troglodytes] Length = 2944 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|114586714|ref|XP_516439.2| PREDICTED: alpha 1 type VII collagen isoform 2 [Pan troglodytes] Length = 2912 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|308068884|ref|YP_003870489.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858163|gb|ADM69951.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 432 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 68/234 (29%), Gaps = 54/234 (23%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + + + LV+D SGSM ++ A K+ + Sbjct: 112 GRKSVQDLVLVIDNSGSMQQTD---------------PDNERLTAAKSLIGQM------- 149 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTE-----KVRQYVTRDMDSLILKPTDSTPAMKQ 300 D + ++ + + + + V + + T+ A+ + Sbjct: 150 ---DGDKRVAIVSFDSTAQLVQPFTPIRTDAEKQAVYSKIDSMQTIM-SGGTEIRLALDE 205 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + +I L+DG + I + Sbjct: 206 TIKEIE----------------TQGNAEKGSLVIMLSDGFSELDTQTALAPYIARQ---- 245 Query: 361 FIKIVTISIN-ASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 I + TI + A +G LL+ + + NV NA L F I + R Sbjct: 246 -IPVNTIGLKLAESDGIALLQNIADLTGGTYSNVANAQGLTQAFGKIYNKIGDR 298 >gi|224004082|ref|XP_002295692.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209585724|gb|ACI64409.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1007 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 69/228 (30%), Gaps = 12/228 (5%) Query: 190 IFLIELVVDLSGSMHCAMNS---DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 F + +D+SGS+ +++ KK + + + D + Sbjct: 766 DFSLCFAIDMSGSVCNNGRGECLGCSPLDTCNSVGVKKTMCCSNFLSLMEFTKDVTTSFN 825 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V D ++ + T+V I + ++ + + + T+ + + + Sbjct: 826 EVATDHDFSVVHFATKVNTAIGLVSSRQALK-VLNQLI--YTGGMTNLASGIASGQETFS 882 Query: 307 SDKKRS-FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S S + + + ++ N + AK IV Sbjct: 883 STSGTSKRKNMILLITDGKLFIVYNGKRCSMYQPSLPSYNPEGNAHEAATNAKSKGTVIV 942 Query: 366 TISINASPNGQR---LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + A+ + + ++ S+ + V + L + I + Sbjct: 943 PVLMEAASDNAQSLMFMQKISSNGNVY--VSDFKGLNDLKDTILSEIS 988 >gi|156741667|ref|YP_001431796.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232995|gb|ABU57778.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 826 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 84/339 (24%), Gaps = 88/339 (25%) Query: 129 MTHMANNRLDSSNNTIFY-------NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 N + + +++ + L + + Sbjct: 374 TGAPPTNNFTVGGSVTYTPRYQTYPDLNRPVQFLLILDVSGSMSWTFDGR---------- 423 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +G N V+ + + ++ K L F+ Sbjct: 424 ---------------GVQNGQTVFCTNPSQGCVSVQTAWPNAQERRIYTAKQVLRSFVAQ 468 Query: 242 IDLLSHV--KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ID + + L+ ++ R+ + S + + + L T+ ++ Sbjct: 469 IDQDRQSGLRPYDTVRLVTFSGRLGSFVNSSGAVGDNNRALNDLTEVLPAGWTNDRATLE 528 Query: 300 QAYQI--------LTSDKKRSFFTNFFRQG--------VKIPSLPFQKFIIFLTDGENN- 342 A + F R + +++ +IF+TDG N Sbjct: 529 AAINSAGMVDGDPYMTAGATPSAVAFARASQVFANAPERAPNGMKYRRVVIFVTDGVANV 588 Query: 343 -------------------------------NFKSNVNTIKICDKAKENFIK-----IVT 366 + + + KE +I+ + Sbjct: 589 LRNGMQNNYGEGCQLGAENVGCQMGDPLPDGSLRPLNAMVAEAQALKEAYIRPSDGSVYV 648 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +++ + L S P+Y ++ L +F +I Sbjct: 649 VAL-SGTFEATGLNLVASQPDYVKRADRSEELQQIFDDI 686 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 12/162 (7%), Positives = 38/162 (23%), Gaps = 24/162 (14%) Query: 12 KKGIASE-KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + + ++ A + + + + + + ++ ++ NAA LAG ++++S Sbjct: 3 RRLLHHRTEGQNIVLLAGILALLVGMAALAVDLGVTYAEQRNIVRGTNAASLAGMNRLIS 62 Query: 71 NLSRLGDRFESISNHAKRALI-----------------------DDAKRFIKNHIKESLS 107 + + + + S Sbjct: 63 GGRDADVALAIYESLRSNGIQVTVPGESPQPGDRAFEALYLGSDGAPIPGACSRVGACGS 122 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 +I N + L Sbjct: 123 QRPQGVKYIQIDLKGNVDTYFARLFGQSTLPVGATAYASVGA 164 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 18/174 (10%), Positives = 49/174 (28%), Gaps = 12/174 (6%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +A +D + +G++ +++ + ++K Sbjct: 44 ADDFVSAKSFISRVVDAFDIAADFTRVGVVQFSSFFTEEFPLDRYSDKASLKQAIGNIPQ 103 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + ++ K G + S + + +TDG ++ Sbjct: 104 RGGGTLLGQVINYLVNTSFTEAK----------GARPLSDGIPRIAVLMTDGSAHD-NPT 152 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 D + + I +I + S N + L+ + + V + + Sbjct: 153 TVLAPAIDALRASGIIAFSIGVGPSVN-RDQLEAVAGDTDRVFLVGAYSVIDDI 205 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 58/175 (33%), Gaps = 27/175 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQATGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGISS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 + + I TI++ + L + Y + SLI F IS Sbjct: 426 EAVSRSGAIIHTIALG-PSAARELETLSDMTGGLRFYANKHVSSLIDAFSRISST 479 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 59/151 (39%), Gaps = 25/151 (16%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + + + T+ Q +T ++ + T +K +Q +T + + Sbjct: 401 VGMVTFESTAKIQNNLTKITDDDTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNT 460 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + I+ LTDGE+N I C ++ K++ I TI++ Sbjct: 461 SGSE----------------IVLLTDGEDNG-------IHSCFEEVKQSGAIIHTIALGP 497 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 S + + + ++ Y + + LI F Sbjct: 498 SAAKELEILSSMTGGYRFYANKDINGLIDAF 528 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 27/191 (14%) Query: 227 KMAALKNALLLFLDSIDL----LSHVKEDVYMGLIGYTTRV-----EKNIEPSWGTEKVR 277 ++AAL+ AL + D + + + +I + +V +P+ G+ ++ Sbjct: 381 RLAALQQALTGLTGADDSLSGRFARFRAREQVTIITFNDKVTATRQFTVSDPTPGSADLK 440 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T A+ AY + I+ +T Sbjct: 441 AISDYGAALRAGGNTAIYSALDAAYTTAAAGM--------------KADPSALTSIVLMT 486 Query: 338 DGENNNFKSNVNTIKICDK--AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DGENN + + + ++ + + L + S+ ++ Sbjct: 487 DGENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFG-DADRAALTQIATSTGGAVFDATAP 545 Query: 396 D-SLIHVFQNI 405 SL VF+ I Sbjct: 546 GVSLSDVFREI 556 >gi|3273265|dbj|BAA31176.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273301|dbj|BAA31194.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273267|dbj|BAA31177.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVKHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273303|dbj|BAA31195.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|89094360|ref|ZP_01167301.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] gi|89081419|gb|EAR60650.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] Length = 707 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/281 (12%), Positives = 81/281 (28%), Gaps = 54/281 (19%) Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-----LNQRYNQKIVSFI 177 +S+ + + L T+ ++ + + L + ++ Sbjct: 241 SSTDEEGQPINTDSLSGPATTLDQDILMYWRHQPNLPASVDMLAYKESGNNKGTYKLTLT 300 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + V+D+SGSM + K AAL + Sbjct: 301 PGTDLPAFNQ--GRDWVFVLDISGSM---------------------KGKFAALVEGVRE 337 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L ++ + + L R V + + + T+ Sbjct: 338 GLSNLSP----NDRFRIVLFNNQARSFTQGYLPADKTTVENTLNQLDQIQPGQGTNLYAG 393 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ L SD+ + I+ +TDG N + K + Sbjct: 394 LQTGINQLDSDRSTA--------------------IVLVTDGVANVGT--THKSKFLNLL 431 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 ++ +++ T + S N L + S + ++ N+D + Sbjct: 432 EQKDVRLFTFIMGNSANRPLLEEMTRVSNGFAMSISNSDDI 472 >gi|297279075|ref|XP_001109489.2| PREDICTED: calcium-activated chloride channel regulator 4-like [Macaca mulatta] Length = 931 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 71/204 (34%), Gaps = 37/204 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRD 283 ++ + A FL L V+ ++G++ + + + R + Sbjct: 320 YDRLNQMNKAAKYFL-----LQIVENGSWVGMVHFDSTATIINKPIQIISSDERNTLLAW 374 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T +K A+Q++ ++ S ++ LTDGE+ Sbjct: 375 LPTYASGGTSICSGIKSAFQVI----------------GELSSHLDGSEVVLLTDGED-- 416 Query: 344 FKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LI 399 T C D+ K + + I++ + + + +++ + H+ + LI Sbjct: 417 -----YTASSCIDEVKRSGAIVHFIALGTAAD-KAVIEMSKITGGRHFYASDKAQNNGLI 470 Query: 400 HVFQNISQ---LMVHRKYSVILKG 420 F +++ + + + KG Sbjct: 471 DAFGDLTSGNTELSQKSLQLESKG 494 >gi|260834336|ref|XP_002612167.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] gi|229297541|gb|EEN68176.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] Length = 954 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 51/176 (28%), Gaps = 26/176 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +D ++ +G++ Y+ + +K + T + Sbjct: 202 QFVVDVVNSFDVSPTATRVGVVQYSNKNTLMFNLGDKVDKPSTVNAINSIQYQGGGTYTG 261 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K RQ K + LTDGE+ + S Sbjct: 262 YALKY-----------------VRQKAAWRGGNVPKVAVVLTDGESYDSVSV-----AAQ 299 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + +++ + + G +L+ S+ + + + L I++ + Sbjct: 300 NLLSDGVEVFAVGV----AGFKLIAIANSNETNVIELNDFNDLTTKIGEIAKKVCS 351 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/220 (10%), Positives = 70/220 (31%), Gaps = 49/220 (22%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYV 280 +++ +K L ++++ +GL+ +++ + ++ + Q + Sbjct: 123 PPSRLEKIKYELSNIINTLKS-------DRIGLVIFSSSAFMHCPLTYDKGALNLFTQIL 175 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 ++ + TD ++ + K + + + K ++ +DGE Sbjct: 176 NTNLMPIGNAGTDFYAPLELVLKKYQEANKSN----------RKQQNEYAKVVVLFSDGE 225 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN-----------------------ASPNGQR 377 + I D+ K+N I++ T+ + + + Sbjct: 226 EFGDRYT----AIVDQYKQNNIRVFTVGVGSLQGGKIPTSLGFKKDKKGKVVLSKLSTTS 281 Query: 378 LLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRKYSV 416 L + + V + + + I Q + +K V Sbjct: 282 LQTIAEQTNGRFFEVSETKNEIPELINTI-QEIKGQKLDV 320 >gi|296227292|ref|XP_002759309.1| PREDICTED: collagen alpha-1(XIV) chain [Callithrix jacchus] Length = 1796 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 54/184 (29%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L+ + + ++ +T + + K S Sbjct: 1046 DNFNKIINFLYSTVGALNKIGTDGTQVAMVQFTDDPRTEFKLNAYKTKETLLDAIKRISY 1105 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K L + + K I+ +TDG + + + Sbjct: 1106 KGGNTKTGKAIKYVRDTLFTAESG-------------TRRGIPKVIVVITDGRSQDDVN- 1151 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 KI + + + I + + + + + S + + V + F+ I Sbjct: 1152 ----KISKEMQSDGYSIFAVGVADADYSELVSIGSKPSARHVFFVDDF----DAFKKIED 1203 Query: 408 LMVH 411 ++ Sbjct: 1204 ELIT 1207 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|258624850|ref|ZP_05719778.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582848|gb|EEW07669.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 128 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 33/93 (35%), Gaps = 20/93 (21%) Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA------------------SPN 374 +I L+DG N + ++ + AK+ I T+ + A + Sbjct: 1 MILLSDGSNTAGVLD--PLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLD 58 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L ++ ++ N L ++ I+Q Sbjct: 59 EKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 91 >gi|118387578|ref|XP_001026893.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89308660|gb|EAS06648.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1074 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 44/402 (10%), Positives = 99/402 (24%), Gaps = 65/402 (16%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + + K + +L ++ + + + + + K Sbjct: 167 MDIAQLKQVKVQ------SIVLHINGNILPSDNTPLYKLVQDGDVVVAQTMQYVFNSNKA 220 Query: 101 HIKESLSGYSAV---------------FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 + S + N + + S M + + N + Sbjct: 221 NQSMHSSVRQSQSPINRNNQQQLQNLKLLNKSTNSSLQSPHKKMQASTYDPIQQQANRLS 280 Query: 146 YNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPALLRIEMGERPIFL------------ 192 + L + +L+ + E I Sbjct: 281 NKSQGNLNQSPVLSPMNSQILDNSSFNPNFISTDHQMLENQLEINIQSTQNYIQLFEQSQ 340 Query: 193 ---IELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--RTKMAALKNALLLFLDSIDLLSH 247 + + ++ G+ P + K+ LK LL +D +D Sbjct: 341 QIPVMISLNTKGNFDAKAYQRPPIDLICVMDNSGSMHGEKINMLKETLLYLIDQLDEKD- 399 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GL+ + + V S T + D TD M +A++ + + Sbjct: 400 -----RLGLVLFNSEVTFRPMKSMDTTNKLKLKQYISDIRAQGGTDINLGMTEAFKFIKT 454 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K + T+ F + S + + L + N S Sbjct: 455 RKYCNPVTSVFLLSDGLDSKAQDRVAVTLKNMSINEQFSINC-----------------F 497 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + + V D+L +F I Q + Sbjct: 498 GFGRDHDPILMNQIKKIDQVDMFFV---DALGGLFSVIGQDV 536 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 12/141 (8%), Positives = 38/141 (26%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + S + + + ++++ + + W+ ++A ++A +GA ++ Sbjct: 1 MRNLLRSRQGSAAFATVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRLSC 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 ++ +S+ AK + G + I Sbjct: 61 TMAGAACGTQSVDYLAKEFAAQNGFCNSSPQDSTPYPGTQCAPSLPNRISRAVQIDIGTY 120 Query: 131 HMANNRLDSSNNTIFYNMDVM 151 + V Sbjct: 121 TAGTFTTPPAGTGNAVRARVS 141 >gi|160712|gb|AAA29771.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ T A+ + + L Sbjct: 86 IHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLEVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 62/184 (33%), Gaps = 30/184 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K++ L + S + +G++ +++ P TE+ ++ + S Sbjct: 308 KLSLAVAGALELVRSAAEEDY------LGVVLFSSTHRVLFPPRPMTEQGKKEAESLLLS 361 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A ++A ++L +K I+ L+DG ++ + Sbjct: 362 VRAGGGTVLGSAFREAVRLLQ------------------GVPVERKGILVLSDGLISDPQ 403 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + +++ +++ A + L +Y A L +F Sbjct: 404 DPILALAEA-----SGLEVSAMALGADADRAFLKVLAERGGGRYYQAATAQELPRLFLKE 458 Query: 406 SQLM 409 Q + Sbjct: 459 GQEV 462 >gi|218296567|ref|ZP_03497295.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] gi|218243109|gb|EED09641.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] Length = 706 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 56/184 (30%), Gaps = 30/184 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+A L + S + +G++ +++ P T + ++ + S Sbjct: 323 KLAMAVAGALELVRSAAPEDY------LGVVLFSSSPRVLFPPRPMTAQGKKEAESLLLS 376 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T A ++A ++L +K ++ L+DG + K Sbjct: 377 LRAGGGTVLGGAFREALRLLQDV------------------PVERKALLVLSDGIIFDPK 418 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I A +++ +++ + L Y L +F Sbjct: 419 EP-----ILALAATAGVEVSALALGPDADAAFLEALAQRGGGRFYRAATPKELPRLFLKE 473 Query: 406 SQLM 409 Q + Sbjct: 474 GQEV 477 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 39/292 (13%), Positives = 77/292 (26%), Gaps = 37/292 (12%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 +V + + + ++ + L + E + ++ Sbjct: 53 QPEVKHNIMLFIDDSGSMDEYIGSKTRIQITKDALSKVLEE-------HGSTFNWALQTL 105 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNA-LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 NS D A +K + + + M + Y + Sbjct: 106 NNSGGSDTPDAKGFTIPATEIANRVKRIKAKGGTPTTSRYFELVSQIVMPNVKYRCQKSY 165 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA-----------------MKQAYQILTSD 308 + S G + + ++AY Sbjct: 166 VVLMSDGDANLSCGLDNTGYKWPKSDFKYGDLSRYSDYYRRNISREEPKYREAYNYFGYR 225 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFII--------FLTDGENNNFKS-NVNTIKICDKAKE 359 + S + F TDG + KS N + D +K+ Sbjct: 226 QAGSCQKRELGAYDTSWDIDDGLAFFSKKLAEQDFKTDGTDAAGKSWNGDPKDPADYSKQ 285 Query: 360 NFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 ++ T+ P G+R L+ S PEY+Y +SL VF +I + + Sbjct: 286 L-VQTFTVGFGQGITPTGKRYLQLAASRPEYYYEADKPESLSKVFNDIVEQI 336 >gi|295841331|dbj|BAJ07080.1| von Willebrand factor, type A [uncultured bacterium] Length = 334 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/266 (11%), Positives = 72/266 (27%), Gaps = 40/266 (15%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + L L+ + + + Sbjct: 102 QGYRPADFSIPLDGGGSPLSAENGVD-----------PSEPKTALQVPNASVIQVVRDVW 150 Query: 207 MNSDPEDVNSAPICQDKKRT--KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + + + K+A + AL FL+ I + +GLI + T V+ Sbjct: 151 WYTKRHTNVYLVVDTSGSMSGEKLAQAQEALSAFLEQIK-----GDRERVGLIEFETSVK 205 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 ++ T A+ +A+Q L K+ Sbjct: 206 PVVQLDELGNNRAALDLAVQRMEAAGDTALLDAVYEAHQRL----------------RKL 249 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRLLKT 381 ++ +TDG+ NN + K+ + E+ + + I+ + L Sbjct: 250 GDEERINAVVVMTDGQENNSWIS--LRKLVPQLAEDWPVPVVVFAIAYGDDADIATLRAI 307 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQ 407 + + +++ +++ +S Sbjct: 308 TEPTGGQV-REGDPETIRDLYKILST 332 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 + + +G++ Y+ + + + ++Q ++ T Sbjct: 25 TFLNNIVGQFDIGPTATQVGVVQYSWFIRQECALNAHSSLASLQQAISNITVL--GLGTH 82 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ A + K ++ +TDG + + + + Sbjct: 83 TGAALTFARNTALTAANG-------------ARPGVPKIVVVMTDGASEDDVTLPS---- 125 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + IS++ S RLL+ SP+ + + D+L + +S + Sbjct: 126 -QNLRNDGVITFAISVSWSLPNDRLLQDIAGSPDRIFAATDFDALDGIKVTLSSQLCE 182 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 57/180 (31%), Gaps = 20/180 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F + D + + ++ Y + + + + R I T + Sbjct: 196 YDDFEKAKDFIHTTMSNCNFAIVQYGSSIRTELSLLDNEDGARSLQKVKQIKQIYNLTKT 255 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + S + K II L+DG+ + ++ Sbjct: 256 ASAINHVLTDIFIPENGSKNNSE-------------KIIIVLSDGKILG--DPMTLDEVL 300 Query: 355 DKAKENFIKIVTISINASP----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +K + + +I + + + + P +Y+V + +L + ++ + ++ Sbjct: 301 NKTQMKGVTRYSIGVGDGILKNKDAIKEMTQIA-DPGKYYSVSSYGALNDILSSLERGII 359 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 50/187 (26%), Gaps = 27/187 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 SA + + +K K A I ++ + +G++ + ++ Sbjct: 399 DQSASMGPETGLSKFTMAKEAA------IMATESLRAEDRIGVLAFDVSTRWVVDFQPVG 452 Query: 274 E--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + R + TD A++ + Sbjct: 453 TGLSLADIQRRISTLPLGGGTDIYNALQTGL------------------PELARQPGRVR 494 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + + ++A+ I + TI+I + L +Y Sbjct: 495 HAVLLTDGR-SFTDDRQAYQALIEEARSRNITLSTIAIGTDADIDLLQTLARWGAGRYYF 553 Query: 392 VVNADSL 398 + Sbjct: 554 AAEPGDI 560 >gi|149919193|ref|ZP_01907676.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] gi|149819907|gb|EDM79329.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] Length = 718 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/397 (8%), Positives = 78/397 (19%), Gaps = 59/397 (14%) Query: 56 ANNAAI--LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A + A +A +V +L + ++ L D + + Sbjct: 163 ALDRAQAEVAATRALVPDLPGEQ-----VRSYVAVVLTDGLWTGADGTTALAPADQDPAI 217 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM---------------------- 151 T + + + +T + +++ + + + + Sbjct: 218 TATALLDEADVPTYVITIDGDAQVEEAADALAFAGGTGAAFHGGTPGLLDGAIYGLNQEL 277 Query: 152 -------------TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 L LN + VD Sbjct: 278 LHDHALPGCSRQHPRVMVLLDASSSTLNDFGGTVPGTMGETAWDEIRAGLTAPDSLFDVD 337 Query: 199 LSGSMHCAMNSD----PEDVNSAPICQDKKRTKM--AALKNALLLFLDSIDLLSHVKEDV 252 + ++ D L AL + ++ Sbjct: 338 EGLAPQDMLHVGLTIFGHDDAPEEQRVLASYGPCMRDNLAWALDPEVSCVEPGCDDPWGG 397 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 Y T D T + ++ ++ D + Sbjct: 398 PTIAWTYQDGSAGPPGFDLQTNSHMPRCEGPADFCWGSGTFTHRGLET-VRLHQIDYAAA 456 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS---- 368 T+ + I +TDG + + S+ + I I Sbjct: 457 SQTDDALYPTAPDTPY---INILITDGMYSGYSSDAQVQSELEAMFNAGITTYVIGYGDL 513 Query: 369 INASPNGQRLLKTC---VSSPEYHYNVVNADSLIHVF 402 ++ + Y N L Sbjct: 514 FGFPDAVAQVEQMADWGSGGANDAYLAQNQVELEDAL 550 >gi|218782851|ref|YP_002434169.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218764235|gb|ACL06701.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 684 Score = 57.2 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 65/215 (30%), Gaps = 30/215 (13%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + + Q + + K+ K + ++ + D ++ G +E I Sbjct: 41 FDASGSMWGQIEGKAKITIAKEVMEGIVNDLPDDINVGLTAYGHRRKGDCDDLETLIPL- 99 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 G + ++ + T + T + ++ + Sbjct: 100 -GPIDKQAFIEKIKAINPKGKT--------------PMLRSIRLTAEAIKHLEDETT--- 141 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLK-TCVSSPE 387 I+ ++DG+ + + K I + + + + LK + Sbjct: 142 --ILLVSDGKETC---DPEPCAFVAELKALGINFVMHVVGFDVGGETEEELKCMAAAGDG 196 Query: 388 YHYNVVNADSLIHVFQN-ISQLMV-HRKYSVILKG 420 ++ NAD L I + + + K S +L G Sbjct: 197 EYFPASNADKLKEALATVIEKTVEKNLKVSALLNG 231 >gi|71988405|ref|NP_001022675.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|51587416|emb|CAH19087.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] gi|51591699|emb|CAH19103.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3183 Score = 57.2 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 KN +L F+ +L + +GLI Y+ ++ + ++ + Sbjct: 661 VFKNEVLRFVREFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGISETQYL 720 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ Q R+G + + I LTDG + + Sbjct: 721 TGLTRTGAAIQHMVQ----------EGFSERRGARPQQSDIARVAIILTDGRSQD----- 765 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N D A++ I I + L++ SP + V L ++ Sbjct: 766 NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 820 >gi|33333552|gb|AAQ11892.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 57.2 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + H + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYHEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|71988401|ref|NP_001022674.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|22096361|sp|P34576|MUA3_CAEEL RecName: Full=Transmembrane cell adhesion receptor mua-3; AltName: Full=Muscle attachment abnormal protein 3; Flags: Precursor gi|4761646|gb|AAD29428.1|AF139060_1 transmembrane cell adhesion receptor MUA-3 precursor [Caenorhabditis elegans] gi|14530487|emb|CAA83226.2| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] gi|14530561|emb|CAC42345.1| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3767 Score = 57.2 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 KN +L F+ +L + +GLI Y+ ++ + ++ + Sbjct: 1245 VFKNEVLRFVREFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGISETQYL 1304 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ Q R+G + + I LTDG + + Sbjct: 1305 TGLTRTGAAIQHMVQ----------EGFSERRGARPQQSDIARVAIILTDGRSQD----- 1349 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N D A++ I I + L++ SP + V L ++ Sbjct: 1350 NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 1404 >gi|241983030|emb|CAZ65768.1| antrax toxin receptor 2 [Cricetulus griseus] Length = 463 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 56 IYNFVHQLTER-FVSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYI 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + S L II LTDG + Sbjct: 115 HEGLKLANEQIQSA----------------GGLKTSSIIIALTDG-KLDGLVPSYAENEA 157 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 K++ + + + Q L+ S + + V +L + +I + Sbjct: 158 KKSRTLGASVYCVGVLD--FEQAQLERIADSKDQVFPVKGGFQALKGIINSILAQSCTEI 215 Query: 410 VHRKYSVILKG 420 + S + G Sbjct: 216 LELSPSSVCVG 226 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 62/179 (34%), Gaps = 27/179 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + + +T ++ + T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE++ I C Sbjct: 389 RGLEAGFQAITSSDQSTSGSE----------------IVLLTDGEDD-------LISSCF 425 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQLMVHR 412 + K + I TI++ + L + Y + +SL+ F IS + Sbjct: 426 EVVKHSGAVIHTIALG-PKAARELETLSDMTGGLRFYANKDVNSLMDAFSGISSASGNL 483 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 52/187 (27%), Gaps = 27/187 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIG----YTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 LK + L +G + ++ +R Sbjct: 321 LKQTKEALFTILHDLRPEDNFNIVGFSSRIKVWKDQLVPVTP-----NSIRDGKVYIHHM 375 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 T+ A++ ++L F I + I+FLTDG Sbjct: 376 SPSGGTNINGALQTGIRLLND----------FVAHNDIDARSVS-LIVFLTDGRPTVGEI 424 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIH 400 ++ + A++ + + TI I + + L L+ C + + A L Sbjct: 425 QTPKILNNTKEAARDR-VCLFTIGIGDDVDFKLLEKLSLENCGMTRRFQVEADAAAQLKG 483 Query: 401 VFQNISQ 407 + I Sbjct: 484 FYDEIGT 490 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/409 (9%), Positives = 103/409 (25%), Gaps = 58/409 (14%) Query: 40 LIYVLDWHYKK--------NSMESANNAAILAGASKMVSNLSRLGDRFE-SISNHAKRAL 90 K + + +A +A + + + + Sbjct: 105 AADSAQIMQDKEPQAAKSAAELRAEFDAGEIAALKSKSEDSAAALGMAKRAAPAAPGVVA 164 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 + S + ++ + + + + F Sbjct: 165 QGQLLAEPMAAVAPSPVPPTDGRAQLQLDPS-RERFANAAVNPIKSVAADPVSTFSADVD 223 Query: 151 MTSYDYR--LQFIEHLLNQRYNQKIVSFIPA---------LLRIEMGERPIFLIELVVDL 199 SY + + + + + + + D Sbjct: 224 SASYSFVRRSLTAGAMPDPQSVRVEEMINYFPYDWPGPEKADQPFKATVTVMPTPWNHD- 282 Query: 200 SGSMHCAMNSDPEDVNSAP----------ICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + MH A+ +AP + K+ LK+A L ++ + Sbjct: 283 TQLMHVAIKGYDIAPATAPHANLVFLIDVSGSMDEPDKLPLLKSAFRLLVNKLKADD--- 339 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ Y +EP+ EK + ++ T + AY + Sbjct: 340 ---TVSIVTYAGNAGTVLEPTRVAEKSK-ILSAIDRLEAGGSTGGAEGIAAAYDLAKKAF 395 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIVTIS 368 + ++ TDG+ N S + + +I ++ +++ I + + Sbjct: 396 VKDGVN----------------RVMLATDGDFNVGPSIDEDLKRIIEEKRKDGIFLTVLG 439 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 L++T + + D+L + + + + + Sbjct: 440 FGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEEAGSTLFPIA 486 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 53/202 (26%), Gaps = 28/202 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ LK A + + V ++ Y + P+ G + Sbjct: 224 TSGSMDDPAKLPLLKRAFAQLVPQLRAKDRVS------IVAYAGHAGLVLPPTPG-NRHG 276 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T+ ++ AY + I+ T Sbjct: 277 EILAALEGLHAAGSTNGGEGLRLAYAMARQGHVEGGVN----------------RILLAT 320 Query: 338 DGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNA 395 DG+ N + + + + I + T+ + + + + H + Sbjct: 321 DGDFNVGITDRNALLTLVADQRRSGIALSTLGFGSGNYNDAMAERLADAGNGQHLYIDT- 379 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 L + + Q M ++ Sbjct: 380 --LDEARRALVQQMQATLLTIA 399 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 39/115 (33%), Gaps = 4/115 (3%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + +EK ++ + M ++ IG I + +N + NAA + + ++ Sbjct: 10 RFLKNEKGTVTVYLVIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACSVLADYHTST 69 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + D +++K ++ S + + + S Sbjct: 70 KQDFGL----FTYKGANYDQDFAKYVKANLTFSADQNFNLLDYRYEGSKADVSNS 120 >gi|34481898|emb|CAE46497.1| trap [Plasmodium falciparum] Length = 331 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + H + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIADEIKYHEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I ++ ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENAKQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/286 (11%), Positives = 68/286 (23%), Gaps = 58/286 (20%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI--------PALLRIEMGERPIFLIEL 195 + + + K + + + + F + Sbjct: 65 ATNEERKPIDVVFVHDTSGSMKDSFGGVKKATSAENALKESLRFFNQNQQSKDKYFFVPF 124 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 D+S D ++ L L + L + Sbjct: 125 DSDVS----YKNYGDK---------------RIQPA-EGLSDILPMAEHLDFSEAYWVKK 164 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + T+ T +++ A + + + Sbjct: 165 YSWYYGYYWSQEIFDFSV----------------GGTNYTQSLEYALSKFSGMRDSKRYI 208 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-----------TIKICDKAKENFIKI 364 F G K T+G N N + +K N +K+ Sbjct: 209 IFLTDGEPTSLNHDNKQYTLYTNGTARAGNVNANYNDVQKFIHEKAVASAEKLGVNDVKM 268 Query: 365 VTISINASPNGQ-RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQL 408 +I+ +LL+ + + N +SL +VF +IS+ Sbjct: 269 YSIAFAEPGEVNYQLLENMSNKTGGRAIQA-NPNSLSNVFTDISKE 313 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 49/272 (18%), Positives = 87/272 (31%), Gaps = 76/272 (27%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + LV+D SGSM + ++TKM LK+ F+ Sbjct: 61 AKASSSAVGAADVVLVIDRSGSMGERYDG-------------SRQTKMEVLKDTAKDFIT 107 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD-SLILKPTDSTPAM 298 + S + + ++ Y + + NI + T++ Q + R +D T S + Sbjct: 108 QLSAQSPASQ---VSVVSYASDSKTNIGLTSLDTQENIQSLNRAIDKLWASGATRSDLGL 164 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC---- 354 + AY +L ++F+IFLTDGE N++ I Sbjct: 165 EDAYSVL-----------------GAADSGNKQFVIFLTDGEPNSYSGFDREIAARAEST 207 Query: 355 -------DKAKENF----------------------------IKIVTISINASPNGQRLL 379 D K + +I +I I S + QR+ Sbjct: 208 ASIIKGEDLIKRDGRIFGDYDGSLDDGSSHNPDWEFEGDPLSAEIFSIGILKSWSSQRVH 267 Query: 380 KTCVSSPEYHYNV--VNADSLIHVFQNISQLM 409 H A +L +F+ I+ + Sbjct: 268 DYLNYIDSQHSAALADTAQALQDIFEAITHQI 299 >gi|301767380|ref|XP_002919113.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VI) chain-like [Ailuropoda melanoleuca] Length = 1059 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 61/206 (29%), Gaps = 26/206 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K F+D++ + + G + Y+ VE + ++ V Sbjct: 60 DKVKAFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIISPLRPMPSDRDALKASVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF-----------QK 331 K T + A+K+ + L K Sbjct: 120 VK--YFGKGTYTDCAIKKGLEGLXXXXXXXXXXXXXASAGGRRPARSRGCQGGSHLKENK 177 Query: 332 FIIFLTDGENNNFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 ++I +TDG ++AK +K+ +++I RL + +++ + Sbjct: 178 YLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTY 235 Query: 390 ---YNVVNADSLIHVFQNISQLMVHR 412 + + + I+Q + Sbjct: 236 RRNFTAADWGQTRDAEEIITQTIDTL 261 >gi|284029817|ref|YP_003379748.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809110|gb|ADB30949.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 550 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 69/215 (32%), Gaps = 37/215 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D+SGSM + +T+M A L + + V+ Sbjct: 358 VIDVSGSMAEKVAG-------------SAKTRMQLTIEAADNGLKMFPDSAELGLWVFST 404 Query: 256 LIGYTTRVEKNI----EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 IG + + + + + G + + + + T A + + S Sbjct: 405 KIGPDSADFRQLVPIGKLTPGHRQRMIGQLKAQSARVGGGTGLYDTAIAAVRAVRSSYNS 464 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN--FKSNVNTIKICDKAKE--NFIKIVTI 367 ++ TDG+N++ S T++I + K+ ++I+ + Sbjct: 465 GAVN----------------TVLLFTDGKNDDPGSLSLDRTLQILNGLKDPARPVRIIAL 508 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + L + ++ Y N L VF Sbjct: 509 GIGPDADADELKRLAQATGGQAYVARNPTDLKDVF 543 >gi|238923487|ref|YP_002937003.1| hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] gi|238875162|gb|ACR74869.1| Hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] Length = 1082 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/395 (9%), Positives = 106/395 (26%), Gaps = 42/395 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 E+ S+ + ++ +L+ + V + K+ ESA + A+ + + ++L+ Sbjct: 6 FSKRERGAVSVFLVIILVPCMLVASIFVDVGRVYLSKSMAESAADMALNSLMTHYDADLN 65 Query: 74 R---LGDRFESISNHAKRALIDDAKRFIKNHIKES-LSGYSAVFYNTEIQNIVNSSRISM 129 + ++I ++ ++ + ++ F + S + + Sbjct: 66 DWYGMVASCQNIDEFYDASIKCYKNALKSQNLSKDEMNTLVGEFSAMIGADSKASDYLRV 125 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-----LLNQRYNQKIVSFIPALLRIE 184 T ++ + + F+++ + ++ IP + + Sbjct: 126 TDDGDDSTTIKAVDGANLANATMLKSQIVDFMKYRAPIAITQTAIDKIKNKSIPGIDDVL 185 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDV-----NSAPICQDKKRTKMAALKNALL--- 236 + L+E D + M N + + + ++A+ Sbjct: 186 KSDENKPLVEKKQDYCKADEKLMRDSYNTYKYLFDNYSYGNPQPSNSLLTGTRDAMQTAR 245 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--------SLI 288 ++ L GL+ + + ++ + Y + S Sbjct: 246 EQYRELNKLMITNLYNTSGLVVFNRAQVPLNQYNYTKSSTKCYKAKYDGKTYYVNCHSRK 305 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T LT K I + G N+ + Sbjct: 306 DDDTYYIDG-----TELTKQFDDISKKIKAFDKAKNDFTNAVNNSISYSSGVTNDIQYWK 360 Query: 349 NTIKICDKAKENFIKIVTISI-NASPNGQRLLKTC 382 + ++++ + L K Sbjct: 361 HAADA-----------YSVTVSGSDDYRTNLNKKA 384 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 49/162 (30%), Gaps = 11/162 (6%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILT 306 + G+ + V + E RQ + ++ T A+++A Q + Sbjct: 75 PDGSKAGIWTFGQSVNLLVPYRLVDESWRQQAAKSASAINSVALHTHIGAALEKAAQDVV 134 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + N + ++ + K + T Sbjct: 135 AGDDGFRRNLVLL---------TDGVVDIDPEAVVNIQERKRILTELLPQLKAAGYVVHT 185 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I+++ + + + K +++ +AD L+ F I Sbjct: 186 IALSQDADQELMKKLALTTDGVFAVAQSADELMQAFLTIFDQ 227 >gi|118384116|ref|XP_001025211.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89306978|gb|EAS04966.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 631 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/187 (10%), Positives = 65/187 (34%), Gaps = 26/187 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K ++ +L L ++ + LI + + + + + Sbjct: 151 DSGSMSGKKAQLVRKSLKYLLKIMNEND------RICLISFDSVEKILTPFLRNNLENKS 204 Query: 279 YVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + +++ T+ M+ ++ + K+++ T + L+ Sbjct: 205 ELKKAIKNIVGRGSTNIEAGMEAGLWMIKNRKEKNPIT----------------CMFLLS 248 Query: 338 DGENNNFKSNVNTIKICD--KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DG++++ + ++ K+ ++ FI + T A + ++ + +Y + + Sbjct: 249 DGQDDSPQVDLRVQKLIQSYDIQDTFI-VNTYGYGADHDATQMRNIAETHKGGYYYIEDV 307 Query: 396 DSLIHVF 402 + F Sbjct: 308 KKVSEWF 314 >gi|73954248|ref|XP_546328.2| PREDICTED: similar to integrin, alpha 1 precursor [Canis familiaris] Length = 1182 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 69/191 (36%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D ++ + + +G++ Y V + + V + Sbjct: 191 SVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANQIIQRGGRQ 250 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 251 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNHLLNKVI 297 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I S N L +K+ S + ++ +NV + +L+ + Sbjct: 298 QDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 354 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 355 VEALGERIFAL 365 >gi|257389158|ref|YP_003178931.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257171465|gb|ACV49224.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 393 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 49/156 (31%), Gaps = 17/156 (10%) Query: 250 EDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ Y+ ++ + T + + W + + + TD +K A + L+S Sbjct: 70 DEDYVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLSSS 129 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K ++ L+DG++N + + + + I+I + Sbjct: 130 A---------------TGPDTVKRLLLLSDGKDNERTPD-EFEGLAEAIDDAGIRIQSAG 173 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I N + + ++ + F Sbjct: 174 IGTDYNEATIRTLGTAGRGTWTHLEAPGDIEDFFGE 209 >gi|3273279|dbj|BAA31183.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 54/187 (28%), Gaps = 22/187 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT----EKVRQYV 280 R +KN + +D + ++G+I ++T +I V Sbjct: 122 RRSWEEIKNFVKSIVDM---CDISDQGTHVGIITFSTDPVIDIAFDKYKGVEMNAVNIKR 178 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D T A+ A + L + + QK + ++DG Sbjct: 179 DIDELRRKKGYTFIDKALTLADKSLFTQEAGM-------------REDSQKVAVLMSDGI 225 Query: 341 NN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + T + K +++ T+ I A + L+ + D L Sbjct: 226 QTKDRGPFTPTDIAANPLKMKGVQVYTVGIGADVDVFELMAVASGITSMFSD-RYLDDLK 284 Query: 400 HVFQNIS 406 IS Sbjct: 285 AELAYIS 291 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K K A ++ + G I + T +E + + + + + Sbjct: 422 KFDLAKEAAARSVELLKEED------TFGFIAFDTEAWTVVE-TEPIKNKDEVIETIRST 474 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + TD PA+ QAYQ L + K II LTDG++N+ Sbjct: 475 ALGGGTDIFPALNQAYQQLNEMDLKR------------------KHIILLTDGQSNDGPY 516 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +I ++ N + + T++I + L + Y V A ++ + Sbjct: 517 ----EEIIEEGLTNNVTLSTVAIGGDADTSLLEELAEIGTGRFYEVYEASAVPSILS 569 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 60/184 (32%), Gaps = 25/184 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ K AL F+D +V+ + ++T + + S K TR Sbjct: 400 NRLREAKTALGDFIDIFADQDNVQ------VTIFSTNATELSDLSPIGPKRADLHTRIDG 453 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + + Y + + + ++ LTDGE+ Sbjct: 454 LVADGETRLYSTIGEVYTDIQQQTEVQRI----------------RALVVLTDGEDTASS 497 Query: 346 SN--VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + +I IKI TI+ + N + L + + Y + ++ V+ Sbjct: 498 LSLEQLNEQIRQDESGTSIKIFTIAYGSDANQEVLQRIAEITGAKSYTG-DPATIRQVYH 556 Query: 404 NISQ 407 I+ Sbjct: 557 EIAT 560 >gi|218516298|ref|ZP_03513138.1| hypothetical protein Retl8_22869 [Rhizobium etli 8C-3] Length = 432 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 75/237 (31%), Gaps = 45/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I P + ++D+SGSM K+ LK+A L ++ Sbjct: 55 DIAPTTAPHANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRLLVNR 95 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ Y +EP+ EK + ++ T ++ A Sbjct: 96 LKADD------TVSIVTYAGNAGTVLEPTRVAEKPK-ILSAIDRLEAGGSTGGAEGIEAA 148 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKEN 360 Y + + ++ TDG+ N S+ + +I ++ +++ Sbjct: 149 YDLAKKAFVQDGVN----------------RVMLATDGDFNVGPSSDEDLKRIIEEKRKD 192 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + + L++T + + D+L + + + + + Sbjct: 193 GIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEQAGSTLFPIA 247 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 68/197 (34%), Gaps = 24/197 (12%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++ L+ L F+D + +G++ + ++ + Sbjct: 112 DYSKSMLAEDVYPSRSERLEAELTRFID-----ESGRRGDRVGVVIFA-GAARSFPVTSD 165 Query: 273 TEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT--------NFFRQGVK 323 + ++ D + T A+ ++ +L + ++ G Sbjct: 166 MGVLSLFLAHADPRTENPGGTAIGKALDKSIDLLVAVRRDDSGARADQVEGEGEDESGAP 225 Query: 324 IPSL---PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + + I+ LTDGE+ ++ +A++ I+I T+ I + + Sbjct: 226 EAAPALSEADQVIVLLTDGEDTVG----RPEEVAARAEQLGIRIYTVGIGSDSGEPIMRY 281 Query: 381 TCVSSPEYHYNVVNADS 397 P + +AD Sbjct: 282 DADGQPAGY--ATDADG 296 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 64/190 (33%), Gaps = 25/190 (13%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +T +KN + +D + L+ Y+ ++ + + D Sbjct: 168 QTDFKRMKNFVRAVMDR-----SKGTNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDP 222 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + ++ + L K +K II +TDGE + Sbjct: 223 IVQLNGLTFTATGIRTVVRELFHSKNG-------------ARKSARKIIIVITDGEK--Y 267 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVS--SPEYHYNVVNADSLI 399 K + + +A++ I I + + + LK S S ++ + V + +L Sbjct: 268 KDPLEYKDVIPEAEKANIIRYAIGVGDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALS 327 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 328 SIQKQLQEKI 337 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 64/190 (33%), Gaps = 25/190 (13%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +T +KN + +D + L+ Y+ ++ + + D Sbjct: 168 QTDFKRMKNFVRAVMDR-----SKGTNTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDP 222 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + ++ + L K +K II +TDGE + Sbjct: 223 IVQLNGLTFTATGIRTVVRELFHSKNG-------------ARKSARKIIIVITDGEK--Y 267 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVS--SPEYHYNVVNADSLI 399 K + + +A++ I I + + + LK S S ++ + V + +L Sbjct: 268 KDPLEYKDVIPEAEKANIIRYAIGVGDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALS 327 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 328 SIQKQLQEKI 337 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/388 (7%), Positives = 89/388 (22%), Gaps = 64/388 (16%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKR-----ALIDDAKRFIKNHIKESLSG 108 + +A+ + ++ + + A + + ++I+ +G Sbjct: 241 QYTTDASGTYPTASWQPTGNQNVRNHQGNKDGAAQWDGQTEWNGEPTNKTNSYIEYGGTG 300 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 A + + + + + + + + + + Sbjct: 301 SQADYAIRKYAKETTTPGLF-------DVYLNIRGNVQKEIAPLDLVLVVDWSGSMNDNN 353 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + + + I + M S + Sbjct: 354 R----IGEVQKGVDRFVDTLAESGI----TDNIHMGYVGYS-----SDGYKNDSVAMGPF 400 Query: 229 AALKNALLLFLDSIDLLSHVKE----DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++KNA+ S + D L + + + G V++ Sbjct: 401 DSVKNAIKTITPSSTTGGTFTQKALRDAGNMLATPNGHKKVIVLLTDGVPTFSYQVSKVQ 460 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 S + + +++ + + I Sbjct: 461 TETNG----SYYGTQFTNRQDQPGSTSRISNSYYAPDQRNTNKLINSTFIA--------- 507 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQR------------LLKTCVSSPEYHYN- 391 TI K+ I+I + I + + + + + +Y Sbjct: 508 -----TIGEAMALKQRGIEIHGLGIQLQSDTNAGLSKQEVENKMRQMVSADENGDLYYES 562 Query: 392 VVNADSLIHVFQ----NISQLMVHRKYS 415 +A + IS + + K + Sbjct: 563 ADHAPDISDYLARKAVQISGTVANGKVT 590 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 50/178 (28%), Gaps = 12/178 (6%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTT-RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 SI + I + + T V L T+ T Sbjct: 97 DQTGSISTTDPYNLRIEASKIFLNNLGTDDYTGLTSFTSSYTSVVK-----LHSGFTNKT 151 Query: 296 PAMKQAYQIL----TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 MK++ L + T K +I TDGENN Sbjct: 152 EQMKKSLDTLALNVSGGTPLYTSTIQSVTYTAQKGPTANKAVIVFTDGENNVTT--NTLE 209 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 KA + I + T+ ++ N L + + + +A LI F + L+ Sbjct: 210 DATAKAIQQKIPLFTVGLSTDVNVNVLAQMANETGGAFFYAKDAGQLISTFGTLGNLL 267 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 53/163 (32%), Gaps = 22/163 (13%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 E + ++ ++ ++ +K R S T + A++ + + Sbjct: 1302 GPEGTQVAIVQFSDEPRTEVQLKSYRKKERLLEAISSISYKGGNTKTGRAIQHMKDSVFT 1361 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ K ++ LTDG + + S K+ + ++ + I Sbjct: 1362 EEGG-------------ARTAVPKVLVLLTDGRSQDDVS-----KVSKELQKQGFVVFAI 1403 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + G+ + S + + V + F+ + + +V Sbjct: 1404 GFADADYGELVNVASKPSSTHVFFVDDL----DAFKEMEEELV 1442 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/176 (10%), Positives = 54/176 (30%), Gaps = 20/176 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +GL ++ T ++ V +L K ++ + Sbjct: 35 VKAFDVDLDRTRIGLAQFSGEPRIE--WHLNTHTTKEAVMEAARNLPYKGGNTLTGLALT 92 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K + LTDG++ + + ++ Sbjct: 93 F-----------ILENSFSPESGSRPGIPKIGVLLTDGKSQDDVIPP-----AQRLRDAG 136 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQLMVHRKYSV 416 +++ I + + + L S + + YNV + + + +++ + R ++ Sbjct: 137 VEVFAIGV-KNADEGELRAIASVSEDTHVYNVADFHLMADIVDVLTRTICERMETL 191 >gi|323484869|ref|ZP_08090225.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] gi|323401865|gb|EGA94207.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] Length = 2032 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/294 (9%), Positives = 68/294 (23%), Gaps = 31/294 (10%) Query: 136 RLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 L+ + ++I + + + + + + ++ + + Sbjct: 1060 TLEVTGDSIQTTIGGGTADIVFVIDKSSSMNDWDDSLDDYRWNKLESTVDRFINKLKITS 1119 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 +S + + + I + +A KN L L + Sbjct: 1120 PNSKISFIEYQSSDLGTYQETYDDIRKVAVADTESADKNLEGNKLKYFRTLQKWTAISEV 1179 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G Y + N + A+ + + D+ S Sbjct: 1180 GSKPYGSAPGYNQG----------------THSAGGYLGAERALDR-LKTYNPDEYNSNS 1222 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINAS 372 G N + I + K+ I T++ + Sbjct: 1223 KYIIYLADGTAGYYVNNNGTQAGSGSGGNANARRAAITQSGELKKKHPDATIYTVAFGSD 1282 Query: 373 PNGQ-RLLKTCVSSPEY----------HYNVVNADSLIHVFQNISQLMVHRKYS 415 + +K + Y+ N L F N++ + + Sbjct: 1283 SSANMNWMKPGAYNGNSDNPYNPNVTAFYSAANTKELEETFDNLAAQVGSSAVT 1336 >gi|159900699|ref|YP_001546946.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893738|gb|ABX06818.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 828 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 47/171 (27%), Gaps = 30/171 (17%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR--QYVTRDMDSLILKPTDSTPAMKQAY 302 + D +G++ + + + + + V+ + TD A++ Sbjct: 417 TDSLGADDTIGVLAFDDTNDWTVTFTKVGQGVQLSEIQNNIAGLSAGGGTDIYAALEVGM 476 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L + + + LTDG + S + I + I Sbjct: 477 GGLAQQTGKV------------------RHAVLLTDGRSGGESSYESLIA---PLRAQGI 515 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS-----LIHVFQNISQL 408 + TI+I + L ++ D L I++ Sbjct: 516 TLSTIAIGGDADTVLLESLAKLGAGRYHFASRPDDLPRLTLQEA--EIARE 564 >gi|326433447|gb|EGD79017.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 1763 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/289 (9%), Positives = 73/289 (25%), Gaps = 53/289 (18%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKI--VSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + + L LL ++ +I V + A + + ++ + Sbjct: 215 GATLDFAFLLDGSGSLLQSGWDSQIQFVKDVTAYFDVGRNRTHVGIVTFHGPRVSHLSYN 274 Query: 207 ----MNSDPEDVN---------------SAPICQDKKRT-KMAALKNALLLFLDSIDLLS 246 + C + + A +D++ Sbjct: 275 ESIPDCPLGTVWEPDGEGCICEVEGSQCTGSACGTGYYELEAGGCQGATDQNIDTLPDCE 334 Query: 247 HVKEDVYMGLIGYTTR------------VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + V++ + + + + + T Sbjct: 335 IRQFFPHGCADCTCNPLGQKYQPSDSYTVDQLFSMTQDHDDLAATLDSLV--FPDGATHM 392 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + ++Q + + + +I LTDG++N ++ Sbjct: 393 SAGLEQIRDTIFQLRNGMREYEQA----------IPRVLIVLTDGKSNPGF---EPHEVA 439 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHV 401 ++ + I I I + L+ S P + Y + + SL + Sbjct: 440 EQLRNRGIIIYAIGVGD--YYLPELEAMASEPMDRHVYELADPSSLFTI 486 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 35/324 (10%), Positives = 84/324 (25%), Gaps = 56/324 (17%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 + A ++ + L D S+ + + K + + Sbjct: 2637 TLDVAENELTGRVSAELVAPHDYDGSLDFDLSVSATSEEKVGADTETTTVSEPVTLSGSD 2696 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + N V+++ D V + + + + YN Sbjct: 2697 LVVGNNVDNTLEGHGGNDILIGDQGGYKTNVTPGVNYNIALVVDASGSMGDYVYNT---- 2752 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 D T+M ++ AL Sbjct: 2753 -------------------------------------DGTVMRNPDGSAMTRMDMMQEAL 2775 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 ++S+ +G + ++ + ++ V + +++ ++L TD Sbjct: 2776 TNLVESLVTHDGSINIKLIGFDDNIDVTFEALDITNSSDVVAELLSKIENNLPVGGGTDY 2835 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++A S S ++ FLTDGE N+ N + Sbjct: 2836 GVGFEEANNWYASSSIS--------------SNGYENMTFFLTDGEPNSGTLNNGLTEYN 2881 Query: 355 DKAKENFIKIVTISINASPNGQRL 378 + + K++ + + + L Sbjct: 2882 ELVSTHNAKVMAVGMGNDIDDSVL 2905 >gi|316973220|gb|EFV56840.1| putative transmembrane cell adhesion receptor mua-3 [Trichinella spiralis] Length = 3249 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 61/174 (35%), Gaps = 9/174 (5%) Query: 229 AALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 A KN +L FL +L + + +I Y+ ++ ++ K + Sbjct: 903 AVFKNEILRFLREFINLFTIGSNHTRLAIIQYSDQIRHELDFKEANSKAEVDEALNRVEY 962 Query: 288 ILKPTDSTPAMKQAYQILTSD--KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ ++I S + F +G + + I +TDG + + Sbjct: 963 LTGLTKTGDALTDMFKIGFSSTFATKFFTILSESRGARPIETGVHRVAIVITDGRSQDIV 1022 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 S ++AK++ + + + + + L S + + V L Sbjct: 1023 SFS-----ANEAKKSNVLMFAVGV-TDHVSEAELVEIAGSKDRVFLVKEFTDLN 1070 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 39/263 (14%), Positives = 80/263 (30%), Gaps = 54/263 (20%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + +QR VS I + + + L++D SGSM Sbjct: 14 VDASKS-GSQRLMATSVSAIGETIDSRV----PLNLCLILDHSGSM-------------- 54 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEK 275 K + +K A L +D + + ++ + R E + + Sbjct: 55 ------KGQPVENVKRAAWLLVDKLRDQD------RLSIVVFNHRAEVLLSNQNVVDRDH 102 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 ++Q + R T ++ + L +K + F Sbjct: 103 IKQQINRLS---ANGGTSIDEGLRLGIEELAKGRKDTISQAF-----------------L 142 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 LTDGEN + +N A + + + T+ + N L K + ++ + Sbjct: 143 LTDGENEHGDNNRCLK-FAQLAADYNLTVNTLGFGNNWNQHILEKISDAGLGSLSHIEHP 201 Query: 396 DSLIHVFQNISQLMVHRKYSVIL 418 D + F ++ M + Sbjct: 202 DQAMDKFDSLLTRMQTVGLTNAY 224 >gi|115488386|ref|NP_001066680.1| Os12g0431700 [Oryza sativa Japonica Group] gi|77554879|gb|ABA97675.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113649187|dbj|BAF29699.1| Os12g0431700 [Oryza sativa Japonica Group] gi|125536450|gb|EAY82938.1| hypothetical protein OsI_38156 [Oryza sativa Indica Group] gi|125579179|gb|EAZ20325.1| hypothetical protein OsJ_35934 [Oryza sativa Japonica Group] Length = 524 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 70/245 (28%), Gaps = 48/245 (19%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + + R + VVD+SGSM K+ Sbjct: 39 TADSVTAVVEIKATSSTAVREGLDLVAVVDVSGSMRGH--------------------KI 78 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSL 287 ++K AL + + + + ++ + + ++ + T+ R + + + Sbjct: 79 ESVKKALQFVIMKLTPVD------RLSIVTFESSAKRLTKLRAMTQDFRGELDGIVKSLI 132 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD + +L T I ++DG+ S Sbjct: 133 ANGGTDIKAGLDLGLAVLADRVFTESRTA---------------NIFLMSDGKLEGKTSG 177 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNIS 406 T + + + T + Q L +SP + V + +L F + Sbjct: 178 DPT-----QVNPGEVSVYTFGFGHGTDHQLLTDIAKNSPGGTYSTVPDGTNLSAPFATLL 232 Query: 407 QLMVH 411 +V Sbjct: 233 GGLVT 237 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 64/189 (33%), Gaps = 35/189 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAYVQSELVQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPTAAAGGTSICTGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---L 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAVIHTVALG-PSAAQELEQLSKMTGGFQTYASDQAQNNGL 468 Query: 399 IHVFQNISQ 407 I F +S Sbjct: 469 IDAFGALSS 477 >gi|169642421|gb|AAI60698.1| Unknown (protein for IMAGE:4964804) [Xenopus laevis] Length = 519 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/262 (11%), Positives = 86/262 (32%), Gaps = 48/262 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + + + + + I +V+D S S+ Sbjct: 211 FSSSMLENVDTTNLED--RSDRSVRILKDGLMNIFIVLDTSKSV---------------- 252 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEK 275 + + K+A +LF++ + +I Y ++ + S + Sbjct: 253 ----GQNRFDEAKSASILFIEKMSNYDIKPRY---CIISYASKAISVVSLRDPDSNNADA 305 Query: 276 VRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ Y+ L + + R+G K + I Sbjct: 306 VMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQE-----LAYEREGKKEDFMKIHNVI 360 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK----------IVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ + + I+ + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKLIKRFLDVGIRKDNPREEYLDVYVFGLGSDIDQPEINDLAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQ 403 + + +++ N + + F+ Sbjct: 421 KKEKEVHTFHLQNVEKMKEFFE 442 >gi|148230547|ref|NP_001081234.1| complement factor B [Xenopus laevis] gi|602779|dbj|BAA06179.1| complement factor B [Xenopus laevis] Length = 747 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/262 (11%), Positives = 86/262 (32%), Gaps = 48/262 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + + + + + I +V+D S S+ Sbjct: 211 FSSSMLENVDTTNLED--RSDRSVRILKDGLMNIFIVLDTSKSV---------------- 252 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEK 275 + + K+A +LF++ + +I Y ++ + S + Sbjct: 253 ----GQNRFDEAKSASILFIEKMSNYDIKPRY---CIISYASKAISVVSLRDPDSNNADA 305 Query: 276 VRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ Y+ L + + R+G K + I Sbjct: 306 VMEHLEEFQYDRHEDKQGTNTRAALHAIYEHLIEQE-----LAYEREGKKEDFMKIHNVI 360 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK----------IVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ + + I+ + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKLIKRFLDVGIRKDNPREEYLDVYVFGLGSDIDQPEINDLAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQ 403 + + +++ N + + F+ Sbjct: 421 KKEKEVHTFHLQNVEKMKEFFE 442 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ L + + L+ Y + V K + T + Sbjct: 91 FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 150 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + + G + K I +TDG + ++ Sbjct: 151 AIKTAMEQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 195 Query: 357 AKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + + + LK S P ++ + V + Sbjct: 196 ARASGIEIYAVGV----DRAEMRSLKQMASQPLDDHVFYVETYGVIEK 239 >gi|239908812|ref|YP_002955554.1| hypothetical protein DMR_41770 [Desulfovibrio magneticus RS-1] gi|239798679|dbj|BAH77668.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 439 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 71/241 (29%), Gaps = 46/241 (19%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + + P E ER + L +D SGSM Sbjct: 20 DNTLDVLVRIQAPNTPEGETKERTRLNLALAIDRSGSMA--------------------G 59 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + K +D + V LI Y + +E + + + R ++ Sbjct: 60 RPLEEAKRCASFVVDKLKNTDRVS------LIAYDSSIETRVP--SVKVEDKAIFHRAIE 111 Query: 286 SLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ + + ++ II L+DG+ N Sbjct: 112 GIDDGGCTNLHGGWLKGAEQISP----------------YIDPSTISRIILLSDGQANEG 155 Query: 345 KSNVNTI-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ I K C + + + T + ++ N ++ + Y AD L+ FQ Sbjct: 156 LTDEAEIFKQCRELADAGVTTSTYGLGSNFNETLMIGMAKNGQGNSYYGRTADDLMDPFQ 215 Query: 404 N 404 Sbjct: 216 E 216 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 28/272 (10%), Positives = 76/272 (27%), Gaps = 42/272 (15%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 L + + + + + + + + + + P I++V + N + E Sbjct: 91 EIVLVLDTSGSMKDSKIKKMKNAAMEFVNK--IKKIPNLDIDIVTYSTSGYTYLNNGNTE 148 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 + D + +++ +K D + ++ Sbjct: 149 E--------------------------DLLKIINSIKADGGTNTGEGLRKANYILDLEKN 182 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR----SFFTNFFRQGVKIPSLP 328 + + D + + + L K S + + + Sbjct: 183 KNADKS-IVFMSDGMPTYYSIIAKYYWWDFLKLFPYKSYYDEISNEMIYTDDHIFEINKR 241 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLKTC--- 382 + + NN KS I + K I +I + + G L+K Sbjct: 242 NLRVEGSGKEESNNVDKSTNYATTIGNIIKTKKYNIYSIGYALGDENSTGNMLMKKIHES 301 Query: 383 ---VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + + +++ VF NI ++ Sbjct: 302 MGGIVGEDGTFFMSDENAINEVFNNIGDKIIS 333 >gi|99078203|ref|YP_611461.1| von Willebrand factor, type A [Ruegeria sp. TM1040] gi|99035341|gb|ABF62199.1| von Willebrand factor type A [Ruegeria sp. TM1040] Length = 477 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 39/113 (34%), Gaps = 3/113 (2%) Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ QA ++ K + + S + +I ++DGE + + Sbjct: 94 QAISQAVTQISPKGKTPLSAAVMQAADALRSSEEKATVILISDGEETCGL-DPCAVGAEL 152 Query: 356 KAKENFIKIVTISINASPNGQRLLKTC--VSSPEYHYNVVNADSLIHVFQNIS 406 +A+ + I + + R C ++ ++ + +A L ++ Sbjct: 153 EARGVDFTLHAIGFGIADDAARAQLQCLAENTGGFYRDASSASELTAALAQVA 205 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 78/267 (29%), Gaps = 44/267 (16%) Query: 160 FIEHLLNQRYNQKIVSFIPAL-----LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + Y ++ FI + + + I S + + Sbjct: 50 NLGSIKLRDYGLNLLYFITYAFFYLAMTVMILTLAGPSISKKKTTYLSSGADIVIVLDIS 109 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 S + + ++ K + F+ + +E+ +GL+ + + + + Sbjct: 110 PSMGAIEFSSKNRLEFAKELIKYFV-------YQRENDNIGLVAFAKEASLIVPLTIDRD 162 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + + + + A L S ++ +I Sbjct: 163 FFSKKLDDIYIMDLGNGSALGLGISIALSHL------------------KHSEAPKRSVI 204 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDG N+ + ++ + A+ +KI +I I +S K Y Sbjct: 205 VLTDGVVNSDEV--YKDQVINLAQGLNVKIYSIGIGSSEELSVGFKL---RSGRFYQGT- 258 Query: 395 ADSLIHVFQ-----NISQLMVHRKYSV 416 L ++ IS YSV Sbjct: 259 ---LREIYDPSMLFEISGKTGGLFYSV 282 >gi|148665999|gb|EDK98415.1| matrilin 3 [Mus musculus] Length = 482 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + + Sbjct: 85 SSRSVRPLEFTKVKTFVSRIIDTL---DIGATDTRVAVVNYASTVKIEFQLNTYSDKQAL 141 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + G + P K I + Sbjct: 142 KQAVARITPLST--GTMSGLAIQTAME----------EAFTVEAGARGPMSNIPKVAIIV 189 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 190 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASKPLEEHVFYVET 242 Query: 395 ADSLIH 400 + Sbjct: 243 YGVIEK 248 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 60/171 (35%), Gaps = 26/171 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDS 294 + + + + ++ + + + + + K + + + ++ T + Sbjct: 393 QIANKLQQMPIRPNLTRVAIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFT 452 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+K+ + + K+ K ++F + S +T + Sbjct: 453 NQALKRMALLFEASKR---------------DNCKMKLLVF------TDGYSAEDTAEGI 491 Query: 355 DKAKENFIKIVTISINASPNGQ---RLLKTCVSSPEYHYNVVNADSLIHVF 402 + K I + T+ I+ N LK +SP ++++ + D+L+ F Sbjct: 492 EALKRQGITVYTVGISTDKNAGLNVSELKGMATSPSHYFDSSDFDNLLKHF 542 >gi|306518578|ref|NP_034900.4| matrilin-3 precursor [Mus musculus] gi|6918887|emb|CAB72265.1| matrilin-3 [Mus musculus] gi|26339344|dbj|BAC33343.1| unnamed protein product [Mus musculus] Length = 481 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + + Sbjct: 85 SSRSVRPLEFTKVKTFVSRIIDTL---DIGATDTRVAVVNYASTVKIEFQLNTYSDKQAL 141 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + G + P K I + Sbjct: 142 KQAVARITPLST--GTMSGLAIQTAME----------EAFTVEAGARGPMSNIPKVAIIV 189 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 190 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASKPLEEHVFYVET 242 Query: 395 ADSLIH 400 + Sbjct: 243 YGVIEK 248 >gi|14548114|sp|O35701|MATN3_MOUSE RecName: Full=Matrilin-3; Flags: Precursor gi|2342635|emb|CAA71532.1| matrilin-3 [Mus musculus] Length = 481 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + + Sbjct: 85 SSRSVRPLEFTKVKTFVSRIIDTL---DIGATDTRVAVVNYASTVKIEFQLNTYSDKQAL 141 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + G + P K I + Sbjct: 142 KQAVARITPLST--GTMSGLAIQTAME----------EAFTVEAGARGPMSNIPKVAIIV 189 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 190 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASKPLEEHVFYVET 242 Query: 395 ADSLIH 400 + Sbjct: 243 YGVIEK 248 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 73/210 (34%), Gaps = 37/210 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-R 277 ++ + A FL L V+ ++G++ + T + R Sbjct: 314 SGSMGGYDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDTTATIVNKLIQIKGSDER 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + L T +K A+Q++ + + ++ LT Sbjct: 369 NTLMAGLPTYALGGTSICSGIKYAFQVIGELHSQLDGSE----------------VVLLT 412 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N T C D+ K++ + I++ ++ + + +++ + H+ + Sbjct: 413 DGEDN-------TASSCIDEVKQSGAIVHFIALGSAAD-EAVIEMSNITGGSHFYASDEA 464 Query: 397 S---LIHVFQNISQ---LMVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 465 QNNGLIDAFGALTSGNTDLSQKSLQLESKG 494 >gi|313902415|ref|ZP_07835818.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] gi|313467346|gb|EFR62857.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] Length = 170 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 8/55 (14%), Positives = 19/55 (34%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + F L + L +G ++ + ++ + A LAG ++ Sbjct: 17 QGGAVGAAFLLLLPVILAALGLVLDGSRLVLTRAHAQAVADFASLAGVQEVDEEA 71 >gi|255522879|ref|NP_001157342.1| collagen alpha-1(XIV) chain [Equus caballus] Length = 1796 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + K + S T + A+K L ++ Sbjct: 1068 DGTQVAIVQFTDDPRTEFKLDAYKNK-ETLLDAIKHISYKGGNTKTGKAIKHVRDTLFTE 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I + Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISKEMQSDGYSIFAVG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 52/173 (30%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGARSGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV--KNADENELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 382 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 441 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 442 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIITDGQSYDD---- 482 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 483 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 538 Query: 406 SQLMV 410 + + Sbjct: 539 IRGIC 543 >gi|326674791|ref|XP_001922046.3| PREDICTED: collagen alpha-1(XIV) chain [Danio rerio] Length = 1852 Score = 56.8 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + L E + + ++ + + ++K S Sbjct: 1034 DNFQKIIRFLYSTTGALDVIGPEGTQVAIAQFSDDARTEFKLNSYSDKEALLDAVQRISY 1093 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K + + S+ K ++ LTDG + + + Sbjct: 1094 KGGNTKTGRAIKHVKEAIFSEDAGV-------------RRGIPKVLVVLTDGRSQDDVN- 1139 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 KI + + I I + G+ L S P + + V + F+ I Sbjct: 1140 ----KISKEMQMEGYIIFAIGFADADYGE--LVNIASKPSERHVFFVDDL----DAFKKI 1189 Query: 406 SQLMVH 411 + ++ Sbjct: 1190 EEQLIT 1195 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 24/173 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 ++ + +GL Y+ G + + T + A+ Sbjct: 185 VNAFDVGIDKTRIGLAQYSGDPRIEWHL-NGFSTKEAVIDAVKNLPYKGGNTLTGLALTY 243 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 +L + K K I +TDG++ + I + + Sbjct: 244 ---VLENSFK----------PESGARDNIPKIGILITDGKSQDD-----VISPAQTLRSS 285 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 +++ I + + LK S PE + YNV + + + + +++ + Sbjct: 286 GVELFAIGV--KNADENELKAIASEPEDTHVYNVADFSIMGTIVEGLTRTVCD 336 >gi|326435125|gb|EGD80695.1| hypothetical protein PTSG_01285 [Salpingoeca sp. ATCC 50818] Length = 1006 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 59/171 (34%), Gaps = 26/171 (15%) Query: 235 LLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILK- 290 + F+ ++D+L + V + L+ Y + + S+ V ++ + + Sbjct: 329 VRDFIASTVDMLPVGENTVRVALMTYHSSNMPQFDFDDSFDRATVVSAISSLVYDDSRQY 388 Query: 291 PTDSTPAMKQ-AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A+ A +L + + TDG + + + Sbjct: 389 GTATGSALNFFADNMLQASAGYRGG---------------PAIVYVFTDGASQDDVTPG- 432 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++VTI + A+ N L+ SSP + V + DSL Sbjct: 433 ----AAALQATGAQVVTIGVTAAVNEAE-LQEIASSPSDVFIVADFDSLTD 478 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 59/179 (32%), Gaps = 20/179 (11%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + K+ + D + ++K + + I Y+T+ +E + K+ ++R Sbjct: 62 RSASANDNWKDIYIFVGDVVKKFPNLK--MRVSFITYSTQGHTLMELTSDRNKIHNSLSR 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T+ K+A + + + II LT G Sbjct: 120 LKNIKPTGATNMHEGFKKANEQIEQENAGGNNAA--------------SLIIALTTG-PL 164 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIH 400 K+ T +KA+E K+ + + + L V + Y V SL Sbjct: 165 TPKALQETKSEAEKAREMGAKVYCVGVKD--YRKDQLDAIVGRKDQMYGVGNGFKSLQD 221 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 46/163 (28%), Gaps = 19/163 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ L + F D++ + ++ + + + Sbjct: 37 ISELNEGINCFYDAVRSDETALYAAEIAVVTFGGSAV----LKTDFSTLEHQPDS-PNFF 91 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T AM A +L + +Q +I+ +TDG+ N S Sbjct: 92 ANGGTPMGEAMNMALDLLEKR---------KGEYKASGVDYYQPWIVLMTDGKPNGDSSE 142 Query: 348 --VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + C+ K + I I I + L SP+ Sbjct: 143 YARAVQRTCEMIKNRKLTIFPIGIGEDADMNAL---AAFSPKR 182 >gi|47847422|dbj|BAD21383.1| mFLJ00114 protein [Mus musculus] Length = 1188 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 53/167 (31%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + D + T + A+K L + Sbjct: 203 RPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLDSVRQLRGYTYTASAIKHVITELFTT 262 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + K +I +TDG S + I + + A I I Sbjct: 263 QSG-------------ARQDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS---IIRYAI 306 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + ++ LK S P + ++V N D+L + + + + Sbjct: 307 GVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 353 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 56.4 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 55/178 (30%), Gaps = 26/178 (14%) Query: 248 VKEDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 M ++ + R + + T K+++ T + T A+ A + Sbjct: 106 SNTRDQMFVVHFNERARLGLPERTPFTGKIKELETAISSFDVGGSTALYDAILLAQSHIR 165 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + ++ +TDG +N+ K+ + D + + I Sbjct: 166 GGVYGR------------------RILLVITDGGDNSSKAT--LEEAVDAVAKAGVVIYA 205 Query: 367 ISINASPNGQR----LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 I I + + L + + + + + I+ + R+Y++ G Sbjct: 206 IGIYDPNDKDQNPKVLAHLAEVTGGEAFFPTALSDITRICEEIAADV-RRQYTIGFAG 262 >gi|325963534|ref|YP_004241440.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469621|gb|ADX73306.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 347 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 18/195 (9%), Positives = 55/195 (28%), Gaps = 26/195 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-- 289 +A +D L+ E +GL + + + + ++ + D+ Sbjct: 107 SSADAAVVDVFATLASEFEGERIGLTIFDSTAIQVFPLTDDYGYAQEQLKLARDAFTGVP 166 Query: 290 -------------KPTDSTPAMKQAYQILTSDK---KRSFFTNFFRQGVKIPSLPFQKFI 333 + + K ++ + G + + + Sbjct: 167 GSSGFLDGTWSGRGSSLIGDGLASCLNSFPHGKFEHRKPESSGDSGTGRNLTEHQRSRTV 226 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI-------SINASPNGQRLLKTCVSSP 386 + TD + + + + A++ +++ + G L S+ Sbjct: 227 VLATD-NFLSGEPILTLDEAASLARDRAVRVYALNPGDLDYGAGPGQPGGHLRAAAESTG 285 Query: 387 EYHYNVVNADSLIHV 401 +Y + N +++ V Sbjct: 286 GAYYALDNPEAVPGV 300 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ + +E+ +GL+ + + + + + + Sbjct: 118 SSKNRLEFAKELIEYFV-------YQRENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDD 170 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S +K +I LTDG N Sbjct: 171 IYIMDLGNGSALGLGISIALSHL------------------KHSEAPKKSVIVLTDGVVN 212 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + ++ + A+ +KI ++ I + K Y L V+ Sbjct: 213 SDEV--YKDQVINLAQGLNVKIYSVGIGSDEELNVGFKL---RSGKFYQ----GVLKEVY 263 Query: 403 Q-----NISQLMVHRKYSV 416 IS YSV Sbjct: 264 DPSMLFEISNKTGGLFYSV 282 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 40/206 (19%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTR 282 ++ + A LFL L V++ ++G++ + + E + + R + + Sbjct: 320 SDNRLNRMNQASRLFL-----LQTVEQGSWVGMVTFDSTAYVQSELTQLNSGADRDLLIK 374 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ T ++ A+ + I+ LTDGE+N Sbjct: 375 RLPTVASGGTSICSGLQAAFTSIKKKYPTDGAE-----------------IVLLTDGEDN 417 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYN--VVNADS 397 TI C D K + I T+++ + L + + + + + N + Sbjct: 418 -------TISSCFDLVKNSGAIIHTVALG-PSAAKELEQLSKMTGGLQTYSSDQIQN-NG 468 Query: 398 LIHVFQNISQ---LMVHRKYSVILKG 420 L+ F +S + + +G Sbjct: 469 LVDAFAALSSGNAAISQHSIQLESRG 494 >gi|332798629|ref|YP_004460128.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] gi|332696364|gb|AEE90821.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] Length = 185 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 58/192 (30%), Gaps = 25/192 (13%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK---M 68 KK + E N +IFA S++ IG + V + K+NSM++ IL + Sbjct: 6 KKFLQKEDGNILLIFAGSMVLIAFFIGICLDVSMIYVKRNSMQN-----ILQIIREERFT 60 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + R D + H + + + + + +++ ++ Sbjct: 61 YQDTIRYSDNPALTTYHIAYSAAAENGFDGIVTVYFHEEDPEPNYRSYQVRILLTDECPF 120 Query: 129 M--THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + + L E + S +P Sbjct: 121 YFGRIFGLDTVPLNVS---------------LDGGESYGEGSADVIWHSPLPVSNYNGAY 165 Query: 187 ERPIFLIELVVD 198 I +++V D Sbjct: 166 TGIIGDMQVVYD 177 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 66/219 (30%), Gaps = 34/219 (15%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 ++ ++LV+D+SGSM ++MAA K A LD+ Sbjct: 33 TADQDAPKVDLVLDVSGSMRAKD--------------IDGGSRMAAAKQAFNEVLDATPE 78 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + R ++ + P D T A K A Sbjct: 79 EVRLGIRTLGADYPGDNR--------------KEGCKDTAQLYPVGPLDRTEA-KTAVAT 123 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-- 362 L+ + + K I+ ++DGE+ + ++ + I Sbjct: 124 LSPTGWTPIGPALLKAADDLDGGNGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGL 181 Query: 363 KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 I T+ + R L ++ + +V + D L Sbjct: 182 TIDTLGLVPDVKLNRQLSCIAEATGGTYTSVEHRDQLTD 220 >gi|160720|gb|AAA29775.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 574 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/264 (10%), Positives = 67/264 (25%), Gaps = 70/264 (26%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 ++ + + L ++ + ++ ++ ++ + E P V + H Sbjct: 11 EENMNVNVEIVLDASGSMVKKIGDKTMMEIAKESIKKVLSEMPAN---AKVGIRVFGHKG 67 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 N+ + S + Sbjct: 68 DNTASKKDESC----------------GANELI------------------------YPI 87 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + V T +++ + L + Sbjct: 88 GDL-----NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK----------------ALDG 126 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNGQRLLKTCV- 383 + +TDG N ++I + K I + I N N RLLK Sbjct: 127 EKTLNILYIITDGIETCG---GNPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIAD 183 Query: 384 SSPEYHYNVVNADSLIHVFQNISQ 407 ++ Y+ +V +A+ L I++ Sbjct: 184 AAGGYYSSVNDANKLTGELYRINE 207 >gi|124513464|ref|XP_001350088.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] gi|160691|gb|AAA29767.1| sporozoite surface protein 2 [Plasmodium falciparum] gi|23615505|emb|CAD52497.1| Thrombospondin-related anonymous protein, TRAP [Plasmodium falciparum 3D7] Length = 574 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 72/226 (31%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D + +GL+ + + + Sbjct: 97 DISNSMMAQDVIPSRLEKSKLLIEDLVDHFT-------NDKVGLVVFAGDAFVQLPITND 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ + T + + Sbjct: 150 YVSAKMFLQNINPSLITTQGTDLARAISLSQSCFTQREH------------------IGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ +A + I + + + +GQ Sbjct: 192 AIIVITDGEDHEGGA----LEAAREAYKKGINVFILGVGTSKGAPIPDGNGGYLKDNSGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +L + + + +V N + +++ + + Sbjct: 248 TVLSTLNEQMCQQVAKAGNGVYIHVDNTS---DAQEKLNKELSKLQ 290 >gi|307304370|ref|ZP_07584121.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|306902572|gb|EFN33166.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 431 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 16/188 (8%), Positives = 48/188 (25%), Gaps = 6/188 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ + +E ++I A++ + +G + W+ +K ++ A + + A A Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAA 75 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + + G + ++ Sbjct: 76 VRHRAGDQQSALEAAARRVAGGSGFSPGGLTVSTA------PGSAGGSNKVTVELTETHP 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 R+ + + + + L V + Sbjct: 130 RMFSSVFGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNS 189 Query: 186 GERPIFLI 193 FL+ Sbjct: 190 NAADAFLM 197 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 55/166 (33%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 I+ L +GL+ Y + V++ + K + T + Sbjct: 25 FLSQVIESLDVGPNATRVGLVNYASSVKQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGL 84 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A SD +G + S K +I +TDG + +V+ Sbjct: 85 AIHFAITKALSDA----------EGGRPRSPDISKVVIVVTDGRPQDSVRDVSAR----- 129 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I++ I + + L+ S P E+ V + + Sbjct: 130 ARASGIELFAIGVGRVD--KATLRQIASEPQDEHVDYVESYRVIEK 173 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 439 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYM 498 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 499 SGGTATGDAIAFTVRNVF---------------GPIRDSPNKNFLVIVTDGQSYDD---- 539 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + L+ S P + + L + ++ Sbjct: 540 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LRDMASKPKESHAFFTREFTGLEPIVSDV 595 Query: 406 SQLMV 410 + + Sbjct: 596 IRGIC 600 >gi|34481890|emb|CAE46493.1| trap [Plasmodium falciparum] Length = 331 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|34481892|emb|CAE46494.1| trap [Plasmodium falciparum] Length = 331 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNIFSNNAREIIRLHSDASKNKEKALFIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|313902401|ref|ZP_07835804.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] gi|313467332|gb|EFR62843.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] Length = 308 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 5/39 (12%), Positives = 21/39 (53%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME 54 S++ +++ A+S+ L ++G ++ + ++ ++ Sbjct: 17 RSQQGGVAVLVAISLAVLLGMVGLVVDGGRLYAERTRLQ 55 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 64/223 (28%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPDRLSKAKQMLSKLTDGF-------SNDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ S++ + T A+ A + T + K Sbjct: 150 YVSAKMFLSSINPSMVSTQGTAIGAAINLAVRSFTPSETSD------------------K 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + +K I + + + Sbjct: 192 AIILITDGENHEDDAVKAAAAAAEK----GIHVNIVGMGDPKGSPIPVDGSNNYMKDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + + + N + + + + Sbjct: 248 NVVITKLNEEMCQEIAAAGHGTYVRADNT---NSALRALQKEI 287 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/363 (10%), Positives = 95/363 (26%), Gaps = 55/363 (15%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + + LA + V + + + + + + Sbjct: 49 DSADQVFLAASKSRVEVQESYVLPSSTPIIPMPNPPVSENRENYPKTPISPIRQVATDPV 108 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQ------ 167 +T ++ +S + N + ++I Y + Q Sbjct: 109 STFSTDVDTASYTNARRFLNQGMRPPADSIRVEEFINYFDYALPAPDTTNTPIQISTERT 168 Query: 168 ----RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 ++V R + P + ++D+SGSM+ Sbjct: 169 QTPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMN------------------- 209 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K+ ++ + L + + + + Y + +EP+ G +K Q Sbjct: 210 SPDKLPLMQRSFNLLVSQLRPQD------RVAIAVYAGQSGVVLEPTSGDQKA-QINQAI 262 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T + + AY + ++ I TDG+ N Sbjct: 263 NQLRAGGGTHGSAGIHLAYDLAQANYLPDGIN----------------RIFIGTDGDFNV 306 Query: 344 FK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHV 401 S + ++ +E + + + L++ + Y + + + Sbjct: 307 GTTSLTELKALIERKREAGVFLSVLGFGTGNYNDALMEELSNHGNGTAYYLDSYQEARKL 366 Query: 402 FQN 404 F Sbjct: 367 FAT 369 >gi|33333558|gb|AAQ11895.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|160718|gb|AAA29774.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|160723|gb|AAA29776.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ L ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYLNIFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/237 (12%), Positives = 74/237 (31%), Gaps = 45/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I P + ++D+SGSM K+ LK+A L ++ Sbjct: 267 DIAPATTPHANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRLLVNR 307 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ Y + P+ EK + ++ T ++ A Sbjct: 308 LKPDD------TVSIVTYAGNAGTVLTPTRVAEKSK-ILSAIDRLEAGGSTGGAEGIEAA 360 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKEN 360 Y + + ++ TDG+ N S+ + +I ++ +++ Sbjct: 361 YDLAKQGFVKDGVN----------------RVMLATDGDFNVGPSSDEDLKRIIEERRKD 404 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + + L++T + + D+L + + + + + Sbjct: 405 GIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEEAGSTLFPIA 459 >gi|145595544|ref|YP_001159841.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145304881|gb|ABP55463.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 316 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 63/234 (26%), Gaps = 53/234 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L VD+S SM ++AA + A F+ + + Sbjct: 87 ATVMLAVDVSLSMQADDV---------------APNRLAAAQEAAQQFVAEL------PD 125 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + P+ + V + + T + A+ Sbjct: 126 SYNLGLVSFAKAANVLVPPTKDRQAVLAAINGLALAE---STATGEAVFT---------- 172 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ L+DG + +S + + TI+ Sbjct: 173 --CLEAIRSVPADGAAGIPPARIVLLSDGYRTSGRSVEQAAAAAQA---ANVAVSTIAFG 227 Query: 371 ASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L ++ Y Y + L V+Q++ + Sbjct: 228 TDGGQVDIGGQRQRVPVDRLALADLAETTDGYFYEAASVSELKQVYQDMGSSIG 281 >gi|58429485|gb|AAW78146.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429473|gb|AAW78140.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429483|gb|AAW78145.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429501|gb|AAW78154.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429487|gb|AAW78147.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429453|gb|AAW78130.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|332215868|ref|XP_003257064.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VII) chain-like [Nomascus leucogenys] Length = 2944 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 47/171 (27%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAVSAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ K I +TDG++ + + K + Sbjct: 129 RVFLPQL---------------ARPGIPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 56.4 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/233 (12%), Positives = 66/233 (28%), Gaps = 41/233 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS------------ID 243 V S C + + T + + + + Sbjct: 103 SVGSSYRCACPEGRRLQADGKSCGRCGSGPTDLVLVIDGSKSVGAQQFELVKRWVGELVG 162 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L +GL+ +++RV K + + K T + A++ + Sbjct: 163 SLDVSPAGTRVGLVQFSSRVRTEFPLGRHGTKAEAEAAVRAVTPMDKGTMTGLALRHLVE 222 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 S+ + ++ + +TDG + + S +AK+ I Sbjct: 223 RGFSEAEG-------------ARPGSRRVGLLVTDGRSQDDVSP-----WAARAKDRGIV 264 Query: 364 IVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + + + L+ S P + + F +++L+ H K Sbjct: 265 MFAVGVG--KAVEEELREIASDPVERHVSYAAD-------FGALTRLLDHLKL 308 >gi|291569213|dbj|BAI91485.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 412 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 45/160 (28%), Gaps = 25/160 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K A +D +++ + ++ + R + + + + Sbjct: 59 LETVKQAAKELIDRLNVGD------RISVVAFDHRAKVLVP-NQDIADPDGIKKKIDGLR 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K + L K+ F LTDGEN + + Sbjct: 112 CSGGTAIDEGLKLGIEELGKGKQDRISQGF-----------------LLTDGENEHGDNK 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A E + I ++ N L K + Sbjct: 155 RCLK-LAKLATEYKLTINSLGFGNDWNQDILEKIADAGGG 193 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 75/258 (29%), Gaps = 56/258 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + L + + Sbjct: 948 RKKLNLALVLDISGSMGSSFDEYY--------------------------------YDRF 975 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + VN +D +++K+ A++ LD ++ +GL+ + T E Sbjct: 976 GNHVAVND---TEDAEKSKIEIAAAAIVALLDHLEDDD------RLGLVLFNTGAELAEP 1026 Query: 269 PSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S K Q + D+ + T + M+ A ++ + +++ Sbjct: 1027 VSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMATELYD-------------EFLEVNQS 1073 Query: 328 PFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 ++ IIFLTD N+ +++ + + + + I I N + + Sbjct: 1074 EYENRIIFLTDAMPNSGQTSEESLLGMIEANANKNVYTTFIGIGVDFNTELVEYITKIRG 1133 Query: 387 EYHYNVVNADSLIHVFQN 404 +Y+V +A + Sbjct: 1134 ANYYSVHSATQFKERMDD 1151 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 64/211 (30%), Gaps = 34/211 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM ++MAA K A LD++ + Sbjct: 41 VELVLDVSGSMRARDIDGK--------------SRMAAAKQAFNEVLDAVPEEVRLGIRT 86 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R ++ + P D T A K A L Sbjct: 87 LGADYPGDDR--------------KRGCKDTRQLYPVGPLDRTEA-KAAVATLAPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISIN 370 + + I+ +TDGE+ + ++ + I I T+ + Sbjct: 132 IGPALLGAAEDLEGGDATRRIVLITDGEDTCAPLDPC--EVAREIAAKGIHLVIDTLGLV 189 Query: 371 ASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + L+ ++ + +V + D L Sbjct: 190 PDAKTRTQLRCIAEATGGTYTSVQHTDELSD 220 >gi|156370019|ref|XP_001628270.1| predicted protein [Nematostella vectensis] gi|156215242|gb|EDO36207.1| predicted protein [Nematostella vectensis] Length = 1552 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 49/137 (35%), Gaps = 17/137 (12%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 +++ + L T A+K + + L ++K +K Sbjct: 12 NNVNIKRDIDEL--RLERGLTFIDKALKISAEKLFTEKNGMRLNR-------------KK 56 Query: 332 FIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + LTDG + K K+ +++ ++ I + + L+ S +Y + Sbjct: 57 VALVLTDGIQTKDKGPFTPLQKASQPMKDKGVEVYSLGIGSDIDVSELITF-ASGEKYVF 115 Query: 391 NVVNADSLIHVFQNISQ 407 N + D L +NI+Q Sbjct: 116 NAKSFDELQLQVENITQ 132 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 30/354 (8%), Positives = 84/354 (23%), Gaps = 39/354 (11%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + D+ ++ S ++ + + + + L+ N Sbjct: 238 AFSSNWDGIDSWDGNPANLTNSYIEHAGTNNPVDFALRKYARETETPGLYDVFLNVRGN- 296 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 N + + R + V + Sbjct: 297 -VQNPIKPVDIVLVVDMSGSMEGAREGAIKQGV--KSFLSSIENTAYAQYVNVGLV--GY 351 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-LFLDSIDLLSHVKEDVYMGLIGYTTR 262 + P+ ++A+ AL F+ +++ M Sbjct: 352 SSPGYISNSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQM-------- 403 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + D + + A + + +D + Sbjct: 404 ------LKEDASGNEKMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDSNSLDEPSFTSQ 457 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-------G 375 + T + T+ AK++ +I T+ I + Sbjct: 458 LWMMNGNNRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSS 517 Query: 376 QRLLKT---------CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 L SS + + +A+++ +N + +++ +++ I G Sbjct: 518 NTYLSQEEVRKRTSLIASS-GLYQDADSAENITDYLKNQADVVLS-RFNTITNG 569 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 48/381 (12%), Positives = 112/381 (29%), Gaps = 53/381 (13%) Query: 55 SANNAAI---LAGASKMVSNLSRLGDRFE-SISNHAKRALIDDAKRFIKNHIKESLSGYS 110 + + + L G + + S + + S K A+ ++ +++ ++LS + Sbjct: 391 QSLDVGVAGPLMGIIQYGDDPSTEFNLKTYANSKDLKNAIEKILQKGGLSNVGKALSFVN 450 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR------------- 157 F+ N + +++ + D + + + Sbjct: 451 KNFFEDSNGNRGAAPNVAIVMVDGWPTDKVEEASRLARESGINIFFVTIEGPDENEKQNV 510 Query: 158 --LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 F++ + + ++ ++ + + I L+ S C ++D V Sbjct: 511 VEANFVDKAVCRTNGYYSINVPSWFSLHKVVQPLVKRICDSDHLACSKTCLNSADIGFVI 570 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + L + D +G + YT + K Sbjct: 571 DGSSSVGTGN-----FRTVLQFVANISKEFEISDTDTRIGAVQYTYEQRLEFGFEKQSTK 625 Query: 276 --VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V + R T + A+ A++ L K +K + Sbjct: 626 QDVLNAIKRINYW--SGGTSTGAAINYAFEQLFIKSK----------------PNKRKIM 667 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYN 391 I +TDG + + A +N + ++ I + Q L+ + P+ + + Sbjct: 668 ILITDGRSYDD-----VQGPATAAHQNGVITYSVGI--AWAAQDELEAIATDPDKEHSFF 720 Query: 392 VVNADSLIHVFQNISQLMVHR 412 V DSL I Q + Sbjct: 721 VDEFDSLYRFVPRIIQNICTE 741 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 49/186 (26%), Gaps = 31/186 (16%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDS 294 D L MG+I Y ++ ++ + + ++ Sbjct: 385 FLKDVAQSLDVGVAGPLMGIIQYGDDPSTEFNLKTYANSKDLKNAIEKI--LQKGGLSNV 442 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + D + I + DG + + Sbjct: 443 GKALSFVNKNFFEDSNGNRGAA-------------PNVAIVMVDGWPTD-----KVEEAS 484 Query: 355 DKAKENFIKIVTISINASPNGQRLL-------KTCVSSPEYHYNVVNAD--SLIHVFQNI 405 A+E+ I I ++I ++ V +Y++ SL V Q + Sbjct: 485 RLARESGINIFFVTIEGPDENEKQNVVEANFVDKAVCRTNGYYSINVPSWFSLHKVVQPL 544 Query: 406 SQLMVH 411 + + Sbjct: 545 VKRICD 550 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT K + Sbjct: 524 FRTVLQFVANISKEFEVSDTDTRVGAVQYTYEQRLEFGFDQHRTKSDLLSAIKRVNYWSG 583 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A++ A + L K +K +I +TDG + + Sbjct: 584 GTSTGAAIRYALERLFEKSK----------------PNKRKLMIVITDGRSYDDVRIPAL 627 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQL 408 A + I I + Q L+ S P+ + + V D+L I Q Sbjct: 628 -----AAHRKGVITYAIGI--TWAAQDELEVMASDPDKDHAFFVDEFDNLYTFVPQIIQN 680 Query: 409 MVHR 412 + Sbjct: 681 ICTE 684 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 42/153 (27%), Gaps = 21/153 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + K L + ++D+ +G++ Y T + Sbjct: 320 RRFQIQKQFLKTMVQALDVGPAGP---LLGIVQYGDDPTTEFNLKTHTNSRDLKAAIEKI 376 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + ++ A+ + SD + + + DG + Sbjct: 377 TQKGGLSNVGRALSFVNKNFFSDANGNRGGAA-------------NVAVVMVDGWPTD-- 421 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + A+E+ I I I+I + ++ Sbjct: 422 ---RVEESSRLARESGINIFFITIEGAAESEKQ 451 >gi|307102430|gb|EFN50705.1| hypothetical protein CHLNCDRAFT_28788 [Chlorella variabilis] Length = 344 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 76/239 (31%), Gaps = 30/239 (12%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L N ++ A L+ RP + L++D+SGSM + S D Sbjct: 59 LRNASAARRRSLLQAASLQASDFRRPRLNLVLLLDVSGSMGESFMSYYYDAAGTQQNLTA 118 Query: 224 ---KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKV 276 TK+ K L LD + + ++ ++TR S + Sbjct: 119 EELNTTKIDVAKEVLSGVLDLLAPNDS------VAIVLFSTRACTPQPLSRVSCLDIPAL 172 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + + D T + + A L +G+ + I+ + Sbjct: 173 QAQIDK--DMHATSSTSLSAGLDLAIAELKK----------CSEGMSASLTDTENRIMVI 220 Query: 337 TDGENNNFKSNVNTIKICDKAK---ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 TD + N+ + T + + + ++ I I + N + +Y+V Sbjct: 221 TDQQPNSG--DYTTGGLAARLRKDADDGIFTTIIGVGLDLNSELAESISKVRGANYYSV 277 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 60/200 (30%), Gaps = 28/200 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K F+D ++ + + G + Y+ V+ + + Sbjct: 73 DNIKQFTTQFIDKLNERYYRCDRNLMWNAGALHYSDEVQLISGLTSMRTGRSGLKDQVSK 132 Query: 286 SLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + A+K+ + L K++I +TDG Sbjct: 133 VVSIGKGTYTDCAIKRGIEELLIGG---------------SHHKENKYMIVVTDGHPLEG 177 Query: 345 KSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD---- 396 ++AK IK+ +++I + + + L + ++ Sbjct: 178 YKEPCGGLEDAANEAKHQGIKVFSVAI-SPNHLESRLSVIATDQAYRRNFTATGPSLRAR 236 Query: 397 SLIHVFQNISQLMVHRKYSV 416 + + I ++ +V Sbjct: 237 DIDNTIDTIIDMIKSNSRAV 256 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 40/365 (10%), Positives = 102/365 (27%), Gaps = 42/365 (11%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ--N 120 A + N+ L + +++ A +A +F ++++ + + V + Sbjct: 685 ALGDANIDNIGALKEAVKNLRWIAGGTYTGEALQFTRDNLAQRFTSEKRVAIVITDGRSD 744 Query: 121 IVNSSRISMTHMANNRLDSS-NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 I+ N + + + QR N + Sbjct: 745 ILRDQTPLSALCGFNTQVVPLGIGDIFQGPPNPEQLSTIS-CGGVSIQRENFAELLDDTF 803 Query: 180 LLRIEMGERPIFLIE-LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 L + +S S + + S K +K F Sbjct: 804 LQSVVSEVCKDKKCPDYTCPISFSGPTDITLVVDSSTSVGSRNFNTTKKF--VKRLAERF 861 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ V + ++ Y+ R + + + V + I TD Sbjct: 862 ---LSAAKPTEDAVRVSVVQYSGRTQQKLEVPFEQNYTVIADSVDKMQ--FINDATDVNA 916 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + + +K ++ +DG + + + + Sbjct: 917 ALNYVTSLF----------------RRSSRSGAKKRMLIFSDGNSQGITQSAIERAV-QE 959 Query: 357 AKENFIKIVTISINASPN---------GQRLLKTCVSSPEYHYNVVNADSLIH--VFQNI 405 A++ I+I + + N G+ + + V + +SL+ +Q + Sbjct: 960 ARQAGIEIYVLVVGTQANEPNVRVLVTGKTAEYDVAFGERHLFRVPDYESLLRGVFYQTV 1019 Query: 406 SQLMV 410 S+ + Sbjct: 1020 SRKIS 1024 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 69/188 (36%), Gaps = 30/188 (15%) Query: 234 ALLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILK 290 + F+ + ID + +G+I ++ +VE T K ++ V + + + Sbjct: 707 STKTFMKTVIDEVLANHAKTRIGVINFSHKVELVSSLEKYTTKESLKSAVDKM--LYLGE 764 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+K+A + + +K + +TDG+ + + V+ Sbjct: 765 GTYTASAIKKAINLFQA-----------------ARPAVRKVAVVVTDGQADA-RDEVHL 806 Query: 351 IKICDKAKENFIKIVTISI--NASPNGQRLLKT---CVSSPE--YHYNVVNADSLIHVFQ 403 + +A I+I I I P+ LK + P+ + Y + + +L + Sbjct: 807 DMVVREAHAANIEIFVIGIVQETDPHYDNFLKEMHLIATDPDEEHFYRIEDFKTLSALTD 866 Query: 404 NISQLMVH 411 + + Sbjct: 867 KLITKICD 874 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 63/178 (35%), Gaps = 29/178 (16%) Query: 232 KNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDS 286 KN +L D+I + VK DV + + +++ V + + ++ + + Sbjct: 30 KNFVLSLTDNIFQMKPVKSQKYDVKLAGMQFSSTVSVDHPFIAWKNVQNFKEKIRALV-- 87 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 I + T S A+ A Q+ ++ + K +TDG ++ Sbjct: 88 YIGQGTYSYYAISNATQLFKTEGREGSI----------------KVAFLMTDGVDHPNSP 131 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV---VNADSLIHV 401 + I A+ I TI ++ + L+ S +V + + ++ V Sbjct: 132 S--VEGIATAARSLGIHFFTIGLSKKNVKEEKLRLI-SGDSSFKHVLCLDDENLIVDV 186 >gi|194221347|ref|XP_001494879.2| PREDICTED: similar to alpha 1 type VII collagen [Equus caballus] Length = 3065 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 47/172 (27%), Gaps = 25/172 (14%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIHAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + LK S + ++ + V + L + +S+ + Sbjct: 169 KLFAVGIKNADHEE--LKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 64/190 (33%), Gaps = 25/190 (13%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDM 284 + A+K AL + ++ L + V +GL+ Y+ R S + Q + Sbjct: 1068 RAEAVKRALERLVSALGPLGP--QAVQIGLLSYSHRPSPLFPLNSSHDPGVILQKIRSIP 1125 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + A+ A++ L + RQ ++ L D E Sbjct: 1126 YMDPSGN-NLGIAVVTAHRHLM-----APDAPGRRQH-------VPGVMVLLVD-EPLRG 1171 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + A+ +K++ + + + ++L + S+ + + V + SL Sbjct: 1172 DIFNPVRE----AQAAGLKVMMLGL-VGADPEQLRRLVPSTDPVQNFFAVNDGSSLDQAV 1226 Query: 403 QNISQLMVHR 412 +++ + Sbjct: 1227 SSLATALCQT 1236 >gi|86609942|ref|YP_478704.1| hypothetical protein CYB_2508 [Synechococcus sp. JA-2-3B'a(2-13)] gi|86558484|gb|ABD03441.1| hypothetical protein CYB_2508 [Synechococcus sp. JA-2-3B'a(2-13)] Length = 249 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 53/176 (30%), Gaps = 22/176 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A L + + L + + + E +R+ V R + Sbjct: 92 LDQFRPAPLDLGAILRFDNRGAGFGLTALGQPLRTAQLLQDFTGDKELLRRGVLRAV--- 148 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFK 345 T A +A + L+ + F + ++ TDG +N Sbjct: 149 PGGNTALYDATVEAGRFLSDFRPTERFN---------------RRLVVFTDGIDNESTRS 193 Query: 346 SNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N T ++ A+E + + + + N L + ++ AD L Sbjct: 194 INQATQELTTLARERGQKLTVYVVGLGVDLNLLELQRLAAATEGTFVLARFADQLS 249 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 66/217 (30%), Gaps = 50/217 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D +++ K + F+ + D L+ + L+G+ + Sbjct: 97 DLSRSMDADDVWPSRLERAKLEIEKFVKNTDNLA-------VSLVGFAGTSFIACPFTQD 149 Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + S+ L+ T A+ A +K Sbjct: 150 METFSYILDNLSTKSVTLQGTRIADALVTAKNTFN------------------VDAVSKK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGE++ + + K+ I + TI + S Sbjct: 192 SIVLITDGEDHGGYFDEVL----KELKDMNISVYTIGVGTSQGAAISTDLGVREKSVISK 247 Query: 374 -NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L S+ Y N L +F ++ + M Sbjct: 248 RDDNTLKLIADSTLGKSYIAENVS-LESIFDDMKKNM 283 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 51/153 (33%), Gaps = 29/153 (18%) Query: 254 MGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + + E + + + + T +K +Q + + Sbjct: 347 VGMVTFDSSAEIQNNLTKITDDNAYQNIIAKLPQ-FAGGGTSICNGLKAGFQAIVYSNQS 405 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + + II LTDGE+N + C ++AK + I +I++ Sbjct: 406 TSGSE----------------IILLTDGEDN-------QMSSCFEEAKASGAIIHSIALG 442 Query: 371 ASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + ++ + + LI F Sbjct: 443 -PSAAKELETLSNMTGGLRFSANKDINGLIDAF 474 >gi|301604824|ref|XP_002932067.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2881 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 61/195 (31%), Gaps = 20/195 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 D R++ AL++ + + ++ ++ + + Y+ N + + +K + Sbjct: 821 IDGTDDARSRFPALRSFIQRVVQNL---DVGQDKTRIAVAQYSDSARPNFQLNTHRDK-Q 876 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + F + +F+I LT Sbjct: 877 GVLDAIQRLTPIGGPSLNTG---------AALDFVTRNVFTKSAGSRSDEGVPQFLILLT 927 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G++ + K + I + L+T P++ ++V +A Sbjct: 928 TGKSRDDVGRP-----ATSLKGQGVIPFAIG--TNKADTTELQTISFVPDFTFSVSDASQ 980 Query: 398 LIHVFQNISQLMVHR 412 L V+QN + + Sbjct: 981 LTDVYQNFANKVSQL 995 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/356 (9%), Positives = 99/356 (27%), Gaps = 38/356 (10%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 K +++ A+L ++ + +++ AK + A I+ + + L Sbjct: 1271 KTNLQK---QALLNNIRRLRLKGGAPLNIGKAVEYVAKNQFVKSAGSRIEEGVPQYLVLL 1327 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + ++ + N ++ S Sbjct: 1328 TGGKSEDDVSRSAR-------LLHNAKVQSVAVASGTTDRKDIE-TIVTD---------- 1369 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + I + + + E ++++ + ++ + + Sbjct: 1370 TRLIFNIKEFRELPSVERKIYRSFEGEIEVTPTPAITTTISKKEADIVFLVDGSINLGRD 1429 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K L +D + ++ + + L Y + V T++ + Sbjct: 1430 NFKEVLQFVSGIVDAVFDEEDAIQIALAQYNSDVTDEFFLKDFTDRDQIMDAVTKAEYKG 1489 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 ++ A + L +F ++ S + +T G+ + Sbjct: 1490 G---RVASLGAAIRHLQDK-------HFVKEAGSRVSSGVPQIAFVITGGK---SVDDGQ 1536 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + +K+ I + A +G + K +P + V N L + + I Sbjct: 1537 SAAMA--LSNKGVKVFAIGVGA-IDGDEVAKIASDAPS-AFRVPNVQELSELNEQI 1588 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 56/195 (28%), Gaps = 28/195 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D ++ D + L N + L + + +G++ Y+ V+ + Sbjct: 37 DGSNNVGRSDFSYIR-DFLVNYVQ------RRLEVGRSRMQVGVVQYSDDVKTEFSLAQN 89 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K Q V + + + A+ F Sbjct: 90 PTKG-QLVEALKNLRFIGGEEANLGAALDY-----------VVENVFTTAAGSRREEGVP 137 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ +T G S+ + + + K N I T I L+ + + + Sbjct: 138 QSLMIITAG-----PSSDDFREATNALKLNSII--TFGIGVERADIAELQQIATDESFVF 190 Query: 391 NVVNADSLIHVFQNI 405 + L + +NI Sbjct: 191 STPQMQDLSTLERNI 205 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 43/381 (11%), Positives = 97/381 (25%), Gaps = 37/381 (9%) Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 N+ + N L + A + A T + + + Sbjct: 458 NVGDEVPYTFVVRNSGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVD 517 Query: 131 HMA--NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 N +S N ++ I+ L +++I + + Sbjct: 518 SEGHFTNTATASGTDDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVT 577 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS-- 246 + + ++ KM ALK+A+ FL ++ Sbjct: 578 GAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQ 637 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 E + +GL+ + + + + ++ T +K L Sbjct: 638 SGSELIRVGLVKFAGKESSKVG---NETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALR 694 Query: 307 SDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENN-----NFKSNVNTIKICDKAKE 359 + F ++ +IF TDG + + + K+ Sbjct: 695 HNGATRADLGFKHASTVMSGARTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKD 754 Query: 360 NFIKIVTISINASP-------NGQRLLKTCVSSP----------------EYHYNVVNAD 396 + + +I + + + VSS Y+ V N Sbjct: 755 SGATVYSIGVFDGANPSSIEEDQKNQFMNAVSSNYPHATAYDKLGTGSNAGYYKVVSNVS 814 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L +F+ I ++ + Sbjct: 815 DLKSIFEKIQTNETTDGFTNV 835 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 51/166 (30%), Gaps = 19/166 (11%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + +D L + + L+ Y + V K + T + Sbjct: 91 FLSEMVDSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 150 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + + G + K I +TDG + ++ Sbjct: 151 AIKTAMEQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 195 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + R LK S P ++ + V + Sbjct: 196 ARASGIEIYAVGV--DRAEVRSLKQMASQPLDDHVFYVETYGVIEK 239 >gi|295093844|emb|CBK82935.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 343 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/200 (10%), Positives = 54/200 (27%), Gaps = 23/200 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDS 294 LD + + G++ + T + E V+ + L + + Sbjct: 113 ENLLDKLKKTVDELQGERFGIVIFNTSPVLLTPLTDDYEYVKDQLDLIAQCLKSRNEVNL 172 Query: 295 TPAMKQAYQ------------ILTSDKKRSFFTNFFRQGVK----IPSLPFQKFIIFLTD 338 A Y ++ ++++ S K +IF TD Sbjct: 173 DDAFSSGYDWIYYQAYISSGTLIGNEQRGSSLIGDGLAAAAIDFSDADKERTKVVIFSTD 232 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVV--- 393 + + D N + + + N + + + Y Sbjct: 233 NDIQ-GTPVATLDEAADICVSNNVTVYGVGTKEMTPENKESMKNAVEKTGGKFYLEEESG 291 Query: 394 NADSLIHVFQNISQLMVHRK 413 + ++ + +S+ +V + Sbjct: 292 SFGEIVSSIEKLSKNLVKVR 311 >gi|170692557|ref|ZP_02883719.1| putative transmembrane protein [Burkholderia graminis C4D1M] gi|170142213|gb|EDT10379.1| putative transmembrane protein [Burkholderia graminis C4D1M] Length = 373 Score = 56.4 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 24/285 (8%), Positives = 68/285 (23%), Gaps = 7/285 (2%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K ++ ++ AL ++ L + I + + ++ +++A +AA +AGA ++ Sbjct: 5 KSRRKQEGAVAVTVALCMLFLLGIAALAIDIGNLLVARSQLQNAADAAAMAGAGCLMRRT 64 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 A + ++ ++ + Sbjct: 65 ECGNTTATQPDWITASAKASSFSTSTVTNKVQADYVKASTVATGYWNATGTPYGLESLPF 124 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 D + + D + F +L S + + G Sbjct: 125 TPGASDLPAVQVTIHKDGSNANGAVPVFFGSVLG--VQILKASAVATAVLSTPGNVGPGG 182 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +M + + + ++ + A + Sbjct: 183 L-----FPLAMSKCLYDNYWNAATSSPKLSPNNNTLPGTSVAQQQGQPYFFQIGSSYHYG 237 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +TT + S+ + + T Sbjct: 238 ACESGQWTTFNVNDNSASYARGLLTSGNSTTFSIGASPGTWIQTG 282 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 80/309 (25%), Gaps = 14/309 (4%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + S + D + + S+S + E +N + Sbjct: 604 NVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLTMKVPQEDAKNINLQIEAKATE 663 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 A + D N + + + + ++N + I+ + + + Sbjct: 664 ARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAGKDILIGDTGGTELNVQAGKNY 723 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LVVD SGSM A S +++ LK+AL DS+ Sbjct: 724 NIALVVDTSGSMKEASGSKT--------AWGTTISRIDLLKDALKNLADSLKGHDGKINV 775 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + + + +T+ T+ A + + Sbjct: 776 SIIDFDTNAKEPITFNDLTS--KNISDLITKIDALKAEGGTNYEDAFLKTTSWFDTQSVT 833 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIVTISI 369 + + F T N + + D K + I I Sbjct: 834 YGKAQGYENLTYFLTDGDPTFSNRDTK-NNGSTTEYSDMKDAVDAFKTLSGQSTVHAIGI 892 Query: 370 NASPNGQRL 378 N L Sbjct: 893 GNGINENYL 901 >gi|308501643|ref|XP_003113006.1| CRE-MUA-3 protein [Caenorhabditis remanei] gi|308265307|gb|EFP09260.1| CRE-MUA-3 protein [Caenorhabditis remanei] Length = 3860 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 KN +L F+ +L + +GLI Y+ ++ + ++ + Sbjct: 1246 VFKNEVLRFVSEFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQYL 1305 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ Q R+G + + I LTDG + + Sbjct: 1306 TGLTRTGAAIQHMVQ----------EGFSERRGARPQQSDIARVAIILTDGRSQD----- 1350 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N D A++ I I + L++ SP + V L ++ Sbjct: 1351 NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 1405 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 10/40 (25%), Positives = 16/40 (40%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK 50 ++ ++ NF I+ A+ L G I V D K Sbjct: 11 LRRMLSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSK 50 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 60/186 (32%), Gaps = 27/186 (14%) Query: 231 LKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K F+D + + + G + Y+ V + + +R V Sbjct: 90 VKEFTKKFIDKLTSRYYRCDRNLVWNAGALHYSDEVILINSLTRDMKTLRDNVE--TVEY 147 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I K T + A+K+ + + K++I +TDG Sbjct: 148 IGKGTHTDCAIKRGIEEVLIGGSHQKEN---------------KYLIVVTDGHPLEGYKE 192 Query: 348 VN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQ 403 ++AK IK+ +++I + + + L S + ++ +A L Sbjct: 193 PCGGLEDAANEAKHLGIKVFSVAI-SPNHLEPRLSVIASDASHRRNFTATSAVGLTD--D 249 Query: 404 NISQLM 409 I + Sbjct: 250 EIDNTI 255 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 68/221 (30%), Gaps = 37/221 (16%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM TK+ K+A+ + L+ + Sbjct: 294 DVIFVIDVSGSMF--------------------GTKIKQTKSAMHVILNDLHRDDSFNII 333 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + ++ + + + + YV + TD A+ A I + Sbjct: 334 TFSDVVHVWRPGQSIPATAQNKKSAKDYVNKI---EADGWTDINAALMAAASIFNQTSHK 390 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISIN 370 K S IIFLTDGE + + + KA I + ++ Sbjct: 391 ---------PEKETSTKKIPLIIFLTDGEATSGVLATSRILSNAQKAMGGTISLFCLAFG 441 Query: 371 ASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 + + + + + Y +A L + I+ Sbjct: 442 EDADYNLMRRLSLENRGIARRIYEYSDATLQLKGFYDEIAS 482 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/243 (11%), Positives = 71/243 (29%), Gaps = 73/243 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + +D+S SM + +D ++ K + ++ + Sbjct: 70 DVVFALDVSKSM---------------LAEDIAPNRLEKSKQLITQIVNELG-------G 107 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +G+IGY + + + ++ D + + T + A++ + +++ Sbjct: 108 DRVGIIGYAGSAFPQVPITTDFASTKLFLNSMDTDMVSSQGTAISEAVQMSTTYFDDEEQ 167 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 ++ + + ++DGE++ + A EN IK+ I + Sbjct: 168 KN------------------RVLFVISDGEDHEGN----FESAIENATENGIKVYAIGVG 205 Query: 371 A-------------------SPNGQ---------RLLKTCVSSPEYHYNVVNADSLIHVF 402 + Q L + N N +++ Sbjct: 206 TEKGGPIPIKRNGVLQSYKRDQDNQQVITRLGEPTLQAIADEGNGKYINGTNTQAVVDEV 265 Query: 403 QNI 405 + I Sbjct: 266 KAI 268 >gi|89100218|ref|ZP_01173085.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085068|gb|EAR64202.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 456 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/216 (11%), Positives = 66/216 (30%), Gaps = 34/216 (15%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 S D + + + +T+M K ++ F S+ +++ VY Sbjct: 143 GGSQESPASFNVEIVLDASGSMANKLGSKTRMELAKESIKEFASSLPEEANISLRVYGHK 202 Query: 257 IGYTTRV--------EKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + E P + ++ + + T ++ +A + L Sbjct: 203 GTGSDSDKKMSCSSNELVYPPQPYNEGELNSALDKFN---PAGWTPLAQSLIEAQKDL-- 257 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + + ++DG + N ++ K++ + + Sbjct: 258 --------------AQFEGQDNKNMVYVVSDGIETC---DGNPVEAAKDLKDSGVAPVVN 300 Query: 366 TISINA-SPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I + + Q+L + ++ + NV + L + Sbjct: 301 IIGFDVKGKDQQQLEEVAKAAGGTYQNVTSQQQLQN 336 >gi|301781320|ref|XP_002926070.1| PREDICTED: collagen alpha-1(VII) chain-like [Ailuropoda melanoleuca] Length = 2994 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/354 (8%), Positives = 83/354 (23%), Gaps = 39/354 (11%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + D+ ++ S ++ + + + + L+ Sbjct: 232 AFSSNWDGIDSWDGNPANLTNSYIEHAGTNNPVDFALRKYARETETPGLYDVFLNV--RG 289 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 N + + R + V + Sbjct: 290 NVQNPIKPVDIVLVVDMSGSMEGAREGAIKQGV--KSFLSSIENTAYAQYVNVGLV--GY 345 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALL-LFLDSIDLLSHVKEDVYMGLIGYTTR 262 + P+ ++A+ AL F+ +++ M Sbjct: 346 SSPGYISNSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQM-------- 397 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + D + + A + + +D + Sbjct: 398 ------LKEDASGNEKMIILMTDGVPTFSNKVSSAQLEGGVLYGTDFDSNSLDEPSFTSQ 451 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-------G 375 + T + T+ AK++ +I T+ I + Sbjct: 452 LWMMNGNNRTPAPYTVSGETINDTWAATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSS 511 Query: 376 QRLLKT---------CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 L SS + + +A+++ +N + +++ +++ I G Sbjct: 512 NTYLSQEEVRKRTSLIASS-GLYQDADSAENITDYLKNQADVVLS-RFNTITNG 563 >gi|281343115|gb|EFB18699.1| hypothetical protein PANDA_015680 [Ailuropoda melanoleuca] Length = 2904 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 41 SGAASAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 100 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 101 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 140 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 141 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 190 >gi|119585300|gb|EAW64896.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_d [Homo sapiens] Length = 2978 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|119585298|gb|EAW64894.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_b [Homo sapiens] Length = 2609 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|119585297|gb|EAW64893.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_a [Homo sapiens] Length = 2944 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|119585301|gb|EAW64897.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_e [Homo sapiens] Length = 2597 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|495866|gb|AAA58965.1| collagen type VII [Homo sapiens] Length = 2912 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|4502961|ref|NP_000085.1| collagen alpha-1(VII) chain precursor [Homo sapiens] gi|1345650|sp|Q02388|CO7A1_HUMAN RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor gi|987125|gb|AAA75438.1| alpha-1 type VII collagen [Homo sapiens] Length = 2944 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|116625802|ref|YP_827958.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228964|gb|ABJ87673.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 326 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 68/215 (31%), Gaps = 41/215 (19%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 H A + P V D + +M A+ L + + L+ ++ R Sbjct: 90 HFATDDAPASVGLVFDTSDSMQPRMNKAHEAVEALLKNANPAD------EFFLVQFSDRA 143 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +E++ + R I T A+ A + + S Sbjct: 144 RLVAGMTKDSEEISR---RAASMRIGGSTALLDAVAMAMEEMKSA--------------- 185 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---------- 373 +K ++ ++DG++N+ + VN +K +E + I I I Sbjct: 186 ---HYLRKVMVIISDGDDNSSRCPVNDLK--RIVREGDVTIYAIGITDDNVPLAYPQRDR 240 Query: 374 -NGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 G LL + + + V L + IS Sbjct: 241 LTGAALLNEIATQTGGRLFEVHKLKQLPEIAAKIS 275 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 59/199 (29%), Gaps = 46/199 (23%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I +V+D+S SM + D T++A K + + Sbjct: 86 ETVTSSEVSADIMVVLDVSRSM---------------LADDVAPTRLARAKAEVAELSSA 130 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQ 300 + +GL+ + R + R + S+ T+ A+++ Sbjct: 131 L-------RGHRIGLVAFAGRASVLAPLTPDYGFFRMILDGVDTKSVSRGGTEIGQALRK 183 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + K I+ +TDGE++ +A E Sbjct: 184 AVRSFDPGPGA-------------------KMILLITDGEDHGG----YAEDAAREALEA 220 Query: 361 FIKIVTISINASPNGQRLL 379 +++V I + Q L Sbjct: 221 GVRVVAIGFGSEQGSQITL 239 >gi|58429509|gb|AAW78158.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 62/186 (33%), Gaps = 25/186 (13%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + +K+A+ L + I L+ +++ + ++ ++ I Sbjct: 44 NDQVDLYLLMDCSGSIRRHNWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRL 103 Query: 270 SWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K ++ + +L T+ T A+ Q + L I Sbjct: 104 HSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLNDR---------------I 148 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTC 382 + ++ LTDG ++ + + +K K + +KI I N R L C Sbjct: 149 NRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAVFGIGQGINVAFNRFLVGC 205 Query: 383 VSSPEY 388 S Sbjct: 206 HPSDGK 211 >gi|58429493|gb|AAW78150.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 569 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 62/186 (33%), Gaps = 25/186 (13%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + +K+A+ L + I L+ +++ + ++ ++ I Sbjct: 44 NDQVDLYLLMDCSGSIRRHNWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRL 103 Query: 270 SWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K ++ + +L T+ T A+ Q + L I Sbjct: 104 HSDASKNKEKALIIIKSLLSTNLPYGKTNLTDALLQVRKHLNDR---------------I 148 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTC 382 + ++ LTDG ++ + + +K K + +KI I N R L C Sbjct: 149 NRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAVFGIGQGINVAFNRFLVGC 205 Query: 383 VSSPEY 388 S Sbjct: 206 HPSDGK 211 >gi|82703475|ref|YP_413041.1| hypothetical protein Nmul_A2358 [Nitrosospira multiformis ATCC 25196] gi|82411540|gb|ABB75649.1| putative membrane protein [Nitrosospira multiformis ATCC 25196] Length = 437 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 25/58 (43%), Gaps = 5/58 (8%) Query: 4 LSRFRFYFKKG-----IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 + R ++ EK +II ALS++ + G + + + K+ ++++ Sbjct: 5 MKRVSDRLRRNRNHPGCRQEKGVVAIIVALSLVVLVGFAGLALDLGKLYVAKSELQNS 62 >gi|268573054|ref|XP_002641504.1| C. briggsae CBR-MUA-3 protein [Caenorhabditis briggsae] gi|187031289|emb|CAP29350.1| CBR-MUA-3 protein [Caenorhabditis briggsae AF16] Length = 3770 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 KN +L F+ +L + +GLI Y+ ++ + ++ + Sbjct: 1252 VFKNEVLRFVREFVELFEIGRGKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQYL 1311 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ Q R+G + + I LTDG + + Sbjct: 1312 TGLTRTGAAIQHMVQ----------EGFSERRGARPQQNDIARVAIILTDGRSQD----- 1356 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N D A++ I I + L++ SP + V L ++ Sbjct: 1357 NVTGPADAARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 1411 >gi|312139254|ref|YP_004006590.1| lipoprotein [Rhodococcus equi 103S] gi|311888593|emb|CBH47905.1| putative lipoprotein [Rhodococcus equi 103S] Length = 314 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/229 (7%), Positives = 55/229 (24%), Gaps = 16/229 (6%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 E+ +++ A+ ++ L + + + K +++ A A + S Sbjct: 4 DERGVVAVLVAILMVVLLGCAAISVDIGANYVVKRQLQN----GADAAALAVAQESSCKA 59 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 S + +A ++ + I + + + + +R Sbjct: 60 GSSASSVSSLVQANVNSSSAASAAVIDGVKRKVTVTASAVGDDGLAGRRNVFAPVLGVDR 119 Query: 137 LDSSNNTIF---YNMDVMTSYDYR-----LQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + S + + + L + G Sbjct: 120 SEISASATASCVFPLGGTAELPLTFHKCHFDESRSL--DVKILVAYNVTAPRCNGTSGNA 177 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK--MAALKNAL 235 + +G +++ K + +NA+ Sbjct: 178 APGNFGWLQGANGRCPAKIDAAVYATPGDTGNNIPGPCKDTIKQFQNAV 226 >gi|45384490|ref|NP_990665.1| collagen alpha-1(XIV) chain precursor [Gallus gallus] gi|1705533|sp|P32018|COEA1_CHICK RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|288873|emb|CAA50064.1| collagen XIV [Gallus gallus] Length = 1888 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 54/184 (29%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + +I ++ + + K + Sbjct: 1056 DNFNKIISFLYSTVGALDKIGPDGTQVAIIQFSDDPRTEFKLNAYKTKETLLEAIQQIAY 1115 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K A ++L + + K ++ +TDG + + + Sbjct: 1116 KGGNTKTGKAIKHAREVLFTGEAGM-------------RKGIPKVLVVITDGRSQDDVN- 1161 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ + + + I + + + + S + + V + F I Sbjct: 1162 ----KVSREMQLDGFSFFAIGVADADYSELVNIGSKPSERHVFFVDDF----DAFTKIED 1213 Query: 408 LMVH 411 ++ Sbjct: 1214 ELIT 1217 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 24/174 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 185 VSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 IL + K K I +TDG++ + ++ Sbjct: 244 --ILENSFK----------PEAGARPGVSKIGILITDGKSQDDVIPP-----AKNLRDAG 286 Query: 362 IKIVTISI-NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ I + NA N LK S P+ + YNV + + + + + +++ + R Sbjct: 287 IELFAIGVKNADINE---LKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSR 337 >gi|288875|emb|CAA50063.1| collagen XIV [Gallus gallus] Length = 1857 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 54/184 (29%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + +I ++ + + K + Sbjct: 1056 DNFNKIISFLYSTVGALDKIGPDGTQVAIIQFSDDPRTEFKLNAYKTKETLLEAIQQIAY 1115 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K A ++L + + K ++ +TDG + + + Sbjct: 1116 KGGNTKTGKAIKHAREVLFTGEAGM-------------RKGIPKVLVVITDGRSQDDVN- 1161 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ + + + I + + + + S + + V + F I Sbjct: 1162 ----KVSREMQLDGFSFFAIGVADADYSELVNIGSKPSERHVFFVDDF----DAFTKIED 1213 Query: 408 LMVH 411 ++ Sbjct: 1214 ELIT 1217 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 24/174 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 185 VSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 IL + K K I +TDG++ + ++ Sbjct: 244 --ILENSFK----------PEAGARPGVSKIGILITDGKSQDDVIPP-----AKNLRDAG 286 Query: 362 IKIVTISI-NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ I + NA N LK S P+ + YNV + + + + + +++ + R Sbjct: 287 IELFAIGVKNADINE---LKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSR 337 >gi|46906212|emb|CAA46928.2| collagen type XIV/undulin [Gallus gallus] Length = 1148 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 54/184 (29%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + +I ++ + + K + Sbjct: 655 DNFNKIISFLYSTVGALDKIGPDGTQVAIIQFSDDPRTEFKLNAYKTKETLLEAIQQIAY 714 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K A ++L + + K ++ +TDG + + + Sbjct: 715 KGGNTKTGKAIKHAREVLFTGEAGM-------------RKGIPKVLVVITDGRSQDDVN- 760 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ + + + I + + + + S + + V + F I Sbjct: 761 ----KVSREMQLDGFSFFAIGVADADYSELVNIGSKPSERHVFFVDDF----DAFTKIED 812 Query: 408 LMVH 411 ++ Sbjct: 813 ELIT 816 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/251 (11%), Positives = 78/251 (31%), Gaps = 45/251 (17%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +++ I P + ++D+SGSM K Sbjct: 315 NRDTELMHVAIKGYDIAPATTPRANLVFLIDVSGSMDEPD-------------------K 355 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK+A L ++ + + ++ Y + P+ EK + ++ Sbjct: 356 LPLLKSAFRLMVNRLKADD------TVSIVTYAGNAGTVLAPTRVAEKSK-ILSAIDRLE 408 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY + + ++ TDG+ N S Sbjct: 409 PGGSTGGAEGIEAAYDLAKQGFVKDGVN----------------RVMLATDGDFNVGPSS 452 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +I ++ +++ I + + L++T + + D+L + + Sbjct: 453 DGDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLV 510 Query: 407 QLMVHRKYSVI 417 + + + Sbjct: 511 EEAGSTLFPIA 521 >gi|58429531|gb|AAW78169.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|301772356|ref|XP_002921600.1| PREDICTED: integrin alpha-1-like, partial [Ailuropoda melanoleuca] Length = 1160 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 69/191 (36%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D ++ + + +G++ Y V + + V + Sbjct: 169 SVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQ 228 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 229 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNHQLNKVI 275 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I S N L +K+ S + ++ +NV + +L+ + Sbjct: 276 QDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 332 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 333 VEALGERIFAL 343 >gi|296269297|ref|YP_003651929.1| vault protein inter-alpha-trypsin domain-containing protein [Thermobispora bispora DSM 43833] gi|296092084|gb|ADG88036.1| Vault protein inter-alpha-trypsin domain protein [Thermobispora bispora DSM 43833] Length = 796 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/259 (12%), Positives = 80/259 (30%), Gaps = 52/259 (20%) Query: 156 YRLQ-FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + +++ +P R R + LV+D SGSM Sbjct: 266 LVPDEPGDGAAEDGEGTLMLTVLPPDDREAA--RRPRDVVLVLDRSGSMAGW-------- 315 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWG 272 KM A + A +D++ + ++ + +E+ + Sbjct: 316 ------------KMVAARRAAARIVDTLTERD------RLAVLAFDNVIERAFPDGLTAA 357 Query: 273 TEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 T++ R + V T ++L ++ +G + Sbjct: 358 TDRARYRAVEFLARLEARGGT----------EMLAPLEEALTALAAAAEGGRDA------ 401 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++ +TDG+ + ++ + +++ + I+ + N L + V Sbjct: 402 VLVLVTDGQ--VGDEDRILERMASRI--GGVRVHAVGIDRAVNAAFLGRLAVLGAGRCEL 457 Query: 392 VVNADSLIHVFQNISQLMV 410 V + D L ++I + + Sbjct: 458 VESEDRLDEAMEHIHRRIG 476 >gi|281339017|gb|EFB14601.1| hypothetical protein PANDA_010505 [Ailuropoda melanoleuca] Length = 1153 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 69/191 (36%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D ++ + + +G++ Y V + + V + Sbjct: 170 SVTAFLNDLLERMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQ 229 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 230 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNHQLNKVI 276 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I S N L +K+ S + ++ +NV + +L+ + Sbjct: 277 QDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 333 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 334 VEALGERIFAL 344 >gi|218193011|gb|EEC75438.1| hypothetical protein OsI_11969 [Oryza sativa Indica Group] Length = 516 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 56/185 (30%), Gaps = 32/185 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTR 282 K+ +K ALL + +D + + ++ ++ + + + R + Sbjct: 82 AGDKLDRMKAALLFVIRKLDDVD------RLSIVTFSNDAARLCPLRFVAGDAARADLGA 135 Query: 283 DMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ + + + + ++ ++DG+ Sbjct: 136 LVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRSV---------------NVMLMSDGQQ 180 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 N + + + T + A + L S + V + +L Sbjct: 181 NRADATRLDPG--------GVPVHTFGLGADHDPAVLQAIAGKSREGMFHYVADGVNLTA 232 Query: 401 VFQNI 405 F + Sbjct: 233 PFSQL 237 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 57/152 (37%), Gaps = 27/152 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V + +T ++ T +K +Q + ++ + Sbjct: 346 VGMVTFDSVAEIRNNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQST 405 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N I C ++ K++ + I TI++ Sbjct: 406 SGSE----------------IILLTDGEDN-------EIHSCIEEVKQSGVIIHTIALG- 441 Query: 372 SPNGQRLLKTCVSSPEY-HYNVVNADSLIHVF 402 + L + + Y + + L + F Sbjct: 442 PSAAKELETLSDMTGGHRFYANKDINGLTNAF 473 >gi|301780322|ref|XP_002925578.1| PREDICTED: collagen alpha-1(XIV) chain-like [Ailuropoda melanoleuca] Length = 1796 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 48/163 (29%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKL-NAYNTKETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + N I + Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISGEMQSNGYNIFAVG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSSRHVFFVDDF----DAFKKIEDELIT 1207 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 53/173 (30%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 185 VTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGARAGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|281350861|gb|EFB26445.1| hypothetical protein PANDA_015099 [Ailuropoda melanoleuca] Length = 1741 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 48/163 (29%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + S T + A+K L + Sbjct: 1040 DGTQVAMVQFTDDPRTEFKL-NAYNTKETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTA 1098 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + N I + Sbjct: 1099 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISGEMQSNGYNIFAVG 1140 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1141 VADADYSELVSIGSKPSSRHVFFVDDF----DAFKKIEDELIT 1179 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 53/173 (30%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 157 VTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNY- 215 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 216 ------------IFENSFKPEAGARAGVSKIGILITDGKSQDDIIPPS-----RNLRESG 258 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + L+ S P+ + YNV D + V +++++ + R Sbjct: 259 VELFAIGV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 309 >gi|229489135|ref|ZP_04383001.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229324639|gb|EEN90394.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 551 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/338 (10%), Positives = 86/338 (25%), Gaps = 32/338 (9%) Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 ++ ++ + + + + A + ++ Sbjct: 233 DADGGTAVTTEQTFEAYLAAHADSRLTASVPSSGSVFMDYPMAGVGADSAAAVDAGKLLA 292 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ F D + + L+ + + Sbjct: 293 AALTSDTGRSVLAGLGFRGADQAPPVAGGVGKVGSLVPSDPTVVTKALARYEILSRPSRA 352 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 VVD+SGSM + +A R A + L F + + Sbjct: 353 LA-----VVDVSGSMDYMQDGVTRMAATAQAGDIAIRMFPANAQLGLWAFSIDLGEGTDY 407 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +E + + T + K+ + S++ T ++ AY+ + Sbjct: 408 RELEPVARMDATEGD------TDHRSKLLSRIDSLS-SIVGGGTGLYDSVLAAYRSMQQT 460 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKI 364 +I LTDG N++ S + +E + I Sbjct: 461 Y----------------DPASINSVILLTDGANDDPSSISLQELLDTLTREQDPTRPVPI 504 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +TI + + L + + + + VF Sbjct: 505 ITIGVTDDADTDVLEQISALTGGNSHFAPTPADIPKVF 542 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/343 (8%), Positives = 71/343 (20%), Gaps = 75/343 (21%) Query: 61 ILAGASKMVSNLSRLG-DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 LAGA + + + + R + + S Sbjct: 170 ALAGAGEFHLAENLAALQEAMATATGGSSTGAALPIRVSNATRQPQPAARSKELPAALPD 229 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 ++ + ++ + + + + Q +S Sbjct: 230 PATSTKVPLIALPPVAKMLLP-EPQVTTNAPALAVEIIFDASGSMAARLQGQTKLSLARR 288 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L + I + F Sbjct: 289 ALAAAVPGLENPSI------------------------------------LVGMRAYGFD 312 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAM 298 S++ E + + + + R D+L T ++ Sbjct: 313 QSLNKTPDAS----------CPNTELVLPFTANRQA--TAINRTADALSAYGYTPIADSL 360 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + II ++DGE + + Sbjct: 361 TLAGHDLL------------------AIDAQKHMIILISDGEETCG---GFPAAVAANLR 399 Query: 359 ENFIKI--VTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 I + I + Q+ ++ S +++ + D L Sbjct: 400 SLGIDLQTHVIGFDLDATAQQQMQAIASAGGGQYFDAADGDEL 442 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 48/182 (26%), Gaps = 23/182 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT---TRVEKNIEPSWGTEKVRQYVTR 282 K+ KN + L H+ + G I G + ++ Sbjct: 31 RKIDLAKNLFQGMSEQFALDPHMSLRFFAGGTSSNKMIDCQSSKIGLGVGAARSAAGISS 90 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +DS+ ++ R + + II ++DG+ Sbjct: 91 MIDSINAVG-----------------QQTPITYALERAQQDMEGWLGPRKIILISDGQET 133 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + I + TI I N +L ++ + N +L Sbjct: 134 CQQD---PESLAEAFSGAGITVDTIGIGPPGNFAQLGMIALAGAGEFHLAENLAALQEAM 190 Query: 403 QN 404 Sbjct: 191 AT 192 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 30/170 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMD 285 +K+ + +D + +E +G++ Y+ + V+ V R Sbjct: 818 YEVVKDFVNSLIDHVS---VSREATRVGVVLYSHVEVVVASLQQLYDQAAVKTAVRRMPY 874 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+++A Q+ + +K + LTDG +N Sbjct: 875 L--GEGTFTGSAIRRATQLFQA-----------------ARPGVRKVAVVLTDGLADNRD 915 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHY 390 + + A I+I + I + + Q + S P+ +Y Sbjct: 916 AVS-LKDAAEGAHSAGIEIFVVGIVNNSDSQYAEFKNEMNILASDPDENY 964 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 405 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYM 464 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 465 SGGTATGDAIAFTVRNVF---------------GPIRDSPNKNFLVIVTDGQSYDD---- 505 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + L+ S P + + L + ++ Sbjct: 506 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LRDMASKPKESHAFFTREFTGLEPIVSDV 561 Query: 406 SQLMV 410 + + Sbjct: 562 IRGIC 566 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTR 282 ++ L A LFL L V++ ++G++ + + R +T+ Sbjct: 319 IGDRLKRLTQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQSALRQIKGGTDRDALTK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ T ++ A+ ++ K I+ LTDGE+N Sbjct: 374 SLPTVASGGTSICSGLRSAFTVIRKKYKTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---L 398 TI C ++ K++ I T+++ L + + + L Sbjct: 417 -------TISSCFNEVKQSGAIIHTVALG-PSAAAELEELSKMTGGLQTYASDQAQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + ++ +G Sbjct: 469 IDAFGALSSGNGAVSQHSIQLVSRG 493 >gi|87306384|ref|ZP_01088531.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] gi|87290563|gb|EAQ82450.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] Length = 1030 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 7/108 (6%) Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S P+ K +I ++DG+ + + + IK+ Sbjct: 522 MPQFDPSLQMALRAFNQLPPNEVAVKHMIIISDGDPSPANPFTLS-----AIAKAGIKVT 576 Query: 366 TISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 T++I N L K ++ +Y V N +L ++Q ++ + Sbjct: 577 TVAIGTHGPANSLELKKIASATGGKYYEVTNPKALPRIYQREARRIAQ 624 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/341 (8%), Positives = 86/341 (25%), Gaps = 32/341 (9%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L S ++ + E++ + ++++ E+ G + Sbjct: 33 TLVAGSSGKPVVTAATQKLEAVPGFLNDFNLMTRIALVESNYGENNKGSLGGIWEITTSI 92 Query: 121 IVNSSRISMT-----------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFI-EHLLNQR 168 + + + +N + + +Q Sbjct: 93 QATLRGSLSSSVQTIVNEAGCILGQENTNLTNTDFSTPIRSAFAARIFIQQSVGSSAIPI 152 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + ++ + R + V+ D V Sbjct: 153 TLTQQATWWSTVYRPGANATKFIELVSQVENVTVGCATHKLDLWFVIDGSGSVGFSN--- 209 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +++L + +DV +G Y++ + + S Sbjct: 210 --FQDSLRFLASLTKRFTIGPDDVRVGFSVYSSTSTIHSHFNQHMNNSALEAEILGTSYT 267 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ +L + R G + S + ++ +TDG++ + Sbjct: 268 GGGTSTGRAIN---DVLNNGF-------VERNGARPASEGVPRILVVMTDGQSGDSVKTP 317 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + D K I + + I + + + + + + Sbjct: 318 S-----DNVKAAGITVFGVGIGSGIDIAEVNEIASNPDSRY 353 >gi|156371145|ref|XP_001628626.1| predicted protein [Nematostella vectensis] gi|156215607|gb|EDO36563.1| predicted protein [Nematostella vectensis] Length = 484 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 59/181 (32%), Gaps = 18/181 (9%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K + + S ++ ++G+I ++TR + + D Sbjct: 290 AKRFVKALIGSFK---VSQKGTHVGIIRFSTRAKVMFTFTEHFTHEDVNYAIDDIEYTEG 346 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A++ A L F +QG S K + +TDG + F + Sbjct: 347 GTKTELALRLARTEL-----------FSKQGGSRTSPLIFKLFVLMTDGRSEYFHAVARQ 395 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K + + ++ + I N + L+ SS VV+ L+ I + Sbjct: 396 ---AKMLKRSGVHVMAVGIGKYTNQRE-LEVIASSKSDVIGVVSFRDLMIRMNEIKDKLC 451 Query: 411 H 411 Sbjct: 452 E 452 >gi|81892746|sp|Q6Q473|CLCA4_MOUSE RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel regulator 6; Short=mClca6; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|50882459|gb|AAS86332.2| calcium activated chloride channel [Mus musculus] Length = 924 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 70/210 (33%), Gaps = 35/210 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 ++ + A FL I ++ ++G++ ++++ E ++ Sbjct: 314 VSGSMTSYDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIVHELIQINSDIE 368 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + + + T +K A+Q+ + + ++ T I+ L Sbjct: 369 RNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE----------------ILLL 412 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DGE + D+ K++ + I++ + + + H + Sbjct: 413 SDGE------DSTAKDCIDEVKDSGSIVHFIALG-PLADLAVTNMSILTGGNHKLATDEA 465 Query: 397 S---LIHVFQNISQL---MVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 466 QNNGLIDAFGALASENADITQKSLQLESKG 495 >gi|55378019|ref|YP_135869.1| hypothetical protein rrnAC1219 [Haloarcula marismortui ATCC 43049] gi|55230744|gb|AAV46163.1| unknown [Haloarcula marismortui ATCC 43049] Length = 788 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/273 (8%), Positives = 70/273 (25%), Gaps = 40/273 (14%) Query: 138 DSSNNTIFYNMDVMTS----YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 N + ++ + ++ + L + G+ Sbjct: 294 SLYNVSTASSVPADLDDYAAVVMQDTPSGNM-GNTTALQDFVINGGGLVVAGGDNAYENG 352 Query: 194 ELVVDLSGSM----HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 SM ++ + ++ K L LD + + Sbjct: 353 GYETSPVASMLPVRVGNATGGESNIVILVDVSGSAESGLSVQKAVALDVLDQLGDEN--- 409 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +G++ + + E + + + TD ++ A ++L + Sbjct: 410 ---QVGVVAFNQNAYRVSEMQALGQNRAETADKIRRLESGGATDIAVGLQGADELLDDRE 466 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 II L+DG++ + ++ ++V++ + Sbjct: 467 GT---------------------IILLSDGQD----RLGPPAAVANQLGREGTRVVSVGV 501 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + S ++ + L +F Sbjct: 502 GKRVGVATMRQIASESGGSYFAADETERLRLLF 534 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 76/245 (31%), Gaps = 45/245 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 NQ +Q+ +S + + + L++D SGSMH Sbjct: 16 ANQPSSQRQLSIAIRAITQSQDQSLPLNLCLILDHSGSMH-------------------- 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K A + ++ + + +I + R + + + + + ++ Sbjct: 56 GRPLETVKKAAMQLIERLKEGD------RICVIAFDHRAKVLVP-NQAIDNLNTIKSQIR 108 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T +K + + K + + LTDGE N Sbjct: 109 QLSADGGTAIDEGLKLGIEEVAKGKADAVSQ-----------------VFLLTDGE-NEH 150 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N +K+ A E+ + I T+ AS N L K S + + + F Sbjct: 151 GDNERCLKLAHFAVEHKLTINTLGFGASWNQDVLEKIADSGSGTLCYIEQPEQAVQEFGR 210 Query: 405 ISQLM 409 + + Sbjct: 211 LFNRI 215 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/351 (11%), Positives = 91/351 (25%), Gaps = 22/351 (6%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+ N ++ + S ++ + + LS ++ Sbjct: 87 ATPKRENKFKVSVNVAAQSKVTFNLTYEELLQRRLGSYELVLSIRPQQVVRHLKIDVRII 146 Query: 125 SRISMTHMANN--RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + N + I YR +E + + ++ Sbjct: 147 ETRDIVMLDNTYGSGELEGVEIARPSPNRAHIQYRPTDMEQMRMSPSGISGDFLVRYDVK 206 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ I ++ + V TKM K A+ L + Sbjct: 207 RDLSVGDIQIVNGYFVHYFAPSGLPVVPKNIVFIIDKSGSMGGTKMRQTKQAMNTILKDL 266 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + E + E R YV R ++ T+ A+ A Sbjct: 267 RDHDRFNVMPFSYSSTMWRPNEMVLATRENIESARTYVRRSIN--AGGGTNINQAIIDAA 324 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKEN 360 +L + S IIFLTDG + K + + + +E Sbjct: 325 DLLR-----------RVTDDQPNSPRSASLIIFLTDGLPSVGESKPRNIMVNVKNAIREQ 373 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + + + + L K + + + + A L + ++ Sbjct: 374 -VSLFCLGFGKDVDFPFLEKMALENRGLARRIYEDSDAALQLKGFYDEVAT 423 >gi|332970881|gb|EGK09858.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 442 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 68/253 (26%), Gaps = 43/253 (16%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 ++S P + E + S+ D + + + KM K Sbjct: 105 LMSLDPTIQVNEKTPDNKIDVP-------SIEQMNVEILLDASGSMAGRVDGGVKMDLAK 157 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKNIEP-SWGTEKVRQYVTRD 283 A+ F + + V VY E S+ + + Q + + Sbjct: 158 QAIRAFASDVPEGAQVSLRVYGHKGSNQKKDKAVSCQSNELVYPLKSYDSSQFEQSLNQF 217 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T A++ A + L + ++DG Sbjct: 218 K---PTGWTPLASAIQAAREDLKEWAGAR------------------NIVYVVSDGVETC 256 Query: 344 FKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 V K E+ I+ + I + GQ+ LK ++ + V + D L Sbjct: 257 GGDPVR---EAKKLGESGIEPMVKIIGFDVDDAGQQQLKKVAEAADGSYQTVTSGDDLKE 313 Query: 401 VFQNISQLMVHRK 413 + + + Sbjct: 314 YLKGEKESIKQEW 326 >gi|282879638|ref|ZP_06288369.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306586|gb|EFA98615.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 346 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 71/226 (31%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++A K + +D+ + +GL+ + + + Sbjct: 102 DISNSMLAEDVVPSRLAKSKLLVENLVDNFT-------NDKIGLVIFAGDAFIQLPITSD 154 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A++ + T + Sbjct: 155 YVSAKMFLQNTDPSLITTQGTDIARAIRLSMSSFTQ------------------QDKVGR 196 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + ++ A + I + + + NGQ Sbjct: 197 AIILITDGEDHEGGA----LEAAADANKKGINVFILGVGNTQGAPIPVAEGGYMKDENGQ 252 Query: 377 RLLK---------TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + +V N + ++ + + Sbjct: 253 TVMTALNEKMCQDIARAGKGTYIHVDNT---NDAQKKLNDELAKLQ 295 >gi|84498148|ref|ZP_00996945.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381648|gb|EAP97531.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 533 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 55/189 (29%), Gaps = 29/189 (15%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 R ++ +K++L L +++ + ++ Y T +EP+ + Sbjct: 193 TSGSMDIRERLGLVKSSLALLAENLRPDD------TIAIVTYQTDATPLLEPTPVRDTD- 245 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T+ + Y K ++ + Sbjct: 246 TILAAIDRLEAGGSTNLEAGLLLGYDQAREAYK----------------QGATNVVLLAS 289 Query: 338 DGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKTCVSSPEYHYN-VV 393 DG N ++ ++ ++N I +VT+ L++ + Y + Sbjct: 290 DGVANVGVTDGG--RLATAIRDNGRRGIHLVTVGYGMGNYSDHLMEQLADQGDGFYEYID 347 Query: 394 NADSLIHVF 402 + +F Sbjct: 348 TFEEARKLF 356 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/241 (11%), Positives = 73/241 (30%), Gaps = 34/241 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + FIP L + +F ++++ D + + D + Sbjct: 78 TSLKLFIPTTLLMLGLGGGLFQFLYGMNVNKEKAFKDIVIAIDTSGSMEQSDPNGERF-- 135 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRDMDS 286 A +D+++ + + + + + + ++V + Sbjct: 136 --KATSSLIDNLEG------NRRIAFMTFDDSPILQFDFMEATTKEQKEVVKAKIASYQQ 187 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + +AY+++ ++ S +I ++DG ++ + Sbjct: 188 NDDGQTGVRDMINEAYELIQNN-----------------SKNHSGSLIMISDGAPSDDSA 230 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + + + I I TI + N L+ + HY+ + + F Sbjct: 231 SNIPALVSNYVQN-NIPIYTIGMMYGDNSAEQYLIDIANLTGGQHYSTSDTTMIAGAFGQ 289 Query: 405 I 405 I Sbjct: 290 I 290 >gi|33333550|gb|AAQ11891.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENT 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429467|gb|AAW78137.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429537|gb|AAW78172.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429535|gb|AAW78171.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429461|gb|AAW78134.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/366 (9%), Positives = 81/366 (22%), Gaps = 46/366 (12%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM------------ESANNA---- 59 S + F ++ A +++ + + F I + K M + +A Sbjct: 18 QSRRGAFMVMAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITDAVQTT 77 Query: 60 -----------------------AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 A A K+ D + + Sbjct: 78 ADGIGGSDNVSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTFH 137 Query: 97 FI---KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + + + TE Q M+ + + I + Sbjct: 138 MVWGENAKPYNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTAAVTTSAIAFIEAR--D 195 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 L + + + S+ +E I+ + + S + Sbjct: 196 IVLVLDYSGSMSYDSEFDAMSSYRLGKSAVEANLDDIWETLVDSGATYSDSGKLKFPATG 255 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + L LD D ++K Y + + Sbjct: 256 YGRINSEVGTYISSTNDDYIYRALDLDEEDSSGNLKYPFPQEGKNYYGNLNGEPSGNTNK 315 Query: 274 EKVRQYVT--RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ R ++ + L +K + + + P ++ Sbjct: 316 NLWKNYIKWVRSDGTVNNYGYRKKYGYRTLMGYLIEQRKLNSQSEDLWRAPIYPFNAMKE 375 Query: 332 FIIFLT 337 + T Sbjct: 376 GVTLFT 381 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 58/224 (25%), Gaps = 40/224 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----------------SW 271 A+K + LF + + L ++GL+ Y + Sbjct: 370 FNAMKEGVTLFTEFLSGLE---FGDHIGLVTYDDTSRIQSVLNEQGISESVDLGDELITN 426 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK----------RSFFTNFFRQG 321 + S T +K A ++L S + N G Sbjct: 427 RYSDIDSIQRHKQASHYAPYTGMGYGIKDARELLQSHGRAGARPTILVMTDGNANRSPSG 486 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC----DKAKENFIKIVTISINASPNGQR 377 +P + D +++ +N + +A + + T+++ + Sbjct: 487 WSLPGSWDWDDVTDFDDDGQSDYSTNDRHKQYAFWQAVEAAKLGYTVHTMTVGEGADRSL 546 Query: 378 LLKTCVSSPEYHYNVVNADSLIH-------VFQNISQLMVHRKY 414 + + + ++ F I+ + K Sbjct: 547 MQAMAKACNGICIDAPGGATIEDMKSQLLTAFGKIAANVPPAKL 590 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/314 (7%), Positives = 68/314 (21%), Gaps = 28/314 (8%) Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIV 174 + V + + + N + + + + N + Sbjct: 41 AKASQSVIVKPQNSNAEGSIDFHLTPKGKATNANRDPIDVVFVFDKSGSMNDSGKNPQKF 100 Query: 175 SFIPALLRIEMG--------ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + D+ + + I + Sbjct: 101 QSAKDAMTAAVNFFKENAGPNDRFGFVPFDDDVETGKVVNFAPENNMASLNLINSNSNSL 160 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 N ++ + + + Y+ + + + ++ Sbjct: 161 SALGGTNYTQSLDAALGMFGNSTNNKYVLFMTDGEPTFSKVIERTTYNERACFIVCWDTG 220 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + QG + D Sbjct: 221 KKITG------------------NVVMNYEVYNQGFSVYHKTNGHTYKQDFDLAETKSAI 262 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQR-LLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + + K IK+ +I ++ L+ S+ +++ +F++I Sbjct: 263 KKHGLSMAQKLAMKDIKLFSIGFGSNTELDMGYLRELSSTTGVTARQATQENISSIFRDI 322 Query: 406 SQLMVHRKYSVILK 419 S M S +K Sbjct: 323 SADMDTPSISGEIK 336 >gi|123438167|ref|XP_001309871.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121891616|gb|EAX96941.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 957 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/330 (13%), Positives = 106/330 (32%), Gaps = 26/330 (7%) Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 R +S S + + ++ + ++ + ++ + L Sbjct: 326 RDKSKSYFLNHYDTLNHLTTYNLDFTDQYLLSNSKLFLPTSGDLQKAKEMTFQVFSVIDL 385 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + D + + ++ + + I++ G + I Sbjct: 386 NEKGRVCIFKTDDGMNLYLKPKYDKSSKDFEKIALIINP-------RTGSQEEIDIAKTA 438 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVYMGL 256 + S + + + + C + + LKNA + + GL Sbjct: 439 SIQNSENKGHSRNEVKQINIICCDLSRSMENSNKLKNAKKVISILAKNVFEFGIGTMWGL 498 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 I +++ V+ + + + V DS + T A+K A + +TS + Sbjct: 499 INFSSTVKTVLPLTAIASEFSMAVNE--DSELGDDTKLFEAIKVASETITSKSEYF---- 552 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNN--FKSNVNTIKICDKAKENFIKIVTISINASPN 374 K I+ +TDG +N+ +KS+ + K+ +N I + I I+ S + Sbjct: 553 ----------DNVYKRIVVVTDGIDNDNHYKSDESLQKLTKILTDNKIILDVIFIDESDS 602 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ + + + + VF + Sbjct: 603 RAAVMSQATGGLAFFFKGSEQNLMESVFSS 632 >gi|58429471|gb|AAW78139.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 61/186 (32%), Gaps = 25/186 (13%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + +K+A+ L + I L+ +++ + ++ + I Sbjct: 44 NDQVDLYLLMDCSGSIRRHNWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRL 103 Query: 270 SWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K ++ + +L T+ T A+ Q + L I Sbjct: 104 HSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDR---------------I 148 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTC 382 + ++ LTDG ++ + + +K K + +KI I N R L C Sbjct: 149 NRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGC 205 Query: 383 VSSPEY 388 S Sbjct: 206 HPSDGK 211 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/203 (10%), Positives = 61/203 (30%), Gaps = 27/203 (13%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + +C S N + T++ + A L + ++ G + + Sbjct: 446 AYYCLDGSGSMGDNDGWTGIEAAATQVFDPEQAAQNLLQT-HPQDVTTVAIFNGGVTGGS 504 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + +R D T+ + +A LT + Sbjct: 505 PWQVRG---NDGDALRDLARSVADYEPEGGTNMYACLLRATTELTGQQNGDR-------- 553 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 ++ ++ +TDG++ + + D + + +V I+ + ++L + Sbjct: 554 --------KRLVVLMTDGQSGAEQRD----DALDALRSADVPVVAIAFGRDADPRQLEEV 601 Query: 382 CVSSPEYHYNVVNADSLIHVFQN 404 ++ D L+ + Sbjct: 602 AKATNGTFVR---EDDLVAALRQ 621 >gi|260797293|ref|XP_002593638.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] gi|229278864|gb|EEN49649.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] Length = 388 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 55/187 (29%), Gaps = 29/187 (15%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F I +G+I Y+TR + + K D + T Sbjct: 27 VKQFTKKAISGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMRGGTL 86 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + K +I +TDG + + + Sbjct: 87 TGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVAAPALE-- 131 Query: 354 CDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI------S 406 A++ I + I + + L+ S+ V N + L ++ + + Sbjct: 132 ---AQQKGITVYAIGVSGYDADQ---LEQIASNNNTLAFVDNFNLLDNLRNTLLTGVCDA 185 Query: 407 QLMVHRK 413 + + + Sbjct: 186 KNICSTR 192 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 43/152 (28%), Gaps = 23/152 (15%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F I +G+I Y+TR + + K D + T Sbjct: 258 VKQFTKKAISGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMRGGTL 317 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + K +I +TDG + + + Sbjct: 318 TGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVAAPALE-- 362 Query: 354 CDKAKENFIKIVTISI-NASPNGQRLLKTCVS 384 A++ I + I + + L+ S Sbjct: 363 ---AQQKGITVYAIGVSGYDADQ---LEQIAS 388 >gi|58429491|gb|AAW78149.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3142286|gb|AAC18657.1| thrombospondin related adhesive protein [Plasmodium falciparum] Length = 562 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP-6] gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound [Methanosaeta concilii GP-6] Length = 726 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 66/240 (27%), Gaps = 49/240 (20%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + +++ E + I LV+D SGSM Sbjct: 248 SFKEEGQDGFFLLLAAPDVKVNEEEIVVKDIILVLDTSGSMQ------------------ 289 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 KM K A LD ++ L ++ + T + + Sbjct: 290 --GEKMDQAKEAARYVLDHLNPLD------RFAIVSFATTTRSFSPSLEPAAQADKGKDF 341 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + TD AM +A + + +IFLTDG Sbjct: 342 LDRLEAMGSTDINRAMIEAVGLAEEVRPT--------------------TLIFLTDGLPT 381 Query: 343 NFKSNV--NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +A + ++I + + + L + + + V + + Sbjct: 382 EGVTVTGAILDNVAREAPD-NVRIFSFGVGDDVDTDLLDQISMDNGGASTYVRPGEEIDE 440 >gi|256839783|ref|ZP_05545292.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256738713|gb|EEU52038.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 339 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 59/207 (28%), Gaps = 26/207 (12%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI------- 324 + +++ S++ + T A+ A + T D+ G Sbjct: 150 YISAKMFLSSINPSMVSTQGTAIGAAINLAARSFTPDETTDKAIILITDGENHEDDAIGA 209 Query: 325 --PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + G+ + K K+ + I N Q + Sbjct: 210 AKAAAEKGIHVNIVGMGDPKGSPIPIQGSNNYMKDKDGNVVI------TKLNEQMGQEIA 263 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + + N + + + + Sbjct: 264 AAGNGMYVRADNT---NSALKALQKEI 287 >gi|150007596|ref|YP_001302339.1| hypothetical protein BDI_0949 [Parabacteroides distasonis ATCC 8503] gi|255013875|ref|ZP_05286001.1| hypothetical protein B2_08202 [Bacteroides sp. 2_1_7] gi|149936020|gb|ABR42717.1| conserved hypothetical protein BatB [Parabacteroides distasonis ATCC 8503] Length = 339 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 59/207 (28%), Gaps = 26/207 (12%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI------- 324 + +++ S++ + T A+ A + T D+ G Sbjct: 150 YISAKMFLSSINPSMVSTQGTAIGAAINLAARSFTPDETTDKAIILITDGENHEDDAIGA 209 Query: 325 --PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + G+ + K K+ + I N Q + Sbjct: 210 AKAAAEKGIHVNIVGMGDPKGSPIPIQGSNNYMKDKDGNVVI------TKLNEQMGQEIA 263 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + + N + + + + Sbjct: 264 AAGNGMYVRADNT---NSALKALQKEI 287 >gi|3273263|dbj|BAA31175.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSYRRHNWVNHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/258 (10%), Positives = 64/258 (24%), Gaps = 37/258 (14%) Query: 196 VVDLSGS--MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V ++ S S + + ++T+M +++ D++ ++ + Sbjct: 148 VSGVNWSNITAHTYVSGNYRNWLSSSTETVRKTRMEIMQDVAKRLADTVTGVNIG--LMA 205 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 S + + + T + + A + K Sbjct: 206 FNQSQNGEGGRVLNNVSNVKDNASAFKAKVDGLYPSGQTPLSETLFGAMRYFQGGKPFLD 265 Query: 314 FTNFFRQGVKIPSLPFQ-------KFIIFLTDGENNNFK--------------SNVNTIK 352 + +I LTDG + S + Sbjct: 266 RNPVSGTVDGSNNYKSPIELECQANNVILLTDGAPTSDTNHNSFIGSAIGKTCSGNCLDE 325 Query: 353 ICDKAK----------ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I + IK T+ + L + +Y NA L F Sbjct: 326 IAGYMATNDMSAAFSGDQTIKTYTVGFSIDD--PLLGAAATAGGGEYYVANNAQQLADAF 383 Query: 403 QNISQLMVHRKYSVILKG 420 +I + ++ + + G Sbjct: 384 DDILRSVMDTSSTFVAPG 401 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 56/172 (32%), Gaps = 23/172 (13%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTP 296 + + L+ G +G+ V K + + + + + + D + Sbjct: 68 VTEVLSLMDDASSKDRFGFVGFNKEVTKELALTNNIVQAKSKLNEFGKNISPYMANDLSK 127 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 +++A LT S K I+ +T G N N + K+ K Sbjct: 128 GLEKAVDELTKK-----------------STSNDKVIVIMTVG---NSIYNEVSKKLAAK 167 Query: 357 AKENFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 A E I I TIS + L + + + + NA L V +S Sbjct: 168 AYEEDITIHTISFGDPLYADAPFLTEIAKLTGGNYTHSPNAAFLKDVLSKLS 219 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 44/381 (11%), Positives = 97/381 (25%), Gaps = 37/381 (9%) Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 N+ + N L + A + A T + + + Sbjct: 458 NVGDEVPYTFVVRNSGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVD 517 Query: 131 HMA--NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 N +S N ++ I+ L +++I + + Sbjct: 518 SEGHFTNTATASGTDDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVT 577 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS-- 246 + + ++ KM ALK+A+ FL ++ Sbjct: 578 GAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQ 637 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 E + +GL+ + + + + ++ T +K L Sbjct: 638 SGSELIRVGLVKFAGKESSKVG---NETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALR 694 Query: 307 SDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENN-----NFKSNVNTIKICDKAKE 359 + F ++ +IF TDG + + + K+ Sbjct: 695 HNGATRADLGFKHASTVMSGARTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKD 754 Query: 360 NFIKIVTISINASPN-------GQRLLKTCVSSP----------------EYHYNVVNAD 396 + + +I + N + VSS Y+ V N Sbjct: 755 SGATVYSIGVFDGANPSSIEENQKNQFMNAVSSNYPHATAYDKLGTGSNAGYYKVVSNVS 814 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 L +F+ I ++ + Sbjct: 815 DLKSIFEKIQTNETTDGFTNV 835 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 56/178 (31%), Gaps = 25/178 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K A + +D + + +I + + + + T+K + Sbjct: 62 TVKEAAIQLVDQLREGD------RLSVIAFDHKAKVIVPNQDVTDKAHIK-AQIDRLEAA 114 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T +K Q L S + + LTDGE N N Sbjct: 115 GGTCIDDGIKLGLQELASSPGKRAAQ-----------------VFMLTDGE-NEHGDNGR 156 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 ++I A E+ + + ++ + N L K ++ + + + F+ + Q Sbjct: 157 CLEIAAVAAEHGVTLNSLGFGENWNQDVLEKIADAANGSLAYIETPNQALTEFERLLQ 214 >gi|307318077|ref|ZP_07597513.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306896118|gb|EFN26868.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] Length = 431 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/188 (8%), Positives = 50/188 (26%), Gaps = 6/188 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ + +E ++I A++ + +G + W+ +K ++ A + + A A Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAA 75 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + + ++ G + ++ Sbjct: 76 VRHRAGDQQSALETAARRVAGGSGFSPGGLTV------STVPGSAGGSNKVTVELTETHP 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 R+ + + + + L V + Sbjct: 130 RMFSSVFGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNS 189 Query: 186 GERPIFLI 193 FL+ Sbjct: 190 NAADAFLM 197 >gi|194671641|ref|XP_591137.3| PREDICTED: matrilin 3 [Bos taurus] gi|297480578|ref|XP_002691564.1| PREDICTED: matrilin 3 [Bos taurus] gi|296482389|gb|DAA24504.1| matrilin 3 [Bos taurus] Length = 574 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 63/177 (35%), Gaps = 26/177 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMD 285 +K + +D++D+ D + ++ Y + V+ + +++ V R Sbjct: 187 FTKVKTFVSKIIDTLDIGPM---DTRVAVVNYASTVKIEFHLQTHSDKQSLKRAVARITP 243 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T S A++ A G + PS K I +TDG + Sbjct: 244 LST--GTMSGLAIQTAMD----------EAFTVEAGARGPSSNIPKVAIIVTDGRPQDQV 291 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + ++ +A+ + I++ + ++ + LK S P E+ + V + Sbjct: 292 N-----EVAARARASGIELYAVGVDRADMES--LKMMASEPLDEHVFYVETYGVIEK 341 >gi|13591884|ref|NP_112256.1| integrin alpha-1 precursor [Rattus norvegicus] gi|124941|sp|P18614|ITA1_RAT RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|56494|emb|CAA36384.1| unnamed protein product [Rattus norvegicus] gi|149059385|gb|EDM10392.1| integrin alpha 1, isoform CRA_b [Rattus norvegicus] Length = 1180 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + + D + + + +G++ Y V + + V + Sbjct: 188 SVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIGRQGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 295 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 351 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 352 VKALGERIFAL 362 >gi|260786375|ref|XP_002588233.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] gi|229273393|gb|EEN44244.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] Length = 1313 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 51/160 (31%), Gaps = 23/160 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + +GLI ++ + + T RQ + + S T ++ +L Sbjct: 340 DGSMVGLITFSDTAAVDHALTELTADSHRQSLITALPSSTYGSTSIGAGIQAGLSMLKPT 399 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + I+ +TDG+ N + + K + +VTI+ Sbjct: 400 GQG-------------------GTIVLMTDGQENTAPMIQDVWPSVLQQK---VTLVTIA 437 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I + +S Y+ +A L +F IS Sbjct: 438 IGEYADMSLEDLASQTSGLSFYDTEDASHLSEIFTAISSQ 477 >gi|153791389|ref|NP_001028400.2| integrin alpha-1 precursor [Mus musculus] gi|189442109|gb|AAI67237.1| Integrin alpha 1 [synthetic construct] Length = 1179 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 67/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 188 SVTAFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 295 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 351 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 352 VKALGERIFAL 362 >gi|163849593|ref|YP_001637636.1| TadE family protein [Methylobacterium extorquens PA1] gi|163661198|gb|ABY28565.1| TadE family protein [Methylobacterium extorquens PA1] Length = 135 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 13/141 (9%), Positives = 39/141 (27%), Gaps = 6/141 (4%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + ++ S+ FAL V + L+ + Y ++A +A A ++ + Sbjct: 1 MRSFRKDQRGVISVEFALVVPLLITLVFAAVEFGGILYT----QAAAESATNNVARQLAT 56 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 N ++++ + + + S ++ + ++ Sbjct: 57 NRITQAQAKKAVTPLL--PVWARVNVDVAVTASNATKPESNLYTVATTIPMAQAAPTGFL 114 Query: 131 HMANNRLDSSNNTIFYNMDVM 151 + S Sbjct: 115 KFIYVDRNISATVTQQQEPTS 135 >gi|123781093|sp|Q3V3R4|ITA1_MOUSE RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|74186862|dbj|BAE20498.1| unnamed protein product [Mus musculus] Length = 1179 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 67/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 188 SVTAFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 295 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 351 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 352 VKALGERIFAL 362 >gi|148685682|gb|EDL17629.1| mCG133494, isoform CRA_a [Mus musculus] gi|148685684|gb|EDL17631.1| mCG133494, isoform CRA_a [Mus musculus] Length = 828 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 52/167 (31%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + T + A+K L + Sbjct: 184 RPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLRGYTYTASAIKHVITELFTT 243 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + K +I +TDG S + I + + A I I Sbjct: 244 QSG-------------ARQDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS---IIRYAI 287 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + ++ LK S P + ++V N D+L + + + + Sbjct: 288 GVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|125570256|gb|EAZ11771.1| hypothetical protein OsJ_01640 [Oryza sativa Japonica Group] Length = 1034 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 67/232 (28%), Gaps = 43/232 (18%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V + ER + V+D+S +M L Sbjct: 34 VLVRVVAPPPAAASSERAPIDLVAVLDVSCC-----------------GGLGPVNRMDLL 76 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ +D + + + + + G ++ + V + Sbjct: 77 KKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMVQ--SSLTVTGE 134 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+K+A IL +G K FI+ ++DG++ Sbjct: 135 NKLSTALKKAATIL--------------EGRKDHDKKRPGFIVLISDGDD---------A 171 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVF 402 + + A + + N + + + +S + + + D L F Sbjct: 172 SVLNDAMNLNCSVHAFGFRDAHNARAMHRIANTSAGTYGILNDGHDGLADAF 223 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 64/206 (31%), Gaps = 35/206 (16%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 S ++ L+ A+ L + ++ + ++ + + V + + Sbjct: 532 SCGGGGGAPARRLDLLRKAMDLVIGNLGADD------RLAIVPFHSSVVDATGLLEMSVE 585 Query: 276 VRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R +R + SL T PA+ A +IL + + R G ++ Sbjct: 586 GRGVASRKVQSLAVAGGTKLFPALNAAVEILEAR---CWEAKRERVGA----------VV 632 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 ++DG++ I +A + + + + + + + + + Sbjct: 633 LISDGDD---------RTIFREAINPRYPVHAFGFRGAHDARAVHHVADHTSGVYGVLDD 683 Query: 395 ADS-LIHVFQNISQLMVHRKYSVILK 419 + F R+ + ++ Sbjct: 684 EHDRVTDAFAA-----CVRRVTSVVA 704 >gi|10946646|ref|NP_067309.1| integrin alpha-X precursor [Mus musculus] gi|48428495|sp|Q9QXH4|ITAX_MOUSE RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor gi|6684131|gb|AAF23492.1|AF211864_1 leukocyte adhesion glycoprotein p150,95 alpha integrin subunit [Mus musculus] gi|189442099|gb|AAI67225.1| Integrin alpha X [synthetic construct] Length = 1169 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 52/167 (31%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + T + A+K L + Sbjct: 184 RPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLRGYTYTASAIKHVITELFTT 243 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + K +I +TDG S + I + + A I I Sbjct: 244 QSG-------------ARQDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS---IIRYAI 287 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + ++ LK S P + ++V N D+L + + + + Sbjct: 288 GVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|74215118|dbj|BAE41792.1| unnamed protein product [Mus musculus] Length = 1169 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 52/167 (31%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + T + A+K L + Sbjct: 184 RPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLRGYTYTASAIKHVITELFTT 243 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + K +I +TDG S + I + + A I I Sbjct: 244 QSG-------------ARQDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS---IIRYAI 287 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + ++ LK S P + ++V N D+L + + + + Sbjct: 288 GVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|229491170|ref|ZP_04384998.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229321908|gb|EEN87701.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 614 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/241 (13%), Positives = 64/241 (26%), Gaps = 55/241 (22%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +F E + VVD SGSM + +A K+A Sbjct: 34 TFGLFSTPTAQAEETTSSVLFVVDTSGSMA--------------------GSPLAQAKDA 73 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + ++ + G + + + Q T + Sbjct: 74 LRAGIGALSSGQAAGLRSFAG--DCGNGGQLLVPVATD--NRDQLNNATNQLTAGGTTPT 129 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A L S + II ++DG++ + Sbjct: 130 PDALRAAAGDLPSTGD--------------------RTIILISDGQSTCGD----PCAVA 165 Query: 355 DKAKEN-FI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + K I ++ + NA + L ++ ++ N L IS + Sbjct: 166 TELKTQLGIDFRVHAVGFNAPDVAESELSCIANATGGRYFTATNTTELSDA---ISAAVT 222 Query: 411 H 411 Sbjct: 223 T 223 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 52/165 (31%), Gaps = 12/165 (7%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLI 288 + L + +GLI Y++R V Q + R I Sbjct: 20 FRRVLEFVKRLVSTFHVSPRRARIGLIVYSSRSYLVGGFRRYRNLRSVLQAIKRI--RYI 77 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + AMK A + L S + + K ++ +TDG + + + Sbjct: 78 RGGTYTGKAMKYALRKLFSRRAGYHHARVRL--FRSSRKGAAKILVMITDGISQDRVTTP 135 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 K+ + I ++ + ++L++ S P + Sbjct: 136 ALR-----LKKMGVVIFSVGVGKRYRLKQLMQI-ASRPRLVFTAP 174 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 76/220 (34%), Gaps = 51/220 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D +++ +K L F+++ + +G+I ++ ++ ++ Sbjct: 85 DLSKSMDAADVTPSRLEKVKFELNRFIEN-------ERANRIGIIIFSNDAYIHVPLTYD 137 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ D L T+ A++ AY L + K Sbjct: 138 AAALELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSA---------------DPTSRAK 182 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 ++ TDGEN S+ T + + + I + ++++ Sbjct: 183 MMVLFTDGEN----SSSCTNALFNNLRRFGIGVYSVAVGTKVGISIQENGKPLKDKNDKL 238 Query: 374 -----NGQRLLKTCVSSPEYHYNVVNA-DSLIHVFQNISQ 407 + L SS +Y + N+ + + + +I+Q Sbjct: 239 VISKLDENFLRGIANSSRGSYYELNNSKNDIQKLISDINQ 278 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 55/202 (27%), Gaps = 35/202 (17%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-------IGYTTRVE 264 ++ S +T++ A K A ++ I + V Y G + + Sbjct: 5 DNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMGCQDV 64 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + T A+ L Sbjct: 65 TILG-GPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN-------------- 109 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTC 382 I+ ++DG N +V ++ + ++ I I TI +N P + L+ Sbjct: 110 --------IVLVSDGIANCTPPDVC--EVAQELAQSGINLVINTIGLNVDPAAREELECI 159 Query: 383 VS-SPEYHYNVVNADSLIHVFQ 403 + + +A SL Sbjct: 160 AGVGGGTYADASDAQSLTDALT 181 >gi|304394408|ref|ZP_07376331.1| von Willebrand factor, type A [Ahrensia sp. R2A130] gi|303293848|gb|EFL88225.1| von Willebrand factor, type A [Ahrensia sp. R2A130] Length = 689 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 67/215 (31%), Gaps = 28/215 (13%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDV 252 + S M D + + Q + R K+ ++ L L ++ ++ Sbjct: 19 PANAQQTQSSRNVMV--VFDGSGSMWGQIEGRAKIEIARDVLSSVLGETTSNMTIGMIAY 76 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G + +E + P V + + R T + A+++A + L Sbjct: 77 GHRKKGQCSDIETVVAPGPAASTVPEMIARANAIKPKGKTPLSDAVRKAAESL------- 129 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISIN 370 + ++ +TDG N + + + +E+ + + Sbjct: 130 ------------RYTENEATVVLVTDGIETC---NADPCALATELEESGVDFTTHVVGFG 174 Query: 371 ASPN-GQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S + G+++ ++ + +AD L Sbjct: 175 LSKDEGRQVACLAANTGGKFISADDADELKAALDE 209 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 383 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYM 442 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 443 SGGTATGDAISFTVRNVF---------------GPIRDSPNKNFLVIVTDGQSYDD---- 483 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 484 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 539 Query: 406 SQLMV 410 + + Sbjct: 540 IRGIC 544 >gi|119494080|ref|ZP_01624623.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] gi|119452182|gb|EAW33385.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] Length = 608 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 72/254 (28%), Gaps = 50/254 (19%) Query: 156 YRLQFIEHLLNQRYNQK----IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + N ++ A + E R + LV+D SGSM Sbjct: 1 MQANYSLSYPLIATNTPATIDLILNFNAQTQAETSPRRPLNLSLVIDRSGSMA------- 53 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + A ++S+ V ++ Y + E + Sbjct: 54 -------------GQSLRYAIKAAQQLVESLTADDIVS------VVIYDDQPETI--LTP 92 Query: 272 GTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T + + + + T+ + + S + Sbjct: 93 QTVEDKAAICKQIGRIRAGGCTNLSGGWLMGCDCVKSRQTSDRLN--------------- 137 Query: 331 KFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 ++ LTDG+ N ++ I K E I T+ + N L+ ++ Sbjct: 138 -RVLLLTDGQANMGITDPKVITKTAQNQAETGIITTTLGFGSYFNEDLLISMADAAGGNF 196 Query: 390 YNVVNADSLIHVFQ 403 Y + + D + VF+ Sbjct: 197 YFIQSPDDVAQVFR 210 >gi|281417929|ref|ZP_06248949.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] gi|281409331|gb|EFB39589.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] Length = 1050 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 60/213 (28%), Gaps = 32/213 (15%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK------NIEPSWGTEKV 276 K T + +K+A F+D M +I Y+ ++ + Sbjct: 721 DKSTYLKLVKDAAKNFVDKFA-----GSKTKMAVIQYSDSANDNDFKKYDLSLPDKGAAL 775 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK------------RSFFTNFFRQGVKI 324 ++ + + ++ M++AY IL S + K Sbjct: 776 KETIDKIKPG-TSGLSNMGDGMRRAYHILNDPPPKGQISKYIVVITGSVPNRWTAVDNKK 834 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +N ++ S + I +V I + G L + Sbjct: 835 NEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEEDIGDVLEEIAAE 894 Query: 385 SPE--------YHYNVVNADSLIHVFQNISQLM 409 S ++Y N L+ + N++ + Sbjct: 895 SGAKPLEGTDRHYYKANNFLELLDILNNMTLKI 927 >gi|58429481|gb|AAW78144.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|326426687|gb|EGD72257.1| hypothetical protein PTSG_00277 [Salpingoeca sp. ATCC 50818] Length = 2847 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 69/198 (34%), Gaps = 27/198 (13%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 NS I ++ + +A L++ + +D S + + + + T Sbjct: 1257 DNSGSISEEDFQQSIAFLRDIVQTQIDLEGTGS------RVAAMTFCATQTLLTTFTGVT 1310 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + ++++ T + A+ A + +D + ++ + Sbjct: 1311 TDALNALDSAVNTVSCG-TATGAALDFAIANILNDN---------------SNPGARRVV 1354 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 I +TDG + + + D+ ++ I + + L+T +S + + Sbjct: 1355 IVVTDGA--SQEDSSFVTAASDRLHAEVDEVFAIGVG-PASNLAELQTIATSNDNAFIET 1411 Query: 394 NADSLIHVFQNISQLMVH 411 + D+L ++++Q + Sbjct: 1412 DFDALRD--RDLTQQLCS 1427 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + LK +D + + + + + ++ E +++ S+ + + Sbjct: 473 YSLLKQFAENVVDEL---VISESESRIAVEIFSDVQEVSLQFSYDKPTIVDTIRNLRHIQ 529 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A++ A + + S + +L + +TDG+ + + Sbjct: 530 R--GTATGAALELARRDIFSSSDPN-----------YRNLAVPAIAVVITDGQ--SQEEP 574 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + +E + ++ + I + LL S + V L+ F Sbjct: 575 AFVAEQALRLRELGVTVLALGIGDGTDPDELLAI-AGSADRVEQVEQFGDLVQEF 628 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 68/182 (37%), Gaps = 28/182 (15%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTRDM 284 ++ L ++ ++ + V +G++ +++ E ++ K+RQ + + + Sbjct: 340 NRIDKLGESVRAWIKN-----DVPTGSQLGMVMFSSTAHIVSELQVISDMKIRQEMMKKV 394 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A Q+L + I+ +TDG N+ Sbjct: 395 PKDLYSITCIGCGLDLAVQMLQE----------------KGNNKTGGIIVLVTDGRNSAG 438 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---SLIHV 401 +++ ++ + + I++VT++ + + + + + Y + + D +L Sbjct: 439 YLDISDVE--EDIVKAGIRVVTVAFGSEADSN-IERLADVTGGKSYYIKDGDSSEALQRA 495 Query: 402 FQ 403 F Sbjct: 496 FT 497 >gi|262381905|ref|ZP_06075043.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|301310438|ref|ZP_07216377.1| BatB protein [Bacteroides sp. 20_3] gi|262297082|gb|EEY85012.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|300832012|gb|EFK62643.1| BatB protein [Bacteroides sp. 20_3] Length = 339 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 59/207 (28%), Gaps = 26/207 (12%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPNRLDKAKQMLSRLTDGFT-------NDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI------- 324 + +++ S++ + T A+ A + T D+ G Sbjct: 150 YISAKMFLSSINPSMVSTQGTAIGAAINLAARSFTPDETTDKAIILITDGENHEDDAIGA 209 Query: 325 --PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + G+ + K K+ + I N Q + Sbjct: 210 AKAAAEKGIHVNIVGMGDPKGSPIPIQGSNNYMKDKDGNVVI------TKLNEQMGQEIA 263 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQLM 409 + + N + + + + Sbjct: 264 AAGNGMYVRADNT---NSALKALQKEI 287 >gi|58429519|gb|AAW78163.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 565 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429497|gb|AAW78152.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429469|gb|AAW78138.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 70/246 (28%), Gaps = 56/246 (22%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + P I LV+D SGSM ++K+ K++ Sbjct: 360 VLNDIATTPVPVPDQDIMLVIDRSGSM--------------SGDAGTGQSKIDEAKDSAS 405 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVT-------RDMDSLI 288 LF+ ++ + MGL+ ++T + + K Q + + Sbjct: 406 LFVQLVEASA----GHRMGLVSFSTSASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLIP 461 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + +A LT K I+ LTDG N Sbjct: 462 DGWTSIGDGIDKAQSELTGGAN-------------------PKTILLLTDGLQNT---PP 499 Query: 349 NTIKICDKAKENFIKIVTISINASPN-GQRLLKTC-VSSPEYHYNVVNADSLIH----VF 402 + + +I I + N LL S+ + + L F Sbjct: 500 MIETATNDIGD--TRIHAIGLGTEANLNGGLLSDLTQSTGGAYTRAGDGLELKKFFALAF 557 Query: 403 QNISQL 408 +I + Sbjct: 558 GDIFED 563 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 73/203 (35%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 321 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 376 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 417 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 418 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALTSGNTDLSQKSLQLESKG 494 >gi|156324784|ref|XP_001618484.1| hypothetical protein NEMVEDRAFT_v1g2784 [Nematostella vectensis] gi|156199073|gb|EDO26384.1| predicted protein [Nematostella vectensis] Length = 410 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 45/159 (28%), Gaps = 22/159 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + E ++GL+ Y+ + ++ + T + Sbjct: 257 NFMINIVKSFKISSERTHVGLVLYSFFTQLMFNFDKYSDSASIVKAINTTDYPKGGTRTG 316 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K A L RS K +I LTDG S+ Sbjct: 317 EALKMAKSQLFGASMRS----------------VPKVLIVLTDGR-----SSDKVEAPSK 355 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVV 393 K+ + I + + + L S ++ + Sbjct: 356 ALKDEGVVIFAVGVGDQIDPSELNVMASDSKSDHVFKAK 394 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 73/203 (35%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 321 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 376 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 417 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 418 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALTSGNTDLSQKSLQLESKG 494 >gi|160716|gb|AAA29773.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 568 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D DV C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDVYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRNLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 73/203 (35%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 322 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 376 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 377 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 418 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 419 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 472 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 473 AFGALTSGNTDLSQKSLQLESKG 495 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 71/224 (31%), Gaps = 47/224 (20%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 +I R + V+D SGSM K K+ ++ A + ++ Sbjct: 34 GQIMTQVRAPVNVCFVIDRSGSM--------------------KGEKIDRVRRATIRAIE 73 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +D V ++ + R E I + K + R T PA++ Sbjct: 74 MLDAQDVVS------VVIFDHRTEVLIPATP-VAKPAELADRVNRVRDSGGTRIAPAIEA 126 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + +I LTDG+ +S + ++ + A Sbjct: 127 GLREIDKGPSHMV-----------------RRLILLTDGQT---ESESDCLRRAEDAGRR 166 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I + + N L++ S + + ++ FQN Sbjct: 167 NVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPEKIVDYFQN 210 >gi|87199928|ref|YP_497185.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87135609|gb|ABD26351.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 153 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 15/96 (15%), Positives = 34/96 (35%), Gaps = 1/96 (1%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++RF + + A I A+ +L + V +++ ++SA A +A Sbjct: 1 MTRFIPFLSRLRACRDGAIMIEVAVLTPVLVLFGLGTVEVSSLVARRSELQSAL-AEAVA 59 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 A + D ES+ + + + + Sbjct: 60 IALASKPDTQSKIDTIESVISASTGVSTANIDTAVI 95 >gi|7766811|pdb|1CK4|A Chain A, Crystal Structure Of Rat A1b1 Integrin I-Domain. gi|7766812|pdb|1CK4|B Chain B, Crystal Structure Of Rat A1b1 Integrin I-Domain Length = 198 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + + D + + + +G++ Y V + + V + Sbjct: 22 SVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIGRQGGLQ 81 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 82 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 128 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 129 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 185 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 186 VKALGERIFAL 196 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 73/203 (35%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 321 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 376 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 417 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 418 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALTSGNTDLSQKSLQLESKG 494 >gi|7463254|pir||E70121 hypothetical protein BB0173 - Lyme disease spirochete Length = 340 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 66/219 (30%), Gaps = 51/219 (23%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 118 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 170 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 171 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 212 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + + I + +KI +I I +S + L++ Sbjct: 213 SDEIKDQVINLAQGL---NVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 269 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 270 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 308 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 73/203 (35%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 321 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 376 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 417 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 418 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 471 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 472 AFGALTSGNTDLSQKSLQLESKG 494 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 27/169 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L+ +++ +GL+ + + + + + Q +T ++ T Sbjct: 328 NQAAELYLIQIIEKGSLVGLVTFDSFAKIQSKLIKIIDDNTYQKITANLPQEADGGTSIC 387 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 +K +Q ++ + +F + II LTDGE+ I +C Sbjct: 388 RGLKAGFQAISQSNQSTFGSE----------------IILLTDGED-------YQISLCF 424 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVF 402 + K++ I TI++ S + + L + + + N + LI F Sbjct: 425 GEVKQSGTVIHTIALGPSADEE-LETLSNMTGGHRFYAHKNINGLIDAF 472 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 72/191 (37%), Gaps = 34/191 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 ++A LK A +FL L +++ ++G++ + + ++ +R+ +T + Sbjct: 317 RVARLKQAAEIFL-----LQIIEDGSWVGIVTFNSAATIKTGLQQITSDSIRRSLTGYLP 371 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++ +++ I+ LTDGE++ Sbjct: 372 VTAGGGTRICNGVEAGFKVFKQKYASEKGCE----------------IVLLTDGEDS--- 412 Query: 346 SNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIHV 401 TI C D+ K + I TI++ S + L + + ++ +++SLI Sbjct: 413 ----TISYCLDEVKRSGSIIHTIALGRSADPG-LEELADMTGGLKFSATDSLDSNSLIDA 467 Query: 402 FQNISQLMVHR 412 F IS + Sbjct: 468 FTGISSSDGNL 478 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 74/237 (31%), Gaps = 45/237 (18%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I P + ++D+SGSM K+ LK+A L + Sbjct: 322 DIAPATAPHANLVFLIDVSGSMDEPD-------------------KLPLLKSAFRLLVSK 362 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ Y +EP+ EK + ++ T ++ A Sbjct: 363 LKADD------TVSIVTYAGNAGTVLEPTRVAEKSK-ILSAIDRLEAGGSTGGAEGIEAA 415 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKEN 360 Y + + ++ TDG+ N S+ + +I ++ +++ Sbjct: 416 YNLAKQAFVKDGVN----------------RVMLATDGDFNVGPSSDEDLKRIIEEKRKD 459 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + + L++T + + D+L + + + + + Sbjct: 460 GIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEEAGSTLFPIA 514 >gi|332828898|gb|EGK01581.1| hypothetical protein HMPREF9455_02113 [Dysgonomonas gadei ATCC BAA-286] Length = 609 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/372 (11%), Positives = 107/372 (28%), Gaps = 55/372 (14%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A + A L +V+ + + S R ++ +F +N K + + F Sbjct: 99 QAADIAELREHKAIVAEVPVAKPFIMAESPRYDRQRNEEYSQFTENKFKMAREEPLSTFS 158 Query: 115 NTEIQNIVNSSRISM--THMANNRLD-----SSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 ++ R + + + + Y + Sbjct: 159 VDVDAASYSNMRRFINRGELPYKDAVRIEELINYFSYDYAEPAGNDPVRITTEVGSCPWN 218 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N+ + + P ++D+SGSM T+ Sbjct: 219 SQNRLVK-VGLKAKSLASDNLPASNFVFLIDVSGSM-------------------SGPTR 258 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ + ++ Y + + + + G K ++ + Sbjct: 259 LDLVKSSLKLLVNNLRKKD------RVAIVVYASSTGEVLPSTSGENK-QKIKEALNNLS 311 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+I + + II TDG+ N S+ Sbjct: 312 AGGSTAGGAGIQLAYKIAKQNFIKGGNN----------------RIILCTDGDFNVGVSS 355 Query: 348 VN-TIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNI 405 ++ + ++ + + + ++ + H + N L + + Sbjct: 356 NEGLQRLIENERKTGVFLSILGYGMGNYKDSKMQTLAQAGNGNHAYIDN---LQEANKVL 412 Query: 406 SQLMVHRKYSVI 417 Y+V Sbjct: 413 VNEFGATMYTVA 424 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 84/254 (33%), Gaps = 53/254 (20%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + +P ++ +R ++ LV+D+SGSM Sbjct: 267 VIMNSTDIKATPPQADSNLPVPTFTLLQSSDR---VVTLVLDVSGSMA------------ 311 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 ++ L A +F+ + V+E ++G++ ++T + + Sbjct: 312 -------SDGRIGRLYQAAEVFV-----MQIVEEGSHVGIVSFSTSTTVLSKLVQVIDDT 359 Query: 277 -RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R ++ + + T+ +++ ++ + + I+ Sbjct: 360 QRNHLKFLLPKTAVGGTNICAGIREGIKV----------------NNQHDGSSYGTEIVL 403 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 LTDGE+N S + I + I++ +PN L + + Sbjct: 404 LTDGEDNYNTSLCFP-----DISNSGIIVHFIALGPNPNPN-LETIVDMTGGLRFLATDK 457 Query: 394 -NADSLIHVFQNIS 406 +A LI F +++ Sbjct: 458 VDAQGLIDAFSSLT 471 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 64/217 (29%), Gaps = 27/217 (12%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 + + + D P+ +T++ A + S + + Sbjct: 411 IGIHRRGNTLAVYDTSGSMDLPVANSGGKTRLQIAVGAADAAIPLFAKDSRLGLWQFSTN 470 Query: 257 IGYTTRVEKNIEPSWGTEKV------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + T + + ++V V T A++ L++ + Sbjct: 471 LDGTKPYRELVPVGLMNDEVGTGTREEALVAAVNGLKAKGGTGLYATALAAFESLSAQYQ 530 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIV 365 ++ LTDG+N++ S++ ++ K + I+ Sbjct: 531 ----------------PDKPNQVVLLTDGQNDDPTSSMTLTQLIATLKAEYNPKAPVHII 574 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 TI A + L + ++ Y + +S+ V Sbjct: 575 TIGYGADADMDALRQISAATGSKTYPAQDPNSIFQVM 611 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|73974565|ref|XP_532319.2| PREDICTED: similar to collagen, type XIV, alpha 1 [Canis familiaris] Length = 1796 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 48/163 (29%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKL-NAYNTKETLLDAIKHISYKGGNTKTGKAIKHVRDSLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + N I + Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISGEMQSNGYSIFAVG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSSRHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 53/173 (30%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 185 VTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGARTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|15893764|ref|NP_347113.1| heat shock protein DnaK [Clostridium acetobutylicum ATCC 824] gi|15023331|gb|AAK78453.1|AE007561_14 DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum ATCC 824] gi|325507887|gb|ADZ19523.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum EA 2018] Length = 698 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 64/191 (33%), Gaps = 31/191 (16%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + A F+DSID S + LIG+ +V+ I + E++ + Sbjct: 521 SGSMRGKPLEEAIEASKTFVDSIDEGS-----FSLALIGFADKVKTLINLTEDREEIFRA 575 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + + T + +AY IL F ++ LTDG Sbjct: 576 IDGLKKADVGTST-MSEPFSEAYNILKDAYGDCF-------------------VVVLTDG 615 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + K + + C KE I+I I + L K + +L Sbjct: 616 QWYGKKDIMAEVNKC---KEYEIEIAAIGFG-NAKKDFLDKIATCEENSIF--TEVSNLK 669 Query: 400 HVFQNISQLMV 410 F I++++ Sbjct: 670 QSFSRIAKVIS 680 >gi|163848202|ref|YP_001636246.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526110|ref|YP_002570581.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669491|gb|ABY35857.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449989|gb|ACM54255.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 415 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/243 (12%), Positives = 65/243 (26%), Gaps = 49/243 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + Q + A+ + + V+D SGSM Sbjct: 16 IPSSTTPQVGYLLVEAVAPATLAPSLPLNLCFVLDRSGSMQ------------------- 56 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVT 281 K+ LK A ++ + ++ + V+ I + + V Sbjct: 57 -GAKLDNLKAATRRVIEQLQPTDIA------AIVIFDDTVQTLIPATPVGDRSALLAAVD 109 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 ++ T + M+ L ++ LTDG+ Sbjct: 110 SISEA---GGTAMSLGMQAGQVELQKHLGSDRL----------------SRMLLLTDGQT 150 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + ++IV + N Q L +S Y + + + Sbjct: 151 WGDEP--LCRDLARSLGQAGVQIVAFGLGTEWNEQLLDDIAAASNGYSDYIAAPEQIGTF 208 Query: 402 FQN 404 FQ Sbjct: 209 FQQ 211 >gi|327280282|ref|XP_003224881.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XIV) chain-like [Anolis carolinensis] Length = 1885 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 54/185 (29%), Gaps = 25/185 (13%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + + ++ + + K + Sbjct: 1053 DNFNKIIGFLYSTVGALDKIGPDGTQVAIAQFSDDPRTEFKLNAYRTKETLLEAIQQIAY 1112 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K Q+L + K ++ +TDG + + + Sbjct: 1113 KGGNTKTGKAIKHVQQVLFTIDSG-------------TRKGIPKVLVVITDGRSQDDVN- 1158 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNIS 406 K+ + + + I I + A + LL S + + V + F+ I Sbjct: 1159 ----KVSREMQLDGFSIFAIGV-ADADYSELLNIGSKPSERHVFFVDDF----DAFKKIE 1209 Query: 407 QLMVH 411 +V Sbjct: 1210 DELVT 1214 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 55/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + + K T + A+ Sbjct: 186 VAAFNVGSEKTRIGLAQYSGDPRIEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTF- 244 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 IL ++ K K I +TDG++ + K+ Sbjct: 245 --ILENNFKSEAGA----------RPGVPKIGILITDGKSQDDVIPP-----AKNLKDAG 287 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ I + + LK S P+ + YNV + + + + +++ + R Sbjct: 288 IELFAIGV--KNADETELKEIASEPDNTHVYNVADFSFMNSIVEGLTKTVCSR 338 >gi|160714|gb|AAA29772.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 565 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429459|gb|AAW78133.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429495|gb|AAW78151.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429503|gb|AAW78155.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429541|gb|AAW78174.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|330831165|ref|YP_004394117.1| TapY1 [Aeromonas veronii B565] gi|328806301|gb|AEB51500.1| TapY1 [Aeromonas veronii B565] Length = 1214 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/282 (9%), Positives = 63/282 (22%), Gaps = 68/282 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL----------D 240 + + S ++ + +T++ ++A+ + Sbjct: 199 SNVANALASGSSTQISLFTANYVRWYYGSTGVSTQTRLKVAQDAVKELVLATPAVDFGLA 258 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + G V + + G + + + T + + Sbjct: 259 VFNYNDGNEPGSRNGGRIIRNIVGNDAILTSGKSGAQDLIDKVNTLSSDTWTPLCETLYE 318 Query: 301 AYQIL--------------TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +Y+ T + + +II++TDGE N K+ Sbjct: 319 SYRYFGGLGVYFGDDDVNRTPKRDSNAEVTSGIYKTPYDKCSNNGYIIYMTDGEPTNDKA 378 Query: 347 -------------------------------NVNTIKICDKAKENFI----------KIV 365 + K N + Sbjct: 379 ADSLVANLISTLSTEDKVAYGGLVSYGSSGEKSYLAALAGYMKNNDVNTSAIGKQTVTTF 438 Query: 366 TISINASP---NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 T+ G L +T +Y +A L ++ Sbjct: 439 TVGFGDEAIKGAGALLAETARRGGGKYYPATDATQLNEALKS 480 >gi|300789363|ref|YP_003769654.1| hypothetical protein AMED_7539 [Amycolatopsis mediterranei U32] gi|299798877|gb|ADJ49252.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 526 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 77/260 (29%), Gaps = 25/260 (9%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 LL R + P E +++ + S + D +A + Sbjct: 278 PSGQLLGDRPRDTRTNAAPRPAGPPTPEAMYGVLQAWAGANLSARVQVLLDVSGSMAATV 337 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ- 278 + ++MA A L + + ++ + ++ + ++++ Sbjct: 338 PGTGR-SRMALTLEAATQGLGLFKPTTEIGLWLFSTKLDGPKDYKELLPMRPISQQLAAG 396 Query: 279 --YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T ++ AYQ + ++ L Sbjct: 397 GVATLQAVKPKPGGATGLYDSILAAYQNARQSWQLGRIN----------------VVVVL 440 Query: 337 TDGENNNFKSNVNTIKICDKAK----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 TDG N + S + + + + ++ I I + L + ++ + Sbjct: 441 TDGRNEDDDSIGLPGLLAELGRLQDPRKPLPVIGIGIGPDIDASELRQVSAATGGESFTT 500 Query: 393 VNADSLIHVF-QNISQLMVH 411 + + VF Q +S+LM Sbjct: 501 PDPRKISDVFYQALSKLMCQ 520 >gi|148688391|gb|EDL20338.1| mCG126758 [Mus musculus] Length = 223 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 57/185 (30%), Gaps = 26/185 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVHQLTER-FVSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + + L II LTDG + Sbjct: 121 HEGLKLANEQIQNA----------------GGLKASSIIIALTDG-KLDGLVPSYAENEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 K++ + + + Q L+ S + + V FQ + ++ Sbjct: 164 KKSRSLGASVYCVGVLD--FEQAQLERIADSKDQVFPVKGG------FQALKGIINSVSL 215 Query: 415 SVILK 419 + +LK Sbjct: 216 NHVLK 220 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 69/203 (33%), Gaps = 38/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDM 284 ++ L A LFL L +++ + G++ + + T+ R + + Sbjct: 321 DRLNRLNQASQLFL-----LQIIEKGSWTGMVTFDSSATIQSALIQIETDAQRNSLISRL 375 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T ++ A+ ++ I+ LTDGE++ Sbjct: 376 PTAAGGGTSICSGLRTAFTVIKKKFST-----------------HGSEIVLLTDGEDS-- 416 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 TI C D+ K++ I T+++ S + L K + + + LI Sbjct: 417 -----TISSCFDEVKQSGAIIHTVALGPSADPG-LEKLAEMTGGMKTSATDNAQNNGLID 470 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F +S + R + KG Sbjct: 471 AFSALSSGNGAITQRSIQLESKG 493 >gi|304320961|ref|YP_003854604.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] gi|303299863|gb|ADM09462.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] Length = 432 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 20/243 (8%), Positives = 59/243 (24%), Gaps = 18/243 (7%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + E+ + +++ L ++ L + G + H +E A + Sbjct: 12 RQTLARFCTEERGSIAVMMMLMLVPALGIGGLVFDGKRIHTAHLELEIVA---ESAAIAA 68 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS--- 124 + S + ++ + ++ E S T Sbjct: 69 ALQLPSESSAKSAAVDYAEQNLPPSSYGEVVRASDVEIGSYDEGTGTFTAGAGTSAVRVT 128 Query: 125 -----------SRISMTHMANNRLDSSNNTIFYNMDVMTS-YDYRLQFIEHLLNQRYNQK 172 I + + ++ S + I + + + + Sbjct: 129 AWRHEDRSNSLPTIFAGAIGRDSVNLSASAIAVSQSSSEDPICILVLGNKFYGLDLDLRS 188 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V +++ E + ++ S + S + + + Sbjct: 189 EVDIPDCGIQVNSDEYDAVQASSLATVNASFFNVVGSVLGSTSGLTPTPTEGADAVDDPY 248 Query: 233 NAL 235 +L Sbjct: 249 ASL 251 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 380 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 439 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 440 SGGTATGDAISFTVRNVF---------------GPMRDSPNKNFLVIVTDGQSYDD---- 480 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 481 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 536 Query: 406 SQLMV 410 + + Sbjct: 537 IRGIC 541 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 63/189 (33%), Gaps = 30/189 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 R+ MA ++ A+ FL S +DV +G++ + I P+ V+ V Sbjct: 101 RSGMATVRTAVKDFLASA------PKDVRIGVVSFGNTAGPEIAPTTARAAVQAVVD--- 151 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D T + QA ++L S + I+ L+DG+N Sbjct: 152 DLRADGNTALFSGVTQAVRMLGSTGD--------------------RSIVLLSDGKNTVG 191 Query: 345 KSNVNTIKICDKAKENFIKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + +++ + + + L + +A+ + FQ Sbjct: 192 DRASGLAAAGKALTASQVRVEVVRFTTGENDPEALAAFAKAGGGSVVQATDAEGVRTAFQ 251 Query: 404 NISQLMVHR 412 ++++ + Sbjct: 252 TAAKVLESQ 260 >gi|307944861|ref|ZP_07660199.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307772075|gb|EFO31298.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 862 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 13/130 (10%), Positives = 45/130 (34%), Gaps = 7/130 (5%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 RD+++L+ + S + + K + ++ ++ +DG Sbjct: 95 RDIETLVPMGSGSGADISTRINSIRPKGKTPLSQSVRLAAQELQYREEAATVVLFSDGIE 154 Query: 342 NNFKSNVNTIKICDKAKENFI--KIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADS 397 ++ + ++ + + I I + + ++L ++ +++ +A S Sbjct: 155 TC-NADPCL--LAEELERDGIDFTAHVIGFGIGSDADRKKLQCIAENTGGTYFDADDAGS 211 Query: 398 LIHVFQNISQ 407 L ++ Sbjct: 212 LKDALGQVTN 221 >gi|313238855|emb|CBY13854.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 33/356 (9%), Positives = 91/356 (25%), Gaps = 40/356 (11%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE-IQNIVNSSRISMT 130 + + K+ + D +F + +S ++ S + Sbjct: 7 ADGSSSVSQVSFDLMKKWIADFVDKFDVSQTATRVSVVQFTDNVSDRQVFRYYLSADFLL 66 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQ--FIEHLLNQRYNQKIVSFIPALLRIEMGE- 187 + + + D++ + + Sbjct: 67 IVLTDGVSQDEIEKA-ASDLIGDKVLVFSVGIGNSVDANELEKIAGLPEYVFKTANYNAL 125 Query: 188 RPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I S S +C + + + LK + ++ +++ Sbjct: 126 TGITDTLYNKLCSSLKSDNCISSVKQDLSFIVDSSSSITISDYQKLKTWMKSIIEKLEIG 185 Query: 246 SHVKEDVYMGLIGYTTRVE--KNIE-PSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQ 300 + + G + R + ++ ++ V +D + T A+ Sbjct: 186 DNASRVSILQFSGQSARPQGRWINPVLTFDRSTSKEAVIGAIDGMKKLNGDTCIGEALDY 245 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 Y+ + + + ++ +I +TDG+ N + + Sbjct: 246 FYRNMFTSQAGQ-------------RSDVEQRVIVMTDGKRNCPAEIAKP---AELIRAQ 289 Query: 361 FIKIVTISINAS---------PNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNIS 406 +I I I + Q L + + + + + N D L + + I Sbjct: 290 EAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEINNFDQL--ILKRIG 343 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/245 (10%), Positives = 73/245 (29%), Gaps = 52/245 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 K + + + + I VVD SGS+ + + Sbjct: 340 KRIGILGDVCQEAECPDIFADIVFVVDSSGSI--------------------GPKRFDYM 379 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDM----DS 286 KN + S + ++ YT + ++ + + Q + + + Sbjct: 380 KNWVKSIAASFK---VGENFARFSVVQYTKTAKTVVDFQTLDLSSISQKIDSMIYFQGRN 436 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+++A+ +L + ++ ++ LTDG S Sbjct: 437 GRGGKTFTGNALERAHTLLKESEPGR-----------------KRIVLLLTDG-----SS 474 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +++ + I + + + L + + + +++ Q + Sbjct: 475 DDEYGSVAKAIRDDKVDIFAVGVGR--ARKDELVEITADEQRVWQTRTFNNIGQFNQKLL 532 Query: 407 QLMVH 411 + Sbjct: 533 AEVCS 537 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 78/247 (31%), Gaps = 64/247 (25%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I + VDLS SM +++ +K L +D+ Sbjct: 79 DIMISVDLSASMDANDV---------------APSRLEKIKYELKNIVDAF-------NS 116 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 +GLI +++ ++ + ++ L+ TD A+ A++ LTS+ Sbjct: 117 DRIGLIIFSSEAFVQCPLTYDQNALNLFIETLNTGLVPGSSTDFGSALNMAHEKLTSEAA 176 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S K II ++DGE+ +T K ++ I++ ++ + Sbjct: 177 PSSQQK-------------SKIIILISDGED----FGDDTEGAVSKINDSGIRLFSLGVG 219 Query: 371 ASP-----------------------NGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS 406 + + L + ++ + N + + + I+ Sbjct: 220 TEQGSKIRTRRGYRKDKSGNDIVTKLDSRSLENLADQTDGEYFEINKNKNEVGRLISTIN 279 Query: 407 QLMVHRK 413 + K Sbjct: 280 AIEGELK 286 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 45/344 (13%), Positives = 92/344 (26%), Gaps = 34/344 (9%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + L + N K + +++S S + +E Q +N S Sbjct: 224 TSATGLNLLLTLLQDGEDNFTKFNSNIPYVAYTTQQMRDSASNGTLDAMLSEYQAYINDS 283 Query: 126 RI-----SMTHMANNRLDSSNNTIFYNMDVMTS-YDYRLQFIEHLLNQRYNQKIVSFIPA 179 + + + + + + Q K Sbjct: 284 NLVSMYDFIAFGVRHDNPVYICNKSGKSASELEAIQLVVDYCKSDDMQTIATKKGFNAND 343 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK-KRTKMAALKNALLLF 238 ++ L N C M LKN+L Sbjct: 344 D---YTATEEFSGAQVTQGLKTYKKTKDNGKDIIAVFVADCSGSMDGDPMNQLKNSLTN- 399 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPA 297 ++ ++ Y+GL+ Y++ V + + R Y + + T S A Sbjct: 400 -----GAQYINDNNYVGLVSYSSSVTVEVPIAQFDLNQRSYFQGSVNNLMASGGTASYDA 454 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A +++T K + + L+DG N S +I Sbjct: 455 VVVAMKMITD--------------AKAEHPDAKCMLFLLSDGYANVGYS---MDEITSAL 497 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +++ I + TI + L K + N + D + + Sbjct: 498 RQSNIPVYTIGYGGDADTDELTKLSGINEAASINADSDDIIYKI 541 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 15/115 (13%), Positives = 35/115 (30%), Gaps = 12/115 (10%) Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ Q +T K + + + ++ ++DG C Sbjct: 84 QALIDKVQSITPRGKTPISKTLSLAALSVGFFSGKSSVVLVSDGLET------CNADPCA 137 Query: 356 KAKENFIK-----IVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 +AK I + I + + + L+ + + + NA+ L + Sbjct: 138 QAKSLGIINPGFDVHVIGFDVTEEEFKSLQCIATETGGKFFRANNAEELKDALRQ 192 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 60/185 (32%), Gaps = 20/185 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + + S + L+ Y ++ + + Sbjct: 234 FQRAKDFIANMMTNFSEKCFECSFALVQYGRVIQTEFNLQDSQNMTASLAKVQNITQVRN 293 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A++ + + + S K I+ LTDG+ F+ +N Sbjct: 294 VTRTASAIQHVLDDIFTQRHGS-------------RKKASKVIVVLTDGDT--FEDPLNL 338 Query: 351 IKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + + K I+ I + + ++ LK S P+ + + V N +L + + Sbjct: 339 TVVINSPKMQGIERFAIGVGDAFKKHQTEQELKLIASDPDETHAFKVTNYSALDGLLSRL 398 Query: 406 SQLMV 410 Q ++ Sbjct: 399 QQSII 403 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 53/167 (31%), Gaps = 21/167 (12%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDST 295 I+ L +G++ Y + V+ K + T + Sbjct: 49 FLTQVIESLDVGPNATRVGVVNYASAVKHEFPLKAHRSKA-SLLQAVRRLEPLSTGTMTG 107 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A S+ +G + S K I +TDG + +V+ Sbjct: 108 LAIQFAISRAFSEV----------EGARPLSPALSKVAIVVTDGRPQDDVKDVSAR---- 153 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 A+E I++ I + + L+ S P E+ V + + Sbjct: 154 -AREAGIELFAIGVGRVD--KTTLRRIASEPLAEHVDYVESYSVIEK 197 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 383 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYM 442 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 443 SGGTATGDAIAFTVRNVF---------------GPIRDSPNKNFLVIVTDGQSYDD---- 483 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + L+ S P + + L + ++ Sbjct: 484 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LRDMASKPKESHAFFTREFTGLEPIVSDV 539 Query: 406 SQLMV 410 + + Sbjct: 540 IRGIC 544 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 62/180 (34%), Gaps = 24/180 (13%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ + + + + I Y++R + + T R + T Sbjct: 26 VKGFIRRLVESFHISRTSTRVAAIVYSSRPRVAFDFNRYTSARRAAHAVKRLRFLRGGTS 85 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A L R +K ++ +TDG KS+ + +K Sbjct: 86 TGRALRLASSRLFRRYGRKR----------------RKVLMLITDG-----KSSDDVLKP 124 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 K ++I + + S + L+ S P Y + SL + +++++ K Sbjct: 125 SKALKRKGVQIFAVGVGMSVSRNELI-LIASHPSQVYQA-SFTSLSAIVKSLARKTCESK 182 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 55/188 (29%), Gaps = 25/188 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + + + + + V Sbjct: 157 FTVMKGFVTNVIRRF--IERDAQFAIAQYSNDCDIHYNFNDLKLDDGTWESKVANIP--Y 212 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+++ L + +K ++ +TDGE+++ Sbjct: 213 HEGGTFTASAIQKLVNYLFT-------------PNGGTRPSAKKILVVITDGESHD---R 256 Query: 348 VNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVF 402 +A++N I I + N + L T S P+ + V + ++L ++ Sbjct: 257 NLLKDAASQAEKNSIVRFAIGVGKAFDYYNAREELNTIASDPDTDYVFKVTDFNALKNIL 316 Query: 403 QNISQLMV 410 Q + ++ Sbjct: 317 QKLEGNII 324 >gi|153874442|ref|ZP_02002664.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152069095|gb|EDN67337.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 478 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 63/192 (32%), Gaps = 27/192 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + TK+ ++ A + F+ L+H+ + + ++G+ R + + Sbjct: 51 TSGSMAEGTKLQEVQAAAIQFIQRRHGLTHLANN-KIAVVGFGGRAYLVANLTSDLMNLE 109 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q + + + T ++ A L++ Q+ I+ T Sbjct: 110 QPIQKL---RAVGGTPMDRGLQSAMNQLSAGSDSE-----------------QRSILLFT 149 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG+ +N + T+ K I+IV I+ LL + + + Sbjct: 150 DGKPDNQR---TTLNASQLVKNANIQIVAIA--TDDADIGLLTQVTGDAALVFP-TSVGN 203 Query: 398 LIHVFQNISQLM 409 FQ Q + Sbjct: 204 FDQAFQKAEQAI 215 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 63/218 (28%), Gaps = 37/218 (16%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 AM+ +P V K+ + A F + L+ + + Sbjct: 81 QFAMDDEPVAVGLVFDTSGSMGEKLQRSRMAAREF------FHISNPEDEFFLVEFDSSP 134 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + + T + S T A+ A Sbjct: 135 RLVVPLTSDTGTIE---DHLTFSRSHGSTALLDAIFLAL------------------HEM 173 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-INASPNGQR----- 377 S +K ++ ++DG +N+ + + ++ KE+ + I +I + + Sbjct: 174 KHSKKNKKALLIISDGGDNHSRYSE--KEVSSVVKESDVLIYSIGVFGGGGSPEEAGGPG 231 Query: 378 -LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 L K + + +A L + + I + +R Sbjct: 232 LLSKVSEQTGGRLFEA-SAVELPDIAKKIGIELRNRYI 268 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 TK+ K AL L + H + I + ++ + + +R Sbjct: 314 SSASMVGTKLRQTKEALFTILQDLRPEDHFNIIGFSNRIK-VWQQDRLVPVTPN--NIRD 370 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ A++ ++L ++ IIFLTD Sbjct: 371 AKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNI-----------DARSVSLIIFLTD 419 Query: 339 GENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVV 393 G +S+ D ++ F + TI I + + L + + + + Sbjct: 420 GRPTVGETQSSKILSNTKDAIRDKFC-LFTIGIGNDVDYKLLERMALENCGMVRHFQEDE 478 Query: 394 NADS-LIHVFQNISQ 407 +A S L + I Sbjct: 479 DAASHLKGFYDEIGT 493 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 60/195 (30%), Gaps = 21/195 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 TK+ K AL L + H + I + ++ + + +R Sbjct: 318 SSASMVGTKLRQTKEALFTILQDLRPEDHFNIIGFSNRIK-VWQQDRLVPVTPN--NIRD 374 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ A++ ++L ++ IIFLTD Sbjct: 375 AKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNI-----------DARSVSLIIFLTD 423 Query: 339 GENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVV 393 G +S+ D ++ F + TI I + + L + + + + Sbjct: 424 GRPTVGETQSSKILSNTKDAIRDKFC-LFTIGIGNDVDYKLLERMALENCGMVRHFQEDE 482 Query: 394 NADS-LIHVFQNISQ 407 +A S L + I Sbjct: 483 DAASHLKGFYDEIGT 497 >gi|319783910|ref|YP_004143386.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169798|gb|ADV13336.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 393 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/257 (11%), Positives = 68/257 (26%), Gaps = 4/257 (1%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 S AN + IFAL++ + GF + W+Y +++ +AA AGA + +S Sbjct: 1 MRRSTGANVATIFALTLPVVVGAAGFGVETSYWYYNSLRLQATADAAAYAGALEQISGSD 60 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + A L + N S + + ++ + Sbjct: 61 KPTIVAAATQSAASNGLGSG--TIVVNTPPASGPNTAKKAVEVIVGQKLDR--MFTLIFT 116 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 +++ + D ++ L V + + Sbjct: 117 QDKVPEQARAVAVITDASSACMLALDPSASQAVLFSGSTSVKLTGCSVMSNSIASDAIKL 176 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + L + +C + A + L Sbjct: 177 QGAAGLQADCLISAGGVSLSNPVTTVCAAPITQALPAGDPFVDLPAPVAATPCLNDNKAT 236 Query: 254 MGLIGYTTRVEKNIEPS 270 +G Y + + + Sbjct: 237 LGPGTYCSGMNLKGNVT 253 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 60/178 (33%), Gaps = 25/178 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--DSTPAMK 299 ++ ++ ++ D + +I + + ++K R + ++D L +S M+ Sbjct: 70 LNFINGLRGDSQIAIILFDGGTSVAMPLVPVSQKTRDALVTNLDKLTYTGRFRNSAAGME 129 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-------SNVNTIK 352 +A L +K II LT+G S Sbjct: 130 RALYELK----------------NFGRPEAEKAIILLTNGPIETGDEKRDRDFSRWMQEY 173 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +A E IK+ I+ + + L ++ +Y A L F I +++ Sbjct: 174 LAHEAAEAGIKVFGIAFTEAADFHLLQILAHTTGGTYYRAPQAVDLQSAFNRIRKVIT 231 >gi|114576315|ref|XP_515709.2| PREDICTED: matrilin-3 [Pan troglodytes] Length = 486 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--V 276 + + + +K + +D++ D + ++ Y + V+ + T+K + Sbjct: 90 SSRSVRPLEFTKVKTFVSRIIDTL---DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSL 146 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A G + PS K I + Sbjct: 147 KQAVGRITPLST--GTMSGLAIQTAMD----------EAFTVEAGAREPSSNIPKVAIIV 194 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 195 TDGRPQDQVN-----EVVARAQASGIELYAVGVDRADMES--LKMMASEPLEEHVFYVET 247 Query: 395 ADSLIH 400 + Sbjct: 248 YGVIEK 253 >gi|116283392|gb|AAH14765.1| Itga1 protein [Mus musculus] Length = 685 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 67/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 188 SVTAFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 295 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 351 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 352 VKALGERIFAL 362 >gi|329901495|ref|ZP_08272833.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] gi|327549103|gb|EGF33703.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] Length = 1132 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/326 (9%), Positives = 77/326 (23%), Gaps = 24/326 (7%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 A +K++ + S N+ L +S Y TS Sbjct: 168 ASSTVKSNCTGIAGAGLYAALAVSGSAYMQIGAASDDANINDVLYASGQPGVYINYGNTS 227 Query: 154 YDYRLQFIEHLLNQRYNQKIVS--FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 L N +++ G + + + Sbjct: 228 PASPYPPSYSLANYNNGSVLITYPSSVPSANRATGPTNAGYVPYSPQVMYAQRGFGYGGS 287 Query: 212 EDVNSA----PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 + +S P+ +++ A+ FL + ++ + + + + Sbjct: 288 QSPSSGNVLVPMTSAGTVPTASSVTTAVNTFLPPLAPETNTLQTTEIKAV--AGQAPTAG 345 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + ++ + + + + + Sbjct: 346 LLAKAQSYLASVLSTANCAPKQYVVLISDGLPT-----QDLAGKYWPPLGSAAAAGYGLT 400 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN------GQRLLKT 381 K L N + +TI K K I I + A + L+ Sbjct: 401 ASFKADGSLN---VTNDTALTDTITNLAKLKAAGINTYVIGLGAGVDPSVNPQAAATLQA 457 Query: 382 CVSSPE--YHYNVVNADSLIHVFQNI 405 + +Y + +L+ +I Sbjct: 458 MAIAGGTANYYPASSPAALVDGLNSI 483 >gi|197124353|ref|YP_002136304.1| von Willebrand factor A [Anaeromyxobacter sp. K] gi|196174202|gb|ACG75175.1| von Willebrand factor type A [Anaeromyxobacter sp. K] Length = 480 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/255 (12%), Positives = 72/255 (28%), Gaps = 49/255 (19%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +VS + P+ +I V+D+SGSMH K Sbjct: 18 KDAHLVVSLVAPHGNARAERSPVCVIP-VLDVSGSMH--------------------GEK 56 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDS 286 + +++ +D + + G++ ++T VE P+ T+ + + Sbjct: 57 LHFATQSIMKLVDHLAPGD------FCGVVVFSTEVETLAAPTEMTQDRKDALKVALGRL 110 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ + V +I TDG N + Sbjct: 111 RPRHNTNLAGGLLAGLD---------------HAKVTKVPDGMPVRVILFTDGLANEGPA 155 Query: 347 NVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +C + N + + + L + + V + + + F Sbjct: 156 TS-PEGLCALLEANLGTASVSAFGYGDDADQELLRELSTLGRGNYAYVRSPEDALTAF-- 212 Query: 405 ISQLMVHRKYSVILK 419 ++ + + + Sbjct: 213 -ARELGGLLSTYAQR 226 >gi|118365082|ref|XP_001015762.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89297529|gb|EAR95517.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 755 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 80/244 (32%), Gaps = 27/244 (11%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + N+ + I L I+ G F+ + ++L + + + + Sbjct: 1 MSVNNQNKMKFNLISELKAIKPGNLRDFICGIELELENNNNNDARLPVDIICLIDNSGSM 60 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K ++ +L L ++ + L+ +++ + + ++ +Q + Sbjct: 61 AGKKAQLVRKSLKYLLKILEKGD------QISLVSFSSTAKTLCPLTQVNDENKQQIKSA 114 Query: 284 MDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T P K+ +IL S K++ T FI+ LTDGE Sbjct: 115 IKQINGQGGTFVIPGFKEVTKILNSRKEQREQT----------------FILLLTDGEFG 158 Query: 343 NFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + S I ++ I+ I T N + L + + + N + Sbjct: 159 DIDSGKVIQNINRLFTQSEIQKTPYIYTYGYGDDVNPEILQEIAQKFQGKYCLISNVQQV 218 Query: 399 IHVF 402 F Sbjct: 219 TDWF 222 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 27/152 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V Q +T + + T +K +Q + + + Sbjct: 347 VGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQST 406 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N S C + K + I TI++ Sbjct: 407 SGSE----------------IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALG- 442 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + Y + + L + F Sbjct: 443 PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 27/152 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V Q +T + + T +K +Q + + + Sbjct: 347 VGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQST 406 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N S C + K + I TI++ Sbjct: 407 SGSE----------------IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALG- 442 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + Y + + L + F Sbjct: 443 PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|74193348|dbj|BAE20643.1| unnamed protein product [Mus musculus] Length = 682 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 67/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 188 SVTAFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 295 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 351 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 352 VKALGERIFAL 362 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 65/216 (30%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 ++LV+D+SGSM T+MAA K A LD+ Sbjct: 36 EDAPKVDLVLDVSGSMRTRD--------------IDGGTRMAAAKQAFNEVLDATPEEVR 81 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R + + P D T A K A L+ Sbjct: 82 LGIRTLGADYPGDDR--------------KTGCKDTAQLYPVGPLDRTEA-KTAVATLSP 126 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + K I+ ++DGE+ + ++ + I I Sbjct: 127 TGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGLTID 184 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + ++ L ++ + +V + D L Sbjct: 185 TLGLVPNTKMRQQLSCIAEATGGTYTSVEHTDELTD 220 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 63/189 (33%), Gaps = 9/189 (4%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 L ++ E I L++D SGSM+ +D N +K Sbjct: 480 SGDEIEFTYKISAEDLQIEDINEEVKKDIVLILDTSGSMNFNFYNDSIPYN-------EK 532 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD 283 ++ +LK + F++ + +++ + Y+ + + + Y Sbjct: 533 DKRIYSLKQSAKQFINKFNNKDNIRIGIIPYSY-YSGYANNIKQLTEINDNNKKSYENYI 591 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T+ +++A ++L + S + ++ +K + + D Sbjct: 592 DNIKVEGATNQGDGIREAGKMLLNTDGNSKKYVILITDGEATAITIEKPNLIINDSAYYW 651 Query: 344 FKSNVNTIK 352 + ++ Sbjct: 652 ANNVLSFRN 660 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 383 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYM 442 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 443 SGGTATGDAIAFTVRNVF---------------GPIRDSPNKNFLVIVTDGQSYDD---- 483 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + L+ S P + + L + ++ Sbjct: 484 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LRDMASKPKESHAFFTREFTGLEPIVSDV 539 Query: 406 SQLMV 410 + + Sbjct: 540 IRGIC 544 >gi|332970145|gb|EGK09139.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 485 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 68/248 (27%), Gaps = 55/248 (22%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 +P + I +++D SGSM + Sbjct: 159 DGPGGIRALKVPESQEVN--------IMILLDSSGSMADKVKGGV--------------- 195 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--------TTRVEKNIEPSWGTEKVRQ 278 KM K A+ F ++ ++V VY +E+ + + Sbjct: 196 KMDLAKKAVKEFASNMPEGANVSLVVYGHKGSNAGADKKVSCESIEEIYPLAAYDG--KT 253 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T +MK A + L Q + ++D Sbjct: 254 FQGSLDKFQPTGWTPLAGSMKLAQEKL----------------ASHTGSNVQNIVYVVSD 297 Query: 339 GENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNA 395 G + + +K ++ E+ + I + GQ+ LK + + V + Sbjct: 298 GVETC---DGDPVKAAEELNESNMKAVVNIIGFDVDNEGQKALKEVADAGGGTYKTVGSK 354 Query: 396 DSLIHVFQ 403 L F+ Sbjct: 355 VGLQEYFE 362 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 50/186 (26%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + V+ ++ + + T+ A++ + Q+L + + Sbjct: 133 DRVGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEVMPQGGTNVGAALRLSRQVLENADR 192 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S + ++ LTDGE+ + + K++ ++++ + + Sbjct: 193 GSKE----------------RVVVLLTDGEDLVGDV----AEATEALKDSGVQVLAVGVG 232 Query: 371 ASP-----------------------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + L ++ Y ++ V Sbjct: 233 SESGEPIPVFDRRGAFVDYKKDAAGETVITRLDRAGLTAIAEATGGTFYFQPRGVAMSQV 292 Query: 402 FQNISQ 407 + I Q Sbjct: 293 VERIDQ 298 >gi|50950211|ref|NP_001002980.1| collagen alpha-1(VII) chain [Canis lupus familiaris] gi|33149359|gb|AAO64414.1| type VII collagen [Canis lupus familiaris] Length = 2936 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|3850207|gb|AAC72024.1| alpha-1 type VII collagen non-collagenous domain [Canis lupus familiaris] Length = 1253 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 46/171 (26%), Gaps = 23/171 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AY 302 + + V + Y+ S T + A+ A Sbjct: 69 SGAASAQGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVAD 128 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K I +TDG++ + + K + Sbjct: 129 HVFLPQL---------------ARPGVPKVCILITDGKSQD-----LVDTAAQRLKGQGV 168 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 K+ + I + + + L + + ++ + V + L + +S+ + Sbjct: 169 KLFAVGI-KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 32/192 (16%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWG 272 S + T++ + A L+L + ++++ +GL+ + + Sbjct: 314 DKSGSMRLGSPITRLTLMNQAAELYL-----IQIIEKESLVGLVTFDSTATIQTNLIRII 368 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T +K+ ++ +TS + + + Sbjct: 369 NDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------------- 412 Query: 333 IIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHY 390 I+ LTDGE+N S C + K + I TI++ + L + Y Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALG-PSAARELETLSDMTGGLRFY 464 Query: 391 NVVNADSLIHVF 402 + + LI F Sbjct: 465 AKEDVNGLIDAF 476 >gi|58429543|gb|AAW78175.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 60/186 (32%), Gaps = 25/186 (13%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + + +A+ L + I L+ +++ + ++ + I Sbjct: 44 NDQVDLYLLMDCSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRL 103 Query: 270 SWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 K ++ + +L T+ T A+ Q + L I Sbjct: 104 HSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDR---------------I 148 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTC 382 + ++ LTDG ++ + + +K K + +KI I N R L C Sbjct: 149 NRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGC 205 Query: 383 VSSPEY 388 S Sbjct: 206 HPSDGK 211 >gi|327272010|ref|XP_003220779.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Anolis carolinensis] Length = 919 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 62/222 (27%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 289 NILFVIDVSGSMW--------------------GLKMRQTVEAMKTILDDLRSNDQFSVL 328 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + R + TE ++Y+ T+ A+ +A IL Sbjct: 329 DFNHNV-RCWRDSLVQASNAQTEAAKKYIEGI---HPNGGTNINDALLRAIFILKEASN- 383 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + I+ ++DG+ K + ++ I + + I Sbjct: 384 ----------MGMLDPSSTSMIVLVSDGDPTVGELKLPTIQKNVKKNIQD-DISLFCLGI 432 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + V + + F +S Sbjct: 433 GFDVDYDFLKRLAQENNGMAHRVFGNQETSSQMRKFFNQVST 474 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 69/205 (33%), Gaps = 37/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTR 282 ++ + A FL + V+ ++G++ + + + R + Sbjct: 319 SHNRLNRMNQAAKQFL-----MQTVENGSWVGMVRFNSTATILNKLIQIISSNERNTLLE 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T +K A+Q++ + + I+ LTDGE+N Sbjct: 374 KLPTRAQGGTSICSGIKSAFQVIGELYSQLDGSE----------------IVLLTDGEDN 417 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---L 398 T C D+ K++ + I++ + + +++ + H + L Sbjct: 418 -------TASSCIDEVKQSGAIVHFIALGKDAD-KAVIEMSNITGGSHLYASDEAENNGL 469 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F ++ + + + KG Sbjct: 470 IDAFGALTSGNAGISQKSLQLESKG 494 >gi|223933146|ref|ZP_03625138.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|330833239|ref|YP_004402064.1| LPXTG-motif cell wall anchor domain-containing protein [Streptococcus suis ST3] gi|223898207|gb|EEF64576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|329307462|gb|AEB81878.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis ST3] Length = 997 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 83/307 (27%), Gaps = 28/307 (9%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI------ 183 + + ++ + + R+ ++ N + Sbjct: 402 SFSGFRSVTNATVSNSSVDQAHITQSKRIDYLGDGGNNPDTSVDDRIGTDTHELYRMYLD 461 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 G + F +VVD S SM MNS + + T L + Sbjct: 462 MTGTKEPFDTLIVVDRSTSMTDPMNSVDTQARYLAVYKALNGT------AGRQGLLSKLV 515 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQA 301 + +G GY + + + R + T+ T ++ A Sbjct: 516 GFHPENQVAIVGFQGYPGYPSGDQDSTVIANWGRSTSVALSNIQPPYNNGTNYTAGLRTA 575 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQ--KFIIFLTDGENNNFKSNVNTIKICDKA-K 358 +L ++ F ++ G N + + K Sbjct: 576 GVVLDQNQSSRKKVMIFISDGVPTFAFVNGVRYGNGTISGNNPYYTRDWTLNYFNSWIGK 635 Query: 359 ENFIKIVTISINA--------SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I T+ I++ S N L + ++ +V ++ +L Q I Sbjct: 636 YPKLPIYTLGISSEFGNSDNLSANPYVLNHMSSQTGGFYSHVADSQALERTLQKIVDD-- 693 Query: 411 HRKYSVI 417 K S++ Sbjct: 694 -TKLSLV 699 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 17/184 (9%), Positives = 58/184 (31%), Gaps = 19/184 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + +G++ Y++ ++ + + V + Sbjct: 17 ADEFEKVKTFLNSIVGHFDIGPTATQVGVVQYSSSPQQEFALNAHSSLVSLQQAITNIII 76 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I + T++ A+ A + + K ++ +TDG S+ Sbjct: 77 IGRGTNTGSALTFARDVALTAANG-------------ARPGLPKIVVTMTDGA-----SS 118 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + + + + I + + + ++ + S + + + D+L ++ +S Sbjct: 119 EDVLTPSQNLRNDGVITFAIGVTSRASDWQV-EEIAGSLDRVFTASDFDALDNIKVTLSS 177 Query: 408 LMVH 411 + Sbjct: 178 QLCE 181 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/106 (16%), Positives = 38/106 (35%), Gaps = 5/106 (4%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + F + ++ +I +TDGE+++ +T + + I I Sbjct: 1 AKALKHANALFTEEHGSRIKQNVKQMLIVITDGESHDHDQLNDT---ALELRNKGITIFA 57 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + Q+ L+ + V N D L VF + + M Sbjct: 58 VGVG--KANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 101 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 35/252 (13%) Query: 164 LLNQRYNQKIVSFIPALLRIE--MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + N V L + + ER V L + + + V++ Sbjct: 71 MAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSN----- 125 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + D ID + + +G+ + + + IE K + Sbjct: 126 -------SDFVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQ 178 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S A+K+ + + + ++ +T G+ Sbjct: 179 IQNVSKSGGFPRIDFALKKVSNMFN------------LHAGGRRNAGVPQTLVVITSGDP 226 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 ++ K+ I ++ + I + LL + E + D L Sbjct: 227 RYDVAD-----AVKTLKDLGICVLVLGIG-DVYKEHLLPI-TGNSEKIITFQDFDKLKNV 279 Query: 400 HVFQNISQLMVH 411 V + I + + Sbjct: 280 DVKKRIIREICQ 291 >gi|48734898|gb|AAH71224.1| Matrilin 3 [Mus musculus] Length = 481 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 66/186 (35%), Gaps = 26/186 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + + + +K + +D++ D + ++ Y + V+ + + + + Sbjct: 85 SSRSVRPLEFTKVKTFVSRIIDTLG---IGATDTRVAVVNYASTVKIEFQLNTYSDKQAL 141 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 +Q V R T S A++ A + G + P K I + Sbjct: 142 KQAVARITPLST--GTMSGLAIQTAME----------EAFTVEAGARGPMSNIPKVAIIV 189 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVN 394 TDG + + ++ +A+ + I++ + ++ + LK S P E+ + V Sbjct: 190 TDGRPQDQVN-----EVAARARASGIELYAVGVDRADMES--LKMMASKPLEEHVFYVET 242 Query: 395 ADSLIH 400 + Sbjct: 243 YGVIEK 248 >gi|66805993|ref|XP_636718.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] gi|60465117|gb|EAL63216.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] Length = 549 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 73/222 (32%), Gaps = 24/222 (10%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V+D+S SM + + + + + F +D + V Sbjct: 124 MIVLDVSCSMTAVAYNGSTKPGLLEMNRSEVSQAL---------FQTMLDKYVSFEVPVV 174 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 GL+ + +++ E S + Q + + + T A+ A + + Sbjct: 175 AGLVLFGQKIDTAFEISKNFDSFSQELGEVVANQ--GSTRLYEAIYHAANEIEKYRNN-- 230 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + + I LTDG++ SN+N + K I + I I Sbjct: 231 -------PKEKLAPDVCCRIFLLTDGQDT---SNINPYNVYQYLKPLNIILDAIPIGRDD 280 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 N L ++ + + + +F+ + L+ ++ S Sbjct: 281 NS-TLSTLTKATGGSCFMANSTQEGVELFEREALLIPTQRDS 321 >gi|224046671|ref|XP_002199336.1| PREDICTED: collagen, type XIV, alpha 1 (undulin) [Taeniopygia guttata] Length = 1883 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 53/184 (28%), Gaps = 23/184 (12%) Query: 229 AALKNALLLFLDSIDLLS-HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ L + + +I +T + + K + Sbjct: 1052 DNFNKIINFLYSTVGALDKIGPDGTQVAIIQFTDDPRTEFKLNAYKTKETLLEAIQHIAY 1111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+K A + L + + K ++ +TDG + + + Sbjct: 1112 KGGNTKTGKAIKHAREFLFTGEAGM-------------RRGIPKVLVVITDGRSQDDVN- 1157 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ + + + I + + + + S + + V + F I Sbjct: 1158 ----KVSREMQLDGFTFFAIGVADADYSELVNIGSKPSERHVFFVDDF----DAFTKIED 1209 Query: 408 LMVH 411 ++ Sbjct: 1210 ELIT 1213 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 24/174 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + K T + A+ Sbjct: 185 VSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 IL + K K I +TDG++ + ++ Sbjct: 244 --ILENSFK----------PEAGARPGVSKIGILITDGKSQDDVIPP-----AKNLRDAG 286 Query: 362 IKIVTISI-NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ I + NA N LK S P+ + YNV + + + + + +++ + R Sbjct: 287 IELFAIGVKNADINE---LKEIASEPDSTHVYNVADFNFMHTIVEGLTRTVCSR 337 >gi|193782688|ref|NP_436104.2| hypothetical protein SMa1579 [Sinorhizobium meliloti 1021] gi|193073144|gb|AAK65516.2| conserved hypothetical protein [Sinorhizobium meliloti 1021] Length = 431 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 16/188 (8%), Positives = 48/188 (25%), Gaps = 6/188 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + ++ + +E ++I A++ + +G + W+ +K ++ A + + A A Sbjct: 16 RRQRLARRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAA 75 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + + G + ++ Sbjct: 76 VRHRAGDQQSALETAARRVAGGSGFSPGGLTVSTA------PGSAGGSNKVTVELTETHP 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 R+ + + + + L V + Sbjct: 130 RMFSSVFGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCSVVSNS 189 Query: 186 GERPIFLI 193 FL+ Sbjct: 190 NAADAFLM 197 >gi|119512060|ref|ZP_01631154.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463286|gb|EAW44229.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 418 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 71/256 (27%), Gaps = 53/256 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 +QR +S I + + + L++D SGSMH Sbjct: 20 SSQRQLAVSISAIAEIQDRNI----PLNLCLILDHSGSMH-------------------- 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 + +K A + +D + + ++ + R + ++++ + Sbjct: 56 GLPLETVKQAAIGLVDKLKPGD------RLSVVAFDHRATVLVPNQTITNPGQIKKQINS 109 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T ++ + L KK + F LTDGE N Sbjct: 110 LT---ADGGTAIDEGLRLGIEELAKGKKETVSQAF-----------------LLTDGE-N 148 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 N +K A + + T+ N L K + ++ + F Sbjct: 149 EHGDNQRCLKFAQLATGYNLTLNTLGFGDKWNQDVLEKIADAGLGSLSHIQKPEQAADEF 208 Query: 403 QNISQLMVHRKYSVIL 418 + + + Sbjct: 209 NRLFSRVQTVGLTNAY 224 >gi|226308012|ref|YP_002767972.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] gi|226187129|dbj|BAH35233.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] Length = 551 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/352 (11%), Positives = 90/352 (25%), Gaps = 43/352 (12%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 +A+ + + + L +S + + + + S Sbjct: 230 QTVDASGGTAVTTEQTFEAYLAAHADSRLTASVPSSGSVFMDYPLASLGADSSAADDAGK 289 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++ R + + D + V L+ + Sbjct: 290 LLAAALASDTGRSVLAGLGFRGADQAPPVAGGVGKV-----------GSLVPSDPSVVTK 338 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + VVD+SGSM + +A R A + Sbjct: 339 ALARYEILSRPSRALA-----VVDVSGSMDYMQDGVTRMAATAQAGDIAIRMFPANAQLG 393 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L F + + +E + + T + K+ + S++ T Sbjct: 394 LWAFSVDLGDGTDYRELEPVARMDATEGG------TDHRSKLLSRIDNLS-SIVGGGTGL 446 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ AY+ + +I LTDG N++ + Sbjct: 447 YDSVLAAYRSMQQSY----------------DPASINSVILLTDGANDDPSGIALQELLD 490 Query: 355 DKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +E + I+TI + + L + + + + VF Sbjct: 491 TLTREQDPMRPVPIITIGVTDDADTDVLEQISTLTGGNSHFAPTPADIPKVF 542 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 65/192 (33%), Gaps = 29/192 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 170 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 224 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + +++ + L K +K +I +TDG+ Sbjct: 225 IVQLQ--GLTYTASGIQKVVKELFHSKNG-------------ARKSAKKILIVITDGQK- 268 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-----CVSSPEYHYNVVNADS 397 F+ + + +A++ I I + + L+ S ++ + V N + Sbjct: 269 -FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVA 327 Query: 398 LIHVFQNISQLM 409 L + + I + + Sbjct: 328 LRSIQRQIQEKI 339 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 65/192 (33%), Gaps = 29/192 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 170 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 224 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + +++ + L K +K +I +TDG+ Sbjct: 225 IVQLQ--GLTYTASGIQKVVKELFHSKNG-------------ARKSAKKILIVITDGQK- 268 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-----CVSSPEYHYNVVNADS 397 F+ + + +A++ I I + + L+ S ++ + V N + Sbjct: 269 -FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVA 327 Query: 398 LIHVFQNISQLM 409 L + + I + + Sbjct: 328 LRSIQRQIQEKI 339 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 65/192 (33%), Gaps = 29/192 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 170 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 224 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + +++ + L K +K +I +TDG+ Sbjct: 225 IVQLQ--GLTYTASGIQKVVKELFHSKNG-------------ARKSAKKILIVITDGQK- 268 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-----CVSSPEYHYNVVNADS 397 F+ + + +A++ I I + + L+ S ++ + V N + Sbjct: 269 -FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVA 327 Query: 398 LIHVFQNISQLM 409 L + + I + + Sbjct: 328 LRSIQRQIQEKI 339 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 65/192 (33%), Gaps = 29/192 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 165 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 219 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + +++ + L K +K +I +TDG+ Sbjct: 220 IVQLQ--GLTYTASGIQKVVKELFHSKNG-------------ARKSAKKILIVITDGQK- 263 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-----CVSSPEYHYNVVNADS 397 F+ + + +A++ I I + + L+ S ++ + V N + Sbjct: 264 -FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVA 322 Query: 398 LIHVFQNISQLM 409 L + + I + + Sbjct: 323 LRSIQRQIQEKI 334 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 65/192 (33%), Gaps = 29/192 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 172 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 226 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + +++ + L K +K +I +TDG+ Sbjct: 227 IVQLQ--GLTYTASGIQKVVKELFHSKNG-------------ARKSAKKILIVITDGQK- 270 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-----CVSSPEYHYNVVNADS 397 F+ + + +A++ I I + + L+ S ++ + V N + Sbjct: 271 -FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVA 329 Query: 398 LIHVFQNISQLM 409 L + + I + + Sbjct: 330 LRSIQRQIQEKI 341 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 70/248 (28%), Gaps = 29/248 (11%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLS----GSMHCAMNSDPEDVNSAPICQDKKRT 226 + +V+ + + +R ++ + + V + Sbjct: 293 RPVVTSVRPAAPLSAAKRRELPFPGTRSVADGLLAAYENELRRPSRTVYVLDTSGSMEGD 352 Query: 227 KMAALKNALLLFL--DSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVT 281 ++ LK AL D ++ + L+ + + V+ P K + Sbjct: 353 RLDRLKAALTQLAGADGAATGERFRDREEVTLMPFGSEVKAVRTHTVPEDDPGKALAAIR 412 Query: 282 RDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D T +++ AY+ L F I+ +TDGE Sbjct: 413 ADAKALTADGETAIFSSLRAAYRHLAERASALGDDRFTS-------------IVLMTDGE 459 Query: 341 NNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADS 397 N S + + + + I S G+ L+ S + ++ Sbjct: 460 NTAGDSADDFESFYRRLPGAQRTTPVFPILFGDSDRGE--LENIASLTGGRLFDATKGS- 516 Query: 398 LIHVFQNI 405 L F+ I Sbjct: 517 LDQAFEEI 524 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 69/195 (35%), Gaps = 35/195 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVR 277 ++ L A LFL L +++ ++G++ + + + E R Sbjct: 314 SGSMSSDNRLNRLNQAGKLFL-----LQTIEQGSWVGMVTFDSAAQVRSELRQIKSGTDR 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T+ + ++ T ++ A+ ++ I+ LT Sbjct: 369 EALTKSLPTVPSGGTSICSGLRVAFSVIKKKYPTDGSE-----------------IVLLT 411 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA- 395 DGE+N TIK+C D+ +++ I T+++ L + + + Sbjct: 412 DGEDN-------TIKVCFDEVRQSGAIIHTVALG-PSAALELEELSKMTGGLQTYASDQV 463 Query: 396 --DSLIHVFQNISQL 408 + LI F +S Sbjct: 464 QNNGLIDAFGALSSE 478 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 22/190 (11%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + + + + L+ + + ++ + +I+ + ++ Sbjct: 54 SGSVGRYNFEEVEIAFVENLLSQLTISPQASRVAVVSFDDVARTHIDYIKYPKNKCSFLR 113 Query: 282 RD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 I + T++ A + A ++L P ++ +I LTDG Sbjct: 114 ELKTVKYIGEWTNTEDAFRLAQELLRP------------PSAFKNERPVKQVVILLTDGR 161 Query: 341 NNNFKSNVNTIKICDKAKE-NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 +K + K +I +I I + N Q+ L+ C + ++ Y N Sbjct: 162 PTRGGD---PVKRANNLKSVYNAEIFSIGIGGNLNKQQ-LEDCATDAQHLYLSPNFVD-- 215 Query: 400 HVFQNISQLM 409 F+++++ + Sbjct: 216 --FKDLAKRI 223 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 67/183 (36%), Gaps = 26/183 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRD---MDSLILK 290 +D + + G+I Y T + I S V + + D D Sbjct: 205 LVDELASFDMT---IQFGIISYATVPKVIIPIYDENSDNDAHVFEVIENDLKYSDHKDKT 261 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A+++ Y +++ K+ + + II LTDG+ N +T Sbjct: 262 GTNIKTALEEVYNMMSFQKETYKNESVWNS--------IHHIIILLTDGKANIGGRPADT 313 Query: 351 IKICDKA------KENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVF 402 IK ++ +E+++ + T I + L + + + + +A+ + VF Sbjct: 314 IKHIEEFLDIKKKREDYLDVYTFGIGPDVDMADLSEIASKKDGESHVFRMESANEMKTVF 373 Query: 403 QNI 405 Q I Sbjct: 374 QKI 376 >gi|218189604|gb|EEC72031.1| hypothetical protein OsI_04925 [Oryza sativa Indica Group] Length = 1128 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 67/232 (28%), Gaps = 43/232 (18%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V + ER + V+D+S +M L Sbjct: 34 VLVRVVAPPPAAASSERAPIDLVAVLDVSCC-----------------GGLGPVNRMDLL 76 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ +D + + + + + G ++ + V + Sbjct: 77 KKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMVQ--SSLTVTGE 134 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+K+A IL +G K FI+ ++DG++ Sbjct: 135 NKLSTALKKAATIL--------------EGRKDHDKKRPGFIVLISDGDD---------A 171 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVF 402 + + A + + N + + + +S + + + D L F Sbjct: 172 SVLNDAMNLNCSVHAFGFRDAHNARAMHRIANTSAGTYGILNDGHDGLADAF 223 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 26/235 (11%), Positives = 73/235 (31%), Gaps = 35/235 (14%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R L+ + V + + ++ S ++ L+ A+ L + ++ Sbjct: 597 PRLPVLVRVAVPATAARRAPVDLVTLLDISCGGGGGAPARRLDLLRKAMDLVIGNLGADD 656 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQIL 305 + ++ + + V + + R +R + SL T PA+ A +IL Sbjct: 657 ------RLAIVPFHSSVVDATGLLEMSVEGRGVASRKVQSLAVAGGTKLFPALNAAVEIL 710 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + R G ++ ++DG++ I +A + Sbjct: 711 EAR---CWEAKRERVGA----------VVLISDGDD---------RTIFREAINPRYPVH 748 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADS-LIHVFQNISQLMVHRKYSVILK 419 + + + + + + + + + F R+ + ++ Sbjct: 749 AFGFRGAHDARAVHHVADHTSGVYGVLDDEHDRVTDAFAA-----CVRRVTSVVA 798 >gi|156364713|ref|XP_001626490.1| predicted protein [Nematostella vectensis] gi|156213368|gb|EDO34390.1| predicted protein [Nematostella vectensis] Length = 1043 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/176 (10%), Positives = 53/176 (30%), Gaps = 22/176 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW------GTEKVRQYVTRDMDSLI 288 L D +D ++ + + ++T V+ + + V++ + S Sbjct: 113 LKFTADLMDYYGISEDGTRISVATFSTDVDIVLSFNEFSGVEMNAASVKRGILGAKQSR- 171 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+K A + L + + +K + +T G + Sbjct: 172 GPGLRIDKALKAADKDLFNRRFGM-------------REDQKKVCLLVTSGAQTKDQGPS 218 Query: 349 NTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + I + + + L+ S+ ++ Y + + + V + Sbjct: 219 TQLGTVTALLSARGVDIYAVGVGDGVDSSE-LRNIASTEDFIYTASSFEEINKVLE 273 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 22/254 (8%), Positives = 61/254 (24%), Gaps = 27/254 (10%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + F +++Q + + S + + + + M++ Sbjct: 433 VDFS-SVVSQPSRKFLFSSAAYDGFTSVRHDMASTLCNACFVPIDVAFIMDN-------- 483 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K +K + + S + Y K E++ Sbjct: 484 --SESVSSEKYDFVKRFVKDVIMSYADAENSANFAIGQYAKYFQTGTKRFRNFRSMEELD 541 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + ++ A K +F+I L Sbjct: 542 EVINSLRQMSSSAERNVGAGLRGAANEFFQVKNGM-------------RQGLPRFLIVLA 588 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + S+ + ++ + + L+ S P + V Sbjct: 589 SANPSA--SSEAIESAVVNLDKENVRRIAVGFTEDATPG-FLRMLASDPSLMFRVDEPKK 645 Query: 398 LIHVFQNISQLMVH 411 L V ++ ++ Sbjct: 646 LDKVMMELTPMLCQ 659 >gi|34481896|emb|CAE46496.1| trap [Plasmodium falciparum] Length = 331 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNIFSNNAREIIRLHSDASKNKEKALFIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|311253435|ref|XP_001924360.2| PREDICTED: collagen alpha-1(XIV) chain [Sus scrofa] Length = 1795 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V+ + D+ K+ + + LD I + + ++ +T + + Sbjct: 1036 VDGSWSIGDENFNKITNFLYSTVGALDKIGA-----DGTQVAMVQFTDDPRTEFKLNTYK 1090 Query: 274 EKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K + S T + A+K L + + K Sbjct: 1091 TK-ETLLDAIKHISYKGGNTKTGKAIKHVRDNLFTAESGI-------------RRGIPKV 1136 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 I+ +TDG + + + KI + + + I + + + + + S + + V Sbjct: 1137 IVVITDGRSQDDVN-----KISKEMQLDGYSIFAVGVADADYSELVSIGSKPSARHVFFV 1191 Query: 393 VNADSLIHVFQNISQLMVH 411 + F+ I ++ Sbjct: 1192 DDF----DAFKKIEDELIT 1206 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 59/181 (32%), Gaps = 23/181 (12%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + FL+++ + E +GL Y+ + + K T Sbjct: 176 VRSFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTL 235 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + K I +TDG++ + + Sbjct: 236 TGLALNY-------------IFENSFKPEAGARTGVSKIGILITDGKSQDDVIPPS---- 278 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVH 411 +E+ +++ I + + L+ S P+ + YNV D + V +++++ + Sbjct: 279 -RNLRESGVELFAIGV--KNADENELREIASEPDNTHVYNVAEFDLMHTVVESLTRTVCS 335 Query: 412 R 412 R Sbjct: 336 R 336 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 44/174 (25%), Gaps = 24/174 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K S + ++ + ++V S I Sbjct: 90 KEFAKNVAASFAAKNLFTNGGTASYASFSDAASDGGTFGS-EAEFNEFVDNA--SWIEGD 146 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ + + ++L + F+I +TDG+ N Sbjct: 147 TNIEAGLSKGRELLANGTSTR-----------------TSFLILITDGDWNRGGDPQI-- 187 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D A++ + + + + LL ++ N L + I Sbjct: 188 -EADAARDEGTIVYAVGVGPDVSEATLLSI-GGDLTNVFDASNFTELDNTLDEI 239 >gi|156383823|ref|XP_001633032.1| predicted protein [Nematostella vectensis] gi|156220096|gb|EDO40969.1| predicted protein [Nematostella vectensis] Length = 182 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 60/195 (30%), Gaps = 33/195 (16%) Query: 222 DKKRTKMAAL-KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 D K K+ F D+I + + ++T + IE + Sbjct: 15 DTDFGKFQMFIKDLAEEFKDAISEGD-----TEVAAVLFSTIPKTKIEFDLDDYDHINDI 69 Query: 281 TRDMDS---LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +D+ T + A+ + + P + ++ LT Sbjct: 70 KAAVDAFSHQHGGQTRTGEALTFTLEEVFKKA---------------PRPSVKNVLVVLT 114 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNA 395 DG+ N +++ +++ I + + + LK S P+ + ++V + Sbjct: 115 DGKAQG-----NVTGPAQDVRDHGVEVFAIGVG-PHSNEAQLKDIASDPDDKHVFHVTDY 168 Query: 396 DSLIHVFQNISQLMV 410 L + I + Sbjct: 169 K-LEDITGPILDGIC 182 >gi|327266508|ref|XP_003218047.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 767 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/255 (10%), Positives = 77/255 (30%), Gaps = 22/255 (8%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + V +G + D + + K K+A Sbjct: 220 VSSKFISSLTETAESSDSDKNVSTTGKRKIKIEKDGSLNIYIVLDASRSIKK-DQFKHAQ 278 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL------ 289 + + I+ +S +I + T V++ + + +V ++ + Sbjct: 279 NMSIKLIEKISSYDISPRYAVITFATEVKELVRTTDDQSTDASWVIEKLEGMKYTEHKQK 338 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ + Y ++ + + V S + I+ L+DG+ N + Sbjct: 339 PGTNIQKGLSSVYSMMITQQAAERRRGLNPPPV---SEKTRHVIVLLSDGDYNMGGDPIR 395 Query: 350 TIKICDKA----------KENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADS 397 I+ + +E+F+ + ++ + + + K S + + + + Sbjct: 396 VIRQIREFLNIGRNRTHPREDFLDVYVFAVGGTVVMENVNKIASQKSGERHAFKIKDYSD 455 Query: 398 LIHVFQNISQLMVHR 412 L F+ + Sbjct: 456 LQLAFEEMIDESETL 470 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 57/173 (32%), Gaps = 22/173 (12%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D + +V + ++ Y + + + ++ + I T + A+ Sbjct: 370 DFIYNVMSNCNFAIVQYGSLIRTELLLLDNEDRAGSLLKVKQIKQIYNLTKTASAINH-- 427 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 K K II L+DGE + + ++ +K + + Sbjct: 428 -----------VLTDIFIPEKGSKNNTAKIIIVLSDGEIL--EDPMTLDEVLNKPQMKGV 474 Query: 363 KIVTISINA-----SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +I + + +++ P +Y+V + +L + ++ + ++ Sbjct: 475 TRYSIGVGDGILKKPNAVKEMMQIA--DPGKYYSVSSYGALNDILSSLEREII 525 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 23/218 (10%), Positives = 65/218 (29%), Gaps = 25/218 (11%) Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 E + K+ + + ++ D ++ Y Sbjct: 147 QGGRINQGPGTEIAFVLDGSGSIQDDDFQKAKDFIYNVMSNVWKTC---FDCNFAIVQYG 203 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + + ++V + I T + A+ + + S + Sbjct: 204 SLIRTELSLLDNEDRVGSLLKVKQIKQIYNLTKTASAINHVLTDIFIPENGSKDNSA--- 260 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS-----PNG 375 K II L+DG+ + ++ +K + + +I + Sbjct: 261 ----------KIIIVLSDGKILG--DPMTLDEVLNKPQMKGVTRYSIGVGDGILKNLDAT 308 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +++ P +YNV + +L + ++ + ++ K Sbjct: 309 EEMMQIA--DPGKYYNVSSYRALNDIVSSLERGIIGTK 344 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 50/383 (13%), Positives = 110/383 (28%), Gaps = 53/383 (13%) Query: 53 MESANN---AAILAGASKMVSNLSRLGDRFE-SISNHAKRALIDDAKRFIKNHIKESLSG 108 + A + A L G + + S + + S K A+ ++ +++ +LS Sbjct: 269 ISQALDIGPAGPLLGIVQFGDDPSMEFNLKTHANSRDLKAAIEKIPQKGGLSNVGRALSF 328 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + F++ N + +++ + D + + + + Sbjct: 329 VTKNFFSNANGNRGGAPNVAIVMVDGWPTDKVEEASRLARESGINIFFI-----TIEGAV 383 Query: 169 YNQKIVSFIPALLRIEMGERPIFL-----------------IELVVDLSGSMHCAMNSDP 211 N+K P + + R F ++ V D + + Sbjct: 384 ENEKQNVIEPNFVDKAVCRRNGFYSLNVLSWFSLHKIVQPLVKRVCDTNRLACSKTCLNS 443 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D+ T + L + D +G + YT Sbjct: 444 ADIGFVIDGSSSVGTG--NFRTLLQFVANLSKEFEISDTDTRIGAVQYTYEQRLEFGFDK 501 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + K + T + A+ A + L K +K Sbjct: 502 YSTKQDILNAIKRVNYWSGGTSTGAAINYALEHLF----------------KKSKPNKRK 545 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYH 389 +I +TDG + + +V + A +N + +I I + Q L+ + + ++ Sbjct: 546 LMILITDGRSYD---DVRIPAMA--AHQNGVITYSIGI--AWAAQDELEVIATHPTKDHS 598 Query: 390 YNVVNADSLIHVFQNISQLMVHR 412 + V D L I Q + Sbjct: 599 FFVDEFDDLYKSVPKIIQNICTE 621 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 31/135 (22%), Gaps = 18/135 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D L +G++ + + ++ Sbjct: 264 QFLKDISQALDIGPAGPLLGIVQFGDDPSMEFNLKTHANSRDLKAAIEKIPQKGGLSNVG 323 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + S+ + I + DG + + Sbjct: 324 RALSFVTKNFFSNANGNRGGA-------------PNVAIVMVDGWPTD-----KVEEASR 365 Query: 356 KAKENFIKIVTISIN 370 A+E+ I I I+I Sbjct: 366 LARESGINIFFITIE 380 >gi|99031847|pdb|2B2X|A Chain A, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab gi|99031850|pdb|2B2X|B Chain B, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab Length = 223 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + + D + + + +G++ Y V + + V + Sbjct: 40 SVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIVQRGGRQ 99 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 100 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 146 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 147 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 203 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 204 VKALGERIFAL 214 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/393 (8%), Positives = 81/393 (20%), Gaps = 60/393 (15%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + + + +++ +M S Sbjct: 271 VLVVDWSGSMNENNR------IGEVQKGVNRFVDTLADSGITNNININMGYVGYS----- 319 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 320 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 369 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF-TNFFRQGVKIPSLPFQKFI 333 ++ + D + + ++ F+ T F + + S + Sbjct: 370 GHKKVIVLLTDGVPT--------FSYTVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGS 421 Query: 334 IFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINASPNGQRLLKT--------- 381 D N N + N TI K+ I+I + I + + L Sbjct: 422 YNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMRE 481 Query: 382 ---CVSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 482 MVSADENGDLYYESADYAPDISDYLAKKAVQIS 514 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 70/258 (27%), Gaps = 54/258 (20%) Query: 185 MGERPIFLIELVVDLSGSMH--------CAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 RP V+D SGSM+ + +TK+ + +L Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDS 294 + + L+ + + + T ++ + + + T Sbjct: 95 GLV----SSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNF--SGGTRM 148 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M++A IL + + TDG+ + + Sbjct: 149 ALGMEEALNIL------------------KNCDLSSRRTLIFTDGQTFDESD---CRDLA 187 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV------VNADSL------IHVF 402 + E I I + + N LL + +NV + +F Sbjct: 188 TQFAEAGIPITALGVG-EYNEDLLLYLSDRTGGRVFNVVETQTHTGTTDIPISELPNTIF 246 Query: 403 QNISQL----MVHRKYSV 416 + + Q + + K ++ Sbjct: 247 EEVQQAQSEVINNLKLNI 264 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ L + + L+ Y + V K + T + Sbjct: 91 FLSEMVNSLDIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGM 150 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A + + + G + K I +TDG + ++ Sbjct: 151 AIKTAMEQVFT----------ENAGARPLKKGIGKVAIIVTDGRPQD-----KVEEVSAA 195 Query: 357 AKENFIKIVTISINASPNGQRL--LKTCVSSP--EYHYNVVNADSLIH 400 A+ + I+I + + + + LK S P ++ + V + Sbjct: 196 ARASGIEIYAVGV----DRAEMRSLKQMASQPLDDHVFYVETYGVIEK 239 >gi|332214177|ref|XP_003256207.1| PREDICTED: collagen alpha-1(XIV) chain [Nomascus leucogenys] Length = 1796 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 65/216 (30%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 ++LV+D+SGSM T+MAA K A LD+ Sbjct: 36 EDAPKVDLVLDVSGSMRTRD--------------IDGGTRMAAAKQAFNEVLDATPEEVQ 81 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R + + P D T A K A L+ Sbjct: 82 LGIRTLGADYPGDDR--------------KTGCKDTAQLYPVGPLDRTEA-KTAVATLSP 126 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + K I+ ++DGE+ + ++ + I I Sbjct: 127 TGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGLTID 184 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + +R L ++ + +V + D L Sbjct: 185 TLGLVPNTKMRRQLSCIAEATGGTYTSVEHTDELTD 220 >gi|119913152|ref|XP_616068.3| PREDICTED: integrin, alpha 1 [Bos taurus] gi|297487671|ref|XP_002696385.1| PREDICTED: integrin, alpha 1 [Bos taurus] gi|296475808|gb|DAA17923.1| integrin, alpha 1 [Bos taurus] Length = 1195 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 70/195 (35%), Gaps = 27/195 (13%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V Sbjct: 204 SVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQ 263 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 264 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNHRLNKVI 310 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I S N L +K+ S + ++ +NV + +L+ + Sbjct: 311 QDCE---DESIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 367 Query: 402 FQNISQLMVHRKYSV 416 + + + + + +V Sbjct: 368 VEALGERIFALEATV 382 >gi|119612406|gb|EAW92000.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_c [Homo sapiens] Length = 849 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 137 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 195 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 196 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 237 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 238 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 276 >gi|119612404|gb|EAW91998.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_a [Homo sapiens] Length = 865 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 137 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 195 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 196 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 237 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 238 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 276 >gi|114621486|ref|XP_001143977.1| PREDICTED: collagen, type XIV, alpha 1 isoform 2 [Pan troglodytes] Length = 1685 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 973 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1031 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1032 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1073 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1074 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1112 >gi|114621482|ref|XP_001144037.1| PREDICTED: collagen, type XIV, alpha 1 isoform 3 [Pan troglodytes] Length = 1780 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|114621484|ref|XP_001143906.1| PREDICTED: collagen, type XIV, alpha 1 isoform 1 [Pan troglodytes] Length = 1800 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|114621480|ref|XP_519927.2| PREDICTED: collagen alpha-1(XIV) chain isoform 4 [Pan troglodytes] Length = 1796 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|15779150|gb|AAH14640.1| COL14A1 protein [Homo sapiens] Length = 759 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 31 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 89 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 90 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 131 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 132 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 170 >gi|55743096|ref|NP_066933.1| collagen alpha-1(XIV) chain precursor [Homo sapiens] gi|125987815|sp|Q05707|COEA1_HUMAN RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|187954653|gb|AAI40894.1| Collagen, type XIV, alpha 1 [Homo sapiens] Length = 1796 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 27/152 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V Q +T + + T +K +Q + + + Sbjct: 347 VGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQST 406 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N S C + K + I TI++ Sbjct: 407 SGSE----------------IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALG- 442 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + Y + + L + F Sbjct: 443 PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|2065167|emb|CAA72402.1| collagen type XIV [Homo sapiens] Length = 755 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 43 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTA 101 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 102 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 143 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 144 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 182 >gi|291395335|ref|XP_002714014.1| PREDICTED: integrin alpha 2 [Oryctolagus cuniculus] Length = 1207 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/285 (14%), Positives = 79/285 (27%), Gaps = 49/285 (17%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 LQ + + + +S L +G + DVN Sbjct: 118 LNLQTSTSMPSVAEMKTNMSLGL-TLTRNVGTGGFLTCGPLWAQQCGNQYYTTGICSDVN 176 Query: 216 S-------APICQDKKRTKMAAL-----------KNALLLFLDSI-DLLSHVKEDVYMGL 256 S + + + +A+ FL+ L +GL Sbjct: 177 SDFELLSSFSPAVQTCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQVGL 236 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFF 314 I Y + K +T + T++ A++ A Sbjct: 237 IQYANNPRVVFNLNTFKTKD-GMITATSQTYQYGGDLTNTFKAIQFARD----------- 284 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 F K ++ +TDGE+++ I C+ + I I++ N Sbjct: 285 --FAYSAASGGRPGATKVMVVVTDGESHDGSMLRAVIDQCN---NDNILRFGIAVLGYLN 339 Query: 375 GQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 340 RNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 384 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 61/237 (25%), Gaps = 47/237 (19%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V+ + + + +V+D S SM Q K+ K Sbjct: 16 SVTSTMHAQENAVPSAKVGKVMIVLDGSNSMW---------------GQVDGEAKITIAK 60 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + + + + D + VY + + G + + T Sbjct: 61 DVMTDLISNWDDAVDLGLMVYGHRRKGDCSDIEVVAL-PGKVNRPALIDKVQSISPRGKT 119 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + A + + ++ ++DG Sbjct: 120 PISKTLLLAATSV-------------------GYFSGKSSVVLVSDGLETCDADP----- 155 Query: 353 ICDKAKENFIK-----IVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQ 403 C +AK I + I + + + L+ + + + NA+ L + Sbjct: 156 -CAQAKALGIINPGFDVHVIGFDVTEEEFKSLQCIATETGGKFFRANNAEELKDALR 211 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 53/186 (28%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T Sbjct: 190 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGT---- 244 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + K +K +I +TDGE+++ I + Sbjct: 245 ---------ETRTAYGIEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEQVIDASE- 294 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++ I +++ N + L K S P+ + +NV + +L + + Sbjct: 295 --KDNITRYAVAVLGYYNRRGINPEAFLSEIKYIASDPDDKHFFNVTDEAALKDIVDALG 352 Query: 407 QLMVHR 412 + Sbjct: 353 DRIFSL 358 >gi|282900972|ref|ZP_06308905.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] gi|281194063|gb|EFA69027.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] Length = 464 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 72/239 (30%), Gaps = 34/239 (14%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + QK+ + ++ P V+D SGSM+ + + + Sbjct: 14 FMPADTEGQKLFVMLKLRPLKDIATSLPPTTFTFVIDTSGSMYEVVAGESKPTGVTYEQD 73 Query: 222 DKKR-------TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 K+ +K+ + +LL ++ +K+ + ++ + I + TE Sbjct: 74 GKEYREVTGGKSKIDIVIESLLALVN----SGRLKQQDRIAIVQFNDSASSIIGLTSATE 129 Query: 275 KVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T + T +++A+ IL+ + K + Sbjct: 130 IKKTKKTAINNLRNFSGGTRMGLGLRRAFDILSEQEMTV------------------KRV 171 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + TDG+ I + I + + + N L + + V Sbjct: 172 LLFTDGQT---FDEDQCQSIANDFATRNIPVTALGVGEEFNEDLLSHLSDCTGGKLFYV 227 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/225 (12%), Positives = 71/225 (31%), Gaps = 56/225 (24%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 +D++ + + +D K ++ ++ + FL+S+ +GL+ ++ + + + Sbjct: 95 QDISLSMLAEDVKPNRLVRSRHEISAFLESLT-------GDRVGLVAFSGEAQVMVPLTL 147 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V+ + ++ T+ A+++ + + S Sbjct: 148 DYGTVQMVLRELNPGWLMPGTNLESAIRKGMTLFKNSGGAS----------------QHS 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------SPNG------- 375 +I ++DGE + + E IKI TI I + NG Sbjct: 192 VMILMSDGEELEAAAVNAAKEA----AEFGIKIYTIGIGSREGVPIPLKDKNGGSVYKKD 247 Query: 376 ------------QRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQ 407 L + + ++ + L V I+ Sbjct: 248 MQGNIVTTRLEEGTLQEIANVTGALYFYASPGEFQLQKVLTEIAT 292 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/360 (10%), Positives = 89/360 (24%), Gaps = 32/360 (8%) Query: 53 MESANNA--AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 + +A A LA A++ + L ++ RA G + Sbjct: 50 LGNAAEAKRETLAEAAEPLKAARELAAPTRDMAGGKGRAGAGGLLAPAPAATAAPGEGAA 109 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 N + L + + + + + Sbjct: 110 RAAAPEAAGLDPNGRFATTYRPGGGHLAAFESAVARGIVPPDGRELVSDVGARYAPSIDV 169 Query: 171 QKIVSFIPALL--RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 K + R + + V S + A + + M Sbjct: 170 PKGSALALRTDLERTRVPPGGGPVHVRVALRSTAQAPAARPHLSVHLVLDVSGSMQGEPM 229 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE---KNIEPSWGTEKVRQYVTRDMD 285 A ++A +D +D L+ +++ + + ++ + + Sbjct: 230 ARARDAARALVDRLDANDD------FSLVTFSSGADVRIDDGPVGPRRAAIKATIDGIRE 283 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF- 344 T+ + Y + K ++ L+DG N Sbjct: 284 ---GGGTNIGQGLALGY---------------AQAARPGIPEGAVKVVLLLSDGRANAGI 325 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S+ ++ A + ++ T + A +G + +Y + + D + Sbjct: 326 TSSERLSRLALDAFQGGVQTSTFGLGADYDGALMSAIASDGAGGYYYLRDPDQIAPALAT 385 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 49/181 (27%), Gaps = 10/181 (5%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMK 299 L + GLI ++ + + V Y++ SL T+ A+ Sbjct: 121 WAKKLMNDIPGDRFGLITFSRIANIECPLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALD 180 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A + +++ S G + + N S + + E Sbjct: 181 HAQKQFKENERDSRVVVLLSDGETDGNKWRESLEALQKKKIPVNVISLGDPKREGLVLNE 240 Query: 360 NFIKI------VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 I +S+ + L + + + D L I L+ + Sbjct: 241 KGHPIRNSKGDYVMSL---SDTSTLKQIADETGGTYIPWDPEDPLNSGHSTIESLIRELE 297 Query: 414 Y 414 + Sbjct: 298 F 298 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 74/219 (33%), Gaps = 48/219 (21%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + LV+D+SGSM ++ + A+ L + + S Sbjct: 50 DVRLVIDVSGSMKRNDPNNLR-------------------QPAVDLLVQLLPEGSRAGVW 90 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + +P T + + + T+ A+++A Sbjct: 91 TFGKWVNMLVPHRDVTDPWRATAQAKASEINSVGLF----TNIGEALEKA---------- 136 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDK----AKENFIK 363 F+K II LTDG + KS + I D+ KE + Sbjct: 137 -------TFEGADGGAEFRKSIILLTDGMVDIDKSPEQNKREWRRIADEVIPRLKEAGVT 189 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + TI+++A+ + L K +++ +AD L+ +F Sbjct: 190 VHTIALSANADTNLLNKISLATGGMAEVAHSADDLMRIF 228 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 82/330 (24%), Gaps = 51/330 (15%) Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 D + ++I+ G A F + N+ + + + Sbjct: 282 NGDPENLTNSYIEYGGVGDEADFALRKFAKETNTPGLF-------DVYLNVRGNVQRQID 334 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + ++ + + + + I+ V Sbjct: 335 PIDVVLVVDWSGSMNEMGR----ITEVKKGVDRFLNQIEGSGIQDSV-------YMGYVG 383 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + + + K + +K + + GL + Sbjct: 384 YSSDGNNYQNKTCQLGKFSEVKETIRTMTP---ETAAGGTFTQRGLRQAGDMLSTQ---- 436 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 ++ + D + + ++ T + T F + + Sbjct: 437 ---NGHKKVIVLLTDGVPTYSYHVS-------KVHTQADGSYYGTAFSLSQDQPMNTSHL 486 Query: 331 KFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISIN---------ASPNGQRL 378 F +D N+ N TI KE I+I + I + + Sbjct: 487 YNGYFASDQYGNSKWINNTFVATIGEAMALKERGIEIHGLGIQLQGDQTAGYTKADVENK 546 Query: 379 LKTCVSSPE----YHYNVVNADSLIHVFQN 404 ++ VS+ E Y+ + A + + Sbjct: 547 MRQMVSADEEGHLYYESANEAADIADYLEK 576 >gi|226137|prf||1411304A thrombospondin related protein TRAP Length = 559 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYSEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|136153|sp|P16893|TRAP_PLAFA RecName: Full=Thrombospondin-related anonymous protein; Flags: Precursor gi|9978|emb|CAA31440.1| unnamed protein product [Plasmodium falciparum] Length = 559 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYSEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|227661|prf||1708291A thrombospondin related protein Length = 559 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYSEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 355 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 414 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ +TDG++ + Sbjct: 415 SGGTATGDAISFTVRNVF---------------GPVRDSPNKNFLVIVTDGQSYDD---- 455 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 456 -VRGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 511 Query: 406 SQLMV 410 + + Sbjct: 512 IRGIC 516 >gi|218438801|ref|YP_002377130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218171529|gb|ACK70262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 573 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 48/153 (31%), Gaps = 21/153 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 Y+GL+ Y R + + + + ++++ + T M L Sbjct: 430 NPGNYVGLVTYGDRAAEVVPLGLFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLME 489 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 KK + F +++ LTDG+ N + ++ + + +++ I Sbjct: 490 QKKNNPDGRF--------------YLLLLTDGQANMGVTFDEVKEVIEY---SGVRVYPI 532 Query: 368 SINASPNGQRLLKTCVSSPEYHY--NVVNADSL 398 + N + L N + L Sbjct: 533 AYG-DVNQEELEAIASLRESTVKKGTPENVEDL 564 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 64/223 (28%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +++ K L D + +GLI + + + Sbjct: 97 DVSNSMLAEDVSPDRLSKAKQMLSKLTDGF-------SNDKVGLIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ S++ + T A+ A + T + K Sbjct: 150 YVSAKMFLSSINPSMVSTQGTAIGAAINLAMRSFTPSETSD------------------K 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + +K I + + + Sbjct: 192 AIILITDGENHEDDAVKAAAAAAEK----GIHVNIVGMGDPKGSPIPVDGSNNYMKDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + + + N + + + + Sbjct: 248 NVVITKLNEEMCQEIAAAGHGTYVRADNT---NSALRALQKEI 287 >gi|91788413|ref|YP_549365.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] gi|91697638|gb|ABE44467.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] Length = 533 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 13/162 (8%), Positives = 44/162 (27%), Gaps = 21/162 (12%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES------------- 55 SE+ +IIF L+V+ + G ++ + + K +++ Sbjct: 2 KTTSGYRKSERGAVAIIFGLTVVVLFAMGGVVLDLGHLYIAKAELQNAADAAALAGAKDL 61 Query: 56 -----ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 +AA+ A+ + + ++ + + + + + Sbjct: 62 NETTPGIDAAV---ATAQTISAKNKYNFSTDVTLALANIEFGPSPDGPWSSVATARAAPQ 118 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 + + + M + + + + Sbjct: 119 GMTFIKVDTGLKVLGTYLMRVAGVDTVSTFGLAVAGRFVNNV 160 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT T K Sbjct: 467 FRTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSG 526 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + Sbjct: 527 GTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGRSYDDVRIPAM 570 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A + I + + Q L+ + P ++ + V D+L I Q Sbjct: 571 V-----AHHKGVITYAIGV--AWAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQN 623 Query: 409 MVHR 412 + Sbjct: 624 ICTE 627 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 29/329 (8%), Positives = 68/329 (20%), Gaps = 47/329 (14%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA+ +G + H + + + +S T Sbjct: 88 AAVHSGVLDNSGGKILVRKVAGQSGYHGSYSNGVWSLSLPRWRESFVVSEGKPQKGVTYP 147 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + SS T + T + + ++ Sbjct: 148 SALTYSSVPGTTAQPVTLMQVPGTTAVEATHTAL-----PKPSPSAGFTTSSLRLQPVGQ 202 Query: 179 ALLRIEMGER---------PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT--- 226 + + + S+ DP D + Sbjct: 203 RSRELGEEPDLWKPGSVLLDAGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSSIGK 262 Query: 227 ---KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVT 281 ++ D L MG++ Y + ++ + Sbjct: 263 RRFRIQK-----QFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRDLKTAIE 317 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + ++ A+ + S + + + DG Sbjct: 318 KITQR--GGLSNVGRAISFVTKNFFSKSNGNRGGA-------------PNVAVVMVDGWP 362 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + A+E+ I I I+I Sbjct: 363 TD-----KVEEASRLARESGINIFFITIE 386 >gi|282900951|ref|ZP_06308884.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] gi|281194042|gb|EFA69006.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] Length = 575 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 54/197 (27%), Gaps = 23/197 (11%) Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 D D + NA+ L + +GL+ Y + Sbjct: 390 WKTQKDTGKTVYLMAVIDTSGSMYGGPLNAVKDGLRIASQQINPGNY--VGLVTYGDQPV 447 Query: 265 KNIEPSWGTE-KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 ++ + + + ++++ + T M A L KK + F Sbjct: 448 NLVKLAPFDDLQHKRFLAAIDNLQADGATAMYDGMMVALSELVQQKKTNPNGKF------ 501 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 +++ LTDG+ N + +I + +++ I+ N L Sbjct: 502 --------YLLLLTDGQTNQGFNFEEVKEIIQY---SGVRVYPIAYG-EVNEAELNAIAA 549 Query: 384 SSPEYHY--NVVNADSL 398 N L Sbjct: 550 LRESTVKKGTPENVQEL 566 >gi|110632964|ref|YP_673172.1| TadE-like [Mesorhizobium sp. BNC1] gi|110283948|gb|ABG62007.1| TadE-like protein [Chelativorans sp. BNC1] Length = 140 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 32/87 (36%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 F++ + S ++ FA+ M LL+ ++ + +N + A + A Sbjct: 6 RLFRRFLESGSGAAAVEFAIVCMPLLLICLGIVEFGRAFFVRNDLSYAADVAARKVLIGQ 65 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAK 95 + + D + + A + DA Sbjct: 66 IPAGAPSSDAASGLETAVREAFVGDAS 92 >gi|323529406|ref|YP_004231558.1| hypothetical protein BC1001_5117 [Burkholderia sp. CCGE1001] gi|323386408|gb|ADX58498.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 353 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/277 (8%), Positives = 70/277 (25%), Gaps = 12/277 (4%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + ++ +++ AL + + + + + + +N +++A +AA LAGA Sbjct: 4 RKSHRAARERQRGAVAVMTALCLTALVGITALAVDLGRAWVVRNELQNAADAAALAGAGS 63 Query: 68 MVSNLSRLG-DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + N + + + A + + ++ + + Sbjct: 64 LGPNYKSPNWTQAAAKAQSAITLNKTEGVALVTAQVQTGYWNVKGTPAGMQALPVPAPGA 123 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 ++ S + L + ++S + Sbjct: 124 YDR---PAVQVTVSRAAGQNGGPLSLVLAPVLGITT-MPISATAVAVISAPGYAGPGALF 179 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I S + D + + K + A + Sbjct: 180 PQAISQCLY----SAYWNATTGQPVNDPATGQPQEFKIGSAYNYPGCAAGMGSGQWTSFQ 235 Query: 247 HVKEDVYMGLIGYTTRVEK---NIEPSWGTEKVRQYV 280 +DV + +W ++ + Sbjct: 236 LNVQDVTSIRNLIANGNPTPMQIGDMTWIQSGLKNSI 272 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 27/152 (17%) Query: 254 MGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + E + + V Q +T + + T +K +Q + + + Sbjct: 347 VGMVTFDSVAEIQNHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQST 406 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINA 371 + II LTDGE+N S C + K + I TI++ Sbjct: 407 SGSE----------------IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALG- 442 Query: 372 SPNGQRLLKTCVSSPEYHYNVV-NADSLIHVF 402 + L + Y + + L + F Sbjct: 443 PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 61/192 (31%), Gaps = 27/192 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K A + ++ + + ++ + R + + + Q + Sbjct: 59 LETVKQAAVGLIERLQPDD------RLSIVAFDHRAKVLVRNQPM-GNLDQIKRKINRLG 111 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +K + L K+ + + LTDGEN + + Sbjct: 112 ADGGTAIDEGLKLGVKELIKAKQDTVSQ-----------------VFLLTDGENEHGNNE 154 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQNIS 406 IK+ + A EN + I ++ A+ N L K +++ Y +L F + Sbjct: 155 SC-IKLAELAAENNLTINSLGFGANWNQDILEKIADIATGSLSYIEEPEQALSE-FARLF 212 Query: 407 QLMVHRKYSVIL 418 M + Sbjct: 213 NRMQSVGLTNAY 224 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 35/83 (42%), Gaps = 3/83 (3%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 S + +I +TDG NN ++ T + + A E + I + + S N L S+ Sbjct: 19 STDASRIMIVVTDGYNNQSGASSAT--VINNAIEENVIIYCVGVG-SVNSTVLKNISEST 75 Query: 386 PEYHYNVVNADSLIHVFQNISQL 408 +Y + L +F+NI Sbjct: 76 GGCYYYINQFSQLNGIFENIISE 98 >gi|269958194|ref|YP_003327982.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] gi|269306875|gb|ACZ32424.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] Length = 145 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 11/145 (7%), Positives = 34/145 (23%), Gaps = 7/145 (4%) Query: 7 FRFYFKKGIAS-EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R ++ E+ + S A++ ++ L +G + + A + A A Sbjct: 1 MRRLRERLARDPERGSASAWAAVTTIAMFLFVGIAVDFGG--QLHAQ-QQARDVATQAA- 56 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + + + Sbjct: 57 --RAGGQQINAPQAIRGQGVTAQPGDAYSAAASYLAGSGVSGSVQVAGARVIVDTSATYN 114 Query: 126 RISMTHMANNRLDSSNNTIFYNMDV 150 ++ + N L ++ + Sbjct: 115 TKFLSIIGINTLPATGHAEARIARA 139 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/292 (8%), Positives = 66/292 (22%), Gaps = 14/292 (4%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + ++ +I AL ++ L I + + +N ++++ +AA +AGA ++ Sbjct: 5 HTGRSGKKQEGAVAITVALCMVILLGFAALAIDIGNLLIARNELQNSADAAAMAGAGCLI 64 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + A + + S ++ + Sbjct: 65 RRTACSNTSASQPDWPTADATASAFSTSATTNQVQGTSVQTSTVATGYWNTTGTPYGLES 124 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 D + D + F+ + R S + + G Sbjct: 125 LPFTPGANDLPAVQVTIRKDGSNANGAVPIFLGRIFGARI--LKASAVATAVLSTPGSVG 182 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + +S M+ + A ++ I H Sbjct: 183 PGGL-FPLAISQCMYTNYWDSSSQSPKIAPNSGVVPGFSWPNQIAGQPYIFQIGSSYHYG 241 Query: 250 EDVYMGLIGY----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + G T Sbjct: 242 TCSSGQWTTFDVNDNSAGYAKTMLTNGNPNTLTI-------GASPGTWIQTG 286 >gi|149050644|gb|EDM02817.1| similar to vitrin (predicted) [Rattus norvegicus] Length = 427 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 53/193 (27%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 251 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRIGAVQYTYEQRLEFGFDKYNSKADVLSA 310 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 311 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMILITDGRS 354 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V D+L Sbjct: 355 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLY 407 Query: 400 HVFQNISQLMVHR 412 I + + Sbjct: 408 KFVPRIIRNICTE 420 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 63/194 (32%), Gaps = 35/194 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVR 277 ++ L A LFL L V++ ++G++ + + E + R Sbjct: 314 SGSMTVGGRLKRLNQAGKLFL-----LQTVEQGAWVGMVAFDSAAYVKSELVQINSAAER 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + R + + T ++ A+ ++ I+ LT Sbjct: 369 DALARSLPTAASGGTSICSGLRSAFTVIKKKYPTDGSE-----------------IVLLT 411 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N TI C + K+N I T+++ + L + + + Sbjct: 412 DGEDN-------TISACFPEVKQNGAIIHTVALG-PSAAKELEELSQMTGGLQTYASDQA 463 Query: 397 S---LIHVFQNISQ 407 LI F +S Sbjct: 464 ENNGLIDAFGALSS 477 >gi|304320960|ref|YP_003854603.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] gi|303299862|gb|ADM09461.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] Length = 433 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 13/208 (6%), Positives = 48/208 (23%), Gaps = 19/208 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ ++ + +++ A+ ++ + L ++ ++ A + + Sbjct: 17 LQRLRGDQRGSIAVLMAVMLVPSVGLGALIVDGSRMRTAHLEIQIVA---EAAALAAAQN 73 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFI---------KNHIKESLSGYSAVFYNTEIQNI 121 S R + + + + S Sbjct: 74 LPSVDDAREAATDYAEANLDPTKYGNVVRSTDVEFGTYDDSNGTFSVGGTTAVRVTAGRT 133 Query: 122 VNSSRISMTHMA------NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + S T + L +S + L F + V Sbjct: 134 EDRSNAFSTLFGGVIGRPSVDLTASAIAVAETSGGNPICILVLGF-GYYGLDMDGDINVD 192 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSM 203 +++ + + ++ + Sbjct: 193 IPDCGIQVNSDDDDAMNSKDDSYVNAAY 220 >gi|170743966|ref|YP_001772621.1| hypothetical protein M446_5903 [Methylobacterium sp. 4-46] gi|168198240|gb|ACA20187.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 417 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/297 (9%), Positives = 63/297 (21%), Gaps = 14/297 (4%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R F A ++ SII ++ + I + +Y K ++ +AA L Sbjct: 1 MRQRFVAFAAHKQGGASIIGLFGMLIAVGFAAVAIDSGNLYYSKLKLQKIADAAALGAVM 60 Query: 67 KMVSNLSRLGDRFE-----------SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 + ++ S + + ++S A + V Sbjct: 61 ALPTSSSVMAAALDLVSKNTPVGFGTVSTSADIQIGVYDPSSKTFTPSAIGQNAVQVTTR 120 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---LNQRYNQK 172 + + +D + +++ L + Sbjct: 121 RSSAYGNAVLTYVAGILGVSSVDMAASSVAVKYGGACVMVLEPASAGSLQTKGSSALQTN 180 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + I + P + P D + Sbjct: 181 CPIQVNSSSATAARTGGSSSITASQICVVGNYSGTGFSPWPKINCPSLVDPLANVPEPAQ 240 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + SI Y G + + V + + Sbjct: 241 PVCTVNNPSISSGVFPTNCTYSGTVSLSGNVTLQSGLYYFKSANISVTGSTSITGSG 297 >gi|54293612|ref|YP_126027.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] gi|53753444|emb|CAH14899.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] Length = 1169 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/250 (10%), Positives = 70/250 (28%), Gaps = 21/250 (8%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P++ + + + + + + + ++ Sbjct: 292 PSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVNN 351 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +++ L H+K + + + R ++ T S Sbjct: 352 AINAF--LPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTV-------GTTSGNC 402 Query: 298 MKQAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKI 353 ++ Y IL SD + + + F DG N N ++ + I Sbjct: 403 PQKQYIILISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNNTNSQALSDAINE 462 Query: 354 CDKAKENFIKIVTISINASPN------GQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K + + I I + A + L+ + +Y + ++L+ +I Sbjct: 463 IKALKNDGVLIFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPEALVSSLNSI 522 Query: 406 SQLMVHRKYS 415 + + +S Sbjct: 523 LSNIQNGSFS 532 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/367 (11%), Positives = 94/367 (25%), Gaps = 55/367 (14%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + L+ G++ + D ++I+ G A F + N+ Sbjct: 275 TDENSNVLNHQGNKDAGETWDGITTWNGDPANLTHSYIEYGGVGDEADFALRKFAKETNT 334 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + + + ++ + + Sbjct: 335 PGLF-------DVYLNVRGNVQRQIDPIDVVLVVDWSGSMNEMGR----IAEVKKGVDRF 383 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + I S M S CQ K + +K + Sbjct: 384 LNQIEGSGI----QDSVYMGYVGYSSDGSNYQNKTCQLG---KFSEVKETIRSMTP---E 433 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + GL + ++ + D + + ++ Sbjct: 434 TAAGGTFTQRGLRQAGDMLSTQ-------NGHKKVIVLLTDGVPTYSYHVS-------KV 479 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENF 361 T + T F + + F F D +NN N TI KE Sbjct: 480 HTQADGSYYGTAFSLTQDQPMNTSFLYNGYFAFDQQNNYKWINNTFIATIGEAMALKERG 539 Query: 362 IKIVTISIN---------ASPNGQRLLKTCVSSPE----YHYNVVNADSLIHVFQN---- 404 I+I + I + ++ ++ VS+ E Y+ + A + + Sbjct: 540 IEIHGLGIQLQGDQTAGYTKADVEKKMRQMVSADEDGHLYYESANEAADIADYLEKKALH 599 Query: 405 ISQLMVH 411 I+ + Sbjct: 600 IAATVTD 606 >gi|315639746|ref|ZP_07894885.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] gi|315484523|gb|EFU74980.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] Length = 1228 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/327 (6%), Positives = 69/327 (21%), Gaps = 33/327 (10%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 D + + ++ S + + + + Sbjct: 298 DGVTTWNGDASNKTNSYIEYGGTGGNANFAIRKYAKETSTPGLYDVYLNVRGNEIKDIKP 357 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + + R + I + ++ + + Sbjct: 358 IDIVLVIDMSGSMEPSRAPAVRQGIKDFF-------KYIQDAGVSQYVNVGFVGYSSINQ 410 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + + N + + + + G + Sbjct: 411 SNYVGLIQEGIQPISNTTHTNNINTKLNRTFIGGTFTQLGIKTGADM----------LTS 460 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + D + + + + S + P + Sbjct: 461 YNSTNEKMMILLTDGVPTYSYKVNESTTINGTLYGTKFNTSLDNPGDTSKLNYPYIDNSN 520 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-------S 384 I T T+ K+ ++ + I P+ L + V + Sbjct: 521 HKINDT---------FAATLGQAKITKDGGTQLHVLGIQLGPDKNYLTQNQVRAYTSLMA 571 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMVH 411 S ++ + AD + +N + +V Sbjct: 572 SDGHYLDANTADDIKTYLENQAANVVS 598 >gi|167624203|ref|YP_001674497.1| hypothetical protein Shal_2279 [Shewanella halifaxensis HAW-EB4] gi|167354225|gb|ABZ76838.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4] Length = 446 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/378 (9%), Positives = 92/378 (24%), Gaps = 38/378 (10%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN------------NAAIL 62 I + ++F + + + + + K +++A A L Sbjct: 15 IGRQSGAILVMFTIGIFAVIAFAALALDGGHMLLSKGRLQNAVDAAALNAAKELQEGATL 74 Query: 63 AGASKMV-----------SNLSRLGDRFESISNHAKRALIDDAKRFIK--NHIKESLSGY 109 A + N S + + + + Sbjct: 75 LEAREAAYTILLQNLSFTENGELNTSVSLSSPDFNNTQVTPRLQVEFSELPDPFNPILAE 134 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + + ++N+ S+ ++ + + +S + L + Sbjct: 135 GSEYVRVRVENVKLSNFLADILNFDKEIRASAVAGRSQDLACVNKILPLLVCGKEGSTAE 194 Query: 170 NQKIVSFIPALLRIEM------GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + ++ L+++ G LI L D GS S + Sbjct: 195 DNYGLADGLHLMKVGAGQPSANGSGNFQLISLDGDRGGSDLRDAFSGNYSPDQCVSEGTV 254 Query: 224 KRTKM-----AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKV 276 TK A + F D ++ +++ I T VE+ + Sbjct: 255 ATTKTGGTVGPAAQGMNTRFGDWGPAGTNSEDNPRDFNICEGTPVTVEEVQAMDDDENLL 314 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + + P A + + + + + ++ + Sbjct: 315 FDENGKPVMQTAIAPDSIAGAYRWSDYDAANHSDSPQNCETTDDPNAMELRRELEVVVGI 374 Query: 337 TDGENNNFKSNVNTIKIC 354 D + N C Sbjct: 375 CDPDANGTYDITILGTAC 392 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/195 (9%), Positives = 52/195 (26%), Gaps = 28/195 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 ++ L +++ +G++ + + + + Sbjct: 307 ISGSMSSNNRIEKLGQVASIYILLTADDDD-----ELGMVVFNDQPSTRSQMVTISESTR 361 Query: 277 RQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + T + +A +L + + II Sbjct: 362 LDLLELIPTRDDIGDATGIGSGLSEAIDVLENG----------------GNDAAGGCIIL 405 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 ++DGE N + + + + TI++ + + + Y N Sbjct: 406 VSDGEENRSPYIDDVQS---TIVDKGVCVHTIALGVDASHNMEQLPLATDGKSFYYSENP 462 Query: 396 --DSLIHVFQNISQL 408 ++L F I++ Sbjct: 463 YSNALNEAFITIAKQ 477 >gi|288925757|ref|ZP_06419688.1| BatB protein [Prevotella buccae D17] gi|288337412|gb|EFC75767.1| BatB protein [Prevotella buccae D17] Length = 342 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 64/210 (30%), Gaps = 54/210 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D + +GLI + + + Sbjct: 97 DISNSMLAQDVAPSRLDKSKLLVESLVDRFT-------NDKIGLIVFAGDAYVQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A+ T + Sbjct: 150 YVSAKMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQA------------------DKIGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + + ++ + + + + +GQ Sbjct: 192 AIIVITDGEDHEGGAVEAAAEA----RKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADS 397 ++ + + + +V N Sbjct: 248 TVMTALNEQMCREVAQAGSGKYIHVDNTGD 277 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 32/192 (16%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWG 272 S + T++ + A L+L + ++++ +GL+ + + Sbjct: 314 DKSGSMRLGSPITRLTLMNQAAELYL-----IQIIEKESLVGLVTFDSTATIQTNLIRII 368 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T +K+ ++ +TS + + + Sbjct: 369 NDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------------- 412 Query: 333 IIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHY 390 I+ LTDGE+N S C + K + I TI++ + L + Y Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALG-PSAARELETLSDMTGGLRFY 464 Query: 391 NVVNADSLIHVF 402 + + LI F Sbjct: 465 AKEDVNGLIDAF 476 >gi|74214868|dbj|BAE33446.1| unnamed protein product [Mus musculus] Length = 1169 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 52/167 (31%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + T + A+K L + Sbjct: 184 RPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLGSVRQLRGYTYTASAIKHVITELFTT 243 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTI 367 + K +I +TDG S + I + + A I I Sbjct: 244 QSG-------------ARQDATKVLIVITDGRKQGDNLSYDSVIPMAEAAS---IIRYAI 287 Query: 368 SIN---ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + ++ LK S P + ++V N D+L + + + + Sbjct: 288 GVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDTLKDIENQLKEKI 334 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/262 (7%), Positives = 59/262 (22%), Gaps = 11/262 (4%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++ A + + G + W K ++ ++ A Sbjct: 2 FGGSMGRLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQADS----AAL 57 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +++ +S ++ R + + + Y+ + Sbjct: 58 AGAYAVAQGFSASDSATSDINRLSLITLSQTPTIENAPTTGSYAGNNKAVRVVLQTTKEL 117 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI-------PA 179 + + L+ V+ A Sbjct: 118 PFSKILGIGAPVILGEATAAIVGSGDYCVVSLEKSSATGITLQGNATVNLGCGMATNSRA 177 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + G V + G + + + I + A L Sbjct: 178 SNAVYAGGSSTVTATPVAAVGGLTSSSNYATGTTLLPYSIPVQDPYASLPTPTAANLTGC 237 Query: 240 DSIDLLSHVKEDVYMGLIGYTT 261 ++++ + ++ G Sbjct: 238 SNLNVQPNNSRNITAGCYKNID 259 >gi|126334034|ref|XP_001370526.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 1247 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 69/204 (33%), Gaps = 34/204 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG--TEKVRQYVTRDMD 285 +K+ + ++ + L+ Y+ R++ + + + V + Sbjct: 219 FQQMKSFVRAVINQFKET-----NTLFSLMQYSNRLKIHFTFADFQRSTNWGNLVNPILQ 273 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+++ L + K +I +TDGE ++ Sbjct: 274 L--RGLTYTATAIRKVVTELFQSRNG-------------ARKNATKILIVITDGEK--YQ 316 Query: 346 SNVNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + + +A++ I I + + + Q L S P E+ + V N +L Sbjct: 317 DKLQYKDVIPEAEQAGIIRYAIGVGDAFEYASAQEELNIIASQPAKEHVFQVDNFSALKT 376 Query: 401 VFQNISQLM-----VHRKYSVILK 419 + +++ + + K S+ + Sbjct: 377 IQEDLQEKIFSIEGTQSKSSISFR 400 >gi|311273682|ref|XP_003133985.1| PREDICTED: integrin alpha-1-like, partial [Sus scrofa] Length = 506 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 68/191 (35%), Gaps = 27/191 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 210 SVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANEIVQRGGRQ 269 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 270 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNHRLNKVI 316 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I S N L +K+ S + ++ +NV + +L+ + Sbjct: 317 QDCE---DESIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 373 Query: 402 FQNISQLMVHR 412 + + + + Sbjct: 374 VEALGERIFAL 384 >gi|189220466|ref|YP_001941106.1| hypothetical protein Minf_2455 [Methylacidiphilum infernorum V4] gi|189187324|gb|ACD84509.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 340 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 70/221 (31%), Gaps = 54/221 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D +++ K F+ + +GL+ +T S Sbjct: 97 DVSKSMLAEDTVPSRLERAKLLATNFISKL-------HGERVGLVAFTKNAFIEAPLSTD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + + ++ D T+ + +A Q +S + +K Sbjct: 150 YELLEEILSELSPDDFPNGGTNFAAMLDEALQFFSSSGRS------------------KK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I L+DGE++ + K+ I +++I I +S Sbjct: 192 MLILLSDGEDHGGGWQQRLVDF----KKESIPVLSIGIGSSNGAVIRNSNGSLYKDYNGE 247 Query: 374 ------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 N L S+ + + V +++++ Sbjct: 248 PIVSIFNPAALELIAHSTGGLYIQADKYFDITTVVESLAKN 288 >gi|317122048|ref|YP_004102051.1| hypothetical protein Tmar_1211 [Thermaerobacter marianensis DSM 12885] gi|315592028|gb|ADU51324.1| Protein of unknown function DUF2134, membrane [Thermaerobacter marianensis DSM 12885] Length = 344 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 7/122 (5%), Positives = 33/122 (27%), Gaps = 12/122 (9%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ N +++ A+ + + + + G ++ + ++ ++ + G Sbjct: 55 QEGNVAVLVAICLAALMAMAGLVVDGGRLYTERARLQ------------AAADVAALAGA 102 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 A R + D ++ + ++ + + Sbjct: 103 TELPEDPAAARQVALDYLARNGVPVQHAQVEVDGPRTRIHVRASAAVPLGLARVLGPQEV 162 Query: 138 DS 139 Sbjct: 163 TV 164 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 52/147 (35%), Gaps = 18/147 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 I+ ++ DV +GL+ Y+T V N+ + + +V T + A+ Sbjct: 416 INGSKYISSDVSIGLVSYSTDVNINLPIAKFDLNQRSLFVGAVESLAAGGNTATFDAIIV 475 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L +K + + + + L+DG N S + + K Sbjct: 476 ATKMLKEEKAK--------------NPNAKLMLFVLSDGVTNYGHSLNDIKDM---MKTF 518 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPE 387 I I TI NA+ L + Sbjct: 519 GIPIYTIGYNANIKALETLSQINEAAN 545 >gi|58429499|gb|AAW78153.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRNNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|56696618|ref|YP_166979.1| hypothetical protein SPO1741 [Ruegeria pomeroyi DSS-3] gi|56678355|gb|AAV95021.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 177 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 6/55 (10%), Positives = 20/55 (36%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 R + ++ E N ++ F + + + ++ + + +E A + Sbjct: 3 RRLSNFLRRFRRGEDGNATVEFVIVFPAVIGVMLAGVELTFMTLNHAMLERAVDV 57 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 40/108 (37%), Gaps = 1/108 (0%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV-SN 71 K S + N S++ AL + + +I I + ++S +AA LA A +M S+ Sbjct: 5 KFCKSTQGNVSVMAALFIGLLIAVIAGTIDISQKSSLNRELQSVVDAAALAAAREMAVSS 64 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 + + + S + A + + + S+ T Sbjct: 65 ADQTRVQSVASSYVDANWTGEQATTHAVLDVSKGIITVSSTAPKTIAS 112 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 55/161 (34%), Gaps = 18/161 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQ 300 + ++ ++ Y+GL+ Y+ V + + R Y + + T S A+ Sbjct: 399 TNGAQYINDNNYVGLVSYSNSVTIEVPIAQFDLNQRSYFQGAVNNLIASGGTASYDAVVV 458 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +++T K + + L+DG NN S +I + + Sbjct: 459 AVKMITE--------------AKAQHPDAKCMLFLLSDGYANNGYS---MDEITSALRTS 501 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + TI + L + + N + D + + Sbjct: 502 GIPVYTIGYGDDADTGELARLSGINEAASINADSDDIIYKI 542 >gi|307726382|ref|YP_003909595.1| hypothetical protein BC1003_4370 [Burkholderia sp. CCGE1003] gi|307586907|gb|ADN60304.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 350 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 17/183 (9%), Positives = 54/183 (29%), Gaps = 5/183 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ ++ +++ AL + + + + +N +++A +AA LAGA + N + Sbjct: 11 LSRQRGAVAVMTALCLTVLIGTTALAVDLGRAWVVRNELQNAADAAALAGAGSLGPNYAS 70 Query: 75 LG-DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + A + + ++ + + + + Sbjct: 71 PNWTQAQTKAQSAITLNRTEGTSLLTAQVQTGYWNVTGTPAGMQALPVPSPGAYD---KP 127 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 ++ S + L + ++S + I L Sbjct: 128 AVQVTVSRAAGQNGGPLSLVLAPVLGIST-MPISATAVAVISAPGYAGPGALFPTAISLC 186 Query: 194 ELV 196 Sbjct: 187 LYT 189 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A + P + +VD+SGSM K Sbjct: 205 NTDTLLLRIGVAGRDVPTAALPPANLVFLVDVSGSM-------------------GAPDK 245 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 246 LPLLQSSLKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTR-IVEAIDSLQ 298 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 299 SGGGTAGASGIELAYKAAQQAYLRGGIN----------------RILLATDGDFNVGVTD 342 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 343 FDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHE 402 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 403 LGSTLATIARDV 414 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 11/147 (7%), Positives = 44/147 (29%), Gaps = 17/147 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS---- 70 E+ +F+++ A+ ++ + G + V + + ++ ++ + A AG ++ + Sbjct: 21 RRRERGSFAVMAAVFLVVIAAIFGV-LDVGNTYLQRRDLQQIADMAAAAGVQRVDNLCVQ 79 Query: 71 NLSRLGDRFESISNHAKRALI------------DDAKRFIKNHIKESLSGYSAVFYNTEI 118 + + + + + A + + S ++ Sbjct: 80 APTSATNSATVNGLNTSQGDTIAVTCGRWDPTVNPAPSYYLANTNTSGDPNRLQLNAVQV 139 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIF 145 + + + S+ Sbjct: 140 NVTRQVRHMFVGPLQTVHATSTARATA 166 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 65/194 (33%), Gaps = 35/194 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVR 277 ++ L A LFL L V++ ++G++ + + E R Sbjct: 314 SGSMSNGDRLKRLNQAGKLFL-----LQIVEQGSWVGMVTFDSAAHVQSELVQINGATER 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T+ + ++ T ++ A+ ++ I+ LT Sbjct: 369 DALTKSLPTVASGGTSICSGLRSAFAVIRKKFSTDGSE-----------------IVLLT 411 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N TI C ++ K++ I T+++ + L + + + Sbjct: 412 DGEDN-------TISSCFNEVKQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQA 463 Query: 397 S---LIHVFQNISQ 407 LI F +S Sbjct: 464 QNNGLIDAFGALSS 477 >gi|297300028|ref|XP_002805526.1| PREDICTED: collagen alpha-1(XIV) chain-like [Macaca mulatta] Length = 1717 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 50/163 (30%), Gaps = 24/163 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 + + ++ +T + + K + S T + A+K L + Sbjct: 1068 DGTQVAMVQFTDDPRTEFKLNAYKTK-ETLLDAIKHISYKGGNTKTGKAIKYVRDSLFTA 1126 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ +TDG + + + KI + + + I I Sbjct: 1127 ESG-------------TRRGIPKVIVVITDGRSQDDVN-----KISREMQLDGYSIFAIG 1168 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + S + + V + F+ I ++ Sbjct: 1169 VADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELIT 1207 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSR 337 >gi|297292637|ref|XP_001090691.2| PREDICTED: anthrax toxin receptor 2 [Macaca mulatta] Length = 488 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 55/191 (28%), Gaps = 25/191 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K + + I LTDG + K Sbjct: 121 HEGLKLVIEKIIYS----------------GDEKXXXXXIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNIS----QLM 409 ++ + + + Q L+ S E + V +L + +I + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEI 221 Query: 410 VHRKYSVILKG 420 + + S + G Sbjct: 222 LELQPSSVCVG 232 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 65/216 (30%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + L++D+SGSM +++MAA K A LD+ Sbjct: 29 QSAPKVNLLLDVSGSMRAKD--------------IDGQSRMAAAKQAFNEVLDATPEEVE 74 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R ++ + P + T A K A L Sbjct: 75 LGIRTLGANYPGDDR--------------KEGCKDTAQLYPVGPLNRTEA-KTAVATLAP 119 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + K I+ ++DGE+ + ++ + I I Sbjct: 120 TGWTPIGPALLKAADDLDGGDGSKRIVLISDGEDTCAPLDPC--EVAREIAARGIGLTID 177 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + R L ++ + +V + D L Sbjct: 178 TLGLVPNAKLSRQLSCIAEATGGTYTSVEHQDELTD 213 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 62/174 (35%), Gaps = 22/174 (12%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDST 295 S++L+ + Y+G++ ++ R P TE+ R+ + S T Sbjct: 357 AVTGSLELIRSARPQDYIGVVVFSDRPRWLFRPRPMTEQGRKEAESLLLSTQAGGGTMIR 416 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A +A + L K +I LTDG + + Sbjct: 417 RAYLEALEALEQV------------------PTESKQVIALTDGLAADVTPD--LFDAAR 456 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +A IK T++I A +G+ L + + +++V + L F +Q + Sbjct: 457 EA-SPRIKTNTVAIGADADGRFLRELAQAGDGTYWDVPRPEDLPRFFLEEAQRV 509 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPRYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQIEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|126173282|ref|YP_001049431.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125996487|gb|ABN60562.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 642 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/251 (12%), Positives = 71/251 (28%), Gaps = 36/251 (14%) Query: 175 SFIPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPED--VNSAPICQDKKRTKMAA 230 + P + E+ P + L + L G + K+ Sbjct: 199 NAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSMASADKLPL 258 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGL---IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 L+ AL L + V VY G + ++ Sbjct: 259 LQTALKLLTAQLSAQDKVSIVVYAGAAGVVLDGVSGNDTQTLTY----------ALEQLS 308 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + QAYQ+ + +I TDG+ N ++ Sbjct: 309 AGGSINGGQGITQAYQLAKKHFIPNGIN----------------RVILATDGDFNVGVTD 352 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I + +K K++ I + T+ +L++ +Y ++ L + + Sbjct: 353 FDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT--LNEARKVLV 410 Query: 407 QLMVHRKYSVI 417 + +++ Sbjct: 411 DELSSTLFTIA 421 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 61/184 (33%), Gaps = 28/184 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + +D S E +G++ Y+ + Sbjct: 775 FEVIKDFVNTLIDR---TSVSPEVTRVGIVLYSNINLLVTNIQDRLTRDEVKAAVRRMPY 831 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I + T + +++A ++ +K + +TDG+ + + Sbjct: 832 IGEGTYTGSGIRKANEMF-----------------AFARPGVRKVAMVITDGQ-TDHRDT 873 Query: 348 VNTIKICDKAKENFIKIVTISI--NAS---PNGQRLLKTCVS--SPEYHYNVVNADSLIH 400 V +A + I + I + + + ++ LK+ S + E+ ++V + L Sbjct: 874 VKLEDAVREAHLDNITMFAIGVVNQSDPIYDDFKQELKSIASPPTEEHMFSVEDFRMLHG 933 Query: 401 VFQN 404 F+ Sbjct: 934 AFEE 937 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 66/185 (35%), Gaps = 28/185 (15%) Query: 232 KNALLLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLI 288 K + F + + + LI Y++ V N + + ++ + D S I Sbjct: 65 KEFVRSFSRRLMEMQVSGWHLRTRLALIYYSSSVHINQHF-NDWQDLDVFLDQLEDASYI 123 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T ST A+ A Q+ + + + +TDG ++ ++ Sbjct: 124 GQGTYSTYAISNATQLFIRETSGQSV----------------RVSLLMTDGSDHPRNPDI 167 Query: 349 NTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 T + +AK + IKI I ++ N +L S + +++ + L + Sbjct: 168 MT--VVAEAKSHNIKIFAIGLSMRAMDSNSAKLRAVASSPAQQYFHSLTDRGLEE---RL 222 Query: 406 SQLMV 410 Q + Sbjct: 223 LQQIC 227 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 65/194 (33%), Gaps = 35/194 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVR 277 ++ L A LFL L V++ ++G++ + + E R Sbjct: 314 SGSMSNGDRLKRLNQAGKLFL-----LQIVEQGSWVGMVTFDSAAHVQSELVQINGATER 368 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T+ + ++ T ++ A+ ++ I+ LT Sbjct: 369 DALTKSLPTVASGGTSICSGLRSAFAVIRKKFSTDGSE-----------------IVLLT 411 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N TI C ++ K++ I T+++ + L + + + Sbjct: 412 DGEDN-------TISSCFNEVKQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQA 463 Query: 397 S---LIHVFQNISQ 407 LI F +S Sbjct: 464 QNNGLIDAFGALSS 477 >gi|58429529|gb|AAW78168.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRNLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 71/259 (27%), Gaps = 46/259 (17%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L ++ Q+ Q + L I D + + + Sbjct: 357 VIFVLDVSGSMVGQKLPQVKEAMGQILSEI-----------HSEDFFTLILFSDFAQVWT 405 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +N+ + K++ K + LD++ + Sbjct: 406 INATQETSNHWDEKVSNWKTNNNISLDTLGEN------------------RFVFPAT--E 445 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V+ D T+ A+ +A K + + I Sbjct: 446 QNVQYAKKFIQDLQSESSTNMEDALNKA----------HLIAKLGETRFKDGANTPKPII 495 Query: 334 IFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH--- 389 +FLTDGE IK E I ++ + L K +++ + Sbjct: 496 VFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSLGFGEGADIDFLKKLSLNNTGFARVI 555 Query: 390 YNVVNAD-SLIHVFQNISQ 407 Y +A L + ++ IS Sbjct: 556 YEASDASLQLRNFYKEISS 574 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 28/336 (8%), Positives = 87/336 (25%), Gaps = 26/336 (7%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ ++ + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLTGKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLN--QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + + N + + + E I ++ V ++S + Sbjct: 191 PIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVN 250 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + ++ V G + Sbjct: 251 LTTFSTTGSYKQAAFSQ--------LDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL 302 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +++ + +YV D + + ++ + F Sbjct: 303 QKQHAQL--------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELSSN 354 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIVTISI-NASP---NGQR 377 S + + T + ++++ KA + ++ I GQ Sbjct: 355 VPASFNYGRLGYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFSGLEHEIAYGQN 414 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 L + + N ++L F +I + + Sbjct: 415 LTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 31/247 (12%), Positives = 68/247 (27%), Gaps = 27/247 (10%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLS----GSMHCAMNSDPEDVNSAPICQDKKRT 226 + +V+ + R+ +R + ++ S + V Sbjct: 308 RPVVASVRPAARLSTAKRRELPFPGSLAVADGLLASYENDLRRPSRTVYVLDTSGSMNGD 367 Query: 227 KMAALKNALLLFL--DSIDLLSHVKEDVYMGLIGYTTRV----EKNIEPSWGTEKVRQYV 280 ++A LK AL D ++ + L+ + +RV + + Sbjct: 368 RLAQLKRALGQLAGSDVSPTGDRFRDREEVTLMPFGSRVKGVRTHTVPGERPAPVLAAIR 427 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T +++ AY L + F I+ +TDGE Sbjct: 428 ADAEALTADGDTAIFSSLQAAYDHLAQRRSALGDDRFTS-------------IVLMTDGE 474 Query: 341 NNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 N + + + + E + I + +L + ++ L Sbjct: 475 NTTGATASDFDAYYRRLRGPERTAPVFPIVFG-DSDRSQLQSIATLTGGRLFDATKGS-L 532 Query: 399 IHVFQNI 405 F+ I Sbjct: 533 DGAFEEI 539 >gi|15418999|gb|AAK77222.1| capillary morphogenesis protein-2 [Homo sapiens] gi|119626253|gb|EAX05848.1| anthrax toxin receptor 2, isoform CRA_a [Homo sapiens] Length = 386 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 52/171 (30%), Gaps = 21/171 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQN 404 ++ + + + Q L+ S E + V +L + + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINS 212 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 45/181 (24%), Gaps = 29/181 (16%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRV---EKNIEPSWGTEKVRQYVTRDMDSLILK 290 F+ + + V G I + T + + V R Sbjct: 265 TTADFIGKTGNANDPIDLVIQGAINFPTNYVSNNPSTPLTPNYPTHSSKVGRRNLPES-- 322 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 K Y L++ V P P + F + N Sbjct: 323 --------KFDYSNLSARITFDGVAGAL---VYEPRFPHPYYYYFPCNAAIN-------- 363 Query: 351 IKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 AK + I TI + LK + + + +L F NI+Q Sbjct: 364 --EAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTA-TPANLAAAFDNIAQT 420 Query: 409 M 409 + Sbjct: 421 I 421 >gi|51247575|pdb|1T6B|Y Chain Y, Crystal Structure Of B. Anthracis Protective Antigen Complexed With Human Anthrax Toxin Receptor Length = 189 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 52/171 (30%), Gaps = 21/171 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 33 IYNFVQQLAER-FVSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYI 91 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 92 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 134 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQN 404 ++ + + + Q L+ S E + V +L + + Sbjct: 135 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINS 183 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 46/456 (10%), Positives = 124/456 (27%), Gaps = 80/456 (17%) Query: 24 IIFALSVMSFLL-LIGFLIYVLDWHYK------------KNSMESANNAAILAGASKMVS 70 ++ A+ +MS + +IG + + + + SM+ + I S + Sbjct: 13 MVMAIMLMSMIAVIIGIIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTIGKATSVFIL 71 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ E + S + ++ + K+ + + + Sbjct: 72 DDSKFKGNKEGLTKEWSYIGLSSDGKKVLNYVWDKSKQDWKVSELGTKSLYDIKLDLEFK 131 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + + N + + + ++ K + Sbjct: 132 TEGAYQDNRLISYNLTGKYPDSNNKLSIDTAISALNTKQVFSKVAKGKKGIALAYRNDPI 191 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ + I V D SGSM +N + + +++M+ LK+ + + + Sbjct: 192 EGQMNVA-ISFVFDKSGSMSWDLNGNNTNY-------WGPKSRMSILKDKATIMMRDLKD 243 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ I + + E GT + + T+ ++ Sbjct: 244 IGNVSVNLVSFSILGSYVQKDFSELDKGTTTIEASINAL---QTGGVTNPGDGLRYGMMS 300 Query: 305 LTSDKKRSFFTNFFRQG-----------VKIPSLPFQKFI-----------------IFL 336 L + + + G + Q + + Sbjct: 301 LQNHSAQLKYVVLLTDGIPNAYTVDTNDTSWRNRNVQPYYNRWRETVGELVTFNNGPYDV 360 Query: 337 TDGENNNFKSNVNTIKICDKAKENFI---------------KIVTISI-NAS---PNGQR 377 T + + ++ I ++ I G+ Sbjct: 361 TTNLTTDQNRVSYDSYSNEALRKKAIEYAGKVSQTFGAGVKRVNVIGFSGVPSEIAYGED 420 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 L ++ S + + +L F +I + + Sbjct: 421 LTRSIGSGGMEAKYVPAADEAALQQTFSDIKKQIQQ 456 >gi|269128332|ref|YP_003301702.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268313290|gb|ACY99664.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 149 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 16/144 (11%), Positives = 37/144 (25%), Gaps = 3/144 (2%) Query: 4 LSRFRFYFKKGIAS-EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + R ++ ++ ++ L S LLL G L+ + + A Sbjct: 1 MKVLRRALRRAGTDGDRGTIAMYTVLLTPSVLLLAGLLVDGGLAIHARQRAADMAEQAAR 60 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 AGA+++ ++ L E + A I Sbjct: 61 AGANEI--DVQTLRATGEPRIDAAAACASAYDLLDAYGQEVADRDCLPGAEEVRITVTIR 118 Query: 123 NSSRISMTHMANNRLDSSNNTIFY 146 ++ ++ + Sbjct: 119 VRPQLLAIVPGLGHFTMTSTASAH 142 >gi|30749469|pdb|1MHP|A Chain A, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody gi|30749472|pdb|1MHP|B Chain B, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody Length = 192 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 68/188 (36%), Gaps = 27/188 (14%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + + D + + + +G++ Y V + + V + Sbjct: 20 SVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIVQRGGRQ 79 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 80 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 126 Query: 352 KICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHV 401 + C+ + I+ +I+I N L +K+ S + ++ +NV + +L+ + Sbjct: 127 QDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTI 183 Query: 402 FQNISQLM 409 + + + + Sbjct: 184 VKALGERI 191 >gi|58429533|gb|AAW78170.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429547|gb|AAW78177.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|323345326|ref|ZP_08085549.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] gi|323093440|gb|EFZ36018.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] Length = 340 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 69/226 (30%), Gaps = 57/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D +++A K + +D+ + +GLI + + + Sbjct: 97 DISNSMMAEDVTPSRLAKSKLLVENLVDNFT-------NDKIGLIVFAGDAFVQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ A + T K Sbjct: 150 YVSAKMFLQNIDPSLIATQGTDIAGAINLASKSFTQ------------------QDKVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + + I + I + +GQ Sbjct: 192 AIIVITDGEDHEGGAIEAAKAA----RAKGYNIFILGIGSTNGAPIPMANGGYLQDASGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + + +V N + ++ + + Sbjct: 248 TVMTKLNEQMCKEIAQAGNGTYIHVDNTS---DAQEKLNDELTKLQ 290 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 51/176 (28%), Gaps = 23/176 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D + + ++ +I Y+T + Sbjct: 195 FEKMKTFVKNVVDFFN---IGPKGTHVAVITYSTWAQVEFNLKAHHSSKAALKNAVNAIY 251 Query: 288 -ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+ A + F P K + LTDG +N Sbjct: 252 YRSGWTYTADALDLAGR-----------NIFQVANGMRPDKGIPKIAVLLTDGYSNG--- 297 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 N + + + + + + I +R L + P ++ + + N + L Sbjct: 298 -NNPLGPANDLRAAGVNVFCVGIG--NYYERELNDIATDPDKDHVFKLENFNDLNS 350 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 70/204 (34%), Gaps = 38/204 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++ L A LFL L +++ + G++ + + T+ R + Sbjct: 320 GDRLNRLNQASQLFL-----LQIIEKGSWTGMVTFDSSATIQSALIQIETDAQRNSLISR 374 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T ++ A+ ++ + I+ LTDGE++ Sbjct: 375 LPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE-----------------IVLLTDGEDS- 416 Query: 344 FKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LI 399 TI C D+ K++ I T+++ S + L + + + + LI Sbjct: 417 ------TISSCFDEVKQSGAIIHTVALGPSADPG-LEELAKMTGGMKTSATDNAQNNGLI 469 Query: 400 HVFQNISQL---MVHRKYSVILKG 420 F +S + R + KG Sbjct: 470 DAFSALSSENGAITQRSIQLDSKG 493 >gi|159900724|ref|YP_001546971.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893763|gb|ABX06843.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 415 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 25/242 (10%), Positives = 64/242 (26%), Gaps = 49/242 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 ++ + P LV+D SGSM Sbjct: 18 SGTSQVNYVLIQAKPHHVPTVQAAPPLNFCLVLDRSGSMA-------------------- 57 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTR 282 K+ L+ A+ + ++ + V ++ + +E + + ++ + Sbjct: 58 GDKIQHLREAVREIVANLRPIDAVS------IVLFDDTLEVLVPARLADDLPALQNAIES 111 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T + ++ L + ++ LTDG+ Sbjct: 112 IGEQ---GGTAMSLGLQAGLAELQKFQAADRV----------------GRVLLLTDGQ-T 151 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + + I + + N L +S + + + F Sbjct: 152 WGDEDTC-RDLAKQIGDLGVSITALGLGTEWNEALLDDLATASNGESDYIADPSQISKYF 210 Query: 403 QN 404 Q Sbjct: 211 QQ 212 >gi|319652226|ref|ZP_08006344.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] gi|317396049|gb|EFV76769.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] Length = 463 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 56/193 (29%), Gaps = 17/193 (8%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +S+ + Q + KM K+A+ F +I + + L+ Y + + Sbjct: 162 DASSSMLLQAGGKMKMDIAKSAVKSFAQTIG------QSSEVSLVVYGHKGSEA------ 209 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ ++ + S +A S + Sbjct: 210 -DADKEISCSGVEEVYPMGKYSKKEFHEAVDSFESKGWTPLAGAIQKAAEMSSGYDGSTT 268 Query: 333 IIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHY 390 I ++DG + + + N + I I + LK + + Sbjct: 269 IYIVSDGAETCDGDPVSASKNLVKNNSSNSVNI--IGFGVDGKAENQLKAVAEAGNGEYL 326 Query: 391 NVVNADSLIHVFQ 403 N D L + Q Sbjct: 327 KADNPDELKNTIQ 339 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 49/362 (13%), Positives = 106/362 (29%), Gaps = 50/362 (13%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A AS + + ++ + A + + R + ++ Sbjct: 236 AQQMAASDDQLFATYTSEPAKASAFPTSEAALAEHNRVNPQNKMVAVIPAEGTPS--FEY 293 Query: 120 NIVNSSRISMTHMANNRLDS------SNNTIFYNMDVMTSYDYRLQ-FIEHLLNQRYNQK 172 ++N S L + + T+ T+ L + + Sbjct: 294 TLINVSTNPAKAQGIEALRAFLRSEEAAKTLASFGLRSTAVPVTLPTPPGSIGEIKPGST 353 Query: 173 IVSF-IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + A + + F + V D+SGSM+ T++ Sbjct: 354 PNAAQVTAAIDVWQAATTSFQLLSVFDVSGSMNEK---------------VGNTTRVRIT 398 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTRDMDS-L 287 + A + L+++ + + V+ G ++ + S ++ R S Sbjct: 399 QEAAGIALNALPRSTKLGLWVFSSDKGGGRDYKELVPLGLLSDDAQRARMAAAAASLSKE 458 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + AY + + ++ LTDG+N + Sbjct: 459 VDGWTGLYDTIWAAYSKVKASY----------------DPQRVNAVVILTDGKNEDPGGG 502 Query: 348 VNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ ++ K K+ I I TI I + + L K SS +Y N + V Sbjct: 503 LSLEQLLAKIKDATDPKRPIAITTIGIGPGVDAESLRKISRSSYSDYYGAENPADMTTVL 562 Query: 403 QN 404 Sbjct: 563 AK 564 >gi|325695689|gb|EGD37588.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK150] Length = 460 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 47/452 (10%), Positives = 122/452 (26%), Gaps = 77/452 (17%) Query: 24 IIFALSVMSFLL-LIGFLIYVL-------------DWHYKKNSMESANNAAILAGASKMV 69 +I A+ +MS + +IG + + + SM+ + I S + Sbjct: 14 MIMAIMLMSMIAVIIGIIFNTMFSGRELVEREASIQAEM-RTSMQY-VDRTIGKATSIFI 71 Query: 70 SNLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ--NIVN 123 + S+ + + S + ++ N K++ + ++ Sbjct: 72 LDDSKYQGSEKGLTKEWSYIGLSEDGKKVRNYVWNKSKQNWDITDLGTKSLYNMKLDLEF 131 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + + + + ++ K I Sbjct: 132 KADDLYKDNRLISYNLTGKYPDTSNKLSIDTAISALNSKQVFSKVAKGKKGIAIAYRNDP 191 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 G+ + I V D SGSM M + + + ++M LK + + + Sbjct: 192 IEGQMNVS-ISFVFDKSGSMAWDMLGKEVEETN-------RPSRMKILKEKSIAMMKDLK 243 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + +V ++ + E E GT + + + + T+ ++ Sbjct: 244 SIGNVSVNLVAFSTLGSYVQEDFSELDKGTTTIETSINKLDE---GGYTNPGDGLRYGMV 300 Query: 304 ILTSDKKRSFFTNFFRQGVKIPS---------------LPFQKFIIFLTDGENNN-FKSN 347 L + + + GV + + + + Sbjct: 301 SLQKNPAQLKYVVLLTDGVPNAFTAKTNDVSSRNRIKVYYPNGYYWIRFNNSTYDLTTNF 360 Query: 348 VNTIKIC-------DKAKENFI---------------KIVTISI-NASPN---GQRLLKT 381 + + ++ I ++ I + GQ L K Sbjct: 361 TSNEDYAAYDPYSQETLRKQSIEYSGKVSQTFGAGIKRVNVIGFSGREEDVRYGQDLTKA 420 Query: 382 CVS--SPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + N + L F +I + + Sbjct: 421 IKEGKTDAEYTSAENEEKLQATFSDIKKQIQQ 452 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 60/208 (28%), Gaps = 37/208 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + + KM A K A+ +LD I + + E + S G Sbjct: 159 DASGSMAGKVNGQVKMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCG 215 Query: 273 TEKV---------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +V Q+ + T A++ Sbjct: 216 SSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNDDFKE---------------- 259 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKT 381 + ++DGE + ++ + I + + Q+ LK Sbjct: 260 YTGEENLNVVYIVSDGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKN 316 Query: 382 -CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + V NAD L +Q +++ Sbjct: 317 TAEAGKGNYATVSNADEL---YQTLNKE 341 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICRGLRSAFTVIKKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISECFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/358 (8%), Positives = 81/358 (22%), Gaps = 49/358 (13%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ S + D ++I+ +G A + + + + Sbjct: 192 NTNVLDHQGNKNGSNQWDGINSWDGDPNDRTHSYIEYGGTGNQADYAIRKFAKETTTPGL 251 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPALLRIE-M 185 + + + + + N R + + + + Sbjct: 252 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNNNDRIGEVKIGVDRFVDTLADS 304 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + V +S+ + ++ + + +K+ + + Sbjct: 305 GITDKINMGYVG---------YSSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFT 355 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 D L + + + G V R ++ + + Sbjct: 356 QKGLRDAGDMLSVPNGHKKVIVLLTDGVPTFSYKVQRVRAQ----SSNDYYGTQFSNTQD 411 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + + T GE KE I+I Sbjct: 412 QPG-NTSRIARSYYAPDQNNQSRRIDSTFIATIGE-------------AMALKERGIEIH 457 Query: 366 TISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 458 GLGIQLQSDSAAGLSKAEVESRMRKMVSADEKGDLYYESADHATDISEYLAKKAVQIS 515 >gi|256261598|gb|ACU65921.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 52/166 (31%), Gaps = 21/166 (12%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ R +++ + + T + A++ L S Sbjct: 183 RPSSQFSLVQFSDRFQEHFTFKDFATSSDPLNLLNSVWQLGGWTFTASAIRFVTDRLLSA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K +I +TDGE I + A I I Sbjct: 243 AYG-------------ARKDASKILIVITDGEKTEKVDYKEVIPRAEAA---GIIRYAIG 286 Query: 369 INASPNGQRLLK---TCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + ++ + L+ S+P + + V N D+L + + + + + Sbjct: 287 VGSAFQYRNSLQELIDIASTPSKEHVFQVENFDALRDIQKQLKEKI 332 >gi|221315959|ref|ZP_03597764.1| hypothetical protein BsubsN3_19772 [Bacillus subtilis subsp. subtilis str. NCIB 3610] Length = 235 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/208 (9%), Positives = 55/208 (26%), Gaps = 33/208 (15%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH--------VKEDVYMGLIGYTTRV 263 D + + + + K+ K ++ F + + ++ + G + Sbjct: 47 FDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFGHAGNNKLSGKALSCSTT 106 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + + T A+ + + Sbjct: 107 ETIYGLHPYEGSL--FDNSLSELKPTGWTPIAKALADTRKEFEAF--------------- 149 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT 381 + + +TDGE +K + + + + I N G +K Sbjct: 150 --DADGKNVVYLITDGEETCGGD---PAAEIEKLRASNVDTIVNIIGFNFDVKGNEEMKQ 204 Query: 382 CV-SSPEYHYNVVNADSLIHVFQNISQL 408 + + + +AD ++ +Q Sbjct: 205 AAVAGGGEYISANSADEFEQAWEKEAQK 232 >gi|50913505|ref|YP_059477.1| collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|19108443|gb|AAL11466.1| FctX [Streptococcus pyogenes] gi|50902579|gb|AAT86294.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|198417345|gb|ACH87890.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417347|gb|ACH87891.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1036 Score = 54.9 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 31/316 (9%), Positives = 75/316 (23%), Gaps = 55/316 (17%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + I ++ R + G+ Sbjct: 433 MTGKKNPLDILVVVDKSGSMQEG---------IGSVQRYRYYAQRWDDYYSQWVYHGTFD 483 Query: 205 CAMNSDPEDVNSAPICQ-------DKKRTKMAALKNA---LLLFLDSIDLLSHVK----- 249 + + + A+KN+ + L ++ Sbjct: 484 YSSYQGESFNRGQIHYRYRGIVSVSDGIRRDDAVKNSLLGVNGLLQRFVNINPENKLSVI 543 Query: 250 ----------------EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + G ++ E G + +L T+ Sbjct: 544 GFQGSADYHAGKWYPDQSPRGGFYQPNLNNSRDAELLKGWSTNSLLDPNTLTALHNNGTN 603 Query: 294 STPAMKQAYQIL----TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 A+ +A +IL +++ + ++ NN +S Sbjct: 604 YHAALLKAKEILNEVKDDGRRKIMIFISDGVPTFYFGEDGYRSGNGSSNDRNNVTRSQEG 663 Query: 350 TIKICDKAKEN--FIKIVTISINASPNGQR------LLKTCVSSPEYHYNVVNADSLIHV 401 + D+ K + I ++ ++ N +LK S E++Y + + L Sbjct: 664 SKLAIDEFKARYPNLSIYSLGVSKDINSDTASSSPVVLKYL-SGEEHYYGITDTAELEKT 722 Query: 402 FQNISQL--MVHRKYS 415 I + + S Sbjct: 723 LNKIVEDSKLSQLGIS 738 >gi|10048261|gb|AAG12328.1|AF249739_1 sporozoite surface protein 2 [Plasmodium falciparum] Length = 559 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 68/204 (33%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNIFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|290986713|ref|XP_002676068.1| predicted protein [Naegleria gruberi] gi|284089668|gb|EFC43324.1| predicted protein [Naegleria gruberi] Length = 413 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 64/201 (31%), Gaps = 29/201 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM + +D++ +G++ + + E + + + ++ + + Sbjct: 70 KMKVANKVICEIIDNLKDFE------RLGIVLFDDKAETFLPLTIVQDLEKKSLKERVMK 123 Query: 287 LI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T+ MK+ + ++ II+LTD N Sbjct: 124 ITEKGSTNFEAGMKRGIDLFSTMDSSDLSN--------------SNRIIYLTDACPNVGG 169 Query: 346 SNVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS----LI 399 ++ + A I I I N + + + +++V +++ L Sbjct: 170 TDS-LDVLTKDANSGPYGILSTFIGIGLDFNSEIVEELTKVRGCNYFSVKSSEEFKKILN 228 Query: 400 HVFQNISQLMV-HRKYSVILK 419 F I + + K ++ K Sbjct: 229 EDFNYIVTPICYNVKLTLESK 249 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 51/153 (33%), Gaps = 29/153 (18%) Query: 254 MGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ + + + + + + T ++ +Q + + Sbjct: 347 TGMVTFESSATIQNYLTEITDHNAYEKILANLPQA-AGGGTSICSGLRAGFQAIIHSNQN 405 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + + I+ LTDGE++N I +C ++ K++ I TI++ Sbjct: 406 TSGSE----------------IVLLTDGEDDN-------ISLCFEEVKKSGSVIHTIALG 442 Query: 371 ASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVF 402 + L + + Y + + LI F Sbjct: 443 -PSAAKELEILSNMTGGHRFYANKDINGLIDAF 474 >gi|327262912|ref|XP_003216267.1| PREDICTED: integrin alpha-1-like [Anolis carolinensis] Length = 1166 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 72/195 (36%), Gaps = 28/195 (14%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ FL+S+ ++ + +G++ Y V + T V Sbjct: 231 YPWESVTEFLNSLLQNMNIGPQQTQVGIVQYGENVTHEFNLNTYTTVEEVLVAAKKIGQR 290 Query: 289 LKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ + A + ++ + QK ++ +TDGE+++ Sbjct: 291 GGTRTNTALGIDTARKEAFTEARG-------------ARRGVQKVMVVVTDGESHDNYRL 337 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADS 397 I+ C+ + I+ +I+I N L +K+ S + ++ +NV + + Sbjct: 338 GEVIQDCE---DENIQRFSIAILGHYNRGNLSTEKLVEEIKSIASEPTEKHFFNVSDELA 394 Query: 398 LIHVFQNISQLMVHR 412 L+ + + + + + Sbjct: 395 LLTIVEALGERIFAL 409 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 69/225 (30%), Gaps = 33/225 (14%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + LVVD+SGSM + ++K+ K A + L Sbjct: 406 RKKANVLLVVDVSGSMGDEVKGTG-------------KSKIDLAKQAAIDSLGQFVPRDQ 452 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILT 306 V + + ++ + ++ + +R T + AY+ L Sbjct: 453 VGLWQFATHLDGDKDYQELLPVQPLGSNGKETLASRLSGLTPQSGTGLYDSSLAAYEYLK 512 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---KSNVNTIKICDKAKENFIK 363 ++ LTDG N + + ++ + ++ Sbjct: 513 ----------------AHLDPSAINAVVVLTDGRNEDPGGVDLDHLVPQLRPEGNAESVR 556 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + TI+ + L + ++ Y+ DS+ VF ++ Sbjct: 557 LFTIAYGGDADQNVLKQIAEATAGSEYDSSKPDSINQVFTSVISN 601 >gi|296331310|ref|ZP_06873782.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676296|ref|YP_003867968.1| hypothetical protein BSUW23_18120 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151425|gb|EFG92302.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414540|gb|ADM39659.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 281 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 52/201 (25%), Gaps = 33/201 (16%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH--------VKEDVYMGLIGYTTRV 263 D + + I + K+ K ++ F + + ++ + G + Sbjct: 93 FDGSGSMIQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFGHAGNNKLSGKALSCSTT 152 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + + T A+ + + Sbjct: 153 ETIYGLHPYEGSL--FDNSLSEIKPTGWTPIAKALSDTRKEFEAF--------------- 195 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT 381 + + +TDGE +K +E+ + + I N G +K Sbjct: 196 --DADGKNVVYLITDGEETCGGD---PAAEIEKLRESNVDTIVNIIGFNFDIKGNEEMKQ 250 Query: 382 CV-SSPEYHYNVVNADSLIHV 401 + + + +AD Sbjct: 251 AAVAGGGEYISANSADEFEQA 271 >gi|55962354|emb|CAI11851.1| novel protein (zgc:56119) [Danio rerio] gi|56207241|emb|CAI21014.1| novel protein (zgc:56119) [Danio rerio] Length = 946 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 67/222 (30%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + + + Sbjct: 305 NIVFVIDVSGSMW--------------------GLKMKQTVEAMKAILDDLSIDDYFSII 344 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + S ++ ++Y+ T+ A+ +A Q+L Sbjct: 345 DFNHNV-RCWSEDLVQASSIQVDEAKKYIQNIK---PNGGTNINEALLRAIQML------ 394 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + II ++DG+ K + + + KE + ++ I Sbjct: 395 -----IKASHHGLIDPRSVSMIILVSDGDPTVGEIKLSTIQKNVKLRMKEE-FSLFSLGI 448 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNADS-LIHVFQNISQ 407 + L + + + Y NA L + +S Sbjct: 449 GFDVDFDFLERIAMDNRGIAQRIYANQNAAEQLKTFYSQVSS 490 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/336 (8%), Positives = 87/336 (25%), Gaps = 26/336 (7%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ ++ + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLTGKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLN--QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + + N + + + E I ++ V ++S + Sbjct: 191 PIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVN 250 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + ++ V G + Sbjct: 251 LTTFSTTGSYKQAAFSQ--------LDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL 302 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +++ + +YV D + + ++ + F Sbjct: 303 QKQHAQL--------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELSSN 354 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIVTISI-NASP---NGQR 377 S + + T + ++++ KA + ++ I GQ Sbjct: 355 VPASFNYGRLGYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFSGLEHEIAYGQN 414 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 L + + N ++L F +I + + Sbjct: 415 LTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 44/359 (12%), Positives = 107/359 (29%), Gaps = 49/359 (13%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHY-------------KKNSMESANNAAILAGASKMVS 70 +I A+ +MS + LI +I+ + + SM+ + + S + Sbjct: 14 MIMAIMLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFIL 72 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ + + S A ++ N K+ + + + Sbjct: 73 DDSKFKGSKQGLTREWSYIGLSADGKKVLNYVWNKEKQDWDVSELGTKSLYNMKLDLEFK 132 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + + N + + + ++ K I Sbjct: 133 TEGAYQDNRLISYNLTGKYPDTNSKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPI 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ I + V D+SGSM A+N N+ ++M L++ + ++ + Sbjct: 193 QGQMNIA-VSFVFDISGSMKGALNGANPTSNN--------PSRMDILRDKAEIMINELQS 243 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ + + + +++ + T+ ++ Sbjct: 244 VGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKS--DGGVTNPGDGLRYGMVS 301 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K+++ LTDG N + N + K I+ Sbjct: 302 LQK------------------QHAQLKYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 64/210 (30%), Gaps = 34/210 (16%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++M+A K A LD++ + Sbjct: 41 VELVLDVSGSMRTRD--------------IDGQSRMSAAKQAFNDVLDAVPEEVQLGIRT 86 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R + + P D T A K A L Sbjct: 87 LGANYPGDDR--------------KVGCKDTKQLYPVGPLDRTEA-KTAVATLAPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISIN 370 + + I+ +TDGE+ + ++ I I T+ + Sbjct: 132 IGPALLGAADDLDGGDATRRIVLITDGEDTCGPLDPC--EVARDIAARGIHLVIDTLGLV 189 Query: 371 ASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 + ++ L ++ + V +AD L Sbjct: 190 PNAKIRQQLTCIAEATGGTYTAVQHADELS 219 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 72/252 (28%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K++ I+M E+ + ++D SGSM+ K Sbjct: 192 NKDTKLMMVGLNTAAIDMSEKKASNLVFLIDTSGSMYEE-------------------NK 232 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + A + +++D + Y G + + + Sbjct: 233 LPLAQKAFKMLAENLDENDRISIVTYAG-------SDTVVLNGVAGSEAYTICEALDSLE 285 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ + + AY+I + +I TDG+ N S Sbjct: 286 ASGSTNGSAGLITAYEIAEQQFIKDGNN----------------RVILATDGDLNVGLTS 329 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + + + + K++ I + + + L+ + + + + + Sbjct: 330 ESDLVGLITEEKDSGIFLSVLGFGSDNLKDNKLEALADHGNGNYSYLDSVYE---AKKVL 386 Query: 406 SQLMVHRKYSVI 417 M Y+V Sbjct: 387 VDEMGGTLYTVA 398 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 32/192 (16%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWG 272 S + T++ + A L+L + ++++ +GL+ + + Sbjct: 314 DKSGSMRLGSPITRLTLMNQAAELYL-----IQIIEKESLVGLVTFDSTATIQTNLIRII 368 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + T +K+ ++ +TS + + + Sbjct: 369 NDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE---------------- 412 Query: 333 IIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHY 390 I+ LTDGE+N S C + K + I TI++ + L + Y Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALG-PSAARELETLSDMTGGLRFY 464 Query: 391 NVVNADSLIHVF 402 + + LI F Sbjct: 465 AKEDVNGLIDAF 476 >gi|51891532|ref|YP_074223.1| hypothetical protein STH394 [Symbiobacterium thermophilum IAM 14863] gi|51855221|dbj|BAD39379.1| hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 252 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 23/58 (39%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + ++ + +F +L IGF + V + + +A AA LAG + Sbjct: 1 MNRFRTEQRGSTGALFVFIWAVAVLAIGFALDVGRVFVLREQLRTAEEAAALAGVRQA 58 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 70/258 (27%), Gaps = 54/258 (20%) Query: 185 MGERPIFLIELVVDLSGSMH--------CAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 RP V+D SGSM+ + +TK+ + +L Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDS 294 + + L+ + + + T ++ + + + T Sbjct: 95 RLV----SSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNF--SGGTRM 148 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 M++A IL + + TDG+ + + Sbjct: 149 ALGMEEALNIL------------------KNCDLSSRRTLIFTDGQTFDESD---CRDLA 187 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV------VNADSL------IHVF 402 + E I I + + N LL + +NV + +F Sbjct: 188 TQFAEAGIPITALGVG-EYNEDLLLYLSDRTGGRVFNVVETQTHTGTTDIPISELPNTIF 246 Query: 403 QNISQL----MVHRKYSV 416 + + Q + + K ++ Sbjct: 247 EEVQQAQSEVINNLKLNI 264 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 60/197 (30%), Gaps = 26/197 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 ++ +L++AL+ + L+ + + ++ + T + E Sbjct: 351 TSGSMAGARIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTGPGAPRTFTVPEE 410 Query: 275 KVRQYVTRDMD----SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + + T ++ +AYQ+L Sbjct: 411 NPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVLEPLMA--------------ADPDRF 456 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEY 388 I+ +TDGEN N S + + + + T+ + L + + Sbjct: 457 TSIVLMTDGENANGSSLPDFLTSLASLPPAMKQVPVFTVLFGEG-SSDELTQVATRTGGK 515 Query: 389 HYNVVNADSLIHVFQNI 405 ++ N L VFQ I Sbjct: 516 VFDARNV-QLSRVFQEI 531 >gi|224823582|ref|ZP_03696691.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] gi|224604037|gb|EEG10211.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] Length = 995 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/310 (10%), Positives = 67/310 (21%), Gaps = 68/310 (21%) Query: 147 NMDVMTSYDYRLQFIEHLLN-------QRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ V + + Q+ + + + + + D+ Sbjct: 43 SVKVTPNILLFFDNSGSMAWDVYGNTGQKRERHCNWRGCWYGDVLDPLQGETRLSIAKDV 102 Query: 200 SGSM--HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + SM +D + N + + + V + Sbjct: 103 TKSMIDAYPDMRWGLFSFDPSSSKDYVNDREDLRYNRAGRLVAEVGGGKDHLDRVKSAID 162 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + N + + T M AY + S + Sbjct: 163 SLG--ADTNTPLAEAY---------------YEMTRYYRGMSSAYYKPVNPASGSRGSGR 205 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFK----------------SNVNTIKICDKAKE-- 359 + V+ F++ +TDGE + +A Sbjct: 206 YASPVQYR--CQANFVLMMTDGEPTEDNEFPLSGDPLFAGLSLGKDNPLPSFAGRAAAGD 263 Query: 360 --------------------NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 IK TI L KT + ++ N + L Sbjct: 264 LFTSGSDGEGQGWDAETFARQSIKTYTIGFTVDNT--LLQKTAMQGGGRYFTASNREQLK 321 Query: 400 HVFQNISQLM 409 F + + Sbjct: 322 ASFTSAMDDI 331 >gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] Length = 670 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/281 (11%), Positives = 76/281 (27%), Gaps = 59/281 (20%) Query: 141 NNTIFYNMDVMTSYDYR--LQFIEHLLN------QRYNQKIVSFIPALLRIEMGERPIFL 192 ++ + + R L + + + + + G R Sbjct: 262 GSSFSDKVAMDQDVVLRWQLDPVASASGAVFSEEYKGEHYALVMLRTPDEMTSGPRMPRE 321 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D SGSM +M K AL ++ + Sbjct: 322 VVFVIDTSGSMA--------------------GQRMYHAKQALSQAVERLSPDDRFN--- 358 Query: 253 YMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ + + + S V+Q + T PA++ A + Sbjct: 359 ---VVEFNNQHSRLFSSMRSASAINVKQALNWVGRLQGGGGTMMLPAVEDALSV------ 409 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + +I +TD N + ++ + ++ T+ I Sbjct: 410 -------------RSDPAYLRQVILITDASVGNEAEILRVVER----QRKGARLFTVGIG 452 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 SPN L K + + + + Q + + + Sbjct: 453 VSPNSYLLRKAAQVGQGDYVYIASGQEVKARMQRLFAKLEN 493 >gi|58429479|gb|AAW78143.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429539|gb|AAW78173.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLSDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|330918891|ref|XP_003298384.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] gi|311328422|gb|EFQ93524.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] Length = 1367 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 61/183 (33%), Gaps = 23/183 (12%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 +S+ Q T++ LK +++ + ++GL+ ++T+ + + E Sbjct: 946 DSSASKQKSHMTRLDVLKQMFDAYINRLLA---YNFHSHVGLVTFSTKALVAQKITNAVE 1002 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R + T ++ A + ++ + II Sbjct: 1003 NFRHKLNNLK---ASGDTAIWDSIALAQDQIQQYAEQY--------------PGSKLRII 1045 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 ++DGE+N + + + + + I + + + + + L C S Y + Sbjct: 1046 CISDGEDNKSQ--NTAVDLASRLIRDDITVDSFCL-DDHSNKELQTLCSLSGGYSFAPKT 1102 Query: 395 ADS 397 D Sbjct: 1103 LDE 1105 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 65/245 (26%), Gaps = 33/245 (13%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + LV+D SGSM + D + A N Sbjct: 388 DFEITGTPPEKPVDVILVIDTSGSMGTRIPGDSKAPLYYAKL---------AAINFANSI 438 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D + G GY + I + + + + T+ Sbjct: 439 IDENPDSRVGVIEFSGGYYGYASDASTVINLTNNKANLASSINGLTTHNM---TNIQAGF 495 Query: 299 KQAYQILT------SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + AY ++ K F V I + ++ T + Sbjct: 496 RLAYNKISAISSTRDSVKSVVFLTDGVANVSIGNWSSSNPVVHNTHTIAAYTEGQSLYSY 555 Query: 353 ICDKAKENFIKIVTIS-INASPN------GQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 I + TI A N + L+ V E +Y +A L V++ Sbjct: 556 I-------NGNLFTIGLFGAISNSSVKSIARDTLQKAVYDDLEKYYEASSAVDLGPVYET 608 Query: 405 ISQLM 409 ISQ + Sbjct: 609 ISQKL 613 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 12/102 (11%), Positives = 31/102 (30%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + N + + AL FL + F + ++ ++S + A +AGA+ Sbjct: 1 MRGLVSRFLRARGGNLATMAALVSPIFLAVAAFSVDTSSLFLERRQLQSMADFAAVAGAA 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + + + + + Sbjct: 61 SLSQANDAVLRQLRANGLDPVLMTGAYDPSVVNGKTDNKTRV 102 >gi|125973772|ref|YP_001037682.1| carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] gi|125713997|gb|ABN52489.1| Carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] Length = 1050 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 60/213 (28%), Gaps = 32/213 (15%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK------NIEPSWGTEKV 276 K T + +K+A F+D M +I Y+ ++ + Sbjct: 721 DKSTYLKLVKDAAKNFVDKFA-----GSKTKMAVIQYSDSANDNDFKKYDLSLPDKGAAL 775 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK------------RSFFTNFFRQGVKI 324 ++ + + ++ M++AY IL S + K Sbjct: 776 KETIDKIKPG-TSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGSVPNRWTAVDNKK 834 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +N ++ S + I +V I + G L + Sbjct: 835 NEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEEDIGDVLEEIAAE 894 Query: 385 SPE--------YHYNVVNADSLIHVFQNISQLM 409 S ++Y N L+ + N++ + Sbjct: 895 SGAKPLEGTDRHYYKANNFLELLDILNNMTLKI 927 >gi|58429521|gb|AAW78164.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429511|gb|AAW78159.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|293347920|ref|XP_001064219.2| PREDICTED: vitrin-like [Rattus norvegicus] Length = 648 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 53/193 (27%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 472 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRIGAVQYTYEQRLEFGFDKYNSKADVLSA 531 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 532 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMILITDGRS 575 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V D+L Sbjct: 576 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLY 628 Query: 400 HVFQNISQLMVHR 412 I + + Sbjct: 629 KFVPRIIRNICTE 641 >gi|291223809|ref|XP_002731900.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 992 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 32/190 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD 285 ++ L+ A F+ + + E Y+G++ ++ ++ + R+ + R + Sbjct: 337 RLTKLRQAASAFIRNT-----ICEGSYLGIVEFSEFAQELAPLTLVNGSDSREGLIRRLP 391 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + + ++L+++ + P I+ ++DG N Sbjct: 392 HSVGGWTSIGAGIMKGIEVLSTNGQN----------------PEGGLIMAISDGGENRAP 435 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIHVF 402 + ++ D E+ + I TI+ + + L + + D L F Sbjct: 436 TLSEALQAVD---ESGVTIDTIAYSEQADEN-LASLAARTGGMSFFYSGDDDSTVLEDAF 491 Query: 403 QNISQLMVHR 412 + + R Sbjct: 492 ---ATSITSR 498 >gi|198426242|ref|XP_002124410.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 402 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 53/179 (29%), Gaps = 30/179 (16%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-----EPSWGTEKVRQYVTRDMDSLIL 289 + + + + + +G Y V+ + + + Q + Sbjct: 231 IEFVVSMLTQFVVNESSLRVGAFRYNRAVDSDTQILLNGFTNDKTGLVQAIQDIP--YRG 288 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A++ A +L + +TDG + + + Sbjct: 289 SGTRTGNAIRHAKDVL-------------LLPENGNRPNVTDLVFVVTDGRSQDAVA--- 332 Query: 350 TIKICDKAKENFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQN 404 ++ + + I++ ++ ++L SP+ + V D L VF + Sbjct: 333 --EVARELRATGAVTFVIAVIIQGSTIERDQML-DIAGSPDRLFEVTGGFDDLDSVFAD 388 >gi|62733745|gb|AAX95854.1| hypothetical protein LOC_Os11g11610 [Oryza sativa Japonica Group] gi|77549354|gb|ABA92151.1| hypothetical protein LOC_Os11g11610 [Oryza sativa Japonica Group] Length = 182 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 4/53 (7%), Positives = 14/53 (26%), Gaps = 3/53 (5%) Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + T + + + + N ++ F +Q + + Sbjct: 89 PVHTFGFGKDHDAVAMHTIAEVTGGTFSFIENEAAIQDGF---AQCIAPLRRR 138 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/217 (11%), Positives = 60/217 (27%), Gaps = 54/217 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + D K +++ K L+ + +GLI ++ S Sbjct: 99 DLSRSMLAPDVKPSRLDRAKLLTQSLLEKL-------SGERVGLIVFSGTAFLQSPLSSD 151 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E +R+++ L T+ + A + + Sbjct: 152 YEILREFLPALDPTFLPEGGTNYDALINTALTAFGATGAAD------------------R 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 F+I L+DGE + + K I+++ + + + Sbjct: 194 FLIILSDGEATEDDWRSHV----AELKNRGIRVIALGVGTTAGAMIPDGAGGLVKDERGA 249 Query: 374 ------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 L + ++ + + L V Sbjct: 250 VVLSKLESNTLRELASATSGVYEDASTWVDLAAVLAA 286 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/373 (10%), Positives = 87/373 (23%), Gaps = 51/373 (13%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + +LA S + S R + + Sbjct: 200 QADRTTEQPLLAMPSSANTATSAKHVRPALDVKMQVNIDAGFELTSLDSLYHPIKQSNVG 259 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ---- 167 Y+ + R + N FY + Sbjct: 260 NHYSVNFAGKQIADRDFVLQWQANVGAVPKAATFYQTGKTH----LADNSDERSETAQRQ 315 Query: 168 -----RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 +V +P + + + LV+D SGSM Sbjct: 316 PNPVDNNMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSM------------------- 356 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + K AL L + + + + + S + + + Sbjct: 357 -SGQSITQAKQALQFALAGLRDIDSFNIIEFNSDVTMLS----ATPLSANSRNIGKANRF 411 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ A++ A + + +IF+TDG Sbjct: 412 IQSLDADGGTEMRSALQTAL----------VDSVQQDSDQTDAHSEMLRQVIFMTDGAVG 461 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 N I D+ ++ ++ T+ I ++PN + + + N + Sbjct: 462 NEHELYQLIN--DQLGDS--RLFTVGIGSAPNSDFMRRAATMGRGTFTYIGNESEVQQKI 517 Query: 403 QNISQLMVHRKYS 415 + + + + Sbjct: 518 EQLLNKIEQPVLT 530 >gi|323693762|ref|ZP_08107958.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] gi|323502183|gb|EGB18049.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] Length = 1560 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/300 (9%), Positives = 70/300 (23%), Gaps = 41/300 (13%) Query: 136 RLDSSNNTI-FYNMDVMTSYDYRLQFIEHLL--NQRYNQKIVSFIPALLRIEMG----ER 188 L+ + ++I + + + + + + + + Sbjct: 586 TLEVTGDSIQTTIGGGTADIVFVIDKSSSMNQWDYDLGGNRWTILKETVDRFINKLKITS 645 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I + S + + D + + A KN L L Sbjct: 646 PNSKISFIEYQSSDLTNYTRTGTYDEIRNVAMANTE----NADKNLEGYSLKYFRTLQRW 701 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +G Y + N + A+ + + + Sbjct: 702 TAISEVGSKPYGSAPGGNQG----------------THSAGGYLGAERALDR-LKKYNPE 744 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVT 366 + S + G N + I + K+ I T Sbjct: 745 EYNSNVKYVIYLADGTAGFYVDSYGYRDGAGSGGNTNARRAAITQSGELKKKHPDATIYT 804 Query: 367 ISINASPNGQ-RLLKTCVSSPE----------YHYNVVNADSLIHVFQNISQLMVHRKYS 415 ++ + + +K + Y+ N + L F ++++ + + Sbjct: 805 VAFGSDSSANMNWMKPGAYNGNITNPYNPNVTAFYSATNTEELEKTFDSLAEQVGSSAVT 864 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQIEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 71/259 (27%), Gaps = 46/259 (17%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L ++ Q+ Q + L I D + + + Sbjct: 357 VIFVLDVSGSMVGQKLPQVKEAMGQILSEI-----------HSEDFFTLILFSDFAQVWT 405 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +N+ + K++ K + LD++ + Sbjct: 406 INATQETSNHWDEKVSNWKTNNNISLDTLGEN------------------RFVFPAT--E 445 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V+ D T+ A+ +A K + + I Sbjct: 446 QNVQYAKKFIQDLQSESSTNMEDALNKA----------HLIAKLGETRFKDGANTPKPII 495 Query: 334 IFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH--- 389 +FLTDGE IK E I ++ + L K +++ + Sbjct: 496 VFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSLGFGEGADIDFLKKLSLNNTGFARVI 555 Query: 390 YNVVNAD-SLIHVFQNISQ 407 Y +A L + ++ IS Sbjct: 556 YEASDASLQLRNFYKEISS 574 >gi|260810969|ref|XP_002600195.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] gi|229285481|gb|EEN56207.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] Length = 323 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 67/187 (35%), Gaps = 31/187 (16%) Query: 228 MAALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYV---- 280 K ++ LD I + + L+ + + K + + ++++ + Sbjct: 132 FQKAKASIATVLDYICPGIGLYSPYHQVALMTFHSTPTKQFDFNDHGSYAELKEAILAVP 191 Query: 281 -----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T ++ TD+ A+ A + + + + ++ Sbjct: 192 YEVLMTNFVNKQGGPRTDTHEALDYARTTMFTSRTGL-------------RPGSLREVLL 238 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVV 393 LTDG+ N + T++ ++ + + I + + I + + L+ VS PE + +++ Sbjct: 239 LTDGQPN---EDDLTVQAAERLRNSGITVFALGIADGVDNEH-LEQLVSDPEYKHIFHLN 294 Query: 394 NADSLIH 400 + L Sbjct: 295 TFEDLAD 301 >gi|90021002|ref|YP_526829.1| hypothetical protein Sde_1355 [Saccharophagus degradans 2-40] gi|89950602|gb|ABD80617.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 787 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 77/259 (29%), Gaps = 52/259 (20%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 L Q + ++ + LV+D+SGSM ++ Sbjct: 11 LVLMLGLAASAGAQPELNLAQASQEIESKLADRSGKPADVRLVIDVSGSMKRNDPANLR- 69 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + A+ L + + E G+ + V + Sbjct: 70 ------------------QPAVDLLMQLL------PEGSKAGVWTFGKWVNMLVPHQVVD 105 Query: 274 EKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E+ R + + T+ A+++A L + S + K Sbjct: 106 EQWRSLGRAKASEINSVGLYTNIGEALEKAAYDLDA-----------------ASDEYAK 148 Query: 332 FIIFLTDG--------ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 II LTDG + N + ++ K K I T++++ + + L K + Sbjct: 149 HIILLTDGMVDIDKQPDKNTQEWRRIVDEVLPKLKAAGYTIHTVALSDNADNNLLKKLSL 208 Query: 384 SSPEYHYNVVNADSLIHVF 402 + AD L+ +F Sbjct: 209 QTDGIASVAHTADDLMKIF 227 >gi|58429463|gb|AAW78135.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D DV C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDVYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|271968449|ref|YP_003342645.1| von Willebrand factor type A domain-containing protein [Streptosporangium roseum DSM 43021] gi|270511624|gb|ACZ89902.1| von Willebrand factor type A domain protein [Streptosporangium roseum DSM 43021] Length = 490 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/256 (13%), Positives = 78/256 (30%), Gaps = 49/256 (19%) Query: 160 FIEHLLNQR---YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 F H+ R ++ + E R + VVD+SGSM Sbjct: 114 FTVHMDGARMPENGTALIRVGLQTRKAEPEARRPANLTFVVDVSGSM------------- 160 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + ++ ++ AL +D + V ++ ++T+ + + T + Sbjct: 161 ------GEPGRLDLVREALHKLVDQLGPGDQVS------IVAFSTQARLVLSMTPATGRD 208 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 Q + T+ + Y + + +I L Sbjct: 209 -QLHAAIDRLGVEDSTNLETGLTAGYA----------------EAARAFRPAATNRVILL 251 Query: 337 TDGENNNFKSN--VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +DG N + ++ + A I ++ + + Q + + + V + Sbjct: 252 SDGLANTGDTTWQGILDRVAESA-GRQITLLCVGVGRDYGDQLMEQLADNGDGAAVYVSS 310 Query: 395 ADSLIHVF-QNISQLM 409 AD VF + ++ + Sbjct: 311 ADDARKVFVEQLATNL 326 >gi|58429545|gb|AAW78176.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-----DSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALSIIKSLLSTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429507|gb|AAW78157.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRNLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSNGK 211 >gi|290957869|ref|YP_003489051.1| hypothetical protein SCAB_34031 [Streptomyces scabiei 87.22] gi|260647395|emb|CBG70500.1| putative membrane protein [Streptomyces scabiei 87.22] Length = 534 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 45/361 (12%), Positives = 105/361 (29%), Gaps = 37/361 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK------NHIKESLSGYSAV 112 A+ L+GA +++ A + L + ++ + L+ S + Sbjct: 193 ASGLSGAQSALTDADVARATPRLKRFFAGQKLTSGSSGWLATAYDRRGDVDALLNYESVL 252 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 ++ I + + L S+ + ++ +T + + + Sbjct: 253 KSRPDLTVIRPRDGVVTADYPLSSLASTGTDVRDDVRRLTDALRTPDVQRLITERTLRRP 312 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KM 228 +V+ +P ++ R ++ + A +D + D + ++ Sbjct: 313 VVASVPPAAGLDTTRRRELPFPGSRSVAVGLLDAYENDLRRPSRTVYVLDTSGSMEGDRL 372 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 LK AL V + + VR +V R D Sbjct: 373 DRLKTALTELTGDFRDREEV----------------TLMPFGSDVKSVRTHVVRPADPKA 416 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ--GVKIPSLPFQKFIIFLTDGENNNFKS 346 ++ + L++ + + +T+ R + I+ +TDGEN S Sbjct: 417 G-----LDGIRADTRKLSAAGETAIYTSLRRAYEHLGAVDRDTFTSIVLMTDGENTEGAS 471 Query: 347 NVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + I + I + L + ++ SL F+ Sbjct: 472 PADFDDFYGRLPDAARHIPVFPILFG-DSDRDELEHIAEVTGGRLFDA-TRGSLDGAFEE 529 Query: 405 I 405 I Sbjct: 530 I 530 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 16/161 (9%), Positives = 51/161 (31%), Gaps = 11/161 (6%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R R +G + ++ + +++ A+ VM ++++G I V + ++++ +++ + A +A Sbjct: 1 MTRQRDAGWRGKSRQRGSVAVMAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIA 59 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI----------KNHIKESLSGYSAVF 113 M S+ + A D + + Sbjct: 60 SVESMTDQCSQQNSPAMMAAQSNALANGFDYRASGQTLSIECGRWDTSATPYFNSTFTPL 119 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + I + + ++ + Sbjct: 120 NAVSVSVTQQVPYIFLGRFFGKSGSTGATVAAFSTAKAINI 160 >gi|149437043|ref|XP_001515962.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 948 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 313 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDQFSVI 352 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + + T ++Y+ + T+ A+ +A IL Sbjct: 353 DFNHNV-RSWKDNLVPATDLMTTDAKKYIEKI---QPNGGTNINEALLRAIFILREASN- 407 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K V + ++ I + ++ I Sbjct: 408 ----------LGMLDPNSVSLIILVSDGDPTVGELKPTVIQKNVKKNMRD-NISLFSLGI 456 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 457 GFDVDYDFLERLSRENHGMAQRIYGNQDTSSQLKQFYNQVST 498 >gi|88857994|ref|ZP_01132636.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] gi|88819611|gb|EAR29424.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] Length = 974 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/285 (11%), Positives = 66/285 (23%), Gaps = 66/285 (23%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + +V D SGSM ++ + N +++ + A+ ++ Sbjct: 48 NVQTTEKPRVMIVFDTSGSMESSVATGEYCYNRKGNQVYCADSRIKVAQAAMKALVEQNT 107 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + G GY + +T+ T + ++Y Sbjct: 108 DIEFGLMRFNGGSGGYI--------LAGLGSSKEAILTKIESLRAGGSTPLAETLYESYL 159 Query: 304 ILTSDKKRSFFT-----NFFRQGVKIPSLPFQK---------------FIIFLTDGENNN 343 LT S K +I +TDG+ Sbjct: 160 YLTGGGVNYAKNIADRDKDIEDNSSYDSPFKPKKDDSGLDILRCDNSINMIIMTDGDPTE 219 Query: 344 ---------------------FKSNVNTIKICDKAKENF-----------IKIVTISINA 371 KS + + TI + Sbjct: 220 DGGQNGNIKSLYNSKYGAYPKSKSGSYLNSLAKYMLNVDLFPTTSGVTDIARTFTIGFGS 279 Query: 372 S--PNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +G LLK S + A+ L + + + + Sbjct: 280 GMSDDGLDLLKQTASDGGGEYLLASTAEQLTEALK---KTITKIR 321 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/317 (11%), Positives = 81/317 (25%), Gaps = 43/317 (13%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + + + + + + N Sbjct: 1925 NQDGQDLAPGSVTGSDPSATSETASGTLVGSVSGATGAVSFTLVGNATGAYGQLLLHPDG 1984 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA------LLRI 183 ++ ++ + + S + Q + L N +VS + R Sbjct: 1985 SYTYTLTSPANTTPHANDGPNVLSESFTYQATDSLGNSTTGSLVVSIVDDVPKAVASERS 2044 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +V+D+SGSM +++ K A+ LD D Sbjct: 2045 VTAVEIDSNLLIVLDVSGSMADDS-------------GVPGLSRLDLAKQAISALLDKYD 2091 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L VK + L+ +++ + ++ T+ A+ A Sbjct: 2092 DLGDVK----VQLVTFSSSATDQTSVWVDVATAKSLLSSLS---ADGGTNYDAAVATAKT 2144 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC--DKAKENF 361 + + Q F +DG+ N+ + N Sbjct: 2145 AFVTSGQL---------------TGAQNIGYFFSDGKPNSGLETGTADEAAWKAFLDANG 2189 Query: 362 IKIVTISINASPNGQRL 378 IK I + + L Sbjct: 2190 IKNYAIGLGDGVSNDYL 2206 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/386 (10%), Positives = 109/386 (28%), Gaps = 51/386 (13%) Query: 39 FLIYVL--DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKR 96 I +K + + A L A + S + E++ A Sbjct: 217 GAIVYSSERKQRQKIKLGEHKDMASLMKAVDNLPFFSGITATGEALK-FAATHTEGRRDN 275 Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 N++ + + E + + A + + S Sbjct: 276 LTLNYVVLTDGY---SYDIIESGARLLREVPNSVVYAVTIGEIYLRKELELITGNKSNVM 332 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL-------VVDLSGSMHCAMNS 209 + ++I + ++ + + S+ N Sbjct: 333 I----GSMSYGTVVKRIKNCEARARAQQLRDENPVELVHPGEFLSDAFSHRQSVQTNENI 388 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--------VYMGLIGYTT 261 ++ + + + + + + + DS L + + ++ + Sbjct: 389 KKDEPAKDSVTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQTQLQIANKLFQVAIVQFAG 448 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + + K + + + ++ T + A+K+ + + K+ Sbjct: 449 KSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKR--------- 499 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ--- 376 K ++F + S +T + + K I + T+ I+ N Sbjct: 500 ------DNCKMKLLVF------TDGYSAEDTAEGIEALKRQGITVYTVGISTDKNAGLNV 547 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVF 402 LK +SP ++++ + D+L+ F Sbjct: 548 SELKGMATSPSHYFDSSDFDNLLKHF 573 >gi|256004503|ref|ZP_05429482.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|255991508|gb|EEU01611.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|316940035|gb|ADU74069.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 1313] Length = 1050 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 60/213 (28%), Gaps = 32/213 (15%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK------NIEPSWGTEKV 276 K T + +K+A F+D M +I Y+ ++ + Sbjct: 721 DKSTYLKLVKDAAKNFVDKFA-----GSKTKMAVIQYSDSANDNDFKKYDLSLPDKGAAL 775 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK------------RSFFTNFFRQGVKI 324 ++ + + ++ M++AY IL S + K Sbjct: 776 KETIDKIKPG-TSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGSVPNRWTAVDNKK 834 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 +N ++ S + I +V I + G L + Sbjct: 835 NEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEEDIGDVLEEIAAE 894 Query: 385 SPE--------YHYNVVNADSLIHVFQNISQLM 409 S ++Y N L+ + N++ + Sbjct: 895 SGAKPLEGTDRHYYKANNFLELLDILNNMTLKI 927 >gi|145540134|ref|XP_001455757.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423565|emb|CAK88360.1| unnamed protein product [Paramecium tetraurelia] Length = 522 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 53/191 (27%), Gaps = 25/191 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-Q 278 KM +K +L L + + LI + + + T++ + Sbjct: 121 SGSMSGEKMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYRLTRLMRATQENMYK 174 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ TD AMK A IL + ++ + I L+D Sbjct: 175 FLIAIDTIEANGATDIGNAMKMALSILKHRRFKNPIAS----------------IFLLSD 218 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE+ V KE I T + + + Y + + Sbjct: 219 GEDEGAAGRVWNDIQSKNIKEP-FTINTFGFGRDCCPKIMSEIAHFKEGQFYYISEISKI 277 Query: 399 IH-VFQNISQL 408 F+ + Sbjct: 278 DECFFEALGGE 288 >gi|116670917|ref|YP_831850.1| von Willebrand factor, type A [Arthrobacter sp. FB24] gi|116611026|gb|ABK03750.1| von Willebrand factor, type A [Arthrobacter sp. FB24] Length = 340 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 12/195 (6%), Positives = 55/195 (28%), Gaps = 19/195 (9%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-- 289 +A ++ L+ + +GL + + + + + V+ + + Sbjct: 107 SSADAAVVEVFARLAAGFDGERLGLTVFDSSAVQVFPLTDDYDVVQGQLEAARKAFDGAP 166 Query: 290 ------KPTDSTPA---MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + + + + + ++ TD Sbjct: 167 GSAAFLDGTWNGAGSSLIGDGLASCVQGFPSNGGDTGTGEQAGSGREERSRSVVLATD-N 225 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTIS---INASPNGQ----RLLKTCVSSPEYHYNVV 393 + + + AK+ +++ ++ + + +L + +Y + Sbjct: 226 FISGEPIFTLQEAAALAKKQDVRVYALNPGDFDYGTDPDQPGVQLRTAVEGTGGAYYPLD 285 Query: 394 NADSLIHVFQNISQL 408 + +++ + + + Sbjct: 286 SPEAVGEIIRRVQST 300 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|260813586|ref|XP_002601498.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] gi|229286795|gb|EEN57510.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] Length = 384 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 54/181 (29%), Gaps = 23/181 (12%) Query: 235 LLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F +I +G+I Y+TR + + K D + T Sbjct: 27 VKQFTKKTISGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTL 86 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + K +I +TDG + + + Sbjct: 87 TGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVAAPALE-- 131 Query: 354 CDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 A++ I + I + + L+ S+ V N + L ++ + + Sbjct: 132 ---AQQKGITVYAIGVSGYDADQ---LEQIASNNNTLAFVDNFNLLDNLRNTLLTGVCDA 185 Query: 413 K 413 K Sbjct: 186 K 186 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 44/152 (28%), Gaps = 23/152 (15%) Query: 235 LLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F +I +G+I Y+TR + + K D + T Sbjct: 254 VKQFTKKTISGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTL 313 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + K +I +TDG + + + Sbjct: 314 TGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVAAPALE-- 358 Query: 354 CDKAKENFIKIVTISI-NASPNGQRLLKTCVS 384 A++ I + I + + L+ S Sbjct: 359 ---AQQKGITVYAIGVSGYDADQ---LEQIAS 384 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 43/181 (23%), Gaps = 29/181 (16%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRV---EKNIEPSWGTEKVRQYVTRDMDSLILK 290 F+ + + V G I + T + + V R Sbjct: 265 TTADFIGETGNANDPIDLVIQGAINFPTNYVSNNPSTPLTPNYPTHSSKVGRRNLPES-- 322 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 K Y L++ + P + + Sbjct: 323 --------KFDYSNLSARITFDGVAGALVYEPRFPHPYY-------------YYFPCNAA 361 Query: 351 IKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I AK + I TI + LK + + + +L F NI+Q Sbjct: 362 INEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTA-TPANLAAAFDNIAQT 420 Query: 409 M 409 + Sbjct: 421 I 421 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 60/186 (32%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y + V+ V T++ Sbjct: 160 FLIEILTKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYKSVKDVVEAASHIEQRGGTETRT 218 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 219 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIRQ 262 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 263 SEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 322 Query: 407 QLMVHR 412 + Sbjct: 323 DRIFSL 328 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 61/191 (31%), Gaps = 33/191 (17%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A F+ ++ ++E L+ ++ V + + + ++ + Sbjct: 97 KEAGKKFVRAL-----LREQDEFDLMDFSDTVREVVSFTNDK---KRIENGLNELRKGDA 148 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ A Q L ++ ++ +TDG+N Sbjct: 149 TAVYDAVYLASQRLGETNAGGG---------------RRRVLVLITDGDNTVHGV--GYD 191 Query: 352 KICDKAKENFIKIV---TISINASPNGQR-----LLKTCVSSPEYHYNVVNADSLIHVFQ 403 + ++A+ + + + I A L++ + +Y V + L V+ Sbjct: 192 QAVEQAQRAGVMVYALIVVPIEADAGRNTGGEHALIQMATDTGGNYYYVNDPRDLAKVYA 251 Query: 404 NISQLMVHRKY 414 +S + + Sbjct: 252 KVSDDLRTQYV 262 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y + V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDAVHEFHL-NDYKSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLKKVIRQ 286 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++ + + +++ N + L K S P+ + +NV + +L + + Sbjct: 287 SERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|262198293|ref|YP_003269502.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262081640|gb|ACY17609.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 419 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 62/226 (27%), Gaps = 44/226 (19%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 E R + LV+D S SM ++A+ A Sbjct: 23 EAQATESSARMPVNLALVIDRSSSMRGP--------------------RLASAIVAARQV 62 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ +D + +I + S E + T+ Sbjct: 63 VEQLDERD------RLSVIAFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAG 116 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDK 356 MK+ + + S R ++ LTDG+ + N + K Sbjct: 117 MKKGAEAVRSGFVRG----------------ALSRLVLLTDGQPSLGITDNDRLCALAQK 160 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + I T+ + + + L S + + +A + F Sbjct: 161 EADRGVTITTMGLGQGFDDELLADLAHSGRGGFHYLASAADIPGAF 206 >gi|114594052|ref|XP_001144775.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pan troglodytes] Length = 386 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 52/171 (30%), Gaps = 21/171 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ + V ++ + I ++++ + + K+ + + + T Sbjct: 62 IYNFVQQLAER-FVSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYI 120 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K A + + K L II LTDG + K Sbjct: 121 HEGLKLANEQIQ----------------KAGGLKTSSIIIALTDG-KLDGLVPSYAEKEA 163 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQN 404 ++ + + + Q L+ S E + V +L + + Sbjct: 164 KISRSLGASVYCVGVLD--FEQAQLERIADSKEQVFPVKGGFQALKGIINS 212 >gi|90419742|ref|ZP_01227651.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335783|gb|EAS49531.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 448 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 79/265 (29%), Gaps = 16/265 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 AS + N + I AL + ++G ++ + H + ++SA ++A LA A Sbjct: 1 MTQRLNAFAASTQGNVATIVALLAPLLIFIMGSVVNLSTAHSAHSRLQSAADSAALAAAR 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ SR + A L + A+ + S +A + V ++ Sbjct: 61 ELYMANSRPEVLKSVAYSFAMTNLGNQAEGVSIDVKIGGRSDATAAESAISTEVTVTLNK 120 Query: 127 ISMTHMANNRLD-----SSNNTIFYNMDV--------MTSYDYRLQFIEHLLNQRYNQKI 173 T + L + + + + Q + + Sbjct: 121 DFGTSLPMPDLTGAIGALTASATARIAGGGRICMIGLTGEGKRAIDISGNAQVQAADCAV 180 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 S + + + EL G N +P + P D +++ + Sbjct: 181 YSNSIDPSGLSVTKVAKLSSELACSSGGYDGIESNYEPLPLTDCPAVSDPLSSRIPPV-- 238 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIG 258 +S+ + ++ G Sbjct: 239 -PTKCDHKNQKISNNNQILWPGTYC 262 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/336 (8%), Positives = 86/336 (25%), Gaps = 26/336 (7%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ + + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLTGKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLN--QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + + + + N + + + E I ++ V ++S + Sbjct: 191 PIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVN 250 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + ++ V G + Sbjct: 251 LTTFSTTGSYKQAAFSQ--------LDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL 302 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +++ + +YV D + + ++ + F Sbjct: 303 QKQHAQL--------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELSSN 354 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIVTISI-NASP---NGQR 377 S + + T + ++++ KA + ++ I GQ Sbjct: 355 VPASFNYGRLGYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFSGLEHEIAYGQN 414 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 L + + N ++L F +I + + Sbjct: 415 LTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 41/359 (11%), Positives = 109/359 (30%), Gaps = 49/359 (13%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHY-------------KKNSMESANNAAILAGASKMVS 70 ++ A+ +MS + LI +I+ + + SM+ + + S + Sbjct: 14 MVMAIMLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFIL 72 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHI------KESLSGYSAVFYNTEIQNIVNS 124 + S+ + ++ + + + N++ +SG ++ Sbjct: 73 DDSKFKGSKQGLTREWSYIGLSADGKKVMNYVWNKQKQDWDVSGLGTKSLYNMKLDLEFK 132 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + N + + + ++ K I Sbjct: 133 TEGAYQDNRLISYNLTGKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPI 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ I + V D+SGSM A+N N+ ++M L++ + ++ + Sbjct: 193 QGQMNIA-VSFVFDISGSMKGALNGANPTSNN--------PSRMDILRDKAEIMINELQS 243 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ + + + +++ + T+ ++ Sbjct: 244 VGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKS--DGGVTNPGDGLRYGMVS 301 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L K+++ LTDG N + N + K I+ Sbjct: 302 LQK------------------QHAQLKYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|293359740|ref|XP_233802.5| PREDICTED: vitrin [Rattus norvegicus] Length = 648 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 53/193 (27%), Gaps = 25/193 (12%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT K Sbjct: 472 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRIGAVQYTYEQRLEFGFDKYNSKADVLSA 531 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A++ A + L K +K +I +TDG + Sbjct: 532 IRRVGYWSGGTSTGAAIQYALEQLF----------------KKSKPNKRKVMILITDGRS 575 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + +V + K + I I + Q L+ + P ++ + V D+L Sbjct: 576 YD---DVRIPAMAAYQK--GVITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLY 628 Query: 400 HVFQNISQLMVHR 412 I + + Sbjct: 629 KFVPRIIRNICTE 641 >gi|119510959|ref|ZP_01630081.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464398|gb|EAW45313.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 464 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 67/234 (28%), Gaps = 33/234 (14%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFL-IELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + QK+ + ++ P V+D SGSM+ + D Sbjct: 15 FMPAETAGQKLFLMLKLRPTKDVAAIPPSTTFTFVIDTSGSMYEIVAGDTTPTGKTYTVD 74 Query: 222 DKKRT-------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 K+ T K+ + +LL + + + ++ + + I + TE Sbjct: 75 GKEYTQVTGGKSKIDIVSESLLALIR----SGRLGASDRIAIVQFDDTASQIIGLTSATE 130 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + T +++A ++L + + + + Sbjct: 131 INKLEDAIAQLRTFSGGTRMGLGLRRALEMLNNQQMTV------------------RRTL 172 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 TDG+ + I N I I + + N L S+ Sbjct: 173 LFTDGQT---FDDDQCRAIASDFATNNIPITALGVGEDFNEDLLTHLSDSTGGT 223 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 54/187 (28%), Gaps = 27/187 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 SA + + +K K A I ++++ +G++ + ++ Sbjct: 399 DQSASMGPETGISKFTMAKEAA------IMATESLRQEDRIGVLAFDVSTRWVVDFQPVG 452 Query: 274 E--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + R + TD A+++ + Sbjct: 453 VGLSLADVQRRISTLPLGGGTDIYNALQEGL------------------PALAQQPGRVR 494 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + LTDG + + ++A+ I + TI+I + L + ++ Sbjct: 495 HAVLLTDGR-SFTDDRQAYRMLLEEARSQNITLSTIAIGTDADINLLQELARWGAGRYHY 553 Query: 392 VVNADSL 398 + + Sbjct: 554 AAEPNDI 560 >gi|17231851|ref|NP_488399.1| hypothetical protein alr4359 [Nostoc sp. PCC 7120] gi|17133495|dbj|BAB76058.1| alr4359 [Nostoc sp. PCC 7120] Length = 418 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 22/199 (11%), Positives = 54/199 (27%), Gaps = 25/199 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +K A + +D + + ++ + R + + + + Q Sbjct: 51 SGSMNGRPLEIVKQAAIRLVDRLKTGD------RLSVVAFDHRAKVLVP-NQVIDNPEQI 103 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T ++ + L KK + F LTDG Sbjct: 104 KKQISRLAADGGTAIDEGLRLGIEELAKGKKETISQAF-----------------LLTDG 146 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 EN + ++ A + + T+ + N L K + + + + Sbjct: 147 ENEHGDNSRCLK-FAQLAAGYSLTLNTLGFGDNWNQDILEKIADAGLGSLSYIQKPEQAV 205 Query: 400 HVFQNISQLMVHRKYSVIL 418 F + + + Sbjct: 206 DEFGRLFSRIQTVGLTNAY 224 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 79/255 (30%), Gaps = 48/255 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 NQ + I+M E P+ + ++D+SGSM Sbjct: 185 PWNSDNQLMF-VGMKTGEIDMEEAPVSNLVFLIDVSGSM-------------------SS 224 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 R K+ L+ + +DS+ ++ + ++ Y+ + + + K + Sbjct: 225 RNKLPLLQKSFDELVDSL------PDEGTISIVTYSGEEKVVLSGEPMSNK-KGIKKAID 277 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ M++AY+I K +I TDG+ N Sbjct: 278 KLHANGCTNGQAGMQKAYEI----------------AQKYFIEGGNNRVIMATDGDLNVG 321 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVF 402 S+ + K K+ + + + ++T + + + + V Sbjct: 322 ISDLDDLEKFITDKKDEGVFLSILGFGEGNYKDDKMQTLADCGNGNYSYIDSLEEGKKV- 380 Query: 403 QNISQLMVHRKYSVI 417 + M Y+V Sbjct: 381 --LVDEMSSTLYTVA 393 >gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia nassauensis DSM 44728] Length = 831 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 41/294 (13%), Positives = 89/294 (30%), Gaps = 58/294 (19%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQF--IEHLLNQ----------RYNQKIVSFI 177 + + + I V + R + + ++ I Sbjct: 234 AVTVDETGEVTRVRIEPGERVNRDFILRFDYGESGDVAGSLLTAPDENEPTSGTFQLTAI 293 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P R + +++D SGSM KM A + A Sbjct: 294 PPSDLPRARPR---DVVVLLDRSGSMGGW--------------------KMVAARRAAAR 330 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTP 296 +D++ + + ++ N S GT++ R + V + TD Sbjct: 331 IVDTLSSADRFAVRCFDTAMTSPEGLDPN-GLSAGTDRNRFRAVEHLAGTETRGGTDILK 389 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + A +LT+ +K + II +TDG+ + ++ + Sbjct: 390 PLSTAVDLLTAGEKGR-----------------DRVIILVTDGQ--VGNEDQILRELTGR 430 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +++ + I+ + N L + + V + D L +I + +V Sbjct: 431 L--SGMRVHVVGIDKAVNAGFLHRLALVGRGRCELVESEDRLDEATAHIHRRIV 482 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 31/359 (8%), Positives = 79/359 (22%), Gaps = 37/359 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 S + + + F + ++ ++ S+ Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYT-RQTETSTAKLDFSLSFSQPEY 167 Query: 130 THMANNRLD-------SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI---VSFIPA 179 + + + + + + + + + Sbjct: 168 AKPPDGDAQGRLDVTLIPQGGVPAPVRPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNY 227 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----KNAL 235 LI D+ + +V S + ++ A +A Sbjct: 228 FKTNYHPNDRFALIPFSDDVKATSVVP-FGSKSNVISQLDAILDEGNRLTANGGTNYSAA 286 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---------WGTEKVRQYVTRDMDS 286 L S K+ + G T + + EK+ + Sbjct: 287 LSLAQSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGL 346 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + K Y L + + + + GE K Sbjct: 347 YSDGKMTSISFTDKDGYDRLFYNNHIDYVNGWLFSND-------NGYSFTYAWGE---GK 396 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + + + N I + +I + L+ ++ +L +FQ Sbjct: 397 AYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMDYLRALSTTAGGEVRQGTPQNLTELFQ 455 >gi|150024242|ref|YP_001295068.1| BatB protein [Flavobacterium psychrophilum JIP02/86] gi|149770783|emb|CAL42248.1| BatB protein [Flavobacterium psychrophilum JIP02/86] Length = 346 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 63/183 (34%), Gaps = 45/183 (24%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM +C+D ++ K + ++++ Sbjct: 90 DIVFAIDVSKSM---------------LCEDIAPNRLEKSKQVVSQIINNLGS------- 127 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + ++ D + + ++ A+K + + Sbjct: 128 DRIGIIAYAGSAFPVLPITTDYNVAKMFLQSMTSDMVSSQGSNLDEAIKLSAKYFE---- 183 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 S K +I LTDGE+++ + + K+ +KI+TI I Sbjct: 184 --------------GSPNTSKLMIMLTDGEDHSEGAESAAEEA----KKIGMKIITIGIG 225 Query: 371 ASP 373 + Sbjct: 226 TTA 228 >gi|58429489|gb|AAW78148.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 66/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ + I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|16080726|ref|NP_391554.1| hypothetical protein BSU36730 [Bacillus subtilis subsp. subtilis str. 168] gi|221311632|ref|ZP_03593479.1| hypothetical protein Bsubs1_19856 [Bacillus subtilis subsp. subtilis str. 168] gi|221320871|ref|ZP_03602165.1| hypothetical protein BsubsJ_19720 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325156|ref|ZP_03606450.1| hypothetical protein BsubsS_19886 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313222|ref|YP_004205509.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] gi|8928526|sp|P70961|YWMD_BACSU RecName: Full=Uncharacterized protein ywmD; Flags: Precursor gi|1648854|emb|CAB03681.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636198|emb|CAB15690.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019496|gb|ADV94482.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] Length = 224 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 20/208 (9%), Positives = 55/208 (26%), Gaps = 33/208 (15%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH--------VKEDVYMGLIGYTTRV 263 D + + + + K+ K ++ F + + ++ + G + Sbjct: 36 FDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFGHAGNNKLSGKALSCSTT 95 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + + T A+ + + Sbjct: 96 ETIYGLHPYEGSL--FDNSLSELKPTGWTPIAKALADTRKEFEAF--------------- 138 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT 381 + + +TDGE +K + + + + I N G +K Sbjct: 139 --DADGKNVVYLITDGEETCGGD---PAAEIEKLRASNVDTIVNIIGFNFDVKGNEEMKQ 193 Query: 382 CV-SSPEYHYNVVNADSLIHVFQNISQL 408 + + + +AD ++ +Q Sbjct: 194 AAVAGGGEYISANSADEFEQAWEKEAQK 221 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 57/169 (33%), Gaps = 27/169 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDST 295 + L+ +++ +G++ + + + + + +T ++ T Sbjct: 330 NQAAELYLIQIIEKGSLVGMVTFESLATILNYLTDIIGDNAYEKITANLPREASGGTSIC 389 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q + + + + II LTDGE++ I C Sbjct: 390 SGLRAGFQAIIHSNQSTSGSE----------------IILLTDGEDD-------QISSCF 426 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 ++ K++ I TI++ + L + Y + + L F Sbjct: 427 EEVKQSGAVIHTIALG-PSAARELETLSNMTGGYRFYANKDINGLTDAF 474 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 54.5 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 19/242 (7%), Positives = 56/242 (23%), Gaps = 27/242 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA-------------NNAA 60 + ++ A+ ++ FL + I V + +N +++A + Sbjct: 8 LKKDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESG 67 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK----ESLSGYSAVFYNT 116 A + + ++ + + A + Sbjct: 68 --ASVNPNANVIANNTAIENISEDVPVEVNYNAAANTGDVQRGHWSFAARQFTPNGSLTA 125 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + + N + + + + Q N + +++ Sbjct: 126 IDVGNYTTEDLDNPDPNINGGLINAVKVVVRRQDRPASSFFAQIFGF-ENFGITAEAIAY 184 Query: 177 IPALLRIEMGERP-------IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 I I E +++ + + +M +NS S Sbjct: 185 IGFSGTINPAELDQPIAICEESILDDNGNYNCNMGRMLNSGSNLNTSNTGGWTNFSQPCD 244 Query: 230 AL 231 Sbjct: 245 TA 246 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/358 (8%), Positives = 80/358 (22%), Gaps = 49/358 (13%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ S + D ++I+ +G A + + + + Sbjct: 214 NTNVLDHQGNKNGSNQWDGINSWDGDPNDRTHSYIEYGGTGNQADYAIRKFAKETTTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPALLRIE-M 185 + + + + + + R + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNDRIGEVKIGVDRFVDTLSDS 326 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + V +SD + ++ + + +K+ + + Sbjct: 327 GITDKINMGYVG---------YSSDGYNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFT 377 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 D L + + + G V R + + + Sbjct: 378 QKGLRDAGDMLSVPNGHKKVIVLLTDGVPTFSYKVQRVHAQPS----NDYYGTQFSNTQD 433 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S + + T GE KE I+I Sbjct: 434 QPG-NTSRIARSYYAPDQNNQSRRIDSTFIATIGE-------------AMALKERGIEIH 479 Query: 366 TISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 480 GLGIQLQSDPAAGLSKAEVESRMRQMVSADEKGDLYYESADHATDISEYLAKKAVQIS 537 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A + P + +VD+SGSM K Sbjct: 226 NTDTLLLRIGVAGREVPTAALPAANLVFLVDVSGSM-------------------GAPDK 266 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 267 LPLLQSSLKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTR-IVEAIDSLQ 319 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 320 SGGGTAGASGIELAYKAAQQAYLRGGIN----------------RILLATDGDFNVGVTD 363 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 364 FDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHE 423 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 424 LGSTLATIARDV 435 >gi|17554334|ref|NP_498645.1| MUscle Positioning family member (mup-4) [Caenorhabditis elegans] gi|15789306|gb|AAA28092.5| Muscle positioning protein 4 [Caenorhabditis elegans] Length = 2104 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/177 (10%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 230 ALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSL 287 + +L FL + + ++ Y+ ++ + Sbjct: 452 VFQTEVLRFLAEFTELFDIAPQKTRVSVVQYSDQIRHEFGL-DNYSDRKSLQNAIRNIEY 510 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ S+++ + + I +TDG + + + Sbjct: 511 LTGLTRTGAAIEHVANEAFSERRGARPVG-----------QVSRVAIVITDGRSQDNVTR 559 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + D A+ I++ + + + L+ S + ++V + L ++ Sbjct: 560 PS-----DNARRQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNTRLRS 610 >gi|14579227|gb|AAK69172.1|AF289202_1 transmembrane matrix receptor MUP-4 [Caenorhabditis elegans] Length = 2104 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/177 (10%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 230 ALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSL 287 + +L FL + + ++ Y+ ++ + Sbjct: 452 VFQTEVLRFLAEFTELFDIAPQKTRVSVVQYSDQIRHEFGL-DNYSDRKSLQNAIRNIEY 510 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ S+++ + + I +TDG + + + Sbjct: 511 LTGLTRTGAAIEHVANEAFSERRGARPVG-----------QVSRVAIVITDGRSQDNVTR 559 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + D A+ I++ + + + L+ S + ++V + L ++ Sbjct: 560 PS-----DNARRQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNTRLRS 610 >gi|160727|gb|AAA29778.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 562 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ + ++ ++ I K ++ + +L T+ T A+ Q + L Sbjct: 86 IHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 68/195 (34%), Gaps = 25/195 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A + A F++++ + L+ + R+ + + + DSL Sbjct: 91 LDAERGASYRFIETV----LRPNKDQVFLMQFDFRIFMRQPLT-------NSLRQLSDSL 139 Query: 288 ILKPTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + ++ +L + + + K +I LTDGE ++ Sbjct: 140 PYVDTPTFNQLRAQSGGGTLLYDAVVTASQEVMLNRTGR-------KALILLTDGE--DY 190 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQR-LLKTCVSSPEYHYNVVNADSLIHVFQ 403 S+ + + A+ I +I +G+R L + + + V + +F Sbjct: 191 GSDASVGDAIEAAQRADTLIYSILFADQGDGRRPLQRMSKETGGSFFEVSKKQDIDQIFT 250 Query: 404 NISQLMVHRKYSVIL 418 I + + +YS+ Sbjct: 251 AIQEEL-RSQYSLAY 264 >gi|333030668|ref|ZP_08458729.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741265|gb|EGJ71747.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 342 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 73/228 (32%), Gaps = 60/228 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++A K + +D + + +GLI + + + Sbjct: 97 DISNSMLAQDVSPSRLAKSKLLISKLVDEL-------HNDKVGLILFAGDAFTQLPITND 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + +++ + + + T A+ A + TS + Sbjct: 150 FVSAKMFLSSISPNLIERQGTSIGKAVDLATRSFTS------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGE++ + ++ +A + I++ + + + Sbjct: 192 TIILITDGEDHEPGA----LEAVKRAVDAGIQVNVMGVGSPDGAPIPVTERNGSEYRKDN 247 Query: 374 ---------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N + + + + V N ++ I+ + Sbjct: 248 EGNVVVTKLNEEMAQEIAKAGNGLYVRVDNTNAAQKA---ITSEINKL 292 >gi|309777426|ref|ZP_07672384.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] gi|308914822|gb|EFP60604.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] Length = 365 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 8/44 (18%), Positives = 21/44 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 + E + +I + ++ L++ G + V + K ++ SA + Sbjct: 1 MRKEHGSVTIFLTILLVPMLIVAGVFVDVGRYQLSKAAVTSAAD 44 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 56/187 (29%), Gaps = 27/187 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + + +G++ Y V V V T++ Sbjct: 186 SFLISILQKFYIAPGQIQVGVLQYGETVVHEFYL-NNYRSVTDVVEAAKRIEQRGGTETR 244 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + +K +I +TDGE+++ I+ + Sbjct: 245 TALG-------------IEKAVTEAFQRGGRKGAKKVMIVITDGESHDSPDLQRVIESSE 291 Query: 356 KAKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNI 405 ++ I +++ N + L K S P+ + +NV + +L + + Sbjct: 292 ---KDNITRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDAL 348 Query: 406 SQLMVHR 412 + + Sbjct: 349 GERIFSL 355 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 371 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 430 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 431 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 471 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 472 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 527 Query: 406 SQLMV 410 + + Sbjct: 528 IRGIC 532 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/252 (12%), Positives = 74/252 (29%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++ A + P + +VD+SGSM K Sbjct: 205 NTDTLLLRIGVAGREVPTAALPAANLVFLVDVSGSM-------------------GAPDK 245 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+++L L + + + L+ Y + P+ G ++ R V Sbjct: 246 LPLLQSSLKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTR-IVEAIDSLQ 298 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ AY+ R I+ TDG+ N ++ Sbjct: 299 SGGGTAGASGIELAYKAAQQAYLRGGIN----------------RILLATDGDFNVGVTD 342 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN---------ADS 397 + + + + + + T+ L++ + + Y ++ Sbjct: 343 FDQLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHE 402 Query: 398 LIHVFQNISQLM 409 L I++ + Sbjct: 403 LGSTLATIARDV 414 >gi|119386037|ref|YP_917092.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376632|gb|ABL71396.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 855 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 48/181 (26%), Gaps = 11/181 (6%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 ++ +D I+ ++ ++ V + + ++G + Q + Sbjct: 34 ASGSMWGQIDGINKITIARDVVGDIVSDFPADQNLGF-VTYGHRERGQCADIETLVEPAP 92 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A + L + +I ++DG N + Sbjct: 93 GTAAEIAGI--VEGLNPRGMTPMTDAVVTAAQALRHTEQAATVILVSDGIETC---NPDP 147 Query: 351 IKICDKAKENFI--KIVTISINASPNGQRLLK---TCVSSPEYHYNVVNADSLIHVFQNI 405 +E + I + LL+ + NA L + + Sbjct: 148 CAAARALEEAGVDFTAHVIGFDVRGEADALLQMQCIAEETGGRFLTADNAQELNEALREV 207 Query: 406 S 406 + Sbjct: 208 T 208 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 63/192 (32%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + + ++ V Sbjct: 1127 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDDVEIIQGLTRMPGDRDTLKSRVDA 1186 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 1187 IK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 1229 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 1230 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 1287 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 1288 SRDAEEVISQTI 1299 >gi|153008054|ref|YP_001369269.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] gi|151559942|gb|ABS13440.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] Length = 187 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 16/140 (11%), Positives = 42/140 (30%), Gaps = 8/140 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ ++ ++ FAL ++ FL++I I + + + +A A ++ + Sbjct: 17 LRRFARAQHGVAAVEFALLIVPFLIIIFATIEIGMSFVARQVISNATET----VARRLQT 72 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R E + ++ +L Y I Sbjct: 73 GQIRGAQISEGTLRTELCQQMQFMVAQGCPNLSLNLGTYEGFDKVP----IDQILDGEGK 128 Query: 131 HMANNRLDSSNNTIFYNMDV 150 + +S + ++V Sbjct: 129 LTRTGIIGTSGTSTINQLNV 148 >gi|126728411|ref|ZP_01744227.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] gi|126711376|gb|EBA10426.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] Length = 219 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 53/189 (28%), Gaps = 34/189 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 ++ + A+ L I +L + V G G T ++ + + ++ Sbjct: 22 RIVQARQAMRQVLPDIAVLRRLG-LVIYGPGGDRTCRNVDLRLTPQWQADAPIISEIEGL 80 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T T ++ A + L ++ +TDG+ + Sbjct: 81 RPAGGTALTDGVRLAAETLDYRN-------------------VPGAVVLVTDGKETCGGT 121 Query: 347 NVNTIKICDKAKEN-FIKIVTISINA----------SPNGQRLLK-TCVSSPEYHYNVVN 394 + A+E + + I G+ + + + + + Sbjct: 122 PCQL--AAEFAREAPGLTVHVIGFKVRGDHWDWSTPDAPGESVARCLADDTGGQYLSAET 179 Query: 395 ADSLIHVFQ 403 D L+ + Sbjct: 180 VDELVGALR 188 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 64/201 (31%), Gaps = 30/201 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ +K L +D + + +G++ Y +RV+ + K + Sbjct: 45 SSRSVRPSEFEQVKVFLAKVIDGLS---VGPDATRVGVVNYASRVKNEVSLKS--HKTKA 99 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T + + + + F S K I +TD Sbjct: 100 ALVKAVSKIEPLSTGTMTGLAIQFAM---------NVAFSEAEGGRKSPDISKVAIIVTD 150 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + N I +A+E I+I I + L+ S P ++ V + Sbjct: 151 GRPQD-----NIRDIAARAREAGIEIFAIGVGRVD--MTTLRQMASEPLEDHVDYVESYS 203 Query: 397 SL-------IHVFQNISQLMV 410 + F + + Sbjct: 204 LIEKLTKKFQEAFCAVVSDLC 224 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 71/203 (34%), Gaps = 37/203 (18%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 84 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATVVNKLIQIKSSDERNTLMAGL 138 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T +K A+Q++ + + ++ LTDGE+N Sbjct: 139 PTYPRGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 180 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + +++ + H+ V + LI Sbjct: 181 -----TASSCIDEVKQSGAIVHFIALGRDAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 234 Query: 401 VFQNISQ---LMVHRKYSVILKG 420 F ++ + + + KG Sbjct: 235 AFGALTSGNTDLSQKSLQLESKG 257 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 48/156 (30%), Gaps = 23/156 (14%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 D + +I + + + ++ + R T+ A+ AYQ +T Sbjct: 506 SSDDLIEVIAFDSAPTRYVKMQPARNRSR-IAGEIARIQPGGGTEIFSALDAAYQDMT-- 562 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +K +I LTDG+ + + I + T+ Sbjct: 563 ----------------VTQARKKHVILLTDGKASTGGIRDLVSAMI----AESITVTTVG 602 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + Q L + V + ++L +F Sbjct: 603 LGNDLDEQLLKMIADVGGGRFHAVPDPNNLPRIFTK 638 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/288 (12%), Positives = 79/288 (27%), Gaps = 90/288 (31%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I I LV+D SGSM + + K A F+D + Sbjct: 81 VQNNKILDIVLVMDTSGSM--------------------EGKSLENAKKAANNFVDKLL- 119 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +V +G++ + + E + ++ + T + +++A + Sbjct: 120 --PQNNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLK---ADGGTYTQQGLEKAATV 174 Query: 305 LT-------------------------------------------SDKKRSFFTNFFRQG 321 L ++ + F G Sbjct: 175 LNGAPAEHKKVMVVIGDGEPTYANGEHPNFDKGGFYRIYNPATKKEGYEQWYGNAFKWLG 234 Query: 322 VKIPSLPFQKFIIFLTDGENN------------------NFKSNVNTIKICDKAKENFIK 363 S + + G N + T+K D K + Sbjct: 235 KGYNSAHRNYLVKLINGGFGNGTKERKGWGHVSWPFNLMDDYFENATLKAADTIKN-NTE 293 Query: 364 IVTISINASPN--GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I+T+ I+ N + ++ +S +Y A L + +I++ + Sbjct: 294 IITVGIDIENNDLAKSIMNKLATSGKYLKAGAVAGELDKILDDIAEKL 341 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 50/450 (11%), Positives = 125/450 (27%), Gaps = 71/450 (15%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHY-------------KKNSMESANNAAILAGASKMVS 70 +I A+ +MS + LI +I+ + + SM+ + + S + Sbjct: 14 MIMAIMLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFIL 72 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ + + S ++ N K+ + + + Sbjct: 73 DDSKFKGSKQGLSREWSYIGLSEDGKKVLNYVWNKQKQDWDVSELGTKSLYNMKLDLEFK 132 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + S N + + + ++ K I Sbjct: 133 TEGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPI 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ I + V D SGSM+ + + + ++M L+ ++ + + Sbjct: 193 QGQMNIA-VSFVFDTSGSMNWDLQGRN-------VEKTGNESRMDILRKKSVIMIKDLAE 244 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ + GT + +T+ + T+ ++ Sbjct: 245 IGNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKPENLNPDGVTNPGDGLRYGMIS 304 Query: 305 L-----------------------TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 L S + QG + + + T G Sbjct: 305 LQSQPAQLKYIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDLSPTLGYE 364 Query: 342 NN------FKSNVNTIK-----ICDKAKENFI---KIVTISI-NASP---NGQRLLKTCV 383 + + + T + + +K+ + ++ I GQ L Sbjct: 365 YSRLGYDLYSRDSITRENSIAYAGEVSKKFGLGIKRVNVIGFSGVDHEIAYGQSLTDRIG 424 Query: 384 SSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + + N ++L F +I + + Sbjct: 425 EGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|327290735|ref|XP_003230077.1| PREDICTED: collagen alpha-1(XIV) chain-like, partial [Anolis carolinensis] Length = 562 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 55/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + E +GL Y+ + + K T + A+ Sbjct: 79 VAAFNVGSEKTRIGLAQYSGDPRIEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTF- 137 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 IL ++ K K I +TDG++ + K+ Sbjct: 138 --ILENNFKSEAGA----------RPGVPKIGILITDGKSQDDVIPP-----AKNLKDAG 180 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ I + + LK S P+ + YNV + + + + +++ + R Sbjct: 181 IELFAIGV--KNADETELKEIASEPDNTHVYNVADFSFMNSIVEGLTKTVCSR 231 >gi|316956996|gb|EFV46978.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 328 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 229 AALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 A KN +L FL +L + + +I Y+ ++ ++ K + Sbjct: 51 AVFKNEILRFLREFINLFTIGSNHTRLAIIQYSDQIRHELDFKEANSKAEVDEALNRVEY 110 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A+ ++I S + + + I +TDG + + S Sbjct: 111 LTGLTKTGDALTDMFKIGFSKSRGARPIE----------TGVHRVAIVITDGRSQDIVSF 160 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++AK++ + + + + + L S + + V L Sbjct: 161 S-----ANEAKKSNVLMFAVGV-TDHVSEAELVEIAGSKDRVFLVKEFTDLN 206 >gi|254443293|ref|ZP_05056769.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257601|gb|EDY81909.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 632 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/307 (8%), Positives = 82/307 (26%), Gaps = 30/307 (9%) Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH--- 163 + + T + + + + + + + + + Sbjct: 28 GPSTGGYRATSTLSATRIRTKLGATVGGAQDIRYLRNLIDEGIIPSPASFTAEGLFSEHD 87 Query: 164 --LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD-LSGSMHCAMNSDPEDVNSAPIC 220 + + E +P I + +SG P ++ + Sbjct: 88 LPIGGDAKEGWLFDIASQATSFESAAQPKVDILAQLGFVSGIDATTFKPAPLNLVAVVDK 147 Query: 221 QDK-KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-Q 278 + ++ +L + + + ++ Y + ++EP+ + + R Q Sbjct: 148 SGSMSGDPLELVRKSLRQVVSQLGSDD------QLSIVLYGSSTHIHLEPTKTSTENRDQ 201 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T ++ YQ+ + ++ TD Sbjct: 202 IIASIDRIQSHGSTAMEAGLELGYQVARQSADAFVG---------------KTRVMLFTD 246 Query: 339 GENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 N +++ + + + ++ I + TI + + K + + +S Sbjct: 247 ERPNVGRTDATGFMAMAESGSKSDIGLTTIGVGVHFGAELAEKISSVRGGNLFFFDDDES 306 Query: 398 LIHVFQN 404 + F+ Sbjct: 307 METTFRK 313 >gi|225872657|ref|YP_002754114.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793405|gb|ACO33495.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 410 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 63/202 (31%), Gaps = 34/202 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + K+ A+ A L + + + ++ ++ + + + K+ Sbjct: 189 VDNSGSMQNKLNAVDKAALDLVRASNPDDEA------FIVNFSDQAYLDQGFTSSIAKLE 242 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q + T A+ + L+ D + ++ ++ +T Sbjct: 243 Q---GLAHTEARGGTALYDAIVASADELSKDARH-----------------PKQVLLVVT 282 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL-------KTCVSSPEYHY 390 DGE++ N I + +I I + +G + + Y Sbjct: 283 DGEDDASTMN-LQQAIQRVQALHGPEIYAIGLLYDDSGDEAHRARKALEQLTEQTGGLAY 341 Query: 391 NVVNADSLIHVFQNISQLMVHR 412 + +++ V +++ + ++ Sbjct: 342 FPRSLENVDEVAAEVAKDIRNQ 363 >gi|309792307|ref|ZP_07686777.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225622|gb|EFO79380.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 423 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/261 (12%), Positives = 70/261 (26%), Gaps = 56/261 (21%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHC 205 + S V + + + + LV+D S SM Sbjct: 6 QLRSTLSRSLLPDIT------EEQLVYVLLEVSAHGVPTNISKLPLNLCLVIDRSSSMR- 58 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 ++ +K+A +D + + + L+ + R E Sbjct: 59 -------------------GERLQQVKDAANRIVDMLTDDDY------LSLVTFNDRAEV 93 Query: 266 NIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 I +++ ++ T+ M A Q +GV Sbjct: 94 VIPAQRPQHRPDLKRMISGI---EAAGGTEMATGMALALQ--------ETQRPLMGRGVS 142 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 I+ LTDG + ++I +A+ + + + I N L Sbjct: 143 R--------ILLLTDGRTYGDEG--RCVEIARRAQGRGVGLTALGIGGEWNEDLLETMSA 192 Query: 384 SSPEYHYNVVNADSLIHVFQN 404 + +A + +F + Sbjct: 193 RENSRTQYITSAAEITQIFTD 213 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L ++ + + +T + T K + Sbjct: 523 SNFRLMLDFVARVVETFEISDIGTKVAAVQFTYDQRTEFSFTDYTTKENVLAVIRQIRYM 582 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + P + F++ LTDG++ + Sbjct: 583 SGGTATGDAVAFTVRNVF---------------GPLRDSPNKNFLVVLTDGQSYDD---- 623 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I + +I + A + LK S P + + L + ++ Sbjct: 624 -VRGPAAAAHKAGITVFSIGMAWAPLDD---LKDMASEPKESHTFFTREFAGLEQIVTDV 679 Query: 406 SQLMV 410 + + Sbjct: 680 IRGIC 684 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/251 (12%), Positives = 69/251 (27%), Gaps = 45/251 (17%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++ A R GERP + VVD+SGSM Sbjct: 137 FSVTLDGARAGSDGWSLVRVGLATRAADRT--GERPPAALTFVVDISGSMAE-------- 186 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 ++ +K +L L D + + L+ ++ E + + Sbjct: 187 -----------PGRLDLVKESLGLLADELRDDDS------IALVTFSDEAETRLPMTRVG 229 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 E + T+ ++ Y + + Sbjct: 230 EARGRVREVVNSLATTSSTNVEAGVRTGYDVAVDG----------------HRKDATNRV 273 Query: 334 IFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + L+D N ++ +I ++ K I + + + + + + Sbjct: 274 VLLSDALANTGATEAGAILERIEEERKAYGITLFGVGVGSDYGDAFMERLADRGDGQTTY 333 Query: 392 VVNADSLIHVF 402 V + VF Sbjct: 334 VSTSAQARKVF 344 >gi|315608293|ref|ZP_07883283.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] gi|315250074|gb|EFU30073.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] Length = 342 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 64/210 (30%), Gaps = 54/210 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +D + +GLI + + + Sbjct: 97 DISNSMLAQDVAPSRLDKSKLLVESLVDRFT-------NDKIGLIVFAGDAYVQLPITSD 149 Query: 273 TEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ SLI + TD A+ T + Sbjct: 150 YVSAKMFLQNIDPSLIQTQGTDIAQAINLGLHSFTQA------------------DKIGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------------SPNGQ 376 II +TDGE++ + + ++ + + + + +GQ Sbjct: 192 AIIVITDGEDHEGGAVEAAAEA----RKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQ 247 Query: 377 RLL---------KTCVSSPEYHYNVVNADS 397 ++ + + + +V N Sbjct: 248 TVMTALNEQMCREVAQAGSGKYIHVDNTGD 277 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIRQ 286 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 287 SEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|198421591|ref|XP_002123589.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 401 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 51/196 (26%), Gaps = 30/196 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-----------TRVEKNIEPSWGTEKVR 277 + + + +G+I Y+ + ++ + K Sbjct: 215 DNFEIVKNWTIKVANSFDISDGYTQVGVIQYSHYWATEPLDKQSYIKTEVPLGKYRNKQE 274 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 SL T + A+ + + K +I LT Sbjct: 275 FSAAVRNISLHEYTTYTAHALNKTVFDFQQSSR-------------WNRPKTSKVLILLT 321 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC--VSSPEYHYNVVNA 395 DG + + + + + I + + N + L + + E Y N Sbjct: 322 DGLSTDKQ---LLPSSANYVRSLNITTFAVGVG-EANEKELQEIANGQGTNERVYYTSNF 377 Query: 396 DSLIHVFQNISQLMVH 411 L + + +++ Sbjct: 378 AGLNKIVSQLRSAILN 393 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V+ V T++ Sbjct: 161 FLINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 219 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 220 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIRQ 263 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 264 SEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 323 Query: 407 QLMVHR 412 + Sbjct: 324 DRIFSL 329 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIRQ 286 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 287 SEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGIVQYGEDAVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIRQ 286 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 287 SEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 65/185 (35%), Gaps = 20/185 (10%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ A K AL + ++ + +GL Y V+ +P+ Q + Sbjct: 60 TKIEAAKKALTGVVGAL------PDTAQVGLRVYGATVDGKGKPTPAACADTQLIHPIAA 113 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 K T + A+K + + + +K ++ I+ ++DGE + Sbjct: 114 LDKTKLTTTIAAIK--------ALGETPIAHSLTEALKDLGTSGKRNIVLVSDGEESCVP 165 Query: 346 SNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVF 402 I K + +I T+ + + L+ + +Y+ +A +L Sbjct: 166 DPCP---IVKKLTAAGVDLQIDTVGFGVNAKARTQLQCIADAGKGTYYDAKDAGALATSL 222 Query: 403 QNISQ 407 +SQ Sbjct: 223 NKLSQ 227 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT T K Sbjct: 489 FRTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + Sbjct: 549 GTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGRSYDDVRIPAM 592 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A + I + + Q L+ + P ++ + V D+L I Q Sbjct: 593 V-----AHHKGVITYAIGV--AWAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 24/272 (8%), Positives = 63/272 (23%), Gaps = 38/272 (13%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT--IFYNMDVMTSYDYRLQFIEHLLN 166 + + + + M +++++ T+ RLQ + Sbjct: 165 TTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALPKPSPSAGFTTSSLRLQPVGQ--- 221 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + + + S+ DP D + Sbjct: 222 RSRELDLWCTTTYTNSQNSPQANPGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSS 281 Query: 227 ------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQ 278 ++ D L MG++ Y + ++ Sbjct: 282 IGKRRFRIQK-----QFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRDLKT 336 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ A+ + S + + + D Sbjct: 337 AIEKITQR--GGLSNVGRAISFVTKNFFSKSNGNRGGA-------------PNVAVVMVD 381 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 G + + A+E+ I I I+I Sbjct: 382 GWPTD-----KVEEASRLARESGINIFFITIE 408 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 63/208 (30%), Gaps = 37/208 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + + KM A K A+ +LD I + + + E + S G Sbjct: 159 DASGSMAGKVNGQVKMEAAKKAIYNYLDKI---PNNANVMLRVYGHKGSNNENDKSLSCG 215 Query: 273 TEKV---------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +V Q+ + T A++ + + Sbjct: 216 SSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIE----------------SVNNDFKE 259 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKT 381 + ++DGE + ++ + I + + Q+ LK Sbjct: 260 YTGEENLNVVYIVSDGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKN 316 Query: 382 -CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + V NA+ L +Q +++ Sbjct: 317 TAEAGKGNYATVSNAEEL---YQTLNKE 341 >gi|326434435|gb|EGD80005.1| hypothetical protein PTSG_10281 [Salpingoeca sp. ATCC 50818] Length = 736 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 102/371 (27%), Gaps = 43/371 (11%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A + A + + + A D+ + + +++ + L V Sbjct: 35 SDTTEAEVCGVAVDYIDTGTAQYNDLVVYETSGNVAFQDETFKRMTDYMAQLLLSLDVVS 94 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 T + + R + A N LD Y+ ++ Sbjct: 95 TETIVVVDGFTPRPEVAVRAPNLLDVG---------RALRLRYQADGSAATASRLATLAQ 145 Query: 174 VSFIPALLRIEMGERPIFL--IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + + + V + A + S + +T ++ Sbjct: 146 ACVDAGFDFVSVPNDIGSDNYVHVSVPKMTCGNAAADLLFILDGSGSVGSSNFQTMLSFT 205 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----PSWGTEKVRQYVTRDMDSL 287 + F + + ++ Y + + + +++ ++ Sbjct: 206 RTVATFF-------DVSADTTRIAVMVYASYNYLIFDFNYILTHTKDELLDAISAIN--Y 256 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ A ++ + +GV+ S + + +TDG + + + Sbjct: 257 PYGGTRTGGALDYARTVMFT----------ADRGVRPSSEGIPRVAMVITDGASADDVA- 305 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNI 405 + ++ + + I I A N L + + + + + I Sbjct: 306 ----APAQQLRDEGVTLYAIGI-AGANENELNEIASPPITSNVVF-ISTFSEFGTLAAAI 359 Query: 406 SQLMVHRKYSV 416 S+ R +V Sbjct: 360 SRANCDRAATV 370 >gi|303241521|ref|ZP_07328022.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] gi|302590939|gb|EFL60686.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] Length = 1061 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 16/155 (10%), Positives = 44/155 (28%), Gaps = 17/155 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI---------- 61 + ++ A+ + L+L G L+ + ++S AA Sbjct: 1 MNLFKKNEGAITVYLAIILSVMLILTGVLVDGARARTAEAQVQSTTEAAANSLLANYNNI 60 Query: 62 ------LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN-HIKESLSGYSAVFY 114 L S+ S++ + N + A ++ + + G + Sbjct: 61 LKEWFGLMSLSENNSDVLEEELMYYLNRNLMTELGAEKANLSDESWNYVKKFLGSDNKYN 120 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 + ++ + ++ L S ++ Sbjct: 121 DVNFIDMYDYRIEEVSACPMYNLSESAVLRSQIVE 155 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLRTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|296220039|ref|XP_002756140.1| PREDICTED: integrin alpha-X [Callithrix jacchus] Length = 1078 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 61/191 (31%), Gaps = 27/191 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 A +K+ + + + L+ ++++ + + E Sbjct: 164 YNNFAMMKSFVRAVMSHFQ-----RPSTQFSLMQFSSKFKTHFTFEEFRESSNPLSLLAS 218 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A+++ + L K +I +TDG+ Sbjct: 219 VDQLGGYTHTATAIQKVVKELFLASNG-------------ARREATKILIVITDGKKEGD 265 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + I + DKA I I + + + N + L S P + + V + D+L Sbjct: 266 WLDYDDVIPMADKA---GIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHIFKVEDFDAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KDIQNQLKEKI 333 >gi|148749338|gb|ABR09545.1| CD11c protein [Callithrix jacchus] Length = 1161 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 61/191 (31%), Gaps = 27/191 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 A +K+ + + + L+ ++++ + + E Sbjct: 164 YNNFAMMKSFVRAVMSHFQ-----RPSTQFSLMQFSSKFKTHFTFEEFRESSNPLSLLAS 218 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A+++ + L K +I +TDG+ Sbjct: 219 VDQLGGYTHTATAIQKVVKELFLASNG-------------ARREATKILIVITDGKKEGD 265 Query: 345 KSN-VNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSL 398 + + I + DKA I I + + + N + L S P + + V + D+L Sbjct: 266 WLDYDDVIPMADKA---GIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHIFKVEDFDAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KDIQNQLKEKI 333 >gi|327270788|ref|XP_003220170.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 930 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 69/195 (35%), Gaps = 34/195 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 Q K +++ L A LFL L ++E ++G++ + ++ ++K Sbjct: 318 ASAQMGKDNRLSRLIRAAKLFL-----LQIIEEGSWVGIVTFNSKANIQAGLQKVFSDKE 372 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ +T + + +D + A+Q+ + + I+ L Sbjct: 373 RESLTSHLPTTASGDSDICEGVTTAFQVFSRKLTSTEGCE----------------IVLL 416 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 T+GE + C K + I I TI+ + L K + + Sbjct: 417 TNGEGLDLSP-------CLSKIQSQEIIIHTIAFGSKA-SNELEKLADMTGGKTFYATDS 468 Query: 394 -NADSLIHVFQNISQ 407 +++ LI F IS Sbjct: 469 LDSNGLIDAFGGISS 483 >gi|307247518|ref|ZP_07529563.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306856021|gb|EFM88179.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 510 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/489 (7%), Positives = 109/489 (22%), Gaps = 118/489 (24%) Query: 38 GFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH------------ 85 + + + + A+L+ ++ S + SN Sbjct: 14 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 73 Query: 86 -AKRALIDDAKRFIKNHI----------------KESLSGYSAVFYNTEIQNIVNSSRIS 128 + + S+ + + Sbjct: 74 KRDSQMATKFVKAFLPQTNEEKMHLTPVCKTINNTNGKGHTSSSEVTCTVSGTIEHKSWF 133 Query: 129 MTHMANNRL-----DSSNNTIFYNMDV---MTSYDYRLQFIEHLL--------------- 165 + + + ++ + + + + Sbjct: 134 PLKVGTVEVIPHEVNVASKSKAFKKNTFNIPIDLMVVADLSGSMRYDITNKYETNNETSK 193 Query: 166 -NQRYNQKIVSFIPALLRIEMGERPIFLI-ELVVD---LSGSMHCAMNSDPEDVNSAPIC 220 + I LL + + + + + S + D P+ Sbjct: 194 LGILKDVLIELAEKTLLSEDANQHNRIYVTPFALGAEIDASSCALPYHWDKGKDPDTPLK 253 Query: 221 QDKKRTK------------MAAL--KNALLLFLDSIDLLSHVKEDVYMGLIG-------- 258 +K + K LD+I + K G+ Sbjct: 254 IKNILSKKEGNNQQSRAEFIDNFVYKMNTQATLDNIGEKQNYKVTFPKGVFCLKNMKNKN 313 Query: 259 --------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ----ILT 306 + + + ++K T +K+ + +L+ Sbjct: 314 HGWFSRKDKSDFTGYVRSLRADGATLASSGVLVAANNMIKGGSRTKELKEQTKRVILVLS 373 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIF------------LTDGENN------NFKSNV 348 + + + + +I+ ++ G + + Sbjct: 374 DGNDEIVKGDPNSKVPFLNYTRITENLIYGRQEEFLSGKKKVSFGNTTIETYLTDTQPKE 433 Query: 349 NTIKICDKAKEN--------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 T +C+ ++ KIV + + + K CV + +Y+ + SL++ Sbjct: 434 VTNGMCNVIRDKLDKLNNDKNTKIVFVEFGYASKSKEAWKHCVEGDKNYYSANDKASLLN 493 Query: 401 VFQN-ISQL 408 F+ I + Sbjct: 494 SFKQAIGET 502 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 68/243 (27%), Gaps = 48/243 (19%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ P + +VD+SGSM+ K Sbjct: 172 NPRSLLLRVGIQAADPAKQALPPANLVFLVDVSGSMN-------------------SPDK 212 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+NAL LF+ + + L+ Y + +EP+ G K + Sbjct: 213 LPLLQNALKLFVAQLRPQD------RVALVTYASGTRVVLEPTAGDRKA-AITAAIDGLV 265 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY++ I+ TDG+ N Sbjct: 266 PGGATAGAAGIDLAYRMAEQGFVEHGIN----------------RILLATDGDFNVGITR 309 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN-----VVNADSLIHV 401 + +++ I + T+ +L++ + + Y L+ Sbjct: 310 FETLKDRVAERRKSGIALSTLGFGGGNYNDQLMEQLADAGDGAYRYIDSLAEAQKVLVDE 369 Query: 402 FQN 404 F + Sbjct: 370 FTS 372 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 64/182 (35%), Gaps = 35/182 (19%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V++ ++G++ + + E + R + + + ++ T ++ A+ ++ Sbjct: 339 VEQGSWVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVIK 398 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIV 365 I+ LTDGE+N TI C D K++ I Sbjct: 399 KKYPTDGSE-----------------IVLLTDGEDN-------TISSCFDLVKQSGAIIH 434 Query: 366 TISINASPNGQRLLKTCVSSPE--YHYN--VVNADSLIHVFQNISQ---LMVHRKYSVIL 418 T+++ + L + + + + V N + L+ F +S + + Sbjct: 435 TVALG-PAAAKELEQLSKMTGGLQTYSSDQVQN-NGLVDAFAALSSGNAAIAQHSIQLES 492 Query: 419 KG 420 +G Sbjct: 493 RG 494 >gi|86137907|ref|ZP_01056483.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] gi|85825499|gb|EAQ45698.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] Length = 181 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 8/51 (15%), Positives = 24/51 (47%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++ A ++ FA+ + FL+ + + + +++ +E A + A+ Sbjct: 10 LRRFRKDTDATVTVEFAILMPLFLMFLFSTVELGMLILRQSQLERALDIAV 60 >gi|159038794|ref|YP_001538047.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157917629|gb|ABV99056.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 316 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 65/234 (27%), Gaps = 53/234 (22%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + L +D+S SM ++AA + A F++ + Sbjct: 87 ATVMLAIDVSLSMQADDV---------------SPNRLAAAQEAAQQFVEELPAS----- 126 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ + + P+ + V V + + T + A+ Sbjct: 127 -YNLGLVSFAKAANVLVPPTKDRQAVVTAVDGLVLAE---STATGEAVFT---------- 172 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + I+ L+DG + +S A TI+ Sbjct: 173 --CLEAIRSVPADGAAGIPPARIVLLSDGYRTSGRSVEEAAAAAQAANVAV---STIAFG 227 Query: 371 ASP--------------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + ++ + Y + + L V+Q++ + Sbjct: 228 TDGGQVDIGGQLQRVPVDRFALAELAATTEGHFYEAASVNELKQVYQDMGSSIG 281 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 66/216 (30%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + ++LV+D+SGSM +++MAA K A LD+ Sbjct: 27 QDAPKVDLVLDVSGSMRARD--------------IDGQSRMAAAKQAFNEVLDATPEEVE 72 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R + + P D T A K A LT Sbjct: 73 LGIRTLGADYPGDDR--------------KTGCKDTAQLYPVGPLDRTEA-KTAVATLTP 117 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + K I+ ++DGE+ + ++ + I I Sbjct: 118 TGWTPIGPALLKAAGDLDGGNGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGLTID 175 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + R L ++ + +V + D L Sbjct: 176 TLGLVPTAKLSRQLSCIAEATGGTYTSVEHQDELTD 211 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 51/145 (35%), Gaps = 6/145 (4%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ S++ AL + + + + V Y + ++ + A L+GA ++ S + Sbjct: 13 QRGTVSLMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQLKSADDQATT 72 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------TEIQNIVNSSRISMTH 131 +S+ A + + + N +++G + +S+R Sbjct: 73 NANVLSSVTSAAAQNGYTKAVANDCTTAVAGAADGVRTCLGLWDPANPANGDSTRHFDPG 132 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDY 156 + ++ + + V + + Sbjct: 133 YPATTVSANAVRVQATLTVPLLFMF 157 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLRTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 70/221 (31%), Gaps = 31/221 (14%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 C D V A L N L F + + +G++ Y +V Sbjct: 105 RCETYMDIVIVLDGSNSIYPWYEVQAFLINILQKFYIGPGQIQFLWFTGQVGVLQYGEKV 164 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + S + V + V R T++ + A + Sbjct: 165 VHEFKLS-DYKSVEEVVKRARSINQRGGEETNTALGINVA---------------CSQAF 208 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NA----SPNGQ 376 +K +I +TDGE+++ IK C+ ++ I I++ N + Sbjct: 209 KHGGRRGAKKVMIVITDGESHDSADLQQVIKDCE---KDGITRYAIAVLGYYNRRGINPE 265 Query: 377 RLL---KTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 L K S P+ + +NV + +L + + + + Sbjct: 266 AFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 306 >gi|315126093|ref|YP_004068096.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] gi|315014607|gb|ADT67945.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] Length = 1057 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/320 (10%), Positives = 73/320 (22%), Gaps = 74/320 (23%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + + G + S + Sbjct: 34 VQTNEKPRVIMIFDTSGSMAWDVND-------GDACYKRSGNSYYEVDCFESKDSYKRNE 86 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 N P C D +++ +NA+ ++ D + GL+ + Sbjct: 87 QCYKRTSYYNYEPTCSD---SRLRVAQNAITQLVNDNDDIE-------FGLMRFNGSNGG 136 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY----------------QILTSDK 309 + + + + + + T T + +AY + T++ Sbjct: 137 YV-LARVGAEKSFLLNKVEELPASGSTPLTETLWEAYLYITGQNVYYGGNTNERDFTAES 195 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK---------------------SNV 348 + + F + II +TDG+ N SN Sbjct: 196 SGIYKSPFAPVTGEPLRCDNSINIILMTDGDPTNDSNRNNDIYNLHNSYFSGNPPAVSNS 255 Query: 349 NTIKICDKA------------KENFI----KIVTISIN---ASPNGQRLLKTCVSSPEYH 389 + K + ++ I + L +T + Sbjct: 256 YLAALAKIIHGTDDVKVDLYSKTKNVLDIGRVFPIGFGTGMSDSGKNLLTETADYGGGQY 315 Query: 390 YNVVNADSLIHVFQNISQLM 409 + A L +N + Sbjct: 316 LHANTAAQLSDALKNTISRI 335 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 67/189 (35%), Gaps = 35/189 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V++ ++G++ + + + E + R +T+ Sbjct: 319 TGDRLKRLNQAGKLFL-----LQIVEQGSWVGMVTFDSAAQVQSELIQINSGTERDALTK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ T ++ A+ ++ I+ LTDGE+N Sbjct: 374 SLPTVATGGTSICSGLRSAFAVIKKKYPTDGAE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---L 398 TI C ++ K++ I T+++ + L + + + L Sbjct: 417 -------TISSCFNEVKQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQAQNNGL 468 Query: 399 IHVFQNISQ 407 I F +S Sbjct: 469 IDAFGALSS 477 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|118591412|ref|ZP_01548810.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118436084|gb|EAV42727.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 657 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/226 (11%), Positives = 62/226 (27%), Gaps = 42/226 (18%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + V+D S SM ++ K ++A F+ + Sbjct: 16 ARAQTDTSPDLLFVLDSSNSMWGQIDG---------------TAKAEIARSAFEGFVAGL 60 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + Y + + P ++ + V T T ++QA Sbjct: 61 PDGTRAGVMAYGHRRKADCGDVETLVPVSDLDRAK-LVESVKALTPRGKTPITETLRQAA 119 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++L + + +I ++DG + + + + Sbjct: 120 ELLAQNDR-------------------PGRLILISDGIETCGGDPC---ALAEALASSGV 157 Query: 363 KI--VTISINASPNGQRLLKTCVS--SPEYHYNVVNADSLIHVFQN 404 I + + + C++ + ++N +AD L + Sbjct: 158 DFKAHVIGFDIASKADQAKIACIAHLTGGTYWNARDADGLNEALKE 203 >gi|294011131|ref|YP_003544591.1| hypothetical protein SJA_C1-11450 [Sphingobium japonicum UT26S] gi|292674461|dbj|BAI95979.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 418 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 45/184 (24%), Gaps = 4/184 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + + N ++ A S+ + G + W K ++ ++A LAGA Sbjct: 2 FGRAVGRLLRDRTGNVLMMAAASMPLLVGAAGLATDTVQWTLWKRQIQRQADSAALAGAY 61 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + IS A AL ++ + ++ Sbjct: 62 AVAQGFNASDSATADISRMALVALTQTPTIENAPTSG----PFAGNAQAVRVVLQTSAEL 117 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + L+ + V+ ++ Sbjct: 118 PFSKILGVKAPVIYGEATAAVVGSGDYCVVSLEKTSTVGITLQGNATVNLGCGIVTNSRA 177 Query: 187 ERPI 190 + Sbjct: 178 SNAV 181 >gi|324503617|gb|ADY41568.1| Cuticlin-1 [Ascaris suum] Length = 786 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 55/164 (33%), Gaps = 17/164 (10%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + + ++ Y++ + K + + I T + A+ +A Q+ Sbjct: 67 EIGPDRTQIAVMQYSSYTRVEFGFTANPNKEKLRLALQKIRHISGTTRTGKALDKALQVF 126 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 G ++ + + ++DG +++ + + + ++I+ Sbjct: 127 KHG---------ETSGARVNQEDVAQIAVVVSDGHSHDD-----PVPAAQRLRRAGVQIL 172 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV---FQNIS 406 T+ I A N L+ N+ + SL F+ I+ Sbjct: 173 TLGIGAHINMGELIDITGDETLAFQNLTSQASLDKFVSQFRKIA 216 >gi|256261600|gb|ACU65922.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 21/166 (12%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ R +++ + + T + A++ L S Sbjct: 183 RPSSQFSLVQFSDRFQEHFTFKDFATSSDPLNLLNSVWQLGGWTFTASAIRFVTDRLLSA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K +I +TDGE I + A + I Sbjct: 243 AYG-------------ARKDASKILIVITDGEKTEKVDYKEVIPRAEAA---GVIRYAIG 286 Query: 369 INASPNGQRLLK---TCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + ++ + L+ S+P + + V N D+L + + + + + Sbjct: 287 VGSAFQYRNSLQELIDIASTPSKEHVFQVENFDALRDIQKQLKEKI 332 >gi|47207527|emb|CAF87062.1| unnamed protein product [Tetraodon nigroviridis] Length = 409 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 18/213 (8%), Positives = 51/213 (23%), Gaps = 32/213 (15%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + V + K ++ +G++ Y+ Sbjct: 2 CAGCSTAANDLVYVLDGSWSVGVSDFDTAKQ---WLINITSQFDISSHYTQVGVVQYSDA 58 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPT-------DSTPAMKQAYQILTSDKKRSFFT 315 I + S + T PA+K A + S + Sbjct: 59 PRLEIPLGKHQGQDELIRAIQSISYLGGNTQARRRSERHLPAIKFAVDHVFSSSQ----- 113 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 S + + +TDG++ + + + A+ + + + + + Sbjct: 114 --------RASQVKNRIAVVVTDGKSQDDVVDASME-----ARTQGVTVFAVGVGSEITT 160 Query: 376 QRLLKTCVSSPEYH-YNVVN---ADSLIHVFQN 404 L+ + + + + Sbjct: 161 SELIAIANKPSSTYVLYAEDYTTIHHIRDAMEQ 193 >gi|327265885|ref|XP_003217738.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 636 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 30/288 (10%), Positives = 77/288 (26%), Gaps = 21/288 (7%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + N+ L + + S+ + ++ F+ Sbjct: 213 SFITNDLLPVIHKSFSGK-KGHVSFKPTIDQQRSCVDCPTTLLNGDFVITYDVNRESPGD 271 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++ + + V + K+ + ALL L+ I ++ Sbjct: 272 LQIVNGYFVHFFAPKNISHLPKNIVFIIDVSISMSGRKLQQTREALLKILEDIKEDDYL- 330 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + + ++ +YV + + T+ + ++L Sbjct: 331 NFVLFGDDVHKWKDTLIKATPENLDEASRYVQQIDIA---GWTNLNGGLMAGIEMLNEAH 387 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTIS 368 K II LTDG + + I A + + + Sbjct: 388 KNRSLPERSAS-----------LIIMLTDGRPTKGERDTQVILSNVRNAIQGKYPLYNLG 436 Query: 369 INASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQLMVHR 412 + L K + + + +A L + ++ ++ Sbjct: 437 FGYDLDYGSLEKMAAENNGLARRIYEDSDSALQLQGFYDEVANPLLTE 484 >gi|306829469|ref|ZP_07462659.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] gi|304428555|gb|EFM31645.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] Length = 861 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 31/307 (10%), Positives = 77/307 (25%), Gaps = 40/307 (13%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN---QRYNQKIVSFIPALLRIEMGERPIFL 192 LD ++ + + ++ + AL G R Sbjct: 62 SLDITSKLGTETQSEPLDVVLVADLSGSMNKRDVPSSTGRTITRLDALKNTLKGTRDRQG 121 Query: 193 IELVVDLSGSMHCAM--NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + + + + +M D A + K + + Sbjct: 122 LIDTILSNSNNRLSMVGFGGKIDNKFAEQSWNSYYRKWEWGYRY----------WPYEER 171 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---------MDSLILKPTDSTPAMKQA 301 + + + + + + V++ +S I T+ + ++ A Sbjct: 172 TAFYDGVSPWDDADTILNWNNDASGSKTAVSKMRIAGGQSIGTESGIGTGTNISAGIRIA 231 Query: 302 YQILTSDKKRS----FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDK 356 Q++ S + + + + + + DG + N + Sbjct: 232 NQLIDSARPNAKKVVIVLSDGFANMYYNDSGYTVYNYNNQDGSETAPEWFWNNLDVSINN 291 Query: 357 AKENFIK----IVTISINASPNGQR---LLKTC----VSSPEYHYNVVNADSLIHVFQNI 405 + +I S N L S P + N D L F++I Sbjct: 292 LAYSLAPKLDGFYSIKFRYSNNVDSITSLQYYIRYHNSSIPNEILSANNEDQLRDSFKDI 351 Query: 406 SQLMVHR 412 + ++ Sbjct: 352 TDKILPL 358 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 66/183 (36%), Gaps = 23/183 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + L E +GL+ Y + V+ S + K + T + Sbjct: 75 TFIVNLVQFLEVGPEATRVGLLQYGSVVQPEFSLSTFSTKAEVEQAVRNMKHLATGTMTG 134 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ A + ++ + + + + +TDG + ++ Sbjct: 135 LAIQYAAETSFTEADGARPAHLH----------IPRIAVVVTDGRPQD-----RVEEVAA 179 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADS---LIHVFQN-ISQLM 409 +A++ I+I I + + LKT S P E+ + V + L+ VFQ+ + + M Sbjct: 180 QARQAGIQIFAIGVGRVD--MKTLKTIGSEPHSEHVHLVASFSQMETLVSVFQSKLCREM 237 Query: 410 VHR 412 Sbjct: 238 CEL 240 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 64/191 (33%), Gaps = 16/191 (8%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTD 293 + ++ L ++GL+ Y+T V S T +RQ V+R + + + Sbjct: 428 QFVISIVESLDVSPTGAHVGLLQYSTNVRTEFTLSQHTSAQGIRQAVSRMQ--YMGRGSM 485 Query: 294 STPAMKQAYQILTSDKKRSFFTNF-------FRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + A+++ +Q S ++ + + S +K G + F Sbjct: 486 TGSALRRMFQSSFSAEEGARPNVPRVSVVFTDGRSQDDASEWAKKAKNSGIPGSFSYFGG 545 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQN 404 + C + I + + ++ L+ S PE + Y + + + Sbjct: 546 TGRFLS-CSFLLVLGVTIYAVGVG--KAIEQELREIASEPEEKHLYYAQEFKDVGEITEK 602 Query: 405 ISQLMVHRKYS 415 + M K S Sbjct: 603 LKSRMCTDKPS 613 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 73/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + R N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A+ + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYASETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 68/192 (35%), Gaps = 34/192 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDM 284 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 34 DRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGL 88 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L T +K A+Q++ + + ++ LTDGE+N Sbjct: 89 PTYPLGGTSICSGIKYAFQVIGELHSQLDGSE----------------VLLLTDGEDN-- 130 Query: 345 KSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIH 400 T C D+ K++ + I++ + + + +++ + H+ V + LI Sbjct: 131 -----TASSCIDEVKQSGAIVHFIALGRAAD-EAVIEMSKITGGSHFYVSDEAQNNGLID 184 Query: 401 VFQNISQLMVHR 412 F ++ Sbjct: 185 AFGALTSGNTDL 196 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 365 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 424 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 425 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 465 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 466 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 521 Query: 406 SQLMV 410 + + Sbjct: 522 IRGIC 526 >gi|77552209|gb|ABA95006.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 605 Score = 54.1 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 67/232 (28%), Gaps = 43/232 (18%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V + ER + V+D+S +M L Sbjct: 34 VLVRVVAPPPAAASSERAPIDLVAVLDVSCC-----------------GGLGPVNRMDLL 76 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K A+ +D + + + + + G ++ + V + Sbjct: 77 KKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMVQ--SSLTVTGE 134 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+K+A IL +G K FI+ ++DG++ Sbjct: 135 NKLSTALKKAATIL--------------EGRKDHDKKRPGFIVLISDGDD---------A 171 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVF 402 + + A + + N + + + +S + + + D L F Sbjct: 172 SVLNDAMNLNCSVHAFGFRDAHNARAMHRIANTSAGTYGILNDGHDGLADAF 223 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 63/218 (28%), Gaps = 34/218 (15%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM ++MAA K A +D++ + Sbjct: 41 VELVLDVSGSMRARDVDG--------------MSRMAAAKQAFNEVIDAVPDEVRLGIRT 86 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R + + D T A K A L Sbjct: 87 LGATYPGQDR--------------KTGCLDSKQLYPVGRVDRTEA-KTAVATLRPTGWTP 131 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISIN 370 + + I+ +TDGE++ + + + + + T+ + Sbjct: 132 IGLALRGASKDLAGGDATRRIVLITDGEDSCGQPDPC--DVARELAAQGTHLVVDTLGLT 189 Query: 371 ASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQ 407 + L ++ + V + D L + + + Sbjct: 190 LDSKVRDQLSCIAEATGGTYTAVQHTDQLSTRIKQLVR 227 >gi|270208337|ref|YP_003329114.1| Bee1 [Enterococcus faecalis] gi|72388797|gb|AAZ68037.1| Bee1 [Enterococcus faecalis] Length = 1083 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/351 (10%), Positives = 95/351 (27%), Gaps = 16/351 (4%) Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 +N D + I N+ + D +I E + ++ ++ N N + Sbjct: 256 DTNQYTKSDSSDMIKNYNYGSAGDSDSNVEIQNILEGNLNFDNGYHAYDVGNNQNINLKK 315 Query: 129 MTHMANN----RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + +N ++ + + + N + + S + Sbjct: 316 IVIPTDNPTQFQIQLDIIGGSLKTRKNVDVAFVVDKSGSMANNDRWKNLKSALNTFAHGL 375 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + P ++L + S+ P + P + +D+ Sbjct: 376 LDDNPGGSVQLGIAGFSSVQETTTYFPYYKINVPYGKVGNFGGTQGNYTGFTTSVDAFLN 435 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQ 303 + + E G + + R+ + + L T Y Sbjct: 436 HALLNESPSGGTPTFLGLDAGLELLTNSQYNGRKDAQKVLIILTDGLPTW---GPTTTYT 492 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--- 360 +S ++S T ++ + +++ +G + K+ Sbjct: 493 SSSSGIRQSGMTRSSDNNGRVETFKATNTTLYIGNGSETGPDISTVRSNTISHGKKRTEQ 552 Query: 361 --FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I I + +L P+ Y+ + L +V I + + Sbjct: 553 NPEIDRYAIGFGV-TDVNDILD--ALGPQGKYSATSHTDLNYVLNQIKKKI 600 >gi|313235671|emb|CBY11123.1| unnamed protein product [Oikopleura dioica] Length = 362 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 58/194 (29%), Gaps = 27/194 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + +K A +L V D ++G++ + + Sbjct: 118 ISTSMGIGKRWVNVKKAATQYL------EAVPHDAHVGIVLFHRVAYVKVPLKQIENADS 171 Query: 278 Q--YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R + T A+K + +L + S II Sbjct: 172 SDFLKNRLEKVELGVGTSIASALKVSMNVLQTS-------------GPKDSRDSGGNIIL 218 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVN 394 L+DG+ ++ + +N + + TI+ + + + + ++ + N Sbjct: 219 LSDGQE--SHDPRIDDELIQELIDNKVTVNTIAFGTDASQKLEEVSSRTKGSSYFSDPEN 276 Query: 395 ADS-LIHVFQNISQ 407 +S L F I++ Sbjct: 277 PNSILQDAF--IAE 288 >gi|291242484|ref|XP_002741138.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 765 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/257 (11%), Positives = 87/257 (33%), Gaps = 49/257 (19%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + + I P + G R + LV+D SGSM Sbjct: 250 VMLNHTDFKDGNNPTRIIDDTTPNFIIKRSGVRR---VVLVLDTSGSMD----------- 295 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ L + F+++ +++ ++G++G+++ + + Sbjct: 296 ---------GDRIQRLHQSATYFIET-----RIEDGSFVGIVGFSSYAVIHSGITEIKYG 341 Query: 276 VRQ-YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + ++ + T ++ A Q+L S + ++ Sbjct: 342 FQRGEIASNVPQVASGATSIGDGLRVALQVLQDGNVTSEGAS----------------LL 385 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +TDG N + + + + ++ +++ TI+ L + ++ ++ V + Sbjct: 386 LITDGIENTY---PLLMNVMQEVYDSGVRVDTIAY-TEAAQSTLQELSDNTGGLYFYVPD 441 Query: 395 ADSLIHVFQNISQLMVH 411 D+ +++ + Sbjct: 442 NDTSTAFIDSLAATISE 458 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 50/176 (28%), Gaps = 23/176 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K A+ ++ + L+ Y+ V + T R + +DS+ Sbjct: 113 AKAAVKGLVEGLRSQD------RFSLVTYSNSVNGGDGLHYLTADKRNSLNWMVDSIP-- 164 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + +I ++DG+ N ++ N Sbjct: 165 -------------AGGGTNLGGGLEKGVGVLRAYGAPDRMGKVILISDGQANQGVTDPNQ 211 Query: 351 IKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +++ + T+ I N Q + +Y + N + VFQ Sbjct: 212 LAAMAALRDDGLVYSVTTVGIGQDFNEQLMATVADGGRGRYYYLENPGDFLAVFQE 267 >gi|51870478|emb|CAA63617.1| Thrombospondin related anonymous protein [Plasmodium cynomolgi] Length = 559 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +K+A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVKHAVPLAMKLIQQLNLNENA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ ++ I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYPNIFSNNAKEIIRLHSDASKNKEKALIIIKSLLSTNLPYARTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG + + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDCIQDS---LKESRKLNDRGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|282897675|ref|ZP_06305674.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197354|gb|EFA72251.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 464 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 74/238 (31%), Gaps = 33/238 (13%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + QK+ + ++ P V+D SGSM+ + D ED Sbjct: 15 FMLADKAGQKLFVMLKLRPTKDIATNLPPTSFTFVIDTSGSMYEVVAGDVEDTGVTYQQD 74 Query: 222 DKKR-------TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 K+ +K+ + +LL ++S +K+ + ++ + + I + TE Sbjct: 75 GKEYKQVTGGKSKIDIVIESLLRLVNSGK----LKQQDRVSIVQFDDSASQIIGLTSATE 130 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + T +++A+ IL+ + K + Sbjct: 131 TKQIETAIKKLRDFSGGTRMGLGLRRAFDILSEQEMTV------------------KRAL 172 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 TDG+ I + I I + + N L + + V Sbjct: 173 LFTDGQT---FDEDQCQSIANHFATRNIPITALGVGEEFNEDLLTHLSDYTGGKLFYV 227 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 45/456 (9%), Positives = 122/456 (26%), Gaps = 80/456 (17%) Query: 24 IIFALSVMSFLL-LIGFLIYVLDWHYK------------KNSMESANNAAILAGASKMVS 70 ++ A+ +MS + +IG + + + + SM+ + I S + Sbjct: 13 MVMAIMLMSMIAVIIGIIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTIGKATSVFIL 71 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ E + S + ++ + K+ + + + Sbjct: 72 DDSKFKGNKEGLTKEWSYIGLSSDGKKVLNYVWDKSKQDWKVSELGTKSLYDIKLDLEFK 131 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + + N + + + ++ K + Sbjct: 132 TEGAYQDNRLISYNLTGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIALAYRNDPI 191 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ + I V D SGSM +N + + +++M+ L++ + + + Sbjct: 192 EGQMNVA-ISFVFDKSGSMSWDLNGNNTNY-------WGPKSRMSILQDKATIMMRDLKD 243 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ I + + E GT + + T+ ++ Sbjct: 244 IGNVSVNLVSFSILGSYVQKDFSELDKGTTTIEASINAL---QTGGVTNPGDGLRYGMMS 300 Query: 305 LT----------------------SDKKRSFFTNFFRQGVKIPSLPFQKFIIF------L 336 L S+ + + F + Sbjct: 301 LQNHSAQLKYVVLLTDGIPNAYTVDTNDTSWRNRNVHPYYNRWRETVGELVTFNNGPYDV 360 Query: 337 TDGENNNFKSNVNTIKICDKAKENFI---------------KIVTISI-NAS---PNGQR 377 T + + ++ I ++ I G+ Sbjct: 361 TTNLTTDQNRVSYDSYSNEALRKKAIEYAGKVSQTFGAGVKRVNVIGFSGVPSEIAYGED 420 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 L ++ S + + +L F +I + + Sbjct: 421 LTRSIGSGGMEAKYVPAADEAALQQTFSDIKKQIQQ 456 >gi|260814492|ref|XP_002601949.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] gi|229287252|gb|EEN57961.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] Length = 1774 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 55/194 (28%), Gaps = 30/194 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 + +K + + + D +G++ Y+ R Sbjct: 1148 SGSVSVSDFDTVKQ---FVVAVVSAFTIGLADTRVGVLQYSDRNTLGCNLGDHPDEASFV 1204 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + AM+ A Q P K +I LT Sbjct: 1205 SSINTMTRQ--GGGTSTGAAMEFARQ-----------------NAAWRPAPVPKIMIVLT 1245 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++++ D+ + + I + S + LL+ + P + + + + Sbjct: 1246 DGKSSDSVVAAAHALAADQ-----VTVFAIGVG-SFDHSELLEITNNKPSQVFELADFNV 1299 Query: 398 LIHVFQNISQLMVH 411 L I + + Sbjct: 1300 LAQSINRIVDGVCN 1313 >gi|304312981|ref|YP_003812579.1| hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] gi|301798714|emb|CBL46947.1| Hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] Length = 979 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 60/164 (36%), Gaps = 27/164 (16%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILT 306 K MG+ + V++ + + K +Q R+ + + T A+ A Q Sbjct: 120 KPGSRMGIWTFGQHVDRLVPLATVDAKWKQAAAREANRISSSSLYTAIGDALDAAIQ--- 176 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--------NTIKICDKAK 358 G P +++ ++ L+DG + K+ ++ + Sbjct: 177 --------------GDLKPDPAWERSVVLLSDGMVDISKNPADNQREQQRIFQEVVPRLV 222 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K+ ++++ + + L + ++ + +AD L+HVF Sbjct: 223 AGGYKVHAVALSEQADIEFLKRLAEATKGHFSIAHSADQLMHVF 266 >gi|256397309|ref|YP_003118873.1| hypothetical protein Caci_8209 [Catenulispora acidiphila DSM 44928] gi|256363535|gb|ACU77032.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 177 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 14/158 (8%), Positives = 41/158 (25%), Gaps = 2/158 (1%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R ++ + ++ + ++ + +LL+ F++ V + + A A Sbjct: 13 RRSLRRALRDDRGSLAMAVVIWAPVVVLLMAFVVDVGLLISDRTQASDYADQAARRVAQD 72 Query: 68 MVSN--LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + K + D + + + L T Q N + Sbjct: 73 IDQGWLKTHNVRGPNGEDPGIKVNVDPDTGDCVPDAEQYLLDNQITDTTITSCQVTGNPT 132 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + + + + D + Sbjct: 133 EVDLYVNPRITVTLQMQYKPLFVGFALKGDSTVTGTGS 170 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRLAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L ++ ++G++ + + E + R +T+ Sbjct: 319 TGNRLNRLNQAGQLFL-----LQIIELRSWVGMVTFDSAAHVQSELIQINSGSDRDTLTK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPTAASGGTSICSGLRLAFTVIKKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ + L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAARELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQL---MVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGVVSERSIQLESKG 493 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 62/180 (34%), Gaps = 30/180 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 M KNA ++ + L + + + N + + K+ + Sbjct: 106 YMPLAKNAAAAIVNKLPL-----DISECAVTSFNDVSFINTDFTRDRFKL---LQSIQTL 157 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + TD ++ K+ +K +IFLTDG Sbjct: 158 VPAGGTDYNKGFIKSNAGGLDILKKGL---------------HEKVLIFLTDG-----YG 197 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIHVFQNI 405 +VN +I +AK K+ I++ S + L + ++ +Y V++ + V+ +I Sbjct: 198 DVNPTEIIQQAKSIGAKVYVITLGMSA-PEELKRIVTATNGSYYENVISEQEINAVYMSI 256 >gi|1353722|gb|AAB01767.1| unknown [Naegleria fowleri] Length = 357 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 12/128 (9%), Positives = 39/128 (30%), Gaps = 15/128 (11%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ + + + +++ + T+ + + ++ Sbjct: 18 TNLSGGLFEGLRLIKQRTTCNEITSILLFTDGLANEGITNTSEIVSKMNTTIH------- 70 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ ++ I T + + L + +Y + N D + F N+ + Sbjct: 71 ---EEIRKQ-ITCFTFGFGSDTDANMLTSIAQAGNGLYYFLNNVDDIPKAFGNV---IGG 123 Query: 412 RKYSVILK 419 SV+ + Sbjct: 124 L-VSVVAQ 130 >gi|260813588|ref|XP_002601499.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] gi|229286796|gb|EEN57511.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] Length = 375 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 49/168 (29%), Gaps = 23/168 (13%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F I +G+I Y+TR + + K D + T Sbjct: 27 VKQFTKKAISGFDISPSGTQVGVIQYSTRTRQEFSLNSFLTKETLSSAIDEVQYMRGGTL 86 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + K +I +TDG + + + Sbjct: 87 TGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVAAPALE-- 131 Query: 354 CDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 A++ I + I + + L+ S+ V N + L + Sbjct: 132 ---AQQKGITVYAIGVSGYDADQ---LEQIASNNNTLAFVDNFNLLDN 173 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/340 (10%), Positives = 75/340 (22%), Gaps = 31/340 (9%) Query: 47 HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 K ++ SA + M ++ + Sbjct: 65 FLTKETLSSAIDE-----VQYMRGGTLTGKAIRYVTKYGFGKSDGARPGVPKVVIVVTDG 119 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 Y AV + ++ + N L L N Sbjct: 120 VSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIASNNNTLAFVDNFNLLDN---LRN 176 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 I F L +S M + + D + Sbjct: 177 TLLTGVCDGDDCTCTCICCRSPECFGCRLSTSVSIDMCFFTATLCRNPLDIIFLLDGSGS 236 Query: 227 -KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + + +I +G+I Y+TR + + K D Sbjct: 237 VGASNFEKVKQFTKKTISGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSAAIDEV 296 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A++ + K +I +TDG + + Sbjct: 297 QYMRGGTLTGKAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAV 343 Query: 346 SNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVS 384 + A++ I + I + + L+ S Sbjct: 344 AAPALE-----AQQKGITVYAIGVSGYDVDQ---LEQIAS 375 >gi|148555259|ref|YP_001262841.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] gi|148500449|gb|ABQ68703.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] Length = 625 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/279 (9%), Positives = 59/279 (21%), Gaps = 3/279 (1%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + ++I ALS+ + + F + + + + + + A LAGA N + Sbjct: 16 LARATRGAVAVIGALSLTVLVGMGAFAVEISRGYAADTANQRIADMAALAGALAYNVNSN 75 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + L A + S T I S Sbjct: 76 PSEMTATAKAVVVAQGLPASAATVALVTDSAT-SKQLVQVSVTTSVPIALGRVFSSALAY 134 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG-ERPIFL 192 + S T L ++ + G P Sbjct: 135 DVTATGSATTTATTTTAPPCIA-ALSSTPTYGITLSGGVSITSPGCAVNTNAGVTVPWGT 193 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 ++ + + + A L + + Sbjct: 194 TITAKQVNAGKGIDNPGKGITTSPTANDIVQNKASAATDWMKDDSTLKGLLCKVNQLSGY 253 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 R + T+ ++ Sbjct: 254 SDPDYADGNRSCTTPLVTPATQTSASTEDYILNYSPAAN 292 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 73/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + ME A A LA + + N D Sbjct: 202 AEARMEPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 66/216 (30%), Gaps = 32/216 (14%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV-------KEDVYM 254 + + + + +M LK++LL +D + + Sbjct: 322 AFAHELRNPGNTALVLDTSGSMEGERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQI 381 Query: 255 GLIGYTTRVEKNIEPSWGTEK---VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 LI Y++ ++ +K ++ R + T + A+ A+ + + Sbjct: 382 KLIPYSSEPQQPTRARVDKDKPATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGD 441 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISI 369 ++ +TDGE ++ + E+ I + I Sbjct: 442 IG------------------TVVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILY 483 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N Q + + + ++ +N D L F+ I Sbjct: 484 G-EANIQEMEELAQLTGGKTFDALNGD-LAAAFEEI 517 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|332262934|ref|XP_003280513.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Nomascus leucogenys] Length = 1164 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 59/202 (29%), Gaps = 31/202 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S I T M ++ + F + L+ ++ R E + Sbjct: 157 DGSGSISSGNFATMMNFVRAVISQF---------QRPSTQFSLMQFSNRFETHFTFEKFR 207 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T + A+++ L K + Sbjct: 208 RSSNPLSLLASVRQLGGLTYTATAIQKVVHQLFHASYG-------------ARRDATKIL 254 Query: 334 IFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE-- 387 I +TDG+ + + I + D A I I + + + N + L S P Sbjct: 255 IVITDGKKEGDSLDYKHVIPMADAA---GIIRYAIGVGSAFQNTNSWKELNDIASKPSQE 311 Query: 388 YHYNVVNADSLIHVFQNISQLM 409 + + V + D+L + + + + Sbjct: 312 HIFKVEDFDALKDIQNQLKEKI 333 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 67/221 (30%), Gaps = 44/221 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + LV+D SGSM K++ +K A L L+++ Sbjct: 46 ATRAPLDLALVIDRSGSM--------------------SGDKLSDVKTAALELLETLQPE 85 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + L+ Y++ V ++ + R+ + T P + +A + Sbjct: 86 D------TITLVSYSSDVSMHLMRTRADDAGQREARRALLALQARGGTALGPGLFRALEA 139 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIK 363 L ++ +DG N + A + Sbjct: 140 LE----------------GASDRTRMSHLMLFSDGIANAGEVRPSVLGARAAGAFGAGVS 183 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + T+ + N + + ++ + +++++ + + Sbjct: 184 VSTMGVGVDYNEDLMTRLADQGGGRYHFIQDSEAIASILDD 224 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/122 (18%), Positives = 43/122 (35%), Gaps = 24/122 (19%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD + A + K I+ ++DG S Sbjct: 768 GATDIFAGIDIALTKFS------------------NDTKTAKAIVVVSDG----KTSKSK 805 Query: 350 TIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 IK + AK+ +KI T+S+ + N L++ + +Y+ ++ L VFQ + Sbjct: 806 MIKAINDAKKQGVKIYTVSMGKKSQINDATLMQLSTETGGAYYHALDNLQLHQVFQKLID 865 Query: 408 LM 409 + Sbjct: 866 AI 867 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 190 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 248 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ + Sbjct: 249 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSE- 294 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 295 --KDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 352 Query: 407 QLMVHR 412 + Sbjct: 353 DRIFSL 358 >gi|311251228|ref|XP_003124501.1| PREDICTED: integrin alpha-D-like [Sus scrofa] Length = 230 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 64/190 (33%), Gaps = 25/190 (13%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +KN + +D S + L+ Y+ ++ + S D Sbjct: 28 QNDFKQMKNFVRAVMDQFMDTS-----ILFSLMQYSNLLKTHFTFSQFQTHRSPQSLVDP 82 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + ++ + L K +K ++ +TDG+ + Sbjct: 83 IVQLKGLTYTATGIQTVVKELFHSKNG-------------ARSSAKKILLVITDGQK--Y 127 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVS--SPEYHYNVVNADSLI 399 K + + +A++ + I + + ++ L S S ++ + V N +L Sbjct: 128 KDPLEYEDVIPQAEKANVIRYAIGVGDAFQEHSAKQELSIIGSLPSKDHVFKVDNFAALS 187 Query: 400 HVFQNISQLM 409 + + + + + Sbjct: 188 SIQKKLQEKI 197 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 425 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 484 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 485 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 525 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 526 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 581 Query: 406 SQLMV 410 + + Sbjct: 582 IRGIC 586 >gi|260807874|ref|XP_002598733.1| hypothetical protein BRAFLDRAFT_230653 [Branchiostoma floridae] gi|229284007|gb|EEN54745.1| hypothetical protein BRAFLDRAFT_230653 [Branchiostoma floridae] Length = 794 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 62/190 (32%), Gaps = 29/190 (15%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 + ++ L F+ + V + ++G++ ++TR + R Sbjct: 304 SSGSMRGERIQKLNQVAQHFIRNT-----VADGSWLGIVDFSTRATTAHALIQVSDDSAR 358 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + L T + + Q+L ++ + + ++ ++ Sbjct: 359 DQLAAAVPNSTLGWTCIGCGLLEGIQVLEANGRNA----------------AGGILLVIS 402 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN---VVN 394 DGE N + T + + + + TI+ + + + L + + N Sbjct: 403 DGEEN--QHPNITEAMPTVL-DKGVIVDTIAYSDAADAN-LESLAARTGGMAFFYPEGDN 458 Query: 395 ADSLIHVFQN 404 + +L F Sbjct: 459 STALNDAFTA 468 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 65/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKIGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGEAINLAVRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + E I++ + + Sbjct: 192 AIVVITDGENHEGGAVEAAKVA----AEKGIQVSVLGVGMPDGAPIPIEGTNDYRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + V N +S I+Q + Sbjct: 248 NVIVTRLNEAMCQEIAKEGKGIYVRVDNTNSAQKA---INQEV 287 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 400 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 459 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 460 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 500 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 501 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 556 Query: 406 SQLMV 410 + + Sbjct: 557 IRGIC 561 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 40/356 (11%), Positives = 91/356 (25%), Gaps = 51/356 (14%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + L+ G++ + D ++I+ G A F + N+ Sbjct: 271 TDENSNVLNHQGNKDAGETWDGITTWNGDPANLTHSYIEYGGVGDEADFALRKFAKETNT 330 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + + + + ++ + + Sbjct: 331 PGLF-------DVYLNVRGNVQRQIDPIDVVLVVDWSGSMNEMGR----IAEVKKGVDRF 379 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + I S M S CQ K + +K + Sbjct: 380 LNQIEGSGI----QDSVYMGYVGYSSDGSNYQNKTCQLG---KFSEVKETIRSMTP---E 429 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + GL + ++ + D + + ++ Sbjct: 430 TAAGGTFTQRGLRQAGDMLSTQ-------NGHKKVIVLLTDGVPTYSYHVS-------KV 475 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENF 361 T + T F + + F F D +NN N TI KE Sbjct: 476 HTQADGSYYGTVFSLTQDQPMNTSFLYNGYFAFDQQNNYKWINNTFIATIGEAMALKERG 535 Query: 362 IKIVTISIN---------ASPNGQRLLKTCVSSPE----YHYNVVNADSLIHVFQN 404 I+I + I + ++ ++ VS+ E Y+ + A + + Sbjct: 536 IEIHGLGIQLQGDQTAGYTKADVEKKMRQMVSADEDGHLYYESANEAADIADYLEK 591 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 33/393 (8%), Positives = 80/393 (20%), Gaps = 62/393 (15%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF-TNFFRQGVKIPSLPFQKFI 333 ++ + D + + ++ F+ T F + + S + Sbjct: 368 GHKKVIVLLTDGVPT--------FSYTVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGS 419 Query: 334 IFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINASPNGQRLLKT--------- 381 D N N + N TI K+ I+I + I + + L Sbjct: 420 YNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMRE 479 Query: 382 ---CVSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 480 MVSADENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|119716456|ref|YP_923421.1| TadE family protein [Nocardioides sp. JS614] gi|119537117|gb|ABL81734.1| TadE family protein [Nocardioides sp. JS614] Length = 150 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 49/143 (34%), Gaps = 5/143 (3%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + +R R + E+ + S+ FA + ++ ++L+G + + + + SA A Sbjct: 2 MTARIRSLTTRRSRDERGSISVWFATAALAMIILVGMAVDLGGKVHTQQQARSAAAQAAR 61 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 GA ++ + + G+ AK A +D + + + Sbjct: 62 TGAQEVQGSTAVRGEDLRVDLTAAKTAAMDYLRA-----AGVEGTVRVVDGDTLIVTTTD 116 Query: 123 NSSRISMTHMANNRLDSSNNTIF 145 + + + + + + Sbjct: 117 TYTSKFLGIIGLDTMQVTGEASA 139 >gi|297265788|ref|XP_001107747.2| PREDICTED: vitrin-like isoform 3 [Macaca mulatta] Length = 693 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 526 FRTVLQFVTNLTKEFEISDTDTRVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSG 585 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 586 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 626 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 627 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 682 Query: 409 MVHR 412 + Sbjct: 683 ICTE 686 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 51/182 (28%), Gaps = 31/182 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 L MG++ Y + + ++ + + ++ A+ Sbjct: 335 AQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAIS 392 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S + ++ + DG + + A+E Sbjct: 393 FVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD-----KVEEASRLARE 434 Query: 360 NFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I I I++ + ++ C ++ Y ++V + L Q + + + Sbjct: 435 SGINIFFITVEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVC 494 Query: 411 HR 412 Sbjct: 495 DT 496 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 33/393 (8%), Positives = 80/393 (20%), Gaps = 62/393 (15%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF-TNFFRQGVKIPSLPFQKFI 333 ++ + D + + ++ F+ T F + + S + Sbjct: 368 GHKKVIVLLTDGVPT--------FSYTVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGS 419 Query: 334 IFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINASPNGQRLLKT--------- 381 D N N + N TI K+ I+I + I + + L Sbjct: 420 YNAPDQNNINKRINSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMRE 479 Query: 382 ---CVSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 480 MVSADENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQNELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 66/185 (35%), Gaps = 20/185 (10%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ A K AL + ++ + +GL Y V+ +P+ Q +T Sbjct: 57 TKIEAAKKALTGVVGAL------PDTAQVGLRVYGATVDGKGKPTPAACADTQLITPI-- 108 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T PA+ A + + + + + +K ++ I+ ++DGE + Sbjct: 109 -----GTIDKPALTSAISAINALGETPI-AHSLTEALKDLGPTGKRNIVLVSDGEESCTP 162 Query: 346 SNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVF 402 K + +I T+ + + L+ + +Y+ +A +L Sbjct: 163 DPCP---AVKKLTAGGVDLQIDTVGFGVNAKARSQLQCIADAGKGTYYDAKDAPALAASL 219 Query: 403 QNISQ 407 +SQ Sbjct: 220 SKLSQ 224 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 46/134 (34%), Gaps = 16/134 (11%) Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E R + +D T+ +++ ++I + + Sbjct: 221 REAWRSEMHALVDTLQPGGGTNIYEGLERGFEIAKEARVNHPDRA--------------Q 266 Query: 332 FIIFLTDGENNNFKSNVNTI-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 +I L+DG ++ +I + + E + + T+ + AS N + + Y Sbjct: 267 RVILLSDGLATEGITDSASIIALSEAFIEGGMGLTTVGVGASFNVELMRGLAERGAGNFY 326 Query: 391 NVVNADSLIHVFQN 404 V + +++ VF Sbjct: 327 FVEDPEAVREVFTE 340 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|146338945|ref|YP_001203993.1| hypothetical protein BRADO1888 [Bradyrhizobium sp. ORS278] gi|146191751|emb|CAL75756.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 408 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/343 (9%), Positives = 85/343 (24%), Gaps = 23/343 (6%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ S+IFA++++ ++ IG + + ++ A ++A+LAG + S Sbjct: 17 FWRAQDGQISMIFAIAIIPIVIAIGTAVDFSKSSDTRTQLQKAVDSAVLAGVVQ-PSGQQ 75 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + R + F N + +T M Sbjct: 76 VSTANAIFRGDFGGRFGTAASASFTANSDGS-----------LTGSASATVNTSFLTVMG 124 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + L S + S L + + Sbjct: 125 TSSLGISASATAKPGAQSKSPVCILLVSTLNSQSLLVNSGAQLNAPTCEVHVLSTQNPAA 184 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM------AALKNALLLFLDSIDLLSH 247 L+ C S + + + + + + Sbjct: 185 LFNATLNVKRICIKGSTIIKNGGVTPPAETSCAAISDPFAGSLPTVTVGSCTTNNKVYDP 244 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + G+ +T + + Y+ + + T + + + ++ Sbjct: 245 GSVTLSPGVYCGSTNFNGSGTLTLNPGL---YIIKGTMTFNSGWTVTGNGVT--FYLVDQ 299 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + +F N + + +++ G + T Sbjct: 300 NATLTFNGNVNATLAAPTTGTYANILMYEPTGLSTTNLPINGT 342 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 51/200 (25%), Gaps = 27/200 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + +K A + +D + + ++ + R + + + + + Sbjct: 59 SGSMTGKPIKTVKEAAMRLVDGLGASD------RLSVVAFDHRAKVIVPNQP-VDDIERV 111 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T MK + + K I LTDG Sbjct: 112 KQAIERLKPEGGTSIDEGMKLGIKEVALGKDDRVSQ-----------------IFLLTDG 154 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSL 398 E N N +K+ A E I + T+ N L + Y +L Sbjct: 155 E-NEHGDNERCLKLAQVAAEYNITVNTLGFGNHWNQDVLESIADAVGGTLCYIEQPEQAL 213 Query: 399 IHVFQNISQLMVHRKYSVIL 418 F + + + Sbjct: 214 TE-FSRLFTRIQSVGLTNAY 232 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Ailuropoda melanoleuca] Length = 891 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 75/226 (33%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL LD + ++ Sbjct: 284 SVVFVIDVSGSMH--------------------GRKMEQTKDALLKILDDMKEEDYLNII 323 Query: 252 VYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + + ++ R +V D T+ + +A +L ++ Sbjct: 324 LFSGDVTIWRDSLVQATP--ENIQEARTFVKNIHDQ---GMTNINDGLMRAINMLNKARE 378 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 +I LTDG+ N +S I+ A + + Sbjct: 379 EHRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGF 427 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 428 GNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 473 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca] Length = 854 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 75/226 (33%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL LD + ++ Sbjct: 249 SVVFVIDVSGSMH--------------------GRKMEQTKDALLKILDDMKEEDYLNII 288 Query: 252 VYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + + ++ R +V D T+ + +A +L ++ Sbjct: 289 LFSGDVTIWRDSLVQATP--ENIQEARTFVKNIHDQ---GMTNINDGLMRAINMLNKARE 343 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 +I LTDG+ N +S I+ A + + Sbjct: 344 EHRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGF 392 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 393 GNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 438 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 184 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 242 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 243 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 ---DNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 346 Query: 407 QLMVHR 412 + Sbjct: 347 DRIFSL 352 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|126658524|ref|ZP_01729672.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] gi|126620266|gb|EAZ90987.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] Length = 610 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 48/176 (27%), Gaps = 24/176 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ A +D + + +I Y E + ++ + Sbjct: 57 LSNAIQATEKLVDFLSPDD------LLSVIIYDDVAEVIVP-HQAVTNKQEIKAKIKKIR 109 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T+ + Q S ++ LTDG N + Sbjct: 110 ARGCTNLSGGWLLG----------------CSQVKSHLSTDKLNRVLLLTDGLANIGERK 153 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +K + + I T+ ++ N L+ ++ Y + + D VF Sbjct: 154 PEILLKTAAEKAQQGIVTTTLGFGSNFNEDLLIGMADAAGGNFYFIQSPDDSADVF 209 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQNELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPAAQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQHGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|47220812|emb|CAG00019.1| unnamed protein product [Tetraodon nigroviridis] Length = 1557 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 64/200 (32%), Gaps = 31/200 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V+ + D+ K+ ++ + LD I + + + ++ + S + Sbjct: 896 VDGSWSIGDENFMKITRFLHSTVGSLDLIGT-----DGTQVAIAQFSDDARTEFQLSSHS 950 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K S T + A+K + + S + K + Sbjct: 951 NKEALLEAIQKISYKGGNTKTGRAIKHVKESIFSLEAG-------------ARRGVPKVL 997 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYN 391 + LTDG + + + K+ + + + I I + G+ L S P + + Sbjct: 998 VVLTDGRSQDDVN-----KVSKEMQMDGYIIFAIGFADADYGE--LVNIASKPSDRHVFF 1050 Query: 392 VVNADSLIHVFQNISQLMVH 411 V + + I + ++ Sbjct: 1051 VDDL----DAVKKIEEQLIT 1066 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 50/163 (30%), Gaps = 20/163 (12%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 V++GL Y+ + + K T + A+ Sbjct: 82 CVFLGLAQYSGDPRIEWHLNAYSTKDAVIDAVRNLPYKGGNTLTGLAL------------ 129 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +F + K I +TDG++ + + + I++ I + Sbjct: 130 -NFILENCFKPESGSREGLPKIGILITDGKSQDDVVPP-----AESLRNAGIELFAIGV- 182 Query: 371 ASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQLMVHR 412 + + L + + YNV + + + + +++ + + Sbjct: 183 KNADENELQSIASPPEDTHVYNVADFSVMNSIVEALTRTVCEQ 225 >gi|322437225|ref|YP_004219437.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321164952|gb|ADW70657.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 528 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 21/220 (9%), Positives = 54/220 (24%), Gaps = 21/220 (9%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME---------SANNAA--- 60 + E + ++I A +++ L + + V Y+K ++ S + Sbjct: 74 RIFRDEDGSATLIAAFGMVAILGFLALAVDVGQLRYQKRGLQKAADAAALASVLEMSYCD 133 Query: 61 -ILAGASKMVSNLSRLGDRFESISNHAKRALIDDA--KRFIKNHIKESLSGYSAVFYNTE 117 LA + S + + + N + + NH + + T Sbjct: 134 GTLACGVMQTAAKSAMVENGLTPDNIVTACGTVPSTGLTLMINHGPCAKGASDPNYGKTS 193 Query: 118 IQNIVNS---SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 ++ I + + + L Q + Sbjct: 194 SVEVLVMQAQPTIFAKVLGLSTGTVGARAEASTTGGTNCI-FALDPTGS--GALTVQGLA 250 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 S + L V+ + + + + Sbjct: 251 SITSPCGIMVESSSNSALSCAVLGVITASAINVVGGVSNF 290 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 81/250 (32%), Gaps = 50/250 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + ++V A ERP + ++D+SGSM A K+ Sbjct: 195 DTRLVRIGLAGYEAPKAERPAANLVFLLDVSGSMSSAD-------------------KLP 235 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K A+ + + V ++ Y +EP+ + ++ + + Sbjct: 236 LVKTAMKTLVGQLTPKDRVS------IVVYAGAAGLVLEPTSDSREIMAALDQLQAGGST 289 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV- 348 ++ AY++ + K +I TDG+ N S+ Sbjct: 290 ---AGGAGLELAYKVAEASKVDGI-----------------NRVILATDGDFNVGLSDND 329 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQ 407 ++ + ++N I + + + L++ + + +A I + + + Sbjct: 330 KLLEYVEDKRKNGIAMSVLGFGRGNINEALMEQIADKGNGNYGYIDSA---IEARKVLGE 386 Query: 408 LMVHRKYSVI 417 + Y++ Sbjct: 387 QLGATLYTIA 396 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 66/216 (30%), Gaps = 32/216 (14%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV-------KEDVYM 254 + + + + +M LK++LL +D + + Sbjct: 301 AFAHELRNPGNTALVLDTSGSMEGERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQI 360 Query: 255 GLIGYTTRVEKNIEPSWGTEK---VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 LI Y++ ++ +K ++ R + T + A+ A+ + + Sbjct: 361 KLIPYSSEPQQPTRARVDKDKPATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGD 420 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISI 369 ++ +TDGE ++ + E+ I + I Sbjct: 421 IG------------------TVVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILY 462 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N Q + + + ++ +N D L F+ I Sbjct: 463 G-EANIQEMEELAQLTGGKTFDALNGD-LAAAFEEI 496 >gi|149928031|ref|ZP_01916279.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] gi|149823219|gb|EDM82455.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] Length = 543 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 45/157 (28%), Gaps = 10/157 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIG-FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 S++ +SI+ ++ L +G + ++K ++ + L A +M + Sbjct: 8 RQSQRGAYSILTVFVLIMSLGALGVLAV--GHMAWEKTRLQGVADLVALTAARQMSNGPE 65 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + +++N + + I + T IV + + Sbjct: 66 FAEAQAIALAN-----GVSEDDDLTIECIIDDAPTNDCDNSITSRVTIVRPVNGLLVFLP 120 Query: 134 NNRLDSSNNTI-FYNMDVMTSYDYR-LQFIEHLLNQR 168 N + + S + + L Sbjct: 121 NREVTVLAEATVAPTVVGSVSSGLVSVDTNQSALLNG 157 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 53.7 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 11/147 (7%), Positives = 40/147 (27%), Gaps = 10/147 (6%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS----N 71 A ++ ++ AL++ L+ I ++ + ++ ++ + A L A++ + Sbjct: 11 ARQRGAIGLMAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATRGGNCGAGA 70 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKES------LSGYSAVFYNTEIQNIVNSS 125 + + + N+ + + +A + Sbjct: 71 TANAYAQASVVRNNFPIPSAGRTLAVACGTLNLDASNLRVFAVNAASTEAIRVVVSHTVP 130 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMT 152 + + + N Sbjct: 131 QSFAGAIGGLFGGAGRNATINLSATAV 157 >gi|307609426|emb|CBW98915.1| hypothetical protein LPW_07021 [Legionella pneumophila 130b] Length = 1169 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/250 (10%), Positives = 70/250 (28%), Gaps = 21/250 (8%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P++ + + + + + + + ++ Sbjct: 292 PSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVNN 351 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +++ L H+K + + + R ++ T S Sbjct: 352 AINAF--LPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTV-------GTTSGNC 402 Query: 298 MKQAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKI 353 ++ Y IL SD + + + F DG + N ++ + I Sbjct: 403 PQKQYIILISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINE 462 Query: 354 CDKAKENFIKIVTISINASPN------GQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K + + I I + A + L+ + +Y + ++L+ +I Sbjct: 463 IKALKNDGVLIFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPETLVSSLNSI 522 Query: 406 SQLMVHRKYS 415 + + +S Sbjct: 523 LSNIQNGSFS 532 >gi|296194626|ref|XP_002745014.1| PREDICTED: integrin alpha-2 [Callithrix jacchus] Length = 1208 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/282 (14%), Positives = 90/282 (31%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 126 TSTSIPNVTEIKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISSDFQL 185 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A+ + P ++ +C + A+KN L F+ + + LI Sbjct: 186 SSSFSPAVQTCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVALIQ 242 Query: 259 YTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 Y ++ T++ T T++ A++ A + S Sbjct: 243 YANNPRVVFNLNTYRTKEEMTLATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 296 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I+ C+ + I I++ N Sbjct: 297 -------GRPSATKVMVVVTDGESHDGSMLKAVIEQCNH---DNILRFGIAVLGYLNRNA 346 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 347 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 388 >gi|224534421|ref|ZP_03674999.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] gi|224514523|gb|EEF84839.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] Length = 333 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + + + + Sbjct: 110 SSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + ++ + A+ +KI +I I +S + L++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFGVEFKLRSGKFYQGSLREVYDPSMLVE 262 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 263 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/338 (12%), Positives = 83/338 (24%), Gaps = 65/338 (19%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + ++ E + ++ + H ++ T + Sbjct: 163 VYPLKTDGKATRTLEEFSVSLTIKSRHAVQNVYSPTHAVNTVRKSDKEVSVTFERKQALL 222 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + D + + Y + Sbjct: 223 DKDFQLFYGHGDKDIGLSPLVYKPIQTEDGYFMFLISP------------------QVEA 264 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 +R + LV+D S SM KM K A+ L + Sbjct: 265 EKKRVARDLVLVLDTSSSMSDI--------------------KMQQAKKAVKFCLSQLQP 304 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 G++ ++T V K + T+ + T PA+ A Sbjct: 305 ED------RFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGLKTSGGTAIWPALNDAL 358 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF- 361 + + ++F TDG+ ++N + I AK Sbjct: 359 AM------------------RSSDPSRPFTMVFFTDGQPTVDETNADKIVKNVLAKNTGN 400 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 +I T + N L + S+ V A+ + Sbjct: 401 TRIFTFGVGDDVNAAMLDQLADSTRAVSTYVREAEDIE 438 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/339 (8%), Positives = 88/339 (25%), Gaps = 28/339 (8%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ + + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIALAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-SGSM 203 + + + + + + + G I + Sbjct: 191 PIQGQMNIAVSFVFDTSGSMNWDLQGRNV---------EKTGNESRMDILRKKSVIMIKD 241 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + + ++ + + L N + +I ++ D G+ + Sbjct: 242 LAEIGNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPD---GVTNPGDGL 298 Query: 264 -EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI-LTSDKKRSFFTNFFRQG 321 I +++ Y+ D + + A+ ++ L+ R F N Sbjct: 299 RYGMISLQSQPAQLK-YIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDL 357 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI---KIVTISI-NASP---N 374 + + L ++ + + +K+ + ++ I Sbjct: 358 SPTLGYEYSRLGYDL--YSRDSITRENSIAYAGEVSKKFGLGIKRVNVIGFSGVDHEIAY 415 Query: 375 GQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 GQ L + + N ++L F +I + + Sbjct: 416 GQSLTDRIGEGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 62/188 (32%), Gaps = 30/188 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 ++ + + +G++ Y RV V + V + T Sbjct: 170 DFLINVLQKFYIGPGQIQVGVVQYGERVVNEFRL-DDFRTVDEVVAAAKNIDQRGGEETR 228 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A +K +I +TDGE+++ + + Sbjct: 229 TALGINVARTQAFKHG---------------GRPDAKKVMIVITDGESHD---SPDLKAA 270 Query: 354 CDKAKENFIKIVTISI-NA----SPNGQRLLKTC---VSSPE-YHYNVVNADSLIHVFQN 404 ++++++ I + I++ N + L+ + P+ + ++V + +L + Sbjct: 271 VEESEKDNITLYGIAVLGYYNRRGINPEAFLREIKFIATDPDEHFFSVTDESALKDIVDA 330 Query: 405 ISQLMVHR 412 + + + Sbjct: 331 LGEKIFSL 338 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/359 (8%), Positives = 78/359 (21%), Gaps = 37/359 (10%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 S + + + F + ++ ++ S+ Sbjct: 109 SAPGTTATYEIVVDAYRVLGNGQEEVYFSFPQPPYEYT-RQTETSTAKLDFSLSFSQPEY 167 Query: 130 THMANNRLD-------SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI---VSFIPA 179 + + + + + + + + + Sbjct: 168 AKPPDGDAQGRLDVTLIPQGGVPAPVRPPIDVVFVMDVSGSMTTMKLQSAKSALQAAVNY 227 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----KNAL 235 LI D+ + +V S + ++ A +A Sbjct: 228 FKTNYHPNDRFALIPFSDDVKATSVVP-FGSKSNVISQLDAILDEGNRLTANGGTNYSAA 286 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---------WGTEKVRQYVTRDMDS 286 L S K+ + G T + + EK+ + Sbjct: 287 LSLAQSYFNDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGL 346 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T K Y L + + + + GE K Sbjct: 347 YSDGKMTSIIFTDKDGYDRLFYNNHIDYVNGWLFSND-------NGYSFTYAWGE---GK 396 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + + + N I + +I + L+ ++ +L +FQ Sbjct: 397 AYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMDYLRALSTTAGGEVRQGTPQNLTELFQ 455 >gi|163750668|ref|ZP_02157905.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161329663|gb|EDQ00654.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 1180 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/298 (9%), Positives = 68/298 (22%), Gaps = 55/298 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + ++ + FL + S +++ + + Sbjct: 186 SSSPVPYTYITSSASSGDRAAALDKAFLTNFGLGKSVTLYTERYVNWY----HGRKPKEW 241 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGTEKVRQ 278 +T+M K + + + + Y T + + + Sbjct: 242 KTRMEIAKRVMEDVVVTTPSVDFGLAVFNYNTSNYNTTYRNDGGRIISGIQRLNAIEKKS 301 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS------------ 326 V + T + +AY + + S Sbjct: 302 LVQTINNLSAETWTPLCETLYEAYLYFGAKAPLYALEAGIASPPRDTSIEREGKYLSPFK 361 Query: 327 ---LPFQKFIIFLTDGENNNFKSNVNTIKIC----------------------------D 355 + +I+++TDG N + T+ Sbjct: 362 ESQCSNRSYIVYITDGAPTNDSAANYTVSQLPGYDKNNIVNGSYLPALTSIMNNEDINTK 421 Query: 356 KAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A + F+ TI + L K +Y +A L +I ++ Sbjct: 422 LAGDQFVSTFTIGFSDGATSAAPILKKAAELGGGDYYAAKDAIKLQSALSDIFSQILE 479 >gi|32477946|ref|NP_870940.1| hypothetical protein RB13238 [Rhodopirellula baltica SH 1] gi|32448503|emb|CAD78018.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 164 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 36/123 (29%), Gaps = 5/123 (4%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS-KMVSNLSRLG 76 + + FA+ + +L + ++ + ++ + A A MV + Sbjct: 16 RQGAALVEFAVVLPVIMLFLTAMVEISRILM----LQHTADTAAYEAARCAMVPGATVTE 71 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 +E+ + L + A I E + + N S + + Sbjct: 72 AEWEAYALIEAAGLTNTAVTVTPAEITEETAFITVRVEVPANDNSWMLSSQFTDVVVASE 131 Query: 137 LDS 139 + Sbjct: 132 VTL 134 >gi|163815330|ref|ZP_02206705.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] gi|158449304|gb|EDP26299.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] Length = 348 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/195 (10%), Positives = 50/195 (25%), Gaps = 27/195 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL------ 289 L+ + G++ + T + E +++ + SL + Sbjct: 114 ENLLEELKKTVDELHGERFGIVIFNTSPVLLSPLTDDYEFIKEQLDMIDKSLKVRNSEDD 173 Query: 290 ---KPTD--------STPAMKQAYQILTSDKKRSFFTNFFRQGVKIP----SLPFQKFII 334 TD + ++ ++++ S K +I Sbjct: 174 SDLYSTDLSSMYDWLYYEGYITSGTLVGNEQRGSSLIGDGLAAAACDFSDKDKTRTKIMI 233 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNV 392 F TD + K N + + + N + + K S+ + Sbjct: 234 FSTDNDIQ-GNPVATLDDAASICKNNGVTVYGVGTKEMTDENRESMKKAVESTGGQFFME 292 Query: 393 V---NADSLIHVFQN 404 D ++ + Sbjct: 293 EESGTFDQIVTAIEK 307 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 232 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYM 291 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 292 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 332 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 333 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 388 Query: 406 SQLMV 410 + + Sbjct: 389 IRGIC 393 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 86/303 (28%), Gaps = 73/303 (24%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 T Q + + D S + + Y D + L + + Sbjct: 233 AVVTIEQKDCVPTNDFRLFFDTAKTDLSASVLTYRPDKSEDGYFLLLASPPVEEVGDVKT 292 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + I VVD SGSM K+ K Sbjct: 293 KKTVI-----------------FVVDRSGSM--------------------SGEKIEQAK 315 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILK 290 A L++++ +I Y + VE + + + + Sbjct: 316 EAAKFVLNNLNEGD------LFNIIAYDSDVESFEPELQKLDDKTREKALGFVDNLYAGG 369 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A+ +A +L DK+ +++FLTDG + + N Sbjct: 370 STNIDGALAKAMGMLKDDKR-------------------PSYMLFLTDGLPTHGEQNE-- 408 Query: 351 IKICDKAKENF---IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI----HVFQ 403 KI D AK+ ++++ + N + L + V + + ++ Sbjct: 409 AKIVDNAKQKNDVRARVISFGVGYDVNSRLLDRLSRECFGQSEYVRPNEDIETHVAKLYN 468 Query: 404 NIS 406 IS Sbjct: 469 KIS 471 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|268325023|emb|CBH38611.1| hypothetical protein, containing PKD domain [uncultured archaeon] Length = 1152 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 58/236 (24%), Gaps = 33/236 (13%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 +I G + V+D +GSM + + ++ + F Sbjct: 300 QIFYGTGLSLDLIFVIDTTGSMGDDIANVKASASTIVNEIEAIIPDYQVAVVDYRDF--- 356 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT-DSTPAMKQ 300 V G + + S + + A+ Sbjct: 357 -----PVDPYGGDGDYPFND----VLPFSTDKAAIISAIQGLTLGWGGDWEESVYSALMH 407 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV----NTIKICDK 356 + G K II + D ++ + ++ I + Sbjct: 408 SIDA-------------GSLGGWRGEDQALKAIILMGDAPPHDPEPFTGYILTSVAIAAE 454 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + I TI I + + + + NA+ ++ + + R Sbjct: 455 LADP-VHIYTIQIGGPV--GKFAELASQTGGEVFTAENAEEVVDAILEAIEEITKR 507 >gi|3982895|gb|AAC83698.1| complement factor Bf-2 [Oncorhynchus mykiss gairdneri] Length = 749 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 59/193 (30%), Gaps = 19/193 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +NA+ + + + + L V ++ K + V D+D+ Sbjct: 255 FNKARNAVKKLITKVSSFAVSPNYEILFLASDVFEVVNILDFLGEKRKTLEDVLADLDNF 314 Query: 288 ILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + A++ + +Q + +IF TDG N Sbjct: 315 NYGDRQNVGTNLNLAFKTILERM------AIQKQRNETLFKEVHHVLIFFTDGAFNMGGR 368 Query: 347 NVNTIKICDKA--------KENFIKIVTISIN---ASPNGQRLLKTCVSSPEYHYNVVNA 395 +T+ + ++ ++ I + + Q L+ + ++++ + + Sbjct: 369 PDDTVAKVREMVYMNQKEERDKYLDIYVFGVGREIFDEDIQPLVTK-RDNEDHYFKLKDG 427 Query: 396 DSLIHVFQNISQL 408 L F I Sbjct: 428 TELEETFDKIIDE 440 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 61/186 (32%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V++ V T++ Sbjct: 526 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKEVVEAASHIEQRGGTETRT 584 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ + + K+ + Sbjct: 585 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHD---SPDLEKVIED 628 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 629 SEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 688 Query: 407 QLMVHR 412 + Sbjct: 689 DRIFSL 694 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 75/235 (31%), Gaps = 35/235 (14%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R IE+V+D+SGSM ++ E ++A Q + + K+A S+D Sbjct: 352 AQRSGVDIEIVLDVSGSMA----TESEVQDAAGNVQRHGFSTLDVCKHAARCVACSLDDT 407 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQI 304 +GL+ Y + + + T V ++ L T+ ++ Sbjct: 408 ------CRLGLVAYDAQARVVVGLARVTPAHVAKVHAALEKLAPGTSTNLWGGLELGVDE 461 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENF-- 361 L + + ++ LTDG NN + + ++ Sbjct: 462 LVGGAGDNA-----------------RAVLLLTDGVPNNSPPEGEVAALRAKRLTKDGSE 504 Query: 362 -IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + + LL V +A + F ++ + + S Sbjct: 505 TVAVFAAGFGYALRSDLLLSLAREGGGLFSFVPDAGMVGTSFNHL---VASLRSS 556 >gi|298711243|emb|CBJ26488.1| similar to collagen, partial [Ectocarpus siliculosus] Length = 356 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 48/178 (26%), Gaps = 28/178 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 K + S + + + + + E +V D Sbjct: 91 WELSKEFAKNTVASFAEQNLFTNGGSASFAQFASDASEGGTF-YSLEDFNAFVDA--DVK 147 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD + + ++L++ F+I TDG + Sbjct: 148 HSGGTDIIDGIAKGRELLSAS------------------PAATSFMIVTTDGAAPD---- 185 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D A+ + + + + P+ + LL ++V N + L +I Sbjct: 186 --PQDEADAARAEGTILYAVGVGSGPSQENLLAI-GGDEANVFDVDNFEELDLALDDI 240 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 71/218 (32%), Gaps = 44/218 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + ++D+SGSM K+ LK+A L ++ + Sbjct: 328 PASNLTFLIDVSGSMSSH-------------------NKLPLLKSAFKLLVNQLREKD-- 366 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + ++ Y +EP+ G K + ++ + T +K AY++ + Sbjct: 367 ----KVSIVVYAGAAGVVLEPTSGNNK-EKIISALNNLQSGGSTAGGAGIKLAYKLAEKN 421 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTI 367 K+ +I TDG+ N S+ + + ++ +++ + + + Sbjct: 422 FKKKGNN----------------RVILATDGDFNVGASSDNDMKTLIEEKRKSGVFLSVL 465 Query: 368 SINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN 404 L+T H + N VF Sbjct: 466 GFGYGNYKDSKLETLADKGNGNHAYIDNMQEAQKVFGK 503 >gi|146303120|ref|YP_001190436.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145701370|gb|ABP94512.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 394 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/258 (9%), Positives = 65/258 (25%), Gaps = 67/258 (25%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 V L + N V E+ + ++ + ++ Sbjct: 35 VPLDLFIVLDVSGSM-GIIDNPPEVDDSLIAGTAEVDGHVVRYLKDDIGVN--------- 84 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ A+ L++ D + +I ++ V Sbjct: 85 ----------------NRLEVALEAIRNLLENADTS------TRVTIITFSDHVNVLCR- 121 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + T A+K+A ++ Sbjct: 122 ---RVTPSTALEHLEEIVPDGNTALYSAVKKAISLIDE---------------------H 157 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVS-SP 386 ++ +TDG + + K+ + + I I ++L++ S Sbjct: 158 PARVLLITDGYPTDVEDETEYSKL-------EVPRFSQFIPIGVGEYNAKILRSLADLSN 210 Query: 387 EYHYNVVNADSLIHVFQN 404 Y+V + + + + Sbjct: 211 GRFYHVNDVSEISRIMEE 228 >gi|3182939|sp|Q91145|COCA1_NOTVI RecName: Full=Collagen alpha-1(XII) chain gi|632648|gb|AAA80217.1| type XII collagen alpha-1 chain [Notophthalmus viridescens] Length = 929 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 57/182 (31%), Gaps = 20/182 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K +++ + V + + Y+ + + K Sbjct: 649 FKIVRNFISRVVEVFDIGSDRVQIAVSQYSGDPRTEWQLNTHKTKKSLMDAVANLPYKGG 708 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T++ A+K F + +K I LTDG++ + + Sbjct: 709 NTNTGSALK-------------FILENNFRPGVGMREKARKIAILLTDGKSQDDIVAPSK 755 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLM 409 + I++ + I + + L + E + YNV + L ++ ++++ + Sbjct: 756 RYA-----DEGIELYAVGI-KNADENELKEIASDPDELYMYNVADFSLLTNIVNDLTENV 809 Query: 410 VH 411 + Sbjct: 810 CN 811 >gi|257785062|ref|YP_003180279.1| Cna B domain-containing protein [Atopobium parvulum DSM 20469] gi|257473569|gb|ACV51688.1| Cna B domain protein [Atopobium parvulum DSM 20469] Length = 863 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 60/210 (28%), Gaps = 20/210 (9%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNIEP 269 ++ +S ++ A K + +I ++ K+ V M L+ + + Sbjct: 94 DNSSSMTAQTGGGEMRLDAAKRVVNQLSSTILGINRNAQKDVVEMALLSFNEKPNLECGW 153 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + ++ T+ A+++A + T +P+ Sbjct: 154 TADLNEFQRATNNMGFHT---GTNWESALERAKVLADQKAANGNPTYVIFVTDGLPTQDR 210 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE-- 387 ++ NN + + ++ + R L + Sbjct: 211 NGWVR-------NNQIGYEHALDEARAIGSAGYHFYSVYMYGGHAYLRQLTNYAYTGNPF 263 Query: 388 ------YHYNVVNADSLIHVFQNISQLMVH 411 Y+Y N + F+ I+ ++ Sbjct: 264 GNPGGTYYYEANNTAQMEQAFKEIASVITK 293 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/321 (10%), Positives = 90/321 (28%), Gaps = 42/321 (13%) Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 G E Q + + ++ + S ++ + Sbjct: 19 AGSGGVEIGPNGIETQGDADLPEMPKLGNGRAFFSTTADPGASAGLTPISAVRQIGYRFS 78 Query: 164 LLNQRYNQKIVSFIPAL--LRIEMGERPIFLIELV------VDLSGSMHCAMNSDPEDVN 215 + + + F+P + G+ + I + Sbjct: 79 PNFKSDGKVEILFVPLAPGGAVLDGKVRVSFIVTSNLRFEISGSAAISAVRPAPGGRPTQ 138 Query: 216 SAPICQDKKRTKMAALKN----ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + K A F++++ + + G+I + T VE+ Sbjct: 139 VVIDIDGSSSMRRSDPKRERVRAAKRFVETLSRVDKRNQF---GVIEFNTTVEERAPM-- 193 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ + T ++ ++ L K +++ Sbjct: 194 -GSGMKATSDAIQAVDAVGSTALYTSLIRSIDALEGSGKTG----------------YRR 236 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ------RLLKTCVSS 385 I+ LTDG++ + ++AK ++I +S+ + + + + + + Sbjct: 237 AILVLTDGKDTASSH--GVATVINRAKAAKVRIYVVSLGGAGDQKGLGYVGPMQRLTTET 294 Query: 386 PEYHYNVVNADSLIHVFQNIS 406 +V AD L+ F I+ Sbjct: 295 GGVFTHVDRADDLVARFDAIA 315 >gi|58429515|gb|AAW78161.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 554 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429457|gb|AAW78132.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429523|gb|AAW78165.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429475|gb|AAW78141.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429513|gb|AAW78160.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|21359974|ref|NP_444506.2| vitrin isoform 1 [Homo sapiens] gi|62702118|gb|AAF19243.2|AC007363_1 unknown [Homo sapiens] gi|16552271|dbj|BAB71279.1| unnamed protein product [Homo sapiens] gi|119620821|gb|EAX00416.1| vitrin, isoform CRA_a [Homo sapiens] Length = 693 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 526 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 585 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 586 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 626 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 627 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 682 Query: 409 MVHR 412 + Sbjct: 683 ICTE 686 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 37/141 (26%), Gaps = 18/141 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D L MG++ Y + T + + ++ A Sbjct: 331 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRA 390 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + S + ++ + DG + + A Sbjct: 391 ISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASRLA 432 Query: 358 KENFIKIVTISINASPNGQRL 378 +E+ I I I+I + ++ Sbjct: 433 RESGINIFFITIEGAAENEKQ 453 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT T K Sbjct: 428 FRTVLQFVANLSKEFDISDTDTRVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSG 487 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + Sbjct: 488 GTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGRSYDDVRIPAM 531 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A + I + + Q L+ + P ++ + V D+L I Q Sbjct: 532 V-----AHHKGVITYAIGV--AWAAQEELEIIATHPARDHAFFVDEFDNLYKSVPKIIQN 584 Query: 409 MVHR 412 + Sbjct: 585 ICTE 588 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 24/272 (8%), Positives = 63/272 (23%), Gaps = 38/272 (13%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSS--NNTIFYNMDVMTSYDYRLQFIEHLLN 166 + + + + M ++++ + T+ RLQ + Sbjct: 104 TTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALSKPSPSAGFTTSSLRLQPVGQ--- 160 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + + + + S+ DP D + Sbjct: 161 RSRELDLWCTTTYTNSQNSPQANPGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSS 220 Query: 227 ------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQ 278 ++ D L MG++ Y + ++ Sbjct: 221 IGKRRFRIQK-----QFLADVAQALDIGPAGPLMGVVQYGDNPATQFNLKTHMNSRDLKT 275 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ A+ + S + + + D Sbjct: 276 AIEKITQR--GGLSNVGRAISFVTKNFFSKSNGNRGGA-------------PNVAVVMVD 320 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 G + + A+E+ I I I+I Sbjct: 321 GWPTD-----KVEEASRLARESGINIFFITIE 347 >gi|312080770|ref|XP_003142742.1| hypothetical protein LOAG_07160 [Loa loa] gi|307762094|gb|EFO21328.1| hypothetical protein LOAG_07160 [Loa loa] Length = 634 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 63/181 (34%), Gaps = 20/181 (11%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLIL 289 K L D I+L + ++ +I Y+ + I+ + + +++Q + Sbjct: 464 KEVLRFVKDFIELFDVSPQQTHVAVIQYSDVIRHEIDLNQYSSMEQLKQAINGIEYL--T 521 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A++ R+G + + +I +TDG + + + Sbjct: 522 GLTRTGAAIQHVTD----------EGFSERRGARPIGNGVPRILIVITDGRSQDDVTI-- 569 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 A+ I++ + + + L+ S + ++V + L ++ Q + Sbjct: 570 ---AVRNARMKQIQLFAVGV-TNHALDSELEVISGSTKRTFHVNAFEDLNARLRSAIQKV 625 Query: 410 V 410 Sbjct: 626 T 626 >gi|217978821|ref|YP_002362968.1| hypothetical protein Msil_2684 [Methylocella silvestris BL2] gi|217504197|gb|ACK51606.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 429 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 43/140 (30%), Gaps = 1/140 (0%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ +++ L++ + +I + YKK M+S ++A L GA+ + S Sbjct: 20 RRFRYDRGGGVALMIGLALPVIIGMIALGTEISFLLYKKFQMQSVADSAALGGAAALQSG 79 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 G IS+ N+ + S + E+ + ++ Sbjct: 80 HPAPGIEARGISSFLGFVDGAAGVTVTVNNPPATGSA-ANNASAVEVIISQPQTLSMVSL 138 Query: 132 MANNRLDSSNNTIFYNMDVM 151 + + Sbjct: 139 FVSGLFTVGARAVATRGTTS 158 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDSLKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 46/146 (31%), Gaps = 18/146 (12%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 N + + + T+ P + + +IL + + Sbjct: 23 TNAKLCHDVPTATDLLGHLHPVIGGGGTNMAPGLFISREIL-------------ERPIFP 69 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + + ++ L+DG T +I + K+ IVT++ + L+ S Sbjct: 70 SQIYLRPVVVVLSDGLT---SHPAKTSEIATQLKK-DADIVTVAFGDDADEPYLISLATS 125 Query: 385 SPEYHYNVVNADSLIHVFQNISQLMV 410 S + Y+ L F ++ + Sbjct: 126 SE-HFYHCRTGTDLRAFFASVGTTLS 150 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 26/63 (41%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 I+ + + + + G LI + + M+S ++AA L G ++ + R + Sbjct: 2 ILVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGADQTTTRIAVVD 61 Query: 84 NHA 86 A Sbjct: 62 RAA 64 >gi|297265790|ref|XP_001107629.2| PREDICTED: vitrin-like isoform 1 [Macaca mulatta] Length = 678 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 511 FRTVLQFVTNLTKEFEISDTDTRVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSG 570 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 571 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 611 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 612 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 667 Query: 409 MVHR 412 + Sbjct: 668 ICTE 671 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 51/182 (28%), Gaps = 31/182 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 L MG++ Y + + ++ + + ++ A+ Sbjct: 320 AQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAIS 377 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S + ++ + DG + + A+E Sbjct: 378 FVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD-----KVEEASRLARE 419 Query: 360 NFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I I I++ + ++ C ++ Y ++V + L Q + + + Sbjct: 420 SGINIFFITVEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVC 479 Query: 411 HR 412 Sbjct: 480 DT 481 >gi|126316412|ref|XP_001380737.1| PREDICTED: similar to integrin, alpha 1 [Monodelphis domestica] Length = 1183 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 70/195 (35%), Gaps = 28/195 (14%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ FL+S+ + + +G++ Y V + T + + Sbjct: 189 YPWTSVTDFLNSLLGKMDIGPKQTQVGIVQYGENVTHEFNLNKYTTTEEVLIAANQIVQR 248 Query: 289 LK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + + A + + + +K ++ +TDGE+++ Sbjct: 249 QGRQTMTALGIDTARKEAFTKARG-------------ARSGVKKVMVIVTDGESHDNHRL 295 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADS 397 I+ C+ + I+ +I+I N L +K+ S + ++ +NV + + Sbjct: 296 NEVIQDCE---DEDIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELA 352 Query: 398 LIHVFQNISQLMVHR 412 L+ + + + + + Sbjct: 353 LVTIAEVLGERIFAL 367 >gi|58429525|gb|AAW78166.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRNNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG N+ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPNSIQDS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|297265794|ref|XP_001107688.2| PREDICTED: vitrin-like isoform 2 [Macaca mulatta] Length = 656 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 489 FRTVLQFVTNLTKEFEISDTDTRVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 549 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 589 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 590 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 51/182 (28%), Gaps = 31/182 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 L MG++ Y + + ++ + + ++ A+ Sbjct: 298 AQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAIS 355 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S + ++ + DG + + A+E Sbjct: 356 FVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD-----KVEEASRLARE 397 Query: 360 NFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + I I I++ + ++ C ++ Y ++V + L Q + + + Sbjct: 398 SGINIFFITVEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVC 457 Query: 411 HR 412 Sbjct: 458 DT 459 >gi|289679370|ref|ZP_06500260.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae FF5] Length = 120 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 51/154 (33%), Gaps = 35/154 (22%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VD+SGSM D +++ ++ L FL+ ++ +G Sbjct: 2 AVDVSGSMDYPDMQWKSDE----------VSRLVLVQQLLGDFLE-------GRKGDRVG 44 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + T+ ++ VR ++ + K T A+ A + L Sbjct: 45 LILFGTQAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL---------- 94 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + ++ +TDG NN + + Sbjct: 95 --------RMRPATSRALVLVTDGANNAGQIDPI 120 >gi|220912876|ref|YP_002488185.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859754|gb|ACL40096.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 333 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 56/192 (29%), Gaps = 16/192 (8%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-- 289 +A +D L+ + +GL + + + + R + D+ Sbjct: 103 SSADAAVVDVFAELAREFDGERIGLTIFDSTGSQVFPLTDDYGYARDQLLLARDAFDGKP 162 Query: 290 ------KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T L S T P++ + ++ TD + Sbjct: 163 GSSGFLDGTWGGRGSSLIGDGLASCLNGFPRTQDSDAAADNPAVKRSRSVVLATD-NYVS 221 Query: 344 FKSNVNTIKICDKAKENFIKIVTI-------SINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + + AK+ +++ + +A G +L S+ +Y + N Sbjct: 222 GNPILTLPQASALAKDRTVRVYALNPGDLDYGSDAGQPGAQLRVAAESTGGSYYALDNPG 281 Query: 397 SLIHVFQNISQL 408 ++ + + + Sbjct: 282 AVPGIVAAVQET 293 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 59/177 (33%), Gaps = 20/177 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 D + + +G++ Y E I + T++ + + T + Sbjct: 56 KWVSDLVATFDIGPDYTRVGVVVYAEEPEMAIALNQYTDRDSLIQAVGNITYLNGNTRTG 115 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + S + F + + I LTDG + N + Sbjct: 116 KAIRFMNEESFSIANGARDIEFG----------YNRLAIVLTDGRAQDNVFNPSLE---- 161 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMV 410 A+ N I++ + + S L S P+ + V + ++ + + + + ++ Sbjct: 162 -AQNNGIQLYAVGV--STAVVEELNEIASDPDSRHVMQVDDFQAIERI-RELLRQII 214 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 381 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKEDVLAVIRNIRYM 440 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 441 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 481 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 482 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 537 Query: 406 SQLMV 410 + + Sbjct: 538 IRGIC 542 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 70/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 110 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + + + ++ + A+ +KI +I I +S + L++ Sbjct: 205 SDEIDK--DQVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 262 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 263 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 52/162 (32%), Gaps = 25/162 (15%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S+ L + ++ +G++G+ R E R TD ++ Sbjct: 396 SLSALKQLGDENEVGIVGFNHRTYSVAERQPLGPNREALADRIRRLQAGGATDIAGGLRG 455 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L D +I ++DG + +I + + Sbjct: 456 AGKMLGDD---------------------PGTVILISDGHDRV----EESISYAKQLRSE 490 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +I+ I +PN + L +S ++ + L +F Sbjct: 491 GKRIIAIGAGKNPNEKNLRTIARASGGSYFRATETNRLNILF 532 >gi|162449478|ref|YP_001611845.1| hypothetical protein sce1208 [Sorangium cellulosum 'So ce 56'] gi|161160060|emb|CAN91365.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 607 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 51/202 (25%), Gaps = 28/202 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-Q 278 K+ A + ++ ++ + +VE I + T R + Sbjct: 47 SGSMSGEKLRLALEAARQAIRTLQPGD------RFSVVTFDHQVEVPIPSTDATPGARLR 100 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + TD + + ++ LTD Sbjct: 101 AEAALDTVIARGNTDLGGG----------------WLRGCAEVGAHLPEDAIGRVLLLTD 144 Query: 339 GENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 G+ N+ S + + TI + N L + Y AD Sbjct: 145 GQANHGITSPDELTSRARSQRLRRVTTSTIGLGEGFNEFLLGRLSEEGGGNFYFAARADE 204 Query: 398 LIHVFQNISQLMVHRKYSVILK 419 L + + + SV+ + Sbjct: 205 LPGF---VGREIGEV-LSVVAR 222 >gi|317127857|ref|YP_004094139.1| von Willebrand factor A [Bacillus cellulosilyticus DSM 2522] gi|315472805|gb|ADU29408.1| von Willebrand factor type A [Bacillus cellulosilyticus DSM 2522] Length = 282 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 70/192 (36%), Gaps = 36/192 (18%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSL 287 K A ++ + + ++ + +VE IEP T K +++Y+ R Sbjct: 62 YCKEACNFVINQLTDKDI------LSVVVFDDQVETIIEPQKVTHKDLLKEYIQRI---E 112 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + + Q Q + + ++ + +I L+DG+ N ++ Sbjct: 113 TRGITNLSGGLIQGCQHVLKQEVKN----------------YVNRVILLSDGQANAGITD 156 Query: 348 VN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-- 404 +K+ D + + I T+ ++ + + L S + + +++ +F+ Sbjct: 157 KEALVKLADDYQSAGLVISTLGVSEHFDEELLEGVADSGRGNFHFINEVENIPSIFEQEL 216 Query: 405 ------ISQLMV 410 I Q + Sbjct: 217 DGLLNVIGQNIT 228 >gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata lugens] Length = 315 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 31/263 (11%), Positives = 79/263 (30%), Gaps = 55/263 (20%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++ Q + + L + + + SD Sbjct: 61 VVFVLDISGSMFGEKIKQLKDAMLKILSDLNPQD--------------HFSIVLFSDNAY 106 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V + K A+ +D + ++ + ++ + ++ + Sbjct: 107 V-------------WSKAKTAV--MKKILDEGFYNLDNETLAIL--DDHRNEILQATPDN 149 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K + + T+ +++ +++ K+ T Q + Sbjct: 150 VKTAKEFVELI--KPTTSTNIIDGLRKGLKLVKEGKETLDTTKEPSQP----------IM 197 Query: 334 IFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE- 387 FLTDGE N ++ T + ++ K I +++ + L K ++ Sbjct: 198 FFLTDGEPNVDLTDPVEIVNETSSLNEQLK---TPIYSLAFGQGADITFLKKLSKANHGF 254 Query: 388 --YHYNVVNAD-SLIHVFQNISQ 407 Y +A L + ++ IS Sbjct: 255 ARNIYEGSDATLQLNNFYKEISS 277 >gi|169344506|ref|ZP_02865475.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens C str. JGS1495] gi|169297426|gb|EDS79535.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens C str. JGS1495] Length = 929 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 28/276 (10%), Positives = 68/276 (24%), Gaps = 25/276 (9%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR--IEMGERPIFLIELVVD--LSGSMHCAMN 208 + L + KI + I ++ L+ H Sbjct: 141 DVVFLLDNSNSMRTNNRVGKIKTQIKRVMDNLNNDDNTRYALVTYASTILDGRYYHLIDK 200 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 S D + + N ++ + + + + Y+G + + + Sbjct: 201 SIGNDQYENYKGYTSNQCYLNFTSNIQEIYNKIPENVPYERGHDYVGGTFTQEGLLEAKK 260 Query: 269 PSWGTEKVRQYVTRDMDSLI---------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + + D + +T AY + + + Sbjct: 261 LLKNSTADEKIIIHLTDGVPTYSFLIEEFGGNKPTTFNYNTAYNGIGVRGLGVSYFFDSK 320 Query: 320 QGVKI--PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNG 375 + T+ + + + T + K IKI TI + + + Sbjct: 321 YNKPYIFSREMVYSALNRSTNQREEIWNNGIATALEAENIKRENPYIKIYTIGVELTKDV 380 Query: 376 QRLLKTCVSSPEYHYNVV---NADSLIHVFQNISQL 408 + + +YN N + + IS Sbjct: 381 SKW-----NDANKYYNAEGTLNLPEIKDLLAKISSS 411 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/235 (11%), Positives = 71/235 (30%), Gaps = 42/235 (17%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ++LV+D+SGSM R++++ + A Sbjct: 3 ASADDTAEPVTEPPRVQLVLDVSGSMRATD--------------IDGRSRISVAQQAFGE 48 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 +D++ + + V ++ G + T Sbjct: 49 VVDALPDETQLGIRVLGATYRGEDKQQGCLDTQQIVPVGPVNRERAKAAVATLRPTGFTP 108 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+++A + L + I+ +TDGE+ + ++ Sbjct: 109 VGLALREAAKDLGGG-------------------TTARRIVLITDGEDTCAPPDPC--QV 147 Query: 354 CDKAKENFIKIVTISINASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + +V ++ +P+ ++LL ++ + +A+ L + + Sbjct: 148 ARELAAQGTTLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAATSAEDLTGRLKQL 202 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 73/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + TE + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYTEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 ISKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|327262910|ref|XP_003216266.1| PREDICTED: integrin alpha-2-like [Anolis carolinensis] Length = 1302 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 34/201 (16%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +A+ FL L +GLI Y + + T K V ++ Sbjct: 307 YPWDAVKAFLKKFVQGLDIGPTKTQVGLIQYGNNPREVFNLNTYTNK-EDAVQAMSETYQ 365 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A + S + K ++ +TDGE+++ Sbjct: 366 NGGEYTNTFRAIEYARRYAFSKESG-------------GRPSASKVMVVVTDGESHDGSK 412 Query: 347 NVNTIKICDKAKENFIKIVTISI-NA----SPNGQRLLKTCVS-----SPEYHYNVVNAD 396 I C+ E+ I I++ + + L+K + +Y +NV + Sbjct: 413 LQEVIAKCN---EDNITRFGIAVLGYLIRNELDTKNLIKEIKGIASLPTSKYFFNVSSEA 469 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 +L+ + + R +S+ Sbjct: 470 ALLEE----AGTLGERLFSIE 486 >gi|83312059|ref|YP_422323.1| hypothetical protein amb2960 [Magnetospirillum magneticum AMB-1] gi|82946900|dbj|BAE51764.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1171 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 19/195 (9%), Positives = 58/195 (29%), Gaps = 30/195 (15%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + + + A FL + D + ++G+ + + Sbjct: 70 DHSSSMGAAPGSPLQMMLRAAGNFLRQLSP------DSRVAVVGFNQVPSVHCTLAATPA 123 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + R + T A+ QA ++L K ++ Sbjct: 124 QARSALQAIS---PGGATSIAAALNQAVELL-----------------AHGRPGMDKVVV 163 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +DG+++ + ++++ + L + ++++ Sbjct: 164 LCSDGQDDIA---EIADALARLKAIPSVRVLAVGFGDEVIHATFLAMVADRQD-YFHLTR 219 Query: 395 ADSLIHVFQNISQLM 409 A + VFQ +++ + Sbjct: 220 ARDMDDVFQRLAKEV 234 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 190 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 248 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 249 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 295 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 296 ---DNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 352 Query: 407 QLMVHR 412 + Sbjct: 353 DRIFSL 358 >gi|119620823|gb|EAX00418.1| vitrin, isoform CRA_c [Homo sapiens] Length = 700 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 533 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 592 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 593 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 633 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 634 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 689 Query: 409 MVHR 412 + Sbjct: 690 ICTE 693 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 37/141 (26%), Gaps = 18/141 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D L MG++ Y + T + + ++ A Sbjct: 338 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRA 397 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + S + ++ + DG + + A Sbjct: 398 ISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASRLA 439 Query: 358 KENFIKIVTISINASPNGQRL 378 +E+ I I I+I + ++ Sbjct: 440 RESGINIFFITIEGAAENEKQ 460 >gi|295789109|ref|NP_001171442.1| vitrin isoform 4 [Homo sapiens] gi|119620822|gb|EAX00417.1| vitrin, isoform CRA_b [Homo sapiens] Length = 656 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 489 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 549 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 589 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 590 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/276 (9%), Positives = 66/276 (23%), Gaps = 30/276 (10%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + I M +A + + T + + + + ++ Sbjct: 165 TTKAYQRPPIPGTTAQPVTLMQLLAV-TVAVATPTTLPRPSPSAASTTSIPRPQSVGHRS 223 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-- 226 + S L+ + S+ DP D + Sbjct: 224 QEMDLWSTATYTSSQNRPRADPGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIG 283 Query: 227 ----KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 ++ L D L MG++ Y + T Sbjct: 284 KRRFRIQK-----QLLADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAI 338 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + ++ A+ + S + ++ + DG Sbjct: 339 EKITQRGGLSNVGRAISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPT 385 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + A+E+ I I I+I + ++ Sbjct: 386 D-----KVEEASRLARESGINIFFITIEGAAENEKQ 416 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T++ Sbjct: 179 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVEAASHIEQRGGTETRT 237 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K +I +TDGE+++ I+ ++ Sbjct: 238 AFG-------------IEFARSEAFQKGGRKGAKKVMIVITDGESHDSPDLEKVIQQSER 284 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 285 ---DNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 341 Query: 407 QLMVHR 412 + Sbjct: 342 DRIFSL 347 >gi|12850100|dbj|BAB28591.1| unnamed protein product [Mus musculus] Length = 431 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 46/141 (32%), Gaps = 18/141 (12%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + E++RQ + L T ++A + + Sbjct: 3 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY--------------YENSQGYR 48 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 II LTDGE + + + +++++ + + + + L S ++ Sbjct: 49 TASVIIALTDGELHEDLFF-YSEREANRSRDLGAIVYCVGVKD--FNETQLARIADSKDH 105 Query: 389 HYNVVN-ADSLIHVFQNISQL 408 + V + +L + +I + Sbjct: 106 VFPVNDGFQALQGIIHSILKK 126 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 70/259 (27%), Gaps = 68/259 (26%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 RP I V+D SGSM TK+ +K L + + Sbjct: 40 RPTLDIVAVIDKSGSMA--------------------GTKLELVKKTLETLVAQLRACD- 78 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQIL- 305 + L+ Y T V ++ S +K R T+ ++ + T+ + + + IL Sbjct: 79 -----RLALVTYDTEVTLDLALSPMDDKGRSKATQVVNGIRDGSSTNLSGGLLEGLNILR 133 Query: 306 ---TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-- 360 T ++ G+ + + II ++ + + Sbjct: 134 NRPTDSRREVSSVLLLTDGLANVGISTTEGIIAAMGAVAAAAAPESSSSSLATTLRRASL 193 Query: 361 -----------------------------------FIKIVTISINASPNGQRLLKTCVSS 385 + T + + L + Sbjct: 194 SNFFRRKSPTPEQQQQQQKTESDSPVARPPTNLVEGCTVYTFGYGSDADSNMLKAISDAG 253 Query: 386 PEYHYNVVNADSLIHVFQN 404 +Y + N D++ F + Sbjct: 254 SGTYYFIENKDTVATSFGD 272 >gi|301109920|ref|XP_002904040.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096166|gb|EEY54218.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2146 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 51/224 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 V+D SGSM+ +D MAA K + + L Sbjct: 1904 HVFVLDCSGSMNGQPWNDL----------------MAAWKEYVYNRIADGATLD------ 1941 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + ++ + + + VT T+ ++ A ++L+ Sbjct: 1942 LVSVVTFDNSAQIVY-----EARSITTVTNARIQYRGGGTNYAAGLRSANEVLS------ 1990 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK----ENFIKIVTIS 368 ++ F+ I+F +DG + ++ + N ++ + Sbjct: 1991 ----------RVNFDMFKPAIVFFSDGHP---CDPLQGEELATHIRGCYERNGLQAFAVG 2037 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 S N L + +++V+ + L F +IS + R Sbjct: 2038 FG-SINLNMLERVAEKLGGTYHHVLTGNELKATFFSISASLSTR 2080 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 57/191 (29%), Gaps = 33/191 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKV 276 ++ A+ LF+ + ++ ++G++ +++ T+ Sbjct: 311 VSGSMSSSNRIGRQLQAVELFV-----VQIIENGAHVGIVKFSSSASVVSSLVKINTQAQ 365 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R + + T+ ++ + + I+ L Sbjct: 366 RDQLKSLIPRTAGGGTNICAGIRAGIAL----------------NKNFDGSSYGTEIVLL 409 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN- 394 TDGE + +C ++ I I++ + L + +N ++ Sbjct: 410 TDGE------DNLDTSLCFKDITDSGAIIHVIALG-PNAAKELETIANMTGGLRFNALDK 462 Query: 395 --ADSLIHVFQ 403 A+ LI F Sbjct: 463 VEANELIDAFS 473 >gi|194227183|ref|XP_001916967.1| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Equus caballus] Length = 946 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTEDQFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATTTQVADAKKYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQNIQD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAHRIYGNQDTSSQLKKFYNQVST 495 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 47/132 (35%), Gaps = 18/132 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY-VTRDMDSLILKPTDSTPAMKQ 300 ++ ++ +D +GL+ Y+ V N+ S R Y V T + + Sbjct: 401 LEGQKYIGKDNLIGLVSYSDDVYINLPISRFDLNNRSYFVGAVGGLQAGGATATFDGIAV 460 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + I L+DGE N S + K+ + E+ Sbjct: 461 AMKMLEEQLA--------------LDPKLKPKIFVLSDGETNRGHSLNDIRKLVE---ES 503 Query: 361 FIKIVTISINAS 372 I I TI NA Sbjct: 504 GIPIYTIGYNAD 515 >gi|271964702|ref|YP_003338898.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] gi|270507877|gb|ACZ86155.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] Length = 514 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 47/189 (24%), Gaps = 29/189 (15%) Query: 227 KMAALKNALLLFLDS----IDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQ 278 ++ AL+ AL+ + S + + +I + + Q Sbjct: 341 RIEALRQALVTLTGADTSASGTFSRFRSRENVIMIPFGGSAGLPQPFILPERDPQPALAQ 400 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 T ++ AY I+ +TD Sbjct: 401 IRAYAERLRAAGGTAIYDGLRAAY-----------------GQAGDAGRDHYTSIVLMTD 443 Query: 339 GENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 GEN + S + ++ + S + + + + ++ Sbjct: 444 GENTDGSSYEDFEAYYRSLPEARRQVRTFVVLFGES-DADEMERIATLTRGAVFDARTGS 502 Query: 397 SLIHVFQNI 405 L F+ I Sbjct: 503 -LASAFKEI 510 >gi|260797295|ref|XP_002593639.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] gi|229278865|gb|EEN49650.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] Length = 373 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 49/162 (30%), Gaps = 22/162 (13%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +I +G+I Y+TR + + K D + T + A++ Sbjct: 33 KTISGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSAAIDEVQYMRGGTLTGKAIR 92 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + K +I +TDG + + + +A++ Sbjct: 93 YVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVA-----APALEAQQ 134 Query: 360 NFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I + I + + L+ S+ V N + L + Sbjct: 135 KGITVYAIGVSGYDADQ---LEQIASNNNTLAFVDNFNLLDN 173 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/330 (11%), Positives = 80/330 (24%), Gaps = 32/330 (9%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRA--LIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 L+ A V + ++I K D A+ + + G S Sbjct: 70 TLSAAIDEVQYMRGGTLTGKAIRYVTKYGFGKSDGARPGVPKVVIVVTDGVSYDAVAAPA 129 Query: 119 --QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + ++ + N L + + V Sbjct: 130 LEAQQKGITVYAIGVSGYDADQLEQIASNNNTLAFVDNFNLLDNL-----RNTLLTGVCD 184 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNAL 235 I F L +S M + + D + + + Sbjct: 185 DCTCTCICCRSPECFGCRLSTSVSIDMCFFPATLCRNPLDIIFLLDGSGSVGASNFEKVK 244 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +I +G+I Y+TR + + K D + T + Sbjct: 245 QFTKKTISGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTG 304 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + K +I +TDG + + + Sbjct: 305 KAIRYVTKYGFGKSDG-------------ARPGVPKVVIVVTDGVSYDAVA-----APAL 346 Query: 356 KAKENFIKIVTISI-NASPNGQRLLKTCVS 384 +A++ I + I + + L+ S Sbjct: 347 EAQQKGITVYAIGVSGYDADQ---LEQIAS 373 >gi|16416477|gb|AAL18263.1| vitrin [Homo sapiens] Length = 656 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 489 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 549 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 589 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 590 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 25/276 (9%), Positives = 64/276 (23%), Gaps = 30/276 (10%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + I M +A + + T + + + + ++ Sbjct: 165 TTKAYQRPPIPGTTAQPVTLMQLLAV-TVAVATPTTLPRPSPSAASTTSIPRPQSVGHRS 223 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT-- 226 + S L+ + S+ DP D + Sbjct: 224 QEMDLWSTATYTSSQNRPRADPGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIG 283 Query: 227 ----KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 ++ L D L MG++ Y + T Sbjct: 284 KRRFRIQK-----QLLADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAI 338 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + ++ + + S + ++ + DG Sbjct: 339 EKITQRGGLSNVGRTISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPT 385 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + A+ + I I I+I + ++ Sbjct: 386 D-----KVEEASRLARVSGINIFFITIEGAAENEKQ 416 >gi|295789104|ref|NP_001171440.1| vitrin isoform 2 [Homo sapiens] gi|74739159|sp|Q6UXI7|VITRN_HUMAN RecName: Full=Vitrin; Flags: Precursor gi|37181801|gb|AAQ88704.1| VIT [Homo sapiens] gi|119620824|gb|EAX00419.1| vitrin, isoform CRA_d [Homo sapiens] Length = 678 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 511 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 570 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 571 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 611 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 612 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 667 Query: 409 MVHR 412 + Sbjct: 668 ICTE 671 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 37/141 (26%), Gaps = 18/141 (12%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D L MG++ Y + T + + ++ A Sbjct: 316 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRA 375 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + S + ++ + DG + + A Sbjct: 376 ISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASRLA 417 Query: 358 KENFIKIVTISINASPNGQRL 378 +E+ I I I+I + ++ Sbjct: 418 RESGINIFFITIEGAAENEKQ 438 >gi|196232430|ref|ZP_03131283.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196223502|gb|EDY18019.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 879 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 21/204 (10%), Positives = 56/204 (27%), Gaps = 16/204 (7%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + +TK++ + ++++ + G++ T+ + + Sbjct: 428 DRSGSMTAAVAGQTKISLADQGAVFAMNALQPKDY------FGVVAVDTKPHTVVPLAPI 481 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + K + +M +A+Q L R F + Sbjct: 482 SAKGAAEQKILSITAGGGGIYIYTSMVEAFQQLRDIPARVKHLLLFSDAADAEEKAAGE- 540 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHY 390 ++DG S + + I + + + L + Y Sbjct: 541 ---MSDGIRTGGNS----LDLASAMLAAKITTSVVGLGTEQDKDTPFLRQLAERGSGRFY 593 Query: 391 NVVNADSLIHVFQNISQLMVHRKY 414 +A +L +F + + Sbjct: 594 LTDDATTLPQIFSTETMKVAQSSL 617 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 12 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 70 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 71 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 130 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 131 YFFLGPSRTVSATSTAKATNVD 152 >gi|148555258|ref|YP_001262840.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] gi|148500448|gb|ABQ68702.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] Length = 166 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 17/149 (11%), Positives = 46/149 (30%), Gaps = 1/149 (0%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-AILAGA 65 R + + +I FAL + F ++ + + W+ ++ ++SA A++A Sbjct: 2 IRALLHRLRDDRRGVATIEFALLSVLFFFVMTAGLDIAMWYQQRLRLDSAVEQGAMIAFN 61 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 S+ + S +G + + + + + + + Sbjct: 62 SRASVDQSAIGTYVAAAAKLSSAPTVSISCNGNATCANSGRTCACISGGAPTYSALGCDK 121 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSY 154 S + + S + V + Sbjct: 122 TCSDGSLPGYYMRISATATASTLLVPAAM 150 >gi|150377243|ref|YP_001313838.1| hypothetical protein Smed_5128 [Sinorhizobium medicae WSM419] gi|150031790|gb|ABR63905.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 436 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 19/205 (9%), Positives = 52/205 (25%), Gaps = 6/205 (2%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 SR++ ++ + +E ++I A++ + +G W+ +K ++ A + + A Sbjct: 15 SRWQKPLRRFLKAEGGAVAVIAAVAFPVLVGAMGLGAETGYWYLEKRKLQHAADVSAYAA 74 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A + + + + S + N ++ Sbjct: 75 AVRHRAGDQQSALEAAARRVAGGSGFSPGDLTLSTA------SAAAGGSNNVTVELTETH 128 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 R+ + + + + L V + Sbjct: 129 PRLFSSVFGTGTITIKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVQLSGCSVVSN 188 Query: 185 MGERPIFLIELVVDLSGSMHCAMNS 209 FL+ L + Sbjct: 189 SSASDAFLMRNGSALMSTDCVYTVG 213 >gi|313238339|emb|CBY13421.1| unnamed protein product [Oikopleura dioica] Length = 501 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 60/194 (30%), Gaps = 33/194 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIE-PSWGTEKVRQYVTRDM 284 LK + ++ +++ + + G + R + ++ ++ V + Sbjct: 243 YQKLKTWMKSIIEKLEIGDNASRVSILQFSGQSARPQGRWINPVLTFDRSTSKEAVIGAI 302 Query: 285 DSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D + T A+ Y+ + + + ++ +I +TDG+ N Sbjct: 303 DGMKKLNGDTCIGEALDYFYRNMFTSQAGQ-------------RSDVEQRVIVMTDGKRN 349 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINAS---------PNGQRLLKTCVSSPEYH-YNV 392 + + +I I I + Q L + + + + + Sbjct: 350 CPAEIAKP---AELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEI 406 Query: 393 VNADSLIHVFQNIS 406 N D L + + I Sbjct: 407 NNFDQL--ILKRIG 418 >gi|261404225|ref|YP_003240466.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261280688|gb|ACX62659.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 562 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 51/161 (31%), Gaps = 20/161 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ D +G + Y+T V N+ + + +V T + + Sbjct: 413 LTGQKYLGRDNSIGFVSYSTDVTINLPIGKYDLNQQSMFVGAINSLEASGNTATFDGIVV 472 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + + + I L+DGE N S + + K Sbjct: 473 AMKMLQDEMA--------------ANPDVKPLIFVLSDGETNVGHSLDDIRGLIQAFK-- 516 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + TI + + Q L + N D + + Sbjct: 517 -IPVYTIG--YNADIQALQSISSINEAASINADTDDVVYKI 554 >gi|268575272|ref|XP_002642615.1| C. briggsae CBR-MUP-4 protein [Caenorhabditis briggsae] gi|187032080|emb|CAP28849.1| CBR-MUP-4 protein [Caenorhabditis briggsae AF16] Length = 2101 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 17/177 (9%), Positives = 55/177 (31%), Gaps = 20/177 (11%) Query: 230 ALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSL 287 + +L FL + + ++ Y+ ++ + Sbjct: 452 VFQTEVLRFLAEFTELFDIAPQKTRVSVVQYSDQIRHEFGL-DNYSDRKSLQNAIRNIEY 510 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ S+++ + + I +TDG + + + Sbjct: 511 LTGLTRTGAAIEHVANEAFSERRGARPVG-----------QVSRVAIVITDGRSQDNVTR 559 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + A + I++ + + + L+ S + ++V + L ++ Sbjct: 560 PS-----NNAHKQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNTRLRS 610 >gi|329928399|ref|ZP_08282269.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937835|gb|EGG34241.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 562 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 51/161 (31%), Gaps = 20/161 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ D +G + Y+T V N+ + + +V T + + Sbjct: 413 LTGQKYLGRDNSIGFVSYSTDVTINLPIGKYDLNQQSMFVGAINSLEASGNTATFDGIVV 472 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + + + I L+DGE N S + + K Sbjct: 473 AMKMLQDEMA--------------ANPDVKPLIFVLSDGETNVGHSLDDIRGLIQAFK-- 516 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + TI + + Q L + N D + + Sbjct: 517 -IPVYTIG--YNADIQALQSISSINEAASINADTDDVVYKI 554 >gi|126737457|ref|ZP_01753192.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] gi|126722042|gb|EBA18745.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] Length = 479 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 40/122 (32%), Gaps = 5/122 (4%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ +A +T K + + + +I ++DGE + + Sbjct: 88 PAAGTR--EAISKAVDAITPKGKTPLSAAVIQAAEGLHLSEEKATVILISDGEETCGR-D 144 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC--VSSPEYHYNVVNADSLIHVFQNI 405 I +A + I + + R C ++ + + A+ L ++ Sbjct: 145 PCAIGAELEAAGVDFTLHAIGFGIADDAARAQLQCLAENTGGVYLDAKGAEGLSAALSHV 204 Query: 406 SQ 407 ++ Sbjct: 205 TK 206 >gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus] gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus norvegicus] Length = 933 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 65/226 (28%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMAKNVLFVIDKSGSMA--------------------GKKIQQTREALIKILKDLSTQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V+ G + + E + + V T+ A+ A ++L Sbjct: 309 QFNIIVFSGEA----NQWEQLLVQATEENLNRAVDYASKIPAQGGTNINKAVLSAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKIV 365 + + II LTDGE ++N I+ + N + Sbjct: 365 KSNQAELLPSKSVS-----------LIILLTDGEPTVGETNPKIIQKNTQEAINGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L +Q ++ Sbjct: 414 CLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFYQEVAN 459 >gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] Length = 932 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 65/226 (28%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 268 PTMAKNVLFVIDKSGSMA--------------------GKKIQQTREALIKILKDLSTQD 307 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V+ G + + E + + V T+ A+ A ++L Sbjct: 308 QFNIIVFSGEA----NQWEQLLVQATEENLNRAVDYASKIPAQGGTNINKAVLSAVELLD 363 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKIV 365 + + II LTDGE ++N I+ + N + Sbjct: 364 KSNQAELLPSKSVS-----------LIILLTDGEPTVGETNPKIIQKNTQEAINGRYSLF 412 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L +Q ++ Sbjct: 413 CLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFYQEVAN 458 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/309 (11%), Positives = 79/309 (25%), Gaps = 22/309 (7%) Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 +N+ + S T +A D+ V+ + L + N Sbjct: 154 SEIEAFNSVVVEDGGSPSYPATALAYRTDDTPGPQTKAGKKVVAAITLVLDESGSMANNM 213 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 ++ S P+ + I + + SG + + P ++ Sbjct: 214 SGGRVTSSNPSRISILRNRAKALIDQF----SGLGNIYVGIIPYSDDAYISGTKSFVLAN 269 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDS 286 N + +DS+ M + Y T+ K+ S + + + Sbjct: 270 GTNVNTIKNKIDSLTAQGMTNTGDAMRVSYYATKQFKDSPNSIDNTLPTDTKVIPYMILL 329 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 PT + T + + + + D E+ N Sbjct: 330 SDGDPT--------VFSATTREWVGWWIFGHYEYSGYKQDSGYVISDRISDDSESKNTSQ 381 Query: 345 -KSNVNTIKICDKA---KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S + I + IK I + + +++ + L Sbjct: 382 WSSMGYSEYIGQNLVVGGDLEIKSHVIGFSNDATNAQ--TIANYINAEYHSANSDVELEA 439 Query: 401 VFQNISQLM 409 VF I + Sbjct: 440 VFDEIGGTI 448 >gi|258647262|ref|ZP_05734731.1| BatB protein [Prevotella tannerae ATCC 51259] gi|260852911|gb|EEX72780.1| BatB protein [Prevotella tannerae ATCC 51259] Length = 339 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 67/231 (29%), Gaps = 57/231 (24%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + QD ++ K + +D ++ + L + + + Sbjct: 97 DVSNSMLAQDVSPNRLERAKLLVSTLIDR-------MQNDKIALGVFAGEAYPQLPITGD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ ++ + T+ A+ A + T K Sbjct: 150 YGAAKLFLNSITPGMVTLQGTNLAAAINLADKSFTDK------------------KRVGK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL------------ 379 II +TDGE++ + + + K+ + I Q L Sbjct: 192 AIIIITDGEDHQGGAEEAAAAAAKEGR----KVYILGIGNPGGAQIPLGNDYLKDNNGQV 247 Query: 380 -----------KTCVSSPEYHYNVVNA----DSLIHVFQNISQLMVHRKYS 415 + + + +V N+ + L + Q H Y+ Sbjct: 248 VVTKLNEQMCREVAKAGDGLYLHVDNSNAAQEQLQAALGQLQQASEHTDYT 298 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/326 (9%), Positives = 76/326 (23%), Gaps = 20/326 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINL 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ + + S I + S I Sbjct: 76 SVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTT 135 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFL 192 +R N I V + + IP + P+ Sbjct: 136 SRTGIVNWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAG 195 Query: 193 IELVVDL-------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + +G+ + +S + A + + Sbjct: 196 ATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPG 255 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DS 294 D + + Y + + + D + + T Sbjct: 256 DKAATTNAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASY 315 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQ 320 + S + R+ Sbjct: 316 QGDLITGINTGGSYNPSYYAAGADRR 341 >gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] Length = 802 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 69/241 (28%), Gaps = 54/241 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + V+D SGSM + K K AL L ++ Sbjct: 292 ETAIPKDVIFVLDRSGSM--------------------EGVKFQQAKQALEYVLSRLNPQ 331 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 L+ ++ +VE G E + Q T+ A+ A Q + Sbjct: 332 DRFN------LLSFSNQVEVFAPEMEGVEAIPQAQKWVAGLSAAGGTNIHRALLDAIQFV 385 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN--VNTIKICDKAKENFIK 363 ++IFLTDG ++ +A ++ Sbjct: 386 --------------------RSQRPTYLIFLTDGLPTVGITDREQILDDFARQAP-RGLR 424 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV----FQNISQLMVHRKYSVILK 419 + + + L + ++ V + L F+ IS ++ ++ ++ Sbjct: 425 LFVFGVGYDVDTFLLDELALAHHGLSLYVRPEEDLNQAVAGFFEKISTPVLTD-LTLTVE 483 Query: 420 G 420 G Sbjct: 484 G 484 >gi|301627727|ref|XP_002943021.1| PREDICTED: complement factor B-like, partial [Xenopus (Silurana) tropicalis] Length = 705 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/267 (13%), Positives = 87/267 (32%), Gaps = 48/267 (17%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + ++++ + + I +V+D S S+ Sbjct: 211 FSSSMLENVDTTNLE--GRSDRKVQILKDGLMNIFIVLDTSKSV---------------- 252 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + K K A +LF++ + + +I Y + + K Sbjct: 253 ----GKDKFNEAKEASILFIEKVSSYDIKPQY---CIISYASEAIPVVSLRDQDSKNADA 305 Query: 280 VTRDMDSLILK------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ YQ L + N +KI + I Sbjct: 306 VIEHLENFVYDSHADKQGTNTRAALHSIYQQLIEQELVYKNNNNKESFMKIHN-----VI 360 Query: 334 IFLTDGENNNFKSNVNTIKIC----------DKAKENFIKIVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ D +E ++ + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKVIRRFLNVGTSKDDLREEYLDVYVFGLGSDIDQPEINDLAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQNISQL 408 + + +++ N D + F+ + Sbjct: 421 KKDKEVHTFHLENVDKMKEFFELMIDE 447 >gi|291486252|dbj|BAI87327.1| putative exported protein [Bacillus subtilis subsp. natto BEST195] Length = 224 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 20/208 (9%), Positives = 55/208 (26%), Gaps = 33/208 (15%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH--------VKEDVYMGLIGYTTRV 263 D + + + + K+ K ++ F + + ++ + G + Sbjct: 36 FDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFGHAGNNKLSGKALSCSTT 95 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + + T A+ + + Sbjct: 96 ETIYGLHPYEGSL--FDNSLSELKPTGWTPIAKALADTRKEFEAF--------------- 138 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT 381 + + +TDGE +K + + + + I N G +K Sbjct: 139 --DADGKNVVYLITDGEETCGGD---PAAEIEKLRASNVDTIVNIIGFNFDVKGNEEMKQ 193 Query: 382 CVSSP-EYHYNVVNADSLIHVFQNISQL 408 + + + +AD ++ +Q Sbjct: 194 AAVAGCGEYISANSADEFEQAWEKEAQK 221 >gi|216264497|ref|ZP_03436489.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221217546|ref|ZP_03589016.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224532807|ref|ZP_03673422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224534090|ref|ZP_03674673.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225548563|ref|ZP_03769611.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] gi|225549807|ref|ZP_03770771.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|215980970|gb|EEC21777.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221192609|gb|EEE18826.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224512196|gb|EEF82582.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224512789|gb|EEF83157.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225369615|gb|EEG99064.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|225370826|gb|EEH00261.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] Length = 333 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 110 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + ++ + A+ +KI +I I +S + L++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 262 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 263 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|332707319|ref|ZP_08427370.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] gi|332353913|gb|EGJ33402.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] Length = 464 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 68/216 (31%), Gaps = 33/216 (15%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR-------TKMAALKNALLLFLDS 241 P V+D SGSM+ + D + TK+ + +LL + Sbjct: 43 PSTSFAFVIDTSGSMYEVVEGDTKPTGRVYTQDGNDYEEVIGGKTKIDIVIESLLNLVR- 101 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + +I + + + + TE + + T M+Q Sbjct: 102 ---SNQLGGSDRIAIIQFDDQASTIVGLTPATETSQLEAGIEKLRNYSGGTCMGEGMEQT 158 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +L+ + + TDG+ ++ + EN Sbjct: 159 LTMLS------------------GQTMTSRHALIFTDGQA---FDEEECRELAKQFSENG 197 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 I I + + N L+ ++ + +++V+ D+ Sbjct: 198 IPITALGVG-DYNEDLLINLSDTTAGHLHHIVSGDA 232 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 62/184 (33%), Gaps = 28/184 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + + +G+I Y+ +VE + + K + D Sbjct: 814 FQIIKNFVKTLTDQVA---LDLDTARIGIINYSHKVEMVAHLTQFSSKDDFKLAVDNMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A+ A + + +K + +TDG+ + + Sbjct: 871 LGEGTYTATALHAANHMFEA-----------------ARPGVKKVALVITDGQ-TDTRDE 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSLIH 400 N ++ A + ++I I + + + + + P+ + Y + +L Sbjct: 913 KNLTEVVKNASDASVEIFVIGVVKRNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITLQD 972 Query: 401 VFQN 404 + Sbjct: 973 TLKQ 976 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 61/165 (36%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D + L+ V+ D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKVFQLTPVRSLKYDIKLAALQFSSSVQIDSPFSSWKDLQTFKQRVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLQREGRKDGV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ I +TI ++ N +L S Sbjct: 166 PKNPD--VQSISEDARTAGILFITIGLSTVVNEAKLHLISGDSSS 208 >gi|90407194|ref|ZP_01215382.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] gi|90311770|gb|EAS39867.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] Length = 1116 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 12/62 (19%), Positives = 22/62 (35%), Gaps = 4/62 (6%) Query: 355 DKAKENFIKIVTISINA---SPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLMV 410 A+E I TI + LLK ++ NA +L F+ + ++ Sbjct: 377 KLAEEQNITTYTIGFGDITSDTDATDLLKKTAELGGGKYFPASNASALQKAFKKMIISIL 436 Query: 411 HR 412 + Sbjct: 437 NE 438 >gi|327542166|gb|EGF28659.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1014 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 46/117 (39%), Gaps = 8/117 (6%) Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-KFIIFLTDGENNNFKSNVNTIKICD 355 AM A +T F + K +I ++DG+ ++V Sbjct: 508 AMLAAVGRMTPGDMPEFDPAMRMAVTGLVRTDASVKHLIIISDGDPGPPSNSVI-----Q 562 Query: 356 KAKENFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K+N I I T+++ + + +RL ++ +Y V + +L +FQ ++ + Sbjct: 563 AFKDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRALPGIFQREARRVT 619 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 11 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 69 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 70 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 129 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 130 YFFLGPSRTVSATSTAKATNVD 151 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 59/189 (31%), Gaps = 31/189 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 ++ + + +G++ Y +V + V V T+ Sbjct: 187 DFLINILRKFYVGPGQIQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETN 245 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A + +K +I +TDGE+++ I+ Sbjct: 246 TALGINVARS---------------EAFKQGGRRGAKKVMIVITDGESHDSADLQQVIEE 290 Query: 354 CDKAKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQ 403 + ++ I I++ N + L K S P+ + +NV + +L + Sbjct: 291 SE---KDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVD 347 Query: 404 NISQLMVHR 412 + + + Sbjct: 348 ALGERIFSL 356 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 59/189 (31%), Gaps = 31/189 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 ++ + + +G++ Y +V + V V T+ Sbjct: 165 DFLINILRKFYVGPGQIQVGVLQYGEKVVSEFQL-NDFRSVEDVVKAARKIGQRGGEETN 223 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + A + +K +I +TDGE+++ I+ Sbjct: 224 TALGINVARS---------------EAFKQGGRRGAKKVMIVITDGESHDSADLQQVIEE 268 Query: 354 CDKAKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQ 403 + ++ I I++ N + L K S P+ + +NV + +L + Sbjct: 269 SE---KDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVD 325 Query: 404 NISQLMVHR 412 + + + Sbjct: 326 ALGERIFSL 334 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 192 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 251 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 252 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 300 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 301 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 353 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 354 FTQKALRDAGSMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 396 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 397 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 456 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 457 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 515 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 47/162 (29%), Gaps = 16/162 (9%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + L + LF D++ + ++ + S + Sbjct: 27 TSGSMEGDSINELNEGVRLFYDAVRSDETALYAAEISVVTFGGHASCQAGFS-----TLE 81 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T AM A +L + +Q +I+ +TD Sbjct: 82 HQPDAPQFYADGGTPMGEAMNMALDMLEKR---------KSEYKASGVDYYQPWIVLMTD 132 Query: 339 GENNNFKS--NVNTIKICDKAKENFIKIVTISINASPNGQRL 378 G N ++ + + + CD + + I I I + L Sbjct: 133 GMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVL 174 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 11/102 (10%), Positives = 33/102 (32%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + + N + + AL FL + F + ++ +++ + A +AGA+ Sbjct: 1 MRGFVSRFLGARGGNLATMAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAA 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + + + ++ + Sbjct: 61 SLSQANEAVLRQLQANGVDPVLMTDGYDPSIVNGKADNKTRV 102 >gi|229490509|ref|ZP_04384348.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229322588|gb|EEN88370.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 684 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 76/250 (30%), Gaps = 48/250 (19%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + +V+DLSGSM+ + K+ K +L Sbjct: 30 PTTPPTRAVSAAPAALLMVMDLSGSMNDNDANGK--------------NKLTGAKQSLSR 75 Query: 238 FL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDST 295 + D+ + + Y G K + + L T + Sbjct: 76 IVGDTASSSTPLGLWTYPTAGSNCDPGSFLAGADGGVRKDTDTLMAQVSGLKADGGTPTG 135 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 PA++ + L ++ ++ ++DGE+N ++ CD Sbjct: 136 PALRASVDSLKANGIT------------------TATVVLISDGESNCGQAP------CD 171 Query: 356 KAKE-----NFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 AK+ + + + S G+ L+ S + + ++ + D + + + + Sbjct: 172 TAKQIVAEGFDVTVEALGFQLSGQGRTELECIASTTGGRYSDIADVDEMQKRLKELM--I 229 Query: 410 VHRKYSVILK 419 ++ Sbjct: 230 PELALTISAP 239 >gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora acidiphila DSM 44928] Length = 1033 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 59/185 (31%), Gaps = 22/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM A + A +D++ + + + + + + R V Sbjct: 333 KMTAARRAAARIVDTLTAEDRFAVLTFDDQMETPDGLPTGLSEATDRHRFR-AVQHLATV 391 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ P +++A +L+ + + +I +TDG+ N Sbjct: 392 DARGGTEMEPPLRRAATLLSD-----------------DNPDRDRVLILITDGQVGNEDR 434 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + T+ K I++ T+ I+ + N L + + V + D L I Sbjct: 435 LLTTLSP----KLTHIRVHTVGIDTAVNAAFLQRLSTLGGGHCELVESEDRLDDAMDAIH 490 Query: 407 QLMVH 411 + Sbjct: 491 HRIAT 495 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/286 (9%), Positives = 71/286 (24%), Gaps = 32/286 (11%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 +++ V L + N + + L ++ + Sbjct: 80 TASGDSSSSTVTTAVPADIVLVLDKSGSMKNSNRDTNAKNAAT-ALASKLLTAANAALPA 138 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + ++ + F S + G Sbjct: 139 EQQVQMAVVTFSDRA----------------------RTTSQFTTSPGAIGTAVSAWPNG 176 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQ-AYQILTSDKKRSF 313 + ++ + S G V++++ D + T + K + Sbjct: 177 GTNWEDALKTANDLSSGRSGVQKHIVFLSDGNPTFRITSYSGCFKWSGFGQGWESHPEYG 236 Query: 314 FTNFFRQGVKIPSLPFQ-KFIIFLTDGE-----NNNFKSNVNTIKICDKAKENFIKIVTI 367 K +Q K +DG +++ + + + + + Sbjct: 237 TQAECEANNKWLDQSYQWKTDPDGSDGNIHGEGDDDSYGYNYAAALSEANERGNAALYVV 296 Query: 368 SINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++A N L K + Y+ +A++L F+ I + Sbjct: 297 KVSADANKMSDLAKEANAVTGKEYDGTSAENLTKAFEQIYNTITTT 342 >gi|195941904|ref|ZP_03087286.1| hypothetical protein Bbur8_03396 [Borrelia burgdorferi 80a] gi|312149118|gb|ADQ29189.1| von Willebrand factor type A domain protein [Borrelia burgdorferi N40] Length = 333 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 110 SSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + ++ + A+ +KI +I I +S + L++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 262 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 263 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 214 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 322 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 323 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 375 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 376 FTQKALRDAGSMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 418 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 419 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 478 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 479 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 537 >gi|218249312|ref|YP_002374701.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223889245|ref|ZP_03623833.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226320920|ref|ZP_03796471.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|226321491|ref|ZP_03797017.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|218164500|gb|ACK74561.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223885278|gb|EEF56380.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226232680|gb|EEH31433.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|226233692|gb|EEH32422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|312147800|gb|ADQ30459.1| von Willebrand factor type A domain protein [Borrelia burgdorferi JD1] Length = 333 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 110 SSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + ++ + A+ +KI +I I +S + L++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 262 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 263 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPATQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 196 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 255 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 256 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 315 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 316 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 362 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 363 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 412 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 413 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 472 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 473 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 528 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 529 DENGDLYYESADYAPDISDYLAKKAVQIS 557 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 47/359 (13%), Positives = 96/359 (26%), Gaps = 71/359 (19%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 ++V + + S + A + + + T + Sbjct: 252 AVDQLVQYQNNAPTAAYTTSILKESASKGIIDAMVMEEQTYINTPELKNYVYTPAGVRHD 311 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL--------------NQRY 169 + + L+++ + + + + Sbjct: 312 HPVYAFPWTDEDELEAAKMF--------VDFCLTSD-SQSMATERGFNLHEDYVSEDYGM 362 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + V D+SGSM+ T++ Sbjct: 363 TGSDFLSAQKVWKTNKNGTRPTIAVFVTDISGSMN--------------------GTRIK 402 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLI 288 +LKN+LL + ID S+ +GL+ Y+ +V N+ + K R Y + D + Sbjct: 403 SLKNSLLSTMQYIDSSSY------IGLVSYSDKVYINLPIAQFDNKQRAYFSGAVKDLDV 456 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ Q+L K N + L+DG N Sbjct: 457 GGQTATYDAVLVGMQMLMEKSKEVPDANMM--------------LFVLSDGAQNAGFELK 502 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNAD---SLIHVFQ 403 I I + TI + + L + + D L ++F Sbjct: 503 RITPIVGGL---GISVYTIGYEMTDSDKEDLKALSEINEAVCIDADTEDIVNELRNLFN 558 >gi|38637070|dbj|BAD03327.1| histone deacetylase HD2-like protein [Oryza sativa Japonica Group] gi|40253820|dbj|BAD05756.1| histone deacetylase HD2-like protein [Oryza sativa Japonica Group] Length = 169 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 3/40 (7%), Positives = 10/40 (25%) Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + T + + + + N ++ F Sbjct: 68 PVHTFGFGKDHDAVAMHTIAEVTGGTFSFIENEAAIQDGF 107 >gi|198415894|ref|XP_002125076.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Ciona intestinalis] Length = 863 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 73/254 (28%), Gaps = 45/254 (17%) Query: 163 HLLNQRYNQKIVSF--IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 +++ Y Q++V F + E + + + D + Sbjct: 137 SMMSHSYVQQVVEFCHSDPSDPVN-QHNTEADNEQNAKCNLRSTWDVITSTSDFSGGSNP 195 Query: 221 QDKKRTKMA---ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 + T+ ++ + F+ + + +G+ + + + ++ S + + Sbjct: 196 PNPTLTRPDRMLMMRQSASTFI-----TTSLPIGDCVGITSFQSTSQLLLQISNITSSED 250 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R +TR + T +K A + D + II L Sbjct: 251 RANLTRMIPISAGGTTCIGCGIKTAIDEMERDNPGGVCGD----------------IIVL 294 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG N + I + N + L + + + Y + D Sbjct: 295 TDGMENEEP--------------KNCTVHAIFFTTAHNRE-LAELAIETGGQWYFAQDRD 339 Query: 397 S--LIHVFQNISQL 408 L+ F I+ Sbjct: 340 IKRLMGAFTAIATE 353 >gi|77359874|ref|YP_339449.1| pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] gi|76874785|emb|CAI86006.1| conserved protein of unknown function ; putative pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] Length = 1056 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 32/320 (10%), Positives = 71/320 (22%), Gaps = 74/320 (23%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + + G R + S + + Sbjct: 32 VETKEKPRVMMIFDTSGSMAWDVSD-------GDACYMRSGNRYYEVDCFQSKNSYNRNE 84 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 N C D +++ +NA+ ++ ED+ GL+ + Sbjct: 85 QCYKRSSYYNYEATCSD---SRLKVAQNAIKQLVN-------DNEDIEFGLMRFNGDNGG 134 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY-----QILTSDKKRSFFTNFFRQ 320 + + + + + + T T + +AY Q + + Sbjct: 135 YV-LARVGADKKSLLEKIEELPANGATPLTETLWEAYLYITGQKVYYGNNTNDRDKTAES 193 Query: 321 GVKIPSLPFQK-----------FIIFLTDGENNNFK---------------------SNV 348 S II +TDG+ +N SN Sbjct: 194 SGYYNSPFVPVTGEPLRCDNSINIILMTDGDPSNDSNRNNDIYYLHRDYFSENPPIVSNS 253 Query: 349 NTIKICDKAKE------------NFI----KIVTISINASPNG---QRLLKTCVSSPEYH 389 + + ++ I + L +T + Sbjct: 254 YLAALAKIIHGTDDVEVDLYKNTKNVLDIGRVFPIGFGTGMSNSGKNLLTETADYGGGQY 313 Query: 390 YNVVNADSLIHVFQNISQLM 409 + A L +N + Sbjct: 314 LHANTAAQLSDALKNTISRI 333 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 64/236 (27%), Gaps = 52/236 (22%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +RP + ++D SGSM K+ LK + L L + Sbjct: 288 AIEDRPPLNLVFLIDTSGSMQ-------------------DPAKLPLLKQSFGLMLGRLR 328 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + ++ Y + + P+ + ++ T + AY+ Sbjct: 329 PED------QVAIVTYAGSAGEVLAPTAA-NQRSTILSALDRLDAGGSTAGEEGLALAYR 381 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFI 362 + T ++ TDG+ N S+ ++ ++ I Sbjct: 382 TASEMAGAGEVT----------------RVVLATDGDFNLGISDPEELARLVAHERDTGI 425 Query: 363 KIVTISINASP-NGQRLLKTCVSSPEYHYNVVN--------ADSLIHVFQNISQLM 409 + + + + + + + D L I+ + Sbjct: 426 YLSVLGFGRGNLDDATMQALAQNGNGQAAYIDSLNEAQKVLVDQLSGALFPIADDV 481 >gi|45384200|ref|NP_990400.1| integrin alpha-1 [Gallus gallus] gi|2582830|dbj|BAA23160.1| alpha1 integrin [Gallus gallus] Length = 1171 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 66/194 (34%), Gaps = 26/194 (13%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ FL+S+ + + +G++ Y V + + + Sbjct: 174 YPWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYL-NTYSTTEEVMDAALRIRQ 232 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + F QK ++ +TDGE+++ Sbjct: 233 RGGTQTMTAL---------GIDTAREEAFTEAHGARRG--VQKVMVIVTDGESHDNYRLQ 281 Query: 349 NTIKICDKAKENFIKIVTISI-NASPNGQ-------RLLKTCVS--SPEYHYNVVNADSL 398 I C+ + I+ I+I + G +K+ S + ++ +NV + +L Sbjct: 282 EVIDKCE---DENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELAL 338 Query: 399 IHVFQNISQLMVHR 412 + + + + + + Sbjct: 339 VTIVEALGERIFAL 352 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 76/389 (19%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILA---GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI----KNH 101 + M A A LA A + ++ S + L + Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|324513792|gb|ADY45651.1| C-type lectin protein 160 [Ascaris suum] Length = 497 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 16/160 (10%), Positives = 42/160 (26%), Gaps = 25/160 (15%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + + + ++ M A + + Sbjct: 1 MVLFNDVARVGFPLTTYATNFDVTTAILGMNFNGGISNIAAGMHTAMTEVFAKSSERG-- 58 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 Q+ ++ +TDG + + + A +N I + I Sbjct: 59 -------------VQRVMVIITDGMDITDVGTEHAL-----ASKNNINTYVLGIGEEQGY 100 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + ++K P Y+ N L V + + + + Sbjct: 101 EEMVK-AAGDPSRLYDASNFTQLRDV----IDEVCNERVT 135 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 76/389 (19%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILA---GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI----KNH 101 + M A A LA A + ++ S + L + Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|291444745|ref|ZP_06584135.1| lipoprotein [Streptomyces roseosporus NRRL 15998] gi|291347692|gb|EFE74596.1| lipoprotein [Streptomyces roseosporus NRRL 15998] Length = 531 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 64/220 (29%), Gaps = 43/220 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 ERP + VVD+SGSM ++ ++ +L + D + Sbjct: 176 TSERPPAALTFVVDISGSMAE-------------------TGRLDLVRKSLTILADELRD 216 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + L+ ++ E + + + + + + T+ +K Y Sbjct: 217 DDS------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGY-- 268 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFI 362 + V+ ++ L+D N ++ KI +E I Sbjct: 269 --------------EESVEGHREGATNRVVLLSDALANTGETEAEGILKKIDSARREYGI 314 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + + + V + VF Sbjct: 315 TLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 354 >gi|182438638|ref|YP_001826357.1| hypothetical protein SGR_4845 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467154|dbj|BAG21674.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 578 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 63/224 (28%), Gaps = 43/224 (19%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ERP + VVD+SGSM ++ ++ +L + D Sbjct: 207 AAPSTAERPPAALTFVVDISGSMAE-------------------TGRLDLVRKSLAVLTD 247 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + V L+ ++ E + + + + + T+ + + Sbjct: 248 ELRDDDSVS------LVTFSDAAETRLPMTRLQGNRNRIKDAVDEMRPEQSTNVEAGITR 301 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAK 358 Y + V+ ++ L+D N ++ +I + Sbjct: 302 GY----------------EESVEGHRKGATNRVVLLSDALANTGDTEADGILERIDSTRR 345 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 E I + + + + + + V + VF Sbjct: 346 EYGITLFGVGVGSDYGDAFMERLTNKGDGNTTYVGDEAQARKVF 389 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 61/184 (33%), Gaps = 23/184 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ +K L +D + + +G++ Y +RV+ + K + Sbjct: 45 SSRSVRPSEFEQVKVFLAKVIDGLS---VGPDATRVGVVNYASRVKNEVSLKS--HKTKA 99 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + + T + + + + F S K I +TD Sbjct: 100 ALVKAVSKIEPLSTGTMTGLAIQFAM---------NVAFSEAEGGRKSPDISKVAIIVTD 150 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + N I +A+E I+I I + L+ S P ++ V + Sbjct: 151 GRPQD-----NIRDIAARAREAGIEIFAIGVGRVD--MTTLRQMASEPLEDHVDYVESYS 203 Query: 397 SLIH 400 + Sbjct: 204 LIEK 207 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/359 (6%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 192 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 251 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 252 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 300 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 301 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 353 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 354 FTQKALRDAGNMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 396 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + + D N + + + TI KE I+I Sbjct: 397 SSSNYYGTQFSNTQDQPGNTSRISNSYYAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 456 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 457 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 515 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + R N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 ISKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|239987768|ref|ZP_04708432.1| hypothetical protein SrosN1_10718 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 64/220 (29%), Gaps = 43/220 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 ERP + VVD+SGSM ++ ++ +L + D + Sbjct: 160 TSERPPAALTFVVDISGSMAE-------------------TGRLDLVRKSLTILADELRD 200 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + L+ ++ E + + + + + + T+ +K Y Sbjct: 201 DDS------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGY-- 252 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFI 362 + V+ ++ L+D N ++ KI +E I Sbjct: 253 --------------EESVEGHREGATNRVVLLSDALANTGETEAEGILKKIDSARREYGI 298 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + + + V + VF Sbjct: 299 TLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 338 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 68/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 319 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 373 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A ++ I+ LTDGE+N Sbjct: 374 RLPAAASGGTSICSGLRSASTVIRKKYPTDGSE-----------------IVLLTDGEDN 416 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 417 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 468 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 469 IDAFGALSSGNGAVSQRSIQLESKG 493 >gi|290474520|ref|YP_003467400.1| putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] gi|289173833|emb|CBJ80615.1| Putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] Length = 223 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 20/199 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY-TTRVEKNIEPSWGTEKV 276 K+ AL L L S S ++ ++ + +I + + E N+ + Sbjct: 23 TSGSMASDGKIEALNQGLKDMLLSFKDESRLRAEIQVSVITFGNNKAEVNLPLAPAH--- 79 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T A+ A Q++ IPS ++ ++ + Sbjct: 80 --LLQDFIPLSAEGSTPLGGALSLASQMIEDKSV-------------IPSRAYKPVVVLV 124 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG N+ + + +SI + L + + NA Sbjct: 125 SDGYPNDDWE-GPFSSFASGERSSKATRFAMSIGGDADEDMLSEFANDPEAPLFRAENAS 183 Query: 397 SLIHVFQNISQLMVHRKYS 415 + F+ ++ + + S Sbjct: 184 DIRRFFRAVTMSVSVQSRS 202 >gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus norvegicus] Length = 706 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 65/226 (28%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMAKNVLFVIDKSGSMA--------------------GKKIQQTREALIKILKDLSTQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V+ G + + E + + V T+ A+ A ++L Sbjct: 309 QFNIIVFSG----EANQWEQLLVQATEENLNRAVDYASKIPAQGGTNINKAVLSAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-IKIV 365 + + II LTDGE ++N I+ + N + Sbjct: 365 KSNQAELLPSKSVS-----------LIILLTDGEPTVGETNPKIIQKNTQEAINGRYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L +Q ++ Sbjct: 414 CLGFGFDVNYPFLEKLALDNGGLARRIYEDSDSALQLQDFYQEVAN 459 >gi|58429517|gb|AAW78162.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYSEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYASVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQGS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 66/233 (28%), Gaps = 36/233 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R I + +D+SGSM K+ K L L L + Sbjct: 134 RSPIDIVVALDVSGSMR--------------------VEKLDLCKETLHLLLRELHHDD- 172 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 LI ++ + +Q + + T+ A+ A Q++ Sbjct: 173 -----RFALISFSEDAVIEVPMQKVNERNKQQALHAIDRLSVKGRTNIASAVSLAAQVVN 227 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + F + + + I + K ++ I + T Sbjct: 228 GVAEPNKVRSVFLLTDGNANTGYTEAIDLV--------KLTSIFVEANRNPHTPPISLHT 279 Query: 367 ISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 P+ + L +S Y+V + + F + ++ +L Sbjct: 280 FGYGPEPDQKLLRGMAMATSGGSFYSVRDNSQVSSAFGDAIGGILSLALYSVL 332 >gi|297265792|ref|XP_002799252.1| PREDICTED: vitrin-like [Macaca mulatta] Length = 657 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 490 FRTVLQFVTNLTKEFEISDTDTRVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSG 549 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 550 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 590 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 591 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 646 Query: 409 MVHR 412 + Sbjct: 647 ICTE 650 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/343 (8%), Positives = 89/343 (25%), Gaps = 42/343 (12%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 +++ + ++ A K + + + +A V + + T + S+ Sbjct: 149 ALTYSSSKSPAAQAGETTKAYQRPPIPATTAQPVTLMQLLAVTVAVATPTILPRPS-PSA 207 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 +T + + + S + + + Sbjct: 208 ASTTSIRRPEPVGHR-----SQEMDLWSTTTYTSSQNRPGADPGIQRQDPSGAAFEKPVR 262 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 + D + K+R ++ L L MG++ Y Sbjct: 263 ADVSLDCKIDLSFLIDGSTSIGKRRFRIQK-----QLLAGVAQALDIGPAGPLMGVVQYG 317 Query: 261 TRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + ++ + + ++ A+ + S + Sbjct: 318 DNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAISFVTKNFFSKANGN------ 369 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ + DG + + A+E+ I I I++ + ++ Sbjct: 370 -------RSSAPNVVVVMVDGWPTD-----KVEEASRLARESGINIFFITVEGAAENEKQ 417 Query: 379 L---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 C ++ Y ++V + L Q + + + Sbjct: 418 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 460 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 66/229 (28%), Gaps = 34/229 (14%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +P + + L++D+SGSM +++M+A K A Sbjct: 22 IAVPTTAVADEISATAPKVNLLLDVSGSMRAKD--------------IDGQSRMSAAKQA 67 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 LD+ + R + + P D Sbjct: 68 FNEVLDATPKEVQLGIRTLGADYPGDDR--------------KTGCKDTAQLYPVGPLDR 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 T A K A L + + K I+ ++DGE+ + ++ Sbjct: 114 TEA-KTAVATLAPTGWTPIGPALLKAADDLEGGEGTKRIVLISDGEDTCAPLDPC--EVA 170 Query: 355 DKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + I I T+ + + + L ++ + ++ + D L Sbjct: 171 REIAAKGIGLTIDTLGLVPNAKLRVQLSCIAEATGGTYTSIEHRDELTD 219 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 43/319 (13%), Positives = 84/319 (26%), Gaps = 34/319 (10%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + S + D + + S+S + E +N + Sbjct: 1261 NVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLTMKVPQEDAKNINLQIEAKATE 1320 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRL-QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 A + D N + + + + ++N + I+ ++ + + Sbjct: 1321 ARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAGKDILIGDTGGTQLNVQAGKNY 1380 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I LVVD SGSM A S +++ LK+AL DS+ Sbjct: 1381 NIALVVDTSGSMKEASGSKT--------AWGTTISRIDLLKDALKNLADSLKGHDGKINV 1432 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + + + +T+ T+ A + + Sbjct: 1433 SIIDFDTNAKEPITFNDLTS--KNISDLITKIDALKAEGGTNYEDAFLKTTSWFDTQSVT 1490 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN----------VNTIKICDKAK--E 359 ++ FLTDG+ N + D K Sbjct: 1491 YGKAQG-----------YENLTYFLTDGDPTFSNRNTTSTGTTTEYSDMKDAVDAFKTLS 1539 Query: 360 NFIKIVTISINASPNGQRL 378 + I I N L Sbjct: 1540 GQSTVHAIGIGNGINENYL 1558 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 55/198 (27%), Gaps = 34/198 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + + KM A K A+ +LD I + + E + S G Sbjct: 159 DASGSMAGKVNGQVKMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCG 215 Query: 273 TEKV---------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +V Q+ T A++ Sbjct: 216 SSEVMYPLQPYKKEQFNAALSKFGPKGWTPLASAIESVNDDFKE---------------- 259 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINA-SPNGQRLLK 380 + ++DGE + ++ + I + + Q+L+ Sbjct: 260 YTGEENLNVVYIVSDGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLMN 316 Query: 381 TCVSSPEYHYNVVNADSL 398 T + + V NAD L Sbjct: 317 TAEAGKGNYATVSNADEL 334 >gi|111114995|ref|YP_709613.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|216263812|ref|ZP_03435806.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] gi|110890269|gb|ABH01437.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|215979856|gb|EEC20678.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] Length = 333 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + + + + Sbjct: 110 SSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + A+ ++I +I I +S K Y SL V+ Sbjct: 205 --SDEIYKDQVINLAQGLNVRIYSIGIGSSEEFSVEFKL---RSGKFYQ----GSLKEVY 255 Query: 403 Q-----NISQLMVHRKYSV 416 IS YSV Sbjct: 256 DPSMLVEISNKTGGLFYSV 274 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 56/167 (33%), Gaps = 21/167 (12%) Query: 233 NALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 N L F+ + ++ +GL+ Y++R E S + Sbjct: 549 NRLKNFVTMVSLQFDINRDVTQIGLVTYSSRPETVFGLDTHDSGSSLLQGIGRASYMGGS 608 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A+ + Y + + +K K ++ +TDG + Sbjct: 609 ASTGSALLRVYNDVMTVQKG-------------ARPGVNKAVVVITDGRGAEDAAVP--- 652 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 K ++N I + I I + LL+ S ++ +V + +SL Sbjct: 653 --AQKLRDNGIMVYVIGIG-NIQRNSLLRL-AGSEKFLISVPSYESL 695 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 17/172 (9%), Positives = 46/172 (26%), Gaps = 23/172 (13%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L + + +G + Y+ + K PT++ A+K Sbjct: 78 DALDIGSDLIRVGAMQYSGAPQVEFRLDSSFSKAAIKEKIKSIVFKGGPTETGLALKYIV 137 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + K +I ++DG++ + K I Sbjct: 138 ----------WKGFPGGRPAS-----VPKILIIVSDGKSQGNIKLP-----AAQIKGEDI 177 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 ++ T+ + + + + + D ++ + + Sbjct: 178 EVFTVGVKFPRWEELHALSSEPQEAHVLFAEHVD---DAVNGLATSLTNSSL 226 >gi|32477499|ref|NP_870493.1| hypothetical protein RB12409 [Rhodopirellula baltica SH 1] gi|32448053|emb|CAD77570.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 1032 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 46/117 (39%), Gaps = 8/117 (6%) Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-KFIIFLTDGENNNFKSNVNTIKICD 355 AM A +T F + K +I ++DG+ ++V Sbjct: 526 AMLAAVGRMTPGDMPEFDPAMRMAVTGLVRTDASVKHLIIISDGDPGPPSNSVI-----Q 580 Query: 356 KAKENFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K+N I I T+++ + + +RL ++ +Y V + +L +FQ ++ + Sbjct: 581 AFKDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRALPGIFQREARRVT 637 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 60/193 (31%), Gaps = 20/193 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 ++ + ALL LD I H + G +T + ++ R++ Sbjct: 291 SGSMSGREIQQTREALLKILDDIKEDDH-FNFILFGSDVHTWKETLIKATPENLDEARKF 349 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V T+ M + +L + + + II LTDG Sbjct: 350 VRGID---TKGLTNLYGGMMKGIDMLNAAHEGNLV-----------PKRSASIIIMLTDG 395 Query: 340 ENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVN 394 + N SN I+ KA E + + + L K + + + + + Sbjct: 396 QPNVGISNTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDS 455 Query: 395 ADSLIHVFQNISQ 407 A L + +S Sbjct: 456 ALQLQGFYDEVSN 468 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 214 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 322 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 323 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 375 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 376 FTQKALRDAGSRLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 418 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 419 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 478 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 479 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 537 >gi|229825155|ref|ZP_04451224.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] gi|229790527|gb|EEP26641.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] Length = 1109 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 79/294 (26%), Gaps = 34/294 (11%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFI-EHLLNQRYNQKIVS----FIPALLRIEMGERPI 190 + + + +++ + + L+ + + Sbjct: 71 EMVVKGDGSTTIEQKNLDVVLVVDRSYDNMRKNGRMAAAKAEAAKLVDYFLQSGNNKIRV 130 Query: 191 FLIEL----------VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 L+ V+ ++ A + A A + Sbjct: 131 GLVSFAGNNGGSPSPVLGVTQLTQDADELKNAIRGYNTAGWNNSPVLREAFTQA--GLIK 188 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ + + ++ + + + + ++ + + + Sbjct: 189 ANEMFGASNTNKKIIVLISGGAPTISYGLTLDFNQREEALSNT----PKEGYEIWSLTRW 244 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-E 359 L S+ + + + + + + NTI +K K + Sbjct: 245 ----LDPKYHSSWKKGYHKS-NEWTPYYVKLVKGLIDVPMGKTAEVKTNTIAEANKIKTD 299 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++I ++ I A + +LK S +Y V N L V I +++ K Sbjct: 300 SGVEIFSVGIKADGDAAEVLKKIASD-NRYY-VDNI--LKDV---IDKVIESLK 346 >gi|91201645|emb|CAJ74705.1| hypothetical protein kuste3942 [Candidatus Kuenenia stuttgartiensis] Length = 336 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 81/265 (30%), Gaps = 80/265 (30%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I V+D+S SM + +D K ++ K + + + ED Sbjct: 93 IVFVLDVSMSM---------------LAEDVKPNRLECAKMEIANLVRGL-------EDD 130 Query: 253 YMGLIGYTTRVEKNIE-PSWGTE----KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GL+ + R + P+ E ++ V + T+ A+ A + ++ Sbjct: 131 RVGLVVFAARAFSLLPYPTKDYEMVFLRILNMVNEHYVRFVPYGTNIGNALIAAMETFSN 190 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + +K II LTDGE + + I ++N I I Sbjct: 191 EAG-------------------KKIIILLTDGEEQLLRRSQVVEAIRLLLEKNDISTYII 231 Query: 368 SINASPNG------------------------------QRLLKTCVSSPEYHYNVVNADS 397 I N + L + + + + Sbjct: 232 GIGDPNNSTSIPKRDRLGNKSGYERTEDGEIIYTKPDPKLLREIAEMAGGSYQHDATGAE 291 Query: 398 LIHVFQNISQL----MVHRKYSVIL 418 L ++F+ + + + +K ++I Sbjct: 292 LQNIFKQVIETNKDIIGVKKKNIIY 316 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMVLKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|219684622|ref|ZP_03539565.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219685812|ref|ZP_03540621.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] gi|219671984|gb|EED29038.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219672645|gb|EED29675.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] Length = 333 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + + + + Sbjct: 110 SSKNRLEFSKELIRRFISQ-------RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + A+ +KI +I I +S K Y SL ++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFSVEFKL---RSGKFYQ----GSLKEIY 255 Query: 403 Q-----NISQLMVHRKYSV 416 IS YSV Sbjct: 256 DPSMLVEISHKTGGLFYSV 274 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 6 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 64 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 65 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 124 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 125 YFFLGPSRTVSATSTAKATNVD 146 >gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 [Mus musculus] Length = 941 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|198418440|ref|XP_002122148.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1270 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 53/169 (31%), Gaps = 23/169 (13%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 K +V +GL ++ +++ I S + + T A+ +A Sbjct: 211 SSDIDKHNVVVGLYSFSNIIKREIPLSARTYSTLSGMIDAV--RYPYGQTFIHTAINEAV 268 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 Q + + K +I +TDGE + + I Sbjct: 269 Q----------------EYQRAGRASVPKLLIVITDGEATVPSAVAPSANAARA---AGI 309 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + I +S N L + E + V + SL + + +++ Sbjct: 310 ILTAVGIGSSVNENELTTI-AGAAERVFRVSDFSSLGSILAPLQKVITD 357 >gi|151357765|emb|CAO78005.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 903 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus musculus] Length = 943 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 271 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 310 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 311 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 366 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 367 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 415 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 416 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 461 >gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus musculus] Length = 927 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 271 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 310 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 311 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 366 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 367 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 415 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 416 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 461 >gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus musculus] Length = 904 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 271 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 310 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 311 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 366 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 367 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 415 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 416 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 461 >gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus] Length = 902 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus] Length = 941 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 [Mus musculus] gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus] gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus musculus] gi|151357764|emb|CAO78004.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 941 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus] Length = 942 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/226 (12%), Positives = 57/226 (25%), Gaps = 40/226 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ L + Sbjct: 269 PTMSKNVIFVIDKSGSM--------------------SGKKIQQTREALVKILKDLSPQD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K Y +R T+ A+ A ++L Sbjct: 309 Q-FNLIEFSGEANQWKQSLVQATEENLNKAVNYASRI---RAHGGTNINNAVLLAVELLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + + II LTDG+ + +A + Sbjct: 365 RSNQAELLPSKSVS-----------LIILLTDGDPTVGETNPTIIQNNVREAINGQYSLF 413 Query: 366 TISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + N L K + + + + +A L + ++ Sbjct: 414 CLGFGFDVNYPFLEKMALDNGGLARRIYEDSDSALQLQDFYHEVAN 459 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 62/184 (33%), Gaps = 28/184 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +KN + D + L +G+I Y+ +VE+ + + K D Sbjct: 668 FQIIKNFVKTLTDGVALDLAT---ARIGIINYSHKVEEVAHLTQFSSKDALKRAVDNMQY 724 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A+ A ++ + +K + +TDG+ + + Sbjct: 725 LGEGTYTATALHAANRMFEAS-----------------RPEVKKVALVITDGQ-TDTRDE 766 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSLIH 400 N + A + ++I I + + + + + P+ + Y + +L Sbjct: 767 KNLTDVVKNASDINVEIFVIGVVKKNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITLQD 826 Query: 401 VFQN 404 + Sbjct: 827 TLKQ 830 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 60/165 (36%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D + L+ V D+ + + +++ V+ + S +Q V Sbjct: 65 FDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDLHTFKQRVKS 124 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 I + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 125 MN--FIGQGTFSYYAIANATRLLKREGRKDSV----------------KVALLMTDGIDH 166 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ I +TI ++ N +L +SP Sbjct: 167 PKNPD--VQSISEDARNAGIIFITIGLSTVVNETKLHLISGNSPG 209 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/389 (8%), Positives = 74/389 (19%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGTQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + + + + I + +M S Sbjct: 271 VLVVDWSGSMNENNR----IGEVQKGVNRFVDTLADSGITNNI----NMGYVGYSSDSYN 322 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 N+A K +KN D L + + + G Sbjct: 323 NNAIQMGPFDTVK-NPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVP 381 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 V+R Q + + K I Sbjct: 382 TFSYKVSRVQTE--------ADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRIN 433 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 434 ----------STFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|323491534|ref|ZP_08096713.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] gi|323314110|gb|EGA67195.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] Length = 411 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/158 (9%), Positives = 41/158 (25%), Gaps = 16/158 (10%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN---- 71 +K ++ +++++ + + I V K ++++ +AA LA A M S Sbjct: 12 RRQKGLVLVMVTVAMLALVGVAALAIDVSHAVLNKARLQNSVDAAALAAAIVMDSEGTNA 71 Query: 72 ----------LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + S + + + + S Y + Sbjct: 72 EATTAANTTLTNLASATGNSEMDFSSSTVTVQYSNDPTIFPQTSGYDADLDTYVRVTVSA 131 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + ++ S + + Sbjct: 132 YSLDNFFAYLFGVDKQLVS--SAVAGPSPGIDVVNVVP 167 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 65/227 (28%), Gaps = 51/227 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 RP +ELV+D+SGSM +++M+A K A +D++ Sbjct: 28 SARPAPKVELVLDVSGSMKTRD--------------IDGQSRMSAAKQAFNDVIDAVPEE 73 Query: 246 SHVKEDVYMGLIGYTTRV---------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + + + V T P Sbjct: 74 VELGIRTLGADYPGEDKARGCKDTRQLYPVGPI--DRTEAKTAVATLS---PTGWTPIGP 128 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A L + I+ ++DGE+ + ++ + Sbjct: 129 ALLGAADDLD------------------GDEGGSRRIVLISDGEDTCGPLDPC--EVARE 168 Query: 357 AKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + I T+ + + ++ L ++ + V + + L Sbjct: 169 IAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKEDLSD 215 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 214 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 322 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 323 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 375 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 376 FTQKALRDAGSMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 418 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 419 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 478 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 479 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 537 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 65/227 (28%), Gaps = 51/227 (22%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 RP +ELV+D+SGSM +++M+A K A +D++ Sbjct: 29 SARPAPKVELVLDVSGSMKTRD--------------IDGQSRMSAAKQAFNDVIDAVPEE 74 Query: 246 SHVKEDVYMGLIGYTTRV---------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + + + V T P Sbjct: 75 VELGIRTLGADYPGEDKARGCKDTRQLYPVGPI--DRTEAKTAVATLS---PTGWTPIGP 129 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A L + I+ ++DGE+ + ++ + Sbjct: 130 ALLGAADDLD------------------GDEGGSRRIVLISDGEDTCGPLDPC--EVARE 169 Query: 357 AKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + I T+ + + ++ L ++ + V + + L Sbjct: 170 IAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHKEDLSD 216 >gi|315644678|ref|ZP_07897809.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315279937|gb|EFU43236.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 562 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 51/161 (31%), Gaps = 20/161 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ D +G + Y+T V N+ + + +V T + + Sbjct: 413 LTGQKYLGRDNSIGFVSYSTDVAINLPIGKYDLNQQSMFVGAINSLEASGNTATFDGIVV 472 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + + + I L+DGE N S I + K Sbjct: 473 AMKMLHDEMA--------------VNPDVKPLIFVLSDGETNVGHS---LDDIRELIKAF 515 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + TI + N Q L + N D + + Sbjct: 516 KIPVYTIG--YNANIQALQSISSINEAASINADTDDVVYKI 554 >gi|297667858|ref|XP_002812180.1| PREDICTED: vitrin-like isoform 1 [Pongo abelii] Length = 693 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 526 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 585 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 586 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 626 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 627 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 682 Query: 409 MVHR 412 + Sbjct: 683 ICTE 686 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/143 (9%), Positives = 38/143 (26%), Gaps = 22/143 (15%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDST 295 L MG++ Y + + ++ + + ++ Sbjct: 331 LAAVAQALDIGPAGPLMGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVG 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + S + ++ + DG + + Sbjct: 389 RAISFVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD-----KVEEASR 430 Query: 356 KAKENFIKIVTISINASPNGQRL 378 A+E+ I I I+I + ++ Sbjct: 431 LARESGINIFFITIEGAAENEKQ 453 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 49/448 (10%), Positives = 103/448 (22%), Gaps = 115/448 (25%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 N D E+ + ++ G S + + + + Sbjct: 80 QNDGSEADEAETGQVADPSSSRSVNVLRAGGLVRGDTGGTSGGEGDLVLAKSLETDGQGD 139 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN-----------QRYNQKIVSFIP 178 + S + +I L + + Y+ K + + Sbjct: 140 KVITLEAYASGSMSIVEQGI-PCDIVLVLDQSGSMDDRFGSQGSYHALSGYSNKRLGDLA 198 Query: 179 ALLRIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKR----------- 225 + + + + VD+SG S+ + + + + Sbjct: 199 ENGNLYVRGGDGSYVAVDVDVSGFISLKYSYTWEGLEAPLTSEGRYTVPQFEGVTFYSFQ 258 Query: 226 -----TKMAALKNALLLFLDS--------------------------IDLLSHVKEDVYM 254 T++AALK+A F+ S + + V Sbjct: 259 TDQTVTRIAALKDAANSFVQSVRSNSLGEDGIAGTVDDVPHRIAVIGFASGDNTELFVGS 318 Query: 255 GLIGYTTRVEKN--IEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 Y + T + + T + A Sbjct: 319 ASYPYGQSAQSQYGSAFQDMTTEMGFDNALASIGQLSADGGTLVDDGLDMA--------- 369 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN----NNFKSNVNTIKICDKAKENFIKIVT 366 F + + + LTDG N+ I + K + + Sbjct: 370 ----NGVFSANPLVTGELRNRVTVVLTDGAPGLYGNDRGVANEAISQASELKTAGSTVFS 425 Query: 367 ISI------------------NASPNGQRLLKTCV--------------------SSPEY 388 I I + + + + SSP+Y Sbjct: 426 IGIFPGADASGDLPDQSAMGWGDNDSNRFMHLLSSNYPDASSMGSPGARFVDEEGSSPDY 485 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + ++ L +FQ+ISQ V Sbjct: 486 YLSASDSAGLNSIFQSISQSTGGSSVDV 513 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/397 (7%), Positives = 80/397 (20%), Gaps = 70/397 (17%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRY--------NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + + + N+ + + + + + + + + Sbjct: 271 VLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSSITNNINMGYVGYSSDGYNNNAIQMGP 330 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ++ + + + T L + + V + G+ ++ +V + Sbjct: 331 FDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFSYKVSRV 390 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + G Q+ R T + Sbjct: 391 QTEADGRFYGTQFTNRQDQ---PGSTSYISG-------------------------SYNA 422 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT----- 381 + + + TI K+ I+I + I + + L Sbjct: 423 PDQNNINKRIN-------STFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVED 475 Query: 382 -------CVSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 476 KMREMVSADENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|304310230|ref|YP_003809828.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] gi|301795963|emb|CBL44164.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] Length = 371 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/240 (9%), Positives = 75/240 (31%), Gaps = 10/240 (4%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + +G + + + + + ED +S + + + + + Sbjct: 59 AAVSPEGLRIVGNANPVQVFVPTQNNSGLDISDFQLREDGSSQTLVSAEPNSSLPFVTVF 118 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 ++ + S+ + + G+ + ++ V ++ R T + Sbjct: 119 VMDYSPSVRNNETALKRMEEAAKGFVSLMQP-----KDKAAVIKFNDRVEVMGA-GLTSN 172 Query: 295 TPAMKQAYQILTS--DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A + + + V + II L+DG++ + + Sbjct: 173 HDTLNAAIDSIPPQRGYSKLYDAISKAIEVSNCNPKLVCSIIVLSDGDDVGSALPLADLH 232 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + + I + N +L + +S +Y ++ V++ I + + Sbjct: 233 --NQLYQAGTAVFPIGYGDNINVNKLQELATNSGGAYYTSEDSSQFSAVYERIWARLSNE 290 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/326 (10%), Positives = 81/326 (24%), Gaps = 34/326 (10%) Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + + +T + + + + I S L Sbjct: 56 QVIYNPEFQTSDSTLLSYRKSIEQSDKDTFRITLEAVTGQDIKTIGGKSGSIVLVLDNSG 115 Query: 163 HLLN----QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + + + LV + S ++ Sbjct: 116 SMGWGSSPTPADYARDALKEFANEFLKNGNSGNKLGLVT-------YSSGSGVPIYDAYD 168 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + M + +D + + DV MG+ + T Q Sbjct: 169 EIKYVQGYSMTENSDIFGQVVDGLQT-PSGETDVQMGIKTARDI------LAADTSGNPQ 221 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ D + T A + ++ + + Q ++ Sbjct: 222 FILVFSDGATNRSARPTSAGELGGANISPCTFGDKIYDMTFKFDGF-DYGAQGSAVYNDG 280 Query: 339 GENNNFKSNVNTIKICDK---AKENFIKIVTISINASPNGQRL--------LKTCVSSPE 387 N N +T+ + AK+ I I ++ +P L +K SS Sbjct: 281 VYTTNGNVNAHTVAAVSEALLAKDQGIDIYSV-FYHNPALNDLEYGAGVFVMKNSASS-G 338 Query: 388 YHYNVV--NADSLIHVFQNISQLMVH 411 + + NA + +F I + + Sbjct: 339 QYTEISPGNAGAFAEIFTEIEKQIQE 364 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 23/184 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + ++ +K L +D + + +G++ Y +RV+ + K Sbjct: 45 SSRSVRPSEFEQVKVFLAKVIDGLS---VGPDATRVGVVNYASRVKNEVSLKSHKTKAAL 101 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + A++ A + F S K I +TD Sbjct: 102 VKAVSKIEPLSTGTMTGLAIQFAMNV-----------AFSEAEGGRKSPDISKVAIIVTD 150 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNAD 396 G + N I +A+E I+I I + L+ S P ++ V + Sbjct: 151 GRPQD-----NIRDIAARAREAGIEIFAIGVGRVD--MTTLRQMASEPLEDHVDYVESYS 203 Query: 397 SLIH 400 + Sbjct: 204 LIEK 207 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|239941305|ref|ZP_04693242.1| hypothetical protein SrosN15_09946 [Streptomyces roseosporus NRRL 15998] Length = 516 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 64/220 (29%), Gaps = 43/220 (19%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 ERP + VVD+SGSM ++ ++ +L + D + Sbjct: 161 TSERPPAALTFVVDISGSMAE-------------------TGRLDLVRKSLTILADELRD 201 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + L+ ++ E + + + + + + T+ +K Y Sbjct: 202 DDS------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQPAQSTNVEAGIKLGY-- 253 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFI 362 + V+ ++ L+D N ++ KI +E I Sbjct: 254 --------------EESVEGHREGATNRVVLLSDALANTGETEAEGILKKIDSARREYGI 299 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + + + + V + VF Sbjct: 300 TLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 339 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|219847682|ref|YP_002462115.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541941|gb|ACL23679.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 63/238 (26%), Gaps = 45/238 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q + + A+ + V+D SGSM Sbjct: 19 SSTPQVVYLLVEAVAPASPTSALPLNLCFVLDRSGSMQ--------------------GA 58 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ ++K A ++ + ++ + V+ I + + Sbjct: 59 KLESMKAATRRVIELLRPHDVA------AIVIFDDTVQTLIPATP-VGDRSALLAAVETI 111 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + M+ A L K ++ LTDG+ + Sbjct: 112 TEAGGTAMSLGMQAAQTELQ----------------KHLGPDRISRMLLLTDGQTWGDEP 155 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++I + + N Q L +S Y + + + FQ Sbjct: 156 IC--RDLARTLGQAGVRITALGLGTEWNEQLLDDIAAASDGYSDYIADPAQIETFFQQ 211 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|54296651|ref|YP_123020.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] gi|53750436|emb|CAH11830.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] Length = 1169 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/273 (9%), Positives = 72/273 (26%), Gaps = 28/273 (10%) Query: 159 QFIEH-LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 F N S P++ G + + + N+ Sbjct: 272 NFSLSNYNNGNVLLSYSSTRPSIGSFATGPTNAGFVPFSQQVMYVQRGFGYYSNQSFNTG 331 Query: 218 PICQD----KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + ++ NA+ FL + ++ + + Sbjct: 332 NMRVNMTTAGTNPTTTSVTNAINAFLPHLRPETNSTSTTEIKA------AAVQSPLAGLL 385 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-VKIPSLPFQKF 332 + R + + T KQ +++ + + + Sbjct: 386 TRSRSF------MKTVGTTSGNCPQKQYIILISDGLPTQDLQARYWPPLGSAAATGYGVT 439 Query: 333 IIFLTDG--ENNNFKSNVNTIKICDKAKENFIKIVTISINASPN------GQRLLKTCVS 384 F DG + N ++ + I + K + + I + A + L+ Sbjct: 440 ATFNADGSLNSTNSQALSDAINEINALKNDGVLTFVIGMGAGVDPAVNPEAAATLRAMAV 499 Query: 385 SPE--YHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +Y + +L++ +I + + +S Sbjct: 500 AGGTENYYPATSPQALVNSLNSILSNIQNGSFS 532 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEAISQTI 232 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/262 (14%), Positives = 83/262 (31%), Gaps = 58/262 (22%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F + Q + I + + + L +D+SGSM Sbjct: 62 NFSLSVEGQPIKDFSLEPISSA-------KSPLSVVLAIDVSGSM--------------- 99 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K MA + A +FLD ++ ++ LI + V + + +VR+ Sbjct: 100 -----KGEPMAEARKAAAIFLDELEKDD------HVALITFGQGVYHLSDFAAKKHEVRE 148 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T A +A + + + + +I LTD Sbjct: 149 QLQHLEA--KEQWTWLYQATLEAME------------------RAVQAPTTRAAVILLTD 188 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G++ + + D+ K + I + + L K +S + A L Sbjct: 189 GKDEGSPVSE--EAVLDRIKGAQVPIYAMGFGSKAQVDYLQKVASASQGAFLSTPQAADL 246 Query: 399 IHVFQNISQLMVH---RKYSVI 417 +++Q + + + +++ Sbjct: 247 TNLYQTVLDYLKNQYILRFTYE 268 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 ISKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 7 IARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 66 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 125 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 126 YFFLGPSRTVSATSTAKATNVD 147 >gi|52840865|ref|YP_094664.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52627976|gb|AAU26717.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 1172 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/273 (9%), Positives = 72/273 (26%), Gaps = 28/273 (10%) Query: 159 QFIEH-LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 F N S P++ G + + + N+ Sbjct: 275 NFSLSNYNNGNVLLSYSSTRPSIGSFATGPTNAGFVPFSQQVMYVQRGFGYYSNQSFNTG 334 Query: 218 PICQD----KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + ++ NA+ FL + ++ + + Sbjct: 335 NMRVNMTTAGTNPTTTSVTNAINAFLPHLRPETNSTSTTEIKA------AAVQSPLAGLL 388 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG-VKIPSLPFQKF 332 + R + + T KQ +++ + + + Sbjct: 389 TRSRSF------MKTVGTTSGNCPQKQYIILISDGLPTQDLQARYWPPLGSAAATGYGVT 442 Query: 333 IIFLTDG--ENNNFKSNVNTIKICDKAKENFIKIVTISINASPN------GQRLLKTCVS 384 F DG + N ++ + I + K + + I + A + L+ Sbjct: 443 ATFNADGSLNSTNSQALSDAINEINALKNDGVLTFVIGMGAGVDPAVNPEAAATLRAMAV 502 Query: 385 SPE--YHYNVVNADSLIHVFQNISQLMVHRKYS 415 + +Y + +L++ +I + + +S Sbjct: 503 AGGTENYYPATSPQALVNSLNSILSNIQNGSFS 535 >gi|73976417|ref|XP_539432.2| PREDICTED: similar to Matrilin-2 precursor [Canis familiaris] Length = 524 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 65/184 (35%), Gaps = 28/184 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + D + L +G+I Y+ +VEK + + K + D Sbjct: 195 FEIIKSLVKTLSDQVALDLAT---ARIGIINYSHKVEKVAHLTQFSNKDDFKLAVDNMQY 251 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A+ +A + + +K + +TDG+ + + Sbjct: 252 LGEGTYTATALHEANHMFEA-----------------ARPGVKKVALVITDGQ-TDSRDE 293 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSLIH 400 N ++ +A + ++I I + + + + + P+ + Y + +L Sbjct: 294 KNLTEVVKRASDINVEIFVIGVAKKNDPNFEMFHKEMNLIATDPDSEHVYQFDDFITLQD 353 Query: 401 VFQN 404 + Sbjct: 354 TLKQ 357 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V VR V T++ Sbjct: 199 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVRDVVEAASHIEQRGGTETRT 257 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K ++ +TDGE+++ I+ ++ Sbjct: 258 AFG-------------IEFARSEAFQKGGRKGAKKVMVVITDGESHDSPDLEKVIQQSER 304 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 305 ---DNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 361 Query: 407 QLMVHR 412 + Sbjct: 362 DRIFSL 367 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V VR V T++ Sbjct: 168 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVRDVVEAASHIEQRGGTETRT 226 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A K +K ++ +TDGE+++ I+ ++ Sbjct: 227 AFG-------------IEFARSEAFQKGGRKGAKKVMVVITDGESHDSPDLEKVIQQSER 273 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 + + +++ N + L K S P+ + +NV + +L + + Sbjct: 274 ---DNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALG 330 Query: 407 QLMVHR 412 + Sbjct: 331 DRIFSL 336 >gi|326922361|ref|XP_003207417.1| PREDICTED: collagen alpha-3(VI) chain-like [Meleagris gallopavo] Length = 3135 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 16/176 (9%), Positives = 54/176 (30%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + +GL+ ++ + + + + R + Sbjct: 665 DFVVTLVNNLDVGTDKIRVGLVQFSDTPKTEFSL-YSYQTKSDIIQRLGQLRPKGGSVLN 723 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 S +F G + + ++ +T G + + +++ + Sbjct: 724 TG---------SALNFVLSNHFTEAGGSRINEQVPQVLVLVTAGRSADP-----FLQVSN 769 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + T ++ + L+ +P Y V + L + Q +++ + Sbjct: 770 DLARAGVL--TFAVGVRNADKAELEQIAFNPRMVYFVDDFSGLTALPQELNKPITT 823 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--D 293 ++ I+ L + +++G++ Y+ + + + K + + Sbjct: 262 DFLVNLIESLRVGAQQIHIGVVQYSDQPRTEFALNSYSTKA-DVLDAVKALSFRGGKEAN 320 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + L F + G + ++ ++ GE+++ Sbjct: 321 TGAALEYVVENL-----------FTQAGGSRIEEAVPQILVLISGGESSDDIREGLL--A 367 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +A I + SI L+ + + + ++ +L Sbjct: 368 VKQAS-----IFSFSIGVLNADSAELQQIATDGSFAFTALDIRNLAA 409 >gi|331266438|ref|YP_004326068.1| hypothetical protein SOR_1070 [Streptococcus oralis Uo5] gi|326683110|emb|CBZ00728.1| conserved hypothetical protein [Streptococcus oralis Uo5] Length = 863 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/301 (9%), Positives = 67/301 (22%), Gaps = 29/301 (9%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ---KIVSFIPALLRIEMGERPIFL 192 LD ++ + NQ + +S I AL G Sbjct: 62 SLDITSKLGTETQTDPLDVVLVADLSGSMQNQDVQSFDGRTISRIDALKNTLRGTNGRKG 121 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + +M +++ + Q K + + + Sbjct: 122 LIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFR--PYWPYERMTKYYDGVSPW 179 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKR 311 + + T+ + A L + Sbjct: 180 DDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAGLTLA-NQLMGSARS 238 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----------- 360 + ++ + + + N + D+ N Sbjct: 239 NAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPT 298 Query: 361 --FIKIVTISINASPNGQRLL-------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +I S N + + S P + + D L F+NI+ ++ Sbjct: 299 LDG--FYSIKFRYSNNVDSITSLQYYIRQHNASIPNEILSANDEDQLRDSFKNITDKILP 356 Query: 412 R 412 Sbjct: 357 L 357 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|295789107|ref|NP_001171441.1| vitrin isoform 3 [Homo sapiens] Length = 657 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 490 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 549 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 550 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 590 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 591 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQN 646 Query: 409 MVHR 412 + Sbjct: 647 ICTE 650 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 23/305 (7%), Positives = 67/305 (21%), Gaps = 24/305 (7%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + A + ++ + ++ +++ Sbjct: 137 ESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVAT 196 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 L + + + + + S + + Sbjct: 197 PTTLPRPSPSAASTTSIPRPQS-VGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGA 255 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + + D + K+R ++ L D L Sbjct: 256 AFQKPVGADVSLDCKIDLSFLIDGSTSIGKRRFRIQK-----QLLADVAQALDIGPAGPL 310 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 MG++ Y + T + + ++ A+ + S + Sbjct: 311 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN- 369 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ + DG + + A+E+ I I I+I + Sbjct: 370 ------------RSGAPNVVVVMVDGWPTD-----KVEEASRLARESGINIFFITIEGAA 412 Query: 374 NGQRL 378 ++ Sbjct: 413 ENEKQ 417 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + + D +G + YT K Sbjct: 485 FRTVLQFVANISKEFEISETDTRVGAVQYTYEQRLEFGFDDYHTKSDILNAIKRVGYWSG 544 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + Sbjct: 545 GTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGRSYDDVRIPAM 588 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A + I + + Q L+ + P ++ + V D+L + + Sbjct: 589 V-----AHHKGVTTYAIGV--AWAAQDELEVIATHPASDHSFFVDEFDNLYKFVPKVIRN 641 Query: 409 MVHR 412 + Sbjct: 642 ICTE 645 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 14/145 (9%), Positives = 38/145 (26%), Gaps = 22/145 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D L MG++ Y + ++ + + ++ Sbjct: 288 QFLADVAQTLDIGPGGPLMGVVQYGDNPAAQFSLRTHMNSRDLKTAIDKITQR--GGLSN 345 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + S + + + DG + + Sbjct: 346 AGRAISYVTKNFFSKANGNRGGA-------------PNVAVVIVDGWPTD-----KVEEA 387 Query: 354 CDKAKENFIKIVTISINASPNGQRL 378 A+E+ I I I++ + ++ Sbjct: 388 SRLARESGINIFFITVEGATENEKQ 412 >gi|298705152|emb|CBJ28595.1| von Willebrand A domain containing protein [Ectocarpus siliculosus] Length = 382 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 50/178 (28%), Gaps = 28/178 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 K + S + +++ + + E +V D+ Sbjct: 117 WELSKEFAKNTVASFAEQNLFTNGGSASFAQFSSDASEGGTF-YSLEDFNAFVDA--DTE 173 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD + + ++L++ F+I TDG + Sbjct: 174 YSGGTDIIAGIAKGRELLSAS------------------PTTTSFMIVTTDGAAPD---- 211 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 D A+ + + + + P+ + LL +++ N D L +I Sbjct: 212 --PQDEADAARAEGTILYAVGVGSGPSEEILLAI-GGDEANVFDIDNFDELDVALDDI 266 >gi|296135239|ref|YP_003642481.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295795361|gb|ADG30151.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 336 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 16/163 (9%), Positives = 42/163 (25%), Gaps = 32/163 (19%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ L+ + + + P++ Q + T A+ Sbjct: 133 FADFARARQGDRFALVAFGSHAATLLPPTFDARAAGQMAGLLAVGQLGPDTALGDAIALG 192 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + +I TDG +N + + A+ Sbjct: 193 LRQ------------------AGALHGLKPVLILYTDGGQSNTGAISPADAVA-LARHLG 233 Query: 362 IKIVTISINASPN-------------GQRLLKTCVSSPEYHYN 391 ++I T+ + +P+ L ++ Y Sbjct: 234 VRIYTVEVGTTPDPGRPYTVPAYAGPQPDLRLIAEATGGRFYF 276 >gi|213966336|ref|ZP_03394518.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] gi|213951042|gb|EEB62442.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] Length = 543 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 58/186 (31%), Gaps = 24/186 (12%) Query: 225 RTKMAALKNALLLFLDSIDLLSHV----KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 ++A L+ + +D + ++ ++T V + + + R + Sbjct: 373 GDRIADLQATMRAIVDGSARTETGSVGLRNREIATILPFSTSVGEPTTTTIDGPESRAQL 432 Query: 281 TRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T +D T A+ QA+ +L S K S + I+ LTDG Sbjct: 433 TAAVDGLYAEGETALYDALIQAFDLLGSSDKNSIPS-----------------IVVLTDG 475 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + K+ N + I + + + + + L Sbjct: 476 QVTSGKTFSEFRDFYQSKGGNLPPVFVIRYG-EADPGEMQELANLTGGKVFESR-ETELA 533 Query: 400 HVFQNI 405 VF+ I Sbjct: 534 DVFKEI 539 >gi|170291084|ref|YP_001737900.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] gi|170175164|gb|ACB08217.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] Length = 328 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 51/154 (33%), Gaps = 27/154 (17%) Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G++G+ + P+ + V+ + +D + + T+ + +K+ +I Sbjct: 199 RYGVVGFNSTAFVIKSPAENKDSVK-VIEEILDLVPIGYTNISDGLKKGLEISY------ 251 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + +TDGE N + K+ + K + I T Sbjct: 252 --------------HLKNPKYLLITDGEYNVGED---PRKVARRFKNLCV-IHTRG-KRD 292 Query: 373 PNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 G L K ++ + + + V ++I Sbjct: 293 SRGSVLCKEIARIGGSKYFVIDDIKQIQRVMKSI 326 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 43/122 (35%), Gaps = 24/122 (19%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD + A + K I+ ++DG S Sbjct: 768 GATDIFAGIDIALTKFS------------------NDTKTAKAIVVVSDG----KTSKSK 805 Query: 350 TIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K+ ++AK+ +KI T+S+ + N L++ + +Y ++ L VFQ + Sbjct: 806 MTKVINEAKKQGVKIYTVSMGKKSQINDATLMQVSTETGGAYYYALDNLQLHQVFQKLID 865 Query: 408 LM 409 + Sbjct: 866 AI 867 >gi|58429465|gb|AAW78136.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 65/204 (31%), Gaps = 26/204 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + + +A+ L + I L+ Sbjct: 27 DVQNNIVDEIKYSEEVCNDQVDLYLLMDC-SGSIRRHNWVNHAVPLAMKLIQQLNLNDNA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILT 306 +++ ++ + I K ++ + +L T+ + A+ Q + L Sbjct: 86 IHLYANVFSNNAREIIRLHSDASKNKEKALIIIKSLLNTNLPFGRTNLSDALLQVRKHLN 145 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I + ++ LTDG ++ + + +K K + +KI Sbjct: 146 DR---------------INRENANQLVVILTDGIPDSIQGS---LKESRKLNDLGVKIAV 187 Query: 367 ISINASPNG--QRLLKTCVSSPEY 388 I N R L C S Sbjct: 188 FGIGQGINVAFNRFLVGCHPSDGK 211 >gi|254167839|ref|ZP_04874688.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623130|gb|EDY35696.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 1953 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 50/163 (30%), Gaps = 8/163 (4%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R I V+D SGSM+ + + T++ A + + + Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGD---VNGDGRSNTRIDVAIQAAIDAVKELGPQ 1300 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V + G + E+ + ++ T + ++ ++ D T + A Sbjct: 1301 DRVAVFTFNG----NSHPEEYMGFTYVTADNLQTIISDLKDIQAGGGTPLYDTLSWAVYY 1356 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + +G+ + + + + T + Sbjct: 1357 MDKYSTDNPDREDATRGILVLTDGLSNYDTYGTSNGARYDYAP 1399 >gi|68448495|ref|NP_001020335.1| inter-alpha (globulin) inhibitor H4 [Danio rerio] gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio] Length = 915 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 69/225 (30%), Gaps = 45/225 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + ++D SGSMH ++ ++ALL L +D Sbjct: 265 PRIPKNVVFIIDRSGSMH--------------------GRRIRQTRSALLTILKDLDEDD 304 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 H + I + + E E +V R D T+ A+ ++ Sbjct: 305 HFGLITFDAEIDF-WKRELLQATKANRENAESFVKRIQD---RGATNINDAVLAGVDMIN 360 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIV 365 + P +I LTDG+ ++N+ I +A + + Sbjct: 361 RN----------------PRKGTASILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLY 404 Query: 366 TISINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNIS 406 + N L K + + Y +AD L + ++ Sbjct: 405 CLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQLQGFYDEVA 449 >gi|78222187|ref|YP_383934.1| type IV pilin biogenesis protein [Geobacter metallireducens GS-15] gi|78193442|gb|ABB31209.1| type IV pilin biogenesis protein, putative [Geobacter metallireducens GS-15] Length = 1030 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/376 (9%), Positives = 91/376 (24%), Gaps = 34/376 (9%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A LA A + + ++D++ ++ + Y+ + + Sbjct: 21 AHLAAAPGIARADDTEIYSAGATMKPMVMIIMDNSGSMGDPVPYDNATAYAGTYTKDTVY 80 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR----LQFIEHLLNQRYNQKIVS 175 ++R + ++ D + + N+ Sbjct: 81 QYTCTARNKKGQCTATQWQVYTGVFTDQINRDGGVDVAGQDGIDDS-STALKTGNRLNYE 139 Query: 176 FIPALLRIEMGE---RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +P ++ + + + G M ++ +++A+ Sbjct: 140 ALPTTSKLSTAKGVLNNLVDLMYNDVDFGFMKFNTEDGGNIISKIGATITAMHGQISAIN 199 Query: 233 NAL-----LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG----TEKVRQYVTRD 283 D+ Y + I + + Sbjct: 200 ATTWTPLAEALTDAGKYFEDTYTGQYSPWNSNNWCQKAFIIIVTDGEPTHDTDTTIIGHF 259 Query: 284 MDSLILKPTDSTPAMKQA----YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +D TD+ K Y + ++ + + Q FL D Sbjct: 260 LDRGQTGITDANRGQKWDQDGDYNMHSTTTTDPLNNDVWVADDTYSDTVPQ---TFLDDV 316 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + ++ + TI + + L +T + ++ NA L Sbjct: 317 AKYLYTHDLRPDLQGT----QNVTTYTIG--YAHDSPLLQRTAQNGGGLYFTANNAAELE 370 Query: 400 H----VFQNISQLMVH 411 H +I++ + Sbjct: 371 HSLLMALDDIAKKLQT 386 >gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter fumaroxidans MPOB] gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter fumaroxidans MPOB] Length = 680 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 62/214 (28%), Gaps = 27/214 (12%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G +VV + S+ + + I K+ L + L + Sbjct: 279 GPDSAGTFMVVVTPAASLK-RIAEGVDWTFVLDISGSMTGRKITTLIEGVSRVLGKMSAN 337 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + + T P E V+ ++ R T + AY++L Sbjct: 338 DRFRIVTFNTTAADFTGGYVPASP----ENVQTWMQRVKQIQAGGSTALFDGLDLAYRLL 393 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 ++ I+ +TDG N + K++ +++ Sbjct: 394 DGERTTG--------------------IVLVTDGVCNVG--PTRHDEFLGLLKQHDVRLF 431 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 T I S N + + S + NV +D + Sbjct: 432 TFVIGNSANQPLMDRLAKESGGFAMNVSESDDIA 465 >gi|294339336|emb|CAZ87692.1| putative Von Willebrand factor, type A [Thiomonas sp. 3As] Length = 336 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 17/163 (10%), Positives = 44/163 (26%), Gaps = 32/163 (19%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ L+ + + + P++ Q + T A+ A Sbjct: 133 FADFARARQGDRFALVAFGSHAATLLPPTFDARAAGQMAGLLAVGQLGPDTALGDAIALA 192 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + +I TDG +N + + A+ Sbjct: 193 LRQAGAL------------------HGLKPVLILYTDGGQSNTGAISPADAVA-LARHMG 233 Query: 362 IKIVTISINASPN-------------GQRLLKTCVSSPEYHYN 391 ++I T+ + +P+ L ++ Y Sbjct: 234 VRIYTVEVGTTPDPGRPYTVPAYAGPQPDLRLIAEATGGRFYF 276 >gi|118096903|ref|XP_425157.2| PREDICTED: similar to type VII collagen [Gallus gallus] Length = 1645 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 51/189 (26%), Gaps = 21/189 (11%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ L + ++ L V + Y + + S V + + Sbjct: 875 DSSSDADTVRTLLSNTVTAMGRLGPDGTQVGLATYSYRSLPWLLLNRSSELPAVLEQIRT 934 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 A+ A L ++ L DG + Sbjct: 935 MRYEEPSGN-AIGAAITFARTYL-------------LSPGAGRRPSVPAVLVVLADGPSG 980 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + I AK +++ +++ Q L+ VS + + N SL + Sbjct: 981 DD-----AITAARDAKAAGVRV--LAVGLEGADQEQLRRMVSGEDPRFVFRNRGSLSELE 1033 Query: 403 QNISQLMVH 411 ++ + Sbjct: 1034 GELTDDLCT 1042 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 36/141 (25%), Gaps = 21/141 (14%) Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 S S T + ++ Sbjct: 2 CPDVPRVEFGFSQHINGTGIQKAIQQLSYKGGNTRTGAGLRYISDNFF------------ 49 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 G K I +TDG++ + + + K IK+ + I + + Q L Sbjct: 50 --GPTQLRPGVPKICILITDGKSQDD-----AEQPALRLKSLGIKVFAVGI-KNADRQEL 101 Query: 379 LKTCVS-SPEYHYNVVNADSL 398 ++ + + + V + L Sbjct: 102 IRVASLPTDSFFFYVGDFKLL 122 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 52/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + + K + Sbjct: 313 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAAIRNIRYM 372 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 373 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 413 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I + ++ + A + LK S P + + L + ++ Sbjct: 414 -VQGPAAAAHDAGITVFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 469 Query: 406 SQLMV 410 + + Sbjct: 470 IRGIC 474 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 52/162 (32%), Gaps = 28/162 (17%) Query: 246 SHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +G++ + T+V S +++ + R T + A+K+ Sbjct: 421 PISDNATRVGIVQFAGKTKVRVLANFSQNKSQLKTIIDR--SPFYSGTTFTNQALKKMAA 478 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K + ++ TDG + S + K + Sbjct: 479 LYEESK----------------RPNAKLKLMLFTDGYSAEDTSEGE-----EALKSQGVV 517 Query: 364 IVTISINASPNGQRLLKT---CVSSPEYHYNVVNADSLIHVF 402 + T+ I+ + +K +S E++Y+ + L+ F Sbjct: 518 VYTVGISTDKSAGLNMKELRGMATSSEHYYDSSDFADLLKHF 559 >gi|198438279|ref|XP_002126427.1| PREDICTED: similar to LOC779593 protein [Ciona intestinalis] Length = 1012 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 67/214 (31%), Gaps = 42/214 (19%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSWGTEKVRQYVTRD 283 K+ +K AL L S++ H ++ + E I + T + +T Sbjct: 343 KLKQVKQALADSLRSLNNEDHFN------IVVFGDTAEPWISGVLSTASTRSINDAITYV 396 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ------------- 330 T+ A++ A+ I+ + P Sbjct: 397 DAVSARGGTNMLVALQTAFAIMEPYLPSLPENETMVEDTTPFPTPVPLQPETNHFIRKRA 456 Query: 331 -----------KFIIFLTDGENNNFKSNVNTIKICDKA-KENF--IKIVTISINASPNGQ 376 K I+FLTDG K +V T I + K N + + TI + + + Sbjct: 457 TETQTELSNYAKMIVFLTDGRPT--KDDVGTDDIASRIEKINGGRVNLHTIGFGSLVDMR 514 Query: 377 RLLKTCVSSPE---YHYNVVNAD-SLIHVFQNIS 406 L K + + ++A + H F +S Sbjct: 515 FLEKLAALNGGVSRRVFESLDAATQIRHFFDEVS 548 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L +G++ Y+++V+ K Q V + + + T + Sbjct: 43 KFMIDIIHELDIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 101 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 102 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 147 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 148 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 187 >gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 [Sus scrofa] Length = 889 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 90/309 (29%), Gaps = 27/309 (8%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD-VMTSYDYRLQFIEHLLNQR 168 +A + + + +++ + N L S T ++ S+ L Sbjct: 185 TADIFEPQGISTLDAEASFI----TNDLLGSALTKSFSGKKGHVSFKPSLDQQRSCPTCT 240 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + FI + ++ + V + KM Sbjct: 241 DSLLKGDFIITYDVNRESPANVQIVNGYFVHFFAPQGLPVVPKNVVFVIDVSGSMYGRKM 300 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ALL LD I ++ ++ G + + + + +K R++V D Sbjct: 301 EQTRDALLKILDDIKEDDYLNFVLFSGDVTTWKDSLVQATP--ENIQKAREFVRNIRDQ- 357 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + +L ++ II LTDG+ N S Sbjct: 358 --GMTNINDGLLTGISMLNKAREEHKVPE-----------RSTSIIIMLTDGDANMGVSK 404 Query: 348 VNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVF 402 I+ A + + + N L + + Y +A+ L + Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFY 464 Query: 403 QNISQLMVH 411 + ++ ++ Sbjct: 465 EEVANPLLT 473 >gi|323320820|gb|ADX36428.1| complement factor B [Apostichopus japonicus] Length = 913 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/253 (9%), Positives = 67/253 (26%), Gaps = 40/253 (15%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + ++ I L D ++ + K Sbjct: 433 AVSNETTNTNV-DQTARFIPLSHDTGLDLYFMFDGSSSVGEDN----------FNMAKKF 481 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTD 293 + I + + +G + + + E + + + T+ Sbjct: 482 AKELVKEIGVTD-RPNSLRVGALVFNSEAEIGFH-TVAFDSTADVLDAIDSMEYKEGGTN 539 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A + ++ K+ K + +TDG+ + Sbjct: 540 IAKAFEVLSNVMIPQT------------AKLNREKSFKTVFLITDGDATEGGD---AQED 584 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL--- 408 ++ + I I I+ + + L S P E+ + + + +L Q ++ Sbjct: 585 ARAVRDQDVTIHCIGISENATRRT-LSGMASEPLSEHLFFLKDYSTLEEFIQIVTNQTID 643 Query: 409 -----MVHRKYSV 416 + R+ S+ Sbjct: 644 YSECGVSPRRVSI 656 >gi|297667864|ref|XP_002812183.1| PREDICTED: vitrin-like isoform 4 [Pongo abelii] Length = 656 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 489 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 549 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 589 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 590 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 43.3 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 22/212 (10%), Positives = 56/212 (26%), Gaps = 29/212 (13%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KM 228 ++S L+ + S+ DP D + + Sbjct: 228 LLSTATYTSSQNRPGADPGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRF 287 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDS 286 K L ++D+ MG++ Y + + ++ + + Sbjct: 288 RIQKQLLAAVAQALDIGPAGP---LMGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQR 344 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 ++ A+ + S + ++ + DG + Sbjct: 345 --GGLSNVGRAISFVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD--- 386 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + A+E+ I I I+I + ++ Sbjct: 387 --KVEEASRLARESGINIFFITIEGAAENEKQ 416 >gi|187939945|gb|ACD39081.1| hypothetical protein PACL_0293 [Pseudomonas aeruginosa] Length = 223 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 65/199 (32%), Gaps = 20/199 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKV 276 K+ AL L + S S ++ ++ + +I + E N+ + Sbjct: 23 TSGSMSVDGKIEALNKGLKDMISSFAGESRLRAEIQVSVITFGGSLAELNLPLTPAH--- 79 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T A+ A +++ IPS ++ I+ + Sbjct: 80 --QLQSFTPLVAEGMTPLGGALSLASEMIEDK-------------DSIPSRAYKPVIVLV 124 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG N+ ++ + + + ++I A + L ++ NA Sbjct: 125 SDGYPNDDWQ-GPFARLVNGERSSKATRFAMAIGADADEVMLSDFANDPEAPLFHAENAR 183 Query: 397 SLIHVFQNISQLMVHRKYS 415 + F+ ++ + R S Sbjct: 184 DIHRFFRAVTMSVSARSQS 202 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 69/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 118 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 170 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 171 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 212 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + + ++ + A+ +KI +I I +S + L++ Sbjct: 213 SDEIXK--DQVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 270 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 271 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 309 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 58/164 (35%), Gaps = 24/164 (14%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRS 312 G+ + V + + + + + + T+ A+++A Sbjct: 66 GIWTFGQYVNMLVPHKPVDAQWGRTASAASSEIKSIAQFTNIGAALEKAAY--------- 116 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFIKI 364 + +Q +I LTDG + + N K + +++ I + Sbjct: 117 -----DHKQQMKADQDYQTHVILLTDGMVDIDRDNRLNKKERQRILNNVLPMYQQSGITL 171 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 TI+++ + + + L K +++ NA+ L++VF + Sbjct: 172 HTIALSDNADKKLLNKLALATDGKVSVAKNAEELMNVFLRVFNQ 215 >gi|303246237|ref|ZP_07332517.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] gi|302492300|gb|EFL52172.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] Length = 382 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 52/145 (35%), Gaps = 5/145 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + RF + + + ++I ALS++ + YK++ +++A +AA LA Sbjct: 1 MIRFTETRRPPA--QAGSVAVIVALSMIVLAGFATLAVDYGFLEYKRSQLQNAADAAALA 58 Query: 64 GASKMVS-NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 GAS +V ++ ++ D+K + + S Q Sbjct: 59 GASVLVQYGANQEAVTDTAVLYGQANLNDSDSKEMAIRNSDVTYPDAVSVRATVGRTQER 118 Query: 122 VNSSRISMT-HMANNRLDSSNNTIF 145 N + + + N D + + Sbjct: 119 GNPVEMFLGRILGWNTQDIAATGVA 143 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 19/151 (12%) Query: 235 LLLFLDSIDLLSH-VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKP 291 + F+ + +DV +GL+ + T + S + + Sbjct: 66 VRSFISNFAGTMPLSPDDVRVGLVTFGTSAVTRWDLSDSRAQNADLLAAAAKKLPYAAGS 125 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + +A +IL S +K K I+ +TDG T+ Sbjct: 126 TYTHLGLAKAEEILFSFQKGGRDNA-------------PKMILVMTDG---ASSRRSQTL 169 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTC 382 +K + + IV + + N Sbjct: 170 SAAEKLRNRGVIIVVLGVGTGVNSAECRSIA 200 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 57/169 (33%), Gaps = 27/169 (15%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDST 295 + L+ +++ +G++ + + + + + +T ++ T Sbjct: 2 NQAAELYLIQIIEKGSLVGMVTFESLATILNYLTDIIGDNAYEKITANLPREASGGTSIC 61 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q + + + + II LTDGE++ I C Sbjct: 62 SGLRAGFQAIIHSNQSTSGSE----------------IILLTDGEDD-------QISSCF 98 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVF 402 ++ K++ I TI++ + L + Y + + L F Sbjct: 99 EEVKQSGAVIHTIALG-PSAARELETLSNMTGGYRFYANKDINGLTDAF 146 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 65/194 (33%), Gaps = 30/194 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 60 DKVKSFTKRFIDNLKDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K+++ +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEELLVGGSHLKEN---------------KYLVVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLMVH 411 + ISQ + Sbjct: 221 SRDAEEVISQTIET 234 >gi|51598434|ref|YP_072622.1| hypothetical protein BG0172 [Borrelia garinii PBi] gi|51573005|gb|AAU07030.1| hypothetical protein BG0172 [Borrelia garinii PBi] Length = 333 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + + + + Sbjct: 110 SSKNRLEFSKELIRRFISQ-------RENDNIGLVAFAKDASIVVPITTDRDFFNKKLDD 162 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 163 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 204 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + A+ +KI +I I +S K Y SL ++ Sbjct: 205 --SDEIYKDQVINLAQGLNVKIYSIGIGSSEEFSVEFKL---RSGKFYQ----GSLKEIY 255 Query: 403 Q-----NISQLMVHRKYSV 416 IS YSV Sbjct: 256 DPSMLVEISHKTGGLFYSV 274 >gi|73973310|ref|XP_867431.1| PREDICTED: similar to alpha 1 type XII collagen short isoform precursor isoform 2 [Canis familiaris] Length = 1901 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K + Sbjct: 1175 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKAQALGALQNIRY 1234 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1235 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1277 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1278 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1330 Query: 406 SQLMVH 411 ++ Sbjct: 1331 EDNLIT 1336 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGI--KNADEDELKMIATDPDDTHAYNVADFESLSKIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|311273688|ref|XP_003133992.1| PREDICTED: integrin alpha-2 [Sus scrofa] Length = 1186 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/283 (14%), Positives = 76/283 (26%), Gaps = 31/283 (10%) Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 S++ + M S L + + G + + Sbjct: 104 TSTSISNVTEMKTNMSLGLTLTRNAGTGGFLTCGPLWAQQCGSQYYTTGVCSDVSPDFQL 163 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 S S DV + A+KN L F+ + +GLI Sbjct: 164 LTSFSPAVQACPSLIDVVVVCDESNSIY-PWDAVKNFLEKFVQGL---DIGPTKTQVGLI 219 Query: 258 GYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y + K T T++ A++ A S Sbjct: 220 QYANNPRVVFNLNTFKTKAEMVEATSHTTQYGGDLTNTFKAIQYARDSAYSAAAG----- 274 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 275 --------GRPGATKVMVVVTDGESHDGSMLKAVIDQCN---NDNILRFGIAVLGYLNRN 323 Query: 377 RL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + L+ + + + Sbjct: 324 ALDTKNLIKEIKAIASIPTERYFFNVSDEADLLEKAGTLGEQI 366 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/352 (11%), Positives = 103/352 (29%), Gaps = 46/352 (13%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHY-------------KKNSMESANNAAILAGASKMVS 70 +I A+ +MS + LI +I+ + + SM+ + + S + Sbjct: 14 MIMAILLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFIL 72 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + ++ + + S A ++ N K+ + + + Sbjct: 73 DDTKFKGSKQGLSREWSYIGLSADGKKVMNYVWNKQKQDWDVSELGTKSLYNMKLDLEFK 132 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + + N + + + ++ K I Sbjct: 133 TEGAYQDNRLISYNLTGKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGVAIAYRTDPI 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ I + V D SGSM+ + + ++M L+ ++ + + Sbjct: 193 QGQMNIA-VSFVFDTSGSMNWDLQGRETKKS-------GNESRMDILRKKSVIMIKDLAE 244 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ ++ + GT + + + + T+ ++ Sbjct: 245 IGNISVNLVGFSTSAKYIQQNFSNLDNGTNTIIATINKRENLNPDGVTNPGDGLRYGMIS 304 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 L S K+I+ LTDG N + + + ++ Sbjct: 305 LQS------------------QPAQLKYIVLLTDGIPNAYLVDSRALYAGNR 338 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/339 (8%), Positives = 89/339 (26%), Gaps = 28/339 (8%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ + + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLTGKYPDTNNKLGIDTAISALNTKQVFSKVAKGKKGVAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-SGSM 203 + + + + + + G I + Sbjct: 191 PIQGQMNIAVSFVFDTSGSMNWDLQGR---------ETKKSGNESRMDILRKKSVIMIKD 241 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + + ++ + + L N + +I+ ++ D G+ + Sbjct: 242 LAEIGNISVNLVGFSTSAKYIQQNFSNLDNGTNTIIATINKRENLNPD---GVTNPGDGL 298 Query: 264 -EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI-LTSDKKRSFFTNFFRQG 321 I +++ Y+ D + + A+ ++ L+ R F N Sbjct: 299 RYGMISLQSQPAQLK-YIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDL 357 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI---KIVTISI-NASP---N 374 + + L ++ + + +K+ + ++ I + Sbjct: 358 SPTLGYEYIRLGYDL--YSRDSITRENSIAYAGEVSKKFGLGVKRVNVIGFSGVNHEIAY 415 Query: 375 GQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 GQ L + + N ++L F +I + + Sbjct: 416 GQSLTDRIGEGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|301758046|ref|XP_002914871.1| PREDICTED: LOW QUALITY PROTEIN: vitrin-like [Ailuropoda melanoleuca] Length = 686 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + + D +G + YT K Sbjct: 519 FRTVLQFVANISKEFEISETDTRVGAVQYTYEQRLEFGFDDYHTKSDILNAIKRVGYWSG 578 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + Sbjct: 579 GTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGRSYDDVRIPAM 622 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A + I + + Q L+ + P ++ + V D+L + + Sbjct: 623 V-----AHHKGVTTYAIGV--AWAAQDELEVIATHPASDHSFFVDEFDNLYKFVPKVIRN 675 Query: 409 MVHR 412 + Sbjct: 676 ICTE 679 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 14/145 (9%), Positives = 38/145 (26%), Gaps = 22/145 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D L MG++ Y + ++ + + ++ Sbjct: 322 QFLADVAQTLDIGPGGPLMGVVQYGDNPAAQFSLRTHMNSRDLKTAIDKITQR--GGLSN 379 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + S + + + DG + + Sbjct: 380 AGRAISYVTKNFFSKANGNRGGA-------------PNVAVVIVDGWPTD-----KVEEA 421 Query: 354 CDKAKENFIKIVTISINASPNGQRL 378 A+E+ I I I++ + ++ Sbjct: 422 SRLARESGINIFFITVEGATENEKQ 446 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/334 (11%), Positives = 77/334 (23%), Gaps = 54/334 (16%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + + S+ + N R+ +T R+D + Sbjct: 27 AFGADNGSSTAALNFTITSSQTEYAKPPNADAQGRLDITLTPQGRVD-------NIIRPP 79 Query: 152 TSYDYRLQFIEHLLN-----QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + ++ + S + L+ D+ S Sbjct: 80 IDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPNDRFALVPFSDDVQYSKVVP 139 Query: 207 MNSDPEDVNSA---------PICQDKKRTKMAALKNALLLFLDSIDL-----LSHVKEDV 252 S DV + + AL+ A F D L+ V Sbjct: 140 FPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQAQSFFNDPARKKYIIFLTDGMPTV 199 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + T +V + I +V + + T A + + Sbjct: 200 SIAKEPITYKVCEGILFWKTCNQVTENLNVQYILYSNGIT--------AARTIY------ 245 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 P P K K ++ + N I + +I + Sbjct: 246 -----------YPDHPETKTYSDREKYREFEEKIRLHGTNVAKTLGMNNITLYSIGFGNN 294 Query: 373 P--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + L K ++ +L +FQ Sbjct: 295 QEVDMGYLEKLSSTAGGQAKKG-TPQNLTEIFQQ 327 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 60/220 (27%), Gaps = 35/220 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + C I T L+ F +++ + V +GL Sbjct: 1385 GENCDHVCNTTISGRLDIQIVIDTSGSLTSAPNKDQVLMNFTNNLANMYDTINQVKIGLT 1444 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 ++ + ++ + T+ T ++ A + Sbjct: 1445 SFSESSVLEMPL-DFYNQLELQDGVSNMTWQGSFTNITSGVETALNDMD----------- 1492 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 S +I +TDG + ++ D+AK + ++++ + Sbjct: 1493 -------TSDAVDDVMILITDGFQ--STNTTLMFQMIDQAKADGVRLIALGFFGD----- 1538 Query: 378 LLKTCVSSPEYH------YNVVNADSLIHVFQNISQLMVH 411 SP + Y+ N L+ + I + + Sbjct: 1539 ---FAFYSPNLYLMTNEVYHAANYAELLAIDNTIFETICS 1575 >gi|307323133|ref|ZP_07602343.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890622|gb|EFN21598.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 422 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 70/242 (28%), Gaps = 48/242 (19%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 P +ELV+D+SGSM T+MAA K A Sbjct: 23 ISPPAAMAGEPAPDSPKVELVLDVSGSMRARDVDGD--------------TRMAAAKQAF 68 Query: 236 LLFLD------SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLI 288 LD + + + + E+ + + V Sbjct: 69 NEVLDATPEEVRLGIRTLGANYPGKDRVAGCRDSEQLYPVGQVDRTEAKAAVA---TLRP 125 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T A++ A + L+S + + I+ +TDGE++ + + Sbjct: 126 TGWTPIGLALRGASKDLSSGEGT-------------------RRIVLITDGEDSCGQPDP 166 Query: 349 NTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNI 405 + + + T+ + + L ++ + + + D L + + Sbjct: 167 C--DVARELAAQGTHLVVDTLGLTLDRKVREQLSCIAEATGGTYTAIQHRDQLSSRIKQL 224 Query: 406 SQ 407 + Sbjct: 225 VR 226 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 49/190 (25%), Gaps = 34/190 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---------R 277 KM A K A+ +LD I + + E + S G+ +V Sbjct: 146 KMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKE 202 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q+ T A++ + ++ Sbjct: 203 QFNAALSKFGPKGWTPLASAIESVNDDFKE----------------YTGEENLNVVYIVS 246 Query: 338 DGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKT-CVSSPEYHYNVVN 394 DGE + ++ + I + + Q+ LK + + V + Sbjct: 247 DGEETCGGD---PVNAAKNLNQSNTHAVVNIIGFDVKNSEQQQLKYTAEAGKGNYATVSS 303 Query: 395 ADSLIHVFQN 404 AD L Sbjct: 304 ADELHQTLNK 313 >gi|296118874|ref|ZP_06837447.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] gi|295967972|gb|EFG81224.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] Length = 674 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 53/189 (28%), Gaps = 37/189 (19%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLS--------HVKEDVYMGLIGYTTRVEKNIEPSW-G 272 T++ A K A ++S+ + + ++ Sbjct: 73 VDGGTRLDAAKQAANQLVNSLPETAVMGMLAYGASGSNAPDNRERGCQDIDVLAPVERID 132 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E+++ + T A++ A L S+ + Sbjct: 133 NEELKSEIGAL---EAQGYTPMGNALRAAADELGSEGD--------------------RS 169 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLK-TCVSSPEYH 389 II ++DG + ++ ++ + I T+ Q L+ +S + Sbjct: 170 IILVSDGIDTCAPPPAC--EVAEELAGDGFDLAIHTVGFKPDEAAQAELECISEASGGTY 227 Query: 390 YNVVNADSL 398 NA+ L Sbjct: 228 VEAENAEEL 236 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 72/237 (30%), Gaps = 53/237 (22%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + ++S P + V+D+SGSM+ Sbjct: 294 PGKDRGTFMLSITPGDDLPPITT--GSDWVFVLDISGSMNA------------------- 332 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTR 282 K+A L + + L + + ++ + R E+ +RQY + Sbjct: 333 --KLATLGDGVRQALGKLRGNDRFR------IVLFDDRAEELTSGFVDATPNNIRQYTQK 384 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 M T+ + A L +D+ I+ +TDG N Sbjct: 385 IMQLQSRGGTNLFGGLSLALTPLDADRPTG--------------------IVLVTDGVAN 424 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K D + + +++ T + S N L +S + +V N+D + Sbjct: 425 VGK--TRQKDFIDLLENHDVRLFTFVMGNSANRPMLTAMTDASNGFAISVSNSDDIA 479 >gi|108761783|ref|YP_632825.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] gi|108465663|gb|ABF90848.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] Length = 694 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 5/55 (9%), Positives = 24/55 (43%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 F ++ ++ ++ AL V+ + + + + +++ +++ ++A Sbjct: 2 FTRTLRQSFRRQEGQALVLAALMVLIMSIAVLTTVNIGHTVHERIRLQNTADSAS 56 >gi|302406604|ref|XP_003001138.1| von Willebrand and RING finger domain-containing protein [Verticillium albo-atrum VaMs.102] gi|261360396|gb|EEY22824.1| von Willebrand and RING finger domain-containing protein [Verticillium albo-atrum VaMs.102] Length = 872 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/259 (12%), Positives = 75/259 (28%), Gaps = 28/259 (10%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 QR + VS + + S + ++ + V PI + Sbjct: 318 DWQRSRPQRVSSVASSWGGAKSATTAPTEYTNFQRSPGLPALVHVPIDVVVVVPISSSMQ 377 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K+ +++AL ++++ MGL+ + + T K + Sbjct: 378 GVKINLVRDALKFMINTLGDRD------RMGLVTFGSGGGGV-PLVGMTTKAWPGWNNIL 430 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 ++ S A + + K + I+ ++D ++ Sbjct: 431 SAIKPIGQKSHRA---------DVVEGANVAMDLLMQRKHNNPIAT--IMLISDASTSDA 479 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH---- 400 S + + AK I I + + + +++ + + V + L Sbjct: 480 DSVDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSSRTKASYTYVKDWMMLRECLAG 536 Query: 401 ---VFQNISQLMVHRKYSV 416 Q +S V K + Sbjct: 537 CLGAMQTLSHQNVKLKLKL 555 >gi|73958268|ref|XP_848527.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 2 [Canis familiaris] Length = 1153 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T +K + +D L+ ++ + + + + + Sbjct: 163 NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSFLVK 217 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +L T + +++ + L + K ++ +TDGE Sbjct: 218 SIKQLLGRTHTATGIRKVVRELFHSSSGARENAL-------------KILVVITDGEKYG 264 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVSSP--EYHYNVVNADSL 398 + + +A I I + + N + L S P ++ + V N ++L Sbjct: 265 DPLD--YKDVIPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KTIQNQLQEKI 333 >gi|73958266|ref|XP_856286.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 3 [Canis familiaris] Length = 789 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T +K + +D L+ ++ + + + + + Sbjct: 163 NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSFLVK 217 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +L T + +++ + L + K ++ +TDGE Sbjct: 218 SIKQLLGRTHTATGIRKVVRELFHSSSGARENAL-------------KILVVITDGEKYG 264 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVSSP--EYHYNVVNADSL 398 + + +A I I + + N + L S P ++ + V N ++L Sbjct: 265 DPLD--YKDVIPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KTIQNQLQEKI 333 >gi|73958270|ref|XP_856370.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 [Canis familiaris] Length = 1036 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T +K + +D L+ ++ + + + + + Sbjct: 163 NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSFLVK 217 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +L T + +++ + L + K ++ +TDGE Sbjct: 218 SIKQLLGRTHTATGIRKVVRELFHSSSGARENAL-------------KILVVITDGEKYG 264 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVSSP--EYHYNVVNADSL 398 + + +A I I + + N + L S P ++ + V N ++L Sbjct: 265 DPLD--YKDVIPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KTIQNQLQEKI 333 >gi|73958264|ref|XP_547048.2| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 1 [Canis familiaris] Length = 1165 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 58/191 (30%), Gaps = 25/191 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T +K + +D L+ ++ + + + + + Sbjct: 163 NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSFLVK 217 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 +L T + +++ + L + K ++ +TDGE Sbjct: 218 SIKQLLGRTHTATGIRKVVRELFHSSSGARENAL-------------KILVVITDGEKYG 264 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVSSP--EYHYNVVNADSL 398 + + +A I I + + N + L S P ++ + V N ++L Sbjct: 265 DPLD--YKDVIPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEAL 322 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 323 KTIQNQLQEKI 333 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 73/238 (30%), Gaps = 42/238 (17%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S A +ELV+D+SGSM R++++ + A Sbjct: 24 SPAGADDETTEPVAEPPKVELVLDVSGSMRARD--------------IDGRSRISVAQQA 69 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +D++ + + V ++ G Q Sbjct: 70 FNEVVDALPDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKAAVAGLRPTG 129 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A Q L + + I+ +TDGE+ + Sbjct: 130 FTPVGLALRSAAQDLGTGS-------------------TARRIVLITDGEDTCAPPDPC- 169 Query: 351 IKICDKAKENFIKIVTISINASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ + ++V ++ +P+ ++LL ++ + +AD L + + Sbjct: 170 -EVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADELTGRIKQL 226 >gi|315498270|ref|YP_004087074.1| hypothetical protein Astex_1248 [Asticcacaulis excentricus CB 48] gi|315416282|gb|ADU12923.1| hypothetical protein Astex_1248 [Asticcacaulis excentricus CB 48] Length = 409 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 45/354 (12%), Positives = 109/354 (30%), Gaps = 25/354 (7%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L++R R Y + N ++ FALS+++ +G + + + + ++ A +A++L Sbjct: 14 LVTRLRHYVS----HKNGNTTLAFALSLVAVAGAVGGAVDMHAAYSVRAQLQDAMDASVL 69 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 G + + D I A + +++ + + + + Sbjct: 70 GG---LRQEQVQRSDVATRIFKSNLPASMSVTPKYVTDETTLTGTAVYDSPTVLLGLLGI 126 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 +S +I++T A + S I L + ++ ++ Sbjct: 127 SSLKINVTATAIGKASSQGPCIQ-----------VLDPSGSQALLVNSGAKITAPNCEIQ 175 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNS----DPEDVNSAPICQDKKRTKMAALKNALLLF 238 + P + L+ C S + V+ + + AAL + Sbjct: 176 VRSKGNPAAIFNSGSSLNFKKVCIEGSTVIRNSTTVDRLSLSCPTVADQWAALIPTVAST 235 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ ++ +V + Y N + + YV R + T + + Sbjct: 236 PCTVSNGNYNTSNVSLTPGVYCGWFNFNNSSATVSFAPGLYVIRSGGWNVNGGTWTGKGV 295 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 K F + + ++ + F G +N N + Sbjct: 296 TF---YFEDTSKIQFNSGISADLTAPDTGTYKNLLFFEKPGLSNTDFIFNNAVA 346 >gi|149634622|ref|XP_001513644.1| PREDICTED: similar to AMACO [Ornithorhynchus anatinus] Length = 801 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 18/191 (9%), Positives = 51/191 (26%), Gaps = 24/191 (12%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + D L + V +G + ++ + Sbjct: 81 FETAKYFVVKLCDALDINPDRVRVGAVQFSVASWLEFPLDSCLTRQEAKDKIKKIVFRGG 140 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T++ A+K + + + +I +TDG++ + Sbjct: 141 STETGLALKYILWKGFPGGR---------------NASVPQILILITDGKSQGNVTVP-- 183 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + K+ I + + + + L + ++ + D + + + Sbjct: 184 ---AQQLKDRGITVFAVGVRFPRWEELHLLASEPNEQHVLFAEDVD---DAYNGLMSTLT 237 Query: 411 HRKY-SVILKG 420 S + G Sbjct: 238 SSTVCSAVSPG 248 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 70/227 (30%), Gaps = 42/227 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +ELV+D+SGSM R++++ + A +D++ Sbjct: 41 TPVEPPKVELVLDVSGSMRATD--------------IDGRSRISVAQQAFNEVVDALPDE 86 Query: 246 SHVKEDVYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V + G Q T A++ A Sbjct: 87 TELGIRVLGATYPGDDKEQGCQDTQQIVPVGPVDRVQAKAAVATLRPTGYTPVGLALRSA 146 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L + + I+ +TDGE+ + ++ + Sbjct: 147 AEDLGTGS-------------------TARRIVLITDGEDTCAPPDPC--EVARELAAQG 185 Query: 362 IKIVTISINASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K+V ++ +P+ Q+LL ++ + +AD L + + Sbjct: 186 TKLVVDTLGLAPDEKVRQQLLCIAGATGGTYTAAQSADELTGRIKQL 232 >gi|291575288|gb|ADE10213.1| hypothetical protein [Actinoplanes liguriensis] Length = 432 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 53/164 (32%), Gaps = 17/164 (10%) Query: 11 FKKGI------ASEKANFSIIFALSVM--SFLLLIGFLIYVLDWHYKKNSMESANNAA-- 60 ++ I E + + A+ L + +I + + ++ ++S +AA Sbjct: 1 MRRLIHALFPPRGEHGVITALVAVLAGAGVLLGMAALVIDIGALYAEREQLQSGADAASW 60 Query: 61 --ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A A +L+ + ++A+R +AK + + ++ ++ SA T Sbjct: 61 KVAQACAGTAGRDLTSATCTVAAQRDNAQRYADRNAKDLV-SDVQFCITTVSAAGVTTAD 119 Query: 119 QNIVNSSRISMTHMANNRLDSSNNT----IFYNMDVMTSYDYRL 158 +S +T A TS L Sbjct: 120 AGCPSSWNTPVTCPAPPSASGPYRYVEVRTSTRNSDNTSVVPPL 163 >gi|297667860|ref|XP_002812181.1| PREDICTED: vitrin-like isoform 2 [Pongo abelii] Length = 678 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 511 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 570 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 571 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 611 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 612 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 667 Query: 409 MVHR 412 + Sbjct: 668 ICTE 671 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 14/139 (10%), Positives = 38/139 (27%), Gaps = 22/139 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 L MG++ Y + + ++ + + ++ A+ Sbjct: 320 AQALDIGPAGPLMGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAIS 377 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + S + ++ + DG + + A+E Sbjct: 378 FVTKNFFSKANGN-------------RSSAPNVVVVMVDGWPTD-----KVEEASRLARE 419 Query: 360 NFIKIVTISINASPNGQRL 378 + I I I+I + ++ Sbjct: 420 SGINIFFITIEGAAENEKQ 438 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/300 (9%), Positives = 78/300 (26%), Gaps = 10/300 (3%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 R D N + + + +A ++ +++ + + R + + Sbjct: 2966 SRLPTDGTLTLNGNPVTVNTAVSAADIAAGKLVFTPSANGLDTSIGFQVRDNGGTDHGGQ 3025 Query: 147 NMDVMTSYDYRLQF--IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 N ++ + V + I +G++ ++ + + ++ Sbjct: 3026 NTSGTYNFVLNTDNIVTGENVGSGTGNTPVLNGGSGNDIILGDKGGTVVTVEPGKNYNIA 3085 Query: 205 CA-MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 S ++++A +K+AL + + + +G Sbjct: 3086 LVVDTSGSMAYKLDGSTNGSGQSRIALVKDALTNLANQLVGHDGIVNVTLIGFATTAGTP 3145 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + V+ +T + T+ A A S + V Sbjct: 3146 VTLQNLTS--ANVQTLLTAITNLSATGGTNYEAAFNSAVSWFNSQTAAGKSVAAGYENVT 3203 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE----NFI-KIVTISINASPNGQRL 378 +G S + + + + + + I I + N L Sbjct: 3204 FFLTDGDPTYYLKNNGNVGGDGSTTDMTTVQESVNAFAPLSGVSTVHGIGIGSGVNEDYL 3263 >gi|327541351|gb|EGF27892.1| hypothetical protein RBWH47_01135 [Rhodopirellula baltica WH47] Length = 384 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 21/55 (38%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 S + I + ++ ++ + + + + + ++A LAGA + Sbjct: 18 SRRGAAGIFGLILCVALFTVLAVVSEFAYINTARTELSRSADSAALAGAWALYDA 72 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPATQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 ISKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|326332117|ref|ZP_08198401.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325950088|gb|EGD42144.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 359 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 51/205 (24%), Gaps = 17/205 (8%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + +C D + + + L + L+ ++ Sbjct: 92 NRYSTRDIVLCLDASTSMFDEDTQVTQAYSQLVAGL----NGERVSLVLWSDAAVTVFPL 147 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP--SL 327 + V+ + R + L + L ++ + + Sbjct: 148 TDDYGWVQDELARAGRAFALGDQEYVAG-----TYLGKERASLISDGIVSCVKRFDLSNP 202 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-----INASPNGQRLLKTC 382 + +I +D + + + + AKE +++ I K Sbjct: 203 ERGRAVIVASDNDPQGGPPVYSLPEATEFAKERDVQLYGIGSADLSFQEDKRA-TFDKAM 261 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQ 407 + + + S+ + I + Sbjct: 262 TDTGGTFSLLGDDGSIESILAGIDR 286 >gi|146162766|ref|XP_001010021.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146146274|gb|EAR89776.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 783 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 42/309 (13%), Positives = 80/309 (25%), Gaps = 92/309 (29%) Query: 167 QRYNQKIVSFIPALLRIEMGERP---IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 Q + S P + + V+DLS SM Sbjct: 58 QNDDDLTKSCTPTSSQASCNIDIDRQPLDLIFVIDLSISMR------------------- 98 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 KM LK + ++ ++ M LIG+ + S T++ ++ VT+ Sbjct: 99 -GKKMNQLKKTICNLINFLNEND------RMALIGFNNSAQNLFPLSHLTQQNKKKVTQI 151 Query: 284 MDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF------------- 329 ++S L + T+ T M +A + L S +G + Sbjct: 152 LNSILPMGLTNITAGMMEAIKQLESSLINISTVAETGEGEEDEKDTHAQKEKRMQQEMQT 211 Query: 330 -----------QKFIIFLTDGENNNFKSNVN--------------------TIKICD--- 355 + + + SN CD Sbjct: 212 EENKEKYKDLLSRSEVIKPQNAIDYDLSNSCINFGQKKEERVKSIFLLSDGLDDRCDKSL 271 Query: 356 ---KAKENFIKI----VTISINASPNGQRLLKTCVSSPEYHYNVVN--------ADSLIH 400 K + + I T + + + Y + + AD+L Sbjct: 272 EMLKVRIQSVPIQFSLNTFGYGEQHDAYLMNEIASYKEGNFYYIEDIDKTDLYFADALGG 331 Query: 401 VFQNISQLM 409 +F + + Sbjct: 332 LFSVVGNQL 340 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPATQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|75812633|ref|YP_320251.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705389|gb|ABA25062.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 464 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 70/254 (27%), Gaps = 42/254 (16%) Query: 154 YDYRLQFI-----EHLLNQRYNQKIVSFIP--ALLRIEMGERPIFLIELVVDLSGSMHCA 206 L E + + QK+ + + + RP V+D SGSM+ Sbjct: 1 MSNVLNVSITPHREFMPAETEGQKLFLMLKLRPTKEVAV-SRPPTTFAFVIDTSGSMYEI 59 Query: 207 MNSDPEDVNSAPICQDKKRT-------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + + K+ + K+ + +LL + ++ + ++ + Sbjct: 60 VTGETTPTGVTYTQDAKEYSQVTGGKSKIDIVIESLLALVR----SGRLEASDRVAIVQF 115 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + I+ + T +V Q T +++A +L+ Sbjct: 116 DDTASQIIDLTPAT-QVSQLENAIAQLRSFSGGTRMGLGLRRALDMLS------------ 162 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + TDG+ + I I + + L Sbjct: 163 ------GQDMAVRRTLLFTDGQT---FDEDICRALASDFATKNIPITALGVGEDFKEDLL 213 Query: 379 LKTCVSSPEYHYNV 392 S+ + V Sbjct: 214 SHLSDSTGGTLFYV 227 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLILTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPATQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DIGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|89070480|ref|ZP_01157773.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] gi|89043884|gb|EAR50075.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] Length = 432 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 3/43 (6%), Positives = 15/43 (34%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME 54 ++ + E ++ + L + + V + + ++ Sbjct: 5 RRLLRREDGAATVFALFLFILCLTVGAVALDVSHAYAARTRLQ 47 >gi|259415853|ref|ZP_05739773.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259347292|gb|EEW59069.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 182 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 26/58 (44%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 M + +K +E N +I FA+ L+++ + + +Y++ ++ A + Sbjct: 1 MIYRTSIPALVQKFRRNEDGNATIEFAILFPLMLMMLFAAVELGMINYRQIMLDRAMD 58 >gi|222150614|ref|YP_002559767.1| hypothetical protein MCCL_0364 [Macrococcus caseolyticus JCSC5402] gi|222119736|dbj|BAH17071.1| hypothetical protein [Macrococcus caseolyticus JCSC5402] Length = 1102 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/292 (11%), Positives = 77/292 (26%), Gaps = 41/292 (14%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLL-----NQRYNQKIVSFIPALLRIEMGERPIF 191 +D + + + Q + + + Sbjct: 490 IDLKTQGRASVTQETVDIVLVVDNSASMGTIIGNGKTRWQSMKDDVFQFIDEVTQANTAA 549 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ +D+ ++ A PE + ++K + + Sbjct: 550 NTKIRIDV---VNFASALRPEINSGFSGDPAVIKSKF---------YPSYVPSGQGGGTF 597 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKK 310 GLI + +++ S ++KV +T +L K + + ++ Sbjct: 598 TQQGLISGSNKLDT----SRASKKVMVVLTDGAPTLSYKGVSATGSESITSFSNTIKGNG 653 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTIS 368 +F I + G+ + K + I +I Sbjct: 654 TNFKLYGSFDYNGIHDPYKVNGHEIINHGQPTVSQ--------AKIIKNSRPGFDIFSIG 705 Query: 369 IN--------ASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVH 411 I+ + +L S PEY + +A L + ISQ + + Sbjct: 706 IDTTVVGGGATEEDMNNVLNNIASRPEYAFTTKDSASELSKILSKISQSVSN 757 >gi|254416823|ref|ZP_05030572.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176369|gb|EDX71384.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 538 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/260 (12%), Positives = 69/260 (26%), Gaps = 53/260 (20%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 S ++ +++F + + R + LV+D SGSM Sbjct: 2 KASYTLSQSLIAADSPSTVD-----LLITFQGSESSQQTSSRRPLNLSLVLDRSGSMA-- 54 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + A +D + ++ ++ Y E Sbjct: 55 ------------------GAPLRYAIQAAQNLIDYLTADD------FVSVVIYDDTAEVI 90 Query: 267 IE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 I ++ + + T+ + + +++ Sbjct: 91 IPPQLVGDQAALKAKIGKI---RARGCTNLSGGWLLGCSQVQANQ--------------- 132 Query: 325 PSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 S ++ LTDG N K K + E I T+ N L+ Sbjct: 133 -SPERINRVLLLTDGLANYGIKDPQVLTKTALEKAEADIVTTTLGFGNYFNEDLLINMAN 191 Query: 384 SSPEYHYNVVNADSLIHVFQ 403 ++ Y + + D VF+ Sbjct: 192 AARGNFYFIQSPDDASQVFE 211 >gi|167535479|ref|XP_001749413.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772041|gb|EDQ85698.1| predicted protein [Monosiga brevicollis MX1] Length = 2014 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/385 (8%), Positives = 90/385 (23%), Gaps = 42/385 (10%) Query: 51 NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 ++++ + A + L G +F + L ++ +L Sbjct: 138 ANLKAVLDR---MAAHEAYQTLLDAGIKFHVSKAYEAPPLDAPTNSENLHYEGRTLDLKF 194 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM------TSYDYRLQFIEHL 164 + + + + + ++ + + V + L L Sbjct: 195 WRNNQVGMYPSLIDTVMQLAWISGADFVRNGDARIELAVVPNSCGAVLDVVFALDGSASL 254 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + + + + + D Sbjct: 255 DGSYQQSGGHVDLWDKQIQFVKDVTSYFSVSSEETRVGVMSFHGPLLSHYSYVEGIPDCP 314 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + + + + I V + + G ++R Sbjct: 315 QRSIWTFEGSHCVCYPDIGCTGPDSTCLEVDFAITPNGCENRSAGDYQQGKCEIRHAFPI 374 Query: 283 DMDS----------LILKPTDSTPAMKQ----AYQILTSDKKRSFFTNFFR--------- 319 + DS T ++ A L + + Sbjct: 375 NCDSCECVGGPIIYPNASWTLYQDLIETTDQYALNDLLDNVEFPAGATHLSWGLDFIDRE 434 Query: 320 -----QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G++ + + +I LTDG +N + K+ I+I I + + Sbjct: 435 MFRLAAGMRSSNNSIPRVLIVLTDGRSNPGF---EPDEYSTALKDKGIEIYAIGVGDYYS 491 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLI 399 + + + + N D L Sbjct: 492 IEVQEMASEPKDRHAFELSNQDDLA 516 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/326 (9%), Positives = 76/326 (23%), Gaps = 20/326 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINL 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ + + S I + S I Sbjct: 76 SVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTT 135 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFL 192 +R N I V + + IP + P+ Sbjct: 136 SRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAG 195 Query: 193 IELVVDL-------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + +G+ + +S + A + + Sbjct: 196 ATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPG 255 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DS 294 D + + Y + + + D + + T Sbjct: 256 DKAATTNAYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASY 315 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQ 320 + S + R+ Sbjct: 316 QGDLITGINTGGSYNPSYYAAGADRR 341 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 70/227 (30%), Gaps = 42/227 (18%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +ELV+D+SGSM R++++ + A +D++ Sbjct: 32 APVEPPKVELVLDVSGSMRATD--------------IDGRSRISVAQQAFNEVVDALPDE 77 Query: 246 SHVKEDVYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + V + G Q T A++ A Sbjct: 78 TQLGIRVLGATYPGENKERGCQDTQQIVPVGPVDRVQAKAAVATLRPTGFTPVGLALRSA 137 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q L + + I+ +TDGE+ + ++ + Sbjct: 138 AQDLGTGS-------------------TARRIVLITDGEDTCAPPDPC--EVARELAAQG 176 Query: 362 IKIVTISINASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K+V ++ +P+ ++LL ++ + +AD L + + Sbjct: 177 TKLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAAQSADELTGRIKQL 223 >gi|88856118|ref|ZP_01130779.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] gi|88814686|gb|EAR24547.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] Length = 585 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/357 (9%), Positives = 90/357 (25%), Gaps = 31/357 (8%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDD----AKRFIKNHIKESLSGYSAVFYNT 116 A +++ + SR+ A A D A + + + Sbjct: 231 TRAAFAELEISDSRVVALATEREIVAHNASNPDEPLVALYPSEGTTALTYPFILVANSSI 290 Query: 117 EIQNIVNSSRIS-MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 ++ + + +S F ++ + + + Sbjct: 291 SARDESRLTDEERTALVETETSRASMLEEFADLIRGSEQVLNAEGFRDNVGGGEILASGV 350 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 +G+ L + ++ + + + S + ++ +N+ Sbjct: 351 IAAPTATKNVGDTAELTSILRNWNALNLRSRILTVVDVSGSMLERAWDGQRRIEMFQNSA 410 Query: 236 LLFLDSIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++ + +G + K Q + + S I Sbjct: 411 EDAFSTLSEDDELGMWLFSKERVGTEDWEDVAGIRSMNDPDHVKNLQAIIDSLPSRINGY 470 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A + D + ++ +TDG N + Sbjct: 471 TGLNDTVLAAVTHVREDYNSEKVNS----------------VMLITDGRNEDDNGISLQK 514 Query: 352 ------KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ K + + +V + I + + + K ++ Y N L +V Sbjct: 515 LLDKLTEMIQKDTDEPVPVVLVGIGPDTDVEAMRKIAQATGGTAYQANNPTELSNVM 571 >gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 806 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 37/265 (13%), Positives = 83/265 (31%), Gaps = 58/265 (21%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 ++ P + + V+D SGSM Sbjct: 288 GDEDGYFLLLASPGVDAKAKQIVS-KDVVFVLDTSGSM--------------------SG 326 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYV-TRD 283 KM K AL ++S++ + +I ++T E + + +++ R+ Sbjct: 327 KKMEQAKKALQFCVESLNDGDRFE------IIRFSTESEPLFDKLAAVSKENREKAGDFI 380 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T A+K+A + + + + ++FLTDG Sbjct: 381 KNLKAMGGTAIDEALKKALSLESKEGRPF-------------------VVVFLTDGLPTV 421 Query: 344 FK--SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-- 399 + + ++ KE +I I N L + + + V+ + L Sbjct: 422 GTTDEDQILKGMQERNKEKR-RIFCFGIGTDVNTHLLDRIAEETRAFSQYVLPEEDLEVK 480 Query: 400 --HVFQNISQLM---VHRKYSVILK 419 F I++ + K++ ++ Sbjct: 481 VSSFFSKINEPVLANPKLKFTADIR 505 >gi|308474504|ref|XP_003099473.1| CRE-MUP-4 protein [Caenorhabditis remanei] gi|308266662|gb|EFP10615.1| CRE-MUP-4 protein [Caenorhabditis remanei] Length = 2161 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/177 (9%), Positives = 56/177 (31%), Gaps = 20/177 (11%) Query: 230 ALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSL 287 + +L FL + + ++ Y+ ++ + Sbjct: 452 VFQTEVLRFLAEFTELFDIAPQKTRVSVVQYSDQIRHEFGL-DNYSDRKSLQNAIRNIEY 510 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ S+++ + + I +TDG + + + Sbjct: 511 LTGLTRTGAAIEHVANEAFSERRGARPVG-----------QVSRVAIVITDGRSQDNVTR 559 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + D A++ I++ + + + L+ + + ++V + L ++ Sbjct: 560 PS-----DNARKQEIQLFAVGV-TNHVLDAELEEISGAKDRTFHVSGFEDLNTRLRS 610 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 72/225 (32%), Gaps = 45/225 (20%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G R + L++D SGSM+ + K+ +++A + + Sbjct: 644 GVRKNSQVILLIDNSGSMNDEVAPGAA--------------KIDRVQSAANAAIGLLAPK 689 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD-STPAMKQAYQI 304 + + + + V + ++ Q T A++ A+ Sbjct: 690 D------ELAVWTFGSSV-HKTALAPMGNRISQVRAEIGAIEAGGTTTQLPAAVQAAHDA 742 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN--NFKSNVNTIKICDKA----- 357 L P P K ++ LTDG N ++ K + A Sbjct: 743 L--------------AQTNDPDNPKTKAVVLLTDGATNLTPDGADEEENKAANDALVADI 788 Query: 358 --KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 E+ ++I TI S + L K +S +Y +SLI+ Sbjct: 789 RGSESHVRIYTIPYGNSADKCLLEKVAAASGARYYGAGARESLIN 833 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 192 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 251 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 252 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNN----RIGEVKIGVDRFVDT 300 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 301 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGT 353 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 354 FTQKALRDAGSMLSVPNGH----------KKVIVLLTDCVPTFSYKVQ-------RVHAQ 396 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 397 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 456 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 457 HGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 515 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 62/223 (27%), Gaps = 44/223 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 ++LV+D+SGSM ++MAA K A LD+ Sbjct: 33 TTADAPKVDLVLDVSGSMRARD--------------IDGGSRMAAAKQAFNEVLDATPEE 78 Query: 246 SHVKEDVYMGLIG-------YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + + + + V T PA+ Sbjct: 79 VRLGIRTLGANYPGDDRKTGCKDTAQLYPVSTLDRTEAKTAVATLS---PTGWTPIGPAL 135 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 +A L I+ +TDGE+ + + AK Sbjct: 136 LKAADDLDGGTGS-------------------HRIVLITDGEDTCAPLDPCEVAREIAAK 176 Query: 359 ENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + I T+ + + + L ++ + +V + + L Sbjct: 177 GVGLTIDTLGLVPNSKLSKQLSCIAEATGGTYTSVEHKEDLTD 219 >gi|327270790|ref|XP_003220171.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 866 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 66/195 (33%), Gaps = 34/195 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV- 276 Q K ++ L A LFL I +++ ++G++ + ++ + Sbjct: 321 ASAQMGKDNRLNRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQAGLQKIFSDIE 375 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ +T + + + + A+Q+ + + I+ L Sbjct: 376 RQSLTSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE----------------IVLL 419 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 T+GE ++ C K + I I TI+ + L K + + Sbjct: 420 TNGEGSDLSP-------CLSKIQSQEIIIHTIAFGSKA-SNELEKLADMTGGKTFYATDS 471 Query: 394 -NADSLIHVFQNISQ 407 +++ LI F IS Sbjct: 472 LDSNGLIDAFGGISS 486 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 19/168 (11%) Query: 236 LLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPT 292 FL + +G+I ++ R E +++ + + V + L+ T Sbjct: 37 KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIP--LMNFTT 94 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A++ A + + + K II LTDG + + Sbjct: 95 RIDRALRLAQKDMFTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPER 141 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I D+ + + + I+ + I ++ N + L + Y DSL Sbjct: 142 IADELRNDGVVILGVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 188 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 19/168 (11%) Query: 236 LLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPT 292 FL + +G+I ++ R E +++ + + V + L+ T Sbjct: 253 KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIP--LMNFTT 310 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A++ A + + + K II LTDG + + Sbjct: 311 RIDRALRLAQKDMFTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPER 357 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I D+ + + + I+ + I ++ N + L + Y DSL Sbjct: 358 IADELRNDGVVILGVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 404 >gi|260814488|ref|XP_002601947.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] gi|229287250|gb|EEN57959.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] Length = 1386 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 55/194 (28%), Gaps = 29/194 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 + +K + + + D +G++ Y+T Sbjct: 276 SGSVSVSDFETVKQ---FVVAVVSAFTIGLADTRVGVLQYSTSSSLECNLGDHPDEASFV 332 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T + AM+ A Q P K +I LT Sbjct: 333 SAINTLVY-QKGGNTYTGAAMEFARQ-----------------NAAWRPAPVPKIMIVLT 374 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++++ D+ + I + S + LL+ + P + + + D Sbjct: 375 DGKSSDSVVAAAQALAADQV-----AVFAIGVG-SFDHSELLEITNNKPGRVFELDDFDV 428 Query: 398 LIHVFQNISQLMVH 411 L I + + Sbjct: 429 LAQSINRIVRAVCS 442 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 20/194 (10%), Positives = 56/194 (28%), Gaps = 29/194 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 + +K + + + + +G++ Y+T Sbjct: 485 SGSVSVSDFETVKQ---FVVAVVSAFTIGLAETRVGVLQYSTSSTLACNLGDHPDEASFV 541 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + A++ A Q P + +I LT Sbjct: 542 SAINTMTY-QKGGSTYTGAALEFARQ-----------------NAAWRPAPVSRIMIVLT 583 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++++ D+ + + I + S + LL+ + + + + + ++ Sbjct: 584 DGQSHDSVVAAAQALAADQ-----VTVFAIGVG-SFDHSELLEITSNKLGHVFELDDFNA 637 Query: 398 LIHVFQNISQLMVH 411 + I + + Sbjct: 638 MAQNITQIVRAVCK 651 >gi|159184836|ref|NP_354603.2| hypothetical protein Atu1609 [Agrobacterium tumefaciens str. C58] gi|159140120|gb|AAK87388.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 579 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 7/89 (7%), Positives = 24/89 (26%), Gaps = 6/89 (6%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + + N +I L+ F+ ++ + +K ++ + + Sbjct: 6 SLLPRFLKDKTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQTA------DLAAI 59 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRF 97 + + A + K Sbjct: 60 SAAANATDAEKAVQQYFALNGMDLGVKTD 88 >gi|288959068|ref|YP_003449409.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] gi|288911376|dbj|BAI72865.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] Length = 149 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 34/118 (28%), Gaps = 8/118 (6%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 L G + L + + + A +AA LAG M + R I + +A Sbjct: 33 PTALGAGCTVDALRSYAVEARLSQAVDAAALAGGRVMFDSQ-----RDGHIRSFFDKAFP 87 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 + +H+ A + + + I + + +I Sbjct: 88 NGFL---GSHLSPLTIAEDAAAGTLTVSAHASVNAIFLRLFGKKEVTVEAQSIVRRGQ 142 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 73/238 (30%), Gaps = 42/238 (17%) Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 S A +ELV+D+SGSM R++++ + A Sbjct: 24 SPAGADDETTEPVAEPPKVELVLDVSGSMRARD--------------IDGRSRISVAQQA 69 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +D++ + + V ++ G Q Sbjct: 70 FNEVVDALPDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVDRTQAKAAVAGLRPTG 129 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A Q L + + I+ +TDGE+ + Sbjct: 130 FTPVGLALRSAAQDLGTGS-------------------TARRIVLITDGEDTCAPPDPC- 169 Query: 351 IKICDKAKENFIKIVTISINASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ + ++V ++ +P+ ++LL ++ + +AD L + + Sbjct: 170 -EVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADELTGRIKQL 226 >gi|31789427|gb|AAP58542.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 329 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/264 (12%), Positives = 69/264 (26%), Gaps = 72/264 (27%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L + + +++D S SM ++ A + FL Sbjct: 78 ALATSTPQYQREDLVIMLDRSASMRAHDV---------------SPSRFARATAEIRDFL 122 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAM 298 + +GL+G+ + + V Y+ D L T+ A+ Sbjct: 123 QH-----KPENIDRVGLVGFAGTSLILSYLTRDLDTVAFYLDWIESDPRTLLGTNIGAAL 177 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + A + K +K + L+DGE+ + ++ + Sbjct: 178 RNALDV-----------------AKKDDRRARKIFVLLSDGEDYG---DEVARQLAVY-R 216 Query: 359 ENFIKIVTISINASPN---------------------------GQRLLKT-CVSSPEYHY 390 +I +I I + + +L+ + + Sbjct: 217 GQGYRINSIGIGSDEEVVVPEVQPDGREMSLRDENGRVVRTRFEESMLRDVAARTGGRYL 276 Query: 391 NVVNADSLIHVFQNISQLMVHRKY 414 + L I Q R+ Sbjct: 277 RSRSGGDLTRALHAIEQ--GERRL 298 >gi|325473817|gb|EGC77005.1| BatB protein [Treponema denticola F0402] Length = 286 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/249 (12%), Positives = 71/249 (28%), Gaps = 64/249 (25%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + R + + V D+S SM QD + +++A + L + L Sbjct: 36 SKPVSVRRRGVS-VMFVSDISKSMSL---------------QDIQPSRIAVQRQFLKILL 79 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAM 298 + + S +GL+ ++ S+ + + L T+ + Sbjct: 80 EKMHKTSPES---AVGLVITKGEGVLSVPLSFEKNALSSAINALSPLILSSTGTNLEAGV 136 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 +A ++ K I+ TDG + + +K K Sbjct: 137 LRALDSFGENRGN------------------SKIIVLCTDGGETSGS----LLHAAEKIK 174 Query: 359 ENFIKIVTISINA---------SPNGQRLLKTC-------------VSSPEYHYNVVNAD 396 + ++ + G LK + + +++ Sbjct: 175 KTDAILIIVGFGTLEETKIKVLDEKGNTQLKDARLEEAFLKKIASIAGGESTYISALSSG 234 Query: 397 SLIHVFQNI 405 S+ + + I Sbjct: 235 SIETILKII 243 >gi|218672134|ref|ZP_03521803.1| hypothetical protein RetlG_11055 [Rhizobium etli GR56] Length = 125 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 44/125 (35%), Gaps = 31/125 (24%) Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-------------CDKAKENFIKI--- 364 G + +K + F++DG +++K + T K C K+ +KI Sbjct: 1 GDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVL 60 Query: 365 VTISINASPNG-------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNIS-QLMV 410 T + N ++ C S P +++ V + + + + +++ Sbjct: 61 YTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTEGITDAMKALFLKVIR 119 Query: 411 HRKYS 415 + + Sbjct: 120 APRIT 124 >gi|149440247|ref|XP_001521494.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3, partial [Ornithorhynchus anatinus] Length = 390 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 65/211 (30%), Gaps = 38/211 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 213 NVVFVIDVSGSM--------------------YGRKLVQTKEALLKILEDMKEEDYLNFI 252 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ I T + +K +++V D T+ + + ++L ++ Sbjct: 253 LFSSEIT-TWKDTLIKATPENLKKAKEFVKNIKDE---GLTNINDGLMRGIKMLNEARET 308 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 + II LTDGE N + + I+ A + + Sbjct: 309 NVV-----------PKRSTSLIIMLTDGEANVGEIRADKIQENVRNAIGGKFPLYNLGFG 357 Query: 371 ASPNGQRLLKTCVSSPEYHYNV--VNADSLI 399 N L K + + + + L Sbjct: 358 YDLNYNLLEKMALENHGLARRIYEDSDADLQ 388 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 64/205 (31%), Gaps = 30/205 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K + + + L+ Y+ ++ + + Q D Sbjct: 179 QNDFNQMKGFVQAVMGQFEGTD-----TLFALMQYSNLLKIHFTFTQFRTSPSQQSLVDP 233 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + + L +K +I +TDG+ + Sbjct: 234 IVQLKGLTFTATGILTVVTQLF-------------HHKNGARKSAKKILIVITDGQK--Y 278 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 K + + +A++ I I + P ++ L T S+P ++ + V N +L Sbjct: 279 KDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHVFKVDNFAALG 338 Query: 400 HVFQNISQLM-----VHRKYSVILK 419 + + + + + + S + Sbjct: 339 SIQKQLQEKIYAVEGTQSRASSSFQ 363 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 64/205 (31%), Gaps = 30/205 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K + + + L+ Y+ ++ + + Q D Sbjct: 163 QNDFNQMKGFVQAVMGQFEGTD-----TLFALMQYSNLLKIHFTFTQFRTSPSQQSLVDP 217 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + + L +K +I +TDG+ + Sbjct: 218 IVQLKGLTFTATGILTVVTQLF-------------HHKNGARKSAKKILIVITDGQK--Y 262 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 K + + +A++ I I + P ++ L T S+P ++ + V N +L Sbjct: 263 KDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHVFKVDNFAALG 322 Query: 400 HVFQNISQLM-----VHRKYSVILK 419 + + + + + + S + Sbjct: 323 SIQKQLQEKIYAVEGTQSRASSSFQ 347 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 64/205 (31%), Gaps = 30/205 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K + + + L+ Y+ ++ + + Q D Sbjct: 163 QNDFNQMKGFVQAVMGQFEGTD-----TLFALMQYSNLLKIHFTFTQFRTSPSQQSLVDP 217 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + + L +K +I +TDG+ + Sbjct: 218 IVQLKGLTFTATGILTVVTQLF-------------HHKNGARKSAKKILIVITDGQK--Y 262 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 K + + +A++ I I + P ++ L T S+P ++ + V N +L Sbjct: 263 KDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHVFKVDNFAALG 322 Query: 400 HVFQNISQLM-----VHRKYSVILK 419 + + + + + + S + Sbjct: 323 SIQKQLQEKIYAVEGTQSRASSSFQ 347 >gi|332992330|gb|AEF02385.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas sp. SN2] Length = 1360 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/354 (9%), Positives = 80/354 (22%), Gaps = 33/354 (9%) Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIK--ESLSGYSAVFYNTEIQNIVNSSRIS 128 + D + + + EI + + Sbjct: 372 AANDADDFSSGNRYLLRNIGKTSGVTWSMGSFSSGNDYASPDLSGVVKEIVDRAGWQSGN 431 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + Y+ ++ ++F + + I + + Sbjct: 432 AMAFVFSDFTGYRGAYTYSGSQSSAAKLVVKFNGSATPGQTSTVREHLISKVDELTASGY 491 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L ++ ++ N +K T+++ L + S Sbjct: 492 TPIVDTLYEAVNYYGGRDVDYGLTRGNYKVSNTVRKNTRVSHRSAYLGGDAVQPNGCSDD 551 Query: 249 KEDVYMGLIGY-TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + Y + + + ++ D Sbjct: 552 NLSDSDCINEYIPSGATYISPVNDLQCQTNNHIVLLSDGEA------------------- 592 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + S G +K + L N S+ + I + + TI Sbjct: 593 NNNHSVSLIESLLGDSCTGSGGEKCGLNL---VKNVSDSDTSVI-------DRRVITHTI 642 Query: 368 SINASPNGQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 A+ L + V S Y ++ L+ F I + + + + G Sbjct: 643 GFAANSTANNFLNQLAVQSGGGFYQADDSTELLEAFNTILRSVKDVNSTFVSPG 696 >gi|114321541|ref|YP_743224.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] gi|114227935|gb|ABI57734.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] Length = 972 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 14/157 (8%), Positives = 37/157 (23%), Gaps = 20/157 (12%) Query: 251 DVYMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 MG++ ++ GT + Q T + L Sbjct: 364 RTAMGVVRFSASASVVHPMIAIPDPGTAEKDQLKDAIDSLPASGLTAMFDGLILGLDELQ 423 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + L+DG +N+ + + ++ + I+ Sbjct: 424 DYSAAN-------------DTDAGQVAFLLSDGGDNSSAATEP--QTVQAYQDANVPIIA 468 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVF 402 + L + ++ + + F Sbjct: 469 FGYGSFAPTGVLRRLADNTGGEFFASPTTLAEIQEAF 505 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 60/219 (27%), Gaps = 33/219 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + C I T L+ F +++ + V +GL Sbjct: 1226 GENCDHVCNTTISGRLDIQIVIDTSGSLTSAPNKDQVLMNFTNNLANMYDTINQVKIGLT 1285 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 ++ + ++ + T+ T ++ A + Sbjct: 1286 SFSESSVLEMPL-DFYNQLELQDGVSNMTWQGSFTNITSGVETALNDMD----------- 1333 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-----AS 372 S +I +TDG + ++ D+AK ++++ + S Sbjct: 1334 -------TSDAVDDVMILITDGFQ--STNTTLMFQMIDQAKAEGVRLIALGFFGEFAFYS 1384 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 PN + Y+ N L+ + I + + Sbjct: 1385 PNLYLMT-------NEVYHAANYAELLAIDNTIFETICS 1416 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 29/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D +++ KN L +D +G+I + + ++ Sbjct: 85 DLSESMNATDVVPSRIDRAKNELQGLIDRFSA-------DRIGIILFNSNAYLLTPLTFD 137 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 TE +R + +I TD +P ++ + L+ + K Sbjct: 138 TENIRNTIGNLKTHMIDKGSTDFSPMLEMINEKLSVGTQNRG-----------------K 180 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I +TDGE + ++ + K+N I + + + Sbjct: 181 VAIVVTDGETHYQID----EQLAKRLKQNNIHLFWLGVGT 216 >gi|240169099|ref|ZP_04747758.1| hypothetical protein MkanA1_07284 [Mycobacterium kansasii ATCC 12478] Length = 741 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/276 (11%), Positives = 75/276 (27%), Gaps = 61/276 (22%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + +S V+ +P + +V+D SGSM Sbjct: 261 RGALSSSALLVPDADG-----DEGTWSVTLVPPAE----PSSAPRDVVVVLDRSGSMGGW 311 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 KM A + A +D +D + ++ + R++ Sbjct: 312 --------------------KMVAARRAAGRIVDMLDTVD------RFCVLAFDDRIDTP 345 Query: 267 IEPSWGTEKVR-----QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + G + + T+ +++A ++L Sbjct: 346 TDMAPGLVEGSDQNRFAAASWLGSLRSRGGTEMAEPLRRAVELLAGS------------- 392 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 Q ++ +TDG+ + + I+I + I+ + N L + Sbjct: 393 ----DEGRQASVVLVTDGQITG--EDHLLRSLAQSLGR--IRIYCVGIDRAVNAGFLDRL 444 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 V + L +++ + + + Sbjct: 445 ARLGRGRAELVESEARLDEAMSRLARTIGRPALTAV 480 >gi|290991702|ref|XP_002678474.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284092086|gb|EFC45730.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 467 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 62/192 (32%), Gaps = 28/192 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TKM + ++++ +E +G++ + + E + + + ++ + + Sbjct: 115 TKMQVANEVICEIIENL------REFERLGIVLFDDKAETLLPLTIVQDLDKKSLKETVL 168 Query: 286 SLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 ++ T+ M++ + +S + II+LTD N Sbjct: 169 NIKEKGSTNFEAGMQRGIDLFSSLDSSDLSNSN--------------RIIYLTDACPNVG 214 Query: 345 KSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADS----L 398 + A I + + I N + + + +++V + + L Sbjct: 215 -GTATLDILTKDANSGPYSIFSTFVGIGLDFNSKIVDELTRVRGCNYFSVKSTEEFKKIL 273 Query: 399 IHVFQNISQLMV 410 F I + Sbjct: 274 NEDFNYIVTPIC 285 >gi|224090449|ref|XP_002195035.1| PREDICTED: integrin, alpha 1 [Taeniopygia guttata] Length = 1184 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 64/194 (32%), Gaps = 26/194 (13%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ FL+S+ + + +G++ Y V + Sbjct: 187 YPWESVTDFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYL-NTYSTTEDVMAAASRIRQ 245 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + F QK ++ +TDGE+++ Sbjct: 246 RGGTQTMTAL---------GIDTAREEAFTEAHGARRG--VQKVMVIVTDGESHDNYRLQ 294 Query: 349 NTIKICDKAKENFIKIVTISI-NASPNGQ-------RLLKTCVS--SPEYHYNVVNADSL 398 I C+ + I+ I+I + G +K+ S + ++ +NV + +L Sbjct: 295 EVIDDCE---DENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELAL 351 Query: 399 IHVFQNISQLMVHR 412 + + + + + + Sbjct: 352 LTIVEALGERIFAL 365 >gi|332365023|gb|EGJ42788.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK355] Length = 462 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/339 (8%), Positives = 89/339 (26%), Gaps = 28/339 (8%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ + + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-SGSM 203 + + + + + + + G I + Sbjct: 191 PIQGQMNIAVSFVFDTSGSMNWDLQGRNV---------EKTGNESRMDILRKKSVIMIKD 241 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + + ++ + + L N + +I ++ D G+ + Sbjct: 242 LAEIGNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPD---GVTNPGDGL 298 Query: 264 -EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI-LTSDKKRSFFTNFFRQG 321 I +++ Y+ D + + A+ ++ L+ R F N Sbjct: 299 RYGMISLQSQPAQLK-YIVLLTDGIPNAYLVDSRALYAGNRVDLSQGVGRVTFNNPIYDL 357 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI---KIVTISI-NASP---N 374 + + L ++ + + +K+ + ++ I + Sbjct: 358 SPTLGYEYSRLGYDL--YSRDSITRENSIAYAGEVSKKFGLGVKRVNVIGFSGVNHEIAY 415 Query: 375 GQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 GQ L + + N ++L F +I + + Sbjct: 416 GQSLTDRIGEGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 105/352 (29%), Gaps = 46/352 (13%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHY-------------KKNSMESANNAAILAGASKMVS 70 ++ A+ +MS + LI +I+ + + SM+ + + S V Sbjct: 14 MVMAIMLMSMIALIIGVIFNTMFSSRELIEREASIQAEMRTSMQY-VDRTVGKATSIFVL 72 Query: 71 NLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ + + S A ++ N K+ + + + Sbjct: 73 DDSKFKGSKQGLSREWSYIGLSADGKKVMNYVWNKKKQDWDVSELGTKSLYNMKLDLEFK 132 Query: 127 ISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + N + S N + + + ++ K I Sbjct: 133 TEGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPI 192 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 G+ I + V D SGSM+ + + + ++M L+ ++ + + Sbjct: 193 QGQMNIA-VSFVFDTSGSMNWDLQGRN-------VEKTGNESRMDILRKKSVIMIKDLAE 244 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + +V ++ + GT + +T+ + T+ ++ Sbjct: 245 IGNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGVTNPGDGLRYGMIS 304 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 L S K+I+ LTDG N + + + ++ Sbjct: 305 LQS------------------QPAQLKYIVLLTDGIPNAYLVDSRALYAGNR 338 >gi|301780866|ref|XP_002925835.1| PREDICTED: integrin alpha-D-like [Ailuropoda melanoleuca] Length = 1040 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 57/167 (34%), Gaps = 20/167 (11%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + LI Y+ ++ + + D + T + ++ Q L Sbjct: 195 GGTNTLFSLIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQLNGLTFTATGIRTVVQELFH 254 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 K +K +I +TDG+ +K + + +A+ I I Sbjct: 255 SKNG-------------ARKTARKILIVITDGQK--YKDPLEYSDVIPQAERAGIVRYAI 299 Query: 368 SINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + + ++ L T S P ++ + V N +L ++ + + + + Sbjct: 300 GVGDAFQKLTARQELNTIGSKPSQDHVFRVDNFAALSNIQEQLQEKI 346 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/359 (7%), Positives = 80/359 (22%), Gaps = 51/359 (14%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ + + D ++I+ +G A + + ++ + Sbjct: 214 NTNVLDHQGNKNGTNQWDGINSWNGDPNDRTHSYIEYGGTGNQADYAIRKYAKETSTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDT 322 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 I + + + ++KN + S Sbjct: 323 LADSGITDKI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGT 375 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + ++ + D + ++ Sbjct: 376 FTQKALRDAGNMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQ 418 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKI 364 + T F + + I D N + + + TI KE I+I Sbjct: 419 SSSNYYGTQFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEI 478 Query: 365 VTISINASPNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + I + L + +Y +A + + + Sbjct: 479 HGLGIQLQSDSAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 537 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/389 (7%), Positives = 73/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M AN A LA + + N D Sbjct: 151 AEARMAPANLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 210 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 211 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 270 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 271 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 317 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 318 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 367 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 368 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 427 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 428 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 483 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 484 DENGDLYYESADYAPDISDYLAKKAVQIS 512 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/280 (13%), Positives = 86/280 (30%), Gaps = 63/280 (22%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + L + + F VD++ Sbjct: 946 RKKLNLVIVLDISGSMGSSFDEYYYDRFG-----------------NRVDVN-------- 980 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 +D +++K+ A++ LD ++ +GL+ + T E Sbjct: 981 ----------ETEDAEKSKIEIAAAAIVALLDHLEDDD------RLGLVLFNTGAELAEP 1024 Query: 269 PSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S K Q + D+ + T + M+ A ++ + +++ Sbjct: 1025 VSLIGAKNMQKLKGDVLEISATDGTRLSAGMQMATELYD-------------EFLEVNQS 1071 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSS 385 ++ IIFLTD N +++ ++ +A N + T I I N + + Sbjct: 1072 EYENRIIFLTDAMPNLGQTSEESLLGMTEANANK-NVYTTFIGIGVDFNTELVEYITKIR 1130 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVH-----RKYSVILKG 420 +Y+V +A + + MV + ++ G Sbjct: 1131 GANYYSVHSAKQFKERMDDEFEYMVTPLVFDLRLNLDASG 1170 >gi|325285570|ref|YP_004261360.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321024|gb|ADY28489.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 235 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/235 (9%), Positives = 61/235 (25%), Gaps = 52/235 (22%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + + +G + + + + + Sbjct: 45 IDISGSMEGIDEGSVKDQVVRE-----VGNKAGSQVSKAIG------------GKIGSIL 87 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ TK+ A+K + +I L K+ + T+ + T + Sbjct: 88 GKQVTKEATKLGAVKR---KLIPAIKGLPDGKKFLVFSFNNNVTKQATEFRVASNTTRTS 144 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T++ + +A + I+ ++ Sbjct: 145 SNI-FVQNLKASGGTNTLEGLLEALSTAD-----------------------VQEIVLMS 180 Query: 338 DGENNNFKSNVNTIKICDKAKE---NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 DG N+ + ++ K+ + I I TI+ + + + Sbjct: 181 DGLPNSG-----PKAVLEEIKKVNTSNIIIHTIAFGEDADLDFMRTLAQENNGTF 230 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 40/146 (27%), Gaps = 19/146 (13%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D + L+ Y +EP+ G K T+ ++ AY Sbjct: 118 DDRVSLVVYAGASGVVLEPTPG-NKRATIEQALQQLSAGGSTNGGAGIRLAYAKAREAFI 176 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISI 369 +I TDG+ N N I + + ++ I + T+ Sbjct: 177 EGGIN----------------RVILATDGDFNVGTVNHQALIDLIKQQRQAGIALTTLGF 220 Query: 370 NASPNGQRLLKTCVS-SPEYHYNVVN 394 L++ + + + Sbjct: 221 GGGNYNDHLMEQLADQGDGNYAYIDS 246 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 66/192 (34%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D+++ + + G + Y+ VE + G ++++ V Sbjct: 60 DKVKSFTKRFIDNLNDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K+++ +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEELLVGGSHLKEN---------------KYLVVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEVISQTI 232 >gi|198426622|ref|XP_002122673.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 721 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 61/190 (32%), Gaps = 27/190 (14%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLIL 289 N + F+ SI + + + M + Y +V+ I S ++ Sbjct: 538 NTMKQFVRSIITTFNFGENEARMAVFRYNRQVDTRNQILLSDHINNKTTFLEAYDKLPYN 597 Query: 290 K-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A++ A ++ +++ + + I+ +TDG + + Sbjct: 598 GFGTFTGRALRHAKNVILANRNGN-------------RPNVKDVILTITDGRSQD----- 639 Query: 349 NTIKICDKAKENFIKIVTISI----NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-Q 403 N I + +E + I I A + +LL + V L F Sbjct: 640 NVATISTELREMGVTTFVIGIQPGNGAGLDQDQLLAMGGTPENTILAVGGFGGLDATFLN 699 Query: 404 NISQLMVHRK 413 +S + + Sbjct: 700 RLSASICTNR 709 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 86/282 (30%), Gaps = 31/282 (10%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 ++ +I ++ T+ L ++ + + I L Sbjct: 99 TSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPDFQL 158 Query: 200 SGSMHCAMNSDPEDVNSAPIC-QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 S S A P ++ +C + A+KN L F+ + +GLI Sbjct: 159 SASFSPAAQPCPSLIDVVVVCDESNSIYPWDAVKNFLEKFVQGL---DVGPTKTQVGLIQ 215 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNF 317 Y + K V S T++ A++ A + S Sbjct: 216 YANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASG------ 269 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 270 -------GRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 319 Query: 378 L--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 320 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 361 >gi|321460550|gb|EFX71591.1| hypothetical protein DAPPUDRAFT_255495 [Daphnia pulex] Length = 930 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/245 (11%), Positives = 68/245 (27%), Gaps = 41/245 (16%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIE--LVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + +++ P + IE + S + SD Sbjct: 276 MCGGTSTWTVITLHPDFAMNNNPPNDVAFIEPNFQIVRSSGSRFVLVSDTS-------GS 328 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYV 280 ++ L + ++ + + +G++ + + R+ + Sbjct: 329 MNDYNRIVRLYESSRRWIKY-----DISDGSKLGMVQFANNARILSPIVEINGDASREAL 383 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + T +++A +L I+ LTDGE Sbjct: 384 IARLPVTAVGGTCIGCGLQKALDLLRPGGPG-------------------GVILLLTDGE 424 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS--- 397 + + I D K + ++V+I+ ++ + Y + + DS Sbjct: 425 ETDRPFINDVIS--DVIK-SGARVVSIAFGRKA-EDKIEDLATKTNGKSYFIDDNDSSQG 480 Query: 398 LIHVF 402 L F Sbjct: 481 LNDAF 485 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 47/122 (38%), Gaps = 26/122 (21%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A+++A I ++ K ++ L+DG++N Sbjct: 110 NSGKTNTNGALQEARNIFVGSRQD-----------------VHKVVVLLSDGQSNTGGDP 152 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T ++ +++ ++I TI+I + L + +S + + + F+ ++ Sbjct: 153 TTT---AEELRQDGVEIFTIAIGL--FNKDELNSIATSDHHTFEYSSFRE----FKKLAS 203 Query: 408 LM 409 + Sbjct: 204 RI 205 >gi|297559546|ref|YP_003678520.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843994|gb|ADH66014.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 587 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 51/192 (26%), Gaps = 22/192 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 T+M A L+ S + + + ++ Sbjct: 391 PGTGMTRMQVTSAAATQGLEMFTPSSELGLWEFSTNVNNELHYQEIAPIRELQAA----- 445 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF-----FTNFFRQGVKIPSLPFQKFIIF 335 T + A L + + +++ + I+ Sbjct: 446 -------ADDGTAHRDVLAGALASLQPLPQGDTALYETYLAAYQEMSRTYQPDRTNVILM 498 Query: 336 LTDGENNN--FKSNVNTIKICDKAKE--NFIKIVTISINAS-PNGQRLLKTCVSSPEYHY 390 LTDG+N+N + + I I+TI+ N + L + ++ Y Sbjct: 499 LTDGDNDNPGGLGLDELMSQIESLASPSRPIPIITIAFGPDVQNLEPLQEIAAATGGAAY 558 Query: 391 NVVNADSLIHVF 402 + + +F Sbjct: 559 MTEDPTEIGEIF 570 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 56/185 (30%), Gaps = 18/185 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + L D S +E+V I ++ + + Sbjct: 256 IDDYRLQLQFVADLTQNFSIGRENVQFSTIVFSNEASVAFGFDRYLDPISLQQAIRSVEY 315 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 I T + A+ A Q+ T + I LTDG + + Sbjct: 316 IGSATFTDKALSLALQVFTPGSYGARANE-------------NHVAIVLTDGLSTD---R 359 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 T+ + + + I + I + L + ++V D++ ++ +I++ Sbjct: 360 NKTLIEAARLRNANVTI--LGIRIGQYQIQELIDITGDVKKVFSVEGFDAIHNLTYDITK 417 Query: 408 LMVHR 412 + + Sbjct: 418 AICKQ 422 >gi|328784200|ref|XP_003250409.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Apis mellifera] Length = 2258 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 29/188 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +S + +++ +K+ L F I + + + G V++ + Sbjct: 108 DSSGSVGSKNFHSELNFVKHVLSDF-PVIPSATRIAIVTFGGRRHIRRNVDQISRTNEND 166 Query: 274 EK---VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K + + +T S T + A+ +AY+IL + Sbjct: 167 NKCYLLNKQLTNI--SYTGGGTYTRGALLEAYRILE-----------------KARSNAK 207 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH- 389 K + +TDG +N + K I T I N L S + + Sbjct: 208 KAVFLITDGFSNGGD----PRPAANLLKGAGATIFTFGIRTG-NVDELHDIASSPKDTYS 262 Query: 390 YNVVNADS 397 Y + + Sbjct: 263 YFLDSFTE 270 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 51/190 (26%), Gaps = 34/190 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---------R 277 KM A K A+ +LD I + + E + S G+ +V Sbjct: 173 KMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKE 229 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q+ T A++ + + + ++ Sbjct: 230 QFNAALSKFGPKGWTPLASAIE----------------SINDDFKEYTGEENLNVVYIVS 273 Query: 338 DGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVN 394 DGE + ++ + I + + Q+ LK + + V + Sbjct: 274 DGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYATVSS 330 Query: 395 ADSLIHVFQN 404 AD L Sbjct: 331 ADELHQTLNK 340 >gi|311743549|ref|ZP_07717355.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311312679|gb|EFQ82590.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 445 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 12/150 (8%), Positives = 38/150 (25%), Gaps = 5/150 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I ++ +II A+ + S L + I + + ++A +A + + + + Sbjct: 12 IHEDRGATAIITAVIMTSLLSIGALTIDYGAASHVRRQTQNAADATTRSIV-ENCAKQAA 70 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKE---SLSGYSAVFYNTEIQNIVNSSRISMTH 131 + DA ++ + + Sbjct: 71 AAGQNVLDGACVSATGTADASTIVQGNAPGSAPDAPVIGGAGREVSVTVAEPVDYRLAQL 130 Query: 132 MANNRLDSSNNTIFYNMD-VMTSYDYRLQF 160 + + ++ + + F Sbjct: 131 LGKDSDVVRSSATAEWTNLRPVEGFPVVPF 160 >gi|218458154|ref|ZP_03498245.1| hypothetical protein RetlK5_01327 [Rhizobium etli Kim 5] Length = 156 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 49/156 (31%), Gaps = 35/156 (22%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + + ++ + N G + QK + F+ DG +++K Sbjct: 1 MSIPYQNYYNDQQTSFDEALKGIEGEITGNI---GKGTSNADRQKIVFFVADGVADSYKP 57 Query: 347 NVNT---------------IKICDKAKENFIKI---VTISINASPNG------------- 375 + T C K K+ IK+ T + NG Sbjct: 58 SGCTSPKGANGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETKI 117 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ C +P +++ V + + + + +V Sbjct: 118 AAKMEECA-TPGFYFAVSPTEGIEAAMKALFLKIVS 152 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 60/222 (27%), Gaps = 39/222 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM TKM K A+ + L+ + + Sbjct: 319 NVVFVIDVSGSMF--------------------GTKMKQTKKAMHVILNDLHHDDYFNIV 358 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + + + YV + TD A+ A + Sbjct: 359 TFSDAVSVWKASGSIQATPPNIKSAKVYVNKM---EADGWTDINAALLVAASVFNQSTG- 414 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAKENFIKIVTISI 369 G IIFLTDGE ++ K I + ++ Sbjct: 415 -------ETGRGKGLKKIP-LIIFLTDGEATAGVTVASRILSNAKQSLKG-NISLFGLAF 465 Query: 370 NASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + + + + + + + L + I+ Sbjct: 466 GDDADYHLMRRLSLENRGVARRIYEDADATLQLKGFYDEIAS 507 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 73/210 (34%), Gaps = 37/210 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-R 277 ++ + A FL L V+ ++G++ + T + R Sbjct: 77 SGSMGGYDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDTTATIVNKLIQIKGSDER 131 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + L T +K A+Q++ + + ++ LT Sbjct: 132 NTLMAGLPTYALGGTSICSGIKYAFQVIGELHSQLDGSE----------------VVLLT 175 Query: 338 DGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DGE+N T C D+ K++ + I++ ++ + + +++ + H+ + Sbjct: 176 DGEDN-------TASSCIDEVKQSGAIVHFIALGSAAD-EAVIEMSNITGGSHFYASDEA 227 Query: 397 S---LIHVFQNISQ---LMVHRKYSVILKG 420 LI F ++ + + + KG Sbjct: 228 QNNGLIDAFGALTSGNTDLSQKSLQLESKG 257 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/326 (9%), Positives = 76/326 (23%), Gaps = 20/326 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINL 75 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 I+ + + S I + S I Sbjct: 76 SVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTT 135 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFL 192 +R N I V + + IP + P+ Sbjct: 136 SRTGIVNWFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAG 195 Query: 193 IELVVDL-------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + +G+ + +S + A + + Sbjct: 196 ATYNIGDWLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPG 255 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DS 294 D + + Y + + + D + + T Sbjct: 256 DKAATTNAYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASY 315 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQ 320 + S + R+ Sbjct: 316 QGDLITGINTGGSYNPSYYAAGADRR 341 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 82 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 136 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 137 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 179 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 180 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 231 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 232 IDAFGALSSGNGAVSQRSIQLESKG 256 >gi|320334211|ref|YP_004170922.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319755500|gb|ADV67257.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 609 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 62/236 (26%), Gaps = 44/236 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + R + LV+D SGSM + Sbjct: 15 GTPAPTDLLLRFREDVPNATRRPLNVALVIDRSGSMA--------------------GSP 54 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + A F+D + + ++ Y V+ ++ +K Sbjct: 55 LRYALKAAADFVDRLTETD------VLSIVVYDDDVDTLLDAQPVRDKA-AIKDLLKGVR 107 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + + ++ LTDG+ N+ ++ Sbjct: 108 AGGITNLSGG----------------WLRGCELVAGARRADAVNRVLLLTDGQANHGVTD 151 Query: 348 VN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 IK E + T+ +S L+ +S Y + + D VF Sbjct: 152 TGVLIKTAASKAEAGVSTTTLGFGSSFEEDLLIGMARASGGNFYFIQSMDDAADVF 207 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 53/178 (29%), Gaps = 23/178 (12%) Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ K+D + GL+ +++ + + + T + Sbjct: 30 KNFINKFVKSFMVSKDDSHFGLVLFSSDSNVEFKFDDHYDAASITAAVNATKYPGMGTYA 89 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + A L S + +I +TDG S+ + Sbjct: 90 GKGLTLAKDDLYS---------------APVRSGVPRILIVMTDGI-----SSDDVAGPA 129 Query: 355 DKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ ++I + I + + +L P++ + D L V Q I + Sbjct: 130 KALRDMGVEIFALGIGKNYDQGQLDSMGSDPKPDHVVTA-DFDKLDPVIQTIKDKACN 186 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 53/172 (30%), Gaps = 23/172 (13%) Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F++ K+D + GL+ +++ + + + + T + Sbjct: 250 KNFINKFVKSFMVSKDDSHFGLVLFSSDSNVEFKFDDHYDAASITAAVNATNYPGMGTYA 309 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + A L S + +I +TDG S+ + Sbjct: 310 GKGLTLAKDDLYS---------------APVRSGVPRILIVMTDGI-----SSDDVAGPA 349 Query: 355 DKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNI 405 ++ ++I + I + + +L P++ + D L V Q I Sbjct: 350 KALRDMGVEIFALGIGKNYDQGQLDSMGSDPKPDHVVTA-DFDKLDPVIQTI 400 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T Sbjct: 187 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGT---- 241 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + K ++ +I +TDGE+++ + + K+ + Sbjct: 242 ---------ETRTAYGIEFARSEAFQKGGRKGAKRVMIVITDGESHD---SPDLEKVIED 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 SEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDEAALKDIVDALG 349 Query: 407 QLMVHR 412 + + Sbjct: 350 ERIFSL 355 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 62/190 (32%), Gaps = 26/190 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K F+D + + + G + Y+ VE + + R Sbjct: 208 EQVKIFTKKFIDKLTDRYYRCDRNLVWNAGALHYSDSVELIQGLTRMPSGQKNLKDRVEA 267 Query: 286 SLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A+K+ + L K++I +TDG Sbjct: 268 VQYIGKGTHTDCAIKRGIEELLIGGSHQKEN---------------KYLIVVTDGHPLEG 312 Query: 345 KSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSLI 399 ++AK IK+ +I+I+ RL + +++ + + V + +++ Sbjct: 313 YKEPCGGLEDAVNEAKHLGIKVFSIAISPGHLEPRL--SIIATDHTYRRNFTVNSEENVD 370 Query: 400 HVFQNISQLM 409 + ++ Sbjct: 371 ESIDTMINMI 380 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 66/206 (32%), Gaps = 44/206 (21%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYT-TRVEKNIEPSW----GTEKVRQY 279 K+ ++ +D + V +G++ Y+ ++ ++ + S +++ Sbjct: 771 FEIAKDFIIKVIDRLSRDEQVMFEPGQSKVGVVQYSHSKTQELVSMSDVNIRNIRDLKEA 830 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + I T + A+ A L Q+ I +TDG Sbjct: 831 VKKLN--WIAGGTFTGEALDFASNTLG------------------RPDGNQRIAIVITDG 870 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTIS----INASPNGQRLLKT-CVS-SPEY----- 388 ++ + +C ++V + + N + L C + +P Sbjct: 871 RSDTKRDPSPLNALCQT----GAQVVAVGINDIFSKPSNQEGLSAVTCEAPAPGQKKGLS 926 Query: 389 HYNVVNADSLIHVF-QNISQLMVHRK 413 A+ L F +NI+ + K Sbjct: 927 ISKENYAELLDDAFLKNITSQICIEK 952 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 61/209 (29%), Gaps = 32/209 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +++ + K +K +K FL S D V + + + + Sbjct: 974 DSSTSVGSHNFKTSK-QFVKRLAERFL-SADKTDTADVRVSVVQYSGRNQQKLEAQFLQN 1031 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 ++ + I TD A++ L +K Sbjct: 1032 YTEIASIIDDME--FINDATDVNAAIRY-VTTLYQKSSPRG---------------VKKR 1073 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN---------GQRLLKTCV 383 ++ +DG + +A+ I+I + + N G+ Sbjct: 1074 LLLFSDGNSQGITGKAI-EAAVQEAQRAGIEIYVLVVGRHANEPNIRVLVTGKTTEYDVA 1132 Query: 384 SSPEYHYNVVNADSLIH--VFQNISQLMV 410 + + V + +L+ +Q +S+ + Sbjct: 1133 YGERHLFRVPDYHALLQGVFYQTVSRKIS 1161 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 59/186 (31%), Gaps = 27/186 (14%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 ++ + + +G++ Y V V+ V T Sbjct: 187 FLINILKKFYIGPGQIQVGVVQYGEDVVHEFHL-NDYRSVKDVVAAASHIEQRGGT---- 241 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + K ++ +I +TDGE+++ + + K+ + Sbjct: 242 ---------ETRTAYGIEFARSEAFQKGGRKGAKRVMIVITDGESHD---SPDLEKVIED 289 Query: 357 AKENFIKIVTISI-NA----SPNGQRLL---KTCVSSPE--YHYNVVNADSLIHVFQNIS 406 ++++ + +++ N + L K S P+ + +NV + +L + + Sbjct: 290 SEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDEAALKDIVDALG 349 Query: 407 QLMVHR 412 + + Sbjct: 350 ERIFSL 355 >gi|148655419|ref|YP_001275624.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567529|gb|ABQ89674.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 824 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/313 (10%), Positives = 79/313 (25%), Gaps = 81/313 (25%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 ++ + L + + +G + Sbjct: 399 LNRPVQFLLILDVSGSMSWTFDGR-------------------------GVQNGQVVTCT 433 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEK 265 N V+ + + ++ K L F+ ID + + ++ +T R+ Sbjct: 434 NPTQGCVSIETAWPNVQERRIYTAKQVLRRFVQQIDQDRRSGLRPHDTVRMVTFTGRLGN 493 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ--------ILTSDKKRSFFTNF 317 + + + D L T+ ++ A + F Sbjct: 494 YVNNEGRVGDNNRALNDLTDVLPAGWTNDRATLEAAINEAGMVDGDPYMTAGATPSAVAF 553 Query: 318 FRQG--------VKIPSLPFQKFIIFLTDGENN--------------------------- 342 R + +++ +IF+TDG N Sbjct: 554 ARASQVFAAAPERAPNGMKYRRVVIFVTDGVANVLRNGMQNNYGPGCELGAENVGCQMGD 613 Query: 343 -----NFKSNVNTIKICDKAKENFIK-----IVTISINASPNGQRLLKTCVSSPEYHYNV 392 + + + KE +I+ + +++ + L S P+Y Sbjct: 614 PLPDGSLRPLNAMVAEAQALKEAYIRPSDGSVYVVAL-SPTFETTGLNLVASQPDYVKRA 672 Query: 393 VNADSLIHVFQNI 405 D L +F +I Sbjct: 673 DMPDQLQQIFDDI 685 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 12/161 (7%), Positives = 38/161 (23%), Gaps = 25/161 (15%) Query: 13 KGIASEKA-NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + N +I + + + + + + + ++ ++ NAA LAG ++++S Sbjct: 4 RFYHKTEGQNVVLIAGILAL-LIGMAALAVDLGVTYAEQRNIVRGTNAASLAGMNRLISG 62 Query: 72 LSRLGDRFESISNHAKRAL-----------------------IDDAKRFIKNHIKESLSG 108 + + + + S Sbjct: 63 GRDTDVARAIYESLRSNGIPVTPPGEAPQPGDRSFEAIYLGSDGAPIPGACSRVGACGSQ 122 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 +I+ ++L Sbjct: 123 RPEGVKYIQIRLDGEVETYFARLFGQSQLPVGATAYASVGA 163 >gi|330469792|ref|YP_004407535.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812763|gb|AEB46935.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 565 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 70/219 (31%), Gaps = 32/219 (14%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + ++D+SGSM + +++ A + L LF DS + V Sbjct: 368 MLCIIDVSGSM--------KKAVPTANGATRQQVTAEAARRGLNLFDDSWSIGLWVFSTR 419 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 G Y +V S + + + S T + AY+ + + + Sbjct: 420 LDGSRDYR-QVVPTGPLSRQRSTLERSLDTITSS--SGDTGLYDTLLAAYKDVQQNWEPG 476 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK----ENFIKIVTIS 368 + I+ TDG+N + + + + + I+++ I Sbjct: 477 KVNS----------------IVLFTDGKNEDADGISQRQLLAELKRIKDPDQPIQVIIIG 520 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNIS 406 I + L S+ + + + +F + I+ Sbjct: 521 IGTEVSKAELDTIAQSAGGGAFVAADPTKIGDIFLRAIA 559 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 48/143 (33%), Gaps = 24/143 (16%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 GT + S TD + A + Sbjct: 746 LGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKFS------------------NDSK 787 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSP 386 K I+ ++DG S K+ ++AK+ +K+ T+S+ + N L++ + Sbjct: 788 TSKAIVVVSDG----KTSKSKMTKVINEAKKQGVKVYTVSMGKKSQVNDATLMQLSSETS 843 Query: 387 EYHYNVVNADSLIHVFQNISQLM 409 +++ ++ L VFQ + + Sbjct: 844 GAYFHAIDNMQLHQVFQKLIDTI 866 >gi|218661390|ref|ZP_03517320.1| von Willebrand factor type A [Rhizobium etli IE4771] Length = 370 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 63/193 (32%), Gaps = 26/193 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K+ LK++ L ++ + + ++ Y +EP+ +EK + ++ Sbjct: 18 DKLPLLKSSFRLLVNRLKADD------TVAIVTYAGNAGTVLEPTRVSEKSK-ILSAIDR 70 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF- 344 T ++ AY + + ++ TDG+ N Sbjct: 71 LEAGGSTGGAEGIEAAYDLAQKAFVKDGVN----------------RVMLATDGDFNVGP 114 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S+ + +I ++ ++ I + + L++T + + D+L + Sbjct: 115 SSDEDLKRIIEEKRKEGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKT 172 Query: 405 ISQLMVHRKYSVI 417 + + + Sbjct: 173 LVDEAGSTLFPIA 185 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/375 (9%), Positives = 87/375 (23%), Gaps = 28/375 (7%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + + +SA + + + + + L+ + I Sbjct: 98 QYSQAVSRGQSAGIVSSVGRTLEEFKTSVTVAAHKKVTFELTYEELLKRRLGKYELQIHA 157 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY--DYRLQFIE 162 F N M + + +N + + + Sbjct: 158 RPMQPVKDFKVDVYINENAGINSFMVKGGLSTKELANAVTTTQAEKEAWVHFYPTVDQQK 217 Query: 163 HLLN-QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + I + V Sbjct: 218 SCDSCDDKGMNGDLIVVYDVNRNTSLGDIQRSPGYFVHHFAPSNLQRIPKNVVFIIDQSG 277 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ + AL+ L+ + + I + + E E + + Sbjct: 278 SMHGRKIEQTRTALIHILNDLAEDDFFGLLTFDSNI-FQWKRELVQATKANLESAKTFAR 336 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T+ A+ + +L P +I LTDG+ Sbjct: 337 NI---RANGATNINAAVLKGSSMLN----------------AHPREGSASILILLTDGDP 377 Query: 342 NNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD- 396 ++N I+ A + + N + L K + + Y +AD Sbjct: 378 TTGETNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEKMSLQNNGVARRIYEDSDADL 437 Query: 397 SLIHVFQNISQLMVH 411 L ++ ++ ++ Sbjct: 438 QLKGFYEEVATPLLT 452 >gi|332298718|ref|YP_004440640.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181821|gb|AEE17509.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 566 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 55/192 (28%), Gaps = 35/192 (18%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + V D+S SM S + A K +++ A+K LD ++ Sbjct: 83 RTGSSVSFVFDVSRSMTAKDVS-----SGAGAAVSKPVSRLDAVKLYAAALLDRMEGTEV 137 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILT 306 + + + VR + +L + ++ A Sbjct: 138 SAVIAKGSGV-------VAVPLTSDLNAVRSLLPSLSPALLTSTGSSIGSGIEAAVNSFP 190 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + I+ TDGE + + + + D K I +V Sbjct: 191 PLSAAA------------------RTIVVFTDGEETDGRMSG---AVADALK-FGIPVVF 228 Query: 367 ISINASPNGQRL 378 + + + L Sbjct: 229 VGFGSDTESEIL 240 >gi|145551564|ref|XP_001461459.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429293|emb|CAK94086.1| unnamed protein product [Paramecium tetraurelia] Length = 610 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 71/221 (32%), Gaps = 50/221 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +R + VVD+SGSM+ K+ ++N+L + Sbjct: 124 QRANIDLMCVVDVSGSMN--------------------GEKIKLVQNSLRYIQKILKPTD 163 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQIL 305 + L+ + T+ N++ + + ++ + + + + T+ + +++ Sbjct: 164 ------RLALVTFGTQAGINLQWTRNIAENKKKIKKAIKDIKIRDSTNIASGVALGLRMI 217 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--- 362 K ++ T+ + L+DG +++ +++ + + I Sbjct: 218 RDRKFKNPVTS----------------MFVLSDGVDDDRGADLRCQQA---LHQYNIQDT 258 Query: 363 -KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I T + + + + + + F Sbjct: 259 LTINTFGYGSDHDAKVMNNIANLKGGQFVYIDQIQRVSEHF 299 >gi|321478616|gb|EFX89573.1| hypothetical protein DAPPUDRAFT_310509 [Daphnia pulex] Length = 999 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 55/188 (29%), Gaps = 31/188 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++ L ++ ++ + + +G+I ++T + + R Sbjct: 329 VSSSMNSFNRIGRLYDSARRWIKY-----DIPDGTSLGIIKFSTAATLLSGMTVVNDATR 383 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q + + + T ++ A IL I+ +T Sbjct: 384 QQLMNKIPNTAAGSTCIGCGLQIAIDILKPSGNGGI-------------------IVLVT 424 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG N I++ + +++V+I+ + + + + + S Sbjct: 425 DGVENVHPFI---IEMTPQLIAAQVQVVSIAFGR-AAENEIEVLATKTNGKSFFIDDEGS 480 Query: 398 ---LIHVF 402 L F Sbjct: 481 SDPLNDAF 488 >gi|309774616|ref|ZP_07669641.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] gi|308917647|gb|EFP63362.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] Length = 613 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/224 (11%), Positives = 53/224 (23%), Gaps = 32/224 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV- 276 ++ KN F+ + ++ + T + + + Sbjct: 53 TSGSMGDYQRLINAKNTAKEFVKKYAGDEPDSGRY-LAIVNFATNTNIILNWTDVSSADG 111 Query: 277 -RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF--- 332 T+ +KQA Q+ + + + K Sbjct: 112 KASADNAINQLYADGGTNLHAGIKQASQLFDDTAIQDIESKNTIVLTDGAPTYYLKNCTS 171 Query: 333 ----------------IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA----- 371 DG+ + N T K + TI A Sbjct: 172 DINCIFYTHVTISNTRYHVGGDGDKGSETINDATAAEAKTLKGKS-TVYTICYGASREMT 230 Query: 372 ----SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L S + Y+ ++D L+ F+ I++ + Sbjct: 231 YQGGPTVSAYLKNNIASEMKNAYDADDSDDLVDAFKAITETITS 274 >gi|149921342|ref|ZP_01909796.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] gi|149817775|gb|EDM77239.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] Length = 739 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 34/128 (26%), Gaps = 14/128 (10%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + ++ + L + Q I +TDG+ + + ++ Sbjct: 438 GSGTFTHRGLELVFDNLLDYQANPPALYPA-------DEDTQYVNILITDGQYSTYSTDA 490 Query: 349 NTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVS--SPEYH--YNVVNADSLIHV 401 + + K I N Q L + S ++ + L Sbjct: 491 QVQNALEALLDAGSKTYVIGFGDGLNTTEAQLQLMNMATWGSGGTEMPFDADSQAELELA 550 Query: 402 FQNISQLM 409 I + + Sbjct: 551 LGAIIEDI 558 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 22/261 (8%), Positives = 57/261 (21%), Gaps = 18/261 (6%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 ++ L + + + + + + + ++ M + + Sbjct: 35 KPNFLIILDYSSSMSEPFGMGQTRWEVAVDAVTTLMTTNGGFFQKNMHVAL-MRYGHDPE 93 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P++ + L +D L D Y I S Sbjct: 94 PDNPGTPIPGD-------------LSGLIDG-QSLDVGWYDPDAADKTYFDCNGAEIIQS 139 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + T + AM +++ K + Sbjct: 140 INATPPPLCPGGPGNCSGIG-TWTKGAMDLGADLISQSKADHPEDTVPGDERFYGIMVVT 198 Query: 331 KFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EY 388 G + S I + I +++ + N + + + Sbjct: 199 DGAWTPAQGFPTLSPPSENPAITAAQLFDQQDIPTYVVAVADAVNLPFADELAEAGGTDE 258 Query: 389 HYNVVNADSLIHVFQNISQLM 409 N L + Q + Sbjct: 259 SIAAENPAELTQAIGAVVQDI 279 >gi|325287696|ref|YP_004263486.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323150|gb|ADY30615.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 366 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 62/192 (32%), Gaps = 19/192 (9%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY---TTRVEKNIEPSWGTEKVRQYVTR 282 + + LK A + F++++ K D + I + ++ + + +++ + Sbjct: 139 SSLDELKAASISFINNVMPEEEDKSDSFTMSIHWFDGEDKLHELNPLTSSRDELVSAIES 198 Query: 283 DMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 ++ TD A ++ + + + + + ++ TDG + Sbjct: 199 ITSTISSDASTDLYGAAIRSTDLASKYLEDNTAKDVIGAA----------SVVLFTDGTD 248 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIH 400 + + I +I + + + + L + + N + L Sbjct: 249 QASRYTKAAALKKVTDADPNISFFSIGLGSEIDAEILKEL----GKTFSVFATNKEELET 304 Query: 401 VFQNISQLMVHR 412 F IS + R Sbjct: 305 TFNEISTKISER 316 >gi|297667862|ref|XP_002812182.1| PREDICTED: vitrin-like isoform 3 [Pongo abelii] Length = 657 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 52/184 (28%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 490 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 549 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K +K +I +TDG + + +V Sbjct: 550 GTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVRI 590 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 591 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQN 646 Query: 409 MVHR 412 + Sbjct: 647 ICTE 650 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 19/217 (8%), Positives = 52/217 (23%), Gaps = 27/217 (12%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + S + + + + D + K Sbjct: 226 MDLLSTATYTSSQNRPGADPGIQRQAPSGAAFQKPVGADVSLDCKIDLSFLIDGSTSIGK 285 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVT 281 +R ++ L L MG++ Y + + ++ + Sbjct: 286 RRFRIQK-----QLLAAVAQALDIGPAGPLMGVVQYGDSPATHFNLKTHMNSRDLKTAIE 340 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + ++ A+ + S + ++ + DG Sbjct: 341 KITQR--GGLSNVGRAISFVTKNFFSKANGN-------------RSSAPNVVVVMVDGWP 385 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + A+E+ I I I+I + ++ Sbjct: 386 TD-----KVEEASRLARESGINIFFITIEGAAENEKQ 417 >gi|198435588|ref|XP_002122129.1| PREDICTED: similar to von Willebrand factor precursor (vWF) [Ciona intestinalis] Length = 3684 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/347 (9%), Positives = 77/347 (22%), Gaps = 30/347 (8%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + A I + + + S T V + + Sbjct: 136 VTTTTIPSATGVTTTTTIPSVPGATTTFSSVPGVTTTTIPSVPGATTIPSVPGVTTTTVP 195 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM-GERPIFLIELVVDL 199 + + V + + G I + V Sbjct: 196 FLVQQQQQFPLFLVSTTTTI-PSVPGATTTFSSVPGVTTTTIPSVPGATTIPSVPGVTTT 254 Query: 200 SGSM--------------HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + H + V+ + + + + I Sbjct: 255 TVPFLPGLITTPPLLPQPHTTLGLTGCTVDLVLVVDSSYSIGIDGFITLKDILGNLIKKF 314 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQI 304 ++ + L+ Y+ R + S + + +L+ T + A+K Q Sbjct: 315 DVPNDETRVSLVQYSKRSQVEWLLSTYPGNLDGMLHTIAGMQMLQGVTYTYHALKLVLQT 374 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIK 363 + I+ +TDG + + + Sbjct: 375 VIDGSDSGR------------RPDVPFVIVLITDGRAKDEDIREEVLNKIQVVHDKWNET 422 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I+I L+ S +Y Y V + + + ++ + Sbjct: 423 FHLITIGVGSVDPHQLEGIASHDDYVYTVEDMNEASSLVDEVAASIC 469 >gi|149922101|ref|ZP_01910541.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] gi|149817038|gb|EDM76520.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] Length = 350 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 51/178 (28%), Gaps = 18/178 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQ 300 V + ++ K + + T+ A+ Q Sbjct: 124 FTSEVESNNKVAIYAFDGEEKIHKISDFTGSEGAAEARANSLAGFQPKDKSTNLNGAIVQ 183 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 ++L +++ + ++ TDG + + + ++ K+ Sbjct: 184 GLEVLDEGLEKAK------------NPLRLGTLVVFTDGTDRASRVSE--DEMLSAVKDT 229 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 ++ I + A + L K S ++ + F++I + + S L Sbjct: 230 PYEVFAIGLGAELSEGDLQKIGKSGTAL---AKDSGEIQTAFESIGERIEKLTRSYYL 284 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|260062899|ref|YP_003195979.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] gi|88784467|gb|EAR15637.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] Length = 378 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/245 (12%), Positives = 72/245 (29%), Gaps = 22/245 (8%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK-----MAA 230 + + E+ S ++ + + + + + Sbjct: 99 AGLTADQFTIYEQGRNDDCFSTISSSESQSRISPNSQIFQNHTLLVLDLSNSVLSGSLTE 158 Query: 231 LKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLI 288 LK+A F+D++ + + + + + E++ V D Sbjct: 159 LKSASASFIDNVMPAVPAESFQMAIYWFDGEDVLHELNPLTSSREELIAAVESIDSDFSN 218 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A+ ++ + T + S N I+ TDG + + + Sbjct: 219 DPSTDLYGAVIKSTDLATDLLRDSEQNNTIGAA----------SIVLFTDGTDQASRYSE 268 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQ 407 + + + I TI + A + + L + + N + L F +SQ Sbjct: 269 SQALDKVEKANSNISFFTIGLGAEIDSEVLEEI----GKTFSVFAGNKEELEVTFNQLSQ 324 Query: 408 LMVHR 412 + R Sbjct: 325 KVSER 329 >gi|225551939|ref|ZP_03772879.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] gi|225370937|gb|EEH00367.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] Length = 332 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 68/219 (31%), Gaps = 50/219 (22%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++ K + F+ +E+ +GL+ + + + E + + Sbjct: 109 SSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASIVVPITTDREFFNKKLDD 161 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + A L S ++ I+ LTDG N Sbjct: 162 IYIMDLGNGSALGLGISIALSHL------------------KHSEALKRSIVVLTDGVVN 203 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP----------------------NGQRLLK 380 + ++ + A+ +KI ++ I +S + L++ Sbjct: 204 --SDEIYKDQVINLAQGLNVKIYSVGIGSSEEFSVEFKLRSGKFYQGSFKEVYDPSMLVE 261 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 + Y+V + S Q+ S + + RK + + Sbjct: 262 ISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 300 >gi|29826731|ref|NP_821365.1| hypothetical protein SAV_191 [Streptomyces avermitilis MA-4680] gi|29603827|dbj|BAC67900.1| putative membrane protein [Streptomyces avermitilis MA-4680] Length = 150 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 34/134 (25%), Gaps = 4/134 (2%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ ++ A+ V++ + +IG + + A AG + + G Sbjct: 14 DDRGGVTVFVAVCVLALIGIIGVAVDGGSKMRATERADYIAGEAARAGGQAIDPAEAISG 73 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 A+ A + S + + ++ + Sbjct: 74 TAIVVDLQDAQAAAQAYLRSAGA----TGTVSVSGDGKTLNVNVTGSYDTKFLSVVGIGS 129 Query: 137 LDSSNNTIFYNMDV 150 L + + Sbjct: 130 LSVTGQGKATLLHG 143 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 51/185 (27%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 425 SNFRLMLEFVSNIAKTFEISDIGAKIAAVQFTYDQRMEFSFTDYNTKENVLAVIRNIRYM 484 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 485 SGGTATGDAISFTVRNVF---------------GPIRESPNKNFLVIVTDGQSYDD---- 525 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A + I I ++ + A + LK S P + + L + ++ Sbjct: 526 -VQGPAAAAHDAGITIFSVGVAWAPLDD---LKDMASKPKESHAFFTREFTGLEPIVSDV 581 Query: 406 SQLMV 410 + + Sbjct: 582 IRGIC 586 >gi|229588184|ref|YP_002870303.1| hypothetical protein PFLU0636 [Pseudomonas fluorescens SBW25] gi|229360050|emb|CAY46904.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 651 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 17/200 (8%), Positives = 52/200 (26%), Gaps = 2/200 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + ++ ++ A ++ L+ + ++ + +K ++S + + L A + Sbjct: 4 RLRSRQRGAIGLMAAGTLAVALVFMLLVVDSGRLYMEKRKLQSIADTSALEAAGRGGLCS 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + N + A + + + + Sbjct: 64 PTTTANDYAKENATRNGFTVVAGDNSRGLVVTCGLLTTNANSVRVFTPDATKNDAVRVVA 123 Query: 133 ANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + S I S L ++ +L + P+ Sbjct: 124 TRSVMTSIATGIWSMFSGAPVSSQMMLSATAV-AAYAPPVAQLTIRSSLAAVNSANSPLL 182 Query: 192 LIELVVDLSGSMHCAMNSDP 211 + + L GS+ Sbjct: 183 NVVIGNLLGGSLSLTAAGWN 202 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 64/216 (29%), Gaps = 34/216 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +ELV+D+SGSM ++M+A K A LD++ Sbjct: 34 TKESPKVELVLDVSGSMRAKDIDGK--------------SRMSAAKQAFNEVLDAVPEEV 79 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + R +Q + P D T A K A L Sbjct: 80 RLGIRTLGADYPGQDR--------------KQGCKDTRQLYPVGPLDRTEA-KTAVASLA 124 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KI 364 + + I+ +TDGE+ + ++ + I I Sbjct: 125 PTGWTPIGPALLGAAKDLEGGDATRRIVLITDGEDTCAPLDPC--QVAREIAAKGIHLVI 182 Query: 365 VTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 T+ + ++ L ++ + +V + L Sbjct: 183 DTLGLVPDAKTRQQLTCIAEATGGTYTSVHRTEDLS 218 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 67/216 (31%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L++D+SGSM +++MAA K A LD+ Sbjct: 37 RDAPKVNLLLDVSGSMRARD--------------IDGQSRMAAAKQAFNEVLDATPKEVE 82 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R ++ + P D T A K A LT Sbjct: 83 LGIRTLGANYAGDDR--------------KEGCKDTAQLYPVGPLDRTEA-KAAVATLTP 127 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + + K I+ ++DGE+ + ++ + I I Sbjct: 128 TGWTPIGPSLLKAADDLEGGNGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGLTID 185 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + R L ++ + +V + D L Sbjct: 186 TLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTD 221 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 56/210 (26%), Gaps = 51/210 (24%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +D+SGSM K+ L + + + + + Sbjct: 282 ADFVFALDVSGSMQ---------------------GKLHTLASGVKKAIGQLKPEDRFR- 319 Query: 251 DVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ + S +R+ R T+ + A + L +D Sbjct: 320 -----VVAFNNTAFDLNRGWVSATEANLRETFARLDQLNSNGGTNVYAGVHLALERLDAD 374 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +I +TDG N + + ++ Sbjct: 375 RVA--------------------TLILVTDGVTNQGIVD--PKAFYKLMHKQDLRFYGFL 412 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + S N + C +S + V N+D + Sbjct: 413 LGNSSNWPLMQLMCDASGGSYRAVSNSDDI 442 >gi|224531962|ref|ZP_03672594.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] gi|224511427|gb|EEF81833.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] Length = 333 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/284 (12%), Positives = 82/284 (28%), Gaps = 66/284 (23%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 F L + + + + I +V+D+S SM Sbjct: 59 FTYSFLYLSAMVMVFALAGPSVSKKKMIHLSAGADIVIVLDISPSMGAVEF--------- 109 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + ++ K + F+ +E+ +GL+ + + + + Sbjct: 110 -----SSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASIVVPITTDRDFFN 157 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + + + A L S ++ II LT Sbjct: 158 KKLDDIYIMDLGNGSALGLGISIALSHL------------------KHSEAPKRSIIVLT 199 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINA----------------------SPNG 375 DG N + ++ + A+ +KI +I I + + Sbjct: 200 DGVVN--SDEIYKDQVINLAQGLNVKIYSIGIGSFEEFGVEFKLRSGKFYQGSLKEVYDP 257 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHRKYSVIL 418 L++ + Y+V + S Q+ S + + RK + + Sbjct: 258 SMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAV 301 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 63/221 (28%), Gaps = 52/221 (23%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + VVD SGSM + +K + + + Sbjct: 710 SEVMFVVDHSGSMKARDAKNYT---------------ANKVKQTIKQIGANPSHVYRFNN 754 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 V + R T+ A++ A Sbjct: 755 RPNHEATDKADIVSSIDSLLTYKNENRS-------------TNIVKALETAI-------- 793 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 G + K I+ +TDG + ++ ++ AK I I T+S+ Sbjct: 794 ----------GNFTTNQYTSKAIVLVTDG----YSNSNGLEQVLRDAKLKGIAIHTVSVG 839 Query: 371 ASPN-GQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLM 409 + ++LLK S + + N+ + ++L Q I + Sbjct: 840 SYTTVNEKLLKDISSETNGTYQNITSIENLHGSLQAIITTI 880 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/305 (7%), Positives = 75/305 (24%), Gaps = 58/305 (19%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 S + + + + + + + I+ ++ +SL SA +I Sbjct: 51 STEGDEGQKDDSYEDLLDESVRTSESNKLETVIQTNLNQSLFVTSASNVKPQIDFTYLGI 110 Query: 126 RISMTHMANNRL---DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + N +N+ L + + + + R Sbjct: 111 TPINPMQGQEFTVRYKLTPNPFQHNISKPKEIVLVLDGSGSMSGTKLTNLKNAAKDFIDR 170 Query: 183 I-EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + + ++ + TK+ + + + + Sbjct: 171 LKGVDNLKVAIVVF----------------SSNATINPISVSGTTKIKSTDKSSESSIPN 214 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 L + ++ + T++ +++A Sbjct: 215 YKTLQNEYF------------------LDINDSRLITMINNID---AQGGTNTGDGLRKA 253 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKICDKAKEN 360 +L+ K II ++DG + + + + + K++ Sbjct: 254 EYLLSQ----------------KGDSVANKTIILMSDGLPTYYSGSTESGVNYYKEIKDD 297 Query: 361 FIKIV 365 + I Sbjct: 298 VVGIF 302 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 49/190 (25%), Gaps = 34/190 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---------R 277 KM A K A+ +LD I + + E + S G+ +V Sbjct: 146 KMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKE 202 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q+ T A++ + ++ Sbjct: 203 QFNAALSKFGPKGWTPLASAIESVNDDFKE----------------YTGEENLNVVYIVS 246 Query: 338 DGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVN 394 DGE + ++ + I + + Q+ LK + + V + Sbjct: 247 DGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYATVSS 303 Query: 395 ADSLIHVFQN 404 AD L Sbjct: 304 ADELHQTLNK 313 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 5 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 63 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 64 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 123 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 124 YFFLGPSRTVSATSTAKATNVD 145 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 66/160 (41%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + +D K T++ K L + + K + G++ +T++ + Sbjct: 98 DVSDSMKARDLKPTRLENAKTMLKMLI-------SAKGEQRTGIVAFTSKAYTQCPITND 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E ++ +V + + L K T PA+++A ++L+ P +K Sbjct: 151 VEALKYFVNQLRPEMLNAKGTALAPAVQRAAEMLSK-------------------YPGKK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +I LTDGE++ + A++ IKI+ + I Sbjct: 192 ALILLTDGEDH---EPEQIEEAIKTAQKEGIKIIAVGIGT 228 >gi|282899257|ref|ZP_06307228.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] gi|281195826|gb|EFA70752.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] Length = 431 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 58/188 (30%), Gaps = 12/188 (6%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I +++D SGSM+ + + + A K + D+ + Sbjct: 106 PPAWIIVLLDFSGSMNQTDSGGSKKITGAINAIG------QFAKVSSERGGDTQISIVPF 159 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 E + + S K++ + T+ +K++ Q L Sbjct: 160 GEAGKNCPEYPVDKDTLDKFISASDFKLQNSLEYLSTLNPCGSTNLYQPLKKSLQFL--- 216 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-NFIKIVTI 367 + F + II L+DG +N + ++ K+ I + T+ Sbjct: 217 --GNPEDPRFTLPENSSDPQPRLSIILLSDGYHNAMNEYRDFSELKSLLKDYENITVHTL 274 Query: 368 SINASPNG 375 +P Sbjct: 275 GYGLTPEQ 282 >gi|148694465|gb|EDL26412.1| procollagen, type XII, alpha 1, isoform CRA_a [Mus musculus] Length = 1722 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 1175 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1234 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1235 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1277 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1278 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1330 Query: 406 SQLMVH 411 ++ Sbjct: 1331 EDNLIT 1336 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 61/182 (33%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 112 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSKIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 47/142 (33%), Gaps = 6/142 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 20 IARERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + SN A + + + Y V +++ Sbjct: 79 PNAAAAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVP 138 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 139 YFFLGPSRTVSATSTAKATNVD 160 >gi|99081992|ref|YP_614146.1| TadE-like [Ruegeria sp. TM1040] gi|99038272|gb|ABF64884.1| TadE-like protein [Ruegeria sp. TM1040] Length = 182 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 8/54 (14%), Positives = 25/54 (46%), Gaps = 1/54 (1%) Query: 6 RFRFY-FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 RF ++ E+ N ++ FA+ L+++ + + +++ ++ A + Sbjct: 5 RFISKPLRQFRRDEEGNATVEFAMLFPLMLMVLFASVELGMITFRQIMLDRAMD 58 >gi|326670666|ref|XP_003199265.1| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 1455 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/355 (11%), Positives = 108/355 (30%), Gaps = 32/355 (9%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A+++ K+ S + ++ + I++ + + L ++ + + Sbjct: 281 ALISALVKIKPKPSVEINIGAALDFVRTNMFRAERGSRIQDGVPQLLLLLTSK----KSK 336 Query: 120 NIVNSSRISMTHMANNRLDSSNNT--IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V ++ M L + ++ + + + R ++I+S + Sbjct: 337 DSVQQPAEALRQMGVLTLAAGTRAAQEAELKEIAFDDSVVFMYKDFRVLSRNPKEIISPL 396 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 L + + E P +V + + + + +K+ + Sbjct: 397 STLSGVVVTETPTETGPVVEVTTVHTQKVVRDIVF----LVDGSNYVGNNLRFVKDFING 452 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTP 296 ++ + + V +GL+ + + K SL+ ++ Sbjct: 453 VVNKL---DVRPDRVRIGLMQFAEGQRTEFYLNSHNTKQNVLSAIAQLSLMGGRALNTGA 509 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+K A +F Q+ ++ +T G + + +I D+ Sbjct: 510 ALKYAL-----------DNHFQTSSGSRRRQGVQQVLVLITGGPSQDD-----AKRIADR 553 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + T ++ A LKT + Y N L V + I ++ Sbjct: 554 LALEGVL--TFAVGAGQVDLTFLKTAAFVEDLAYYRRNFADLSSVVEKIMTPLIT 606 >gi|194220813|ref|XP_001500011.2| PREDICTED: vitrin [Equus caballus] Length = 662 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 50/194 (25%), Gaps = 27/194 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + + L + D +G + YT + Sbjct: 486 GSSSVGTSNFRTVLQFVANLSKEFEISDTDTRIGAVQYTYEQRLEFGF-DDYNTKPDILN 544 Query: 282 RDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + A+ A + L K +K +I +TDG Sbjct: 545 AIKRVGYWSGGTSTGAAINYALEQLF----------------KKSKPNKRKLMILITDGR 588 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + + + A + I + + Q L+ + P ++ + V D+L Sbjct: 589 SYDDVRIPAMV-----AHHKGVITYAIGV--AWAAQDELEVIATHPAKDHSFFVDEFDNL 641 Query: 399 IHVFQNISQLMVHR 412 I Q + Sbjct: 642 YKSVPKIIQNICTE 655 >gi|317122731|ref|YP_004102734.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592711|gb|ADU52007.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 1122 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 42/174 (24%), Gaps = 30/174 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ A + +G+I + T+ + V S Sbjct: 471 KLEMAVEAARRVAQLLTPAD------RLGVILFDTQAYVTRPIEPVDSAGQ--VEAAFPS 522 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T ++ A ++ + + +I LTDG Sbjct: 523 VAGGGTSLGVGLEAALHLMKDVRADV------------------RHVIALTDGV----SE 560 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + E I + ++I A + L Y + + Sbjct: 561 PFDVAGTARAFHEQGITVSAVAIGADADTTTLGWLAQEGGGQLYVAADPGQIPS 614 >gi|297183644|gb|ADI19770.1| hypothetical protein [uncultured gamma proteobacterium EB000_37F04] Length = 1181 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/233 (9%), Positives = 55/233 (23%), Gaps = 46/233 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYV 280 R ++ ++ ++ +++ + + + R + Sbjct: 254 TSRQRIDIARDTIISLINATPGVDFGLQVFNFNSSQGNDDHGGRIIAGVREMSAANRAAL 313 Query: 281 -TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI------ 333 T + T ++ +AY+ + F S+ Sbjct: 314 VTTINELTATTWTPLCESLFEAYRYFSGGGVLGGFNGGSLAPPADTSIMNNNRYQSPMRS 373 Query: 334 -------IFLTDGENNNFKS------------------NVNTIKICDKAK---------- 358 + +TDG + + + + Sbjct: 374 CQKQSYLVVITDGVPFYDNDYDSLLRSELALKTGDRFDDSYLPGVAEWMQTRDVNPNLLG 433 Query: 359 ENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I TI A+ L +T +Y +A +L Q I + Sbjct: 434 QQNIVTYTIGFSQGANDAADLLAETATRGGGQYYAASDALALQGSLQQIFSEI 486 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 42/154 (27%), Gaps = 1/154 (0%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 F++ + S N + AL + + +G I ++ ++S + A +A A+ Sbjct: 5 TTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAAD 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + S + F S + + + + V I Sbjct: 65 VSSAEEAVLAHFRSNDLGFAVSTPSGLMTVDGKLLPNDVETAAKGVATIVRGRYVPDPAI 124 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 R + + ++ F Sbjct: 125 DAGQ-RFIRGALPADAVQVTLEKKGDIYLAALFS 157 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 64/205 (31%), Gaps = 30/205 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K + + + L+ Y+ ++ + + Q D Sbjct: 163 QNDFNQMKGFVQAVMGQFEGTD-----TLFALMQYSNLLKIHFTFTQFRTSPSQQSLVDP 217 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + + L +K +I +TDG+ + Sbjct: 218 IVQLKGLTFTATGILTVVTQLF-------------HHKNGARKSAKKILIVITDGQK--Y 262 Query: 345 KSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 K + + +A++ I I + P ++ L T S+P ++ + V N +L Sbjct: 263 KDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHVFKVDNFAALG 322 Query: 400 HVFQNISQLM-----VHRKYSVILK 419 + + + + + + S + Sbjct: 323 SIQKQLQEKIYAVEGTQSRASSSFQ 347 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 50/158 (31%), Gaps = 25/158 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V D + G + R + + + R PAM A + L Sbjct: 492 VHWDDFTGRDNWLWRDGAGKGLVRIGGQQKSMLGRLDRMAPGDMPQFEPAMTMALKDLKP 551 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + K +I ++DG+ + + I ++ K+ IKI T+ Sbjct: 552 ------------------NPASVKHMIIISDGDP-----SPPSGTILNQYKQAGIKITTV 588 Query: 368 SINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ L ++ +Y N +L +FQ Sbjct: 589 AVGTHGPAGSTPLQNIANATGGKYYVATNPKALPRIFQ 626 >gi|326674787|ref|XP_003200204.1| PREDICTED: integrin alpha-10-like [Danio rerio] Length = 1170 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 27/188 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPTDS 294 + + E + +G++ Y T +++ + T + Sbjct: 184 NFLSNILSKFHISPEQMQVGVLQYGEISVHEWSLRDYQTTADVVEAAKNISRQEGRETRT 243 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + K +I +TDGE+++ + + C Sbjct: 244 AYAIQMA-------------CTEAFSPDRGAREGATKVMIVVTDGESHDGEDLPEALIEC 290 Query: 355 DKAKENFIKIVTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQN 404 + + I I++ +K S P+ Y +NV + +L + Sbjct: 291 E---KRNITRYAIAVLGHYIRRQQDPETFINEIKYISSDPDEKYFFNVTDEAALNDIVDA 347 Query: 405 ISQLMVHR 412 + + Sbjct: 348 LGDRIFSL 355 >gi|221109526|ref|XP_002169886.1| PREDICTED: similar to coagulation factor C homolog, cochlin [Hydra magnipapillata] Length = 336 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 57/168 (33%), Gaps = 19/168 (11%) Query: 236 LLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPT 292 FL + +G+I ++ R E +++ + + V + L+ T Sbjct: 37 KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIP--LMNFTT 94 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A++ A + + + K II LTDG + + Sbjct: 95 RIDRALRLAQKDMFTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPER 141 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I D+ + + + I+ + I ++ N + L + Y DSL Sbjct: 142 IADELRNDGVVILGVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 188 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/323 (10%), Positives = 66/323 (20%), Gaps = 24/323 (7%) Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF-------Y 146 F + G S N + + SS++ N + Sbjct: 15 FFSFYLGDATNIVFGESNDSNNATLDFTITSSQLEYAKPPNGDAQGRLDVTLIPKGRVDN 74 Query: 147 NMDVMTSYDYRLQFIEHL---LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 + + + Q + S + LI D+ Sbjct: 75 IVRPPIDVVFVFDVSGSMTPLKLQSAKYALQSAVDYFKANANPNDRFALIPFSSDVQY-- 132 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 V P + + + N TR Sbjct: 133 --------NKVVPFPTGAYDVKQHLERIANVANDLRAYGGTNYTQSLQQAQSFFNDPTRK 184 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + I + G V + + + L + + Sbjct: 185 KYIIFLTDGMPTVSIAKEPITYKVCEGILFWRTCKQVT-EDLDVQYILYSNGITAARTIY 243 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKT 381 P P K T K ++ I N I + +I + L K Sbjct: 244 YPDEPETKTYWDRTKYREFEEKIRLHGTNIAKTLGMNNIVLYSIGFGNYQEVDMGYLEKL 303 Query: 382 CVSSPEYHYNVVNADSLIHVFQN 404 ++ +L +FQ Sbjct: 304 SSTAGGQAKQG-TPQNLTEIFQQ 325 >gi|85859136|ref|YP_461338.1| hypothetical protein SYN_01495 [Syntrophus aciditrophicus SB] gi|85722227|gb|ABC77170.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 364 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/287 (9%), Positives = 63/287 (21%), Gaps = 50/287 (17%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-------------------- 54 + K ++I AL++ FL + + V KN + Sbjct: 12 LRERKGAVAVIVALAMTVFLGIAAMAVDVGHIMVVKNELHNAADADALARANLLYAHTPS 71 Query: 55 ------------SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 +A +A S+ L + Sbjct: 72 GFTSATPPTPDWAAAESAASTIDPANKSDGVTLTSYEVETGYWNLDQNPAGLQPKSITPG 131 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 ++ + N + + N ++S I Sbjct: 132 TRDVAAVKVTVRRVDGTNGGSIRHWFGAFVGNLTSNASATAIAICS------------SP 179 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + R P L+ + + +N D + + Q Sbjct: 180 GTAKPGTLVPMAIPLWIAKRASHYNSPSNLLTIGSAYHYDAYAPVNPDGDPYTNTVAGQW 239 Query: 223 KKRTK-----MAALKN-ALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 T LK+ + + + + + + +GY Sbjct: 240 TSLTSDPINNANHLKDIIVNGNSNPLSIGDPLYIEPGTMDVGYHDNY 286 >gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 427 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 15/170 (8%), Positives = 41/170 (24%), Gaps = 17/170 (10%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSL 287 + A+ LD + + ++ ++ I E ++ + + Sbjct: 61 MVIAAVERLLDRLQPGD------RISVVAFSGSATVIIPNQIVEDPESIKTQIRK--KLQ 112 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++Q L + + F + Sbjct: 113 ASGGTVIAEGLQQGITELMKGTRGAVSQAFLLTDGHGEDS-------LKIWKWEIGPDDS 165 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + KA + + I T+ + N L + ++ + Sbjct: 166 RRCQEFAKKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQ 215 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 15/176 (8%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +++ S+ +D L + T E + S + ++ T+ Sbjct: 328 WTINATQNTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEPDSTTNMED 387 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICD 355 A+ +A I + K + + I+FLTDGE N + +K Sbjct: 388 ALNKALSI----------AKLGKMRFKDSAKTPKPIIVFLTDGEMNEGITNPQALMKYVS 437 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQ 407 + I ++ + + L K +++ + Y +A L + ++ IS Sbjct: 438 DINVDNYPIYSLGFGKGADIEFLKKLSLNNTGFARVIYEASDASLQLHNFYKEISS 493 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 61/159 (38%), Gaps = 30/159 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + +D K +++ K + ++ MG++ ++ ++ Sbjct: 94 DISKSMLAEDSKPSRLEKAKMIVSKIVEE-------NPGEKMGIVVFSGTAMWQCPLTFD 146 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ ++ +L L T + A+ A + + + + Sbjct: 147 LHALKMFLQSVETTNLPLGGTRISSAIMLASKAASCESAG------------------SR 188 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +I ++DGEN+ + + + AK+ ++I++I I Sbjct: 189 VMILISDGENH----DSKIKEAVNAAKKAGLRIISIGIG 223 >gi|325292999|ref|YP_004278863.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] gi|325060852|gb|ADY64543.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] Length = 579 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 5/53 (9%), Positives = 17/53 (32%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 + + + N +I L+ F+ ++ + +K ++ Sbjct: 1 MQNISSLLPRFLKDSTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQT 53 >gi|322434933|ref|YP_004217145.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162660|gb|ADW68365.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 347 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 60/199 (30%), Gaps = 39/199 (19%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +K + AL FL + + ++G+ + ++ + E V R + Sbjct: 136 TSKFGRARKALSEFLRTSNPAD------EFFVVGFNDKPAVIVDYTSDVEDVEA---RMV 186 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T A+ L K K ++ ++DG +N Sbjct: 187 MLKPENRTALIDAVYLGVNKLKE--------------AKYDR----KALLIVSDGGDNRS 228 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQR--------LLKTC-VSSPEYHYNVVNA 395 + ++ +E+ ++I +I I LLK + + V + Sbjct: 229 RYTEGELR--RVVRESDVQIYSIGI-YDAYAPTEEEQLGPVLLKDISEMTGGRMFPVTDI 285 Query: 396 DSLIHVFQNISQLMVHRKY 414 + + IS + + Sbjct: 286 ADMADIASRISAELRNEYV 304 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 67/216 (31%), Gaps = 34/216 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + L++D+SGSM +++MAA K A LD+ Sbjct: 30 RDAPKVNLLLDVSGSMRARD--------------IDGQSRMAAAKQAFNEVLDATPKEVE 75 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + R ++ + P D T A K A LT Sbjct: 76 LGIRTLGANYAGDDR--------------KEGCKDTAQLYPVGPLDRTEA-KAAVATLTP 120 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIV 365 + + + K I+ ++DGE+ + ++ + I I Sbjct: 121 TGWTPIGPSLLKAADDLEGGNGSKRIVLISDGEDTCAPLDPC--EVAREIAAKGIGLTID 178 Query: 366 TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 T+ + + R L ++ + +V + D L Sbjct: 179 TLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTD 214 >gi|241612563|ref|XP_002406425.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215500814|gb|EEC10308.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 840 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 17/234 (7%), Positives = 58/234 (24%), Gaps = 39/234 (16%) Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCA------------MNSDPEDVNSAPICQDKK 224 + + I + + +S + D Sbjct: 89 VTFFQANPQAKASIMFLPYIDGISQFCDNDTHNPDALNLQNGRYGGRSTWDVITKHDDYV 148 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTR 282 ++ L+++ + + + + + ++ + + + + V + Sbjct: 149 NNRIGFLRHSFAHLIRHV-----INDGENVAIVQFNSSGKILQDLKEVKDIATREALVQK 203 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T ++ ++L S + II ++DGE N Sbjct: 204 VEQFSLGDYTCIGCGLETGIEVLESKSMLANG----------------GTIILISDGEEN 247 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + + + + T ++ +L + + Y + Sbjct: 248 KGPWIAEELP---RLLAKGVIVHTFALG-PSAEVKLHDMTLQTGGTAYFFPDYQ 297 >gi|213961715|ref|ZP_03389981.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] gi|213955504|gb|EEB66820.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] Length = 550 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/344 (11%), Positives = 96/344 (27%), Gaps = 53/344 (15%) Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 E N F + + S + + ++ M + + Sbjct: 82 NETYKEISENPFVAVAQQPVTTFSADVDRASYANLRRMLGYGQLPPKDAIRIEEMINYFD 141 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + V ++ ++++ + P I Sbjct: 142 YDYPAPTKEATSPLRVTPELAPTP--------WNPEHLLLRIGLQAKKLDLAQAPPSNIV 193 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 ++D+SGSM + K+ LK++ L L + + Sbjct: 194 FLIDVSGSM-------------------DEPNKLPLLKSSFKLLLTQLKPTD------RV 228 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 ++ Y + + + + ++ ++ + T + ++ AY+ + ++ Sbjct: 229 AIVTYASGTKVALSSTP-VKERQKIEKVLDNLYASGSTSGSSGIQLAYKEAQKNFIKNGN 287 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISINASP 373 II TDG+ N SN K +K +E+ I + + Sbjct: 288 N----------------RIILATDGDFNVGISNPRELEKFIEKQRESGIYMSVLGFGMGN 331 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + +T +Y D L + + ++V Sbjct: 332 YRDDMAETIADKGNGNYA--YIDDLTEAKKVLVNEFSGMLFAVA 373 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/354 (10%), Positives = 104/354 (29%), Gaps = 45/354 (12%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 +++ F ++ + + + + ++ T + ++ Sbjct: 20 SNLDVEEPIVFREDAASFGQTVVQFGGSRLVVGAPLEAVAVNQTGRLYDCAPATGMCQPI 79 Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI--------FLIELVVD 198 + + L+ N ++++ P R + ++ + Sbjct: 80 VLRSPLE-AVNMSLGLSLVTATNNAQLLACGPTAQRACVKNMYAKGSCLLLGSSLQFIQA 138 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + SM + + + A +K+ + + S L+ Sbjct: 139 VPASMPECPRQEMDIAFLIDGSGSINQRDFAQMKDFVKALMGEFASTS-----TLFSLMQ 193 Query: 259 YTTRVEKNIEPSWGTEKV--RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y+ ++ + + + + V + T + ++ + L K S Sbjct: 194 YSNILKTHFTFTEFKNILDPQSLVDPIVQLQ--GLTYTATGIRTVVEELFHSKNGS---- 247 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINA---S 372 +K ++ +TDG+ + + I DKA I I + Sbjct: 248 ---------RKSAKKILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQE 295 Query: 373 PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM-----VHRKYSVILK 419 P + L T S+P ++ + V N +L + + + + + + S + Sbjct: 296 PTALKELNTIGSAPPQDHVFKVGNFAALRSIQRQLQEKIFAIEGTQSRSSSSFQ 349 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 59 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 118 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 119 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 161 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 162 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 219 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 220 SRDAEETISQTI 231 >gi|198436258|ref|XP_002122777.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 4258 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 45/158 (28%), Gaps = 19/158 (12%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + + V + T D + T + A+ A L ++ Sbjct: 2119 NSTRIAALRFNRDVIPLWSFAQSTSSEDLIERIDRVTYDGSGTHTGKALTYAANRLFTEA 2178 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + LTDG + + K +K++ I++ Sbjct: 2179 EG-------------DRPDVPDLAVVLTDGRAQDNPGDT-----VQALKNKGVKVIAIAV 2220 Query: 370 NASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNIS 406 + + + + V + + L V +NI+ Sbjct: 2221 TNRVDINEIYAIASDPDEQNAFFVNDFEGLFSVVENIA 2258 >gi|47551047|ref|NP_999700.1| complement factor B [Strongylocentrotus purpuratus] gi|3928787|gb|AAC79682.1| factor B SpBf [Strongylocentrotus purpuratus] Length = 833 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 58/181 (32%), Gaps = 21/181 (11%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILK 290 K + + ++ +G + Y++ + + T V + K Sbjct: 409 KTFAKQLVGKLQ-VNTSPGGTRVGAVSYSSEARRLFNVNDFTSTVDVVKAIEANVNYTNK 467 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A++ + ++ + +TDG +N + Sbjct: 468 GTNLPAALET------------IGVMITETADESGYSSRKRILFIITDGFSNVGGAPS-- 513 Query: 351 IKICDKAKENFI-KIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 K KE+ KI I I+ + + + L S P E+ + + + + L + I+ Sbjct: 514 -KSAQPLKEDAALKIHCIGISRNTD-KTALAEIASPPVSEHVFYLSDYNELERAVEAITS 571 Query: 408 L 408 Sbjct: 572 T 572 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSW---GTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G + ++ V Sbjct: 60 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEQLLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK +K+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEVISQTI 232 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 58/166 (34%), Gaps = 24/166 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDS 294 +D I L+ +G++ Y+++V+ K Q V + + + T + Sbjct: 43 KFMIDIIHELNIGLAATRIGVVQYSSQVQNVFSLKA-FSKTEQMVKAINEIIPLAQGTMT 101 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + S ++ + +TDG + + V Sbjct: 102 GLAIRYAMNVAFSAEEG-------------ARPNVPHVAVIVTDGRPQDRVAEVAAAAR- 147 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSL 398 E+ I+I + + + L+ S P ++ + V + D + Sbjct: 148 ----ESGIEIYAVGVARADMTS--LRAMASPPFEDHVFLVESFDLI 187 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 54/155 (34%), Gaps = 18/155 (11%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +G+I ++ R E +++ + + V + L+ T A++ A + + Sbjct: 160 SSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIP--LMNFTTRIDRALRLAQKDM 217 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K II LTDG + +I D+ + + + I+ Sbjct: 218 FTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVIL 264 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ N + L + Y DSL Sbjct: 265 GVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 298 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 1/72 (1%) Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 K II LTDG + +I D+ + + + I+ + I ++ N + L + Sbjct: 12 VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVN-ETELSHITGGKKN 70 Query: 389 HYNVVNADSLIH 400 Y DSL Sbjct: 71 AYTAATFDSLTD 82 >gi|126282074|ref|XP_001368568.1| PREDICTED: similar to Coch-5B2 gene product [Monodelphis domestica] Length = 549 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + + +T + T K + Sbjct: 380 SNFRLMLEFVSNIAKIFEISDIGAKIAAVQFTYDQRPEFSFTDYTTKENVLAVIRNIRYM 439 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + I P + F++ +TDG++ + Sbjct: 440 SGGTATGDAISFTVRNVF---------------GPIRDGPNKNFLVIVTDGQSYDDVRAP 484 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + I + +I + A + LK S P+ + + L + +I Sbjct: 485 AAAAH-----KAGITVYSIGVAWAPLDD---LKDMASEPKETHAFFTREFSGLEQIATDI 536 Query: 406 SQLMV 410 + + Sbjct: 537 IRGIC 541 >gi|75676719|ref|YP_319140.1| hypothetical protein Nwi_2535 [Nitrobacter winogradskyi Nb-255] gi|74421589|gb|ABA05788.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255] Length = 554 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 21/49 (42%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 R ++ I+ + NF+I+ A ++ L + V + + +S Sbjct: 1 MRNLLRRFISDTRGNFAIMGAGCMVLVLGCTALGVDVGAIYADRRKAQS 49 >gi|162448822|ref|YP_001611189.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161159404|emb|CAN90709.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 563 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/256 (10%), Positives = 73/256 (28%), Gaps = 51/256 (19%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + S + +++D SGSM Sbjct: 8 MFASVAGGLVASAAGMSAALAQVAPLDTNHVVIIDRSGSM-------------------- 47 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYM---GLIGYTTRVEKNIEPSWGTEKVRQYV 280 ++ K A ++ +++ + + G+ Y+ + Sbjct: 48 YGDRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVASYSDTSSVTYPLTALPASGLDT- 106 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T +++A +L S+ P+ ++ +I L+DG+ Sbjct: 107 -AVDALVADGSTSIGAGLEEALDMLISES---------------PTKSARECVILLSDGQ 150 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS---SPEYHYNVVNAD- 396 +N + + D ++ +I++ + + + + SP + + Sbjct: 151 HNTPPAPSDF--YADYFSRVD-EVHSIALGSGADEAMMSDIAANYGFSPGLYLRADSDTA 207 Query: 397 ----SLIHVFQNISQL 408 LI F ++ Sbjct: 208 IDQLELIGAFNRMAND 223 >gi|47216962|emb|CAG04904.1| unnamed protein product [Tetraodon nigroviridis] Length = 519 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 53/171 (30%), Gaps = 21/171 (12%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + + I ++ R + + ++ + + R T +K Sbjct: 64 VSPRMRVSYIVFSARAVVILPLTGHRAEIDEGLERLRQIKPAGETFMHEGLKA------- 116 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +K + P +I LTDG+ + +++ D+A+ ++ + Sbjct: 117 ----------VSGQMKAQTSPSSSIVIVLTDGKLEVY-PYELSVQEADRARGLGARVFCV 165 Query: 368 SINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRKYSVI 417 + + L PE + V +L V I +++ Sbjct: 166 GVMD--FDHKQLAEIADGPERVFPVLSGFHALKDVVATILTQSCADVFTIE 214 >gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120] gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120] Length = 427 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/170 (8%), Positives = 41/170 (24%), Gaps = 17/170 (10%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSL 287 + A+ LD + + ++ + I E ++ + + Sbjct: 61 MVVEAVEKLLDRLQPGD------RISVVAFAGSATVIIPNQIVENPESIKTQIRK--KLQ 112 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++Q L + + F + Sbjct: 113 ASGGTVIAEGLQQGITELMKGTRGAVSQAFLLTDGHGEDS-------LKIWKWEIGPDDS 165 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 ++ KA + + I T+ + N L + ++ + Sbjct: 166 RRCLEFAKKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQ 215 >gi|3024063|sp|P97280|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus] Length = 886 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 75/226 (33%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM K+ + ALL LD + ++ Sbjct: 281 NIVFVIDISGSMA--------------------GRKIQQTRVALLKILDDMKQDDYLNFI 320 Query: 252 -VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G+ + + + E+ R +V D T+ + + ++LT ++ Sbjct: 321 LFSTGVTTWKDSLVQATP--ANLEEARTFVRSISDQ---GMTNINDGLLRGIRMLTDARE 375 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 + II LTDG+ N +S I+ KA E + + Sbjct: 376 QHTVPE-----------RSTSIIIMLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGF 424 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 425 GNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYEEVANPLLT 470 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/219 (10%), Positives = 64/219 (29%), Gaps = 37/219 (16%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 H + P + K+ + A++ F + + ++ + + Sbjct: 130 HFSSEDAPVSIGVIFDMSGSMSNKIDKSREAIVEFFKTANPDD------EFFVVAFNDKP 183 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + ++ + T A+ + Sbjct: 184 EVLQDFT---NRIEDIQEKLTILQPKDRTSLLDAIYLGMNKM------------------ 222 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLL-- 379 + +K ++ ++DG +N+ + N IK +E ++I I I + + Sbjct: 223 RQAKYERKALLIISDGGDNHSRYTENEIKS--MVREADVQIYAIGIYDLAPTTTEEMAGP 280 Query: 380 ----KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + + + N + L V I + ++ Sbjct: 281 ALLGEISDWTGGRMFPIDNVNELADVATKIGVELRNQYV 319 >gi|156382085|ref|XP_001632385.1| predicted protein [Nematostella vectensis] gi|156219440|gb|EDO40322.1| predicted protein [Nematostella vectensis] Length = 1221 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 52/191 (27%), Gaps = 20/191 (10%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +A K+ + ++ + G+ Y ++ + Sbjct: 82 SDADLAKTKSLVTTLVNQFSDSE---NSIRFGITTYGQEARTLANFKQNFDEAKLR---- 134 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T K Q D N +F+IF + G N Sbjct: 135 --------TAIKGIQKTGVQARRHDLAAMAVKNDLFSLEGGMRQGHPRFVIFFSAGANTG 186 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVF 402 + K + + ++ I +N++ + L + S ++ + L ++ Sbjct: 187 TADD--LKKASKPLTDLGVNMIAIGVNSNADQASLAELA--SENRFIFSANSPAELDALW 242 Query: 403 QNISQLMVHRK 413 +I M K Sbjct: 243 PSIEAQMCQEK 253 >gi|118579649|ref|YP_900899.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118502359|gb|ABK98841.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 337 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 75/241 (31%), Gaps = 36/241 (14%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + R + + +D+S SM + P + ++ + L L + Sbjct: 72 TKTVFNRSGIDLAIGIDVSKSMLAEDETLPPEGKKL----FSIPNRLNRARYCALTILSA 127 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQ 300 + +G+ + ++ + + + + DS + +D A+ Sbjct: 128 LK-------GERVGVFLFASKGVPIVPLTNDYGYCQYILKHANDSTISTPGSDLGQAITT 180 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + S K I+ ++DGE+ N + + + +A Sbjct: 181 GIYLFEESSRTSV-----------------KSIVLISDGEDIN-EDSSVMHEAAQRAAAK 222 Query: 361 FIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL-----IHVFQNISQLMVHRKY 414 I I T+ ++ + + E +Y + L ++IS R + Sbjct: 223 GIAIYTVGTGRGQGVMVPIRDAIGAAIEGYYQDEDGSYLKTRLEQDSLKSISNTSGGRYF 282 Query: 415 S 415 S Sbjct: 283 S 283 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 59/180 (32%), Gaps = 25/180 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTD 293 + + L ++ V +GLI Y+ + S + +T + + T Sbjct: 54 NWTIKIVRDLPIHEDAVRVGLIQYSDAAKTEFNLSR-YSERNDIITHLETLTFMPGEDTR 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ +A + + + L + II TDG S K Sbjct: 113 TGVALDKADEEMFNYIGG-------------ARLKATRLIILFTDG-----LSMDKPTKS 154 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +KI TIS+N+ +L V + + + D + + + H + Sbjct: 155 AKTLRRKGVKIYTISVNSIGFVPEMLGI-VGDADNVFGPTDEDRIEE---RLLTEVEHSR 210 >gi|223670960|dbj|BAH22727.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 68/200 (34%), Gaps = 30/200 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +SA + +D R + + + F +S + +I ++ + Sbjct: 259 DSSASVGEDNFRKGIQFARTIIDEF-----GISATPSGTRVAVIVFSDAAQVIFNLKSNR 313 Query: 274 EKVRQYVTR--DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ R + T + A++ +T + + + +K Sbjct: 314 IVDKEEAVRRLENLQFQGGGTATKLALQAVIDTVTPELRNNS----------------KK 357 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSP--EY 388 + +TDG++N S + +I I ++ S + LK+ S P + Sbjct: 358 ALFLITDGKSNKGGSPDRP---AKVLRAGFNFEIFAIGVSDSVDKDE-LKSIASEPFRTH 413 Query: 389 HYNVVNADSLIHVFQNISQL 408 Y + + +L+ + + I+ Sbjct: 414 VYQIKDYATLVKLKELITTK 433 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 36/110 (32%), Gaps = 2/110 (1%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS--KM 68 K ++ ++ +++ FL + + V K +++A ++A LA A+ Sbjct: 5 VSKSPRKQQGLVVVLVTAALLVFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAATILDN 64 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 + + + N + + F I S F T Sbjct: 65 SKDKDAVDAEIGTALNAMAASTGNHEIDFTTASINIDYSNDPQDFTGTAT 114 >gi|118096699|ref|XP_414253.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Gallus gallus] Length = 886 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 59/193 (30%), Gaps = 20/193 (10%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 ++ + ALL LD I H + G + + ++ R++ Sbjct: 291 SGSMSGREIEQTREALLKILDDIKEDDH-FNFILFGSDVHIWKETLIKATPENLDEARKF 349 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V T+ + + +L + + + II LTDG Sbjct: 350 VRSID---TEGMTNLYGGIMKGIDMLNAAHEGNLV-----------PKRSASIIIMLTDG 395 Query: 340 ENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVN 394 + N SN I+ KA E + + + L K + + + + + Sbjct: 396 QPNVGISNTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMALENKGLARRIYPDSDS 455 Query: 395 ADSLIHVFQNISQ 407 A L + +S Sbjct: 456 ALQLQGFYDEVSN 468 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/334 (10%), Positives = 78/334 (23%), Gaps = 54/334 (16%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + + S+ + N R+ +T R+D + Sbjct: 27 AFGADNGSSTAALNFTITSSQTEYAKPPNADAQGRLDITLTPQGRVD-------NIIRPP 79 Query: 152 TSYDYRLQFIEHLLN-----QRYNQKIVSFIPALLRIEMGERPIFLIELVVDL------- 199 + ++ + S + L+ + Sbjct: 80 IDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPNDRFALVPFSDGVQSDKVVP 139 Query: 200 --SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-----LSHVKEDV 252 SG+ + + + + + AL+ A F D L+ V Sbjct: 140 FPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQAQSFFNDPARKKYIIFLTDGMPTV 199 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + T +V + I +V + + T A + + Sbjct: 200 SIAKEPITYKVCEGILFWKTCNQVTENLNVQYVLYSNGIT--------AARTIY------ 245 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 P P K K ++ + N I + +I + Sbjct: 246 -----------YPDHPETKTYSDREKYREFEEKIRLHGTNVAKTLGMNNITLYSIGFGNN 294 Query: 373 P--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + L K ++ +L +FQ Sbjct: 295 QEVDMGYLEKLSSTAGGQAKKG-TPQNLTEIFQQ 327 >gi|307154093|ref|YP_003889477.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984321|gb|ADN16202.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 240 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 52/196 (26%), Gaps = 29/196 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVR 277 + + K A ++ + + ++ Y VE + P ++ Sbjct: 49 SGSMAGSALKYAKMAAQKLVEYLTPEDI------LSVVVYDDFVETIVAPQPVKDQAAIK 102 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + R T+ + + ++ ++ LT Sbjct: 103 NQINRIN---ARGCTNLSGGWLTGCDHVKANLSAERLN----------------RVLLLT 143 Query: 338 DGENNNFKSNV-NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG N S+ K + E I T+ N L+ Y + + Sbjct: 144 DGLANVGNSDPKILTKTATEKAEQGIITTTLGFGTYFNEDLLINMANGGKGNFYFIQSPQ 203 Query: 397 SLIHVFQ-NISQLMVH 411 VF+ I L+ Sbjct: 204 DAAQVFEIEIESLVSD 219 >gi|162452618|ref|YP_001614985.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] gi|161163200|emb|CAN94505.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] Length = 1031 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 63/261 (24%), Gaps = 42/261 (16%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T ++ + L + +S + R F L Sbjct: 584 ATASVSLVRPVTLFVALDRSNSMRETLSGTD-ISALRWDPARNA-LRAFFQDPASAGLGV 641 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + + +P + NS+ C +NAL+ F + Sbjct: 642 AFRFWPHDNPGNCNSSSSCGVAFG---DGCRNALIPFDGTAARR---------------- 682 Query: 262 RVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + Q T M A T+ + + Sbjct: 683 -------LTEDPAPWDQQENALVNALFPGPGTSGDTPMYPALHGATTWAINYKNAHPEEE 735 Query: 321 GVKIPSLPFQKFIIFLTDGENNN-FKSNVNTIKICDKA-KENFIKIVTISINASPNGQRL 378 ++ +TDG ++ + + A + +++ + +S N + + Sbjct: 736 VA----------VVLVTDGIPSDCDTVPNHIAALARNAFANHGVRVHAVGFGSS-NAEII 784 Query: 379 LKTCVSSPEYHYNVVNADSLI 399 +N+ +L Sbjct: 785 NLIADQGGGQAFNLNAGTTLQ 805 >gi|160878421|ref|YP_001557389.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160427087|gb|ABX40650.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 551 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/252 (10%), Positives = 76/252 (30%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K+ +I+ + + ++D+SGSM K Sbjct: 172 NPDTKLFLAGIQTEKIDFSKSAPSNLVFLIDVSGSMMDED-------------------K 212 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ A LL +++ + ++ Y + + G +K + + Sbjct: 213 LPLVQRAFLLLTENLTEKD------RISIVTYAGNDTVVLSGAKGNQK-EKIQNAITELE 265 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + ++ AYQ+ + +I TDG+ N S Sbjct: 266 AGGSTFGSKGIETAYQLAMENYIEGGNN----------------RVILATDGDLNVGVTS 309 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + ++ +++ + + + ++ + + + L+ + + Sbjct: 310 ESELTNLIEEKRKSGVALSVLGFGTGNIKDNKMEALADHGNGNYAYIDS---LMEARKVL 366 Query: 406 SQLMVHRKYSVI 417 + M +V Sbjct: 367 VEEMGATLVTVA 378 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 9/57 (15%), Positives = 25/57 (43%), Gaps = 4/57 (7%) Query: 4 LSRFRF----YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 +SR F G ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 1 MSRVTFSSGAALHGGRRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 [Sus scrofa] gi|3024051|sp|P79263|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Major acute phase protein; Short=MAP; Flags: Precursor gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa] gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Sus scrofa] gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor Length = 921 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 58/200 (29%), Gaps = 24/200 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + K+ + AL+ L + L+ ++ + + E V + Sbjct: 279 TSGSMRGRKIQQTREALIKILGDLGSRDQFN------LVSFSGEAPRRRAVAASAENVEE 332 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ AM A Q+L + + FII LTD Sbjct: 333 AKSYAAEIHAQGGTNINDAMLMAVQLLE------RANREELLPARSVT-----FIILLTD 381 Query: 339 GENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNV 392 G+ + + + + + L K + + + + Sbjct: 382 GDPTVGETNPSKIQKNVREAIDGQH-SLFCLGFGFDVPYAFLEKMALENGGLARRIYEDS 440 Query: 393 VNADSLIHVFQNISQLMVHR 412 +A L +Q ++ ++ Sbjct: 441 DSALQLEDFYQEVANPLLRL 460 >gi|296481520|gb|DAA23635.1| inter-alpha globulin inhibitor H2 polypeptide [Bos taurus] Length = 946 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTEDHFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + + Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQVADAKNYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGMLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQNIRD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 + L + + Y + L + +S Sbjct: 454 GFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLKKFYNQVST 495 >gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 [Bos taurus] gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus] Length = 946 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTEDHFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + + Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQVADAKNYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGMLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQNIRD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 + L + + Y + L + +S Sbjct: 454 GFDVDYDFLKRLSNDNRGIAQRIYGNQDTSVQLKKFYNQVST 495 >gi|222055940|ref|YP_002538302.1| hypothetical protein Geob_2856 [Geobacter sp. FRC-32] gi|221565229|gb|ACM21201.1| conserved hypothetical protein [Geobacter sp. FRC-32] Length = 391 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 35/373 (9%), Positives = 96/373 (25%), Gaps = 34/373 (9%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS----- 70 E+ + + ++ FL++ G I + + ++ + +A + LAGA + Sbjct: 4 QEEQGMDTTYVGIMLVVFLVIAGLAINIGYMYVSEDDLHNAAELSALAGAQAIGQQMQLS 63 Query: 71 --NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + + + ++ +N + + Sbjct: 64 ARTGTGKLKETIYDQVQPAARAAAIDHVSGHHQASALIEIRNSNINRLTTENDLTMGFWN 123 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ + N + E I+S I ++ Sbjct: 124 ISSRTYTPGGTPVNAVQVRTRRTAE-------SES-AGLGTLGSILSKISGSQKLNYTPE 175 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPI------CQDKKRTKMAAL---KNALLLFL 239 + I + D + S+ + D +M + + A Sbjct: 176 AVAAIPALADANFSVCVDACGTECTYPNICTIQERKLSPDSSDPRMDSPTKSRYAYTSLS 235 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMD---SLILKPTDST 295 + + MG+ ++ +R + + + K D Sbjct: 236 YPPGDTMSLSNLICMGMPPKEVCGKEIYTIRDRDDNALRDMESVMYNPNMNKANKDYDKA 295 Query: 296 PAMKQAYQILTSDK----KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 Q + ++ ++ + K + + + T G + N + Sbjct: 296 TGALQGWWVIAPVTDCLPATQENSHERHRVTKYALIRISRICVTGTTGCSQNDTAFDAPA 355 Query: 352 KICDKAKENFIKI 364 C + +N + I Sbjct: 356 AQCRE--DNGLYI 366 >gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster Length = 889 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 75/226 (33%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM K+ + ALL LD + ++ Sbjct: 284 NIVFVIDISGSMA--------------------GRKIQQTRVALLKILDDMKQDDYLNFI 323 Query: 252 -VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G+ + + + E+ R +V D T+ + + ++LT ++ Sbjct: 324 LFSTGVTTWKDSLVQATP--ANLEEARTFVRSISDQ---GMTNINDGLLRGIRMLTDARE 378 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 + II LTDG+ N +S I+ KA E + + Sbjct: 379 QHTVPE-----------RSTSIIIMLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGF 427 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 428 GNNLNYNFLETMALENHGVARRIYEDSDANLQLQGFYEEVANPLLT 473 >gi|316941492|gb|ADU75526.1| hypothetical protein Clo1313_2521 [Clostridium thermocellum DSM 1313] Length = 1112 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 36/124 (29%), Gaps = 4/124 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + SI + ++ + + L+ + +ESA + + + + Sbjct: 1 MRFFKKTEGAISIFLCIVLLIMIAIPAILVEGTRIKAAQTQIESALDISAKSALANYNYL 60 Query: 72 LSRLGDRFESISNHAKRALIDDAK----RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 L L S+ + + + IKE + + V + + Sbjct: 61 LKELYGLMALSSDDPDLLMEEIIYYLERNLMVEGIKEHKTKAESTLDTINESETVKNIKK 120 Query: 128 SMTH 131 + Sbjct: 121 FLGV 124 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 20/42 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|156409365|ref|XP_001642140.1| predicted protein [Nematostella vectensis] gi|156229281|gb|EDO50077.1| predicted protein [Nematostella vectensis] Length = 156 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 46/156 (29%), Gaps = 26/156 (16%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + +K + + + +GL+ Y+ K + T K Sbjct: 26 RFKGVKRFIQNVISRFH---ISPKHTRVGLVLYSNNPYKIFGFNKYTNKNAAMKATGRIP 82 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T + A+ + L S ++ +I LTDG + + Sbjct: 83 YPRRGTKTGRALAYTGRYLF------------------RSSKRRRVLILLTDGRSYD--- 121 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 K ++ I I + + + N ++L Sbjct: 122 --RVSAPARKLRQAGIHIYAVGVGRNYNIKQLRSIA 155 >gi|326382230|ref|ZP_08203922.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] gi|326198960|gb|EGD56142.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] Length = 552 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 90/374 (24%), Gaps = 46/374 (12%) Query: 42 YVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 ++ K+ M + N+++ S + + + + + + Sbjct: 201 DLVQVLTKRALMMNDNDSSPDLVQVAASSGPTAALTTERAYTKFTRENPDAQLEAAVPGT 260 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 SL + + + +A + D + Sbjct: 261 GTVSLDYRA--VNVAPAADQTVAGDAIEALVATLQTDKGKQIRQSEAVRSADGEPLPDNE 318 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + +LR + D+SGSM Sbjct: 319 GVDGVTVIKPPARELVDNILRKWTALTQPIRALVAQDVSGSMEQDAGG------------ 366 Query: 222 DKKRTKMAALKNA----LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--- 274 R++ A L+ A L F + L G V S T+ Sbjct: 367 ---RSRAALLREASLFGLQQFPKNTALGYWEFSIDRGGKGQDYREVMPIAPISDKTDGRT 423 Query: 275 --KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + R + T A++ + + Sbjct: 424 NRDLLADAIRQTLGNVHGGTGLYDTALAAFKTVYDSY----------------DPAYSNS 467 Query: 333 IIFLTDGENNNFKSNVN--TIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEY 388 +I +TDG N + S + + K I I+ + I+ + L + ++ Sbjct: 468 VIIMTDGRNEDRDSITLQHLVSELNIMKNPARRIPIIAVGISEDADADALKQIADATGGS 527 Query: 389 HYNVVNADSLIHVF 402 + + + + Sbjct: 528 SFIARDPKDIPAIL 541 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 20/42 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|320103720|ref|YP_004179311.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] gi|319751002|gb|ADV62762.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] Length = 738 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/359 (10%), Positives = 104/359 (28%), Gaps = 61/359 (16%) Query: 20 ANFSIIFALSVMSFLLLIGF--LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + ++ + V+ LL I + + +A + ++ + + Sbjct: 138 GSMAVHAGVLVILLLLPGAAEEAIDLSK------RLNAAVDTSVASVEEPLKIMADPSQV 191 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 R E +++ + + + + + Q + S+ I+ + + Sbjct: 192 RSEYLTDPNANPVGG-VLNPSPSATPVVAAAATVSSKSAMPQAVELSASIASPLASMPSM 250 Query: 138 DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + ++ L+Q + + +R + + Sbjct: 251 P--AVDFAGGGGIAGDVTQAVDNVDQALDQLVREILREL----------QRTKLTVVWMF 298 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D S SM + +++ +D+ + ++ Sbjct: 299 DESNSM-----------KDDQQAIRDRFERIS------SELIDNTSSEQRRSAALLHAVV 341 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 G+ ++ ++EP+ ++++ +T + T + +S Sbjct: 342 GFGQQIHFDLEPTPDVTRIKEAITNLRIDMTG----------------TENTMQSVQAVI 385 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS----INAS 372 R G K + + I+ +TD ++ TI+ C AK + + I Sbjct: 386 ARLGSKYITKERRMLIVLVTDESGDDGARVEETIQACLAAK---VSVYVIGRQSLFGTD 441 >gi|326779287|ref|ZP_08238552.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] gi|326659620|gb|EGE44466.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] Length = 575 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 63/224 (28%), Gaps = 43/224 (19%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ERP + VVD+SGSM ++ ++ +L + D Sbjct: 204 AAPSTAERPPAALTFVVDISGSMAE-------------------TGRLDLVRKSLAVLTD 244 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + V L+ ++ E + + + + + T+ + + Sbjct: 245 ELRDDDSVS------LVTFSDAAETRLPMTRLQGNRNRIKDVVEEMRPEQSTNVEAGITR 298 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK--SNVNTIKICDKAK 358 Y + V+ ++ L+D N ++ +I + Sbjct: 299 GY----------------EESVEGHRKGATNRVVLLSDALANTGDTEADGILERIDSTRR 342 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 E I + + + + + + V + VF Sbjct: 343 EYGITLFGVGVGSDYGDAFMERLTNKGDGNTTYVGDEAQARKVF 386 >gi|313903839|ref|ZP_07837228.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313466027|gb|EFR61552.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 1151 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/174 (8%), Positives = 38/174 (21%), Gaps = 30/174 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ A + + ++ + ++ V R + Sbjct: 486 KLQMAVEAARRVAQLLTPAD------RLAVVLFDSQAYVTRPLEPVRNPGE--VDRAFPA 537 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A + + +I LTDG Sbjct: 538 AAQGGTSLGSGLAAAL------------------PLMEGVKADVRHVIALTDGV----SE 575 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + + + + ++I + L + Y + L Sbjct: 576 PFDVTGLARAFRRQGVTLSAVAIGPDADRNTLAQLAREGGGALYEAADPGQLPT 629 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 60/186 (32%), Gaps = 21/186 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-KNIEP-SWGTEKVRQYVTRDMDS 286 +K FL + L E+ + G++ + +V+ + + + Sbjct: 4 QKIKQTYEAFLKILADLP---EEDHFGILIFDDKVDKWQNTLVKAVPDNIIKAKQFVSKI 60 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A+ A ++L + + K+ I+FL+DGE + + Sbjct: 61 SARGGTDINKALLAAVKMLKNTSRN-----------KLLPKISTSIILFLSDGEPTSGVT 109 Query: 347 N-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHV 401 N I KA E + + + L K + + + + A L Sbjct: 110 NHNEIINNVKKANERQTTLYCLGFGNDVDFNFLEKMALENGGLARRIYEDSDAALQLQGF 169 Query: 402 FQNISQ 407 + ++ Sbjct: 170 YNEVAN 175 >gi|223670958|dbj|BAH22726.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 67/200 (33%), Gaps = 30/200 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +SA + +D R + + + F +S + +I ++ + Sbjct: 259 DSSASVGEDNFRKGIQFARTIIDEF-----GISATPSGTRVAVIVFSDAAQVIFNLKSNR 313 Query: 274 EKVRQYVTR--DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ R + T + A++ + + + + +K Sbjct: 314 IVDKEEAVRRLENLQFQGGGTATKLALQAVIDTVNPELRNNS----------------KK 357 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSP--EY 388 + +TDG++N S + +I I ++ S + LK+ S P + Sbjct: 358 ALFLITDGKSNKGGSPDRP---AKVLRAGFNFEIFAIGVSDSVDKDE-LKSIASEPFRTH 413 Query: 389 HYNVVNADSLIHVFQNISQL 408 Y + + +L+ + + I+ Sbjct: 414 VYQIKDYATLVKLKELITTK 433 >gi|221116226|ref|XP_002166419.1| PREDICTED: similar to collagen, type XXVIII [Hydra magnipapillata] Length = 680 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 51/176 (28%), Gaps = 21/176 (11%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +G+I Y + E + + + + D T + A+K+A + + Sbjct: 161 STTRVGIIRYASDAEIALHLTRFNDTTSRDTAIDNIFYKTGGTRTDIALKKAADVFQFSE 220 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +++ G + + D D+ K+ + I I I Sbjct: 221 QKNQVLILVTDGPTNSLEINKDHFVEGKD----------LVAGPVDRLKDAGVAIFCIGI 270 Query: 370 ----NASPNGQRL---LKTCVSSPE--YHYNVVNADSLIHVFQNISQL--MVHRKY 414 + + ++ S P + + L IS+ +V K Sbjct: 271 VPDSETPDEIETMKEEMRVIASEPTKLHLFMSDGYHELQRKVHAISEAACVVTEKL 326 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 33/200 (16%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDL----LSHVKEDVYMGLIGYTTRVEKNIEP------ 269 +++AAL+ AL + D + + + +I + R ++ Sbjct: 350 SGSMAGSRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFAGRANDPVDFAVNDPR 409 Query: 270 --SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S V +V T A++ Y+ + + Sbjct: 410 PGSADLAGVNTFVDGLRLQD---GTAIYSALEAGYRAAGAAVE--------------ADP 452 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDK--AKENFIKIVTISINASPNGQRLLKTCVSS 385 + I+ +TDGENN+ S + + A ++ TI+ + L + Sbjct: 453 GYLTSIVLMTDGENNSGISAADFRSSYQRLPAAARAVRTFTIAFG-EADPAALRDISADT 511 Query: 386 PEYHYNVVNADSLIHVFQNI 405 ++ SL F++I Sbjct: 512 GGAVFDART-SSLADAFKDI 530 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 64/194 (32%), Gaps = 22/194 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKV 276 + K A+ L + V +I + + + + Sbjct: 396 TSGSMAGESIVQAKRAVDFALTQLHPEDSVN------VIEFNDAPQALWNLAMPATANNI 449 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ T+ PA+ A TN +Q + S + ++F+ Sbjct: 450 QRARNWVASLSANGGTEMAPALSMALH----------KTNLEQQNINEGSPVQLRQVVFI 499 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG +N + ++ I+ ++ ++ ++ TI I ++PN + + + + + + Sbjct: 500 TDGSVSNEDALMSLIE--NQLADS--RLFTIGIGSAPNSYFMTQAAQAGRGTFTYIGDIN 555 Query: 397 SLIHVFQNISQLMV 410 + + + Sbjct: 556 QVQQKMTELFNKLT 569 >gi|297678516|ref|XP_002817116.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 2 [Pongo abelii] Length = 1899 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1173 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1232 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1233 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1275 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1276 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1328 Query: 406 SQLMVH 411 ++ Sbjct: 1329 EDNLIT 1334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFDIGPKRVQVALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|223462031|gb|AAI46869.1| Collagen, type XII, alpha 1 [Homo sapiens] Length = 1899 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1173 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1232 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1233 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1275 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1276 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1328 Query: 406 SQLMVH 411 ++ Sbjct: 1329 EDNLIT 1334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|119569134|gb|EAW48749.1| collagen, type XII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1899 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1173 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1232 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1233 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1275 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1276 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1328 Query: 406 SQLMVH 411 ++ Sbjct: 1329 EDNLIT 1334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|114608142|ref|XP_001142833.1| PREDICTED: collagen alpha-1(XII) chain isoform 1 [Pan troglodytes] Length = 1899 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1173 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1232 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1233 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1275 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1276 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1328 Query: 406 SQLMVH 411 ++ Sbjct: 1329 EDNLIT 1334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|93141049|ref|NP_542376.2| collagen alpha-1(XII) chain short isoform precursor [Homo sapiens] Length = 1899 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1173 DNFNKVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1232 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1233 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1275 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1276 -EVKKAALVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1328 Query: 406 SQLMVH 411 ++ Sbjct: 1329 EDNLIT 1334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + + +K Sbjct: 52 RTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ +F + +K + +TDG++ + Sbjct: 112 TLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVADFESLSRIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|34783791|gb|AAH56811.1| Zgc:112265 protein [Danio rerio] Length = 927 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 74/249 (29%), Gaps = 47/249 (18%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERP--IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + +S + + P + ++D SGSMH Sbjct: 247 RGNPKGEVQISNGYFVHYFAPSDVPHIPKNVVFIIDRSGSMH------------------ 288 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K+ ++ALL L +D H + I + R E E +V R Sbjct: 289 --GRKIRQTRSALLTILKDLDEDDHFGLITFDAEIDF-WRRELLQATKANRENAESFVKR 345 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D T+ A+ ++ + P +I LTDG+ Sbjct: 346 IQD---RGATNINDAVLAGVDMINRN----------------PRKGTASILILLTDGDPT 386 Query: 343 NFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD-S 397 ++N+ I +A + + + N L K + + Y +AD Sbjct: 387 AGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVARRIYEDSDADIQ 446 Query: 398 LIHVFQNIS 406 L + ++ Sbjct: 447 LQGFYDEVA 455 >gi|326670654|ref|XP_002663415.2| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 3218 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 53/181 (29%), Gaps = 18/181 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + L+ IDL ++ + +GL Y T V + K + + Sbjct: 1626 NNFKEVMEFILNLIDLFYTERDRLQIGLAHYATDVTDVFYLNTYNNKD-DIINAITRAEY 1684 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + S + T+F ++ + ++ +T G S Sbjct: 1685 KGGREIRTG---------SAIRHVQKTHFVKEKGSRKDEGIPQILMVVTGGR-----SRD 1730 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ K + +++ + + + + L S L + + I Sbjct: 1731 DSKSAALGLKASGVRVYAVGVG---DIEDELNNLGSEATTVARASTFQELSELNEQILDT 1787 Query: 409 M 409 + Sbjct: 1788 L 1788 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 42/371 (11%), Positives = 96/371 (25%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 46 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 103 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 104 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 156 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 157 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 204 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ N + Sbjct: 205 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPGANFYLNTY 261 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 262 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 309 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 310 QILVLLSGGR-----SFDSVNAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 362 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 363 SVSDFSELPNV 373 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 79/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + VN++ S+ + + + Y L + +++++ + Sbjct: 322 DSVNAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 381 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 382 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 429 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 430 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGG----RP 481 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + + F + +I L+ G S + Sbjct: 482 LYTGAAL-----QYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 531 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 532 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 583 >gi|220941746|emb|CAX15447.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 2026 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 53/181 (29%), Gaps = 18/181 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + L+ IDL ++ + +GL Y T V + K + + Sbjct: 496 NNFKEVMEFILNLIDLFYTERDRLQIGLAHYATDVTDVFYLNTYNNKD-DIINAITRAEY 554 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + S + T+F ++ + ++ +T G S Sbjct: 555 KGGREIRTG---------SAIRHVQKTHFVKEKGSRKDEGIPQILMVVTGGR-----SRD 600 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ K + +++ + + + + L S L + + I Sbjct: 601 DSKSAALGLKASGVRVYAVGVG---DIEDELNNLGSEATTVARASTFQELSELNEQILDT 657 Query: 409 M 409 + Sbjct: 658 L 658 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 64/184 (34%), Gaps = 28/184 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + D + +G+I Y+ +VEK + + K + D Sbjct: 814 FEIIKSLVKTLSDQVA---LDLAAARVGIINYSHKVEKVAHLTQFSTKDDFKLAVDRMQY 870 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A+ +A + + +K + +TDG+ + + Sbjct: 871 LGEGTYTASALHEANHMFEA-----------------ARPGVKKVALVITDGQ-TDTRDE 912 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSLIH 400 N ++ KA + ++I I + + + + + P+ + Y + +L Sbjct: 913 KNLTEVVKKASDINVEIFVIGVVKKNDPNFEVFHKEMNLIATDPDSEHVYQFDDFITLQD 972 Query: 401 VFQN 404 + Sbjct: 973 TLKQ 976 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 20/42 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 44/443 (9%), Positives = 117/443 (26%), Gaps = 66/443 (14%) Query: 23 SIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA---------GASKMVSNLS 73 +++ + + + + + G +DW + AA + Sbjct: 9 AVLISSASLMTIAIAGIASQTIDWQDLGKKLTDTTEAAGRMTSSGKPDGDASVSTAELPK 68 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI--------VNSS 125 + A A + S S + T + Sbjct: 69 QEETHTVVAEIARPVATPQPAPAPALPQKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLE 128 Query: 126 RISMTHMANNRLDSSNNTIFY---NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 +M + + + + + + + +P Sbjct: 129 PPAMPAVQLEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDA 188 Query: 183 IEMGERPIFL-----IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM---AALKNA 234 + + E + + S + ++ P + ++ + + K+ Sbjct: 189 VRVEEMVNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQN 248 Query: 235 LLLFLDSIDLLSHVK-----------------EDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 L+ +D+ ++ ++ + ++ Y +EP+ +K R Sbjct: 249 LVFLIDTSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTR 308 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T +K AY + + Q II T Sbjct: 309 -ILEKINALTSGGSTAGHEGLKGAYAL----------------AETMTGDGEQTRIILAT 351 Query: 338 DGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 DG+ N S+ + + + +EN + + L++T + + D Sbjct: 352 DGDFNVGLSDPDSLKRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQNGQGV--AAYID 409 Query: 397 SLIHVFQNISQLMVHRKYSVILK 419 +L + + +V S+I + Sbjct: 410 TLSEARKVLVDQVVSS-ISMIAQ 431 >gi|282898869|ref|ZP_06306855.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] gi|281196242|gb|EFA71153.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] Length = 1499 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 39/395 (9%), Positives = 108/395 (27%), Gaps = 72/395 (18%) Query: 53 MESANNAAILAGASKMVSNLSRL-----GDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 + +A + A + A+ + + + + + + + A F ++ S Sbjct: 680 LSNATDGATITTATAVGTIQNDDVVAPGLAIAPTNAIQTEGNIGTKAFTFTVTRSGDTTS 739 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL--------- 158 SA + T + ++ + + V D + Sbjct: 740 SSSANWAVTGSSTNQADVTDFGGTLPTGTVNFTAGETSKTIIVDVLGDTTVEPDEGFTVT 799 Query: 159 ----------------------QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + ++ + + L I V Sbjct: 800 LSNPTNATITTATAVGTITNDDDNTGSIQGFKWKDINGNGVREDLIQGDSPN----IVFV 855 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D+SGS P + +D + + + A + L++ + + + Sbjct: 856 IDVSGSTRGPFQGIPVGD----VNKDGIQNTILDAEIAGFIALNNSLVRKGFGSRAKVSI 911 Query: 257 IGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + + + + + + + T+ A+++A + L Sbjct: 912 VSFASDAKTLLTTNPETDSNKNGTKDVEEKLISLKSGGETNFEIALQEAAKTL------- 964 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 I + +IF++DG+ N + ++ +K+ + Sbjct: 965 ---------RDIGTTAGNGNVIFMSDGQPNQGNYTDEVLD----LQKAGVKLSAFGVGTG 1011 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + LK + + D L+ VF + Sbjct: 1012 ASIDS-LKLINPNASIF---TSTDQLLGVFDGLGS 1042 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 20/42 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 20/42 (47%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ SI+ AL + + +G + + + ++ ++++ Sbjct: 16 RRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 57 >gi|291223815|ref|XP_002731903.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 996 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/258 (12%), Positives = 88/258 (34%), Gaps = 51/258 (19%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 L + + + I P + R + LV+D SGSM Sbjct: 316 VMLNHSDFINGTNPTRVIDDTSPNFIIKRSEVRR---VVLVLDTSGSMA----------- 361 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 ++ L F+D+ +++ ++G++G+++ + Sbjct: 362 ---------GNRIQRLYQTATYFIDT-----KIEDGSFVGIVGFSSTAVILAGMTEIKYG 407 Query: 276 VRQY-VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + ++ ++ T ++ A Q+L + S + ++ Sbjct: 408 FQRDDIASNVPQVVDGFTSIGAGLELALQVLENGNVASEGAS----------------LL 451 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVV 393 +TDG N N + ++ +++ T + + + Q +L+ ++ ++ V Sbjct: 452 LITDGAENRSPFIANVLP---DIYDSGVRVDTFA--YTESAQLILQNLSDTTGGLYFYVP 506 Query: 394 NADSLIHVFQNISQLMVH 411 + D+ +++ + Sbjct: 507 DNDNSTAFIDSLAATITD 524 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/257 (11%), Positives = 76/257 (29%), Gaps = 47/257 (18%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 ++++ R+ + P + +VD SGSM Sbjct: 192 SAAPWNPQRQLLKIGIQGYRVAPQDIPAANLVFLVDTSGSMAERD--------------- 236 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 K+ +K AL + + + ++ Y + ++ + G +K R Sbjct: 237 ----KLPLIKGALKQLVAQLRPQD------RVAIVTYAGQASMTLDSTPGDQKARIN-AA 285 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T+ + AY + I+ +DG+ N Sbjct: 286 IDELRAAGSTNGGAGLDLAYAQAAKGFVKGGVN----------------RILLASDGDFN 329 Query: 343 NFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRL-LKTCVSSPEYHYNVVNADSLIH 400 +++ + ++ I + T+ + L ++ + ++ DSL Sbjct: 330 VGATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHY---LDSLRE 386 Query: 401 VFQNISQLMVHRKYSVI 417 + ++ M ++ Sbjct: 387 ARKVLAAQMSSTLLTIA 403 >gi|154250522|ref|YP_001411346.1| hypothetical protein Plav_0066 [Parvibaculum lavamentivorans DS-1] gi|154154472|gb|ABS61689.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] Length = 566 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/299 (7%), Positives = 71/299 (23%), Gaps = 24/299 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + E+ S++ ++ L + +I Y + +++A +AA L Sbjct: 10 LTGLLGRYRRDERGVISVMAVGALFLVLAVAMVVIDTGSMLYARRDLQAATDAAALGAVR 69 Query: 67 KMVSNLSRLGDRFESISNHAKRA--------LIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 ++ + + + A D + ++ + ++ + Sbjct: 70 QIGNAENAARSILDLNGYSPGDAPQVVTGIYSADPSLAPRDRFVEADGATEASQINAVRV 129 Query: 119 QNIVNSSRISMTHMANNRLD-SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + L + + + Sbjct: 130 IKYAEAPTYFASLFGFENLTRINAVSTAAYTKT---------VSFSAGTRVAELNSGLAN 180 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM----AALKN 233 L + + L++ + ++ M D + A + + Sbjct: 181 QLLGGLLGTTLNLSLVDYNGLANANIDALMFLDALATQVGLEAGSDTYGDLLSGNATVAD 240 Query: 234 ALLLFLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + +D ++ + L P ++ R + S+ Sbjct: 241 LVRAAVDVLNSETFDGNPAVARGALEAALNPAGNISVPLNDILNATPFLNRTIGSVASG 299 >gi|149601522|ref|XP_001515070.1| PREDICTED: similar to integrin alpha 2 subunit, partial [Ornithorhynchus anatinus] Length = 329 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 64/192 (33%), Gaps = 28/192 (14%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 NA+ FL+ L +GLI Y + K + Sbjct: 124 YPWNAVKNFLEKFVQGLDIGPTKTQVGLIQYGNEPRVVFNMNKFKTKEEMVQETSRTNQN 183 Query: 289 LKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A++ A Q S K ++ +TDGE+++ + Sbjct: 184 GGSLTNTFKAIEFARQNAFSLASG-------------GRPHATKVMVVVTDGESHDGSNL 230 Query: 348 VNTIKICDKAKENFIKIVTISI-NA-------SPNGQRLLKTCVSSP--EYHYNVVNADS 397 I C++ + I I++ + N +K SSP +Y +NV + + Sbjct: 231 KKVIDQCEQ---DNIIRFGIAVLGYLLRNELDTKNLITEIKAIASSPTEKYFFNVSDEAA 287 Query: 398 LIHVFQNISQLM 409 L+ + + + Sbjct: 288 LLEKAGTLGERI 299 >gi|149019071|gb|EDL77712.1| procollagen, type XII, alpha 1, isoform CRA_c [Rattus norvegicus] Length = 1721 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ + + + + Y+ V+ + + +K Sbjct: 1174 DNFNKVVKFIFNTVGAFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 1233 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 1234 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 1276 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 1277 -EVKKAAFIIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 1329 Query: 406 SQLMVH 411 ++ Sbjct: 1330 EDNLIT 1335 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 23/182 (12%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ I ++ + V + L Y+ + +K Sbjct: 52 RTVRSFISRIVEVFEIGPKRVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGN 111 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ Q + +K + +TDG++ + Sbjct: 112 TLTGMALNFIRQQSFKTQAGM-------------RPRARKIGVLITDGKSQDD-----VE 153 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 K K+ +++ I I + + LK + P+ + YNV + +SL + +++ + Sbjct: 154 APSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDIHAYNVADFESLSKIVDDLTINL 211 Query: 410 VH 411 + Sbjct: 212 CN 213 >gi|330448513|ref|ZP_08312161.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492704|dbj|GAA06658.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 899 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/333 (11%), Positives = 88/333 (26%), Gaps = 50/333 (15%) Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL 165 L+ S + + ++ ++T + L + T + +Y + + Sbjct: 328 LASTSHHYQVDGVVVQGDNGDNTITGTNGSDLLIGDLTPPQAEQQPVNINYVIDISGSMY 387 Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 R E+ + S + D A + D Sbjct: 388 YGRMLTLDAVKDHVAKSYEVYVDSNSTLTASDGTKLSDRPGWVTVSYDQLKAGLQYDAGS 447 Query: 226 TK----------------------MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 M +K + I K + ++ ++ V Sbjct: 448 RAGIDIKASDGEYFHTKFDALPYLMDIVKQSYQTLTQEILHNIDDKSKLEFNIVTFSNAV 507 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 N + + Q+V + ++ LT+ F Sbjct: 508 RGNTTFHYD-DSTHQFVNKQNVTIENY-----------IHDLTAGGGTQFEWPLKDASAH 555 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 I + + FL+DG++ + + +IV+I + S + +++ + Sbjct: 556 ITDSSKRNVVYFLSDGKDEDKLDTTGIHFL------KGTEIVSIGVGPSADAKQMGEIAQ 609 Query: 384 SSPEY---------HYNV-VNADSLIHVFQNIS 406 Y + V N + L +F NI Sbjct: 610 MGTGYDKDNPNAPSYSKVITNGNELNDIFHNIG 642 >gi|294632153|ref|ZP_06710713.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] gi|292835486|gb|EFF93835.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] Length = 403 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 42/130 (32%), Gaps = 6/130 (4%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + ++ + P + T A K A L+ + + + I Sbjct: 76 DDRKEGCKDTAQLYPVGPLNRTEA-KTAVATLSPTGWTPIGPALLKAATDLEGGDGTRRI 134 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHY 390 + ++DGE+ + ++ + I I T+ + + L ++ + Sbjct: 135 VLISDGEDTCQPLDPC--EVAREIAAKGIGLTIDTLGLVPDAKTRDQLSCIADATGGTYT 192 Query: 391 NVVNADSLIH 400 +V + D L Sbjct: 193 DVRHKDELSD 202 >gi|148360718|ref|YP_001251925.1| type IV fimbrial biogenesis PilY1-like protein [Legionella pneumophila str. Corby] gi|296106216|ref|YP_003617916.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] gi|148282491|gb|ABQ56579.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila str. Corby] gi|295648117|gb|ADG23964.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] Length = 1169 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/250 (10%), Positives = 69/250 (27%), Gaps = 21/250 (8%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P++ + + + + + + + ++ Sbjct: 292 PSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVNN 351 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +++ L H+K + + + R ++ T S Sbjct: 352 AINAF--LPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTV-------GTTSGNC 402 Query: 298 MKQAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKI 353 ++ Y IL SD + + + F DG + N ++ + I Sbjct: 403 PQKQYIILISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINE 462 Query: 354 CDKAKENFIKIVTISINASPN------GQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K + + I + A + L+ + +Y + ++L+ +I Sbjct: 463 IKALKNDGVLTFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPEALVSSLNSI 522 Query: 406 SQLMVHRKYS 415 + + +S Sbjct: 523 LSNIQNGSFS 532 >gi|66793453|ref|NP_001019751.1| vitrin [Gallus gallus] gi|56744182|dbj|BAD81032.1| Akhirin [Gallus gallus] Length = 748 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 57/195 (29%), Gaps = 29/195 (14%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQY 279 + + L + D +G I YT + K V Sbjct: 572 GSSSVGTSNFRTVLQFVANISKEFEISDTDTRIGAIQYTYEQRLEFSFDKYSTKQDVLSA 631 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + R T + A+ A + L + K +K +I +TDG Sbjct: 632 IKRINYW--SGGTSTGAAISYASEQLFTKSK----------------PNKRKIMILITDG 673 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADS 397 + + A +N + +I + + Q L+ + P+ + + V D+ Sbjct: 674 RSYDDVRMPAL-----TAHQNGVIAYSIGV--AWAAQDELEAIATDPDKEHSFFVDEFDN 726 Query: 398 LIHVFQNISQLMVHR 412 L + Q + Sbjct: 727 LYQFVNPLIQNICTE 741 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 41/145 (28%), Gaps = 22/145 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTD 293 ++ L MG++ Y ++R + + ++ Sbjct: 384 QFLINVAHALGINNAGPLMGIVQYGDDPSTEFNLKTYASPKELRNAIEKIP--QKGGLSN 441 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + SD + ++ + DG + + Sbjct: 442 VGKALSFVNKNFFSDANGNRGGA-------------PNVVVVMVDGWPTD-----RVEEA 483 Query: 354 CDKAKENFIKIVTISINASPNGQRL 378 A+E+ I I ++I A+ ++ Sbjct: 484 SRLARESGINIFFVTIEAAAQNEKQ 508 >gi|189465624|ref|ZP_03014409.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224539998|ref|ZP_03680537.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] gi|189437898|gb|EDV06883.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224518388|gb|EEF87493.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] Length = 348 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 69/223 (30%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLVAQLVDK-------MENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESINPSLISKQGTAIGAAINLATRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + +K I++ + + Sbjct: 192 AVIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPDGAPIPVEGTNDFRRDRDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N Q + + + V N+++ V I+Q + Sbjct: 248 NVVVTRLNEQMCQEIAQAGDGIYVRVDNSNAAQKV---IAQEI 287 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/254 (13%), Positives = 70/254 (27%), Gaps = 45/254 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 NQ Q+ V+ + + + L++D SGSM Sbjct: 16 ANQSNTQRQVAISLSAVSESSDRTLPLNLGLILDHSGSM--------------------T 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + +K A + ++S+ + ++ + + + + + ++ + Sbjct: 56 GKPIKTVKEAAIRLVESLGSGD------RLSVVAFDHKAKVIVPNQP-IDDIKTVNQQIQ 108 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T MK + + K I LTDGE N Sbjct: 109 RLEPAGGTCIDEGMKLGIKEVALGKDDRVSQ-----------------IFLLTDGE-NEH 150 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N +K+ A E I + T+ N L + + + I F Sbjct: 151 GDNERCLKLAQVAAEYNITLNTLGFGNHWNQDVLESIADAVGGTLCYIEQPEQAITEFIR 210 Query: 405 ISQLMVHRKYSVIL 418 + + + Sbjct: 211 LFTRIQSVGLTNAY 224 >gi|110679844|ref|YP_682851.1| hypothetical protein RD1_2615 [Roseobacter denitrificans OCh 114] gi|109455960|gb|ABG32165.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 181 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 22/55 (40%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 FR Y ++ E ++ F + V + + + + ++ ++ + A+ Sbjct: 5 FRTYLRRFRREEDGQIAVEFVILVPLVFTIFMTAMELGIYSMRQMWLDRGLDIAV 59 >gi|149913213|ref|ZP_01901747.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] gi|149813619|gb|EDM73445.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] Length = 512 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 15/177 (8%), Positives = 49/177 (27%), Gaps = 4/177 (2%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ + E S+ +V+ LLL G + + + ++ + A AG ++ Sbjct: 10 RTSRRILHQEDGAGSVFGIFAVVMILLLGGVALDATNLWRYQQMLKQTADVAAHAGTVQL 69 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S + + + + + + + ++ Sbjct: 70 ASGGDATNAYNAAFALVEANMPQSWYGNLF-ANPQADIEVVHYDPDTDLLGGSGPLNAVA 128 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 +T + S N + + + + + + +F+ + Sbjct: 129 VTLRRDGD---SGNPVPTYLLRIADIMNMGDSTDLSQWNVKTRGVAAFVGTKKCVST 182 >gi|56696061|ref|YP_166415.1| hypothetical protein SPO1165 [Ruegeria pomeroyi DSS-3] gi|56677798|gb|AAV94464.1| conserved domain protein [Ruegeria pomeroyi DSS-3] Length = 257 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 57/234 (24%), Gaps = 54/234 (23%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 +V D SGSM + + R ++ + AL L I L + Sbjct: 41 DAMIVFDASGSMAEMGYNGLD------------RPRILDAREALHDALPRIAALRRLGLV 88 Query: 252 VYMGLIGYTT----RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 Y + ++ + Q + T T A+ A ++ Sbjct: 89 TYGAAMDGDADGDLCKRVSMPFTPSPNAAGQILNLIDAIEPDGNTALTDAVNLAARVF-- 146 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVT 366 P I+ +TDG+ + D A++ + + Sbjct: 147 -----------------DQPPRPGVIVLVTDGDETCGGAPCAL--AADLARDTPGLTVHV 187 Query: 367 ISI----------NASPNGQRLLKT------CVSSPEYHYNVVNADSLIHVFQN 404 I N ++ + + + LI Sbjct: 188 IGFRVRSQFFGWEGGDGNPTVPETISPAECLAQATGGEYVSTETVEELIRALNQ 241 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/276 (12%), Positives = 67/276 (24%), Gaps = 32/276 (11%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+L SR + +A + + A+ + I + ++ ++A+ AA Sbjct: 1 MNL-SRMTRRVAQPVAGQS---LVFSAILFFVMIAFAALAIDTGEAFSRQRQQQAASTAA 56 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE---------------- 104 +AG M S + + A A +I Sbjct: 57 SIAGLESMNSEIDGTDGAVQQAIRDALAANGITNAVYINGDWGNLDPSQNYYRAFYTQRG 116 Query: 105 --------SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 S S F ++ + I + + L+ S T+ Sbjct: 117 SRVEYPVGSGGQVSTEFNGLRVEVRSARNTIFGQALGIDTLEVSAENKATLCQCATNIFP 176 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA----MNSDPE 212 + + + E+ F+ LSG Sbjct: 177 LAVATQKMTGMLPGESKPIAWTKNKVGSSVEKDYFVWAEWQTLSGFNADNRLKESLGGTG 236 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 DV A N ++ D I + Sbjct: 237 DVIKGVTEATAPPGYANASNNGVINIDDWIKVSDST 272 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 13/194 (6%), Positives = 49/194 (25%), Gaps = 15/194 (7%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + + +Q+ S + + ++D+ + Sbjct: 374 TTSYDISLVVDISGSMQWCYDSQRTCSVDANARWYRVKDFLAKFSYKMLDVWNAPAGQNM 433 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 ++ + ++AA++ + ++ + + + Sbjct: 434 NNASLFPGEALVGKGGDNRIAAVRFSGNAVT-------SSPSFGFVTSPAGSDQASVSAR 486 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + ++ + + T +++ + V P Sbjct: 487 TTTMRSNMNSLISWITKANMSGSTSGGRGLREGIRYF--------DNVSAHTRVDRFGRP 538 Query: 329 FQKFIIFLTDGENN 342 + ++ LTDG N Sbjct: 539 IKLVMVMLTDGLTN 552 >gi|327458562|gb|EGF04912.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1] Length = 463 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/341 (7%), Positives = 86/341 (25%), Gaps = 31/341 (9%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K R I ++ + + +N+ ++ + + T Sbjct: 131 FKTEGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAIAYRTD 190 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIV----SFIPALLRIEMGERPIFLIELVVDLS 200 + + + + + + + + + E I ++ + ++S Sbjct: 191 PIQGQMNIAVSFVFDTSGSMDWDLQGRNVNPNSGTESRMTILRKKAEIMIKDLKGIGNIS 250 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 ++ +S + T + +K+ L D + + L Sbjct: 251 VNLVGFSSSGKYIQKEFSNLDNGADTIIGTIKDPKKLVPDGVTNPGDGLRYGLISLQSQP 310 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFR 319 +++ + + G + T A + T Sbjct: 311 AQLKYVVLLTDG----------------IPNTYIVDPSALYAGNRVDHGNAVGRVTFNNP 354 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI---KIVTISI-NASPNG 375 +L ++ + ++ + +K+ + ++ I Sbjct: 355 IYDVSTNLGYEYRRLGYDLYSRDSVTRENAITYAGEVSKKFGVGVKRVNVIGFSGVDKEI 414 Query: 376 Q--RLLKTCVSSPEY---HYNVVNADSLIHVFQNISQLMVH 411 + L + + + N +L F +I + + Sbjct: 415 AYGKDLTNSIGKGGMETSYESATNEAALQKTFSDIKKQIQQ 455 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/289 (11%), Positives = 82/289 (28%), Gaps = 25/289 (8%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN----NTIFYNMDVMTS 153 H+ ++ + + S I + + D + + V+ + Sbjct: 103 NVPHVAVIVTDGRSSDSVDQAALETRQSGIVLYAVGVGNYDLGQLTDIASTNETLGVVDN 162 Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 ++ LL+ + +F+ L + I + L + N+ + Sbjct: 163 FNLLDDVRNSLLSSVCSVLNDNFLTTLTIQDCNSDHISITMPCYTLLEKVTPPCNNPVDI 222 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V R ++ + + + + +G++ Y++ V + + Sbjct: 223 VFVLDGSGSVGRRNFEKVQAGVKKIV---GDFNIALDSTRVGVVQYSSIVRQEFALDTFS 279 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + T + AM+ A Q + I Sbjct: 280 NLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFTSANG-------------ARPDVGHVI 326 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + +TDG + + S + KAK+ I + + I +L + Sbjct: 327 VLVTDGRSYDDVS-----QASQKAKQAGIVVFAVGIGDGAVESQLNQIA 370 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 50/164 (30%), Gaps = 20/164 (12%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + ++ Y++ V + V + + T + A+ + + Sbjct: 36 GPSNTRVAVMQYSSSVRQEFALDAFNTLEDLLVGIEEIRYMRGGTRTGKALTRLRRQGFL 95 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + +TDG S+ + + + +++ I + + Sbjct: 96 ESNG-------------ARKNVPHVAVIVTDGR-----SSDSVDQAALETRQSGIVLYAV 137 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + GQ L S+ E V N + L V ++ + Sbjct: 138 GVGNYDLGQ--LTDIASTNETLGVVDNFNLLDDVRNSLLSSVCS 179 >gi|209886335|ref|YP_002290192.1| hypothetical protein OCAR_7223 [Oligotropha carboxidovorans OM5] gi|209874531|gb|ACI94327.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 610 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 15/149 (10%), Positives = 43/149 (28%), Gaps = 4/149 (2%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R + ++ A+E+ N + I A ++ + V K +SA + A + Sbjct: 61 ARGQGLLRRFAANERGNVAFIAAACMVLVTGCAALGVDVGAVFTDKRRAQSAADLAAIVA 120 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH---IKESLSGYSAVFYNTEIQNI 121 AS + + + + N ++ S + + Sbjct: 121 ASDLSRASRAAAATVAKNHYPPDSLVAVETGVYTANTSLGPQQRFSAGTTPASAVRVTMQ 180 Query: 122 VNSSRISMTHM-ANNRLDSSNNTIFYNMD 149 + + + ++ ++ Sbjct: 181 TRTPLFFGKVLTGDPTVNLRVTSVATTTQ 209 >gi|74224883|dbj|BAE37940.1| unnamed protein product [Mus musculus] Length = 633 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 56/186 (30%), Gaps = 27/186 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +++ V + + + Y+ V+ + + +K Sbjct: 301 DNFNKVVKFIFNTVGAFDEVNPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRY 360 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ +K ++ + K ++ +TDG + + Sbjct: 361 RGGNTRTGKAL-----TFIKEKVLTWESGM--------RKNVPKVLVVVTDGRSQD---- 403 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 K +++ + + + + L S P + + V + +S F+ I Sbjct: 404 -EVKKAAFVIQQSGFSVFVVGVADVDYNE--LANIASKPSERHVFIVDDFES----FEKI 456 Query: 406 SQLMVH 411 ++ Sbjct: 457 EDNLIT 462 >gi|146304257|ref|YP_001191573.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702507|gb|ABP95649.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 383 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 35/105 (33%), Gaps = 3/105 (2%) Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + L++ + +FFT ++I LTDG + + N Sbjct: 94 EDLTAEISSLSAGGQTAFFTALLTAFNLHNKHGIPSYVILLTDGNPTD---DTNVETYKR 150 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 A N ++ ++ + N L S Y+V +A + Sbjct: 151 IAIPNGVQTISFGLGDDYNETILKSLADRSGGVFYHVNDAMEIPE 195 >gi|156383257|ref|XP_001632751.1| predicted protein [Nematostella vectensis] gi|156219811|gb|EDO40688.1| predicted protein [Nematostella vectensis] Length = 161 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 46/156 (29%), Gaps = 22/156 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + E ++GL+ Y++ + ++ + T + Sbjct: 27 NFMINIVKSFKISSERTHVGLVLYSSFTQLKFNFDKYSDSASIVKAINTTDYPKGGTRTG 86 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K A L RS K +I LTDG S+ Sbjct: 87 EALKMAKSQLFGASMRS----------------VPKVLIVLTDGR-----SSDKVEAPSK 125 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHY 390 K+ + I + + + L S ++ + Sbjct: 126 ALKDEGVVIFAVGVGDQIDPSELNVMASDSKSDHVF 161 >gi|329850149|ref|ZP_08264995.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] gi|328842060|gb|EGF91630.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] Length = 399 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 38/109 (34%), Gaps = 3/109 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R E ++IFA++ + L+ IG + D KN +++A +A LAGA Sbjct: 3 LRKLLN-MWRCESGQSAMIFAIAAVPLLVAIGGALEYSDMSGLKNRLQTAADAGALAGAG 61 Query: 67 --KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + S R +I D A + +A Sbjct: 62 RLSLASTTGDEDARRVAIRTAQDNLKGDSATFTVDIDRTAGTLTVNAAA 110 >gi|323700441|ref|ZP_08112353.1| hypothetical protein DND132_3035 [Desulfovibrio sp. ND132] gi|323460373|gb|EGB16238.1| hypothetical protein DND132_3035 [Desulfovibrio desulfuricans ND132] Length = 389 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 57/150 (38%), Gaps = 12/150 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA------- 59 R +F++ +E +++ +L + + + L + + + K++++++A +A Sbjct: 1 MRTHFRRP-DNESGFATVMVSLCMAALMGLTALAVDLGRAYLKRSALQTAADAGALAGAN 59 Query: 60 AILAGASKMVSN---LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 ++LA + ++ R + ++ +AL D F+++ + + T Sbjct: 60 SLLAAGKDLDKLRLIVTNYTTRNLTDADGPAKALTDADIVFLRDGVPDEEQPNQVEVTVT 119 Query: 117 EIQNIVNS-SRISMTHMANNRLDSSNNTIF 145 N+ + +D + Sbjct: 120 LSGERENAFPLYFGKAVGKPAMDIVVTSRA 149 >gi|291386938|ref|XP_002709809.1| PREDICTED: vitrin [Oryctolagus cuniculus] Length = 869 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 47/384 (12%), Positives = 98/384 (25%), Gaps = 51/384 (13%) Query: 51 NSMESANN---AAILAGASKMVSNLSRLGDRFE-SISNHAKRALIDDAKRFIKNHIKESL 106 + A + A L G + N + S K A+ +R ++ ++ Sbjct: 508 TDVAQALDIGPAGPLMGVVQYGDNPATQFSLKTHMNSRDLKAAIEKITQRGGLSNAGRAI 567 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 S + F++ N + +++ + D + + + + Sbjct: 568 SFVTKNFFSKVNGNRGGAPNVAVVLVDGWPTDKVEEASRLARESGINIFFI-----TIEG 622 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS--DPEDVNSAPICQDKK 224 N+K P + R V +H + C Sbjct: 623 AVENEKQYVVEPNFANKAV-CRTNGFYSFNVQSWFGLHKTVQPLVKRVCDTDRLACSKTC 681 Query: 225 RTKMAA--------------LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + L + + D +G + YT Sbjct: 682 FNSADLGFVIDGSSSVGTGNFRTVLQFVANLSKEFEISETDTRIGAVQYTYEQRLEFGFD 741 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 K T + A+ A + L K + Sbjct: 742 KYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF----------------KKSKPNKR 785 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EY 388 K +I +TDG + + +V + K + I + + Q L+ + P ++ Sbjct: 786 KLMILITDGRSYD---DVRIPAMAAHHK--GVITYAIGV--AWAAQDELEVIATYPAKDH 838 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 + V D+L I Q + Sbjct: 839 SFFVDEFDNLYKFVPRIIQNICTE 862 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 35/137 (25%), Gaps = 22/137 (16%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D L MG++ Y + ++ + + ++ Sbjct: 505 QFLTDVAQALDIGPAGPLMGVVQYGDNPATQFSLKTHMNSRDLKAAIEKITQR--GGLSN 562 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ + S + + L DG + + Sbjct: 563 AGRAISFVTKNFFSKVNGNRGGA-------------PNVAVVLVDGWPTD-----KVEEA 604 Query: 354 CDKAKENFIKIVTISIN 370 A+E+ I I I+I Sbjct: 605 SRLARESGINIFFITIE 621 >gi|221132796|ref|XP_002166108.1| PREDICTED: similar to fibrillar collagen [Hydra magnipapillata] Length = 2213 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 29/77 (37%), Gaps = 4/77 (5%) Query: 335 FLTDGENNNFKSNVNTIK---ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 LTDG S + + K+ ++I ++ I + LL S + Sbjct: 91 VLTDGRQTRGSSAPDAVDLHVASRPLKDIGVQIYSLGIGRDYDIGELL-DIASDDASVFR 149 Query: 392 VVNADSLIHVFQNISQL 408 + D L+ + +I++ Sbjct: 150 SSDVDELVSIVASITET 166 >gi|126316414|ref|XP_001380743.1| PREDICTED: similar to integrin alpha 2 subunit [Monodelphis domestica] Length = 1214 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 66/193 (34%), Gaps = 30/193 (15%) Query: 230 ALKNALLLF-LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +A+ F + + L + +GLI Y S K + V ++ Sbjct: 220 YPWSAVKNFLVKFVQGLDIGPDKTQVGLIQYGNYPRVVFNMSTFKTK-EEMVKATSQTIQ 278 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A F K ++ +TDGE+++ Sbjct: 279 HGGDLTNTFKAIQFAR-------------EFAYSEASGGRPSATKVMVVVTDGESHDGSF 325 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNAD 396 I C+ ++ I I++ N L +K S+P Y +NV + D Sbjct: 326 LKEVIGQCN---DDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVSDED 382 Query: 397 SLIHVFQNISQLM 409 +L+ + + + Sbjct: 383 ALLEKAGTLGERI 395 >gi|302036308|ref|YP_003796630.1| hypothetical protein NIDE0941 [Candidatus Nitrospira defluvii] gi|300604372|emb|CBK40704.1| conserved exported protein of unknown function, contains von Willebrand factor, type A and vault protein inter-alpha-trypsin domain [Candidatus Nitrospira defluvii] Length = 712 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 74/247 (29%), Gaps = 51/247 (20%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + ++ +P E R I ++D SGSM Sbjct: 331 DGTSYAMLMLVPPTQHRETTARVPRDITFIIDRSGSMA--------------------GA 370 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDM 284 + K +L L + +I + V T+ ++Q + Sbjct: 371 SIEQAKGSLTAALSRLTTQDRFN------IIQFNHTVRSLFPIPQPVTTKSMQQAIRYTE 424 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ PA++QA + + + II +TDG+ N Sbjct: 425 HLAADGGTEILPALRQALKSPQDSAR-------------------LQQIILITDGQVGNE 465 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ ++ TI I ++PN + K + + N + + Sbjct: 466 EELFELLHQRVGSR----RLFTIGIGSTPNSHLMRKAAETGRGTFTYIGNVNEVKDKLDG 521 Query: 405 ISQLMVH 411 + + + H Sbjct: 522 LFRKLEH 528 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 83/317 (26%), Gaps = 55/317 (17%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF- 160 + + S + V S ++ YR+ Sbjct: 170 VGHAQSVVQLAIDDERRITEVKSPTHTLALEGVGTEHVRVRATGRGSAGALELHYRVDGG 229 Query: 161 ----IEHL-LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + + ++ +++ + + E + LV+D SGSM A + +D Sbjct: 230 DWALSAFVHRDAGHDAYLIANLATPPGLAATEIADKDVTLVLDRSGSMSGAPLARAKDAA 289 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGT 273 A + + ++ + + V+ Sbjct: 290 KAVVARLGDGDRVNV--------------------------MAFDDGVDALFLRPVPISA 323 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 E+ Q V TD A+ +A + + + I Sbjct: 324 ERRSQAVEYIDRLSDGGGTDLAGALAEALDAQHPSESEA------------DTGSRPHVI 371 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 +FLTDG++++ + A+ + TI + L + + Sbjct: 372 LFLTDGQSDSQATLQVARGDAGDAR-----VFTIGVGDGVEKPLLARLASEKRGRFTFIA 426 Query: 394 NADSLI----HVFQNIS 406 + + ++ I+ Sbjct: 427 SPSEIERKVSRLYSEIA 443 >gi|330811037|ref|YP_004355499.1| hypothetical protein PSEBR_a4091 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327379145|gb|AEA70495.1| Hypothetical protein; putative exported protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 426 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 47/185 (25%), Gaps = 8/185 (4%) Query: 42 YVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 K +++A +AA L GA + + + + + A Sbjct: 21 DGGHMLLNKTRLQNAVDAAALGGAKTLSQVTGGMNMASTTRAAALDTLSRN-ASAVGNAE 79 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + +++G F E+ + V ++ + + Sbjct: 80 LATAVAGNPGAFAAVELSSSVYGPFSY---PGPTDAKYVRVSVANYQLNGFFWSFV---- 132 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + K V+ I + + + D S A N + + Sbjct: 133 QSMGSGGLGGKAVAAIATAGPSPTSPCDVAPLMVCGDASQYDPAAGNFWSYHFGDLVVLK 192 Query: 222 DKKRT 226 Sbjct: 193 TAAGN 197 >gi|327270794|ref|XP_003220173.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 917 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 67/195 (34%), Gaps = 34/195 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV- 276 Q K +++ L A LFL L ++E ++G++ + ++ + Sbjct: 321 ASAQMGKDNRLSRLIQAAKLFL-----LQIIEEGSWVGIVAFNSKGNIQAGLQKVFSDIE 375 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ +T + + + + A+Q+ + + I+ L Sbjct: 376 RESLTSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE----------------IVLL 419 Query: 337 TDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 T+GE ++ C K + I I TI+ + L K + + Sbjct: 420 TNGEGSDLSP-------CLSKIQSQEIIIHTIAFGSKA-SNELEKLADMTGGKTFYATDS 471 Query: 394 -NADSLIHVFQNISQ 407 +++ LI F IS Sbjct: 472 LDSNGLIDAFGGISS 486 >gi|323341955|ref|ZP_08082188.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322464380|gb|EFY09573.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 162 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/167 (10%), Positives = 42/167 (25%), Gaps = 6/167 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 E N I+FA +M L + V+ KK+ + A L + Sbjct: 1 MVNFKNEEDGNILIMFAFILMFLLGFVALSTDVVLAMNKKDKL---TEVAYL--VREARL 55 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +++ + A + KR + + L + +++ + + Sbjct: 56 DITEEMLHSQQGQQIALDNAHEILKRNGIDPSQFELVWNEQRGMSPNGNYMISLTGTIIL 115 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 I F + ++ + Sbjct: 116 RDVYKTTTLKALGIDEFPI-KVELPVWQNFTDQQPIWDGRTPTINGV 161 >gi|159465497|ref|XP_001690959.1| hypothetical protein CHLREDRAFT_188557 [Chlamydomonas reinhardtii] gi|158279645|gb|EDP05405.1| predicted protein [Chlamydomonas reinhardtii] Length = 434 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 15/186 (8%), Positives = 42/186 (22%), Gaps = 53/186 (28%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 ++A ++ +D + + +G++ Y+ V ++ T R Sbjct: 179 SGSMSGERIALVRETCHFLIDQLTPDDY------LGIVSYSGGVRADVPLLRMTPAARGL 232 Query: 280 VTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +D T + + Sbjct: 233 AHAMVDALEADGSTALYDGLVAGVRQ---------------------------------- 258 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + + T A + + L + +Y + D + Sbjct: 259 ------------QMEAEAPTDQHVTVHTFGFGAGHSVELLQAVADAQSGVYYYISCVDDI 306 Query: 399 IHVFQN 404 F + Sbjct: 307 PSGFGD 312 >gi|13928960|ref|NP_113879.1| integrin alpha-D precursor [Rattus norvegicus] gi|48428189|sp|Q9QYE7|ITAD_RAT RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|6648592|gb|AAF21241.1|AF021334_1 alpha D integrin [Rattus norvegicus] Length = 1161 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/354 (10%), Positives = 104/354 (29%), Gaps = 45/354 (12%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 +++ F ++ + + + + ++ T + ++ Sbjct: 20 SNLDVEEPIVFREDAASFGQTVVQFGGSRLVVGAPLEAVAVNQTGRLYDCAPATGMCQPI 79 Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI--------FLIELVVD 198 + + L+ N ++++ P R + ++ + Sbjct: 80 VLRSPLE-AVNMSLGLSLVTATNNAQLLACGPTAQRACVKNMYAKGSCLLLGSSLQFIQA 138 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + SM + + + A +K+ + + S L+ Sbjct: 139 VPASMPECPRQEMDIAFLIDGSGSINQRDFAQMKDFVKALMGEFASTS-----TLFSLMQ 193 Query: 259 YTTRVEKNIEPSWGTEKV--RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 Y+ ++ + + + + V + T + ++ + L K S Sbjct: 194 YSNILKTHFTFTEFKNILDPQSLVDPIVQLQ--GLTYTATGIRTVMEELFHSKNGS---- 247 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINA---S 372 +K ++ +TDG+ + + I DKA I I + Sbjct: 248 ---------RKSAKKILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQE 295 Query: 373 PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM-----VHRKYSVILK 419 P + L T S+P ++ + V N +L + + + + + + S + Sbjct: 296 PTALKELNTIGSAPPQDHVFKVGNFAALRSIQRQLQEKIFAIEGTQSRSSSSFQ 349 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 37/353 (10%), Positives = 94/353 (26%), Gaps = 59/353 (16%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 AG+S+ +S + + +A + + I I S NT + Sbjct: 471 AGSSQDSIVISSQTENRVDLYYNAIAEEKSEDEVTITGAIPRSYFRSVKADDNTTPGKVF 530 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + AN + + + + + ++++ + + Sbjct: 531 PTKTAEWVDEANGIGKINFTIYGNPIRRGSDVILVIDSSGSMEGEKWSTAKTAAKGFIDN 590 Query: 183 IEMGERPIFL---IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + + I +V S + + + + + K + K+ L ++ Sbjct: 591 LYQNKDGVVSDDRIAIVDFDSSAKAYPGTNSGSETFLKVDDKITIKNKTYSAKDYLKSYV 650 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D TD A++ Sbjct: 651 ------------------------------------------LDSQMKDTGGTDYNKALQ 668 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A ++ + + S P+ + DG+ + K Sbjct: 669 TAQSVINNRRDSSRPAYIVFMSDGEPNGYWDWLTYRYYDGQKY-----------ATELKS 717 Query: 360 NFIKIVTISINASPNG-QRLLKTCVSSPEYHY--NVVNADSLIHVFQNISQLM 409 + + I ++ +N + + S P Y N+V L+ ++ I+ + Sbjct: 718 DGVTIYSLGLNIGSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIYDAIASSI 770 >gi|149921119|ref|ZP_01909577.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] gi|149818006|gb|EDM77465.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] Length = 428 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/303 (9%), Positives = 58/303 (19%), Gaps = 92/303 (30%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 V + + ++ S ++ Sbjct: 96 VPPQVMLVVDASGSM----------------------------------VNNSWDHDLDP 121 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE---KN 266 + V T M A+ + + M Y + Sbjct: 122 NTPQVTRWNTLHGVVSTVMDNFGPAMYAGIQRFPSEEACPDATPMSSNCYNSGSCIVGTQ 181 Query: 267 IEPSWGTEKVRQYVTRDMDSLIL-----KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 E + + T +T M A L + Sbjct: 182 PEVGVSLDNGASVIAAIPGPTAGNTEIVGGTPATKGMNSAVSHLEQQPEAF--------- 232 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKS----------NVNTIKICDKAKENFIKIVTISI-- 369 ++++ +TDG N ++ + A + I + I Sbjct: 233 --------PRYVLLITDGAANCDQALSFPDYIEQYDETLPTTVQAAFDGGITTFVVGIDI 284 Query: 370 -----NASPNGQRL------LKTCVSSPE----------YHYNVVNADSLIHVFQNISQL 408 +G L + YN N L+ Q I Sbjct: 285 EDMLQGVGTDGSPEANPFERLNDVAIAGGAPKNEGMDLEKFYNTTNQQELLDAIQAILGE 344 Query: 409 MVH 411 + Sbjct: 345 VTD 347 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 8/61 (13%), Positives = 30/61 (49%), Gaps = 1/61 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E+ F+++ + + + ++G + + + +++ ++ + A LAG ++ + S+ Sbjct: 21 RRRERGAFAMMTIIFMTVMIAVLGM-LDIGNVFFQRRDLQRIADMAALAGVQRLDATCSQ 79 Query: 75 L 75 Sbjct: 80 A 80 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 55/202 (27%), Gaps = 22/202 (10%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSF--LLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++R R + + S++ +I+FAL +M L I + + H K +++ Sbjct: 1 MARLRSIKR--LHSQRGVVTILFALVLMVLVGL--IALAVDLTRLHLVKAELQN------ 50 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG-----YSAVFYNT 116 A+ + S+ A A+ F + + + V Sbjct: 51 ---AADAAALAGAGSLIDTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQEQDVNVAI 107 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY--DYRLQFIEHLLNQRYNQKIV 174 + + ++ N + + S+ + + I Sbjct: 108 QPGYWNLITPSFTSNTGLVTHTGDGNIPAVQVTITLSHLKFFFAPILGIPEGTVQATAIA 167 Query: 175 SFIPALLRIEMGERPIFLIELV 196 + P + I Sbjct: 168 AVSPPTGGTGLFPMAIGGCLFN 189 >gi|15897956|ref|NP_342561.1| hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|284173931|ref|ZP_06387900.1| hypothetical protein Ssol98_04630 [Sulfolobus solfataricus 98/2] gi|13814281|gb|AAK41351.1| Hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|261602668|gb|ACX92271.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 380 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 60/223 (26%), Gaps = 45/223 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L+ + + H + D + K+ K + I Sbjct: 24 KVLLVPEKLGSATGFHYIVALDTSGSMTGY--------KIELAKQGAIELFKRI------ 69 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + I +++ V E + + T A+ A + Sbjct: 70 PNGNKVSFITFSSNVNVIKEFVDPLD----LTNEILQITAGGQTALYTAILTANSLAKK- 124 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 +++ LTDG + + N +K+ K I++ + Sbjct: 125 ------------------YQMPTYLLLLTDGNPTDETNIGNYLKLPYYEK---IQVYSFG 163 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSL-----IHVFQNIS 406 I N Q L + Y++ +A+ + I+ Sbjct: 164 IGDDYNEQLLQSVSDKTGGVMYHISDANEIPQKLPQKAVTQIA 206 >gi|321460552|gb|EFX71593.1| hypothetical protein DAPPUDRAFT_255504 [Daphnia pulex] Length = 983 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 36/328 (10%), Positives = 99/328 (30%), Gaps = 53/328 (16%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + SN+ ID A + + + + ++S +N Sbjct: 222 DMTSNNNAACKIDPATNSYDQNCRFKFESQFKPTTSLASYHQLDSVVHFCKESEHNAAAQ 281 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + S + + + ++ ++ D+ Sbjct: 282 NKHNDMCERFDTWSVIMESD--DFKNGNGVPEDLHELATDFKIVKPTGTRFVVVM---DI 336 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM ++ L ++ ++ + ++ ++G++ + Sbjct: 337 SGSMKEFD-------------------RIGKLSESIRSWIKT-----DLRNGSHLGMVQF 372 Query: 260 TTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 + E E + + + + + L T ++ A Q+L + Sbjct: 373 SATAEILSELTMISDEKSREEMIAKLPKQLQA-ATCIGCGLELAVQMLNEN--------- 422 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 + I+ +TDG+N+ +++ ++ + + I+++TI+ + Sbjct: 423 ------KGTSETGGVIVLVTDGKNSPGYLHISDVQ--EDILKAKIRVITIAFGEKADKN- 473 Query: 378 LLKTCVSSPEYHYNVVNAD---SLIHVF 402 L + Y V + D +L F Sbjct: 474 LEDLARQTDGKSYFVKDEDGGAALQEAF 501 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/339 (6%), Positives = 70/339 (20%), Gaps = 51/339 (15%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 + ++I+ A + + + + L + Sbjct: 255 NGWDGNPTNRNNSYIEYGGEKEDADYAIRKFAKETATPGLF-------DLYLNVRGNTQK 307 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + + + + + + + + I + Sbjct: 308 NITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDTLADSGITDKI-------NMG 356 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 + + ++KN + S + + + Sbjct: 357 YVGYSIEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGH- 415 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 ++ + D + ++ + T F + + Sbjct: 416 ---------KKVIVLLTDGVPTFSYKVQ-------RVHAQSSSNYYGTQFSNTQDRPGNT 459 Query: 328 PFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINASPNGQRLLKTCV- 383 I D N + + + TI KE I+I + I + L Sbjct: 460 SLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEV 519 Query: 384 -----------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + +Y +A + + + Sbjct: 520 ESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 558 >gi|149925131|ref|ZP_01913438.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] gi|149813994|gb|EDM73632.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] Length = 413 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 34/347 (9%), Positives = 83/347 (23%), Gaps = 64/347 (18%) Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 +A E SG + T ++ +++ + V Sbjct: 22 AGEADGPGIGTESEDGSGDATDSTTTAEEDSGSTTAEEDE---STTGPDCGTVEITPEYV 78 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + ++N ++ + P + R + + + M + Sbjct: 79 PPNVMLVVDASGSMVNNSWDHDLDPNTPQVTRWNTLHGVVDNVMTNFGST------MYAG 132 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + SA C D +G T+ E + + Sbjct: 133 VQRFPSADACP---------------------DATPQSSNCYNLGSCIVGTQPEVGVG-A 170 Query: 271 WGTEKVRQYV--TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF-FTNFFRQGVKIPSL 327 + + + ++ I+ T +T + A L + + + + F G + Sbjct: 171 DNGDAILAAIPGPGAGNTEIVGGTPATLGINSAVDHLLAQNETNPRYILFITDGAANCNT 230 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-------NASPNGQRL-- 378 + + ++ I + I A +G Sbjct: 231 DLPYPEYIES-------YDETLPTTVEAAFEDEGITTYVVGIDIVDMLQGAGTDGSPEAN 283 Query: 379 ----LKTCVSSPE----------YHYNVVNADSLIHVFQNISQLMVH 411 L + +N N L+ Q I + + Sbjct: 284 PFVRLNDVALAGGAPKNEGMDAEKFFNTTNQQELLDALQVILEGVTE 330 >gi|301780868|ref|XP_002925836.1| PREDICTED: integrin alpha-D-like, partial [Ailuropoda melanoleuca] Length = 926 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 56/160 (35%), Gaps = 20/160 (12%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 LI Y+ ++ + + D + T + ++ Q L K Sbjct: 2 SLIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQLNGLTFTATGIRTVVQELFHSKNG--- 58 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS-- 372 +K +I +TDG+ ++ + + +A+ I I + + Sbjct: 59 ----------ARKTARKILIVITDGQK--YRDPLEYSDVIPQAERAGIVRYAIGVGDAFQ 106 Query: 373 -PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 ++ L T S P ++ + V N +L ++ + + + + Sbjct: 107 KLTARQELNTIGSKPSQDHVFRVDNFAALSNIQEQLQEKI 146 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 70/226 (30%), Gaps = 50/226 (22%) Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I R LV+D SGSM K K+ +K A+ + +D Sbjct: 35 DIMAQVRMPLNFSLVIDHSGSM--------------------KGAKLRNVKEAVKMVIDR 74 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ + + ++ + + I P+ ++ + R D+ T + M Sbjct: 75 LEPSDY------ISVVIFDDSAQVIIPSMPANDPVGMKAAIDRIQDA---GGTTMSLGM- 124 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 Q L ++ + R +I LTDG ++ A Sbjct: 125 --IQSLGELRRWNIPNAVSR-------------MILLTDGVTYGDTD--RCRQLARDAAA 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQN 404 I I + I A + L S + + + + +F+ Sbjct: 168 AGISIYPLGIGADWDENLLDDVGQLSGGTPAEFIRSPNDAMTIFEQ 213 >gi|229051629|ref|ZP_04195099.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] gi|228721740|gb|EEL73214.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] Length = 452 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 50/184 (27%), Gaps = 34/184 (18%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---------R 277 KM A K A+ +LD I + + E + S G+ +V Sbjct: 173 KMEAAKKAIYNYLDKI---PDNANVMLRVYGHKGSNNENDKSLSCGSSEVMYPLQPYNKE 229 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q+ T A++ + + + ++ Sbjct: 230 QFNAALSKFGPKGWTPLASAIE----------------SVNADFKEYTGEENLNVVYIVS 273 Query: 338 DGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVN 394 DGE + ++ + I + + Q+ LK + + V Sbjct: 274 DGEETCGGD---PVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYATVST 330 Query: 395 ADSL 398 AD L Sbjct: 331 ADEL 334 >gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus] Length = 889 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 74/226 (32%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL L+ + ++ Sbjct: 282 SVVFVIDVSGSMH--------------------GRKMEQTKDALLKILEDVKQDDYLNFI 321 Query: 252 VYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + E +++ MD T+ A+ + +L ++ Sbjct: 322 LFSGDVTTWKDSLVPATP-----ENIQEASKFVMDIQDRGMTNINDALLRGISMLNKARE 376 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 II LTDG+ N +S I+ A + + Sbjct: 377 EHTVPE-----------RSTSIIIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGF 425 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 426 GNNLNYNFLENMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 471 >gi|87308731|ref|ZP_01090870.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] gi|87288442|gb|EAQ80337.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] Length = 625 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/287 (10%), Positives = 69/287 (24%), Gaps = 42/287 (14%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N ++ +P + I D+ + + M N + Sbjct: 327 NSAVKNAFGLNHVPYPYASGSWDDYINYCRNDNDVRSADYRNMYGGVTLANYWLAKKYNS 386 Query: 225 RTKMAALKN-------ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS------- 270 K F + L + +G++ Y Sbjct: 387 WQTQDLWKAPHYPFHAVKNGFTLFLSFLDDLDFGDEVGIVSYDESARVEHTLDEHGEYAT 446 Query: 271 ---WGTEKVRQYVTRDMDSLILK----PTDSTPAMKQAYQILTS----DKKRSFFTNFFR 319 + + T +K+A ++LT + + Sbjct: 447 LGGNNISEDYDTLDLIQRHKQSGHYGSYTGMGYGVKEASELLTEHSRHGARPTMVVMTDG 506 Query: 320 QGVKIPSLPFQKFII-------FLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTISI 369 Q + P+ + F DG + S+ + +A + + I T+S+ Sbjct: 507 QANRYPNRWSLPYNWNWADYTDFDGDGNADYSTSDRSKQYAFYEAVQAHKLDVTIHTMSV 566 Query: 370 NASPNGQRLLKTCVSSPEYHYN-------VVNADSLIHVFQNISQLM 409 + + + H L+ F I+ + Sbjct: 567 GLDADRSLMTAIAFACGGVHIAVPGGATVAEMEQQLLDAFGQIAAKV 613 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/344 (5%), Positives = 63/344 (18%), Gaps = 36/344 (10%) Query: 34 LLLIGFLIYVLDWHYKKNSME-------------------SANNAA----ILAGASKMVS 70 I F I ++ M+ A ++ ++ + Sbjct: 31 FGFISFGIDTGLISLEQTRMQNAVDAAALAASQEITSAVAQAGDSGGDPNSISISFAKQM 90 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI--------QNIV 122 + ++ + E Sbjct: 91 AVDVAAANGVYLNADRDIVFGKRTYDPGSGEWAYDWTTGPYNVVKVEAHRDQPNLEAPDG 150 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + + L F + + + I + Sbjct: 151 RVPLAFGWAVGVPSIPLVTSATSFVEAR--DMVVVLDFSGSMNDDSQFKAINRLGNDAVT 208 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + V L + + + + Sbjct: 209 QNLRDIFTSMSPNVGSLPLDPTYLTVVGAAPTSGCDSQVQVTFKGTEIYVKSSKDLSNVV 268 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + Y L T + ++ + +V +S + + + Sbjct: 269 LQFDNGNKYKYDNLNQGKTGTFQGNGGNYNRVITKAWVKSGCNS-SGEGSGYGERFEDTN 327 Query: 303 QILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDGENNNF 344 + + + + G + + D N Sbjct: 328 SAVKNAFGLNHVPYPYASGSWDDYINYCRNDNDVRSADYRNMYG 371 >gi|46124769|ref|XP_386938.1| hypothetical protein FG06762.1 [Gibberella zeae PH-1] Length = 1158 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 79/260 (30%), Gaps = 28/260 (10%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 L QR + VS + + S +M +++ + V PI Sbjct: 573 LDWQRSRPQRVSSVASSWNGARSTTTAPTDYTNFPRSPAMSKSVHVPIDVVVVVPISSSM 632 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + K+ +++AL ++++ MGL+ + + T K + Sbjct: 633 QGVKINLVRDALKFMVNTLGERD------RMGLVTFGSGGGGV-PIVGMTTKAWPGWSNI 685 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + S+ S A + + K + I+ ++D ++ Sbjct: 686 LTSIKPVGQKSHRA---------DVVEGANVAMDLLMQRKYNNPIAS--IMLISDASTSD 734 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH--- 400 S + + AK I I + + + +++ + + V + L Sbjct: 735 ADSVDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKDWMMLRECLA 791 Query: 401 ----VFQNISQLMVHRKYSV 416 Q +S V K + Sbjct: 792 GCLGSMQTLSHQNVKLKLRL 811 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 63/233 (27%), Gaps = 40/233 (17%) Query: 172 KIVSFIPALLR----IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 V+ +PA + + P + S A + S + +T+ Sbjct: 30 LSVASVPAFAQNPTGVTPSAEPSSEAAPNGGAANSQSGATMLVLDSSGSMNVQDAGGQTR 89 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIG--------YTTRVEKNIEPSWGTEKVRQY 279 + A K+A F+ + + Y G + + P + Sbjct: 90 LDAAKDATKKFVSELGGTIPLGLVTYGGTVDEAPENQEAGCQDIHVVSGPKEDVGD--SF 147 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T ++K+A + L I+ ++DG Sbjct: 148 TGPIDALQAKGYTPIGDSLKKAAEELGGQHGT---------------------IVLVSDG 186 Query: 340 ENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYH 389 + V ++ + E I I TI N ++ L ++ + Sbjct: 187 IDTCAPPPVC--EVAKELHEQGIDLVINTIGFNVDEEARKELSCIAEAAGGEY 237 >gi|251799254|ref|YP_003013985.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247546880|gb|ACT03899.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 562 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 50/161 (31%), Gaps = 20/161 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ D +GL+ Y+ V + + T + +V T + + Sbjct: 413 LKGQKYLGRDNSIGLVSYSDGVTIKLPIAKYDTNQQSLFVGAVNSLQSGGGTATFDGIVV 472 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + I L+DGE N + + + + K Sbjct: 473 ALKLLEDQLAT--------------DPNTKPVIFVLSDGETNQGYTLQDIKGLIETYK-- 516 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I I TI + N Q L + N D + + Sbjct: 517 -IPIYTIG--YNANIQALQNISSINEAASINADTDDVVYKI 554 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 45/139 (32%), Gaps = 10/139 (7%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NL 72 A E+ +F++ A+ ++ + +G + + + + + ++ + A LAGA +M + Sbjct: 20 RARERGSFAVAAAIWMLVAIAALGV-VDIGNVFFVRRDLQRVADMAALAGAQRMDDQCSQ 78 Query: 73 SRLGDRFESISNHAKRALIDDAKRF-------IKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + SN RA + N S S ++ N Sbjct: 79 PNAAAAANASSNGFDRAASGNTLTVSCGRWDTQSNAAPSYFSTTSTPLNAVQVTATQNVP 138 Query: 126 RISMTHMANNRLDSSNNTI 144 + S+ Sbjct: 139 YFFLGPSRTVSATSTAKAT 157 >gi|149546336|ref|XP_001514218.1| PREDICTED: similar to Procollagen, type VI, alpha 2 [Ornithorhynchus anatinus] Length = 1023 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 44/162 (27%), Gaps = 17/162 (10%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + L+ Y E + + + ++ + Sbjct: 865 TLARSDDDNMNARVALLQYGGEAEQEVVFPLTSNLTVISSALANLRYLDS--SSNVGNGI 922 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + K + +FLTDG + + + K Sbjct: 923 VHAINHI---------VIPPVDNQKKARPNAELSFVFLTDGVTESKS----LKEATEAMK 969 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + + LL+ + + + DSL Sbjct: 970 KQDVVPTVVALGSDVDMDVLLEISLGDRASIFREKDYDSLSQ 1011 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 58/173 (33%), Gaps = 23/173 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDL---LSHVKEDVYMGLIGYTTRVEKNIEP 269 D + + Q + + +K+ ++ F ++ + V + G + ++ VE Sbjct: 57 DTSESVAMQPPIESLVDHIKDFVIQFTSQLENEFYQNQVAISWHYGGLHFSDLVEIFSPL 116 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 E R ++ + T + A+ + +T + Sbjct: 117 PSNKEAFRNKLSGVK--YFGRGTFTDCAIANMTEQVTQSQVAGV---------------- 158 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 F + +TDG ++A++ IK+ ++++N + L + Sbjct: 159 -NFAVVITDGHVT-GSPCGGMKLQAERARDAGIKLFSVAVNENLYESGLREIA 209 >gi|289596620|ref|YP_003483316.1| von Willebrand factor type A [Aciduliprofundum boonei T469] gi|289534407|gb|ADD08754.1| von Willebrand factor type A [Aciduliprofundum boonei T469] Length = 2166 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 51/163 (31%), Gaps = 8/163 (4%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R I V+D SGSM+ + + T++ A + + + Sbjct: 1246 NKRKPIDIIFVIDTSGSMNSVVPGATVGD---VNGDGRSNTRIDVAIQAAIDAVKELGPQ 1302 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V + G + E+ + ++ T + + ++ D T + A Sbjct: 1303 DRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYY 1358 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + + +G+ + + + + N + + Sbjct: 1359 MDTQSADNPDREDATRGILVLTDGLSNSDNYGPNNVRNGARFD 1401 >gi|254167891|ref|ZP_04874740.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623182|gb|EDY35748.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 2164 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 51/163 (31%), Gaps = 8/163 (4%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 +R I V+D SGSM+ + + T++ A + + + Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGD---VNGDGRSNTRIDVAIQAAIDAVKELGPQ 1300 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 V + G + E+ + ++ T + + ++ D T + A Sbjct: 1301 DRVAVFTFDG----NSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYY 1356 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + + +G+ + + + + N + + Sbjct: 1357 MDTQSADNPDREDATRGILVLTDGLSNSDNYGPNNVRNGARFD 1399 >gi|118763608|gb|AAI28261.1| VIT protein [Homo sapiens] Length = 657 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 51/185 (27%), Gaps = 27/185 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLIL 289 + L + D +G + YT + Sbjct: 490 FRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDK-YSSRPDIINAIKRVGYWS 548 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + A+ A + L K +K +I +TDG + + +V Sbjct: 549 GGTSTGAAINFALEQLF----------------KKSKPNKRKLMILITDGRSYD---DVR 589 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 + K + I + + Q L+ + P ++ + V D+L I Q Sbjct: 590 IPAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRIIQ 645 Query: 408 LMVHR 412 + Sbjct: 646 NICTE 650 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/305 (7%), Positives = 67/305 (21%), Gaps = 24/305 (7%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + A + ++ + ++ +++ Sbjct: 137 ESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVAT 196 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 L + + + + + S + + Sbjct: 197 PTTLPRPSPSAASTTSIPRPQS-VGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGA 255 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + + D + K+R ++ L D L Sbjct: 256 AFQKPVGADVSLDCKIDLSFLIDGSTSIGKRRFRIQK-----QLLADVAQALDIGPAGPL 310 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 MG++ Y + T + + ++ A+ + S + Sbjct: 311 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN- 369 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ + DG + + A+E+ I I I+I + Sbjct: 370 ------------RSGAPNVVVVMVDGWPTD-----KVEEASRLARESGINIFFITIEGAA 412 Query: 374 NGQRL 378 ++ Sbjct: 413 ENEKQ 417 >gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] gi|160332333|sp|P56652|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus] gi|296474789|gb|DAA16904.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] Length = 891 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 74/226 (32%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL L+ + ++ Sbjct: 284 SVVFVIDVSGSMH--------------------GRKMEQTKDALLKILEDVKQDDYLNFI 323 Query: 252 VYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + E +++ MD T+ A+ + +L ++ Sbjct: 324 LFSGDVTTWKDSLVPATP-----ENIQEASKFVMDIQDRGMTNINDALLRGISMLNKARE 378 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 II LTDG+ N +S I+ A + + Sbjct: 379 EHTVPE-----------RSTSIIIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGF 427 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 428 GNNLNYNFLENMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 473 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 68/205 (33%), Gaps = 40/205 (19%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 C K+A + A+ FL S + L+ + R + + T++++ Sbjct: 117 DCSGSMGPKLAKSRAAVAAFLSSANPED------EFSLVLFNDRAQLVSGFNRQTDELQS 170 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T A+ A + K K ++ ++D Sbjct: 171 ---KLFYAQSKGRTALLDAIYLAMDQMKHAKHSR------------------KAVLVISD 209 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISI-------NASP---NGQRLLKTCVS-SPE 387 G +N + + ++ ++ KE +I +I I S G LL S S Sbjct: 210 GGDNCSRYS--MREVKNRVKEGDAQIYSIGILEAMGFRGRSAEELAGPALLDDIASQSGG 267 Query: 388 YHYNVVNADSLIHVFQNISQLMVHR 412 + + N + L V I + ++ Sbjct: 268 RLFEIDNLNELSDVASKIGMALRNQ 292 >gi|260797475|ref|XP_002593728.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] gi|229278956|gb|EEN49739.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] Length = 186 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 57/183 (31%), Gaps = 15/183 (8%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + +D + + D +G++ Y+ ++ + Sbjct: 16 DNFETVKQFVVDVVSAFTISLTDTRVGVVQYSDFNTLACNLGDHPDEASFVTAINTMQYQ 75 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + AM+ A L + + + F + +I LTDG++ + Sbjct: 76 GGGTATGDAMEYARVKLQAVWRPAPTPRKFPLF---------QIMIVLTDGKSGDDVVAA 126 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 D + + I + A+ + LL+ + + + + +L +I + Sbjct: 127 AQALAAD-----GVTVYAIGV-ANFDTAELLEITNGNQDRVIELKDYTALTASINSIIRA 180 Query: 409 MVH 411 + Sbjct: 181 LCK 183 >gi|157694069|ref|YP_001488531.1| hypothetical protein BPUM_3318 [Bacillus pumilus SAFR-032] gi|157682827|gb|ABV63971.1| hypothetical protein YwmD [Bacillus pumilus SAFR-032] Length = 225 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 57/201 (28%), Gaps = 29/201 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV------KEDVYMGLIGYTTRVEKN 266 D++ + + K+ K ++ F + + V E + Sbjct: 38 DLSGSMAQSVEGEKKIDIAKRSIQSFASILSDDTQVLLRVFGHEGTNKNAGKAISCASSE 97 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 +G+ + + T A+ Q + Sbjct: 98 AVYGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQA---------------- 141 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCV- 383 + + ++DG+ S + + E+ I + I + + + LK+ Sbjct: 142 -EGKNIVYVVSDGKETCGGSPS---QAAKELHEDGIDTIVNIIGFDVNEKEAKSLKSVAK 197 Query: 384 SSPEYHYNVVNADSLIHVFQN 404 + + NA+ L H+ QN Sbjct: 198 AGGGQYQPAANAEELNHILQN 218 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 55/181 (30%), Gaps = 25/181 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + ++ +K+A +D + + L+ + R + I +K Sbjct: 58 RGERLMQVKDAAARIVDQLGQDDY------FSLVVFNDRADVVIPAQRAIKKA-DLKAAI 110 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ M A Q F +G+ II LTDG Sbjct: 111 AQIEAAGGTEMAQGMALALQ--------EVQRPFLTRGISR--------IILLTDGRTYG 154 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +S ++I + + I + + I N L S + A ++ VF Sbjct: 155 DES--RCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSRAQYIATAQDVVKVFA 212 Query: 404 N 404 + Sbjct: 213 D 213 >gi|317419404|emb|CBN81441.1| von Willebrand factor A domain-containing protein 2 [Dicentrarchus labrax] Length = 761 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 234 ALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + F+ + ++ M L+ Y R T + + Sbjct: 519 TMRDFVRGLSVQFDINRDVAQMALVAYGRRATTVFNL--DTHDTGSAI-----LKAVGDA 571 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + L + K K ++ +TDG + + Sbjct: 572 NYMGGVASTGTALLH------VHSDILTVAKGARPGVNKAVVVVTDGSGGDDAAVP---- 621 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K ++N + + I I +RLL+ S E+ +V++ + L F+++ Sbjct: 622 -AQKLRDNGVSLFVIGIG-DIQKERLLQI-AGSEEHMISVLSYEDL-KYFEDV 670 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 41/155 (26%), Gaps = 19/155 (12%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 L + V +GLI + + T K S T + A+K Sbjct: 48 QALDVSPDKVRVGLIQFGSVPRLEFALDSHTTKQDLKKHMKKVSYRGGSTQTGLALKYVL 107 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + S + I L+DG + N ++ + KE + Sbjct: 108 RKGFEGGRNSSAAA--------------RIAILLSDGRSQG-----NAVQAAAQLKETGV 148 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + + + + + + Sbjct: 149 VLFAVGLRYPRWEELHALASAPMESHVFFAEHFHD 183 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 49/162 (30%), Gaps = 26/162 (16%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQ 303 + +I + + + + + +K + + T + A+K Sbjct: 420 PISPNATRVAIIQFAGKTKLRVLADFAQKKSAAELKTIIGRSHFFSGTTFTNGALKTMAD 479 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + K+ ++ TDG S +T + + K + Sbjct: 480 LFQKSKRADAKLK----------------VVLFTDG-----YSAEDTSEGAEALKSQGVV 518 Query: 364 IVTISINASPNGQRLLKT---CVSSPEYHYNVVNADSLIHVF 402 + T+ I+ + +K +SP + +N + L F Sbjct: 519 VYTVGISTEKSTGLNMKELHGMATSPNHFFNASDFVELSKNF 560 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/311 (9%), Positives = 67/311 (21%), Gaps = 48/311 (15%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV---- 197 ++ + V + + + + + E I V Sbjct: 47 SSTKGQLTVKHNIMLLIDDSGSM-EYVPSSDRRPYSSYYYGYGAREISRLDITKNVLNKV 105 Query: 198 ------DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + S+ N+ D + K+ ++ + V + Sbjct: 106 LDKYQDRFNWSLQTLHNNGRTDTQDFSSSWRTIKNKVNGIR--AENGTPTTRRYYEVVSN 163 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP--------------- 296 + M I Y + + S G + D +P Sbjct: 164 IVMPNIKYRCQKSYVVLMSDGDANLS---CDIYDYWGRPGYGISPFYYGRRYYSDYYRDY 220 Query: 297 ----AMKQAYQILTSDKKRSFFTNFFRQGVKIPS----------LPFQKFIIFLTDGENN 342 M +Y K I + Sbjct: 221 IPSVGM-SSYNYFGPHSGGYCRNQTGGAYDTFWDREDGLAFFSRTLATKDIKVGGKDKAG 279 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + K ++ T+ + G+ L+ S ++N + + L Sbjct: 280 KSWDGDYSDPRGTNFKNQLVQTYTVGFGSGVSATGEAYLRNGASGSNNYFNARDENGLFA 339 Query: 401 VFQNISQLMVH 411 F I+ + + Sbjct: 340 AFDAITDSIAN 350 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 32/177 (18%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + + ++ VE+ + + ++ + ++ + T + A + L Sbjct: 152 DILRPQDRLSVYAFSETVEEIVPFTSDLRRIDRGISEIIAGSA---TAMYDTIFLASKAL 208 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +K ++ +TDG + S+ + + A ++ + Sbjct: 209 MK-------------------HDGRKVMVLITDGGDTF--SSTSYEQAARAATQSETLLY 247 Query: 366 TISI-------NASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +I + G+ L++ + HY + SL F+ IS + + Sbjct: 248 SIIVVPVANSAGRDTGGEHALIQISQDTGGKHYYATDMGSLDVAFKQISDELRTQYL 304 >gi|3183041|sp|Q90615|ITA1_CHICK RecName: Full=Integrin alpha-1; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1 gi|497990|gb|AAA59067.1| alpha 1 integrin [Gallus gallus] Length = 285 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 67/194 (34%), Gaps = 26/194 (13%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ FL+S+ + + +G++ Y V + + + + Sbjct: 75 YPWESVTAFLNSLLRNMDIGPQQTQVGIVQYGQTVVHEFYLNT-YSTTEEVMDAALRIRQ 133 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + F QK ++ +TDGE+++ Sbjct: 134 RGGTQTMTAL---------GIDTAREEAFTEAHGARRG--VQKVMVIVTDGESHDNYRLQ 182 Query: 349 NTIKICDKAKENFIKIVTISI-NASPNGQ-------RLLKTCVS--SPEYHYNVVNADSL 398 I C+ + I+ I+I + G +K+ S + ++ +NV + +L Sbjct: 183 EVIDKCE---DENIQRFAIAILGSYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELAL 239 Query: 399 IHVFQNISQLMVHR 412 + + + + + + Sbjct: 240 VTIVEALGERIFAL 253 >gi|147898495|ref|NP_001088330.1| inter-alpha (globulin) inhibitor H2 [Xenopus laevis] gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis] Length = 935 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 62/227 (27%), Gaps = 42/227 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I V+D+SGSM KM +A+ L+ ++ Sbjct: 300 PTLPKNILFVIDVSGSMW--------------------GLKMKQTVDAMKSILEDLNSDD 339 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + I + E S +YV R T+ A+ +A IL Sbjct: 340 QFGIIDFNHNI-RCWKDELVYASSVEKGDASKYVQRI---QPNGGTNINDALLRAIFILK 395 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKI 364 + I+ ++DG+ K + ++ I + Sbjct: 396 EASNKGLLEQNSVS-----------LIVLVSDGDPTVGELKLPKIQKNVRTNIQD-DIAL 443 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 ++ I + L + + + A L + +S Sbjct: 444 HSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVST 490 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 63/182 (34%), Gaps = 35/182 (19%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V++ ++G++ + + E + R + + + ++ T ++ A+ ++ Sbjct: 339 VEQGSWVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVIK 398 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIV 365 I+ LTDGE+N TI C D K++ I Sbjct: 399 KKYPTDGSE-----------------IVLLTDGEDN-------TISSCFDLVKQSGAIIH 434 Query: 366 TISINASPNGQRLLKTCVSSPE--YHYN--VVNADSLIHVFQNISQ---LMVHRKYSVIL 418 T+++ + L + + + + V N + + F +S + + Sbjct: 435 TVALG-PAAAKELEQLSKMTGGLQTYSSDQVQN-NGFVDAFAALSSGNAAIAQHSIQLES 492 Query: 419 KG 420 +G Sbjct: 493 RG 494 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 51.0 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 55/182 (30%), Gaps = 25/182 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L L + +G I +T + K + Sbjct: 388 FRLVLDLLVSIARSFDISDIGSRIGAIQFTYDQRMEFNFNDHVLKDNALRALQKIPYMSG 447 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K S +KF+I +TDG++ + +V Sbjct: 448 GTATGDAINFAVRSLF----------------KPRSSSNRKFLIIITDGQSYD---DVRV 488 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A+ I + + + +P LK S P + + L Q I + Sbjct: 489 PAMA--AQREGITVYAVGVAWAPMED--LKAMASEPKESHVFFTREFTGLGQFQQPIVRG 544 Query: 409 MV 410 + Sbjct: 545 IC 546 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/174 (10%), Positives = 47/174 (27%), Gaps = 28/174 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + KN + + + + +G++ + + T Sbjct: 183 QRRFNLQKNFVSKLATMLKVGTQGPH---VGVVQTSETPRTEFYLTNYTTAKDVTFAIKE 239 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 I T++ A+ + S + + I+ DG Sbjct: 240 IPYIGGNTNTGKAILHTVRNFFS-------------PDFGVRRGYPRVIVVFVDGWP--- 283 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYN 391 + N + A+E+ I I +S+ + Q ++ V + Sbjct: 284 --SDNVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKAVCKDNEFFT 335 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 59/195 (30%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + VR Sbjct: 301 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSVR 355 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A++ A ++L IIFLT Sbjct: 356 DGKVYIHHMSPTGGTDINEALQTAIRLLNKYVA-----------HSDSGDRSVSLIIFLT 404 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 405 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 462 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 463 EEDAGSQLIGFYDEI 477 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 65/169 (38%), Gaps = 23/169 (13%) Query: 252 VYMGLIGYTTRVEKNIEP----SWGTEKVRQYVT---RDMDSLILKPTDSTPAMKQAYQI 304 + G+I Y T + I S + V + + D T++ A+++ Y + Sbjct: 200 IQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSM 259 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA------K 358 ++S K+ + + II LTDG+ N +TIK + + Sbjct: 260 MSSQKETYKNESVWNS--------IHHIIILLTDGKANLGGRPAHTIKRIEDFLDIKHKR 311 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 E+++ + T I + L + + + + +A+ + VFQ I Sbjct: 312 EDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 360 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 65/169 (38%), Gaps = 23/169 (13%) Query: 252 VYMGLIGYTTRVEKNIEP----SWGTEKVRQYVT---RDMDSLILKPTDSTPAMKQAYQI 304 + G+I Y T + I S + V + + D T++ A+++ Y + Sbjct: 291 IQFGIISYATVPKVIIPIYDEESDNNDHVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSM 350 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA------K 358 ++S K+ + + II LTDG+ N +TIK + + Sbjct: 351 MSSQKETYKNESVWNS--------IHHIIILLTDGKANLGGRPAHTIKRIEDFLDIKHKR 402 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 E+++ + T I + L + + + + +A+ + VFQ I Sbjct: 403 EDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFRMESANEMKTVFQKI 451 >gi|260837282|ref|XP_002613634.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] gi|229299020|gb|EEN69643.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] Length = 1460 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 56/190 (29%), Gaps = 22/190 (11%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +V+D+SGSM P +++ ++ K A L LD++ Sbjct: 200 VSAASPKKKNVVIVMDVSGSMRE------------PHGVPEEQNRLNLAKQAALTVLDTL 247 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + + + T K+A+ Sbjct: 248 TPRDWAGVVSFSARAKAPEGCLGDSLGEANPTNIGIMKDFINQRVPETITVYAEGFKKAF 307 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K + Q IIFLTDG+ + ++ I E + Sbjct: 308 NMFFESKNKKPEQ----------FEDCQNIIIFLTDGQPTDTYFTLDDIVKGQDLMERSV 357 Query: 363 KIVTISINAS 372 I T + A+ Sbjct: 358 HIFTYGLGAN 367 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 54/190 (28%), Gaps = 23/190 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +V+D+SGSM +++ ++ K A L LD++ Sbjct: 1157 VSAASPKKKNVVIVMDVSGSMREPPGP-------------EEQNRLNLAKQAALTVLDTL 1203 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + + + T ++A+ Sbjct: 1204 TPRDWGGVVSFSARAETPEGCLGDSLGEANPTNIGIMKDFINQRVPETITMYGVGFRKAF 1263 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + IIFL+DG + +N I + + + Sbjct: 1264 DMFAEARNKKPEQ----------FEDCYNIIIFLSDGSPTDKDFALNAITQGQELMDRSV 1313 Query: 363 KIVTISINAS 372 I T + A+ Sbjct: 1314 YIFTYGLGAN 1323 >gi|47216852|emb|CAG11659.1| unnamed protein product [Tetraodon nigroviridis] Length = 1042 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/230 (11%), Positives = 75/230 (32%), Gaps = 37/230 (16%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTK-----MAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + +C + R + +K+ + ++ + +E MG++ Sbjct: 669 DKWLWFPVQSRSSGTGLCDRRFRERRTPKNFELVKDFVNALVERVW---LSQEAARMGVV 725 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + + + S+ + T + A+ +A+Q+ + Sbjct: 726 LFSHTSLAVASLQPRSS--LSELKATVRSMPYLGEGTFTGSAIHRAHQLFQAS------- 776 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--- 372 +K + LTDG+ + + + A+ I++ I + Sbjct: 777 ----------RPGVRKVALVLTDGQADPRDVVQVGVSAAE-AQARGIEMFVIGVMNESHP 825 Query: 373 --PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 P+ +K S+P + + + + SL + + I + + +V Sbjct: 826 LYPDFSAEMKAIASNPKEGHVHLIQDFGSLHVLEKVIVNQICEQNAAVAF 875 >gi|333026931|ref|ZP_08454995.1| hypothetical protein STTU_4435 [Streptomyces sp. Tu6071] gi|332746783|gb|EGJ77224.1| hypothetical protein STTU_4435 [Streptomyces sp. Tu6071] Length = 1066 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 55/214 (25%), Gaps = 24/214 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE------PSWGTE 274 + + ALK A F+D++ R N Sbjct: 221 SGSVQGSVTALKTAANTFVDALQGTPSSMARFTFSSFSPAERGGANAPQLASVSTRADAT 280 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + +S T+ A+ L +++ + + Sbjct: 281 AFKNTYANWNESTAQGSTNWDRALY--VPALAAERYDLAVVITDGMPTRYGTPSGPGG-- 336 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY------ 388 +G F+ N I + K +++ + + A G L S + Sbjct: 337 ---NGSLTRFRELENGIFSANALKAENTRVLAVGVGAGVQGNANLNLAALSGQTRYNGTN 393 Query: 389 -----HYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ N + ++++ SV+ Sbjct: 394 IETADYFQEQNFAQVGAALRSLALANCTPSLSVV 427 >gi|326789665|ref|YP_004307486.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540429|gb|ADZ82288.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 564 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 52/158 (32%), Gaps = 20/158 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 I ++ +D +GL+ Y+ V N+ + + MD T + M Sbjct: 415 ITGAEYIGKDNSIGLVTYSDDVNINLPVGKFDLNHRSLFTGAVMDLEASGGTATFDGMIV 474 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A ++L + K + + + L+DGE N S + I K Sbjct: 475 ALKMLMEE--------------KAANPDAKLMLFVLSDGETNRGHSLNDIEGILRTLK-- 518 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I + TI + N L + N + D + Sbjct: 519 -IPVYTIG--YNANISALNNISSINEAASINAESDDVI 553 >gi|194217500|ref|XP_001502733.2| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Equus caballus] Length = 1163 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 58/185 (31%), Gaps = 20/185 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + + + L+ Y ++ + + + + + Sbjct: 205 FQRAKDFISNMMRNFYEKCFECSFALVQYGEVIQTEFDLQDSQDVMASLARVQNITQVGN 264 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + AM+ + + K ++ LTDG + F +N Sbjct: 265 VTKTASAMQHVLDHIFT-------------PSHSSRKKASKIMVVLTDG--DIFGDPLNL 309 Query: 351 IKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + + K ++ I + + + LK S P+ + + V N +L + + Sbjct: 310 TTVINSPKMQGVERFAIGVGEAFQKAKTDKELKLIASDPDETHAFKVTNYAALDGLLSKL 369 Query: 406 SQLMV 410 Q ++ Sbjct: 370 QQNII 374 >gi|170738572|ref|YP_001767227.1| hypothetical protein M446_0220 [Methylobacterium sp. 4-46] gi|168192846|gb|ACA14793.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 494 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/202 (9%), Positives = 47/202 (23%), Gaps = 14/202 (6%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG- 64 + + + AN +IF L ++G + W +K +++A +AA LA Sbjct: 4 KLAKLARSFWLAGAANVLVIFGLGAPVLFGVVGITVDYAIWLNQKLILQNAADAAALAIV 63 Query: 65 ----ASKMVSNLSRLGDRFESISNHAKRALID---------DAKRFIKNHIKESLSGYSA 111 + + + + K + G Sbjct: 64 ADIQIAAPQLERMQAIAESQVRRFVPLKEGDGPVAVRTAPRYRKTSDDPFAPGPVEGDVR 123 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + I + D + + L +Q Sbjct: 124 TANGWSVTLTQRKRAIMSRLVTPTLTDITVEATAEMAGTIKVCVLVLDAAASGALALRDQ 183 Query: 172 KIVSFIPALLRIEMGERPIFLI 193 + + + +R + Sbjct: 184 SRIDAGNCSVYVMSADRRGLQM 205 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 55/190 (28%), Gaps = 30/190 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 ++ LK+ + + ++ + +I ++ + I + G + Sbjct: 182 GQRITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVISETLGAVGSPSRI 241 Query: 281 T---RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 R T A+ AY + + I+ +T Sbjct: 242 DLQQRVEALQADGGTGIYDAVLAAY-------------------AESAGGDYIPSIVLMT 282 Query: 338 DGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DGE ++ + + I + I N + + ++ ++ +N Sbjct: 283 DGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYG-EANVADMEQLAATTGGKTFDAING 341 Query: 396 DSLIHVFQNI 405 D L F+ I Sbjct: 342 D-LDEAFKEI 350 >gi|47207521|emb|CAG14087.1| unnamed protein product [Tetraodon nigroviridis] Length = 1048 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 38/103 (36%), Gaps = 13/103 (12%) Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NA 371 + K +I +TDGE+++ + + ++ C+ + I I++ Sbjct: 55 SPERGAREGATKVMIVVTDGESHDGEELPDALQECE---DRNITRYAIAVLGHYIRRQQD 111 Query: 372 SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +K S P+ Y +NV + +L + + + Sbjct: 112 PETFISEIKYIASDPDDKYFFNVTDEAALNDIVDALGDRIFTL 154 >gi|218509026|ref|ZP_03506904.1| hypothetical protein RetlB5_16575 [Rhizobium etli Brasil 5] Length = 125 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 43/123 (34%), Gaps = 31/123 (25%) Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-------------CDKAKENFIKI---VT 366 + +K + F++DG +++K + T K C K+ +KI T Sbjct: 3 ARRTPSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYT 62 Query: 367 ISINASPNG-------------QRLLKTCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHR 412 + N ++ C S P +++ V D + + + +++ Sbjct: 63 TYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTDGITDAMKALFLKVIRAP 121 Query: 413 KYS 415 + + Sbjct: 122 RIT 124 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 52/159 (32%), Gaps = 13/159 (8%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILT 306 + G+ + V I P+ + R + T+ AM++A L Sbjct: 80 PDGSQAGVWTFGQWVNMLIPPAEVNSEWRTNAKEKAKMINSFGLRTNIGEAMERATWKLA 139 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---KENFIK 363 +D G+ I D + +N + D K +K Sbjct: 140 ADSDFEQHAILLTDGIVD--------IAADDDPQKDNKNEAERQRILTDVLSEYKNLGVK 191 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I TI+++ + + L K + + V N++ L+ F Sbjct: 192 IHTIALSNAADKVLLEKLALETGGMAEVVENSEQLVKAF 230 >gi|86559780|gb|ABD04185.1| collagen alpha-1 chain, type XII-like protein [Anthopleura elegantissima] Length = 101 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 11/91 (12%), Positives = 25/91 (27%), Gaps = 1/91 (1%) Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 K +TDG + K +++ + I + L Sbjct: 9 PAAGMRRNVPKMAFVITDGVQTRRDPITPLATASGRLKAKGVRVYALGIGKNIKDNE-LN 67 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 S+ ++ Y + L + I + + Sbjct: 68 AIASNRKFVYRTSSFKLLTPLINGIVKSICE 98 >gi|42526760|ref|NP_971858.1| batB protein, putative [Treponema denticola ATCC 35405] gi|41817075|gb|AAS11769.1| batB protein, putative [Treponema denticola ATCC 35405] Length = 322 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 63/218 (28%), Gaps = 48/218 (22%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D++ + QD +++A + L + L+ + S +GL+ ++ S Sbjct: 87 VSDISKSMSLQDIHPSRIAVQRQFLKILLEKMHKTSPES---AVGLVITKGEGVLSVPLS 143 Query: 271 WGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 + + + L T+ + +A ++ Sbjct: 144 FEKNALSSAINALSPLILSSTGTNLEAGVLRALDSFGENRGN------------------ 185 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---------SPNGQRLLK 380 K I+ TDG + + +K K+ ++ + G LK Sbjct: 186 SKIIVLCTDGGETSGS----LLHAAEKIKKTDAILIIVGFGTLEETKIKVLDEKGNTQLK 241 Query: 381 TC-------------VSSPEYHYNVVNADSLIHVFQNI 405 + + +++ S+ + + I Sbjct: 242 DARLEEAFLQKAASIAGGESTYISALSSGSIESILKII 279 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 69/241 (28%), Gaps = 45/241 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I ++ ++D SGSM + A++NAL F + +++ ++ Sbjct: 119 IPTVQFLIDFSGSMDQNFGG---------------IKRSQAVRNAL--FDEDDGVVALLQ 161 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G YT+ P +V D + + Sbjct: 162 SQVRFGASLYTSFDGNEAPPCPRLTQVAPAFNNLTALRA----DIGGPLNDPPNAGDTPT 217 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV------------NTIKICDKA 357 S P+ + I+ TDGE ++ + + +A Sbjct: 218 GESIDAIAENFPDNGPND--KPLIVLATDGEPDSCTDPDPNTDPGRAATRRLSEEATQRA 275 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSS----------PEYHYNVVNADSLIHVFQNISQ 407 E I++ +S+ L + + P Y N L+ F I + Sbjct: 276 FEAGIELYVLSVGNDVGADHLQRVANAGVGKALDESNDPATVYIGNNQQELVDAFSEIIR 335 Query: 408 L 408 Sbjct: 336 S 336 >gi|270008949|gb|EFA05397.1| hypothetical protein TcasGA2_TC015569 [Tribolium castaneum] Length = 2194 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 75/261 (28%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + + +++ Q + + L + + ++ V +S Sbjct: 1794 VVFVLDYSASMEGRKHEQLMQAMDKILSDLN-PDDLFHIVRFSVIVSV------------ 1840 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ + +K A +++D + Sbjct: 1841 -------WNFEKNRFDQIKFAQKPEYENLDSFLAEFNLGDAAQVS-----------EDNI 1882 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 +K ++ D+D T+ + ++ ++ + N Q I Sbjct: 1883 KKAKEIKDHDVDMDC---TNIIGGLVVGLYLVRQTLEKFYEKNIE--------TKHQPMI 1931 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK---IVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N + I I ++S + L K + Sbjct: 1932 IFLTDGLPNVGLIIRDEITDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 1991 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 1992 HIYEAADAALQLQNFYRTVSS 2012 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 79/262 (30%), Gaps = 51/262 (19%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + + L + + ++ ++S Sbjct: 1006 VVFVLDHSGSMEGRKYEQLMQAMDKILSDLN-PDDLFHIVRFSENVSV------------ 1052 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K + + + D ++ + T Sbjct: 1053 -------WNFEKNKFDQVS--------FLQKPEYRNLDSFLAEFNLGDAAQV-------T 1090 Query: 274 EKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E + + D + T+ + ++ ++ + N Q Sbjct: 1091 EGNIKKAKKIKDHDVDMGCTNIIGGLVVGLYLVRRTLQKFYEKNVE--------TKHQPM 1142 Query: 333 IIFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE-- 387 IIFLTDG N SN + I KI K + N I ++S + L K + Sbjct: 1143 IIFLTDGLPNEGISNPDKITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFS 1202 Query: 388 -YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 1203 RHIYEAADAALQLQNFYRTVSS 1224 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 79/261 (30%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 L + +++ Q + + L + + ++ D+S + D Sbjct: 249 VVLVLDHSASMRGRKHEQLMQAMDKILSDLN-PDDLFHVVCFSEDVSV---WNLEKKQFD 304 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +D ++ + D + + V+ + Sbjct: 305 ------------------------LIDFMEKFDYENLDSCLTELNLGNAVQ------FTE 334 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +++ D + + T+ + ++ K+++ N Q I Sbjct: 335 DNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVE--------TKHQPMI 386 Query: 334 IFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE--- 387 I LTDG N S+ I KI K + N I ++S + L K + Sbjct: 387 ILLTDGLPNVGLSDPVEITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 446 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ + Sbjct: 447 HIYEAADAALQLQNFYRTVFS 467 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 55/182 (30%), Gaps = 25/182 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L L + +G I +T + K + Sbjct: 388 FRLVLDLLVSIARSFDISDIGSRIGAIQFTYDQRMEFNFNDHVLKDNALRALQKIPYMSG 447 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L K S +KF+I +TDG++ + +V Sbjct: 448 GTATGDAINFAVRSLF----------------KPRSSSNRKFLIIITDGQSYD---DVRV 488 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + A+ I + + + +P LK S P + + L Q I + Sbjct: 489 PAMA--AQREGITVYAVGVAWAPMED--LKAMASEPKESHVFFTREFTGLGQFQQPIVRG 544 Query: 409 MV 410 + Sbjct: 545 IC 546 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/174 (10%), Positives = 47/174 (27%), Gaps = 28/174 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + KN + + + + +G++ + + T Sbjct: 183 QRRFNLQKNFVSKLATMLKVGTQGPH---VGVVQTSETPRTEFYLTNYTTAKDVTFAIKE 239 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 I T++ A+ + S + + I+ DG Sbjct: 240 IPYIGGNTNTGKAILHTVRNFFS-------------PDFGVRRGYPRVIVVFVDGWP--- 283 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASP-------NGQRLLKTCVSSPEYHYN 391 + N + A+E+ I I +S+ + Q ++ V + Sbjct: 284 --SDNVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKAVCKDNEFFT 335 >gi|91084771|ref|XP_972223.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 606 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 75/261 (28%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + + +++ Q + + L + + ++ V +S Sbjct: 252 VVFVLDYSASMEGRKHEQLMQAMDKILSDLN-PDDLFHIVRFSVIVSV------------ 298 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ + +K A +++D + Sbjct: 299 -------WNFEKNRFDQIKFAQKPEYENLDSFLAEFNLGDAAQVS-----------EDNI 340 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 +K ++ D+D T+ + ++ ++ + N Q I Sbjct: 341 KKAKEIKDHDVDMDC---TNIIGGLVVGLYLVRQTLEKFYEKNIE--------TKHQPMI 389 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK---IVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N + I I ++S + L K + Sbjct: 390 IFLTDGLPNVGLIIRDEITDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 449 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 450 HIYEAADAALQLQNFYRTVSS 470 >gi|116332274|ref|YP_801992.1| hypothetical protein LBJ_2829 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116125963|gb|ABJ77234.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 379 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 57/199 (28%), Gaps = 27/199 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + K+ K + ++ ++ + + Y I + Sbjct: 72 DASGSMNEYLGIYQKIHLAKKHVSRYISTLPTETEIGFIAYGNRIPGCSSSRLYEPLQRE 131 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + R T +++ A +++ KK + Sbjct: 132 NHG--TFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE------------------- 170 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT-CVSSPEYH 389 II +TDG + + K K+ I K + + P+ +R +K + Sbjct: 171 IILITDGVESCYGD---PKKELQALKQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKY 227 Query: 390 YNVVNADSLIHVFQNISQL 408 + + + S ++ Sbjct: 228 FGIEDDSSFYTALDSLKNQ 246 >gi|116327069|ref|YP_796789.1| hypothetical protein LBL_0242 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116119813|gb|ABJ77856.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] Length = 379 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 57/199 (28%), Gaps = 27/199 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + K+ K + ++ ++ + + Y I + Sbjct: 72 DASGSMNEYLGIYQKIHLAKKHVSRYISTLPTETEIGFIAYGNRIPGCSSSRLYEPLQRE 131 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + R T +++ A +++ KK + Sbjct: 132 NHG--TFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE------------------- 170 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKT-CVSSPEYH 389 II +TDG + + K K+ I K + + P+ +R +K + Sbjct: 171 IILITDGVESCYGD---PKKELQALKQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKY 227 Query: 390 YNVVNADSLIHVFQNISQL 408 + + + S ++ Sbjct: 228 FGIEDDSSFYTALDSLKNQ 246 >gi|321464007|gb|EFX75018.1| hypothetical protein DAPPUDRAFT_323732 [Daphnia pulex] Length = 923 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 55/182 (30%), Gaps = 31/182 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ L ++ ++ + E +G++ + + + ++ RQ + + Sbjct: 316 YNRIGRLYDSAQRWIQY-----DIPEGTSLGIVKFHSYATVLSNMTVVNDETRQQLMDKV 370 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T ++ A IL I+ +TDG N + Sbjct: 371 PNKTEDSTCIGCVLQMAIDILHPGGNG-------------------GVIVLVTDGLENEY 411 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIHV 401 + + + I++V+I+ + + Y + N D L Sbjct: 412 PFIRDVTP---ELIDAKIQVVSIAFGRDA-ENEIENLATKTNGKSYFINDNGNNDELNDA 467 Query: 402 FQ 403 F Sbjct: 468 FT 469 >gi|325108192|ref|YP_004269260.1| hypothetical protein Plabr_1627 [Planctomyces brasiliensis DSM 5305] gi|324968460|gb|ADY59238.1| protein of unknown function DUF1355 [Planctomyces brasiliensis DSM 5305] Length = 938 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/238 (10%), Positives = 56/238 (23%), Gaps = 43/238 (18%) Query: 170 NQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + +P + + + + + +D SGSM ++ TKM Sbjct: 376 NSPLDDVLPVSMEMREEHRKNRLALAIALDRSGSMTAPVSGGK--------------TKM 421 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + + +I + + + + + Sbjct: 422 DLANLGTAECIRLLSPSD------EVAVIAVDSTPHTIVPLTNVSNPDDIAQQVLGIQSM 475 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+ A L K II +D + + Sbjct: 476 GGGIFVYEALVAAGNELMKS------------------DLATKHIILFSDAA--DSEEPG 515 Query: 349 NTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + I + I + A + + L + + L +F Sbjct: 516 AYRSLIKDYENAGITVSVIGLGTTADVDAKLLQEIATLGSGNIMFTQDVAELPRLFTE 573 >gi|301606537|ref|XP_002932887.1| PREDICTED: cochlin-like [Xenopus (Silurana) tropicalis] Length = 789 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 51/178 (28%), Gaps = 24/178 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L D + +G I +T + + K + T + Sbjct: 626 LDFMADVVRAFEVSDVGTKVGAIQFTYDQRLEFGLNDHSNKDDVLNALRSIRYMSGGTAT 685 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A + L + F+I +TDG++ + Sbjct: 686 GDAINFAVRNLFQ---------------PTKDGHNKNFLIIVTDGQSYDD-----VRGPA 725 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 A + + I ++ + +P LK S P + + L + ++ + + Sbjct: 726 SSAHISGVTIFSVGVAWAPLED--LKDMASEPKNSHTFFTREFPGLEQITPDLIRGIC 781 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/389 (6%), Positives = 72/389 (18%), Gaps = 54/389 (13%) Query: 49 KKNSMESANNAAILAG-------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + M A A LA + + N D Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD-------SSNNTIFYNMDVMTSY 154 + + E + + ++ A + Sbjct: 262 TSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRGNVQKEITPLDL 321 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + + + ++ G + SG + Sbjct: 322 VLVVDWSGSMNENNR----------IGEVQKGVNRFVD---TLADSGITNNINMGYVGYS 368 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + +KN + + L + Sbjct: 369 SDGYNNNAIQMGPFDTVKNPIKNITP---SSTRGGTFTQKALRDAGDMLAT-------PN 418 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 ++ + D + + +A + + S Sbjct: 419 GHKKVIVLLTDGVPTFSYKVSRVQTEADGRFYGTQFTNRQDQPGSTSYISGSYNAPDQNN 478 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT------------C 382 + + TI K+ I+I + I + + L Sbjct: 479 INKRINST----FIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSA 534 Query: 383 VSSPEYHYN-VVNADSLIHVFQNISQLMV 410 + + +Y A + + + Sbjct: 535 DENGDLYYESADYAPDISDYLAKKAVQIS 563 >gi|257790640|ref|YP_003181246.1| hypothetical protein Elen_0884 [Eggerthella lenta DSM 2243] gi|257474537|gb|ACV54857.1| hypothetical protein Elen_0884 [Eggerthella lenta DSM 2243] Length = 210 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 48/150 (32%), Gaps = 9/150 (6%) Query: 10 YFKKGIASEK--ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 +K + + ++IF +++ L+ +G + V + K + A A+ + Sbjct: 2 ASRKPLRTRGLPGQVAVIFVVAISCLLIAVGLGVDVSTAYSAKTG-QEAVLEAVRQSSLG 60 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIK----NHIKESLSGYSAVFYNTEIQNIVN 123 M + + + L D+ + ES SG + + ++ + Sbjct: 61 MSNAVKYSENPGREAREEVVELLSDNGYTGTAEIWYAELPESESGSNDRYAGVYVRLSND 120 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + + ++L + S Sbjct: 121 TETTFLKLAGRDKLT--ARSTAVWTIHPYS 148 >gi|194221223|ref|XP_001915876.1| PREDICTED: inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Equus caballus] Length = 834 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 64/228 (28%), Gaps = 44/228 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ +D + Sbjct: 267 PTIPKNVIFVIDQSGSMA--------------------GRKIQQTREALIKIVDDLGPKD 306 Query: 247 HVKEDVYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQI 304 L+ + + + + + T+ A+ A Q+ Sbjct: 307 QFN------LVCFNEEATQWKPSLVPASAENMKEARNFAAGIMARGGTNINDAVLLAVQL 360 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIK 363 L K+ II LTDG+ ++N I+ +A Sbjct: 361 LERANKQELLPAGSVS-----------LIILLTDGDPTVGETNRANIQKNVQEAISGQCS 409 Query: 364 IVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + + + L K + + + + +A L +Q ++ Sbjct: 410 LFCLGFGFHVSYAFLEKLALDNGGLARRIYEDSDSALQLQDFYQEVAN 457 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 47/186 (25%), Gaps = 23/186 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + K L +G+ Y+T I T Sbjct: 52 SACGNFKRMLDFTQRIASGFGIASSQTRVGVGLYSTFASVPIPFGKYTSLQETVEGIKKL 111 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+K L S K +I LTDG Sbjct: 112 RYPGEGTRTGRALKLMKTHLFSQS----------------RPKAHKVLIVLTDG-----T 150 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 S + +E+ +++ + I R LK + + L+ V + + Sbjct: 151 SVDDVKAPAKALRESGVEVFAVGIG-EHYRPRELKDIATDTGHVLTA-GFRDLMSVERKL 208 Query: 406 SQLMVH 411 + Sbjct: 209 RDAVCK 214 >gi|313239130|emb|CBY14106.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 44/170 (25%), Gaps = 18/170 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + + V +GLI Y R + + D + T++ Sbjct: 343 NFVINLLMQSNLGQHGVRVGLITYNRRPTLRFHMNEMENHQQAINAVDSIVYEGRGTNTG 402 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K + ++ +TDG + Sbjct: 403 AAIKW-------------VVENAFRPEFGDRPEVPNKVLLITDGRARDPPVLKVQSGRLQ 449 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + + I + L + S + V N L FQ Sbjct: 450 EQS----TVYALGIGKQIDYVELNRIASDPSERHVLYVDNFSFLERAFQK 495 >gi|313214907|emb|CBY41128.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 44/170 (25%), Gaps = 18/170 (10%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + + V +GLI Y R + + D + T++ Sbjct: 343 NFVINLLMQSNLGQHGVRVGLITYNRRPTLRFHMNEMENHQQAINAVDSIVYEGRGTNTG 402 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+K + ++ +TDG + Sbjct: 403 AAIKW-------------VVENAFRPEFGDRPEVPNKVLLITDGRARDPPVLKVQSGRLQ 449 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + + I + L + S + V N L FQ Sbjct: 450 EQS----TVYALGIGKQIDYVELNRIASDPSERHVLYVDNFSFLERAFQK 495 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 24/278 (8%), Positives = 73/278 (26%), Gaps = 48/278 (17%) Query: 124 SSRISMTHMANNRLDSSNNTIFY---NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA- 179 S I + A N ++ D + L + + +I Sbjct: 14 VSAIFLQVSARNLAQVASTFQNQVNKYQDSRADIVFMLDSSGSVGENNFQTEITFVENFL 73 Query: 180 -----------LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + + I+ + + + Sbjct: 74 SQLTISPQASRVAIVSFDDAARTHIDYIKNPKNKCSFLRELKTVKYTGGSTNAEDAFRLA 133 Query: 229 AAL---KNA-----------LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 L K+A + + + L+ + + ++ + +I+ + Sbjct: 134 QELLRPKSAFANNQPVKQTEITFVENFLSQLTISPQASRVAVVSFDNHARTHIDYINSPK 193 Query: 275 KVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 ++ T++ + A ++L + + ++ + Sbjct: 194 NKCSFLRELKAVKYTGGSTNAEDGFRLAQELLRPQSAFTNYQP------------VKQVV 241 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKE-NFIKIVTISIN 370 ++LTDG K + + + + K +I +I ++ Sbjct: 242 VYLTDG-----KPDKDPVGRANNLKSVYNAEIYSIGVD 274 >gi|270008952|gb|EFA05400.1| hypothetical protein TcasGA2_TC015572 [Tribolium castaneum] Length = 767 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/261 (13%), Positives = 82/261 (31%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + + L + + ++ D+S Sbjct: 174 VVFVLDHSGSMRGRKYEQLMQAMDKILSDLN-PDDLFHIVRFSDDVSV------------ 220 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K ++ + + + + + I+ + Sbjct: 221 -------WNLEKNKFDQIR------------FEQMPDYENLDTSLAELNLGEAIQVTEDN 261 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K + + D ++ T+ + ++ ++ + N Q I Sbjct: 262 IKKAKRIKDDDVNM--GCTNIIGGLAVGLYLVRRTLQKFYEKNVA--------TKHQPMI 311 Query: 334 IFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N SN + I KI K + N I ++S + L K + Sbjct: 312 IFLTDGLPNVGISNPDEITKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSR 371 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 372 HIYEAADAALQLQNFYRTVSS 392 >gi|300776964|ref|ZP_07086822.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502474|gb|EFK33614.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 800 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 65/211 (30%), Gaps = 44/211 (20%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + ++D+SGSM K+ LK++ + L+ + Sbjct: 436 PASNLVFLIDVSGSMSDE-------------------NKLPLLKSSFKVLLNQLRPKD-- 474 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 +G++ Y + P+ EK + + T ++ AY++ + Sbjct: 475 ----KVGIVVYAGSAGMVLPPTSAGEKDK-IIEALDRLQAGGSTAGGAGIELAYKLAQEN 529 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 + +I TDG+ N S+ + + + +++ + + + Sbjct: 530 FVKEGNN----------------RVIIATDGDFNVGTSSISDLKTLIEDRRKSGVFLTCL 573 Query: 368 SINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 L+T + + N Sbjct: 574 GFGMGNYKDNTLETLADKGNGNYAYIDNMQE 604 >gi|189238321|ref|XP_972429.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 653 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/261 (13%), Positives = 82/261 (31%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + + L + + ++ D+S Sbjct: 248 VVFVLDHSGSMRGRKYEQLMQAMDKILSDLN-PDDLFHIVRFSDDVSV------------ 294 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K ++ + + + + + I+ + Sbjct: 295 -------WNLEKNKFDQIR------------FEQMPDYENLDTSLAELNLGEAIQVTEDN 335 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K + + D ++ T+ + ++ ++ + N Q I Sbjct: 336 IKKAKRIKDDDVNM--GCTNIIGGLAVGLYLVRRTLQKFYEKNVA--------TKHQPMI 385 Query: 334 IFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N SN + I KI K + N I ++S + L K + Sbjct: 386 IFLTDGLPNVGISNPDEITKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSR 445 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 446 HIYEAADAALQLQNFYRTVSS 466 >gi|187466178|emb|CAQ51888.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQRIVHM 382 >gi|148680762|gb|EDL12709.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQRIVHM 382 >gi|88702489|gb|ABD49099.1| integrin alpha E [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQRIVHM 382 >gi|25742632|ref|NP_113956.1| integrin, alpha E [Rattus norvegicus] gi|3236342|gb|AAC23662.1| integrin alpha E1 [Rattus norvegicus] gi|187466179|emb|CAQ51889.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQRIVHM 382 >gi|2497428|sp|Q60677|ITAE_MOUSE RecName: Full=Integrin alpha-E; AltName: Full=Integrin alpha M290; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor gi|535477|gb|AAC52142.1| alpha M290 integrin [Mus musculus] Length = 1167 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDRFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQRIVHM 382 >gi|74150598|dbj|BAE32321.1| unnamed protein product [Mus musculus] Length = 1166 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 207 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 263 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 264 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 308 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 309 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 368 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 369 GLLSKLQQRIVHM 381 >gi|84685160|ref|ZP_01013059.1| hypothetical protein 1099457000257_RB2654_09844 [Maritimibacter alkaliphilus HTCC2654] gi|84666892|gb|EAQ13363.1| hypothetical protein RB2654_09844 [Rhodobacterales bacterium HTCC2654] Length = 208 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 19/188 (10%), Positives = 48/188 (25%), Gaps = 15/188 (7%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + E + + + FLLL LI + + A A+ A Sbjct: 3 LRGHIIRFRRDEGGAALVEMGIIMPLFLLLAFGLIDFGRLGFAYVMAQKATEQAVRAAVV 62 Query: 67 KMVSNLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + SI + + D + + + A + Sbjct: 63 RTPACDGVPAATTRSIGWSLTPYGTACSAGDGLCADPGSVSCTAAQSGATGAEIYARITA 122 Query: 123 NSSRISM-----------THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + ++ + +D+ + + + L+ + Sbjct: 123 LLPTNATPANLRFGYDYDGNLGFVGGPYTPRVTVEIVDLDFEFVTPIGALATLMGANADG 182 Query: 172 KIVSFIPA 179 K+ + I Sbjct: 183 KVGAAIAF 190 >gi|186472639|ref|YP_001859981.1| von Willebrand factor type A [Burkholderia phymatum STM815] gi|184194971|gb|ACC72935.1| von Willebrand factor type A [Burkholderia phymatum STM815] Length = 332 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 73/248 (29%), Gaps = 46/248 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 I +++D SGSM+ A++S +V AP + TK ++A+ Sbjct: 71 GFSGEQTRITGTGAEILILMDGSGSMNQAISSGSMNVADAPTAGE---TKNQMARDAITA 127 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F+ + + + + + T + + + + T Sbjct: 128 FVAQ-------RANDRLAFMLFGTHPMLAVPFTRNRTVIDAAIAATGVGRGTPDTLLDRG 180 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++ A ++ + S + I+ ++DG +V I Sbjct: 181 IQSAVELFDGRPRTS-----------------SRAIVLVSDGG--ARLDDVAREHIRAGL 221 Query: 358 KENFIKIVTISINASPNGQRL---LKTCVSSPE--------------YHYNVVNADSLIH 400 N + + I + + L L SPE Y + + Sbjct: 222 SRNGVALYFIYLRSGIYSPDLHVRLADADHSPEAELHRFFLSLPTAYRLYQADSPQQVAR 281 Query: 401 VFQNISQL 408 +I++ Sbjct: 282 AMSDIART 289 >gi|257791506|ref|YP_003182112.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] gi|257475403|gb|ACV55723.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] Length = 213 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 42/162 (25%), Gaps = 11/162 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA-------ILAGASK 67 E+ + ++ F + L + F + + KK +E+ N A + +K Sbjct: 17 CRRERGDLNLTFLMLFFLVLGFLAFAVDASQFLTKKTLLENTLNLAREERMAPSVTLVAK 76 Query: 68 MVSNLSRLGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + R + + D + + S Y ++ Sbjct: 77 NSQEPDAVIARSVAGTLRDANYQGDIDVYFYEVPEAQLPPSKRDTER-VYAYQVVLTEQV 135 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 + + + + + + +R F Sbjct: 136 KTVFASIFGVYDIPLKSTVVAVSNPYSEYKVWRPAFSRSTKW 177 >gi|160889564|ref|ZP_02070567.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270296688|ref|ZP_06202887.1| conserved hypothetical protein [Bacteroides sp. D20] gi|317480056|ref|ZP_07939168.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861081|gb|EDO54512.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270272675|gb|EFA18538.1| conserved hypothetical protein [Bacteroides sp. D20] gi|316903798|gb|EFV25640.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 342 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 65/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D ++ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDK-------MQNDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESIDPSLISKQGTAIGAAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + E I++ + + Sbjct: 192 TVIVITDGENHEGGAVEAAKAA----AEKGIQVNVLGVGMPEGAPIPIEGTNDYRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N Q + + + V N + ISQ + Sbjct: 248 NVIVTRLNEQMCQEIAQAGNGIYVRVDNTNGAQKA---ISQEI 287 >gi|45382993|ref|NP_990865.1| collagen alpha-3(VI) chain precursor [Gallus gallus] gi|1345652|sp|P15989|CO6A3_CHICK RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|211622|gb|AAA03201.1| alpha-3 collagen type VI [Gallus gallus] Length = 3137 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 53/176 (30%), Gaps = 17/176 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ L + + +GL+ ++ + + + + R + Sbjct: 665 DFVVTLVNYLDVGTDKIRVGLVQFSDTPKTEFSL-YSYQTKSDIIQRLGQLRPKGGSVLN 723 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 S +F G + + ++ +T G + +++ + Sbjct: 724 TG---------SALNFVLSNHFTEAGGSRINEQVPQVLVLVTAGRSAVP-----FLQVSN 769 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + T ++ + L+ +P+ Y + + L + Q + + + Sbjct: 770 DLARAGVL--TFAVGVRNADKAELEQIAFNPKMVYFMDDFSDLTTLPQELKKPITT 823 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 54/167 (32%), Gaps = 21/167 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--D 293 ++ I+ L + +++G++ Y+ + + + K + + Sbjct: 262 DFLVNLIESLRVGAQQIHIGVVQYSDQPRTEFALNSYSTKA-DVLDAVKALSFRGGKEAN 320 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + L F + G + ++ ++ GE+++ Sbjct: 321 TGAALEYVVENL-----------FTQAGGSRIEEAVPQILVLISGGESSDDIREGLL--A 367 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +A I + SI L+ + + + ++ +L Sbjct: 368 VKQAS-----IFSFSIGVLNADSAELQQIATDGSFAFTALDIRNLAA 409 >gi|326428615|gb|EGD74185.1| hypothetical protein PTSG_12412 [Salpingoeca sp. ATCC 50818] Length = 1720 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 53/178 (29%), Gaps = 21/178 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 L DSI L + G + ++T E + + T Sbjct: 232 DNWNRVLNFTADSISTLRTIDPQAQFGAVVFSTTAEIAVPLQAFDSQQAVQDTVRTLPYA 291 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T + A+ + + SD + + ++ +TDGE + + Sbjct: 292 GESTATGNALNLVRREIFSDDAAAISGFRGGRA----------VVVLVTDGETL--ELDG 339 Query: 349 NTIKICDKAKEN---FIKIVTISINASPNGQRLLKT---CVSSP--EYHYNVVNADSL 398 D+ + + + + A + +LL+ S P + + D L Sbjct: 340 VLENAADRLHATGPLGVDVFVLGVGA-ASSPQLLQDVYTAASGPPETHVFTPDVFDQL 396 >gi|221133174|ref|XP_002171310.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 221 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 54/155 (34%), Gaps = 18/155 (11%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +G+I ++ R + +++ + + V + L+ T A++ A + + Sbjct: 50 SSNGSRVGVITFSYRAKLSVKLNSFTDLSSFNEAVDKIP--LMNFTTRIDRALRLAQKDM 107 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K II LTDG + +I D+ + + + I+ Sbjct: 108 FTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVIL 154 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I ++ N + L + Y DSL Sbjct: 155 GVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 188 >gi|284031227|ref|YP_003381158.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] gi|283810520|gb|ADB32359.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] Length = 279 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 13/138 (9%), Positives = 36/138 (26%), Gaps = 11/138 (7%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNS----MESANNAAILAGASKMVSNL 72 +E+ S + + L G +I K+ + A A + Sbjct: 44 NERGALSPAVVILALMIFTLAGLVIDGGRQLGAKSRAVGYAQEAARAGVGTIDFNSAQAR 103 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + E++ N + +D + ++Q + + + + Sbjct: 104 IDVAKAGEAVGNFCAKVTENDPAVTSCATSELDPEH-------LKVQVQIANKTSFLGMI 156 Query: 133 ANNRLDSSNNTIFYNMDV 150 L ++ + Sbjct: 157 GIQSLTANGEGEAHAEQG 174 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 20/291 (6%), Positives = 60/291 (20%), Gaps = 44/291 (15%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + + + + + + + + + I Sbjct: 3 DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNNR----IGEVKIGVDRFVDTLADSGITD 58 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + + + ++KN + S + Sbjct: 59 KI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRD 111 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 + + ++ + D + ++ + T Sbjct: 112 AGSMLSVPNGH----------KKVIVLLTDGVPTFSYKVQ-------RVHAQSSSNYYGT 154 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINAS 372 F + + I D N + + + TI KE I+I + I Sbjct: 155 QFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQ 214 Query: 373 PNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + L + +Y +A + + + Sbjct: 215 SDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 265 >gi|85705210|ref|ZP_01036309.1| hypothetical protein ROS217_17117 [Roseovarius sp. 217] gi|85670083|gb|EAQ24945.1| hypothetical protein ROS217_17117 [Roseovarius sp. 217] Length = 179 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 12/59 (20%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 4 LSR-FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++R F+ + ++ E+ S+ F L FL ++ F I + + +E + A+ Sbjct: 1 MTRAFKNFLRRFRRDERGTASLEFVLVAPFFLGIMIFSIELGFVTMRATFLERGLDMAV 59 >gi|145596106|ref|YP_001160403.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145305443|gb|ABP56025.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 576 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 100/361 (27%), Gaps = 43/361 (11%) Query: 51 NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 ++ + AA L+ + N + ++ +D + S Sbjct: 248 TTLARSLGAAALSEEDLLSYNARKPAVPLAALYPEPAANPLDYPYAVLPGIGPAKASAAQ 307 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 +F + + +S + + + Sbjct: 308 MLFDVLTTASFKDRLALSSLRAPDGTWGAGFSAPAGAPSPAAD-----------GGNAAG 356 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + + + V+D+SGSM + S +++ + A Sbjct: 357 DLDPLAVERAVSSWSIATQSGRMLCVIDVSGSMREPVASAN--------GVSRQQVTLDA 408 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L LF DS + + +G R+ + S ++ Q + + Sbjct: 409 AGRGLHLFDDSWQ-IGLWEFSTNLGSGRDYRRLVEIGPLSSQRSELEQALAQIQ--PTRG 465 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + AY+ + D I+ TDG+N++ Sbjct: 466 DTGLFDTVLAAYEAVQEDW----------------DEGQVNSIVLFTDGKNDDDNGISQQ 509 Query: 351 IKICD--KAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNI 405 I + + K+ +++V I I A + L + + + + +F + I Sbjct: 510 QLIAELERIKDPERPVQVVLIGIGADVSKAELESITEVTGGGSFITEDPTKIGDIFLKAI 569 Query: 406 S 406 + Sbjct: 570 A 570 >gi|4809045|gb|AAD30063.1| Itgae protein [Mus sp.] Length = 895 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 59/193 (30%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 195 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 251 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG+ Sbjct: 252 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDGDIFG- 297 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 298 -DPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 356 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 357 GLLSKLQQRIVHM 369 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 52/160 (32%), Gaps = 29/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D +++ K + +D +GL+ + +++ + Sbjct: 98 DVSRSMLAEDVSPSRLDRAKQQIKDMVDE-------MSGDRVGLVVFAGETRQSVPLTSH 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E +Q + S+ + A++ A K Sbjct: 151 YEDFKQSLDAVGPHSVRRGGSLLGDAIRSATAGFIDKTND------------------HK 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I+ TDGE+ K + +N I+I T+ + Sbjct: 193 AIVVFTDGEDQESKP---VEAAKEAFTKNGIRIFTVGLGD 229 >gi|71988814|ref|NP_499400.2| CUTiclin family member (cut-6) [Caenorhabditis elegans] gi|35210138|emb|CAA97806.2| C. elegans protein M142.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 572 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 53/176 (30%), Gaps = 22/176 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + L+ ++ +M LI Y + + Sbjct: 61 DTFKKQLSFAMHLASRLNISEDGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ 120 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ +G++ K I +TDG++ + S Sbjct: 121 SGATNTGQALRLTL----------------EKGLQGARPGIPKVAIVITDGQSQDDVSEP 164 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + ++ + + I + N +L + +P + V + + L + Sbjct: 165 S-----QLLRDADVMVYAIGVTNLVNVHQLHQM-TGNPVRVFTVESFEQLDRALAD 214 >gi|298247107|ref|ZP_06970912.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297549766|gb|EFH83632.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 30/185 (16%) Query: 228 MAALKNALLLFLDSIDLLSH---VKEDVYMGLIGYTTR--VEKNIEPS---WGTEKVRQY 279 LK + L D + ++ + + + + +K+R Sbjct: 383 WDQLKESSELLFDQTKARQYLLQTHPQDLTSVMVFNSDIAAGPDGSWTVEGNDPQKMRGL 442 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ ++++ ++ P+ ++ II +TDG Sbjct: 443 YDNIQAREPDGGTNMYACLQRSVELFKQQ----------------PNENRKRLIIVMTDG 486 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 ++ +I + +++++ + + +L + ++ +V D+++ Sbjct: 487 QSEKGNG---VDQIIQSVASLGVPVISVAFGSDADVTQLNEISTATHG---SVTKKDNMV 540 Query: 400 HVFQN 404 + Sbjct: 541 DAMRE 545 >gi|47218289|emb|CAG04121.1| unnamed protein product [Tetraodon nigroviridis] Length = 979 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/271 (9%), Positives = 70/271 (25%), Gaps = 55/271 (20%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 + + + + + Q + + L + + + +++ H + Sbjct: 313 NIVFVIDVSGSMWGVKMKQTVEAMQAILDDLTV-DDSFSIVDFN-------HNVRCWSED 364 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 V + + +T + +K + + Sbjct: 365 LVPGSSVQVKDAKTYIENIK--PNGGETQTQEAPPPPSPHADCGVSVAS----------- 411 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T+ A+ +A Q+L + Sbjct: 412 ------------------GTNINEALMRAVQML-----------VRASHQGLVDPRSVSM 442 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY---- 388 II ++DGE + K +A + ++ I + L + + Sbjct: 443 IILVSDGEPHREIKLSAIQKNVKRAMREEFSLFSLGIGFDVDFDFLERIATENRGVAQRI 502 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + N AD L ++ ++ ++ R+ V Sbjct: 503 YANQDTADQLRSFYRQVAAPLL-RRIDVQFP 532 >gi|307544656|ref|YP_003897135.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] gi|307216680|emb|CBV41950.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] Length = 612 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 48/157 (30%), Gaps = 11/157 (7%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRS 312 G+ + RV+ + +E+ R+ +L + TD A++QA + + Sbjct: 77 GVWTFGERVDNPLPLGEVSEQWRERALALPPALRDYQQYTDIETALEQAASAEANGWRHL 136 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + + + + + + + I+ + Sbjct: 137 ILMTDGVIDLPPSRGNKPGI---------DEASRQRLVESMAPRFADRGVVVHAIAFSDE 187 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + + + +SL+ F +I + + Sbjct: 188 ADLALVEQLAQRTGGLASVAKSPESLLGAFLDIIERI 224 >gi|167842405|ref|ZP_02469089.1| hypothetical protein Bpse38_37423 [Burkholderia thailandensis MSMB43] Length = 142 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 34/113 (30%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + E+ S+ F L +L++ ++ K + +A+ A AG V Sbjct: 4 VSRLVRDERGVVSLEFVLVFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGVVVRVP 63 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 L+ ++S + ++ +S + Sbjct: 64 QLAATDIANVALSYAQGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYTYQ 116 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 27/232 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 +I V+ + G V D + K+ K A L LD + + Sbjct: 22 IIPFVLSVEGVYSAHPPIAFLIVIDTSYSMDGE--KIFRAKQAALRLLDILRDKDY---- 75 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G+ G+ + K +EP T + + + T+ + + L + K+ Sbjct: 76 --VGVYGFAGKFYKVLEPVPATNRNEVE-KAIIGLKLGSGTNIYDTL----KKLVEETKK 128 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIVTISIN 370 + IIF+TDGE + +++ K +E + I + Sbjct: 129 VLESGAISLVR----------IIFITDGEPTTGQKKPEKILEMAKKLREAGASALIIGVG 178 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV---FQNISQLMVHRKYSVILK 419 N + L + + +V + SL + + +Q + + +V+ + Sbjct: 179 TEYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYAKSTQEISAKNVAVLFR 230 >gi|113476849|ref|YP_722910.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167897|gb|ABG52437.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 441 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 70/248 (28%), Gaps = 56/248 (22%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 RP V+D SGSM + R K+ + AL + I Sbjct: 35 SASRPSTTFTFVIDTSGSM--------------YDDSEVGRPKIDIVVEALERLVTDIQA 80 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + L+ + + + T+ V T ++++ + Sbjct: 81 DPRDR----IALVQFDDSASVLLPLTAATDTVTLQNAISKLRSFSGGTRMALGIEKSLNL 136 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + + TDG+ + ++ + + I I Sbjct: 137 LKDSVLS------------------SRRTLIFTDGQTIDEID---CRELAVQFAQAGIPI 175 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNV------VNADSL------IHVFQNI----SQL 408 + + N L+ + +NV + +FQ + +++ Sbjct: 176 TALGVG-DYNEDLLVYLSDHTGGRVFNVVEQASNTGTTDIAISELPQTIFQEVQQAQAEV 234 Query: 409 MVHRKYSV 416 + + K SV Sbjct: 235 INNLKLSV 242 >gi|326668779|ref|XP_002662551.2| PREDICTED: collagen alpha-1(VII) chain-like [Danio rerio] Length = 2698 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 51/167 (30%), Gaps = 26/167 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ-AYQILT 306 + G + Y+ + + + T + +K A + Sbjct: 69 GPNGIRFGAVQYSDTARVEFTFTAYLNGTELITAVENINYKGGNTRTGAGLKYIADNFFS 128 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 R K I +TDG++ + N ++ K + +KI Sbjct: 129 PASIRD----------------VPKISILITDGKSQD-----NVLEPSQKLRGLGVKIFA 167 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVV--NADSLIHVFQNISQLMVH 411 + I ++ + LK S P+ + N +L + +S+ + Sbjct: 168 VGIKSADPAE--LKLIASPPQSEFTSQIGNFRALSSLLPLVSRRVCT 212 >gi|262199490|ref|YP_003270699.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082837|gb|ACY18806.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 808 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/257 (14%), Positives = 77/257 (29%), Gaps = 49/257 (19%) Query: 163 HLLN-QRYNQKIVSFIPALLRIEMG-------ERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + R + + LL + ERP ++ +VD+SGSM Sbjct: 427 SMGMVSRPSLTQDGAVDYLLGANVTVPNLTREERPHAVVTFLVDISGSMA---------E 477 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PSW 271 S + T+M ++ L + ++ V ++ + + +E Sbjct: 478 YSPTVDAGGAPTRMDIVREGLWKAVSALKPGDIVN------VVSFDDAAQIELERGEIRP 531 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G R Y+ + L T+ + ++ AY++ + Sbjct: 532 GAATPRPYLRSVLRLLPRGGTNLSAGIEVAYRV----------------ARRNYDPYRIN 575 Query: 332 FIIFLTDGENNNFKSNVNTIK----ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 +I LTD N + + I I D + I + + N L Sbjct: 576 RVIILTDAYANRGSIDPSLIGDHVLIGD---DEGIHFSGLGVGYDFNEDFLNTLTDVGRG 632 Query: 388 YHYNVVNADSLIHVFQN 404 +++++ F Sbjct: 633 TYFSLITERDAARAFGE 649 >gi|153002167|ref|YP_001367848.1| putative outer membrane adhesin-like protein [Shewanella baltica OS185] gi|151366785|gb|ABS09785.1| putative outer membrane adhesin like proteiin [Shewanella baltica OS185] Length = 1215 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 58/198 (29%), Gaps = 30/198 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGTEKVRQ 278 + + K A + +D+ S MGL+ ++ R + + Sbjct: 326 GSPIDNAKQAAKILVDATAEGS-----TAMGLVSFSGRSSVKQDFAMQKMPKPDNGVKQA 380 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + A L++ + + + L D Sbjct: 381 LKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGA-------------PGVVFVLAD 427 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DS 397 G++NN + +++ + + I + ++ L+ ++ +++ Sbjct: 428 GDDNNSIKSESSVITAYQ--NANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLAE 485 Query: 398 LIHVF---QNISQLMVHR 412 +I F I+ + Sbjct: 486 IIDAFLQANAIATDNQNL 503 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 58/167 (34%), Gaps = 20/167 (11%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + LI Y+ ++ + + D + T + +++ + L K Sbjct: 192 NTLFSLIQYSHLLKIHFTFTQFQSSWNPLSLVDPIVQLDGLTYTATGIRKVVEELFHSKN 251 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +K +I +TDG+ +K + + +A+ I I + Sbjct: 252 G-------------ARKSAKKILIVITDGQK--YKDPLEYSDVIPQAERAGIIRYAIGVG 296 Query: 371 AS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 + P+ ++ L S P ++ + V N +L + + + + + Sbjct: 297 DAFWKPSAKQELDNIGSEPAQDHVFRVDNFAALSSIQEQLQEKIFAL 343 >gi|149377773|ref|ZP_01895506.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter algicola DG893] gi|149357945|gb|EDM46434.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter algicola DG893] Length = 973 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/283 (13%), Positives = 73/283 (25%), Gaps = 75/283 (26%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I ++D+SGSM A T++ +K+ ++ + Sbjct: 42 PNIMFILDVSGSMGTADVGGK--------------TRLRVMKDVTKDLFADMEDV----- 82 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL----- 305 +V + + G S K + D T + + Q+ + Sbjct: 83 NVGLMVFGGNEGGYFKSAVSPIENKRAALIDSIEDLSDGGNTPLSETLFQSMRYFQGEDY 142 Query: 306 --------TSDKKRSFFT---NFFRQGVKIPSL----PFQKFIIFLTDGENNNFK----- 345 D ++ + K S ++ LTDGE Sbjct: 143 FIRYWDEPYQDANGNWVADVPPGVTEDGKYKSPIEYECQPNSVVLLTDGEPTEDTNHEGD 202 Query: 346 ----------SNVNTIKICDKA-----------------KENFIKIVTISINASPNGQRL 378 + +I + I T+ Q+L Sbjct: 203 FEAVLGSGACVDNCLDEIAGYMWENDMIPPSNDPDDDFRGDQRISTYTVGFKTD---QKL 259 Query: 379 LKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 L + ++ NA SL F ++ ++ R + G Sbjct: 260 LGDAAAKGNGTYFKAENAASLKSAFDDLFTDVLARSTTFAAPG 302 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 38/113 (33%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + E + + A+ + L+ + L +L + ++ +++ + A L A S Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAARYTGERPS 60 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S++ + A R I+ + S + N + Sbjct: 61 AVRQARVSVAVNDSFRAERLASRQIELGRWQDGSFSDMDASDPRRPNAARVTV 113 >gi|325914146|ref|ZP_08176499.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] gi|325539649|gb|EGD11292.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] Length = 525 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 59/202 (29%), Gaps = 33/202 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 K+ L+++L L + + L+ Y + P+ G ++ R Sbjct: 165 VSGSMGAPDKLPLLQSSLKLLTRQLRAQD------RITLVTYAGNTAVVLPPTPGNQQAR 218 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 V T ++ AY+ R I+ T Sbjct: 219 -IVEAIDSLQSGGGTAGASGIELAYKAAQQSYLRDGIN----------------RILLAT 261 Query: 338 DGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-- 394 DG+ N ++ + + + + + + T+ L++ + + Y ++ Sbjct: 262 DGDFNVGVTDFDTLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSP 321 Query: 395 -------ADSLIHVFQNISQLM 409 L + I++ + Sbjct: 322 LEARKVLTHELGATLETIARDV 343 >gi|160876887|ref|YP_001556203.1| outer membrane adhesin-like protein [Shewanella baltica OS195] gi|160862409|gb|ABX50943.1| outer membrane adhesin like proteiin [Shewanella baltica OS195] gi|315269091|gb|ADT95944.1| outer membrane adhesin like proteiin [Shewanella baltica OS678] Length = 1215 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 58/198 (29%), Gaps = 30/198 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGTEKVRQ 278 + + K A + +D+ S MGL+ ++ R + + Sbjct: 326 GSPIDNAKQAAKILVDATAEGS-----TAMGLVSFSGRSSVKQDFAMQKMPKPDNGVKQA 380 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + A L++ + + + L D Sbjct: 381 LKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGA-------------PGVVFVLAD 427 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DS 397 G++NN + +++ + + I + ++ L+ ++ +++ Sbjct: 428 GDDNNSIKSESSVITAYQ--NANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLAE 485 Query: 398 LIHVF---QNISQLMVHR 412 +I F I+ + Sbjct: 486 IIDAFLQANAIATDNQNL 503 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 52/194 (26%), Gaps = 33/194 (17%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K+ LK + L L + + ++ Y + +EP+ +++ + + Sbjct: 367 NKLPLLKQSFRLMLGQLGEED------MVSIVTYAGSAGRVLEPTKASDR-QTILDALDR 419 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++QAY T + +I TDG+ N Sbjct: 420 LEAGGSTAGQAGLQQAYATATE----------------MARDGAVSRVILATDGDFNVGI 463 Query: 346 SN-VNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVN--------A 395 S+ + + + + + + + + Sbjct: 464 SDPDDMKDYIETQRGTGTYLSVLGFGRGNLDDATMQALAQHGNGMAAYIDTLSEAQKVLV 523 Query: 396 DSLIHVFQNISQLM 409 D L I+ + Sbjct: 524 DQLTGALVPIADDV 537 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/334 (10%), Positives = 93/334 (27%), Gaps = 24/334 (7%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA---GASKMVSNL 72 ++ + ++I A+S++S L L I + + +N +++A +AA LA + Sbjct: 7 QKQRGSVAVITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLYQRAQCG 66 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + K + A + Y + + T Sbjct: 67 NTTATEPDWTTATQKASSFATASTSNTVQGSVIKVAQTGSGYWNITGSPGTLETVPFTP- 125 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 N L + TI + + + V+ + A Sbjct: 126 GTNDLPAIQVTITKSAANA---------NGSVPVYLASILGVTSLSASATATAAVSRPGY 176 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + +M + + D ++ K+ + + Sbjct: 177 VGPGGLFPLAMSKCLFDNYWDSSTNSPKLATSTAKIP----------GQDFNQTPNTPYI 226 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + Y + + + T + + VT + + TDS Q + ++ + Sbjct: 227 FQVSSSYQVNGCEAGQWTTLTSQ-QNNVTFVRGLIAGQNTDSLGIGSQPGTYIQPGEENT 285 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 +T + + ++ +++ Sbjct: 286 LYTAVNDCSAAGDHTCEYETVPVVSSLGTGSYQP 319 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 64/192 (33%), Gaps = 35/192 (18%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI----------GYTTRVEKNIEPS 270 K+RT++ K AL L ++ V ++ + Y+TR+ Sbjct: 302 GTKRRTRLDVAKAALSTILSTLLPQDKVGVVLFNSKVTLAGSSGVDECYSTRLAPAGRF- 360 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 ++ ++ R S T A K A+ +L S K Q Sbjct: 361 -NVNYLKDFINR---SRPGGGTQYQNAFKAAFTLLKSAKSGDGGGE-------------Q 403 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLL-KTCVS 384 F++FLTDG + + I KE + I+TI + + + L + + Sbjct: 404 SFLLFLTDGGPKD-DALEVERLIAQNKKEMEESRERVTIMTIGLGKDEHMKDFLGRLSKN 462 Query: 385 SPEYHYNVVNAD 396 + V N Sbjct: 463 VGSKYSQVDNEA 474 >gi|296166599|ref|ZP_06849029.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898085|gb|EFG77661.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 897 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 57/224 (25%), Gaps = 30/224 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSW 271 D + + D K ++ A K + D D + + G Y R S Sbjct: 66 DESGSLKSTDPKAARVDAAKYLVKTLGDYADRTNVNLDVAIAGFSEGYAVRQGWRRLDSA 125 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF----------FTNFFRQG 321 + V + TD A+ A Q L + + Sbjct: 126 SVDSVDDQLQTLAQHNSGADTDYWLALDGARQTLADHVQGNPNRCQAIAWFSDGKLDFSK 185 Query: 322 VKIPSLPFQKFIIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 + DG S + D+ + + ++ + + + Sbjct: 186 RPGGRPYAPGVDLGSDDGVAEMIRRATDSICRPGGLADQLRAAHVVMLGVGLGNDDSDFD 245 Query: 378 LLKTCVSS---------------PEYHYNVVNADSLIHVFQNIS 406 ++ + P Y V N D ++ F ++ Sbjct: 246 VMSAISTGAGPNGTRCGGITDPKPGAFYRVSNIDQMLFAFDALN 289 >gi|197106455|ref|YP_002131832.1| hypothetical protein PHZ_c2994 [Phenylobacterium zucineum HLK1] gi|196479875|gb|ACG79403.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 416 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/218 (9%), Positives = 51/218 (23%), Gaps = 9/218 (4%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 F++ ++ LS++ L +G W+ + + A ++A + A + Sbjct: 6 LFRRLRDDRSGASALTTGLSMVVILGFVGLGTDAGAWYAARRDAQHAADSAAFSAAVAVH 65 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + +I+ +D ++ E+ + R Sbjct: 66 AGGGDATGEARAIAATYGATHGED---------GVTVRVNRPTADRVEVIVERPARRFFS 116 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + + L V+ L Sbjct: 117 RLYRDQDGVIRARAVAQTGEGGDGCVLALDPKADAAASLTGGASVNLNGCDLVANSTSAT 176 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 L+ +A +D RT+ Sbjct: 177 ALDATGGARLNADGVALGGDYRVRGGAAINARDGVRTR 214 >gi|217974748|ref|YP_002359499.1| outer membrane adhesin-like protein [Shewanella baltica OS223] gi|217499883|gb|ACK48076.1| outer membrane adhesin like proteiin [Shewanella baltica OS223] Length = 1215 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 58/198 (29%), Gaps = 30/198 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGTEKVRQ 278 + + K A + +D+ S MGL+ ++ R + + Sbjct: 326 GSPIDNAKQAAKILVDATAEGS-----TAMGLVSFSGRSSVKQDFAMQKMPKPDNGVKQA 380 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + A L++ + + + L D Sbjct: 381 LKGAIDNIYANGSTALFDGSQLALDNLSAYQASAASGA-------------PGVVFVLAD 427 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DS 397 G++NN + +++ + + I + ++ L+ ++ +++ Sbjct: 428 GDDNNSIKSESSVITAYQ--NANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLAE 485 Query: 398 LIHVF---QNISQLMVHR 412 +I F I+ + Sbjct: 486 IIDAFLQANAIATDNQNL 503 >gi|209886336|ref|YP_002290193.1| TadE family protein [Oligotropha carboxidovorans OM5] gi|209874532|gb|ACI94328.1| TadE family protein [Oligotropha carboxidovorans OM5] Length = 145 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 14/132 (10%), Positives = 44/132 (33%), Gaps = 7/132 (5%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + +RFR + + + ++ FA+ + FL+L+ ++ + + ++ L Sbjct: 1 MTTRFRRF-RDFSKARSGASAVEFAIVLPVFLMLVFGIVMFGAYLALVHDVQQ------L 53 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + S S++ + + + + S + Sbjct: 54 AAEAARTSVAGLNETERRSLAASYVAQNAASYPLIVPAQLSVNAATSSTDPNVFIVTVNY 113 Query: 123 NSSRISMTHMAN 134 ++S + + + Sbjct: 114 DASHTFIYTLPS 125 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 56/184 (30%), Gaps = 37/184 (20%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + ++D+S SM A + ++ K I +S Sbjct: 86 RKGADMVFLIDVSRSMRAADV---------------QPDRLTQAKF-------EISRISR 123 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILT 306 +I + + + + + D + + TD A++ A+ +L Sbjct: 124 AVTGGRRAIILFAAKPLVQCPLTADMDAFEALLGMASPDLIEAQGTDFRSALELAHNVLE 183 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + K ++ L+DGE++ ++ K I + Sbjct: 184 PSSESRLASAAKGE----------KIMVLLSDGEDHAGD----LPAAANRIKNGRIHVFA 229 Query: 367 ISIN 370 I + Sbjct: 230 IGVG 233 >gi|60681980|ref|YP_212124.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|60493414|emb|CAH08200.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] Length = 341 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 66/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGAAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + + + I++ + + Sbjct: 192 AIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + V N++S I+Q + Sbjct: 248 NVIVTRLNEAMCQEIAKEGNGIYIRVDNSNSAQKA---INQEI 287 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 62/179 (34%), Gaps = 25/179 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 A+ + K D +I TR + + S + VR+ S Sbjct: 66 FQRAVQAIQTMVAK------SKRDNRHAVITIATRAKTFMNFSSRADAVRKL---RGISR 116 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A++ A+Q+ T+ K S R ++ +TDG +N K Sbjct: 117 SGGKTNTQDALELAFQMFTTSKYGSTPGGLAR-------------VLVVTDGRSNIEKHR 163 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNI 405 K K N I++ I+I G L ++ + Y V + L V + I Sbjct: 164 TERKAF--KLKANGIEVFVIAIGDYLEGMDELARMANTKYAHMYRVEDVKGLARVVKLI 220 >gi|261854814|ref|YP_003262097.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261835283|gb|ACX95050.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 339 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 62/208 (29%), Gaps = 29/208 (13%) Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 S P +D+ ++ A+K LL F+ + +I T Sbjct: 114 DASPSMQAEDFPAGKDRFIARIDAMKQGLLRFIA-------ARPQDRFSVIVVTNSAGTL 166 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + + ++ + + T + A + + + + Sbjct: 167 VPMTTDHAVLDYWIRQLRAGINGSDTALGDGLAMAIRSIAAQSQAGQPAP---------- 216 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQR--LLKT 381 ++ TDG + + A+ IK+ T+++ + P+ + L + Sbjct: 217 -----LLVVWTDGFSTGGLMT--PAEALALARAYGIKLFTVNLAPKGSPPDQGQPSLAQL 269 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + ++ V I+ + Sbjct: 270 ADLTGGKPILASDLAAMNAVTDQIAASV 297 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 11/150 (7%), Positives = 44/150 (29%), Gaps = 2/150 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + + S++ A+ + + + I V ++ +++ + A ++ A ++ Sbjct: 10 RTRRNTRGAVSVMAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQLKRAD 69 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S + + + I + + T + + + H Sbjct: 70 SPANLNAAVLGTVSNIGAKNGYPSGIAM--GCAEAAGGGADAMTACVGVWDPATAGPRHF 127 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + ++ + + + F Sbjct: 128 SAVYNAATVSPNAVRVQATQTVPILFVFSG 157 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 63/215 (29%), Gaps = 42/215 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G R + S N + V + + T+ K L +DS+ Sbjct: 28 GRRNHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSVRPTEFEKAKEFLQDLVDSL--- 84 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + +GL+ Y + + T + A++ A + Sbjct: 85 EVGLDSTRVGLVNYAS-----TPLAS-------------------GTMTGLAIRTAVEKA 120 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + G ++ S + + +TDG + ++ A+E+ I+I Sbjct: 121 FA----------AEAGARLNSTKVARVAVVVTDGRPQD-----EVERVSAAARESGIEIY 165 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++ + L ++ + V + Sbjct: 166 AVGVDRADRTSLRLMASQPHEDHVFYVETYGVIEK 200 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 62/189 (32%), Gaps = 27/189 (14%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + D +K+A L +++ +G++ +++ P T Sbjct: 341 DVSGSMADGNPSKLALAVAGALSLVETARPED------RLGIVTFSSGPRWLFPPRPMTA 394 Query: 275 KVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + +D T A +QA + L + K I Sbjct: 395 RGKLEAKTLLDRLRPGGSTRMLEAYRQAIEALEAL------------------ELETKQI 436 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + LTDG+ + + + + A+ I+ ++++ + L + ++V Sbjct: 437 LVLTDGQ--VEEDPAALVALAEAARAQGIRTNSVALGGDADRALLARMSRVGEGRFWDVP 494 Query: 394 NADSLIHVF 402 + L +F Sbjct: 495 TPEDLPRLF 503 >gi|323702113|ref|ZP_08113781.1| Protein of unknown function DUF2134, membrane [Desulfotomaculum nigrificans DSM 574] gi|323532995|gb|EGB22866.1| Protein of unknown function DUF2134, membrane [Desulfotomaculum nigrificans DSM 574] Length = 342 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 47/149 (31%), Gaps = 7/149 (4%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R K+ + E + ++ AL++ + + + K+ + +A +AA LAG Sbjct: 33 RLSNMLKQLASREHGSAMVLLALAMTVLIGAVALVTDFGLVALNKSRLANAADAAALAGV 92 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + + IK ++ + E+ + Sbjct: 93 QELPEYPMEALAVAR-------SFGQQNGITDEQMQIKLVQDLATSRYVGVEVTATKTVN 145 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSY 154 I + +D + V ++Y Sbjct: 146 YIMAKVLGYTTVDVQAKAAAQVIPVTSAY 174 >gi|312963689|ref|ZP_07778166.1| calcium-binding outer membrane-like protein [Pseudomonas fluorescens WH6] gi|311282060|gb|EFQ60664.1| calcium-binding outer membrane-like protein [Pseudomonas fluorescens WH6] Length = 892 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/362 (8%), Positives = 83/362 (22%), Gaps = 29/362 (8%) Query: 53 MESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + A +AA + S + ++ + + + + + Sbjct: 503 LSGAADAATYKALLESFQYSSTSKYPAIGDRSIDVSVTDNEGGNSLSSSVATTTITVKSQ 562 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 YNT + + I + + N + + + + + + + N Sbjct: 563 VYNTVAEGGAAADTIDLGTSSANNIVVGDKGGITTPGSNYNIAFVVDTSGSIGSSAMNSI 622 Query: 173 I-------VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP---ICQD 222 + I G + L++ + + + + Sbjct: 623 KGQLSSIFDTLIANAKLTGSGVVKVLLVDFDTKVESLVSVNLADQAAAKAALQVVLNNMA 682 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY-TTRVEKNIEPSWGTEKVRQYV- 280 M ++A L + T ++ + + Sbjct: 683 SNPNDMTNYQDAFNAATAWFKSTDATSNIGAKNLTYFITDGEPNVYNYTYASVSLGSASY 742 Query: 281 ----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 D DS + T + + + T+ + + Sbjct: 743 WWGGNVDFDSKVNGNTYTLGQTTALTATVYDQTRVIVDTDGKVYSYDSSGTRILQGYVVS 802 Query: 337 TDG---ENNNFKSNVNTIKICDKAKEN----------FIKIVTISINASPNGQRLLKTCV 383 + S+ D AK +++ I + ++ + L K Sbjct: 803 DGHGGFDVAGAASSSRYDYALDNAKAAYTALTTSVPGGVEVQAIGLGSNVDKDMLQKYFD 862 Query: 384 SS 385 S Sbjct: 863 SD 864 >gi|260834995|ref|XP_002612495.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] gi|229297872|gb|EEN68504.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] Length = 443 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 46/195 (23%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVR 277 MA LK + FL + ++ + ++ + S V Sbjct: 11 TSGSMNGRGMAELKKGVRHFLLGVQETANKMSLRENVAVVEFGGGARIIQPLSGNYGTVM 70 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Q V T + +A + + +I +T Sbjct: 71 QSVDNLK---AGGTTPMFEGLMEAMKEILQRG------GVLTLPGGR---KMTPRVILMT 118 Query: 338 DGENNNFKSNVNTIKICDKAKENF---------IKIVTISINASPNGQRLLKTCVSSPEY 388 DG ++ N +K I I + + L + Sbjct: 119 DGYPDDK---ENVLKAALSFGPAGWQAVGLPHPIPIACVGCGDDVDKDLLQAIAKLTNGM 175 Query: 389 HYNVVNADSLIHVFQ 403 + + L F+ Sbjct: 176 YILG-DVSQLSEFFR 189 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 49/164 (29%), Gaps = 27/164 (16%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILT 306 + G+ + V + ++ R D++ T+ A++ A Sbjct: 89 PDGATAGVWTFGQYVNMLVPHREVSDAWRDMAIEQSDAINSVAMRTNLGAAIETASDGYF 148 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------K 358 + I LTDG+ + ++ ++ K Sbjct: 149 TGGV-----------------LSNTHFIVLTDGKVDISRNPSANKAEANRILDTLVPPLK 191 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + ++++A + + L K S + NA+ L F Sbjct: 192 QQGARFHAVALSAEADTEFLRKLASDSNGSFHVAENANDLSRAF 235 >gi|118357550|ref|XP_001012024.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293791|gb|EAR91779.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 853 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 39/350 (11%), Positives = 89/350 (25%), Gaps = 55/350 (15%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + + I + +L + + +N + + Sbjct: 154 NKYFVATIPLQYCEDNISAKLGINLLTLDLFCTGKITYAESRGQKTEKVIVDDNTVKFNL 213 Query: 142 NTIFYNMDVMTSYDYRLQFI-------EHLLNQRYNQKIVSFIPALLRIEMGERPIFL-- 192 + + + + + + + +P + + P F Sbjct: 214 SQALNQKGNELQFVFSFEGMFEPQIIFGSSRIFHEDYVKRAILPVSNSVMVSFIPNFNEE 273 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-------- 244 I VD + D + + + K+ + LD Sbjct: 274 ITKEVDDAVKAALNKGEDVICEEFQQKISSELVDHLNSSKSEFIFLLDRSGSMSGQSIKQ 333 Query: 245 --------LSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLIL-KPTD 293 L + D Y ++ + + EK S + ++ + Q V T+ Sbjct: 334 ACEALVLFLQSLPIDSYFNVVSFGSSFEKLFPSSQKYNSQNLEQAVQIINQYSANLGGTE 393 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + + + K I LTDG+ +N ++ Sbjct: 394 IYQPLHNVFN-------------------EKKIEGYNKQIFLLTDGQVDN------PKQV 428 Query: 354 CDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + K+N +I +I + Q + +T V V L V Sbjct: 429 VNLIKKNNKFSRIHSIGFGNDADKQLIQETAVYGKGISKIVNQNCDLQEV 478 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 70/223 (31%), Gaps = 37/223 (16%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + V+D+SGSM TKM K A+ + L + + Sbjct: 280 MEKNVVFVIDVSGSMF--------------------GTKMEQTKKAMNVILSDLRANDYF 319 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + I + Y+ R TD A+ A +L Sbjct: 320 NIISFSDTINVWKAGGSIQATIQNVHSAKDYLHRM---EADGWTDINSALLAAASVLNHS 376 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTI 367 + + + PS+ IIFLTDGE + + +A + + + T+ Sbjct: 377 NQ---------EPGRGPSVGRIPLIIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTL 427 Query: 368 SINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNIS 406 + + L + + + Y +A L +++ IS Sbjct: 428 AFGDDADFTLLRRLSLENRGIARRIYEDTDAALQLEGLYEEIS 470 >gi|297473612|ref|XP_002686715.1| PREDICTED: collagen, type XXVIII-like, partial [Bos taurus] gi|296488704|gb|DAA30817.1| collagen, type XXVIII-like [Bos taurus] Length = 327 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 66/186 (35%), Gaps = 32/186 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +KN + D + L +G+I Y+ +VE+ + + K +++ V Sbjct: 27 FQIIKNFVKTLTDRVALDLAT---TRVGIINYSHKVEEVAHLTQFSSKDDLKRAVDNMQY 83 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ A ++ + K +K + +TDG+ + + Sbjct: 84 L--GEGTYTATALHAANRMFEAAK-----------------PGVKKVALVITDGQ-TDTR 123 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSL 398 N ++ A + ++I I + + + + + P+ + Y + +L Sbjct: 124 DEKNLTEVVKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDDFITL 183 Query: 399 IHVFQN 404 + Sbjct: 184 EDTLKQ 189 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 30/213 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 ++LV+D+SGSM A T+MAA K A LD+ + Sbjct: 37 TAPQVDLVLDVSGSMRTAD--------------IDGGTRMAAAKRAFNEVLDATPEEVQL 82 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + + D T A K L+ Sbjct: 83 GIRTLGANYPGDNQ--------------KTGCKDTAQLYPVSTLDRTEA-KTQVATLSPT 127 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K I+ ++DGE+ + + AK + I T+ Sbjct: 128 GWTPIGPALLKSAGDFTDSASSKRIVLISDGEDTCAPLDPCEVAREIGAKGIGLTIDTLG 187 Query: 369 INASPNGQRLLK-TCVSSPEYHYNVVNADSLIH 400 + + ++ L ++ + +V +AD L Sbjct: 188 LVPNTKMRKQLSCIAEATGGTYTSVEHADELTD 220 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 66/186 (35%), Gaps = 32/186 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +KN + D + L +G+I Y+ +VE+ + + K +++ V Sbjct: 813 FQIIKNFVKTLTDRVALDLAT---TRVGIINYSHKVEEVAHLTQFSSKDDLKRAVDNMQY 869 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ A ++ + K +K + +TDG+ + + Sbjct: 870 L--GEGTYTATALHAANRMFEAAK-----------------PGVKKVALVITDGQ-TDTR 909 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSL 398 N ++ A + ++I I + + + + + P+ + Y + +L Sbjct: 910 DEKNLTEVVKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDDFITL 969 Query: 399 IHVFQN 404 + Sbjct: 970 EDTLKQ 975 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 59/156 (37%), Gaps = 25/156 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ V D+ + + +++ V+ + S + +Q V Sbjct: 63 FDKQKDFVGSLSDKIFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQRVKS 122 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 I + T S A+ A ++L + ++ K + +TDG ++ Sbjct: 123 MN--FIGQGTFSYYAIANATRLLKREGRKDG----------------MKVAVLMTDGIDH 164 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + I + A+ I +TI ++A N +L Sbjct: 165 PKNPD--VQSISEDARTAGILFITIGLSAVVNETKL 198 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 62/220 (28%), Gaps = 16/220 (7%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + N +V + + + I L++D SGSM P Sbjct: 2710 DGKGAIEGTFGNDVLVGDLGGAADPVVKPAENYNIALILDRSGSMA-----------DDP 2758 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +++A LK+A+ F+ + + + ++ + + + + + + Sbjct: 2759 DGSGGYGSRLALLKDAVNAFIGKLGTHT-GQINIALISFSSSASLLLSGTLAQIQTALAA 2817 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 M T+ AM+QA + + + P+ Sbjct: 2818 PNNVLMALTASGATNYEAAMQQANAWFGGVEVNGYNNLAYFLTDGDPTTYNGDNS---NS 2874 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 G NF + ++ I I N L Sbjct: 2875 GSTVNFNDVNRALDDATTLMAR-AEVHAIGIGTGVNSNVL 2913 >gi|149916833|ref|ZP_01905335.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] gi|149822550|gb|EDM81939.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] Length = 447 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/288 (10%), Positives = 63/288 (21%), Gaps = 68/288 (23%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 V + L + Sbjct: 137 ATLEPVPPNVMMVLDKSGSM---------------------------------------- 156 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-----EDVYMGLIGY 259 N+ D N + + + N F DSI+ +++ +++Y Sbjct: 157 -FTNTWDHDNNGGTPQITRWNSLYDVVDNITTTFDDSINFGANLFPSTLAQNIYGPQACT 215 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 T+ + + ++ + + T +T + AY LTS Sbjct: 216 TSNFPEVTVGENNSAQILATIPGPGVTASYGGTPATLGVTTAYNHLTSLDPELPRAMILV 275 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI-----------S 368 L D + I A N + + Sbjct: 276 TDGAANCDQNAANNFQLFD------VYDDGLPVIVGTAAANGVPTYVVGIDIINQTINDG 329 Query: 369 INASPNGQR----LLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVH 411 I PNG L + + +N + L ++ + Sbjct: 330 IGGDPNGINPTVVLNEVAAAGGTGSFFNTEDQAELEAALTDVVASVQT 377 >gi|326671057|ref|XP_003199352.1| PREDICTED: integrin alpha-2-like [Danio rerio] Length = 1181 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 57/184 (30%), Gaps = 23/184 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--T 292 + + ++ L+ + + ++ Y + K + + + T Sbjct: 188 VDFLVKLLENLNIGPQSTQVSVMQYGVDTTFQFYLNSYKTK-ESMIKAASNMQQKQGLET 246 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ A+ A K ++ +TDGE+++ I Sbjct: 247 NTFKAIDFART-------------NAFLPQNGGRPGATKVMVVVTDGESHDANIRNKVIA 293 Query: 353 ICD--KAKENFIKIVTISINASPNGQRLLKTCVS-----SPEYHYNVVNADSLIHVFQNI 405 CD I ++ I + +L+ S + ++ +NV +LI + + Sbjct: 294 ECDSQNITRFGIAVLGYYIRNDIDTSKLIAEIKSIASKPTEKFFFNVSEEAALIEIVGTL 353 Query: 406 SQLM 409 + Sbjct: 354 GDRI 357 >gi|296124235|ref|YP_003632013.1| hypothetical protein Plim_4003 [Planctomyces limnophilus DSM 3776] gi|296016575|gb|ADG69814.1| protein of unknown function DUF1355 [Planctomyces limnophilus DSM 3776] Length = 1023 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 45/163 (27%), Gaps = 29/163 (17%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y + E + K + + I M+ L + Sbjct: 497 AYDYNDGEKWIFELTPA-GKYEELSLLINSAEIGDMPSFQQTMQMGIDGLEASDAS---- 551 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SP 373 K +I ++DG+ + ++ I + + I T+++ Sbjct: 552 --------------SKHMIIISDGDPSPASPDLLKRFI-----DAKVTISTVAVFPHGDV 592 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + + + +Y N + L +F + + S+ Sbjct: 593 DTPTMTSIAQITGGRYYKPTNPNQLPAIF---IKESKTLRRSM 632 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 46/123 (37%), Gaps = 7/123 (5%) Query: 292 TDSTPAMKQAYQILTSDKKRSF-FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ + A K + + + +K ++ ++DG +N Sbjct: 134 TNRPEDLTYAISHSPPTGKTALYDAVWKAREWVARGSRDKKVLVVVSDGGDNASTHT--L 191 Query: 351 IKICDKAKENFIKIVTISINASPNGQR----LLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +I + A ++ I++ TI I + + L + ++ + ++ + ++I+ Sbjct: 192 SEILEAANKSNIQVFTIGIFDPDDPDKNPGVLRQLARATGGEAFVPDELSEVVAICESIA 251 Query: 407 QLM 409 + + Sbjct: 252 KDI 254 >gi|54038464|gb|AAH84380.1| LOC495168 protein [Xenopus laevis] Length = 554 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 62/227 (27%), Gaps = 42/227 (18%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I V+D+SGSM KM +A+ L+ ++ Sbjct: 300 PTLPKNILFVIDVSGSMW--------------------GLKMKQTVDAMKSILEDLNSDD 339 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + I + E S +YV R T+ A+ +A IL Sbjct: 340 QFGIIDFNHNI-RCWKDELVYASSVEKGDASKYVQRI---QPNGGTNINDALLRAIFILK 395 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKI 364 + I+ ++DG+ K + ++ I + Sbjct: 396 EASNKGLLEQNSVS-----------LIVLVSDGDPTVGELKLPKIQKNVRTNIQD-DIAL 443 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 ++ I + L + + + A L + +S Sbjct: 444 HSLGIGFDVDYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVST 490 >gi|157278345|ref|NP_001098275.1| complement factor B [Oryzias latipes] gi|1777334|dbj|BAA12207.1| Bf/C2 [Oryzias latipes] Length = 754 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 51/199 (25%), Gaps = 28/199 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K A++ + I + + V +E G + + D Sbjct: 268 HFKRAKLAIITLIKKIAAFTVSPNYEILFFSADVYEVVSIVEFYEGKITLESAIKNLEDF 327 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + A + + Q + + TDG N S Sbjct: 328 Q-IGDKSTGTDVNAALKKFEEGM------AWIEQKTGDKFSEHRHVFLLFTDGAYNMGGS 380 Query: 347 NVNTIKICDKAKENFIKI---------------VTISINASPNGQRLLKTCVSSPE--YH 389 + T+ + K + + I A+ LL + ++ Sbjct: 381 PLPTLA---RIKNR-VYMSPTGDPGSRLDYLESYVFGIGANIFDDDLLPLTAGTEGELHY 436 Query: 390 YNVVNADSLIHVFQNISQL 408 + + +L F +I Sbjct: 437 FRLKKETNLAATFDDIIDE 455 >gi|304314705|ref|YP_003849852.1| cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] gi|302588164|gb|ADL58539.1| predicted cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] Length = 663 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 52/168 (30%), Gaps = 23/168 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + ++ + +D + + I + R V + T Sbjct: 498 VKGLIERFIEDAQRHKDRISVVGFRGRDAKVIIPSTARASSFRDTVDSI---RVGGTTPM 554 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF---KSNVNTI 351 +K+ +IL +K+ + F++ L+DG N + Sbjct: 555 AQGIKRGLEILREEKR---------------HSEYVPFMVILSDGMPNVGVERNPKREAV 599 Query: 352 KICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHYNVVNADS 397 + + +E I I+ G ++ ++S +Y++ + + Sbjct: 600 EAAARLREEDIPSAVINFEQGSRGGRDLNMEIALASGGSYYDLHDLED 647 >gi|293357059|ref|XP_345157.4| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1269 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 62/193 (32%), Gaps = 30/193 (15%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A+ FL+ L + + LI Y + K V ++ Sbjct: 274 YPWEAVKNFLEKFVQGLDIGPKKTQVALIQYANDPRVVFNLTTYKNK-EDMVQATSETRQ 332 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A K ++ +TDGE+++ Sbjct: 333 YGGDLTNTFKAIQFARD-------------IAYLPESGGRPGATKVMVVVTDGESHDGSK 379 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNAD 396 I+ C+ ++ I I++ N L +K S+P Y +NV + Sbjct: 380 LQTVIQQCN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVADEA 436 Query: 397 SLIHVFQNISQLM 409 +L+ + + + Sbjct: 437 ALLEKAGTLGEHI 449 >gi|187956243|gb|AAI50691.1| Integrin alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQHIVHM 382 >gi|226823202|ref|NP_032425.2| integrin alpha-E isoform 1 [Mus musculus] gi|56206400|emb|CAI24788.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQHIVHM 382 >gi|118346233|ref|XP_977011.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila] gi|89288362|gb|EAR86350.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila SB210] Length = 603 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/223 (9%), Positives = 52/223 (23%), Gaps = 49/223 (21%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-YVTR 282 K+ K +++ + ++ LI + I + +Q Sbjct: 92 VNQKIQNAKKSVIELIQNLTPYD------RFCLIPFGGSNGVAIPFTDSNSINKQETFEI 145 Query: 283 DMDSLILKPTDSTPAMKQAYQILTS-------------------------------DKKR 311 + + TD ++ A + + Sbjct: 146 IQNIVCKGKTDIVSVIQTAINTIKQEQIHQNTIKQEFEKTTKKSLQQSINSSLTRVSRNI 205 Query: 312 SFFTNFFRQGVKIPSLPFQKF--------IIFLTDGENNNFKSNVNTIKICDKAKEN--F 361 + K + L+DGE+N ++ I + K Sbjct: 206 NVDELELLNMSKKHQNKNSSNQIVDRTYCFVLLSDGEDNIHQNYALQR-IRECIKNETLN 264 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I +G L + +Y + +S+ + Sbjct: 265 YSINCFGFGIEHDGNLLSSIAQLTGGQYYYIKENESIYDYLKE 307 >gi|118399120|ref|XP_001031886.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila] gi|89286221|gb|EAR84223.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila SB210] Length = 994 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 61/201 (30%), Gaps = 29/201 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T + K + + + + L+ + T ++ + ++Q Sbjct: 53 NSGSMDGTNIQLTKQLCNELVQFVIKTQPHSK---ISLMTFNTSIDHVENL--HLKSLKQ 107 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T + L ++ N I++LTD Sbjct: 108 VEQFISNINANGGTIFHI----TFDKLRDICQKFTNQNEEL------------VIVYLTD 151 Query: 339 GENNNFKSNVNTIK----ICDKAKE--NFIKIVTISINASPNGQRLLKTCV--SSPEYHY 390 G+ + + + N + K+ N +++ + + S + L K ++ + Sbjct: 152 GQVQSGQDSTNLKDSFIFLQQVLKKFVNNVEVHALGMGTSHDPVILDKIISLQTTQSTYQ 211 Query: 391 NVVNADSLIHVFQNISQLMVH 411 + + + F+NI ++ Sbjct: 212 FIKESSEIEGAFKNIVDIIGQ 232 >gi|27370456|ref|NP_766532.1| integrin alpha-E isoform 2 [Mus musculus] gi|26334103|dbj|BAC30769.1| unnamed protein product [Mus musculus] gi|56206399|emb|CAI24787.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KN + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKNFISTMMRNFYEKC---FECNFALVQYGAVIQTEFDLQESRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 +N + + K + I + + N R LK S P+ + + V N +L Sbjct: 310 GDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q +VH Sbjct: 370 GLLSKLQQHIVHM 382 >gi|313230976|emb|CBY18974.1| unnamed protein product [Oikopleura dioica] Length = 522 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 50/197 (25%), Gaps = 35/197 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKV 276 + K +A+K+ L D + VK + G +V + P+ T + Sbjct: 289 TSGSQVGDKQSAVKDFFKDLLKEFDTQTAVKISITDIGDGREGQVNTVLGPTQFVDTSDI 348 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T + + + + +I + Sbjct: 349 NSALNSLT--WYGTTTAIADGITEGASQMD------------------TTDNVNDVMIVI 388 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN----GQRLLKTCVSSPEYHYNV 392 TDG + + S A I I+I N G L Sbjct: 389 TDGFDGDLSSLQTASAAVATA---GIT--AIAIGYDENGGIIGSTLEDIANGVSSNVIEA 443 Query: 393 VNADSLI----HVFQNI 405 + L VF +I Sbjct: 444 TSTSELDGLALSVFNSI 460 >gi|23008710|ref|ZP_00050041.1| hypothetical protein Magn03003732 [Magnetospirillum magnetotacticum MS-1] Length = 167 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 15/152 (9%), Positives = 41/152 (26%), Gaps = 15/152 (9%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA----- 60 R R K +++ L+++ + IG + K +++A +AA Sbjct: 10 RLRGRAKALHRDSGGALNVVIGLTLIPIVGFIGLGVDYGMATAGKTRLDNAADAAALAAV 69 Query: 61 -------ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + V+ L+ + +++ A + + + + Sbjct: 70 VSAKAYIVANAKQSNVTTLALTEGQNQAVKAFNINAGKLLYGTVSLATPQVTRNRQTLSS 129 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 T + + +N Sbjct: 130 SIT---YTATIQTLFGRIFGSQATTFTNTVTA 158 >gi|13475416|ref|NP_106980.1| hypothetical protein mll6478 [Mesorhizobium loti MAFF303099] gi|14026168|dbj|BAB52766.1| mll6478 [Mesorhizobium loti MAFF303099] Length = 500 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/357 (9%), Positives = 88/357 (24%), Gaps = 20/357 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + ++ I+ ++++ + + I + + N ++ A +A LAGA+ Sbjct: 1 MLRTIRAFWHDQRGIALILVSVTLPALIGFSLLAIDMSRVNNLHNDLQKAADAFALAGAA 60 Query: 67 KMV------SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ + R + + + SA Sbjct: 61 ELDGLPGSWARAERAMATLVTNQAAFSTVGTNGRFTLTSGQPGGTARCNSAGN--ISWCF 118 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + T +++ ++ +I + V + + Sbjct: 119 LKTIPASDATPVSSANYANATQSIGEDETVFVEVTVA--PTGFAAIFPASFLTGNAASNS 176 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + V D + C D + Q A + + Sbjct: 177 FNV-AAKATAGFTSGVCDYTPVFMCNPYEDTSITGGVTLEQ------AAQTRQYRRRQIV 229 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S+ + + K + ++ Y +D+ + Sbjct: 230 LRGDGSYFPGNFAFLSSPFGNGANKLEQMLADSKPQNCYSRDGVDTEPGQNAGPVENGIN 289 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + S R+G K S + + + E + K C A Sbjct: 290 SRFGIDSSNYSDGPAVNVRKGAKNGSQFVKSNKV---NYETDPTKGVGLERDSCQIA 343 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/267 (12%), Positives = 78/267 (29%), Gaps = 43/267 (16%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + P E + + VD+SGS + + N+ Sbjct: 577 NSPSTSPTLRPVVPETLDPTARPTCPTEEDF-DLCIAVDMSGS----VCNSGFFGNNCVG 631 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSH------------VKEDVYMGLIGYTTRVEKNI 267 C +L + + + ++ ++ + + E Sbjct: 632 CSPFVF--CQSLFVSQETCCANFGDVQQFARLMVYNLSQFGDKNTSFSVVSFASDAEILS 689 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S +K + + + T+ A+ ++ Sbjct: 690 GLSSA-DKTINVLDQLI--YSGGSTNHGQAI---------------NACQESLFTSDQNI 731 Query: 328 PFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLKTCV 383 +KFI+ +TDG ++ + I + AK + I I+ I I+ + + + Sbjct: 732 NRKKFIMLITDGVSATDDLNPEADAIDAAETAKSSGITIIPIFISPYNDIDAVSFMSSLS 791 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMV 410 + E + V N DSL + + + + Sbjct: 792 NDGEVY--VTNFDSLDSLKDQLVEQVS 816 >gi|312068041|ref|XP_003137027.1| hypothetical protein LOAG_01440 [Loa loa] gi|307767808|gb|EFO27042.1| hypothetical protein LOAG_01440 [Loa loa] Length = 808 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 79/248 (31%), Gaps = 32/248 (12%) Query: 164 LLNQRYNQKIVSFIPALLRIEM--GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 +L + I L + + + + +D + ++ + Q Sbjct: 1 MLPTYGTVVSLILITPTLITAIDYEDNDLAKVCRPLDRQLDLLFILDGSGSVSGNTFATQ 60 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K+ +D++ ++ + ++ Y++ S K + Sbjct: 61 MAMLNKI-------------VDMIEIGPKNTQIAVMQYSSYTRVEFNFSANPNKESLRAS 107 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 I T + A+ +A + D G ++ + + +TDG + Sbjct: 108 LQKIRHISGTTKTGKALDKALHVFRHDSSF---------GARLNQDDVAQVAVVVTDGHS 158 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ I + ++ ++I+T+ I A N L++ N+ + SL Sbjct: 159 HDD-----PIPAAVRLRQAGVQILTLGIGAHINMGELMEITGDENLAFQNLTSQASLDQF 213 Query: 402 ---FQNIS 406 F+ I+ Sbjct: 214 VYQFKKIA 221 >gi|299532826|ref|ZP_07046213.1| membrane protein-like protein [Comamonas testosteroni S44] gi|298719050|gb|EFI60020.1| membrane protein-like protein [Comamonas testosteroni S44] Length = 628 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 57/202 (28%), Gaps = 3/202 (1%) Query: 8 RFYFKKGIASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + I + + + + AL +++ + L I + + ++K ++ + A LAGA Sbjct: 6 TGLSRTRIRRQSGSVATLGALWLMIAVICLA--TIDIGNVFWQKRELQKIADLAALAGAQ 63 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + N + + + Y + + V +R Sbjct: 64 GETPAACQANAARIATLNGMTGVPQVECGNWTPSPGVADTRTYFVNGASPLNASCVTVAR 123 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + ++ + S I L ++K P + + G Sbjct: 124 TVPYLFLFSVTAANGRQVKAVATAARSRPLAQLRIRSTLLVIDDKKSAILNPIIGGLLGG 183 Query: 187 ERPIFLIELVVDLSGSMHCAMN 208 I + + + Sbjct: 184 NLNIQAVGWNGLANAHVSLLDF 205 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 62/224 (27%), Gaps = 45/224 (20%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 NQ +Q+ V+ + + + L++D SGSM Sbjct: 16 ANQSNSQRQVAISLSAVSESSDRSLPLNLGLILDHSGSM--------------------S 55 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 M +K A ++ + + ++ + R + + +++ Sbjct: 56 GKPMKTVKEAASYLVEGLGPDD------RLSVVAFDHRAKVIVPNQP-VDEIDGVKDAIA 108 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T MK + + K+ I LTDGE N Sbjct: 109 SLKAEGGTSIDEGMKLGIKQVALGKEDRVSQ-----------------IFLLTDGE-NEH 150 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 N +K+ A E I + T+ N L S Sbjct: 151 GDNERCLKLAQVAGEYNITLNTLGFGNHWNQDVLESIADSVGGT 194 >gi|301780608|ref|XP_002925750.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Ailuropoda melanoleuca] Length = 1307 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 56/188 (29%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L ++ V +GL Y+ + + + K Sbjct: 188 SHFQQVKDFLASVIE---PFEIGPNKVQVGLTQYSGDPQTEWDLNAFRTKEVVLAAVYSL 244 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + P K +I +TDG++ + Sbjct: 245 RYKGGNTFTGLALTH-------------VREQNLKPGAGPRPEAAKVVILVTDGKSQDD- 290 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K+ + + + + + + L+ S P +NV + L + Sbjct: 291 ----ARAAGRILKDLGVAVFAVGVKNADEAE--LQLLASPPLDITIHNVQDFPQLSTLAD 344 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 345 LLSRLICQ 352 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 71/239 (29%), Gaps = 15/239 (6%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 ++ +I + S + + S ++ N ++ + + Sbjct: 394 VDGESNAATIIPTLEGNYISKNKSDFSKKKNTNPIYRDKNYDMDAFQKRQFKHFTNVTYE 453 Query: 243 DLLSHVKEDVYMGLIGY-----TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 L + L Y V+ + + V Q ++ D + ++ + + Sbjct: 454 HRLDNSFNFKEPELRKYYQEGNACEVKIIKPTNTIYDSVTQELSLDYEHIVTQQNGKEES 513 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +++AY + +K S + K +N S+ + K Sbjct: 514 IQEAY----ASQKGSGAWLHNNWSAATGNTYVSKVGKLYNTPDNYLKNSSPQSRKAIKDM 569 Query: 358 KENFIKIVTISINASPNGQRLLKTC-----VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++ I+ + S + L + + + + L +FQ I + + + Sbjct: 570 VTKK-RVYIIAFSRSVSTSGLNAIADEFSIKGDSKRVFRANSGEDLSKIFQEIGESINN 627 >gi|46205147|ref|ZP_00048963.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Magnetospirillum magnetotacticum MS-1] Length = 125 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 14/88 (15%) Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN------------ASPNGQRLL 379 ++ L+DG NN + + A+E +++ TI++ + + L Sbjct: 1 MVVLLSDGANNAGQ--TAPKDVAQLARELGVRLYTIALGPIDMADNPDNEQDVVDVETLR 58 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQ 407 S + V D L +V I + Sbjct: 59 AMAEVSGGKAFRVKTTDDLENVANAIDE 86 >gi|296134298|ref|YP_003641545.1| Protein of unknown function DUF2134, membrane [Thermincola sp. JR] gi|296032876|gb|ADG83644.1| Protein of unknown function DUF2134, membrane [Thermincola potens JR] Length = 314 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 41/146 (28%), Gaps = 12/146 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + E+ + +I A++ FL + + + + K+N + Sbjct: 1 MKHLISRLLKKEEGSVIVIVAIAFTVFLGFVAIVTDIGLLYVKQNK------------VA 48 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 V + G + + A+ + + + + + + + + Sbjct: 49 HAVDAAALAGAQELPDNPANAVAIAKSYAQLNGLDPLQVTATVAPDNKSITVTSNDTVNL 108 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMT 152 + + S + T Sbjct: 109 VFARLFGTDFSTVSASAKARVAPART 134 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 57/208 (27%), Gaps = 30/208 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGT 273 + + A +D+ S MGL+ +++R + T Sbjct: 321 SGSMYGSPINNAIQAAKTLVDATAEGS-----TAMGLVSFSSRSSVKQDFAVQQIPKPDT 375 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + T A L + + + + Sbjct: 376 GIKQALKAAIDNIYASGSTALFDGSSLALDNLITYQ-------------TAAASGAPGVV 422 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 L DG++N+ N T+ + + I + ++ L+ ++ +++ Sbjct: 423 FVLADGDDNSSIKNEATVITAYQ--NANVPIFSFGYGSASPTGPLVTMANATGGKYFSSP 480 Query: 394 NA-DSLIHVF---QNISQLMVHRKYSVI 417 +I F I+ + S + Sbjct: 481 TTLSEIIDAFLQANAIATDNQNLVSSTV 508 >gi|292618048|ref|XP_699485.3| PREDICTED: integrin alpha-1-like [Danio rerio] Length = 1201 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 56/176 (31%), Gaps = 30/176 (17%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDK 309 +G++ Y V S + T + + A + + + Sbjct: 200 PARVGIVSYGDDVGHVFNLSQFSNTKELVKNAADIRQRTGHKTMTALGIDTARKEAFTVE 259 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +K ++ +TDGE+++ + + I C E+ I+ +++ Sbjct: 260 RG-------------ARPGVKKVMVIVTDGESHDHHNLKSVIDQCQ---EDGIERFAVAV 303 Query: 370 NASPNGQR-----------LLKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVHR 412 N Q ++ S ++ +NV + +L+ + + + Sbjct: 304 LGDYNRQNKSIDEIKKFIEEIEYIASETKSDHFFNVSDERALVTIVDTLGSKIFAL 359 >gi|219125320|ref|XP_002182931.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405725|gb|EEC45667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 523 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 22/214 (10%), Positives = 55/214 (25%), Gaps = 35/214 (16%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-Q 278 K+ K L + L + GLI + + +++ + Sbjct: 77 SGSMTGNKLKLCKKTLTMLLRVLQTQD------RFGLISFGSDARVEFPAQAMSKQNKAS 130 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT- 337 + + T+ + A+ A Q L +K + + F + + ++ Sbjct: 131 ALQKIQSLTTRGCTNMSAALGLAVQELKIIEKSNPVRSLFFLTDGLANEGISDLDGLVSL 190 Query: 338 --------------------------DGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 D + + + + I + T Sbjct: 191 TRNCLLPSDNPSNVLNSEVMIAECLDDLATSQHQITRLPVAEIESVCRAPITLHTFGYGR 250 Query: 372 SPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQN 404 N L ++ +Y + + ++ F N Sbjct: 251 DHNAALLESLADTTQGGAYYFIEDDSNVGSAFGN 284 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/307 (9%), Positives = 78/307 (25%), Gaps = 29/307 (9%) Query: 120 NIVNSSRISMTHMANN-RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 +VN+ + N+ + + ++ + Sbjct: 97 ALVNTENNFSNPDPTKHTKKNYTNSEETFNTSWEQMKTLVDNMGASGYTPATRRYYEVVA 156 Query: 179 ALLRIEMG---ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + + ++ ++ D + S + S N + N Sbjct: 157 SAVMPNIKYRCQKSYVVMMSDGDANFSCNHDSTSFHYRFNYDGNNGRRNS-------NYP 209 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +L+ L K + + ++ G + Sbjct: 210 KAYLNYEADLYRTKPFITAYNYFGESPMKYFDPYELGNTCRDKRENVHYSKF-------- 261 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + FF + + + + G + + N++ + Sbjct: 262 ------WDTANRYRGVEGGMAFFSRKLAVKDIKTAAVDRVDAAGVSWDGDPNIDPKGV-- 313 Query: 356 KAKENFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++ T+ P G++ L S +++N D L F+ I + Sbjct: 314 DYSKQTVQTFTVGFGNGISPAGEQYLIRGASHDGWYFNAAKPDDLYKAFEKIISQISDDN 373 Query: 414 YSVILKG 420 S+ +G Sbjct: 374 ASIPFEG 380 >gi|310791552|gb|EFQ27079.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 1039 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/251 (12%), Positives = 75/251 (29%), Gaps = 28/251 (11%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 VS + + S +M +++ + V PI + K+ ++ Sbjct: 463 RVSSVASSWGGPKSVTTAPTEYTNFQRSPTMPTSIHVPIDVVVVVPISSSMQGVKINLVR 522 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +AL ++++ MGL+ + + T K + + S+ Sbjct: 523 DALKFMVNTLGERD------RMGLVTFGSGGGGV-PIVGMTTKAWPGWSNVLSSIKPVGQ 575 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 S A + + K + I+ ++D ++ S + Sbjct: 576 KSHRA---------DVVEGANVAMDLLMQRKYNNPIAT--IMLISDASTSDADSVDFVVS 624 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH-------VFQNI 405 + AK I I + + + +++ + + V + L Q + Sbjct: 625 RAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKDWMMLRECLAGCLGAMQTL 681 Query: 406 SQLMVHRKYSV 416 S V K + Sbjct: 682 SHQNVKLKLKL 692 >gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 78/250 (31%), Gaps = 44/250 (17%) Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 N + ++ + + + + ++S+ E + + Sbjct: 192 PNDESITIETQIKDKDNNIAIWSDGYIAISTFTYFETKVHSNSEFYFIIDCSGSMSGSCI 251 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN---IEPSW-GTEKVRQYVTRDM 284 K L +F+ S+ +I + + E + + + + + Sbjct: 252 QNAKLCLNIFMHSL------PIGCRFSIIKFGSDYEVALHPCDYTDENVSEAMKQLNNID 305 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + TD +K LT + K + LTDG+ Sbjct: 306 AEM--GGTDILSPLKY-VMELTPKQGFI------------------KQVFLLTDGQ---- 340 Query: 345 KSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + NT ++C A+EN +I +I I + + ++ S + V + +S Sbjct: 341 --DSNTNELCALAQENRTNNRIFSIGIGSGADKDLIINVSQKSGGNYVFVDDDES----- 393 Query: 403 QNISQLMVHR 412 + +++ ++ Sbjct: 394 EKLNEKVIEL 403 >gi|148655604|ref|YP_001275809.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567714|gb|ABQ89859.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 425 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/258 (13%), Positives = 68/258 (26%), Gaps = 46/258 (17%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCA 206 M + L + V + + + LV+D S SM Sbjct: 1 MTPGINLQQTLSRTMLAVGDEPQLVYVLLEAHAEGVAQQLPKLPLNLCLVLDRSSSMR-- 58 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ++ +K A +D + + L+ + R + Sbjct: 59 ------------------GERLMQVKEAAARIVDQLGPDDY------FSLVVFNDRADVV 94 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 I +K T+ + A Q F +G+ Sbjct: 95 IPAQRAIKK-SDLKAAIAQIEAAGGTEMAQGLALALQ--------EVQRPFLTRGISR-- 143 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +I LTDG +S ++I + + I + + I N L S Sbjct: 144 ------LILLTDGRTYGDES--RCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASEN 195 Query: 387 EYHYNVVNADSLIHVFQN 404 + A ++ VF + Sbjct: 196 SRAQYIATAQDVVKVFAD 213 >gi|302755028|ref|XP_002960938.1| hypothetical protein SELMODRAFT_402429 [Selaginella moellendorffii] gi|300171877|gb|EFJ38477.1| hypothetical protein SELMODRAFT_402429 [Selaginella moellendorffii] Length = 2174 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 71/233 (30%), Gaps = 36/233 (15%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V+D SGSM P +++ + A++ F+ + Sbjct: 1975 DAANHVVFVIDKSGSMGSPDAK--------PRMASFPPSRLGCVYEAMVRFIRTRQAA-- 2024 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +M ++ + + + V + T + ++ A +IL Sbjct: 2025 -GIRDFMSVVLFESSAVIAM---EKVAMAESQVYALLGHDSSGGTVYSAGLQLAEEIL-- 2078 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIV 365 + IIFL+DG N + K K E + + Sbjct: 2079 --------------GRSAGDAKAPAIIFLSDGGNAGGSD---PVAFVRKIKSVEPRLVVH 2121 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQLMVHRKYSVI 417 TI + +++L + V + L F+ +++ + S++ Sbjct: 2122 TIVFGSDLLPRKVLVDMAREGGGVFQVSLDELQLGRSFEALAKSLHPTVASLV 2174 >gi|297559736|ref|YP_003678710.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296844184|gb|ADH66204.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 547 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 59/192 (30%), Gaps = 27/192 (14%) Query: 225 RTKMAALKNALLLFLDS-----IDLLSHVKEDVYMGLIGYTTRVEK----NIEPSWGTEK 275 ++A L++AL + +E + L+ ++T +EP E Sbjct: 368 GGRLAELQSALGALTGADGGSLARSTQAFQEREVVTLLPFSTWPADPRTFVVEPGSVDEV 427 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 T + A+ +AY++L SD ++ Sbjct: 428 NADLSAAVEGLEAEGDTAAYDALVRAYELLESDTGSDGDPLMS--------------VVL 473 Query: 336 LTDGENNNFKSNVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 +TDGE N + E + + T+ S + + + + ++ Sbjct: 474 MTDGEVNRGVGLEGFRESLAARSEPVARVPVFTVLFGES-DVPEMTELAELTGGRVFDAR 532 Query: 394 NADSLIHVFQNI 405 L VF+ I Sbjct: 533 -EQDLEQVFREI 543 >gi|225468694|ref|XP_002269894.1| PREDICTED: hypothetical protein, partial [Vitis vinifera] Length = 585 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 51/172 (29%), Gaps = 22/172 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ +K A+ L + S+ + ++ ++ ++ + T R+ Sbjct: 194 GGGMTGAKLQMMKRAMRLVISSLSSTD------RLSIVAFSASSKRLMPLKRMTTTGRRS 247 Query: 280 VTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R + + + T + A+K+A ++L ++R+ + K Sbjct: 248 ARRIIESLIAGQGTSAGEALKKASKVLEDRRERNPVASIMLLSDGQNERVSSKS------ 301 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKTCVSS 385 N N + A I + + + CV Sbjct: 302 ---TNPNRPSNVVSSTRYAH-LEIPVHAFGFGENGAYGAEPAEDAFAKCVGG 349 >gi|225438159|ref|XP_002262605.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 830 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 51/172 (29%), Gaps = 22/172 (12%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ +K A+ L + S+ + ++ ++ ++ + T R+ Sbjct: 439 GGGMTGAKLQMMKRAMRLVISSLSSTD------RLSIVAFSASSKRLMPLKRMTTTGRRS 492 Query: 280 VTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 R + + + T + A+K+A ++L ++R+ + K Sbjct: 493 ARRIIESLIAGQGTSAGEALKKASKVLEDRRERNPVASIMLLSDGQNERVSSKS------ 546 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKTCVSS 385 N N + A I + + + CV Sbjct: 547 ---TNPNRPSNVVSSTRYAH-LEIPVHAFGFGENGAYGAEPAEDAFAKCVGG 594 >gi|297698645|ref|XP_002826428.1| PREDICTED: integrin alpha-X-like, partial [Pongo abelii] Length = 836 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 61/202 (30%), Gaps = 31/202 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S I D T M ++ + F + L+ ++ + + + Sbjct: 157 DGSGSISSDNFATMMNFVRAVISQF---------QRPSTQFSLMQFSNKFQTHFTFEKFR 207 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 D + T + A++ L K + Sbjct: 208 SSSNPLSLLDSVHQLRGLTHTATAIQNVVHRLFHASYG-------------ARRDAAKIL 254 Query: 334 IFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSPE-- 387 I +TDG+ + + + I + D A I I + ++ N + L S P Sbjct: 255 IVITDGKKEGDRLDYKDVIPMADAA---GIIRYAIGVGSAFEYINSWKELNDIASKPSQE 311 Query: 388 YHYNVVNADSLIHVFQNISQLM 409 + + V + ++L + + + + Sbjct: 312 HIFKVEDFEALKDIQNQLKEKI 333 >gi|226326039|ref|ZP_03801557.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] gi|225205581|gb|EEG87935.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] Length = 823 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/374 (9%), Positives = 107/374 (28%), Gaps = 35/374 (9%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L + + L +++ ++ + D I ++ + + T Sbjct: 302 TLHHVDEKGNELPTTLSSDATLNYNSDISFSDYV--EIIDNYDYKYAKVDKISGKTVTSA 359 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + N S + + + + + + Sbjct: 360 VTNGSSGKRAKLTLYNGKDNKIETLSYSTTTKNIYLVYTA-KSTGSTGSGTIEDTAPQLS 418 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA--PICQDKKRTKMAALKNALLLF 238 + + ++ +L +++S + D+ K K A+ Sbjct: 419 HQKYIKKKDGNNYDLTLNVSSKKGTTSSKKKFDIVLIMDTSTSMSNNNKWRNSKTAVNKL 478 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D++ + V + L+ + T + + G E V+ ++ + T+ + Sbjct: 479 IDTLSSQTTVDVNYR--LVTFGTTAQIQTNWTTG-ETVKSTLSNYSIKE-DQGTNYEDGL 534 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + L+S + P ++ + G + +++ + + K K Sbjct: 535 VKTKEALSSGTRADAEKIIVFLTDGQP-TFYKSGTSYAGPGSSTSYRVYQHALDAAAKLK 593 Query: 359 ENFIKIVTISIN-----------------ASPNGQRLLKTC-----VSSPEYHYNVV-NA 395 + +I+ + I + LL+ ++ + +N + Sbjct: 594 CD--RIIPVGIGLKSVEKYSYDSYSDQQKVPDSATVLLQNVANKVNATTKDTVFNSDSSG 651 Query: 396 DSLIHVFQNISQLM 409 +L F I+ + Sbjct: 652 SNLSTKFSEIAADI 665 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 62/190 (32%), Gaps = 28/190 (14%) Query: 231 LKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DS 286 +K+ F+D + + + G + Y+ V + + R + Sbjct: 66 VKDFTNQFIDKLTQRYYRCDRNLVWNAGALHYSDEVVLIKSLTPMPSGQSELKNRVSAIN 125 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 I K T + A+K+ + L K++I +TDG Sbjct: 126 YIGKGTYTDCAIKRGIEELLIGG---------------SHHKENKYLIVVTDGHPLEGYK 170 Query: 347 NVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVN---ADSLI 399 ++AK IK+ +++I + + + L + ++ + + Sbjct: 171 EPCGGLDDAANEAKLLGIKVFSVAI-SPNHLDQRLNIIATDHAYRRNFTATSLKPTREI- 228 Query: 400 HVFQNISQLM 409 V + I+ ++ Sbjct: 229 DVEETINTII 238 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 17/173 (9%), Positives = 51/173 (29%), Gaps = 23/173 (13%) Query: 207 MNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSI-DLLSHVKEDVYMGLIGYT--TR 262 + P D+ +K K + + E V + ++ Y+ + Sbjct: 822 TFTGPADITLLVDSSTSVGSKNFETTKKFVKQLSGRFLEASKPTDESVRISVVQYSGRNQ 881 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + + + + + + + TD A++ Sbjct: 882 QKVEAQFQYNYTVIAKAIDNME--FMNDATDVNSALQF----------------ITELYR 923 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + +K ++ +DG + + + A++ I+I +++ + N Sbjct: 924 RSARAAAKKRVLVFSDGHSQGITARAIERAV-QDAQKAGIEIYVLAVGSQANE 975 >gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa] gi|3024050|sp|O02668|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa] Length = 935 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 299 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRAEDQFSLV 338 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + + Y+ + T+ A+ +A IL Sbjct: 339 DFNHNI-RTWRNDLVSATKTQVADAKTYIEKI---QPSGGTNINEALLRAIFILNEANN- 393 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ + + + ++ + + ++ I Sbjct: 394 ----------LGLLDPNSVSLIILVSDGDPTVGELQLSKIQKNVKQNIQD-NVSLFSLGI 442 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + A L + +S Sbjct: 443 GFDVDYDFLKRLSNDNRGMAQRIYGNQDTASQLKKFYNQVST 484 >gi|156350127|ref|XP_001622153.1| hypothetical protein NEMVEDRAFT_v1g828 [Nematostella vectensis] gi|156208600|gb|EDO30053.1| predicted protein [Nematostella vectensis] Length = 339 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 48/168 (28%), Gaps = 16/168 (9%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 D + + +E +G++ Y+ I+ + + + + + +D + T Sbjct: 21 NFIKDIVSKQNIGRELTRVGVVVYSHEAIVAIKLTDYSST--EDLNKAIDGIDYYGKTTR 78 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A F + ++ LTDG Sbjct: 79 IDKGLMTA---------DRIDQVFTVAAGS--RPYLPRVLVLLTDGRQTRAPGYKPLKTA 127 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 K+ +K + + + + + L + + V + D L Sbjct: 128 VCPLKKKQVKTLAVGVGRGIDARELRELVDHDDDMV-MVSDFDDLRTA 174 >gi|17567601|ref|NP_508552.1| hypothetical protein F40F4.6 [Caenorhabditis elegans] gi|1065514|gb|AAA81430.1| Hypothetical protein F40F4.6 [Caenorhabditis elegans] Length = 2214 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 66/224 (29%), Gaps = 16/224 (7%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + G + N D ++ ++ + + + + Sbjct: 1990 VPGTQPTQGPVATTQNPYTSAQFDVVFMIDGSQSAQSSFDSLTKFVQTFMVSFNVGQSGA 2049 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GLI + I P+ + + ++ L TD A QIL + Sbjct: 2050 RVGLIVVGGDITNPIPPAANLNSLSSQAMLNSNLAQLSGGYTDFED----AGQILNYTLQ 2105 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---KENFIKIVTI 367 +F S +I+LT + + K+ I TI Sbjct: 2106 IVSSPDFMAANNGYRSGISNHVLIYLT----TTTAFDTDPTPAAQTILAQKQYGII--TI 2159 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + L+T + + SL + + I QL+++ Sbjct: 2160 GYGGATDNNK-LQTISGGSACSFTAPDFASLNNQIKTIQQLILN 2202 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K F+D++ + + G + Y+ VE + G ++++ V Sbjct: 60 DKVKAFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 119 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 120 VK--YFGKGTYTDCAIKKGLEELLVGGSHLKEN---------------KYLIVVTDGHPL 162 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 163 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGQ 220 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 221 SRDAEEIISQTI 232 >gi|319782171|ref|YP_004141647.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168059|gb|ADV11597.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 509 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 3/53 (5%), Positives = 19/53 (35%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 + ++ I+ ++++ + + I + + N ++ ++ Sbjct: 1 MLGTIRAFWNDQRGIALILVSITLPAIIGFSLLAIDMSRINNLHNDLQKGADS 53 >gi|291229678|ref|XP_002734799.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1003 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 62/191 (32%), Gaps = 28/191 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 ++ ++ L A+ ++ ++ L +E +G++ ++T E Sbjct: 293 TSGSMEENGRIDKLHQAVSNYI--LNTLDDGEE---VGVVTFSTTATIQSHLVLINNESR 347 Query: 277 RQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + ++R + T + +A+ +L ++ + I+ Sbjct: 348 TELLSRVPSMQSVGRWTSIGSGLLKAFDVLEEGERNA----------------AGGVIVV 391 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV-- 393 ++DGE N + I + E + + +I I + + + Y Sbjct: 392 ISDGEENRDPLIADIIPMVL---EKGVTVDSIGIGTDASTNLEVLPAATDGMTFYYSEDS 448 Query: 394 NADSLIHVFQN 404 N++ L Sbjct: 449 NSNGLNEALAA 459 >gi|260823577|ref|XP_002606157.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] gi|229291496|gb|EEN62167.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] Length = 550 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 57/177 (32%), Gaps = 16/177 (9%) Query: 236 LLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TD 293 F+++I + + +G++ Y+ V +E + T Sbjct: 231 KDFINTIIQSFTIGPDLTRVGVVQYSFSVNLELELKDHLSAAPLIQAVEEIEYDEGLFTH 290 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + G + K I LTDGE+ ++ Sbjct: 291 TGEALQYITTHSFA----------VENGAREVRKGVPKVSILLTDGESGDYHDVTV---W 337 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +A+E I + + + + + Q L+T Y V L + ++ + + Sbjct: 338 SQRAQEAGITVFPVGVTNNVDQQE-LETMAGHAGKVYRVGWFRDLEKILTDMEESLC 393 >gi|296119163|ref|ZP_06837733.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] gi|295967789|gb|EFG81044.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] Length = 585 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 46/135 (34%), Gaps = 30/135 (22%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D T A++QA + L ++ + + I+ ++DG + Sbjct: 38 EVDDIEASGYTPMGSALRQAAEELPAEGE--------------------RSIVLVSDGID 77 Query: 342 NNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSL 398 V ++ ++ + I I T+ + L+ + + + +A+SL Sbjct: 78 TCAPPPVC--EVAEELHDQGIDLIINTVGFLVDDEARSELECIADAGGGQYLDADDAESL 135 Query: 399 IHVFQNISQLMVHRK 413 + ++H + Sbjct: 136 AESMK-----ILHSR 145 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/251 (13%), Positives = 73/251 (29%), Gaps = 42/251 (16%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + A+ R + +VDLS SM + Sbjct: 427 SAPSGALAFPDRNVVAAVAEAFPEVRKDAQVLFLVDLSASMGYDNTAG------------ 474 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 +T++ ++A+ LD +GL G+TT + V Sbjct: 475 --QTRLEGAQHAITAALDHFTAGD------RVGLAGFTTTDGTITP------GLVAPVAD 520 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN---FFRQGVKIPSLPFQKFIIFLTDG 339 D+ A+ + L + F +Q + I+ L+DG Sbjct: 521 IADNR--------GALVEGVNSLEPVAHTPLYQAVADFAQQQAAMWDPDRINAIVLLSDG 572 Query: 340 ENNNFKSNVN-TIKICDKAK----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 N + E + + T+ + + + L ++ ++Y+ + Sbjct: 573 VNATGDIETIGQDDMIHVLHGLHAETPVLVFTLGYSPDADVETLQAISSATGAHYYDATD 632 Query: 395 ADSLIHVFQNI 405 + V ++ Sbjct: 633 PTEVEAVLGDL 643 >gi|297562484|ref|YP_003681458.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296846932|gb|ADH68952.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 505 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 17/175 (9%), Positives = 50/175 (28%), Gaps = 29/175 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTR 282 ++ ALL ++ + + GL+ + + + P +VR+ ++ Sbjct: 54 GGRLDGAVRALLSLVERLAPSDN------FGLVSFNDQARVEVPCGPLEDKARVRRLISG 107 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 TD + + + Q + ++ ++DG N Sbjct: 108 L---HASGGTDLSSGLLRGVQE-----------------ARRAGADRGGTLLLISDGHAN 147 Query: 343 NFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 ++ ++ A + + ++ + + L + D Sbjct: 148 QGVTDHDLLRQVAADAYAHGVTTTSLGYGLGYDEELLGAVADGGAGSALFAEDPD 202 >gi|115623666|ref|XP_789748.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960627|ref|XP_001186460.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 846 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 53/201 (26%), Gaps = 41/201 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + V+D+SGSM KM K A LD + + Sbjct: 333 PKTRKNVVFVIDVSGSMR--------------------GRKMDQTKRAFTTILDDVRPID 372 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + + + +V T+ + A +L Sbjct: 373 RINIVLFESNVRVWRSNQMVEATGDNIAAAKNHVNDIS---AGGGTNLYDGLTNAVDLLM 429 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI--CDKAKENFIKI 364 II LTDG+ + + I + + + Sbjct: 430 EHGNGE----------------AMPLIIMLTDGQPTSGSVTSTSEIIKRITNLIDGRLSL 473 Query: 365 VTISINASPNGQRLLKTCVSS 385 ++ + L K +S+ Sbjct: 474 FSVGFGNGVDFSFLEKLSLSN 494 >gi|53713709|ref|YP_099701.1| hypothetical protein BF2418 [Bacteroides fragilis YCH46] gi|253565657|ref|ZP_04843112.1| BatB [Bacteroides sp. 3_2_5] gi|265764033|ref|ZP_06092601.1| BatB [Bacteroides sp. 2_1_16] gi|4838139|gb|AAD30859.1|AF116251_2 BatB [Bacteroides fragilis] gi|52216574|dbj|BAD49167.1| conserved hypothetical protein BatB [Bacteroides fragilis YCH46] gi|251945936|gb|EES86343.1| BatB [Bacteroides sp. 3_2_5] gi|263256641|gb|EEZ27987.1| BatB [Bacteroides sp. 2_1_16] gi|301163418|emb|CBW22969.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 341 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 66/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGAAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + + + I++ + + Sbjct: 192 AIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + V N++S I+Q + Sbjct: 248 NVIVTRLNEAMCQEIAKEGNGIYVRVDNSNSAQKA---INQEI 287 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 72/191 (37%), Gaps = 15/191 (7%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRD-- 283 +K L L +D++ + + ++ Y++ V++++ KV + + R Sbjct: 236 FEHVKRTLGLMIDNLCDG-ISPDTNRVAMLRYSSDVKEDLNFIEGSNEPKVMRNIQRLKY 294 Query: 284 --MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T + AM +A + + + + + G+K+ + ++ +TDGE+ Sbjct: 295 KPITDDRHGSTYTAHAMDKALKTIFTSEAG-WRNGTTEDGIKVRTE-----VVIITDGES 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N+ K E IK+ + + + + E + +++ L Sbjct: 349 NDPDETFTIQGQKVKYDEYGIKVYALGVG-DIKKDEIRQLTSMDDESIFYLMSWKDLA-A 406 Query: 402 FQNISQLMVHR 412 F I + ++ Sbjct: 407 FNRIIETLIET 417 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 61/220 (27%), Gaps = 18/220 (8%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I+ V S + + L D + + + Sbjct: 226 IDNDVGKSLCDAAIHGCRTAYHDLLFLLDGSSSISPNDFSTTLTWMRDIAEQFTVGSQFT 285 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ Y + + + + + I + A+ Q + + + S Sbjct: 286 RVGMMQYGDEPHTEFDLNTFQNGSQVFEAISNVTQIGGESGPYAAILQVLRRSLTAQYGS 345 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + IIF+TDG + + + I ++ + + + TI + Sbjct: 346 -------------RENVSQIIIFVTDGGVVD--DSEESQTILNELRFSGALVYTIGVGRM 390 Query: 373 PNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMV 410 + L+ S P + + + L I + Sbjct: 391 V-SRPQLRMIASRPASHHVTTIASYSELSATKSQIIDRIC 429 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 82/338 (24%), Gaps = 33/338 (9%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 + F SL+ A+ + ++ + + N F Sbjct: 828 IQFDNSTNINFNLLQHNSSLTLVQAIADLSPEYDVAGEPPCLLGSALTEVREQLLNNNFG 887 Query: 147 NMDVMTSYDYRLQFIEHLLNQR--------YNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + + + + S + + I Sbjct: 888 PRPNVPKVVLVITASNSMDDLTYPATELMHDGTMLFSICVGIKVNRTQIQKIVTAPYSSF 947 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 H E S IC +M A+ ++ +D +++ L+ Sbjct: 948 FYSVSHFRELPKIEKSLSRGICAAIAPECRMRAVD--VIFLIDGSSSVTNADFTAVKLLM 1005 Query: 258 GYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFT 315 T + + M T + A+K S ++ Sbjct: 1006 VNLTDSFDIGPEGARDFDNREDLRESIMSLEHCTGNTFTAKAIKFVTNYAFSTERG---- 1061 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 K + +TDG ++++ ++A IKI I Sbjct: 1062 ---------GRPNVLKVALLITDGRAQDYRNI---RSEAEQAHTEGIKIF--GIGVGEAQ 1107 Query: 376 QRLLKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVH 411 L+ S + ++ V N + + + + Sbjct: 1108 LSELEDMSSLPTSQHTMFVQNYQDIEKLRSRLVHRICQ 1145 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 23/208 (11%), Positives = 58/208 (27%), Gaps = 30/208 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP------SWGT 273 + + A +D+ S MGL+ +++R + T Sbjct: 321 SGSMYGSPINNAIQAAKTLVDATAEGS-----TAMGLVSFSSRSSVKQDFAVQQIPKPDT 375 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + T A L + + + + Sbjct: 376 GIKQALKAAIDNIYASGSTALFDGSSLALDNLITYQ-------------TAAASGAPGVV 422 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 L DG++N+ N +T+ + + I + ++ L+ ++ +++ Sbjct: 423 FVLADGDDNSSIKNESTVITAYQ--NANVPIFSFGYGSASPTGPLVTMANATGGKYFSSP 480 Query: 394 NA-DSLIHVF---QNISQLMVHRKYSVI 417 +I F I+ + S + Sbjct: 481 TTLSEIIDAFLQANAIATDNQNLVSSTV 508 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTEDQFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQITDAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + + I+ ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGMLNPDSVSLIVLVSDGDPTVGELKLSKIQKNVKQNIQD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLKKFYNQVST 495 >gi|47219204|emb|CAG11222.1| unnamed protein product [Tetraodon nigroviridis] Length = 4421 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 42/385 (10%), Positives = 107/385 (27%), Gaps = 45/385 (11%) Query: 51 NSMESANNAAI-------------LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRF 97 + ++ + +AA+ L + R + E++ A Sbjct: 40 SVVQYSRDAAVHFYLNTYTTKREILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSR 99 Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + + L + + V+S + + + S+ Sbjct: 100 RTEGVPQVLILLTGG----RSSDSVDSPASDLKQLGVLIFAIGSRGSDNREIQRISHSPT 155 Query: 158 --LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-SGSMHCAMNSDPEDV 214 L E Q+++S + ++ + E P + VVD+ + A+++ +D+ Sbjct: 156 SALVVPEFTDLPSVQQQLLSSVRDVVDVG-PEAPTPAVRDVVDVGPEAPTPAVDTSKKDI 214 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 D R A+++ + ++ + + ++ Y+ E + Sbjct: 215 VFLLDGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYS 270 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + ++ + + F + +I Sbjct: 271 TREDIIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLI 321 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 L G + + K+ + I I + + L+ P Y V Sbjct: 322 LLNGGRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTE 374 Query: 395 ADSLIHVFQNISQLMVHRKYSVILK 419 L +I + + +V+LK Sbjct: 375 FTDLP----SIQEQLSSVMSTVLLK 395 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/267 (10%), Positives = 75/267 (28%), Gaps = 33/267 (12%) Query: 162 EHLLNQRYNQKIVSFIPALLRI--------EMGERPIFLIELVVDL-SGSMHCAMNSDPE 212 + + +S P + + ++ + + VVD+ + A+++ + Sbjct: 1912 GSRGSDNREIQRISHSPTSALVVPEFTDLPSVQQQLLSSVRDVVDVGPEAPTPAVDTSKK 1971 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D+ D R A+++ + ++ + + ++ Y+ E + Sbjct: 1972 DIVFLLDGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NT 2027 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + ++ + + F + Sbjct: 2028 YSTREDIIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQM 2078 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +I L G + + K+ + I I + + L+ P Y V Sbjct: 2079 LILLNGGRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAV 2131 Query: 393 VNADSLIHVFQNISQLMVHRKYSVILK 419 L +I + + +V+LK Sbjct: 2132 TEFTDLP----SIQEQLSSVMSTVLLK 2154 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/384 (9%), Positives = 99/384 (25%), Gaps = 58/384 (15%) Query: 51 NSMESANNAAI-------------LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRF 97 + ++ + +AA+ L + R + E++ A Sbjct: 452 SVVQYSRDAAVHFYLNTYTTKREILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSR 511 Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + + L + + V+S + + + S+ Sbjct: 512 RTEGVPQVLILLTGG----RSSDSVDSPASDLKPLGVLIFAIGSRGSDNREIQRISHSPT 567 Query: 158 LQFIEHLLNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDL-SGSMHCAMNSDPEDVN 215 +P + ++ + + VVD+ A+++ +D+ Sbjct: 568 ---------------SALVVPEFTDLPSVQQQLLSSVRDVVDVGPEVPTPAVDTSKKDIV 612 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 D R A+++ + ++ + + ++ Y+ E + Sbjct: 613 FLLDGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYST 668 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + ++ + + F + +I Sbjct: 669 REDIIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLIL 719 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 L G + + K+ + I I + + L+ P Y V Sbjct: 720 LNGGRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEF 772 Query: 396 DSLIHVFQNISQLMVHRKYSVILK 419 L +I + + +V+LK Sbjct: 773 TDLP----SIQEQLSSVMSTVLLK 792 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 52/201 (25%), Gaps = 24/201 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+++ + ++ + + ++ Y+ E + Sbjct: 2687 DGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYSTRED 2742 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ + + F + +I L Sbjct: 2743 IIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQSVPQMLILLNG 2793 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + K+ + I I + + L+ P Y V L Sbjct: 2794 GRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 2846 Query: 399 IHVFQNISQLMVHRKYSVILK 419 +I + + +V+LK Sbjct: 2847 P----SIQEQLSSVMSTVLLK 2863 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 52/202 (25%), Gaps = 24/202 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+++ + ++ + + ++ Y+ E + Sbjct: 3201 DGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYSTRED 3256 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ + + F + +I L Sbjct: 3257 IIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLILLNG 3307 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + K+ + I I + + L+ P Y V L Sbjct: 3308 GRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 3360 Query: 399 IHVFQNISQLMVHRKYSVILKG 420 +I + + +V+LK Sbjct: 3361 P----SIQEQLSSVMSTVLLKA 3378 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 52/201 (25%), Gaps = 24/201 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+++ + ++ + + ++ Y+ E + Sbjct: 829 DGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYSTRED 884 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ + + F + +I L Sbjct: 885 IIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLILLNG 935 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + K+ + I I + + L+ P Y V L Sbjct: 936 GRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 988 Query: 399 IHVFQNISQLMVHRKYSVILK 419 +I + + +V+LK Sbjct: 989 P----SIQEQLSSVMSTVLLK 1005 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 52/201 (25%), Gaps = 24/201 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+++ + ++ + + ++ Y+ E + Sbjct: 1042 DGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYSTRED 1097 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ + + F + +I L Sbjct: 1098 IIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLILLNG 1148 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + K+ + I I + + L+ P Y V L Sbjct: 1149 GRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 1201 Query: 399 IHVFQNISQLMVHRKYSVILK 419 +I + + +V+LK Sbjct: 1202 P----SIQEQLSSVMSTVLLK 1218 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 52/201 (25%), Gaps = 24/201 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+++ + ++ + + ++ Y+ E + Sbjct: 1255 DGSDGTRNGFPAMRDFVERVVEKLS---VGPNKDRVSVVQYSRDAEVHFNL-NTYSTRED 1310 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ + + F + +I L Sbjct: 1311 IIDSVRGLRHRGG----APVRTGAAL-----QYVRDNVFTNSSGSRRLQGVPQMLILLNG 1361 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G + + K+ + I I + + L+ P Y V L Sbjct: 1362 GRSYDSVDTP-----ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 1414 Query: 399 IHVFQNISQLMVHRKYSVILK 419 +I + + +V+LK Sbjct: 1415 P----SIQEQLSSVMSTVLLK 1431 >gi|194335402|ref|YP_002017196.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307879|gb|ACF42579.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 343 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 43/143 (30%), Gaps = 15/143 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSL 287 L + L I +S M ++ + + + + D + Sbjct: 105 DVLPDRLGQAKQEITSISRAVTGGRMSILLFAASPLVQCPLTTDRDAFDALLGMASPDLI 164 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T A + A ++L + + + K ++ L+DGE++ Sbjct: 165 EEQGTSFRAAFELAGRLLEPTLEDRMASGVKGE----------KIVVLLSDGEDHTG--- 211 Query: 348 VNTIKICDKAKENFIKIVTISIN 370 + K+ + + I + Sbjct: 212 -EVRSAVQQLKKANVHLFVIGVG 233 >gi|260793444|ref|XP_002591722.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] gi|229276931|gb|EEN47733.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] Length = 987 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 61/160 (38%), Gaps = 22/160 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + ++G++ + + ++ + TE+ RQ + + + T + +A ++L+ Sbjct: 343 PDGSHVGMVKFHQWATRLLDLTEIATEEDRQEIADAVPNEASGGTCIGCGLTEALEVLSM 402 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + P +I L+DG+ + S T+ + + ++ Sbjct: 403 NGA----------------DPAGGIVIILSDGDESLNVSPNLTVAT-QHLVAAGVTVHSV 445 Query: 368 SINASPNGQRLLKTCVSSPEYHYN---VVNADSLIHVFQN 404 + ++S + + + + S+ + N++SL + Sbjct: 446 TYSSSADTR-MEEVAASTHGRAFFYSGAANSNSLEEGLRE 484 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 68/236 (28%), Gaps = 51/236 (21%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC-QDK 223 + ++S P + V+D+SGSM + + + V+ A + Sbjct: 280 PGKDRGTFMLSITPGDDLPPITT--GSDWVFVLDISGSMSAKLATLGDGVSQALGKLRGG 337 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 R ++ + + +RQY + Sbjct: 338 DRFRIVLFDDRAEELTSGFVDATPNN--------------------------IRQYTKKV 371 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 M T+ + A L +D+ I+ +TDG N Sbjct: 372 MQLQSRGGTNLFGGLSLALNPLDADRPTG--------------------IVLVTDGVANV 411 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 K D + + +++ T + S N L +S + +V N+D + Sbjct: 412 GK--TQQKDFIDLLENHDVRLFTFVMGNSANRPMLTAMTNASNGFAISVSNSDDIA 465 >gi|123469529|ref|XP_001317976.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3] gi|121900723|gb|EAY05753.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3] Length = 394 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 59/188 (31%), Gaps = 16/188 (8%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + + +M +++ + F++ + + + + L Y + S + Sbjct: 70 TNGLEMRLFNQNRMKLIQDQTIDFINDFFVQNPLSQ--LSILATYESTCRILTPLSCNVQ 127 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + ++ A IL + +K K ++ Sbjct: 128 DHVKKIKSLSLEDHGGEPSLESSLSIATAILGNGEKNPGLAQIST----------TKEVL 177 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + S + K + +++ IK+ I + A L K + ++ V+ Sbjct: 178 IVYGSLTTCDNSPIY--KTLNLVRDSKIKVSIIGLGAKVFV--LEKIAAETKGEYFVPVS 233 Query: 395 ADSLIHVF 402 + L +F Sbjct: 234 IEHLHDIF 241 >gi|73980138|ref|XP_540147.2| PREDICTED: similar to vitrin isoform 1 [Canis familiaris] Length = 649 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 25/283 (8%), Positives = 66/283 (23%), Gaps = 30/283 (10%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + + + M ++S+ ++ S L Sbjct: 151 PTAKTGEATKAYQKPSVPGTTAQPVTLMQVTGATASEASHTSLPKPSPSAVSTTSSL-RS 209 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + ++ + + S + + S+ DP Sbjct: 210 QPMGHRSWELDLWSTSSDASGQNSPRANAGFVPKEELSTQSLEPVSQGDPNCKIDLSFLI 269 Query: 222 DKKR----TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEK 275 D + K D L MG++ Y + Sbjct: 270 DGSSGIGKRRFRIQKQ---FLADVAQTLDIGPAGPLMGVVQYGDNPAAQFNLRTHMNSRD 326 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 ++ + + +++ A+ + S + + Sbjct: 327 LKTAIEKITQR--GGLSNAGRAISYVTKNFFSKANGNRGGA-------------PNVAVV 371 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + DG + + A+E+ I I I+I + ++ Sbjct: 372 IVDGWPTD-----KVEEASRFARESGINIFFITIEGATENEKQ 409 >gi|88811039|ref|ZP_01126295.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] gi|88791578|gb|EAR22689.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] Length = 930 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 35/276 (12%), Positives = 68/276 (24%), Gaps = 50/276 (18%) Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-LLRI 183 + + ++ Y + + V + + I Sbjct: 335 PTGGFGTPQGASVTVTADSNLPTAAARIWLSYTSTNAGDTASGSVTVRCVQTGQSWTINI 394 Query: 184 EMGE--RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 RP + LV+D SGSM+ TK+ L+ A +F++ Sbjct: 395 NANTIARPRSAVSLVIDRSGSMNDD--------------AGDGITKVQKLREAANVFINI 440 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-----GTEKVRQYVTRD--MDSLILKPTDS 294 + +GL+ + ++ +E + G + D T Sbjct: 441 MLPGD------GIGLVRFNDTAQRLMEITDVGASPGGAGRTDALNHIAGSDIDPSGATSI 494 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + +L + ++ LTDG N + Sbjct: 495 GDGIVNGRNMLNDAQAAPMPDYDVTA------------MVVLTDGMWN------RPPSLA 536 Query: 355 DKAKENFIKIVTISINASPNGQ--RLLKTCVSSPEY 388 D A + + N L C Y Sbjct: 537 DVAGSINANTYAVGLGIPSNISVPALTTLCQGHAGY 572 >gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] Length = 947 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 66/223 (29%), Gaps = 42/223 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNQNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTIS 368 + + II ++DG+ K + KI KEN I + ++ Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGKCELKLSKIQKNVKENIQDNISLFSLG 454 Query: 369 INASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + + L + + + + L + +S Sbjct: 455 MGFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 497 >gi|153812898|ref|ZP_01965566.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] gi|149830976|gb|EDM86066.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] Length = 838 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 12/154 (7%), Positives = 32/154 (20%), Gaps = 4/154 (2%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + S+ L ++ L+ G ++ + + K + A + + A ++ Sbjct: 1 MRFFCGKNRGAISVFLTLILVPVLIFSGIIVDISRLYAAKTVISGAGDLTMNAALARYDK 60 Query: 71 NLSRLGDRFESISN----HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 L + K L + Sbjct: 61 QLKDSYGLITMADDPSSPSMKTYLEQSFLESCNASALKDTKSTDLHSMIQLELGTEGVEV 120 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + + I M Sbjct: 121 QGVKNSSLADTQVLQQQILEYMKFRAPVYMVSDI 154 >gi|119505574|ref|ZP_01627646.1| transporter [marine gamma proteobacterium HTCC2080] gi|119458683|gb|EAW39786.1| transporter [marine gamma proteobacterium HTCC2080] Length = 554 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 27/251 (10%), Positives = 67/251 (26%), Gaps = 43/251 (17%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ----DKKRTKMAALKN 233 P + R+ + L+ L+ + P ++AP+ + + Sbjct: 55 PPIARLNPNSVLVVLMLLLTLVMAGPSWQQQPSPLAEDTAPLVIVIDVSESMATTDIAPS 114 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 LL + I L D +G+I Y + + E ++ + Sbjct: 115 RLLRGIQKIRDLLKRVPDKRIGVIAYAGSAHTVLPLTSDHEITESFLAVMTPGIA----- 169 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + + ++ ++ +TDG +S Sbjct: 170 -------------PRAGKFPEYALPGIDRLLAQSYYRSSVLMVTDG--LGARSPELLKSW 214 Query: 354 CDKAKENFIKIVTISINASP--------NGQRLLKTCVSSPEYHYNVV----NAD----S 397 C ++ +++ + L+ S +Y+ + D + Sbjct: 215 C---RDRDYQLIVYGFGDPSLSQSTVPLEREALMNLASSCNGKYYDATIDPGDVDAIAGA 271 Query: 398 LIHVFQNISQL 408 L F+ + Sbjct: 272 LTDAFKIVDDE 282 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 61/187 (32%), Gaps = 32/187 (17%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL- 287 A + + + + + +I ++++++ + V + + Sbjct: 150 ANFYSEIKFVKKLLSDFNVSYNYTRVAVITFSSQMKIFRHIDQISTSVEDNDKCLLLNYQ 209 Query: 288 ------ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A+K+A +I + + S +K I +TDG + Sbjct: 210 IPKIEFSGGGTYTYGALKEAEEIFQNARADS-----------------KKIIFLITDGFS 252 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N I + + K + I +I I + + + + + + + D Sbjct: 253 NGRD----PIPLAESLKRKNVVIYSIGIQSGNYAELYNMSSSPGDSHSFLLDSFDH---- 304 Query: 402 FQNISQL 408 F+ +++ Sbjct: 305 FETLARK 311 >gi|149053317|gb|EDM05134.1| rCG33209 [Rattus norvegicus] Length = 1169 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 61/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + K+ + + + + L+ Y ++ + + Sbjct: 210 PSDFQKAKDFISTMMRNFYEKC---FECNFALVQYGGVIQTEFDLLDSRDINASLAKVQS 266 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 267 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 311 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 + +N + +K + I + + N R LK S P+ + + V N +L Sbjct: 312 RDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNTYRELKLIASDPKAAHTFKVTNYSALD 371 Query: 400 HVFQNISQLMVHR 412 + + Q ++H Sbjct: 372 GLLSKLQQRIIHM 384 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 56/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + + K L+ Y+ + + + Sbjct: 166 FDRMKKFVSTVMSRFQ-----KSKTLFALMQYSDDFRTHFTFNDFKRNSDLELLVRPIGQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L + K +I +TDGE + Sbjct: 221 LFGRTHTATGIRKVVRELFHSSNGARNHAL-------------KIMIVITDGEK--YLDP 265 Query: 348 VNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + + +A I I + S ++ L T S P ++ + V N ++L + Sbjct: 266 LEYRDVIPEADRKGIIRYVIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|3236344|gb|AAC23663.1| integrin alpha E2 [Rattus norvegicus] Length = 1167 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 61/193 (31%), Gaps = 23/193 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + K+ + + + + L+ Y ++ + + Sbjct: 208 PSDFQKAKDFISTMMRNFYEKC---FECNFALVQYGGVIQTEFDLLDSRDINASLAKVQS 264 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + T + AM+ + + K ++ LTDG + F Sbjct: 265 IVQVKEVTKTASAMQHVLDNIF-------------IPSRGSRKKALKVMVVLTDG--DIF 309 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 + +N + +K + I + + N R LK S P+ + + V N +L Sbjct: 310 RDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNTYRELKLIASDPKAAHTFKVTNYSALD 369 Query: 400 HVFQNISQLMVHR 412 + + Q ++H Sbjct: 370 GLLSKLQQRIIHM 382 >gi|89068267|ref|ZP_01155677.1| von Willebrand factor type A domain protein [Oceanicola granulosus HTCC2516] gi|89046184|gb|EAR52242.1| von Willebrand factor type A domain protein [Oceanicola granulosus HTCC2516] Length = 1065 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 47/195 (24%), Gaps = 27/195 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + Q K+ + + L G + + Sbjct: 30 DGSGSMWGQIDGVNKIVIAREVVSEILGDF-PADQNLGLTVYGHRTRGDCTDIETVVAPA 88 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T Q V T T A+ A + L ++R+ Sbjct: 89 TGTAGQIVEVVEGINPRGKTPMTDAIIAAAEALRYTEERA-------------------T 129 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINA-SPNGQRLLKTCVS-SPEY 388 ++ ++DG N + ++ I I + P+ ++ + Sbjct: 130 VVLVSDGIETC---NPDPCAAARALEQAGIDFTAHVIGFDVTDPDALGQMQCIADETGGT 186 Query: 389 HYNVVNADSLIHVFQ 403 NA L Sbjct: 187 FTTAANAGELSEAMS 201 >gi|188586927|ref|YP_001918472.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] gi|179351614|gb|ACB85884.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] Length = 301 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 46/142 (32%), Gaps = 13/142 (9%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K + E+ + +I AL + + ++ V + +++ + + A + ++ Sbjct: 5 KLLKDEEGSSVVIVALMLTVLMSFTALVVDVGLMYAERSRL-------VSAAEASALAGA 57 Query: 73 S-----RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY-NTEIQNIVNSSR 126 S G+ E ++A++A A+ + L A E+ Sbjct: 58 SNFPYRDDGEYREEDYDYARQAAEKIAEENGLEDYQIRLLPEDAQQKEKIEVTASEEVEF 117 Query: 127 ISMTHMANNRLDSSNNTIFYNM 148 M D + ++ Sbjct: 118 SFARVMGFQEGDVTGQAAAKSV 139 >gi|303239192|ref|ZP_07325721.1| hypothetical protein AceceDRAFT_1069 [Acetivibrio cellulolyticus CD2] gi|302593237|gb|EFL62956.1| hypothetical protein AceceDRAFT_1069 [Acetivibrio cellulolyticus CD2] Length = 486 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/343 (9%), Positives = 86/343 (25%), Gaps = 36/343 (10%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 D R A V ++ + + + Y++ Sbjct: 21 DGNVRINCGSGVVGEGFIVATGNIIFNTGEVTATAGDDGGLVDKEKKTIAGVGLYSVGGN 80 Query: 152 TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + ++ I + + + ++ + + Sbjct: 81 VEFAV--------GGSEFSGLIYAPKGTVNLQTKDMTINGSVVGKEVITKTNGINITYKD 132 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 + + + + + K+ + F++S D +G++ Y N Sbjct: 133 TSAHETITETETENSHLITAKDVIGKFVNSFDSKD-----AKIGILTYQNDANTNTFDLL 187 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + + S + +++AY +L +S P+ Sbjct: 188 DVDGAKGLIGGITASKEDSN-NLGDGLRRAYYLLKDKGSKSSDKYIVVLSCADPNKCTLS 246 Query: 332 FIIFLTDGENNNFKSNVNTIKI-----CDKAKE----------NFIKIVTISINASPNGQ 376 + + + + + + AKE IK + + Sbjct: 247 GLGETGEYKTVAGNAVNILPALNVTKGTEYAKEILGKINEDTEANIKTHFVDFRFGEDLA 306 Query: 377 RLLKTCVSSPEYHYNVVNADS------LIHVFQNISQLMVHRK 413 ++ K S +YN + L +IS ++ K Sbjct: 307 KI-KEIAESGTNYYNPTTDNELEFPGKLEATMNSISTEILGEK 348 >gi|194211467|ref|XP_001916467.1| PREDICTED: similar to collagen, type VI, alpha 3 [Equus caballus] Length = 3165 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/354 (7%), Positives = 95/354 (26%), Gaps = 40/354 (11%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A+L ++ + +++ + A+ + A I++ + + L + +I Sbjct: 1497 AVLDAIRRLRFKGGSPLNTGKALEHVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDIS 1556 Query: 120 NIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 ++ L + I + E +KI++ Sbjct: 1557 RYSQV----ISSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEEKIITSF 1612 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + + + + + + L Sbjct: 1613 GPSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLRF 1660 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDST 295 + +D L + + +GL+ Y + ++ + + ++ Sbjct: 1661 VSEIVDTLYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKQEIIDAINKVVYKGGRHANTK 1719 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ +F + + +T G +V + Sbjct: 1720 VGIEH-----------LRLNHFVPEAGSRLDQRVPQIAFVITGG------KSVEDAQEAS 1762 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1763 ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1814 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 48/168 (28%), Gaps = 20/168 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ ++ LS + + +G++ Y+ Q + + Sbjct: 262 DFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSL-DTYSTKAQVLDAVKALGFTGGELAN 320 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + +F R G + ++ ++ G +++ + Sbjct: 321 VGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRDGVV----- 366 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ ++ + V + L Sbjct: 367 ALKQ--ASVFSFGLGAQAASKAELQQIATNDNMVFTVPEFRSFGDLQE 412 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 59/187 (31%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ + +D + + + +I ++ + + K Sbjct: 1247 FQYIRTLIERLVDYL---DVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDEVQ-NAVRRLR 1302 Query: 288 ILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + A++ + + S +F++ ++ G++N+ Sbjct: 1303 PKGGRQINIGGALEYVSKNIFKRPLGSRIEEG-----------VPQFLVLISSGKSNDEV 1351 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + K+ + TI+ + + + L+K SPEY ++V L + Q + Sbjct: 1352 DDS-----AAELKQFGVAPFTIA--RNADPEELVKIS-LSPEYVFSVSTFRELPSLEQKL 1403 Query: 406 SQLMVHR 412 + Sbjct: 1404 LTPITTL 1410 >gi|301785912|ref|XP_002928373.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Ailuropoda melanoleuca] Length = 1188 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 58/187 (31%), Gaps = 20/187 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + + L+ Y ++ + + + + + K Sbjct: 228 FQRAKDFISNMMRNFYEKCFQCSFALVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGK 287 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + AM+ + + S K ++ LTDG + F+ +N Sbjct: 288 VTKTASAMQHVLDNIFTPSHGS-------------RKNASKVMVVLTDG--DIFEDPLNL 332 Query: 351 IKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + K ++ I + + LK S P+ Y + V N +L + + Sbjct: 333 TTVISSPKMQGVERFAIGVGKAFENNKTYNELKLIASDPDDRYAFKVTNYTALDGLLSKL 392 Query: 406 SQLMVHR 412 Q ++ Sbjct: 393 QQNIIQM 399 >gi|301778757|ref|XP_002924796.1| PREDICTED: integrin alpha-M-like [Ailuropoda melanoleuca] Length = 1153 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 62/225 (27%), Gaps = 29/225 (12%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALKNALLLFLDSIDLLSHVK 249 + S + P + D + +K + +D Sbjct: 129 PNLWQQPQSFPEKIRECPRQDSDIAFLIDGSGSINPIDFQRMKEFVSTVMDRFK-----N 183 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 L+ ++ + + + + + T + + + + L Sbjct: 184 SKTLFSLMQFSEDFQTHFTFNEFKANPNPRFLVNAIIQLYGRTHTATGILKVVRELFHSS 243 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + K ++ +TDGE + + +A I I + Sbjct: 244 SGARENAL-------------KILVVITDGEKFGDPLD--YKDVIPEADREGIIRYVIGV 288 Query: 370 N---ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 +P + L T S P ++ + V N ++L + + + + Sbjct: 289 GEAFDNPKHREELNTIASKPARDHVFRVNNFEALKTIQNQLQEKI 333 >gi|281352695|gb|EFB28279.1| hypothetical protein PANDA_014199 [Ailuropoda melanoleuca] Length = 1110 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 62/225 (27%), Gaps = 29/225 (12%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT----KMAALKNALLLFLDSIDLLSHVK 249 + S + P + D + +K + +D Sbjct: 84 PNLWQQPQSFPEKIRECPRQDSDIAFLIDGSGSINPIDFQRMKEFVSTVMDRFK-----N 138 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 L+ ++ + + + + + T + + + + L Sbjct: 139 SKTLFSLMQFSEDFQTHFTFNEFKANPNPRFLVNAIIQLYGRTHTATGILKVVRELFHSS 198 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + K ++ +TDGE + + +A I I + Sbjct: 199 SGARENAL-------------KILVVITDGEKFGDPLD--YKDVIPEADREGIIRYVIGV 243 Query: 370 N---ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 +P + L T S P ++ + V N ++L + + + + Sbjct: 244 GEAFDNPKHREELNTIASKPARDHVFRVNNFEALKTIQNQLQEKI 288 >gi|281346139|gb|EFB21723.1| hypothetical protein PANDA_018300 [Ailuropoda melanoleuca] Length = 1151 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 58/187 (31%), Gaps = 20/187 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + + L+ Y ++ + + + + + K Sbjct: 194 FQRAKDFISNMMRNFYEKCFQCSFALVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGK 253 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + AM+ + + S K ++ LTDG + F+ +N Sbjct: 254 VTKTASAMQHVLDNIFTPSHGS-------------RKNASKVMVVLTDG--DIFEDPLNL 298 Query: 351 IKICDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + K ++ I + + LK S P+ Y + V N +L + + Sbjct: 299 TTVISSPKMQGVERFAIGVGKAFENNKTYNELKLIASDPDDRYAFKVTNYTALDGLLSKL 358 Query: 406 SQLMVHR 412 Q ++ Sbjct: 359 QQNIIQM 365 >gi|160858155|emb|CAP19997.1| collagen type VI alpha 5 precursor [Homo sapiens] Length = 591 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 57/189 (30%), Gaps = 23/189 (12%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVY 253 + +++ D++ + K +K +L + + V Sbjct: 424 YAEQRNLDKTGCVDTKEADIHFLIDGSSSIQKKQFEQIKRFMLEVTEMFS---IGPDKVR 480 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G++ Y+ E + + + + T + A+ QI+ + K Sbjct: 481 VGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRM 540 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++I LTDG + + ++ + + I + + I A Sbjct: 541 -------------SKVPCYLIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGA-A 581 Query: 374 NGQRLLKTC 382 N L + Sbjct: 582 NKIELQEIA 590 >gi|34481900|emb|CAE46498.1| trap [Plasmodium falciparum] Length = 327 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 69/200 (34%), Gaps = 22/200 (11%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ + + +D D+ C + +++A+ L + I L+ + Sbjct: 27 DVQNNIVDEIKYREEVCNDEVDLYLLMDC-SGSIRRHNWVRHAVPLAMKLIQQLNLNESA 85 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKK 310 +++ + ++ + + + S EK + +L T+ T A+ Q + L Sbjct: 86 IHLYVNIFSXIIRLHSDASKNKEKALIIIRSLLSTNLPYGRTNLTDALLQVRKHLNDR-- 143 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I + ++ LTDG ++ + + +K K + +KI I Sbjct: 144 -------------INRENASQLVVILTDGIPDSIQDS---LKESRKLNDRGVKIAVFGIG 187 Query: 371 ASPNG--QRLLKTCVSSPEY 388 N R L C S Sbjct: 188 QGINVAFNRFLVGCHPSDGK 207 >gi|73949158|ref|XP_535195.2| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Canis familiaris] Length = 946 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRAEDQFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATRTQVTDAKKYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGLLDPESVSLIILVSDGDPTVGELKLSKIQKNVKQHIRD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 29/259 (11%), Positives = 67/259 (25%), Gaps = 53/259 (20%) Query: 162 EHLLNQR-YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + +++V + RP + +VD SGSM Sbjct: 284 GVMDSPWAEGRQLVHIGLQGALPPVEARPPLNLVFLVDTSGSMDAPD------------- 330 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K+ LK + L L + + ++ Y + + P+ + + Sbjct: 331 ------KLGLLKQSFRLMLSELRPQD------EIAIVAYAGSAGEVLAPTPA-GERATIL 377 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T+ ++QAY + + I+ TDG+ Sbjct: 378 AALERLAAGGSTNGAGGLEQAYATAEAMTEDGEV----------------SRILLATDGD 421 Query: 341 NNNFKSN-VNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVN---- 394 N S+ +++ + + + + + + Sbjct: 422 FNVGLSDPSALEDFIADKRDSGTYLSVLGFGRGNLDDATMQALAQNGNGTAAYIDTLHEA 481 Query: 395 ----ADSLIHVFQNISQLM 409 D L I+ + Sbjct: 482 QKVLVDQLTGALFPIADDV 500 >gi|255537858|ref|XP_002509994.1| protein binding protein, putative [Ricinus communis] gi|223550695|gb|EEF52181.1| protein binding protein, putative [Ricinus communis] Length = 767 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 48/145 (33%), Gaps = 17/145 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ LK A+ L + S+ + ++ +++ ++ + T ++ R +D Sbjct: 374 KLQMLKRAMRLVISSLGSAD------RLSIVAFSSVPKRLLPLRRMTAHGQRAARRIIDR 427 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L+ T A+++A ++L ++R+ + + + N Sbjct: 428 LVCGQGTSVGDALRKATKVLEDRRERNPVASIMLLSDGQDERVQTSSVNQRHTSGHINST 487 Query: 346 SNVNTIKICDKAKENFIKIVTISIN 370 + I + + Sbjct: 488 RFAHIE----------IPVHSFGFG 502 >gi|225621507|ref|YP_002722766.1| hypothetical protein BHWA1_02609 [Brachyspira hyodysenteriae WA1] gi|225216328|gb|ACN85062.1| putative membrane protein containing von Willebrand factor (vWA) type A domain [Brachyspira hyodysenteriae WA1] Length = 324 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 63/217 (29%), Gaps = 50/217 (23%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D +++ K + F+ D LS + L+G+ + Sbjct: 74 DLSRSMDADDVWPSRLERAKLEIEKFVKKTDNLS-------VALVGFAGTSFIASPFTQD 126 Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + S+ L+ T A+ A +K Sbjct: 127 METFTYILDNLTTKSVTLQGTRIADALVTAKNTFN------------------VDAVSKK 168 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGE++ + + E + + T+ + Sbjct: 169 SIILITDGEDHGGYFDDIL----KQLNEMNVSVYTVGVGTEVGATISTDLGVREKSVVSK 224 Query: 374 -NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L S+ Y N L +F ++ Q M Sbjct: 225 RDDNTLKLIADSTHGKSYIAENVS-LESIFDDMKQSM 260 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 41/129 (31%), Gaps = 6/129 (4%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R R + S++ + L ++ + + +++ + A L Sbjct: 16 ARVRGKSASQCRDDGGQISLLIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGAALTA 75 Query: 65 ASKMVSNLSRLGDRFE------SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A + + + E + A + + R ++ +LS +A T+ Sbjct: 76 AQHVSLAGAYTTELAEWLPLSDAEVYAAVADYVGEPGRAPQSCRSGTLSITAATLDATDR 135 Query: 119 QNIVNSSRI 127 V+ S Sbjct: 136 TVSVSLSCT 144 >gi|327277464|ref|XP_003223484.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like, partial [Anolis carolinensis] Length = 750 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 50/169 (29%), Gaps = 25/169 (14%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G I ++ K + T++ A+K Sbjct: 79 DALDISPEKVKVGAIEFSNTAYLEFSLDAYFTKHQIKDKLKRIVFKGGRTETGLALKYIL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + K +I LTDG++ + + KE I Sbjct: 139 RKGFHGSR---------------NSTVPKILIILTDGKSQGNIA-----APAKQLKEMGI 178 Query: 363 KIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADS-LIHVFQNISQL 408 + + ++ + L S P + + D + + +++ Sbjct: 179 TVFVVGVSFPRWEE--LHILASDPTEWHLLFAEDTDDAVNGFYTTLTRS 225 >gi|118390982|ref|XP_001028302.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila] gi|89281272|gb|EAR80639.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila SB210] Length = 202 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 9/47 (19%), Positives = 14/47 (29%) Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 K +E I + +G + K Y V D + F Sbjct: 18 KLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFF 64 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 61/194 (31%), Gaps = 29/194 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM- 284 A +K+ F+D + + G + Y+ V + + + Sbjct: 60 AQVKDFTNRFIDKLTERYFRCDRFLAWNAGALHYSDSVVIIKDLTAMPSGRAELKNSVSA 119 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + I K T + A+KQ + L K++I +TDG Sbjct: 120 INYIGKGTHTDCAIKQGIERLLVGGSHLKEN---------------KYLIVVTDGHPLEG 164 Query: 345 KSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD---- 396 ++AK IK+ +++I + + + L + ++ + Sbjct: 165 YKEPCGGLDDAANEAKHLGIKVFSVAI-SPHHLDQRLNIIATDHAYRRNFTATSLKPTRD 223 Query: 397 -SLIHVFQNISQLM 409 + NI +++ Sbjct: 224 LDVEETINNIIEMI 237 >gi|332227200|ref|XP_003262779.1| PREDICTED: vitrin isoform 3 [Nomascus leucogenys] Length = 658 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 33/343 (9%), Positives = 92/343 (26%), Gaps = 41/343 (11%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 +++ + ++ + A K + + + G +A V + + T + S+ Sbjct: 149 ALTYSSSKSPVAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVATPTTILPRPFPSA 208 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 +T + + + S + + + Sbjct: 209 ASTTSIPRPEPVGHR-----SQEMDLWSTTTYTSSQNRPGADPGIQRQDPSGAAFQKPVG 263 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 + D + K+R ++ L D L MG++ Y Sbjct: 264 VDVSLDCKIDLSFLIDGSTSIGKRRFRIQK-----QLLADVAQALDIGPAGPLMGVVQYG 318 Query: 261 TRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + ++ + + ++ A+ + S + Sbjct: 319 DNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVGRAISFVTKNFFSKANGN------ 370 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ + DG + + A+E+ I I I+I + ++ Sbjct: 371 -------RSGAPNVVVVMVDGWPTD-----KVEEASRLARESGINIFFITIEGAAENEKQ 418 Query: 379 L---------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 C ++ Y ++V + L Q + + + Sbjct: 419 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 461 >gi|148253063|ref|YP_001237648.1| hypothetical protein BBta_1525 [Bradyrhizobium sp. BTAi1] gi|146405236|gb|ABQ33742.1| putative membrane protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 174 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 37/122 (30%), Gaps = 5/122 (4%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +RF + SE +I FAL + +L+ + + Y+ + +A A +L Sbjct: 20 TRFGKLSGR--RSESGTAAIEFALFIPFLFILLVGTVDLGFAMYEAMQVSNAVEAGMLYA 77 Query: 65 ASKMVSNLSRLGDRFESISNHA---KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A + + S + A + + + + Sbjct: 78 AKNGWDSAGITNSVLNASSVYPGGTPALTATPAPSQFCGCPQATGIAVATCSSTCPDGSA 137 Query: 122 VN 123 V+ Sbjct: 138 VS 139 >gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca] Length = 885 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 64/223 (28%), Gaps = 42/223 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 247 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRAEDQFSVI 286 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 287 DFNHNV-RTWRNDLVSATKTQIVDAKKYIEKI---QPSGGTNINEALLRAIFILNEANN- 341 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIKIVTIS 368 + + II ++DG+ + KI K+N I + ++ Sbjct: 342 ----------LGMLDPESVSLIILVSDGDPTVGNCELKLSKIQKNVKQNIRDNIALFSLG 391 Query: 369 INASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 I + L + + + + L + +S Sbjct: 392 IGFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 434 >gi|258624851|ref|ZP_05719779.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582849|gb|EEW07670.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 189 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 37/124 (29%), Gaps = 17/124 (13%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LVVDLS SM + ++ A+K L F+ Sbjct: 78 STSTSHRDLMLVVDLSYSMSQEDM----------QSGQQMVDRLTAVKQVLSEFITK--- 124 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +E MGLI + + + V + + + LI T + A + Sbjct: 125 ----REGDRMGLILFADHAYLQTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKT 180 Query: 305 LTSD 308 Sbjct: 181 FIDS 184 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 64/192 (33%), Gaps = 25/192 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ + + + LI Y+ ++ + + D Sbjct: 174 SSFKQMKDFVRALMGHF-----EGTNTLFSLIQYSHLLKIHFTFTQFQNSWNPLSLVDPI 228 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + +++ + L K +K +I +TDG+ +K Sbjct: 229 VQLKGLTYTATGIRKVVEELFHSKNG-------------ARKSAKKILIVITDGQK--YK 273 Query: 346 SNVNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + + +A+ I I + + P+ ++ L S P ++ + V N +L Sbjct: 274 DPLEYSDVIPQAERAGIIRYAIGVGDAFWKPSAKQELDNIGSEPAQDHVFRVDNFAALSS 333 Query: 401 VFQNISQLMVHR 412 + + + + + Sbjct: 334 IQEQLQEKIFAL 345 >gi|109466064|ref|XP_001075558.1| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1178 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 62/193 (32%), Gaps = 30/193 (15%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A+ FL+ L + + LI Y + K V ++ Sbjct: 183 YPWEAVKNFLEKFVQGLDIGPKKTQVALIQYANDPRVVFNLTTYKNK-EDMVQATSETRQ 241 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A K ++ +TDGE+++ Sbjct: 242 YGGDLTNTFKAIQFARD-------------IAYLPESGGRPGATKVMVVVTDGESHDGSK 288 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNAD 396 I+ C+ ++ I I++ N L +K S+P Y +NV + Sbjct: 289 LQTVIQQCN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVADEA 345 Query: 397 SLIHVFQNISQLM 409 +L+ + + + Sbjct: 346 ALLEKAGTLGEHI 358 >gi|296482552|gb|DAA24667.1| vitrin precursor [Bos taurus] Length = 652 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 50/401 (12%), Positives = 108/401 (26%), Gaps = 52/401 (12%) Query: 39 FLIYVLDWHYKK---------NSMESANN---AAILAGASKMVSNL-SRLGDRFESISNH 85 FLI K+ + + A L G + N ++ + S Sbjct: 270 FLIDGSSSIGKRRFRIQKQFLTDVAQTLDIGPAGPLMGVVQYGDNPATQFNLKTHMNSQD 329 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 K A+ ++R ++ ++S + F++ N + +++ + D Sbjct: 330 VKAAIEKISQRGGLSNAGRAISFVTKNFFSKFNGNRGGAPNVAVVMVDGWPTDKVEEASR 389 Query: 146 YNMDVMTSYDYRLQFIEH------LLNQRYNQKIVSFIPALLRIEMGE------RPIFLI 193 + + + +L + K V + + L+ Sbjct: 390 LARESGVNIFFITIEGASENEKQYMLEPNFANKAVCRTNGFYSLTVQNWFSLHKTVQPLV 449 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + V D + D+ T + + L + D Sbjct: 450 KRVCDTDRLACSKTCLNSADIGFVIDGSSSVGT--SNFRTVLQFVANLSREFEISDTDTR 507 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G + YT + K T + A+ A + L Sbjct: 508 IGAMQYTYEQRLEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF------- 560 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K +K +I +TDG + + + A + I + + Sbjct: 561 ---------KKSKPNKRKLMILITDGRSYDDIRIPAML-----AHHKGVITYAIGV--AW 604 Query: 374 NGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 Q L + P ++ + V D+L V + Q + Sbjct: 605 AAQDELDIIATHPARDHAFFVDEFDNLYKVVPKVIQNICTE 645 >gi|291411003|ref|XP_002721794.1| PREDICTED: integrin, alpha X (complement component 3 receptor 4 subunit-like [Oryctolagus cuniculus] Length = 1100 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 60/193 (31%), Gaps = 29/193 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR--QYVT 281 + A +KN + + + L+ ++ + + + Q + Sbjct: 164 SFSNFATMKNFVKAVMSQFP-----RPSTQFSLMQFSNEFQTHFTFNDFVSSTNPLQLLD 218 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R + T + A+ + L K +I +TDG+ Sbjct: 219 RVYQLM--GTTHTATAILRVVDQLFHASTG-------------ARKDATKILIVITDGQK 263 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNAD 396 + + + KA+ I I + + + R L S P E+ + V N D Sbjct: 264 LD--DPLGYEDVIPKAEAAGIIRYAIGVGLAFQVVSSLRELHDIASEPAHEHVFRVENFD 321 Query: 397 SLIHVFQNISQLM 409 +L + + + + Sbjct: 322 ALRDIQGRLKEKI 334 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 52/167 (31%), Gaps = 22/167 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTD 293 + + L ++ V +G++ Y+ + S + +T + + T Sbjct: 54 NWTIKIVRDLPVHEDAVRVGIVQYSDEAKTEFNLSR-YSERNDIITHLETLKFMPGEDTR 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ +A + L + II TDG S Sbjct: 113 TGVALSKADDEIFDYDGG-------------ARLKATRLIIVFTDG-----LSMDKPTLA 154 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + +KI TIS+N+ +L V + + + + + Sbjct: 155 AKALRRKGVKIYTISVNSIGFVPEMLGI-VGDADNVFGPTDENRIEE 200 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------MGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIRD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLRKFYNQVST 495 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 61/194 (31%), Gaps = 29/194 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM- 284 A +K+ F+D + + G + Y+ V + + + Sbjct: 60 AQVKDFTNRFIDKLTERYFRCDRFLAWNAGALHYSDSVVIIKDLTAMPSGRAELKNSVSA 119 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + I K T + A+KQ + L K++I +TDG Sbjct: 120 INYIGKGTHTDCAIKQGIERLLLGGSHLKEN---------------KYLIVVTDGHPLEG 164 Query: 345 KSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD---- 396 ++AK IK+ +++I + + + L + ++ + Sbjct: 165 YKEPCGGLDDAANEAKHLGIKVFSVAI-SPHHLDQRLNIIATDHAYRRNFTATSLKPTRD 223 Query: 397 -SLIHVFQNISQLM 409 + NI +++ Sbjct: 224 LDVEETINNIIEMI 237 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 32/338 (9%), Positives = 99/338 (29%), Gaps = 31/338 (9%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ--N 120 A + N+ L +++ A +A +F K ++ + + V + Sbjct: 673 AMGDANIDNIGALKQAVKNLKWIAGGTHTGEALQFSKENLLRRFTSNNNVAIVITDGRSD 732 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + + + S + + +L I L R + Sbjct: 733 TLRDRTPLTSLCEVTPVVSLGIGDIFRNNPNPD---QLNDIACLGMPRRQGLSIQRDNYA 789 Query: 181 LRIEMG--ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLL 237 ++ + + ++P D+ +K KN + Sbjct: 790 ELLDDSFLQNITSYVCREKKCPDYTCPITFANPADIMLLVDSSTSVGSKNFDTTKNFVKR 849 Query: 238 FLDSI-DLLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + ++ V + ++ Y+ + + + + + V + + TD Sbjct: 850 LAERFLEASKPAEDSVRVSVVQYSGRNQQKVEVPFQRNYTVIAKAVDNME--FMNEATDV 907 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ + +K ++ +DG + + + Sbjct: 908 NAALQY----------------IMGLYQRSSRSGAKKKVLVFSDGNSQGITARAIERTV- 950 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + ++ I++ +++ + N + + V+ +Y+V Sbjct: 951 QEVQQAGIEVYVLAVGSQVNEPNV-RVLVTGKSTNYDV 987 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 3/38 (7%), Positives = 18/38 (47%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 E+ +++ A+ + ++ + + + +N +++ Sbjct: 14 EEGAVAVLTAILMAVLIMFAALAVDLGYLYGVRNELQN 51 >gi|154250534|ref|YP_001411358.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154484|gb|ABS61701.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 140 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 11/135 (8%), Positives = 30/135 (22%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + E+ + +I FA FL ++ I + ++ A A + + Sbjct: 1 MSRFGRDERGSVAIEFAFIAAVFLAILFGTISYGFQFATRIALSYAVTEGGRAAVAGLSD 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + A + D + + + + Sbjct: 61 QERTQRAADAIYAVVDAYAPLIDRGGISLLDPQWRETEVGRTGDIAIEYTDARFTFLPFV 120 Query: 131 HMANNRLDSSNNTIF 145 + + Sbjct: 121 PAIPGTMRVQTTFVV 135 >gi|327283159|ref|XP_003226309.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1076 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 69/180 (38%), Gaps = 31/180 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +K+ + +D + + +GL+ Y+ V + T + V+Q + + Sbjct: 820 FEIIKDFVTALVDRVT---VGRNATRIGLVLYSLEVRLEFGLNRYTTQQDVKQAIRKM-- 874 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T + A+++A Q +G +K I LTDG+ + + Sbjct: 875 LYMGEGTYTGTAIRKATQ----------------EGFFGARTGVRKVAIVLTDGQADKRE 918 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRL-----LKTCVSSPE--YHYNVVNADSL 398 + V +A I++ I I + + ++ L S P+ + Y + + ++L Sbjct: 919 A-VKLDIAVREAHAANIEMYAIGIVNTSDPTQVEFLRELNLIASDPDREHMYLIDDFNTL 977 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 62/206 (30%), Gaps = 30/206 (14%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 ++ ++ ++ D K + L L + V+ Sbjct: 291 FMLTVTPGDDLPAITEGRDWTLVLDRSGSMSGKFSTLLEGLRKGFAKFNRNDRVR----- 345 Query: 255 GLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +I + + E ++Q V ++ T+ A++ A L +D+ Sbjct: 346 -VIMFNDNATEVTNGWVQATPENLQQVVGAVENAGPSGGTNLMSAIQSALTGLDADR--- 401 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 I +TDGE N + + ++ I++ T + S Sbjct: 402 -----------------TNAIWLVTDGEANVGE--TKQKAFIELLEKKDIRLFTFIMGNS 442 Query: 373 PNGQRLLKTCVSSPEYHYNVVNADSL 398 N L S + +V N+D + Sbjct: 443 ANRPLLEAITKHSNGFAISVSNSDDI 468 >gi|323444971|gb|EGB01833.1| hypothetical protein AURANDRAFT_69450 [Aureococcus anophagefferens] Length = 376 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 48/171 (28%), Gaps = 25/171 (14%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSD 308 + +GL+ Y + + + T V ++ L T+ ++ L Sbjct: 38 DTCRLGLVVYDAQARVVVGLARVTPAHVAKVHAALEKLAPGTSTNLWGGLELGVDELVGG 97 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENF---IKI 364 + + ++ LTDG NN + + ++ + + Sbjct: 98 AGDNA-----------------RAVLLLTDGVPNNSPPEGEVAALRAKRLTKDGSETVAV 140 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + LL V +A + F ++ + + S Sbjct: 141 FAAGFGYALKSDLLLSLAREGGGLFSFVPDAGMVGTSFNHL---VASLRSS 188 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 71/191 (37%), Gaps = 15/191 (7%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRD-- 283 +K L L +D++ + + ++ Y++ V++++ V + + R Sbjct: 236 FEHVKRTLGLMIDNLCDG-ISPDTNRVAMLRYSSDVKEDLNFIEGSNEPTVMRNIQRLKY 294 Query: 284 --MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T + AM +A + + + + + G+K+ + ++ +TDGE+ Sbjct: 295 KPITDDRHGSTYTAHAMDKALKTIFTSEAG-WRNGTTEDGIKVRTE-----VVIITDGES 348 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N+ K E IK+ + + + + E + +++ L Sbjct: 349 NDPDETFTIQGQKVKYDEYGIKVYALGVG-DIKKDEIRQLTSMDDESIFYLMSWKDLA-A 406 Query: 402 FQNISQLMVHR 412 F I + ++ Sbjct: 407 FNRIIETLIET 417 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 54/192 (28%), Gaps = 27/192 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT-----RVEKNIEPSWGTEKVRQYVTR 282 +K + D D+ + V Y T ++ I + + Sbjct: 196 FDRVKQWVKNVTDRFDISTFANVGVIQYSHYYETRTVQPYMKVEIGLGQYKTQAEFQMAV 255 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D T + A+ + + + + +K I+ LTDG++N Sbjct: 256 DSIQFQGFTTFTAHALNRTVEEFMNSTR-------------YSDPTTRKVIVLLTDGQSN 302 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP----EYHYNVVNADSL 398 + + T I I + + + L+ S E + + L Sbjct: 303 DREFLEETSAYARGL---GITIFAVGVE--GYSEEELQIITSGELGNNERVFGLDTFSDL 357 Query: 399 IHVFQNISQLMV 410 V ++ + Sbjct: 358 NKVVDSLQLSIT 369 >gi|297671393|ref|XP_002813835.1| PREDICTED: collagen alpha-1(VII) chain-like [Pongo abelii] Length = 2889 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 42/139 (30%), Gaps = 15/139 (10%) Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 QY +DS + +L V K I Sbjct: 80 ATVQYSDDPRGQWASDASDSFEGTGLSINVLMIRA--------ILSSVTQARDCVPKVCI 131 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 +TDG++ + + K IK+ + I + + + L + + ++ + V Sbjct: 132 LITDGKSQD-----LVDTAAQRLKGQGIKLFAVGI-KNADPEELKRVASQPTSDFFFFVN 185 Query: 394 NADSLIHVFQNISQLMVHR 412 + L + +S+ + Sbjct: 186 DFSILRTLLPLVSRRVCTT 204 >gi|83589209|ref|YP_429218.1| hypothetical protein Moth_0341 [Moorella thermoacetica ATCC 39073] gi|83572123|gb|ABC18675.1| hypothetical protein Moth_0341 [Moorella thermoacetica ATCC 39073] Length = 212 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 31/91 (34%), Gaps = 1/91 (1%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK-NSMESANNAAILAGASKMV 69 K+ + ++ I F + + +L + +L + ++ A A+ A A ++ Sbjct: 2 LKRLLKDQRGPALIWFLIFLPVLMLGMAYLADYTQATTESDIDVQRALEMAVRAAAMQVT 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + G +I + A + Sbjct: 62 PDSQAAGHPRINIVAANIAFRRELASNLGLD 92 >gi|303241024|ref|ZP_07327534.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591449|gb|EFL61187.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 569 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 62/191 (32%), Gaps = 21/191 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLD---SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + ++A+ + + + ++ + L+ + T V + + Sbjct: 392 TSGSMSGEPIDNARSAIQNLFNKEAQEKNYTSIDDEDTISLMTFNTDVSDV--YTVKGKD 449 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + T A+ +A + K+ I+ Sbjct: 450 ISEMSVVIDSLSASGNTHLYDAVDKAITEHQALKQSESEKKI-------------DIIVV 496 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIK-IVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 L+DG + N + + ++ K KE + I+TI + + L + +Y N Sbjct: 497 LSDGADTNSQIQFSQLESMLKQKEGNLPVIITIGYG-NVDKDVLESISDKTGGKYYEG-N 554 Query: 395 ADSLIHVFQNI 405 D++ VF+ I Sbjct: 555 PDTIKKVFEEI 565 >gi|301781662|ref|XP_002926252.1| PREDICTED: collagen alpha-3(VI) chain-like [Ailuropoda melanoleuca] Length = 3167 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/355 (8%), Positives = 97/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA+L ++ + ++ A+ + A I++ + + L + + Sbjct: 1497 AAVLDALRRLRFRGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLFLGG----KS 1552 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 Q+ ++ ++ L + I + E +++ Sbjct: 1553 QDDISRFSQVISSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPSIEDRVMHA 1612 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 P I + + + + + + L Sbjct: 1613 FGPSGVTPAP--PGVDIP----------SPSRPEKKKADVVFLLDGSINFRRDTFQEVLR 1660 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D L + + +GL+ Y + +Q + + ++ Sbjct: 1661 FVSEIVDTLYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKQQIIDAINKVVYKGGRHANT 1719 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ +F + + +T G KS + + Sbjct: 1720 KVGIEH-----------LRQNHFVPEAGSRLDQRVPQIAFVITGG-----KSVEDAQEAS 1763 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1764 LALTQKGVKVFAVGV-KNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1816 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 49/168 (29%), Gaps = 20/168 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ ++ LS + + +G++ Y+ Q + + + Sbjct: 263 DFLVNLLERLSVGAQQIRVGVVQYSDEPRTVFSL-DTYSTKAQVLDAVKALVFTGGELAN 321 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + +F R G + ++ ++ G +++ + Sbjct: 322 VGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRDGVV----- 367 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ ++ + V + L Sbjct: 368 ALKQ--ASVFSFGLGAQAASRAELQHIATNDNLVFTVPEFRSFGDLQE 413 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 58/187 (31%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ + +D + + + +I ++ + + K Sbjct: 1248 FQYIRTLIERLVDYL---DVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDEVQ-NAVRRLR 1303 Query: 288 ILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + A++ + + S +F++ ++ G K Sbjct: 1304 PKGGRQVNIGGALEYVSRNIFKRPLGSRIEEG-----------VPQFLVLISSG-----K 1347 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 S+ + K++ + T++ + + + L+K SPEY ++V L + Q + Sbjct: 1348 SDDEIDDSAAELKQSGVAPFTVA--RNADPEELVKIS-LSPEYVFSVSTFRELPSLEQKL 1404 Query: 406 SQLMVHR 412 + Sbjct: 1405 LTPITTL 1411 >gi|149059383|gb|EDM10390.1| integrin, alpha 2 [Rattus norvegicus] Length = 1140 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 62/193 (32%), Gaps = 30/193 (15%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A+ FL+ L + + LI Y + K V ++ Sbjct: 183 YPWEAVKNFLEKFVQGLDIGPKKTQVALIQYANDPRVVFNLTTYKNK-EDMVQATSETRQ 241 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A K ++ +TDGE+++ Sbjct: 242 YGGDLTNTFKAIQFARD-------------IAYLPESGGRPGATKVMVVVTDGESHDGSK 288 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNAD 396 I+ C+ ++ I I++ N L +K S+P Y +NV + Sbjct: 289 LQTVIQQCN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVADEA 345 Query: 397 SLIHVFQNISQLM 409 +L+ + + + Sbjct: 346 ALLEKAGTLGEHI 358 >gi|159900726|ref|YP_001546973.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893765|gb|ABX06845.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 455 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 79/293 (26%), Gaps = 44/293 (15%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 L + Q + + + + +R L+VD S SM + Sbjct: 2 TSLDLRLTLSRS-SVPANTGEQVLYVLAEVVGQGKTNQRAPVHCVLLVDCSPSMRVPIAD 60 Query: 210 DPEDVN------------SAPICQDKKRTKMAALKNALLLFLDSID-----LLSHVKEDV 252 + M ++ + + D Sbjct: 61 AALFRELMRRGAAHEVMLDGVPVWKIRDDLMDEVRRRAQSPSRWLGQAVRGAAERLNADD 120 Query: 253 YMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +IG+ + ++ ++ V + + T P ++ +L S Sbjct: 121 QLSIIGFAEKAHAVLKAKSPTDVVRLEGSVAILERGDLGRSTRLAPGLRSTIDLLDS--- 177 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +PS F + I+ LTDG + I + TI + Sbjct: 178 -------------MPSTGFSQRIVVLTDGFV---EDEQQAFAYARHLAARSIPVSTIGLG 221 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV----FQNISQLMVHRKYSVILK 419 Q L+ S + + L ++ F + ++ +K + L+ Sbjct: 222 VEFQEQLLMSFADQSGGQSSFITDPSDLPNLLDVEFGK-AHAIIAQKARLDLR 273 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 64/228 (28%), Gaps = 44/228 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + ALL L ++ Sbjct: 247 PTLPKSVVFVIDKSGSMD--------------------GRKIVQTREALLKVLGDLNPED 286 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 V+ +I + +++V +D T+ A+ A +L Sbjct: 287 QFNLVVFNSMIS-QWQPSLLKATQENVGSAKKFV---LDIRASGGTNINEAVLAAVHLLD 342 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---ENFIK 363 +R II LTDGE + N I + + Sbjct: 343 ESNQRELLPENSVS-----------MIILLTDGEPTVGE--TNPENIQQNIQRSLDGKYA 389 Query: 364 IVTISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQ 407 + + + L K + + + + A L +Q ++ Sbjct: 390 LFCLGFGFDVSYSFLEKMALDNSGLARRIYEDSDAALQLQDFYQEVAT 437 >gi|12957162|dbj|BAB32650.1| complement factor B/C2-A3 [Cyprinus carpio] Length = 754 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 59/196 (30%), Gaps = 24/196 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD---- 283 K + ++ I ++ + T V + + + + Sbjct: 267 FEKAKGVIKTLIEKISYYEVSPNYE---ILIFATDVARIVSMRDFKSAQKNNLLEILKRL 323 Query: 284 -MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K + + QAY+ + + + K Q +I TDG+ N Sbjct: 324 KDYEYNSKGDRTGTNIAQAYRSILESMQ------IEQMTNKEEFKTTQHIVIMFTDGQAN 377 Query: 343 -NFKSNVNTIKICDKAKEN-------FIKIVTISINASPNGQRL--LKTCVSSPEYHYNV 392 +I D K+N + + + N + + LKT + ++ + + Sbjct: 378 MGGNPRPWVDQIKDLVKKNSPSEEEENLDLYVFGMGDDVNAEDINDLKTDRGNEKFFFKL 437 Query: 393 VNADSLIHVFQNISQL 408 N + L F ++ Sbjct: 438 KNLEDLQETFDSMIDE 453 >gi|319778370|ref|YP_004129283.1| exopolysaccharide biosynthesis domain protein [Taylorella equigenitalis MCE9] gi|317108394|gb|ADU91140.1| exopolysaccharide biosynthesis domain protein [Taylorella equigenitalis MCE9] Length = 553 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/258 (10%), Positives = 59/258 (22%), Gaps = 52/258 (20%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 + +L + + + + +I Sbjct: 332 SKELTPQERPQMVLVFDNSISML-----HSMDAATANFEQQMLNAMLSGVIP-------- 378 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + T++ K A ++SID + T Sbjct: 379 ------------SGGIPMPSNGPTRIQTAKKAANSIINSIDPYIDIGFISLDRCPTATNH 426 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++ + R + T ++QA ++ + +F Sbjct: 427 GFFKP---SKRGALKSTINRMDTNNSSG-TVLANGIQQAANMVDGKNRDAF--------- 473 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 I+ L+DG + + + K +KI + I ++ L Sbjct: 474 ----------ILVLSDGASTCGPNICTVAANAKRNK-PKLKINVVDIGSTGEANCL---A 519 Query: 383 VSSPEYHYNVVNADSLIH 400 + Y A L Sbjct: 520 RVTGGQVYKANTAAQLAK 537 >gi|487830|gb|AAA59180.1| eukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1163 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHFTFEEFRRTSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDATKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|72093926|ref|XP_787298.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] gi|115954441|ref|XP_001184332.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] Length = 504 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 58/201 (28%), Gaps = 41/201 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + V+D+SGSM K+ K A LD + + Sbjct: 332 PKTRKNVVFVIDVSGSMR--------------------GRKIDQTKRAFTTILDDVRPID 371 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + + S +++V R T+ ++ A +L Sbjct: 372 RINIVLFESDVRVWRSNQMVEATSDNIAAAKRHVNRI---RAGGGTNLYDGLRNAVDLLM 428 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI--CDKAKENFIKI 364 II LTDG+ + + I + + + Sbjct: 429 EHGNGE----------------AMPLIIMLTDGQPTSGSVKSTSEIIQRITNLIDGRLSL 472 Query: 365 VTISINASPNGQRLLKTCVSS 385 ++S + L K +S+ Sbjct: 473 FSVSFGNGVDFSFLEKLSLSN 493 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 42/140 (30%), Gaps = 14/140 (10%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 + + + + ++ +Q +F L + I ++ D+ M Sbjct: 337 NVVFVIDVSGSMRGRKIDQTKRAFTTILDDVR-PIDRINIVLFESDVRVWRSNQMVEATS 395 Query: 213 DV------NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 D + I L+NA+ L ++ M LI T + Sbjct: 396 DNIAAAKRHVNRIRAGGGTNLYDGLRNAVDLLMEH-------GNGEAMPLIIMLTDGQPT 448 Query: 267 IEPSWGTEKVRQYVTRDMDS 286 T ++ Q +T +D Sbjct: 449 SGSVKSTSEIIQRITNLIDG 468 >gi|266631670|emb|CBH29316.3| integrin, alpha X (complement component 3 receptor 4 subunit [Equus caballus] Length = 1160 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 71/224 (31%), Gaps = 47/224 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I ++D SGS++ + ++ +K + F + Sbjct: 152 DIVFLIDGSGSIYFKDFAKM----------------LSFVKAVMSQF---------QRPS 186 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 L+ ++ + + + + S + T + A++ + L S + Sbjct: 187 TQFSLMQFSNKFLVHFTFKDFMDSSDPLGLLNSVSQLRGLTHTASAIQVVIKELFSATRG 246 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISIN 370 K +I +TDG+ ++ + I + + A I + + Sbjct: 247 -------------ARKDASKILIVITDGQKQGDYLGYDDVIPMAEAA---GIIRYAVGVG 290 Query: 371 A---SPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + S + L S P + + V + D+L + + + + Sbjct: 291 SAFQSTQAWQELNDIASKPSHEHIFKVDDFDALRDIQNQLKEKI 334 >gi|166795266|ref|NP_001107649.1| integrin alpha-X [Equus caballus] gi|164507177|gb|ABY59790.1| integrin alpha X [Equus caballus] Length = 1160 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 71/224 (31%), Gaps = 47/224 (20%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I ++D SGS++ + ++ +K + F + Sbjct: 152 DIVFLIDGSGSIYFKDFAKM----------------LSFVKAVMSQF---------QRPS 186 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 L+ ++ + + + + S + T + A++ + L S + Sbjct: 187 TQFSLMQFSNKFLVHFTFKDFMDSSDPLGLLNSVSQLRGLTHTASAIQVVIKELFSATRG 246 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISIN 370 K +I +TDG+ ++ + I + + A I + + Sbjct: 247 -------------ARKDASKILIVITDGQKQGDYLGYDDVIPMAEAA---GIIRYAVGVG 290 Query: 371 A---SPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + S + L S P + + V + D+L + + + + Sbjct: 291 SAFQSTQAWQELNDIASKPSHEHIFKVDDFDALRDIQNQLKEKI 334 >gi|156383259|ref|XP_001632752.1| predicted protein [Nematostella vectensis] gi|156219812|gb|EDO40689.1| predicted protein [Nematostella vectensis] Length = 157 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 44/143 (30%), Gaps = 21/143 (14%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++G++ + T +E + D T + A++ + Sbjct: 36 SFDISASGTHIGIVTFATDPTVELEFDQSFDNTSIATIIDNIRNPDALTFTGKALETVKK 95 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + +I LTDG S + + KE+ + Sbjct: 96 DLFEKSQ---------------RANVHRMLIVLTDGR-----SWDAVQEPAKQLKESGVT 135 Query: 364 IVTISINASPNGQRLLKTCVSSP 386 + + + + ++ LK S+P Sbjct: 136 LYAVGVGQDYDLEQ-LKDIASNP 157 >gi|197117534|ref|YP_002137961.1| hypothetical protein Gbem_1146 [Geobacter bemidjiensis Bem] gi|197086894|gb|ACH38165.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 383 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/258 (8%), Positives = 64/258 (24%), Gaps = 18/258 (6%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM-----------VSNLSRLG 76 + ++ FL+L G I + + ++ + A L GA + L+++ Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKQRLLFQAQQSPEKLAQVS 67 Query: 77 D---RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + S A + + + ++ + S T ++S + Sbjct: 68 ADPLQSAARSTAVDNATGKHSASALVALMNDNGNALSENNDITVGFWNMSSRSYTPGGTP 127 Query: 134 NNRLDSSNNTIFYNMDVMTS--YDYRLQFIE--HLLNQRYNQKIVSFIPALLRIEMGERP 189 N + + V + + + + Sbjct: 128 VNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAICAAAC 187 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + ++ + T + D + V+ Sbjct: 188 EPSCSYPDVCNIAERKMSHAPWDPRRENSSANRYLYTSLLHPVTITNTMSDLVCQEMPVQ 247 Query: 250 EDVYMGLIGYTTRVEKNI 267 E + + + + + Sbjct: 248 EVCGLPIFTAAMKTDAIL 265 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + AL+ L+ + + I + + + +R Sbjct: 276 KLQQTREALVTILNDLRPQDRFNIIGFSNRIKM--WKDHLLPVTPD--NIRNGKIYMYHL 331 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ A ++L + ++ IIFLTDG+ ++ Sbjct: 332 SPTGGTDINGALQTAIKLLNNYVAQNDIE-----------DRSVSLIIFLTDGKPTFGET 380 Query: 347 NV-NTIKICDKAKENFIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHV 401 N + +A I I T+ I + + L L+ C + H LI Sbjct: 381 NTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGF 440 Query: 402 FQNI 405 + I Sbjct: 441 YDEI 444 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 59/184 (32%), Gaps = 20/184 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + AL+ L+ + + I + + + +R Sbjct: 310 KLQQTREALVTILNDLRPQDRFNIIGFSNRIKM--WKDHLLPVTPD--NIRNGKIYMYHL 365 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ A ++L + ++ IIFLTDG+ ++ Sbjct: 366 SPTGGTDINGALQTAIKLLNNYVAQNDIE-----------DRSVSLIIFLTDGKPTFGET 414 Query: 347 NV-NTIKICDKAKENFIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHV 401 N + +A I I T+ I + + L L+ C + H LI Sbjct: 415 NTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCGLTRRVHEEEKAGAQLIGF 474 Query: 402 FQNI 405 + I Sbjct: 475 YDEI 478 >gi|327270796|ref|XP_003220174.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 948 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 66/210 (31%), Gaps = 35/210 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 + K ++ L A LFL L ++ ++G++ + + V Sbjct: 301 TSGKMGKDNRLGRLNQAAKLFL-----LQIIESGSWVGIVTFNNEAATKTLLQKIVNDGV 355 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 RQ +T + + + + +Q+ + S I+ L Sbjct: 356 RQTLTSYLPTTAAGESKICDGVLAGFQVFLNKYPSSEGCE----------------IVLL 399 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV--- 393 T GE+ +S + + + I TI+ + L K + + Sbjct: 400 THGEDPAIRSCFP------QIQNSGSIIHTIAFGSG-TSNELEKLADMTGGLAFYATDSL 452 Query: 394 NADSLIHVFQNISQ---LMVHRKYSVILKG 420 +++ L+ F IS + + + K Sbjct: 453 DSNGLMDAFSGISSGGGDISQQSIQLESKA 482 >gi|308497300|ref|XP_003110837.1| CRE-CUT-6 protein [Caenorhabditis remanei] gi|308242717|gb|EFO86669.1| CRE-CUT-6 protein [Caenorhabditis remanei] Length = 576 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 52/176 (29%), Gaps = 22/176 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + L+ +E +M LI Y + + Sbjct: 62 DTFKKQLSFAMHLASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ 121 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ +G++ K I +TDG++ + S Sbjct: 122 SGATNTGQALRLTL----------------EKGLQGARPGIPKVAIVITDGQSQDDVSEP 165 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + ++ + + I + N +L + +P + V + L + Sbjct: 166 S-----QLLRDADVMVYAIGVTNLVNVHQLHQM-TGNPVRVFTVETFEQLDRALAD 215 >gi|302783060|ref|XP_002973303.1| hypothetical protein SELMODRAFT_99409 [Selaginella moellendorffii] gi|300159056|gb|EFJ25677.1| hypothetical protein SELMODRAFT_99409 [Selaginella moellendorffii] Length = 99 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 8/58 (13%), Positives = 22/58 (37%), Gaps = 4/58 (6%) Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + + T + + + +L ++ V ++ H F +Q + SV+ + Sbjct: 21 VPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEEFAVQHAF---AQFIGGL-LSVVAQ 74 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 56/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + + K L+ Y+ + + + Sbjct: 166 FDRMKKFVSTVMSRFQ-----KSKTLFALMQYSDDFRTHFTFNDFKRNSDPELLVRPIGQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L + K +I +TDGE + Sbjct: 221 LFGRTHTATGIRKVVRELFHSSSGARNHAL-------------KIMIVITDGEK--YLDP 265 Query: 348 VNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + + +A I I + S ++ L T S P ++ + V N ++L + Sbjct: 266 LEYRDVIPEADRKGIIRYVIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|291232650|ref|XP_002736273.1| PREDICTED: mind bomb 2-like [Saccoglossus kowalevskii] Length = 847 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/194 (10%), Positives = 56/194 (28%), Gaps = 22/194 (11%) Query: 228 MAALKNALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 +A LK A FL + + + ++ + + V++ + + Sbjct: 6 IAELKRAGNEFLIGVQLTANQTGLRENVAVVEFGRNTRIVQSLTDNYVSVKRAIDSLV-- 63 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + +A + + S+ +I +TDG ++ Sbjct: 64 -PGGTTPMFEGLMEAMKEIISNGGVLTL---------KGGKKMTPRVILMTDGYGDDKDK 113 Query: 347 NVNT-IKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + K I I + + + + L + + L Sbjct: 114 VAKSAMSFGPLWKAVGLPHPIPIACVGCGPNVDKELLAIIAEITNGMYVTGE-MGQLSGF 172 Query: 402 FQNISQLMVHRKYS 415 F+ + ++ +++ Sbjct: 173 FR---RQVLLLRFA 183 >gi|255009406|ref|ZP_05281532.1| hypothetical protein Bfra3_09717 [Bacteroides fragilis 3_1_12] gi|313147165|ref|ZP_07809358.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135932|gb|EFR53292.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 341 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 66/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISKLVD-------GMENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESINPSLISKQGTAIGAAISLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + + + I++ + + Sbjct: 192 AIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDGAPIPIEGSNDFRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N + + V N++S I+Q + Sbjct: 248 NVIVTRLNEAMCQEIAKVGNGIYVRVDNSNSAQKA---INQEI 287 >gi|119470035|ref|ZP_01612840.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] gi|119446745|gb|EAW28018.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] Length = 1090 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/343 (7%), Positives = 74/343 (21%), Gaps = 87/343 (25%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYN---------QKIVSFIPALLRIEMGERPIFLIELV 196 D + N + S Sbjct: 34 VETDENPRVLMIFDTSGSMDWSVINGDNQVCYIKKITQSNGNGNGNGNGNGNGNGNGNGG 93 Query: 197 VDLSGSMHCAMNSDPEDVN----------SAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + ++ + D ++ KNA+ ++ + Sbjct: 94 GGQVYYENVTCFASKDEYAEFNEQCYIGVNGVAVADCHDRRIDVAKNAMTQLVNDNSDID 153 Query: 247 HVKEDVYMGLIGYTTRV--EKNIEPSWGTEKVRQYVTRDMDSLILKPTD---------ST 295 GL+ + + V + + + S T+ + Sbjct: 154 -------FGLMRFRSNVGGYVVAKLGTDKTSLISDINSLPASGSTPMTETLWEAYRYITG 206 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK-----FIIFLTDGENNNFK----- 345 ++ A+ + + K + + + K + + +I +TDG+ N Sbjct: 207 QSLDYAFNV-SDRDKSADNSVVYTSPFKPNNGDPLRCDNSINVILMTDGDPTNDDGRDTS 265 Query: 346 --------------------SNVNTIKICDKAK---ENFI-------------KIVTISI 369 + + + + + ++ TI Sbjct: 266 IAQTHNSYFNDDIPFDDGTYDDSYLVAMAKILHGTSDTKVDLYTPSTDVLDTGRLYTIGF 325 Query: 370 NASPNG---QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L K + A++L +N + Sbjct: 326 GTGMSQGGIDLLDKAARVGGGQYIEARTAEALSEALKNTISRI 368 >gi|157961836|ref|YP_001501870.1| hypothetical protein Spea_2015 [Shewanella pealeana ATCC 700345] gi|157846836|gb|ABV87335.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345] Length = 445 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 37/105 (35%), Gaps = 5/105 (4%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ ++F + + S L + K +++A +AA L A ++ + Sbjct: 16 RNQSGAILVMFTIGLFSILAFAALALDGGHILLSKGRLQNAVDAAALNAAKELQQGATLF 75 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 R + + + + + S+S S F T++ Sbjct: 76 EARQAAYTILLQNLGFSE-----NGELNNSISLSSPDFNTTQVTA 115 >gi|239906055|ref|YP_002952794.1| hypothetical protein DMR_14170 [Desulfovibrio magneticus RS-1] gi|239795919|dbj|BAH74908.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 137 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 45/135 (33%), Gaps = 6/135 (4%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ ++ AL + L ++ +I + + ++++ +A A ++ Sbjct: 9 DQRGASAVEMALILPLLLTVVFAIIDYSRFFFLRSTVTAAVADATRLAVLPGTTDAMIAA 68 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 +++ + +A ++ + S SA T ++ S I + + Sbjct: 69 AISQALLDPINQADGQ------TPNVSVTPSQRSAGQPVTVTASLPFSPLILPQFLGMSL 122 Query: 137 LDSSNNTIFYNMDVM 151 + N + Sbjct: 123 FPQNINAAATMVVEP 137 >gi|62531155|gb|AAH92555.1| LOC594926 protein [Xenopus (Silurana) tropicalis] Length = 895 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 57/186 (30%), Gaps = 20/186 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM K ALL LD + H ++ + + + E + + + Sbjct: 291 KMQQTKEALLKILDDVKEHDHFNFVIFDWGV----EIWEQSLVKATPENLNRAKAYVRNL 346 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+ A +L IIF+TDG+ + + Sbjct: 347 YPKGWTNINDALLSAISLLDQAHDARSV-----------PKRSASLIIFMTDGQPSTGER 395 Query: 347 NVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHV 401 N++ I+ A + ++ + L K + + Y +A + Sbjct: 396 NLDKIQENARNAIRGKYSLYSLGFGVGVDYPFLEKLSLENSGVARRIYEESDAALQMEGF 455 Query: 402 FQNISQ 407 + ++ Sbjct: 456 YDEVAN 461 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 65/183 (35%), Gaps = 22/183 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K A L LD + + +G+ G+ + K +EP T K + + Sbjct: 55 KIFRAKQAALGLLDILRDKDY------VGVYGFAGKFYKVLEPVPAT-KRGEVERAIISL 107 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T+ + + L + K+ IIF+TDGE K Sbjct: 108 KLGSGTNIYDTL----KKLVEETKKVLQNGALSLVR----------IIFITDGEPTVGKK 153 Query: 347 NVNTI-KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 N I ++ K +E + I + N + L + ++ ++ + SL + Sbjct: 154 NPKKILEMAKKLREAGASALIIGVGTEYNEKLLSRMAMALNGEFEHISDPASLEKLISEY 213 Query: 406 SQL 408 ++ Sbjct: 214 AKS 216 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 67/190 (35%), Gaps = 37/190 (19%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-------VRQYVTRD 283 ++ L + ++ + + +++ V KNI+ K + + +++ Sbjct: 95 FRSELKFVKKLLSDVTVDYNHTRVAIATFSSSVSKNIDQISDPRKENNKCFLLSKLLSKI 154 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T++ A + A +I T + S +K + +TDG +N Sbjct: 155 E--YTGGGTNTLKAFEVAKEIFTQSRNDS-----------------EKVLFLITDGFSNG 195 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + + K++ +KI TI I + L S+P Y L+ F+ Sbjct: 196 GD----PIPLAAELKKDQVKIFTIGIANGNYKE--LYELASTPGEIY-----SYLLDSFE 244 Query: 404 NISQLMVHRK 413 L H K Sbjct: 245 EFESLARHLK 254 >gi|258627502|ref|ZP_05722283.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] gi|258580308|gb|EEW05276.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] Length = 406 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/207 (9%), Positives = 55/207 (26%), Gaps = 21/207 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + G ++ ++ +++S ++L I V + +++A +AA LA A + Sbjct: 7 RNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADAT 66 Query: 72 LSRLGDR--FESISNHAKRALIDDAKRFIKNHIKESLSGY-----------SAVFYNTEI 118 +S A + + + S + Y Sbjct: 67 HDTATATDVAKSTLTSMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVS 126 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-------IEHLLNQRYNQ 171 ++ + M + + S + + + + + + Sbjct: 127 VADLSLTEFFMQALGMAK-QVSAAAVAGPSSSINTIGNVVPIGVCKGSESGGIYGYHSSD 185 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVD 198 + MG L++ Sbjct: 186 VYALKVGDSSMTAMGSGNYNLLDFGSG 212 >gi|150024647|ref|YP_001295473.1| outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] gi|149771188|emb|CAL42655.1| Probable outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] Length = 631 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/249 (12%), Positives = 75/249 (30%), Gaps = 52/249 (20%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +K+V I + P + ++D+SGSM+ K+ Sbjct: 252 RKLVQIGLQGKNIPTNDLPASNLVFLIDVSGSMNAD-------------------NKLPL 292 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L +L + ++ + + ++ Y + P+ G EK + + Sbjct: 293 LIESLKILVEQLRKQD------KVAIVVYAGAAGLVLPPTAGNEK-QTIINALEKLNAGG 345 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-SNVN 349 T ++ AY+ + + +I TDG+ N S+ Sbjct: 346 STAGGAGIELAYKTAQENFIKDGNN----------------RVILATDGDFNVGSTSDSA 389 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS----LIHVFQ- 403 + + +E+ + + + ++ + + N L F+ Sbjct: 390 MQTLIEDKRESGVFLTCLGYGMGNYKDSKMEILADKGNGNYAYIDNIQEANRFLGKEFKG 449 Query: 404 ---NISQLM 409 I++ + Sbjct: 450 SMFAIAKDV 458 >gi|57163755|ref|NP_001009219.1| integrin alpha-E [Felis catus] gi|20530606|gb|AAM27173.1|AF420018_1 alpha E integrin [Felis catus] Length = 1160 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 57/185 (30%), Gaps = 18/185 (9%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + A + + + L+ Y ++ + + + + + Sbjct: 207 FQKAKDFISNMMKNFYAKCFECNFALVQYGEVIQTEFDLRDSQDALASLARVQNITQVKN 266 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A++ + + S K I+ +TDG + F +N Sbjct: 267 VTKTASAIQHVLDNIFTPSHGS-------------RKNAPKVIVVITDG--DIFGDPLNL 311 Query: 351 IKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 + K ++ I + + LK S P + + V N +L + + Q Sbjct: 312 TTVISSPKMQGVERFAIRVGNESTKTLKELKLIASDPFERHAFTVTNYSALDGLLSKLQQ 371 Query: 408 LMVHR 412 ++H Sbjct: 372 NIIHT 376 >gi|218131125|ref|ZP_03459929.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|317476997|ref|ZP_07936239.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986645|gb|EEC52979.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|316906790|gb|EFV28502.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 342 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 65/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDK-------MENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESIDPSLISKQGTAIGAAINLASRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + E I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAA----AEKGIQVNVLGVGMPEGAPIPAEGTNDYRRDRDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N Q + S + V N + ISQ + Sbjct: 248 NVIVTRLNEQMCQEIAKSGNGIYVRVDNTNGAQKA---ISQEI 287 >gi|301772358|ref|XP_002921601.1| PREDICTED: integrin alpha-2-like [Ailuropoda melanoleuca] Length = 1369 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/285 (12%), Positives = 82/285 (28%), Gaps = 37/285 (12%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI--ELVV 197 + +I ++ T+ L ++ + + + + + Sbjct: 287 TATSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAKQCGSQYYTTGVCSDVSADFQL 346 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGL 256 S + DV + A+ FL+ L + +GL Sbjct: 347 LTSFAPAAQACPSLIDVVVVCDESNSIY-----PWEAVKNFLEKFVQSLDIGPKKTQVGL 401 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFF 314 I Y + + K + + T++ A++ A Sbjct: 402 IQYANKPRVIFNLNTFKTKAE-MIEATSQTYQYGGDLTNTFKAIQYAKD----------- 449 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 F L K ++ +TDGE+++ I CD + I I++ N Sbjct: 450 --FAYAAGAGGRLGAAKVMVVVTDGESHDGSMLKAVIDQCD---NDNILRFGIAVLGYLN 504 Query: 375 GQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + + +NV + +L+ + + + Sbjct: 505 RNALDTKNLIKEIKAIASIPTETFFFNVSDEAALLEKAGTLGEQI 549 >gi|116622485|ref|YP_824641.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225647|gb|ABJ84356.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/279 (7%), Positives = 66/279 (23%), Gaps = 38/279 (13%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + ++ I + + + + I P +RI P L Sbjct: 15 IPLVSSQIDSRARYKVEVNMVVLN--FSVTDKSGNPITGLKPTDMRIFEDGIPQKLASFA 72 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKK--RTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 + + +++A+ F+ ++ + Sbjct: 73 EGSTVFRLVDDAPGSPGTAVFVLFDTSNRMYNSYPYVRDAIAEFIRRLEPVDSA------ 126 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + ++ + + + + + + A+ + Sbjct: 127 AIYTFSRNLYRAAPLTRDHVRAGAGLDNIS---AGDDSALFNALLLTLR----------- 172 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI----SIN 370 +P +K I+ ++ +N + + A + I I + Sbjct: 173 --------DAAEVPGRKAIVVFSNSSDNASVLSPY--DVGRLAVNEGVPIYVISTRDGVQ 222 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L ++ Y+ FQ I + + Sbjct: 223 DPVTNSALHYLTAATGGKLYSARTWQKQAGAFQAIREDI 261 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/291 (6%), Positives = 60/291 (20%), Gaps = 44/291 (15%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + + + + + + + + + I Sbjct: 3 DVYLNARGNVQKDITPLDLVLVVDWSGSMNDN----NRIGEVKIGVDRFVDTLADSGITD 58 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + + + ++KN + S + Sbjct: 59 KI-------NMGYVGYSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRD 111 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 + + ++ + D + ++ + T Sbjct: 112 AGSMLSVPNGH----------KKVIVLLTDCVPTFSYKVQ-------RVHAQSSSNYYGT 154 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENFIKIVTISINAS 372 F + + I D N + + + TI KE I+I + I Sbjct: 155 QFSNTQDRPGNTSLISRIYDAPDQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQ 214 Query: 373 PNGQRLLKTCV------------SSPEYHYN-VVNADSLIHVFQNISQLMV 410 + L + +Y +A + + + Sbjct: 215 SDPAAGLSKAEVESRMRQMVSSDEKGDLYYESADHATDISEYLAKKAVQIS 265 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 42/153 (27%), Gaps = 1/153 (0%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 F++ + S N + AL + + +G I ++ ++S + A +A A+ Sbjct: 5 TTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSVADLASIAAAAD 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + S + F S + + + + V I Sbjct: 65 VSSAEEAVLAHFRSNDLGFAVSTPSGLMTVDGKLLPNDVETAAKGVATIVRGRYVPDPAI 124 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 R + + ++ F Sbjct: 125 DAGQRFIRR-ALPADAVQVTLEKKGDIYLAALF 156 >gi|326330152|ref|ZP_08196463.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325951965|gb|EGD43994.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 141 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 11/136 (8%), Positives = 42/136 (30%), Gaps = 5/136 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ E+ + S + + +L+G + + + + + A++ A A Sbjct: 1 MKQRTHDERGSVSTWLVTASLVMAMLVGLAVDLGGQIHAQ---QRAHDLAAQAARVGGEQ 57 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++ A + S++ + +++ + Sbjct: 58 LQAAPAIQGRYVAVDTAAARRAATSYLRSAGVSGSVAVRTGDTVAVTVRDTY--ETRFLG 115 Query: 131 HMANNRLDSSNNTIFY 146 + +R+ + + Sbjct: 116 IIGISRVRVTGTATAH 131 >gi|262171234|ref|ZP_06038912.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] gi|261892310|gb|EEY38296.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] Length = 406 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/207 (9%), Positives = 55/207 (26%), Gaps = 21/207 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + G ++ ++ +++S ++L I V + +++A +AA LA A + Sbjct: 7 RNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADAT 66 Query: 72 LSRLGDR--FESISNHAKRALIDDAKRFIKNHIKESLSGY-----------SAVFYNTEI 118 +S A + + + S + Y Sbjct: 67 HDTATATDVAKSTLTSMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVS 126 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-------IEHLLNQRYNQ 171 ++ + M + + S + + + + + + Sbjct: 127 VADLSLTEFFMQALGMAK-QVSAAAVAGPSSSINTIGNVVPIGVCKGSESGGIYGYHSSD 185 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVD 198 + MG L++ Sbjct: 186 VYALKVGDSSMTAMGSGNYNLLDFGSG 212 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 65/195 (33%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + VR Sbjct: 259 SSASMVGTKLRQTKDALFTILHDLRPQDHFNVIGFSNRIKVWKDHLVSVTP-----DSVR 313 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L + + I IIFLT Sbjct: 314 DGKVYIHHMSPTGGTDINGALQRAIRLLNN----------YVAHNDIEDRSVS-LIIFLT 362 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 DG+ + NT+KI + KE I I TI I + + L K + + V + Sbjct: 363 DGKPTVGE--TNTLKILNNTKEATRGQICIFTIGIGDDVDFKLLEKLSLENCGLTRRVHD 420 Query: 395 ADS----LIHVFQNI 405 D LI + I Sbjct: 421 EDDAGSQLIGFYDEI 435 >gi|46201041|ref|ZP_00207942.1| hypothetical protein Magn03010639 [Magnetospirillum magnetotacticum MS-1] Length = 142 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 41/135 (30%), Gaps = 2/135 (1%) Query: 15 IASEKANFSIIFALSVMSF-LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I E+ ++ FAL++ L+++G + +K + SA A +G + + Sbjct: 5 IRDERGVSAVEFALALPILALMMVGLA-DMGLAVNEKMRLTSAVRAGAQSGYGNWNDSAA 63 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + A G + T V ++ + Sbjct: 64 IISAVKDASGVSPTALTVTTATSCACADGSAMACGNTCSGGATRTYLTVTATERYSLLVD 123 Query: 134 NNRLDSSNNTIFYNM 148 L + + Sbjct: 124 YPGLSGPASLSATAV 138 >gi|94309589|ref|YP_582799.1| TadE-like protein [Cupriavidus metallidurans CH34] gi|93353441|gb|ABF07530.1| flp pilus assembly protein (TadG-like) [Cupriavidus metallidurans CH34] Length = 176 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 12/154 (7%), Positives = 42/154 (27%), Gaps = 1/154 (0%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R ++ ++ ++ F + ++ LL+ + Y+ +++ A +A Sbjct: 1 MKRLTNAHRQSRTRQRGVAAVEFGIMLVPMLLMACGVAEFGRAIYQYDTLTKATRSAARY 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI-V 122 + +++ + ++ + + Sbjct: 61 LSQYSPDDVAYPTAATKCLAAYGNTGCSGQPLAPGLTTAMVIICDRVDSSGCPGATQTFS 120 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 N + T + + N I + T Sbjct: 121 NVATYDSTGGGSGTQAGTVNLIAVRISGYTYTPL 154 >gi|258621449|ref|ZP_05716483.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] gi|258586837|gb|EEW11552.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] Length = 406 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/207 (10%), Positives = 56/207 (27%), Gaps = 21/207 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + G ++ ++ +++S ++L I V + +++A +AA LA A + Sbjct: 7 RNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADAT 66 Query: 72 LSRLGDRFESISNHAK--RALIDDAKRFIKNHIKESLSGY-----------SAVFYNTEI 118 + S A A + + + S + Y Sbjct: 67 HDTATATDVAKSTLASMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVS 126 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-------IEHLLNQRYNQ 171 ++ + M + + S + + + + + + Sbjct: 127 VADLSLTEFFMQALGMAK-QVSAAAVAGPSSSINTIGNVVPIGVCKGSESGGIYGYHSSD 185 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVD 198 + MG L++ Sbjct: 186 VYALKVGDSSMTAMGSGNYNLLDFGSG 212 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 67/226 (29%), Gaps = 58/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGEAINLATRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + +K I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPEGAPIPVEGTNDYRRDREG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N + + V N++S ISQ + Sbjct: 248 NVIVTRLNEGMCQEIAKDGKGIYVRVDNSNSAQKA---ISQEISKM 290 >gi|186683831|ref|YP_001867027.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186466283|gb|ACC82084.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 615 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 42/159 (26%), Gaps = 18/159 (11%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ D + ++ Y V+ + P T K T+ + + Sbjct: 68 DQLEPDDILSVVVYDDEVDSVVPPQAVTNKA-TLKDSIRKVRAGGITNLSGGWLKG---- 122 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKI 364 ++ LTDG N + + K E I Sbjct: 123 ------------CEHVKTRLDPQKINRVLLLTDGHANMGIQDPKVLTATSGQKAEEGITT 170 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 T+ N L+ ++ Y + + D VF Sbjct: 171 TTLGFAQGFNEDLLIGMARAARGNFYFIQSIDEATEVFS 209 >gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Ailuropoda melanoleuca] Length = 946 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRAEDQFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQIVDAKKYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------LGMLDPESVSLIILVSDGDPTVGELKLSKIQKNVKQNIRD-NIALFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|198436264|ref|XP_002122997.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 1993 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/354 (9%), Positives = 80/354 (22%), Gaps = 44/354 (12%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 + +D + Y+ V T + I + + + Sbjct: 1653 AAIRYNRDIEHLWNLDQYTTRAALIEGINNIPYNGVGTLTGAAINYTAENIFLPELGRRK 1712 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 I + SYD + L + V + Sbjct: 1713 ---GVPKIVVVLTDGVSYDDVSIPSQRLKSDNTIIVAVGIGRYDQDQINEIASDPDQDFA 1769 Query: 197 VDLSGSMHCAMNSDPEDV--------NSAPICQDKKRTKMAALKNALLLFLDS------- 241 ++G + C + + ++ D Sbjct: 1770 TTVAGFDGLNRVVTTISSQIKLCQAITNLTTCPLITVMDIVLIVDSSSSIGDDNFELMRN 1829 Query: 242 -----IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + ++ +G + Y V++ + + T + Sbjct: 1830 FILELVRSFNVSRDTTRIGYVRYNNAVDERFQLNTFNTSEEVQEAIRAVPYRGSGTLTGQ 1889 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ A + + I +TDG + + ++ Sbjct: 1890 ALSYASRT-------------SVRAPAGRRPGVPGVAIVITDGRAQDA-VDAPARELQRL 1935 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + ++I L S Y +N+ + + L V +IS + Sbjct: 1936 MQ-------VVAIGIRGAVPEQLNAIASQQGYVFNIEDFNRLDEVLGSISSTVC 1982 >gi|121534348|ref|ZP_01666172.1| hypothetical protein TcarDRAFT_1914 [Thermosinus carboxydivorans Nor1] gi|121307118|gb|EAX48036.1| hypothetical protein TcarDRAFT_1914 [Thermosinus carboxydivorans Nor1] Length = 307 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 9/48 (18%), Positives = 24/48 (50%), Gaps = 1/48 (2%) Query: 4 LSRF-RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK 50 +SRF K + +++ +++ AL++ + L ++ V +Y + Sbjct: 1 MSRFCVKLVGKLLRNQQGAVAVLTALALTAMLGFAAIVVDVGLLYYNR 48 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 45/175 (25%), Gaps = 32/175 (18%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + L + + + S +R + R T A+ A Q L Sbjct: 118 GNPEDMAALYTFNWEIREQQPFSRD---LRAFDNRLKMMHGEAGTAMYDAVYLAAQRLEP 174 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +K I+ +TDG + + +V + + I I Sbjct: 175 RDG-------------------RKVIVVVTDGGDTVSRLSVQKALEAAQLAD--AVIYAI 213 Query: 368 SI-------NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + + G+ L + + L F +I + + Sbjct: 214 VVVPITNDAGRNIGGEHALDFMAKGTGGRIFMPTLGAELDKAFADIITELRTQYL 268 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 12/125 (9%), Positives = 31/125 (24%), Gaps = 7/125 (5%) Query: 35 LLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + F I + K +++A + A LAGA + Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVNQAETAVRTTL-------GG 82 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + + S + + S+ + + ++ S Sbjct: 83 ISTEPGNSELTFSDSNTAVTFSHDMQTFVSAASFTPPLGEYDIYVRVAVTDISLTQYLSG 142 Query: 155 DYRLQ 159 + + Sbjct: 143 LFGID 147 >gi|262203982|ref|YP_003275190.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262087329|gb|ACY23297.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 461 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 66/248 (26%), Gaps = 37/248 (14%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 L ++ + A ++P + +++D S SM A P Sbjct: 7 VLVLLSAVSMMLAACTTSDGTPTGAGGLSTAADKPTTPVVVILDGSESMQIADAPGP--- 63 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 ++ A +NA+ F+ + + GL+ Y P Sbjct: 64 ------------RIDAARNAVSTFISDLTSGTP------FGLVAYGNTESAKTTPQAVGC 105 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + R A + A + + R + + ++ Sbjct: 106 EDVSTLARLGPIDK-------EAARSAIDGVRAQGWTPLSAALTRAAEMLGTEA--GSVV 156 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 ++DGE N T +E + I T+ + + +L Sbjct: 157 LVSDGEANCLPDPCAT---ARSLREQNPNLTISTVGFKS--DAAQLQCVAREGGGVFVTA 211 Query: 393 VNADSLIH 400 N L Sbjct: 212 DNTAQLSA 219 >gi|198426777|ref|XP_002120162.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 367 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 59/173 (34%), Gaps = 28/173 (16%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 SI+ + + T+ T V D L L T + A+ + Sbjct: 178 SINEADPNNFNTVKNWVKNITKRFDI------TSSGSAAVAMDQIKLRLGATFTAAALSK 231 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + + + + +K ++ LTDG++N+ + + + Sbjct: 232 ATTVFKNSSRFN-------------DPLTKKVLVLLTDGQSNDREGLN---ASATQVRNL 275 Query: 361 FIKIVTISINASPNGQRLLKTCVSS----PEYHYNVVNADSLIHVFQNISQLM 409 I TI++ + + L+ + + Y + + +L + Q+I Q + Sbjct: 276 NIT--TIAVGVKADVLQELQIIANGVIGNNDRVYQLRDFSNLDSIVQSIFQEI 326 >gi|149184582|ref|ZP_01862900.1| TadE-like protein [Erythrobacter sp. SD-21] gi|148831902|gb|EDL50335.1| TadE-like protein [Erythrobacter sp. SD-21] Length = 153 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 11/83 (13%), Positives = 29/83 (34%), Gaps = 3/83 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA-NNAAI- 61 + R +E + I A+ + +L+ +++ +++A A Sbjct: 1 MIRMLRKLAAIRRNENGSVVIETAIVLPVLVLMALGGFETSRIVSRESELQAAIAEGAAV 60 Query: 62 -LAGASKMVSNLSRLGDRFESIS 83 LA + L + + E+ + Sbjct: 61 VLATFPEDQEELDTIEEIIEAST 83 >gi|284029672|ref|YP_003379603.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] gi|283808965|gb|ADB30804.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] Length = 144 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 14/132 (10%), Positives = 43/132 (32%), Gaps = 1/132 (0%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E+ +++ LL+I + + + +++ + A+LA A+ + + + Sbjct: 6 RRDERGQMTVLIIGFTAIVLLMIAVVTDISKVFLVRRDLDATADGAVLAAANGLAAIYGQ 65 Query: 75 LGDRFESISNH-AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + R L + + + +SA T + + S+ Sbjct: 66 TSAGSTAELDVEQARRLAAEYLDTVAASNRFDGLDWSAEVTGTTVTVELTSTVDLPFQPP 125 Query: 134 NNRLDSSNNTIF 145 + + + Sbjct: 126 GWQGSAGIASTA 137 >gi|194221225|ref|XP_001492399.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Equus caballus] Length = 883 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 75/226 (33%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K+ALL L+ + ++ Sbjct: 284 NVVFVIDVSGSM--------------------YGRKIQQTKDALLKILEDVKEDDYLNFI 323 Query: 252 VYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G + + + + E ++Q M+ T+ A+ + +L ++ Sbjct: 324 LFSGDVTTWKDNLVQATP-----ENIQQAREFVMNIHSQGMTNINDALLRGISMLNKARE 378 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 + II LTDG+ N +S I+ A + + Sbjct: 379 ENAVPE-----------RSTSIIIMLTDGDANVGESKPEKIQENVHNAIRGKFPLYNLGF 427 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 428 GNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 473 >gi|269126097|ref|YP_003299467.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268311055|gb|ACY97429.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 138 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 6/70 (8%), Positives = 19/70 (27%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 +++ ++ A + L+LI +I + + A + + Sbjct: 20 ARTDRGAVTVEMAFLLPIMLMLIFMIIDFGRAFNAQLRINEAVRQGVRTAVLTQPPADPK 79 Query: 75 LGDRFESISN 84 + Sbjct: 80 TAAADIMTTA 89 >gi|268574618|ref|XP_002642288.1| C. briggsae CBR-CUT-6 protein [Caenorhabditis briggsae] gi|187025290|emb|CAP35757.1| CBR-CUT-6 protein [Caenorhabditis briggsae AF16] Length = 357 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 53/176 (30%), Gaps = 22/176 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K L + L+ +E +M LI Y + + Sbjct: 62 DTFKKQLSFAMHLASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQ 121 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A++ +G++ K I +TDG++ + S Sbjct: 122 SGATNTGQALRLTL----------------EKGLQGARTGIPKVAIVITDGQSQDDVSEP 165 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + ++ + + I + N +L + +P + V + + L + Sbjct: 166 S-----QLLRDADVMVYAIGVTNLVNVHQLHQM-TGNPVRVFTVESFEQLDRALAD 215 >gi|260837284|ref|XP_002613635.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] gi|229299021|gb|EEN69644.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] Length = 2411 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 54/190 (28%), Gaps = 23/190 (12%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + +V+D+SGSM +++ ++ K A L LD++ Sbjct: 131 VSAASPKKKNVVIVIDVSGSMREPPGP-------------EEQNRLNLAKQAALTVLDTL 177 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + + + + T ++A+ Sbjct: 178 TPRDWGGVVSFSARAETPEGCLGDSLGEANPTNIGIMQDFINQRVPETITMYGVGFRKAF 237 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + IIFL+DG + ++ I + + + Sbjct: 238 DMFAEARNKKPEQ----------FEDCYNIIIFLSDGSPTDKAFALDEITKGQELMDRSV 287 Query: 363 KIVTISINAS 372 I T + A+ Sbjct: 288 YIFTYGLGAN 297 >gi|309792255|ref|ZP_07686727.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225796|gb|EFO79552.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 391 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 25/177 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ ++K A ++++ V ++ + V+ + + T++ + Sbjct: 36 KLQSMKAATRKVIETLTDQDVVS------IVIFDDTVQTLVPATLATDRT-ALLAAIDTI 88 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + M+ L K ++ LTDG+ + Sbjct: 89 SEAGGTAMSLGMQAGQVELQ----------------KHSGPDRLSHMLLLTDGQTWGDEE 132 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + ++I + + A N Q L S + +A+ + FQ Sbjct: 133 TC--RNIARALGQADVRITALGLGAEWNEQLLDDLAEFSDGTSDYIADANQIGTFFQ 187 >gi|302915967|ref|XP_003051794.1| predicted protein [Nectria haematococca mpVI 77-13-4] gi|256732733|gb|EEU46081.1| predicted protein [Nectria haematococca mpVI 77-13-4] Length = 1166 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 70/229 (30%), Gaps = 21/229 (9%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 QR + VS + + S M +++ + V PI + Sbjct: 584 WQRSRPQRVSSVASSWGGARSVTTAPTEYTNFPRSPVMTNSIHVPIDVVVVVPISSSMQG 643 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K+ +++AL ++++ MGL+ + + T K + + Sbjct: 644 VKINLVRDALKFMVNTLGERD------RMGLVTFGSGGGGV-PIVGMTTKAWPGWSNILS 696 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S+ S A + + K + I+ ++D ++ Sbjct: 697 SIKPVGQKSHRA---------DVVEGANVAMDLLMQRKYNNPIAS--IMLISDASTSDAD 745 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 S + + AK I I + + + +++ + + V + Sbjct: 746 SVDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKD 791 >gi|184199785|ref|YP_001853992.1| hypothetical protein KRH_01390 [Kocuria rhizophila DC2201] gi|183580015|dbj|BAG28486.1| hypothetical protein [Kocuria rhizophila DC2201] Length = 455 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 59/214 (27%), Gaps = 31/214 (14%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D+SGSM D T+M K A + + + Sbjct: 259 AIDVSGSMGTKALPDGS-------------TRMDLTKAAATTAVGLFPEHDALGLWTFER 305 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 + V S+ + + D + Sbjct: 306 HLDGDKDYRSVTP-----------VRELSASVDGGTQRDQLSQDVQSLTFSPDGYTGLYD 354 Query: 316 NFFRQGVK---IPSLPFQKFIIFLTDGENNNFKSNVNTIKI----CDKAKENFIKIVTIS 368 + + + +I L+DG N++ S + ++ EN ++I+T+ Sbjct: 355 TTLAAYRQVLHDDAPGHLRTVIVLSDGMNHDPDSIALDELLSTLKAEQDAENPVRIITVG 414 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ + L + ++ + + VF Sbjct: 415 VSKDADATVLRQIAEATGGSSHVARTPQDIQKVF 448 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 8/129 (6%), Positives = 35/129 (27%), Gaps = 4/129 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK----MVSN 71 ++ + L+++ +L + + +K +++ + A +A + +N Sbjct: 7 KKQQGAIGLFGVLTLLMAVLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVGGCAQNN 66 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 S + A + + + + ++ Sbjct: 67 SSEAYKAAAQAAAAANGYQGNLLAAPNAVQLGGYHTDSDGIRTFAANNERSAVRVLATQE 126 Query: 132 MANNRLDSS 140 + ++ Sbjct: 127 VPSSLFAGG 135 >gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 [Mus musculus] gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 950 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 314 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTDDQFSVV 353 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 354 DFNHNV-RTWRNDLVSATKTQIADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 408 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + + II ++DG+ K + + ++ I + ++ I Sbjct: 409 ----------MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQD-NISLFSLGI 457 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 458 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 499 >gi|109090611|ref|XP_001091779.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 3 [Macaca mulatta] Length = 781 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + TD+ A+K Sbjct: 79 DALDISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + ++ +TDG++ + + KE + Sbjct: 139 NRGFPGGR---------------NASVPQILVIVTDGKSQGHVALP-----AKQLKEKGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|109090613|ref|XP_001091550.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 1 [Macaca mulatta] Length = 725 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + TD+ A+K Sbjct: 79 DALDISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + ++ +TDG++ + + KE + Sbjct: 139 NRGFPGGR---------------NASVPQILVIVTDGKSQGHVALP-----AKQLKEKGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTDDQFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQIADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus] Length = 950 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 314 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTDDQFSVV 353 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 354 DFNHNV-RTWRNDLVSATKTQIADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 408 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + + II ++DG+ K + + ++ I + ++ I Sbjct: 409 ----------MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQD-NISLFSLGI 457 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 458 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 499 >gi|3024068|sp|Q61703|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus] gi|122889675|emb|CAM13914.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRTDDQFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNV-RTWRNDLVSATKTQIADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + + II ++DG+ K + + ++ I + ++ I Sbjct: 405 ----------MGLLNPDSVSLIILVSDGDPTVGELKLSKIQKNVKQSIQD-NISLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|159899109|ref|YP_001545356.1| FHA domain-containing protein [Herpetosiphon aurantiacus ATCC 23779] gi|159892148|gb|ABX05228.1| FHA domain containing protein [Herpetosiphon aurantiacus ATCC 23779] Length = 785 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 76/249 (30%), Gaps = 50/249 (20%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P + + I LV+D SGSM+ + +K A Sbjct: 80 PPVQAVPQDPTTPIKIALVMDQSGSMNPF---------------------IEEVKRAANQ 118 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +D + + + + + S V+ Y+ ++ S T A Sbjct: 119 AIDQAPANAKIAVFTFTRMNS-VDVYLPAFDFSDDRNAVKDYINQNYRSEPGGETCLYTA 177 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI---- 353 QA L + K ++ II TDG++ + N + K Sbjct: 178 AHQATDFLLNTLK----------------PEERRAIILFTDGKDEDINGNQCSDKTVIDV 221 Query: 354 ---CDKAKENFIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I TI + S + R+ L+ + + V D + F I Sbjct: 222 TTKAGPTQGTKTPIYTIGL-CSADCARIQPESLRQISENTQAISLVGPRDQMSAAFITIM 280 Query: 407 QLMVHRKYS 415 + ++KY+ Sbjct: 281 DSIKNQKYT 289 >gi|304320962|ref|YP_003854605.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] gi|303299864|gb|ADM09463.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] Length = 439 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 17/210 (8%), Positives = 49/210 (23%), Gaps = 19/210 (9%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + ++ A ++ + + + + ++ L L ++ +E + A + Sbjct: 21 RHARRL-AHQRGSIATLLMILLVPSLGLSALIVDGSRMRNAYLELEIVAES---AALAAA 76 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS--- 125 ++ +K E A T + Sbjct: 77 QQLPDTDAATSAAVDYAETNLDPTVYGEVVKASDIEFGQYDEASSTFTSGGDATAVKVTA 136 Query: 126 -----------RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 + + ++ S I L HL + I Sbjct: 137 RREAERGNSLVTLFGAVIGRKEINLSATAIALLDYTEMDPVCILALGHHLYGVDMDILID 196 Query: 175 SFIPA-LLRIEMGERPIFLIELVVDLSGSM 203 IP +++ +++ + Sbjct: 197 VDIPDCGIQVNSDSHTALKSRYNAEVNAAY 226 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 63/188 (33%), Gaps = 30/188 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTR 282 ++ + A FL L V+ ++G++ + + + R + + Sbjct: 268 SSDRLNRMNQAAKYFL-----LQVVENGSWVGMVHFDSTASVKSQLIQIKSNSERNQLLQ 322 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ T ++ A+Q+ + I+ LTDGE Sbjct: 323 SLPTVAGGGTQICLGIRAAFQVFKQQNSEIDGSE----------------IVLLTDGE-- 364 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD--SLIH 400 + ++ E+ I I++ S + + + ++ ++Y NA LI Sbjct: 365 ----DSTASSCVNEVIESGTIIHFIALGPSADASVIQMSNLTGGSHYYASDNAQNNGLID 420 Query: 401 VFQNISQL 408 F ++ Sbjct: 421 AFGALTSE 428 >gi|307591429|ref|YP_003900228.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986283|gb|ADN18162.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 426 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 46/163 (28%), Gaps = 11/163 (6%) Query: 222 DKKRTKMAALKNALLLFLDSIDL--LSHVKEDVYMGLIGYTTRVEKNIEPSWG------T 273 + TK+ A+ F + + V G G + Sbjct: 123 SRGTTKIEGAIKAIREFTEIAKDRGGNTQVSIVPFGDPGKNCAGYPIDSNTLDNFSRVDD 182 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K++ ++ T+ + +A + L F+ P I Sbjct: 183 AKLQIFLDNLASLSPCASTNLYEPLSKAVRFLGKKNDSRFYPLDSSGNPIEPQPRLS--I 240 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENF-IKIVTISINASPNG 375 I L+DG +N + + K+N I + T+ + Sbjct: 241 ILLSDGYHNKPNEAQDFQSLNQLLKKNNQIIVHTLGYGLTAQQ 283 >gi|297265178|ref|XP_002799142.1| PREDICTED: collagen alpha-3(VI) chain [Macaca mulatta] Length = 2568 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/353 (7%), Positives = 92/353 (26%), Gaps = 39/353 (11%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +L ++ + ++ A+ + A I++ + + L ++ Sbjct: 892 VLDAIRRLRLRGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR 951 Query: 121 IVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + L + I + E ++I++ Sbjct: 952 FAQV----IRSSGIVSLGVGDRNIDRAELQTITNDPRLVFTVREFRELPNIEERIMTSFG 1007 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + + + + + L Sbjct: 1008 TSAATPAPPGVATPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLRFV 1055 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTP 296 + +D + + + +GL+ Y + RQ + + ++ Sbjct: 1056 SEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKVVYKGGRHANTKV 1114 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ +F + + +T G++ +V+ Sbjct: 1115 GLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSL-----A 1158 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1159 LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1209 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 17/196 (8%), Positives = 56/196 (28%), Gaps = 21/196 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L+ V +G++ ++ V + + Sbjct: 841 RPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAIRRL 899 Query: 287 LILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + ++ A++ + L F + + ++ + G++ + Sbjct: 900 RLRGGSPLNTGRALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKSQDD 948 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S + + I ++ + + L+T + P + V L ++ + Sbjct: 949 VSR-----FAQVIRSSGIV--SLGVGDRNIDRAELQTITNDPRLVFTVREFRELPNIEER 1001 Query: 405 ISQLMVHRKYSVILKG 420 I + G Sbjct: 1002 IMTSFGTSAATPAPPG 1017 >gi|109101588|ref|XP_001084624.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Macaca mulatta] Length = 2969 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/353 (7%), Positives = 92/353 (26%), Gaps = 39/353 (11%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +L ++ + ++ A+ + A I++ + + L ++ Sbjct: 1293 VLDAIRRLRLRGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR 1352 Query: 121 IVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + L + I + E ++I++ Sbjct: 1353 FAQV----IRSSGIVSLGVGDRNIDRAELQTITNDPRLVFTVREFRELPNIEERIMTSFG 1408 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + + + + + L Sbjct: 1409 TSAATPAPPGVATPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLRFV 1456 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTP 296 + +D + + + +GL+ Y + RQ + + ++ Sbjct: 1457 SEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKVVYKGGRHANTKV 1515 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ +F + + +T G++ +V+ Sbjct: 1516 GLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSL-----A 1559 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1560 LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1610 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFHSFGDLQE 207 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 17/196 (8%), Positives = 56/196 (28%), Gaps = 21/196 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L+ V +G++ ++ V + + Sbjct: 1242 RPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAIRRL 1300 Query: 287 LILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + ++ A++ + L F + + ++ + G++ + Sbjct: 1301 RLRGGSPLNTGRALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKSQDD 1349 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S + + I ++ + + L+T + P + V L ++ + Sbjct: 1350 VSR-----FAQVIRSSGIV--SLGVGDRNIDRAELQTITNDPRLVFTVREFRELPNIEER 1402 Query: 405 ISQLMVHRKYSVILKG 420 I + G Sbjct: 1403 IMTSFGTSAATPAPPG 1418 >gi|116625273|ref|YP_827429.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228435|gb|ABJ87144.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/273 (8%), Positives = 81/273 (29%), Gaps = 58/273 (21%) Query: 157 RLQFIEHLLNQRYN---QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + HL+ + + P +++ E + L++D SGSM Sbjct: 29 VVDKSGHLVTNLTQDAFTVMENGAPQSVKVFRREDVPVSMGLIIDNSGSMRDKRG----- 83 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWG 272 ++ A L + D + ++ + + Sbjct: 84 ----------------KVEAAALALVR------DSNPDDEVFIVNFNDEAFLDNPHGKDF 121 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T +++ T A++ + + + +K Sbjct: 122 TTNIKEMEEALTRIDSRGGTAMRDALRMSIDHVKEKAHKD-----------------KKV 164 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-------CVSS 385 ++ +TDG +N+ + + ++++ + I + + + + ++ Sbjct: 165 LVVVTDGNDNSSVVS--LENLVKASQQSEVLIYGVGLLGDEERREAQRAQRALKALAEAT 222 Query: 386 PEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + + + + V +++ + ++ Y++ Sbjct: 223 GGEVFFPKDVNEVDKVAHQVARDIRNQ-YTIEY 254 >gi|73992734|ref|XP_543096.2| PREDICTED: similar to Protein KIAA1510 precursor [Canis familiaris] Length = 1405 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 52/186 (27%), Gaps = 21/186 (11%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L ++ V +GL Y+ + + + K Sbjct: 220 SHFQQVKDFLASVIE---PFEIGPSKVQVGLTQYSGDPQTEWDLNALRTKEDVLAAVRRL 276 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + P K +I +TDG++ + Sbjct: 277 RYKGGNTFTGLALTH-------------VLEHNLRPAAGPRPEATKVLILVTDGKSQDD- 322 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K+ + I + + + + L +NV + L + + Sbjct: 323 ----ARAAGRILKDLGVAIFAVGVKNADEAELQLLASQPLDITVHNVQDFPQLGTLAGLL 378 Query: 406 SQLMVH 411 +L+ Sbjct: 379 GRLICQ 384 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 67/172 (38%), Gaps = 14/172 (8%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGL-----IGYTTRVEKNIEPSWGTEKVRQYV 280 T M K A +++++ + + + + + + ++ V Sbjct: 206 TNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQSTVASDQCFATTFADASPQNKKKV 265 Query: 281 TRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +D++ T+ PA+++A+ + S F +I + I+F++DG Sbjct: 266 EDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIKKIDPSEID-----RVILFMSDG 320 Query: 340 ENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 N+ S + + +I ++ N + I+T + + G +L+ ++ Y Sbjct: 321 IPNDPGSTILSAQIRANEQLNNSVIILTYGLGNADFG--VLRNMATNKGDVY 370 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 69/205 (33%), Gaps = 38/205 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTR 282 ++ L A LFL L V+ ++G++ + + E + R + + Sbjct: 32 TGNRLNRLNQAGQLFL-----LQTVELGSWVGMVTFDSAAHVQSELIQINSGSDRDTLAK 86 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T ++ A+ ++ I+ LTDGE+N Sbjct: 87 RLPAAASGGTSICSGLRSAFTVIRKKYPTDGSE-----------------IVLLTDGEDN 129 Query: 343 NFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA---DSL 398 TI C ++ K++ I T+++ Q L + + + + L Sbjct: 130 -------TISGCFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASDQVQNNGL 181 Query: 399 IHVFQNISQ---LMVHRKYSVILKG 420 I F +S + R + KG Sbjct: 182 IDAFGALSSGNGAVSQRSIQLESKG 206 >gi|66775067|gb|AAY56127.1| complement component Bf/C2-B [Ginglymostoma cirratum] Length = 757 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 61/195 (31%), Gaps = 30/195 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD----- 283 K A+ + ++ + V GL+ + +R + + V + Sbjct: 276 EDFKKAINALTTFVTMIKQFEVGVNYGLVMFGSRSCVEVNIAHDDVSDSDSVLQILPTLK 335 Query: 284 ---MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ T A+K ++++ K + + I+ TDG Sbjct: 336 YEDITRYSDAGTNMTGALKTIFEMMVLKKASMKDKQAEWR-------EVRHAIMIFTDGR 388 Query: 341 NNNFKSNVNTIKICDKAKEN-FIK--------IVTISINASPNGQRLLKTCV--SSPEYH 389 NN N + D+ + I+ + + + + ++ ++ Sbjct: 389 NNMG---GNPKPMMDRIRNFLDIEKVGEEFLDVYVFGLGNDVDTDEMNSVASKKNNEKHM 445 Query: 390 YNVVNADSLIHVFQN 404 + + + L VF + Sbjct: 446 FFIQ-MEDLTTVFNS 459 >gi|310114429|ref|XP_003119946.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 481 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 59/192 (30%), Gaps = 32/192 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-- 285 LK+ + I K +I E EK++++V +D Sbjct: 70 FDLLKDIQKEVVQDICSSEACKNS-KADIIFLIDGSESISP--KDFEKMKRFVESMVDIY 126 Query: 286 ------SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T + A+ F K Q+++I +TDG Sbjct: 127 RAIFDVQQMRDGTRTGKALNFTL--------------PFFDSSKGGRPSVQQYLIVITDG 172 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + N I ++ I I + +LL+ +++V N ++L Sbjct: 173 VAQD-----NVIIPAKALRDKNTIIFAIGVG-EAKKSQLLEITNDEDNVYHDV-NFEALQ 225 Query: 400 HVFQNISQLMVH 411 ++ + I + Sbjct: 226 NLEKEILSKVCD 237 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 9/90 (10%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + +++I +TDG KS+ + + + + N + I I I N L + Sbjct: 6 ADTGRINVARYLIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGI-REANIDELKEI 59 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + V D L + + + Q + Sbjct: 60 A---KDKIFFVYEFDLLKDIQKEVVQDICS 86 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 43/141 (30%), Gaps = 19/141 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I + ++ + +GL Y+ + + K + T + A+ Sbjct: 263 ISPFNIARDKISIGLSQYSGDPRTEWDLNKFASKDKVLEAVRNLRYKGGNTFTGLALTH- 321 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +L + K P K +I LTDG++ + K+ Sbjct: 322 --VLEQNLKLEAG----------PRPEADKIVILLTDGKSQD-----EANAAAQALKDLG 364 Query: 362 IKIVTISINASPNGQRLLKTC 382 I I I + + + L + Sbjct: 365 ISIFAIGV-KNADEAELRQVA 384 >gi|89053332|ref|YP_508783.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88862881|gb|ABD53758.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 686 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/252 (10%), Positives = 68/252 (26%), Gaps = 52/252 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V + +RP + ++D SGSM+ K Sbjct: 304 NPDTQLVHIGIQGDLPVVEDRPPLNLVFLIDTSGSMN-------------------DPAK 344 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L + L L+ + + ++ Y +EP+ ++ Sbjct: 345 LPLLIQSFRLMLNRLSPED------EVAIVTYAGSAGVALEPTAASDTATIN-AALTTLQ 397 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ +++AY++ ++ TDG+ N S+ Sbjct: 398 AGGSTNGVGGLEEAYRLAGEMMVDGEV----------------SRVLLATDGDFNVGLSD 441 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVN--------ADS 397 + ++ I + + ++ + + D Sbjct: 442 AGALEDYIAEQRDTGIYLSVLGFGRGNLQDDTMQALAQNGNGTASYIDTLHEAQRVLVDQ 501 Query: 398 LIHVFQNISQLM 409 L I+ + Sbjct: 502 LAGALYPIADDL 513 >gi|330465087|ref|YP_004402830.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] gi|328808058|gb|AEB42230.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] Length = 141 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 36/130 (27%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + SI A++++ L +IG E+ A AG + + Sbjct: 5 RGDAGRVSIFLAIALVGVLAIIGMAFDGAGQLRTLQRAENLAAEAARAGGQAIDLATAIE 64 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G + A+RA+ D + S ++ + +N Sbjct: 65 GGPKQINRRQARRAVADYLAAAGASGHTVSFPVVDGETQIRVRVSVTYRRAMLGLFGFSN 124 Query: 136 RLDSSNNTIF 145 + S Sbjct: 125 TVTVSGEATA 134 >gi|126340361|ref|XP_001365240.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain2 [Monodelphis domestica] Length = 951 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 311 NILFVIDVSGSMWGI--------------------KMKQTVEAMKTILDDLRAEDQFSVV 350 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + R + + ++Y+ + T+ A+ +A IL Sbjct: 351 DFNHNV-RNWRDDLVLASKAQITDAKKYIEKI---QPNGGTNINEALLRAIFILNEASN- 405 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ I + ++ I Sbjct: 406 ----------LGMLDPNSVSLIILVSDGDPTVGELKLSQIQKNVKQSMQD-NISLFSLGI 454 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 + L + + Y + L + +S Sbjct: 455 GFDVDYDFLERLSQENHGVAQRIYGNQDTSLQLKQFYNQVST 496 >gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 733 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 60/188 (31%), Gaps = 34/188 (18%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMD 285 + ++A+L LD++ +I + + K + ++Q Sbjct: 373 IQQARDAVLQALDTLTPRDRFN------VIEFNSHARKLFPQAVPAQERALQQARRFVRG 426 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ A+ +A + + + ++FLTDG Sbjct: 427 LKADGGTEIAEALDRALSDAAPEG-------------------YVRQVVFLTDGS--VGN 465 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +I + ++ ++ T+ I SPN + K + ++ + + I Sbjct: 466 ELALFKQIDQQLGDS--RLFTVGIGPSPNRFFMRKAAQFGRGAYSHINDTAEVSD---KI 520 Query: 406 SQLMVHRK 413 ++L + Sbjct: 521 AELTAALR 528 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/373 (8%), Positives = 86/373 (23%), Gaps = 91/373 (24%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K+ R + +K +G + +N+ ++ + + Sbjct: 135 FKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALNTKQVFSKVAKGKRGIALAYRDD 194 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + + + V Sbjct: 195 PIEGQANVAISFVFDASGSMEFSLDGTEKV------------------------------ 224 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 N K R+++ L+ + + + +V ++ + + Sbjct: 225 ----------NPYSNNPLKNRSRIDILREKTKKMMADLQPIGNVSVNLVQFNSHASFVQQ 274 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 IE G + + T+ ++ L S+ + + GV Sbjct: 275 NFIELDKGLTSINSAIDNLNPEHA---TNPGDGLRYGMVSLQSNAAQLKYVVLLTDGVPN 331 Query: 325 P-------SLPFQKFIIFLTDGENNNFKSNV----------------------------- 348 + K DG N N Sbjct: 332 SYMVGPQYNYWGNKIETRTRDGFVNQAYFNTWRYDLSTGFHSTRANSAFAGGNPSWAVDG 391 Query: 349 ---NTIKICDKAKENFI-KIVTISINA-SPNGQRLLKT-CVSSPE----YHYNVVNADSL 398 + ++ + K+ I ++ I +A + Q+ + + +V + L Sbjct: 392 SIEYSGEVAKQFKK-GIKRVNVIGFSAKEADKQQGSRLTAAIKEGVPETSYTDVSDDKQL 450 Query: 399 IHVFQNISQLMVH 411 F +I + + Sbjct: 451 EQTFADIKKQVEQ 463 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 91 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 150 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 151 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 193 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 194 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGH 251 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 252 SRDAEEVISQTI 263 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 59 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 118 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 119 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 161 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 162 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGH 219 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 220 SRDAEEVISQTI 231 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 65/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 59 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 118 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 119 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 161 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 162 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGH 219 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 220 SRDAEEVISQTI 231 >gi|325108015|ref|YP_004269083.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968283|gb|ADY59061.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 710 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 63/238 (26%), Gaps = 40/238 (16%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----- 268 + P A+KN + LFL +D L ++G++ Y + + Sbjct: 471 NHQTPDLWKTPHYPFHAMKNGMSLFLGFLDELE---FGDHVGIVTYDSSARIEKKVKNEH 527 Query: 269 -----------PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF---- 313 S V R T ++ ++L + + Sbjct: 528 NATLADLGEDWISNDYTAVDTIQRRKQAGHYDIYTGLGYGIETGIELLDNHARPGSRPTL 587 Query: 314 ----FTNFFRQGVKI---PSLPFQKFIIFLTDGENNNFKSNVNTIKI---CDKAKENFIK 363 N R + + + DG N + + +A + Sbjct: 588 LVMTDGNANRSPNGWSLPGDWDWDELTDYDGDGVANYRTWDRHKQYAFYQAKQAIDKGYT 647 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-------HVFQNISQLMVHRKY 414 + T+S+ A + + + NV ++ F I+ + K Sbjct: 648 VHTLSVGAGADRDLMRAIAFAGHGEWINVPGGTTIEEIEEQMLDAFARIAANVPPAKL 705 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 22/281 (7%), Positives = 59/281 (20%), Gaps = 42/281 (14%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-------------------------- 57 ++ A ++ + + + + K +++ Sbjct: 2 LLAAAGLVGMIGFVALSVDLGVISLTKTNLQKTADAAALAAAQEIIASVESASSEASSPG 61 Query: 58 --N-------AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + +A A A+K+ D ++ + D I SG Sbjct: 62 GPDDINSISISASKAMAAKVAEMNGVYIDPDRDVAFGRRVFNEDTQTFDIDWGDPPYTSG 121 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT------SYDYRLQFIE 162 S + S + + L F Sbjct: 122 ASFNVVQVTCRRDNPDSSKPDGVLELFFAPVNGTKTAAISASAIAFVEARDIVSILDFSG 181 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL-VVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + +F L M + L + + DL + + + Sbjct: 182 SMNYDSALRSVDTFGHDALADNMEDIYDALQPVDLGDLEFTPEWLTVEGEDPDSEGEAKI 241 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 ++ + + + G + Sbjct: 242 YVTFKHTEVFIDSTKDIQEVYLIFEDGGWQYFSGCGCSSGS 282 >gi|302796874|ref|XP_002980198.1| hypothetical protein SELMODRAFT_38056 [Selaginella moellendorffii] gi|300151814|gb|EFJ18458.1| hypothetical protein SELMODRAFT_38056 [Selaginella moellendorffii] Length = 55 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 21/57 (36%), Gaps = 4/57 (7%) Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + T + + + +L ++ V ++ H F +Q + SV+ + Sbjct: 1 PVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAF---AQCIGGL-LSVVAQ 53 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/298 (9%), Positives = 74/298 (24%), Gaps = 31/298 (10%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLN---QRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + + + + Q + + + L+ + Sbjct: 189 AVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNYNQNDRFALVPFSDGV 248 Query: 200 S--------GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + A D + AAL A F D Sbjct: 249 REASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTD----------P 298 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK-- 309 I + T + + T R+ T ++ Y++ + + Sbjct: 299 TRKKYIIFLTDGMPTVLNTVDTITYREVKQNFWGGYSYTGNRVTDSLSVTYELYSDGRTA 358 Query: 310 --KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENFI 362 + + + R+ + + +G + + S + + N I Sbjct: 359 GIRFTDNKGYSRRFYSDGQDYVNGWRVSWDNGYSFTYSSIEGKIRADATAVAKTLGMNNI 418 Query: 363 KIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + +I + L++ ++ +L +FQ SQL + ++ Sbjct: 419 TLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAITGTIR 476 >gi|293346541|ref|XP_001058131.2| PREDICTED: alpha 1 type VII collagen-like [Rattus norvegicus] gi|293358308|ref|XP_230973.5| PREDICTED: collagen, type XX, alpha 1 [Rattus norvegicus] Length = 1320 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 42/159 (26%), Gaps = 20/159 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + I + + V +GL Y+ + + + K + Sbjct: 190 HNNFQQVKDFLASIITHFAIGPDKVQVGLTQYSGDPQTEWDLNSFHTKEQVLAAVHHLHY 249 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ + K +I +TDG++ + Sbjct: 250 KGGNTFTGLALTH-------------VLGQNLKPAAGVRPEAAKVLILVTDGKSQDD--- 293 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ I + T+ + + LK S P Sbjct: 294 --VRTAARVLKDQDIDVFTVGVKNVDEAE--LKLLASQP 328 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 67/224 (29%), Gaps = 44/224 (19%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + IE I V+D SGS+ + + A Sbjct: 31 FVSTIERYSTTKNDIVFVLDESGSIGADVFP---------------------AELAFTEM 69 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPA 297 + + ++S + + V + S G + ++V T + A Sbjct: 70 VARLLVVSPEYSRLTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGTRTREA 129 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A +IL + + ++ ++DG+ N+ +I Sbjct: 130 LGYAGEILWN-----------------ARQEANRIVVLISDGQANSGS---EPSEIARLL 169 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I I + + + L SSP + Y + N + + V Sbjct: 170 RVKGIVIF--GVGVAHINKDELLDVASSPAHTYMLRNFEYIKKV 211 >gi|149033988|gb|EDL88771.1| similar to Protein KIAA1510 precursor (predicted) [Rattus norvegicus] Length = 1320 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 42/159 (26%), Gaps = 20/159 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + I + + V +GL Y+ + + + K + Sbjct: 190 HNNFQQVKDFLASIITHFAIGPDKVQVGLTQYSGDPQTEWDLNSFHTKEQVLAAVHHLHY 249 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ + K +I +TDG++ + Sbjct: 250 KGGNTFTGLALTH-------------VLGQNLKPAAGVRPEAAKVLILVTDGKSQDD--- 293 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ I + T+ + + LK S P Sbjct: 294 --VRTAARVLKDQDIDVFTVGVKNVDEAE--LKLLASQP 328 >gi|322435057|ref|YP_004217269.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162784|gb|ADW68489.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 393 Score = 49.1 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 55/167 (32%), Gaps = 3/167 (1%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + D Y+ +I Y R + + + V Q + T+ A+ + TS Sbjct: 173 RPDDYIAVITYDLRTHILTDFTNNKDTVAQSLNSLQIPGFSD-TNMFDALYETLDR-TSR 230 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + G S ++ N ++ T + + + +I I Sbjct: 231 IEGRKYIILIGSGRDTFSKLTLDKMLAKIKATPNVTIFSIGTGALAQELGDARGQIGGIG 290 Query: 369 -INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 +N +L + +++ + +L +F I+ + ++ Sbjct: 291 RMNILQAQNQLKTFATMTGGLYFDPMFQGALPDIFSQINDSIRNQYI 337 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 49.1 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 55/171 (32%), Gaps = 27/171 (15%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNI-------EPSWGTEKVRQYVTRDMDSLI--LK 290 D + + + +G++ Y+ + + + +Q +T+ + L Sbjct: 442 DIVRSFNLGVTEQDVGVVVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKILKKLAYEGG 501 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A K A ++LT +K II LTDG + Sbjct: 502 TTYTGYAFKLANEMLT---------------GNKSRPDAKKMIILLTDGATTAANTLQLK 546 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 ++ D ++ + I+ + + Q L + + V L V Sbjct: 547 EEL-DVSRAANVMILAVGVG--KFNQTELIQIAGDRKNFFAVTKFSELEKV 594 >gi|159039503|ref|YP_001538756.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157918338|gb|ABV99765.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 583 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 101/381 (26%), Gaps = 47/381 (12%) Query: 54 ESANNAAILAGASKMVSN--------LSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 ++A A+ A ++ + + + A +D + Sbjct: 216 QAATVGALRALSTNSANLRQELLAKFPTSPDSTSVARGLGAAALSEEDVLSYNARKPAVP 275 Query: 106 LSGYSAVFYNTEIQNIVNS-SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-QFI-E 162 L + I + ++ D + R Sbjct: 276 LVPLYLEPAAMPLDYPYAVLPGIEPAKASAAQMLFEVLATASFKDRLAPLSLRAPDGTWG 335 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS------------MHCAMNSD 210 A G+ + + VD + + ++ Sbjct: 336 AGFGAPQGAPSPEVGGASTEPGSGDAAGAVDPVAVDRAVASWSIATQSGRMLCVIDVSGS 395 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + + +++ + A + L LF DS + + +G R+ + S Sbjct: 396 MKGSVAGAGGASRQQVTLDAARRGLSLFDDSWQ-IGLWEFSTNLGSGRDYRRLVEIGPLS 454 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 ++ Q +T+ T + AY+ + + Sbjct: 455 NQRSRLEQALTQIQ--PTRGDTGLFDTVLAAYEAVQEEW----------------DPGQV 496 Query: 331 KFIIFLTDGENNNFKSNVNTIKICD--KAKE--NFIKIVTISINASPNGQRLLKTCVSSP 386 I+ TDG+N++ + + + K+ +++V I I A + L + Sbjct: 497 NSIVLFTDGKNDDDNGISQQQLLAELERIKDAERPVQVVLIGIGADVSKAELESITKVTG 556 Query: 387 EYHYNVVNADSLIHVF-QNIS 406 + + + +F + I+ Sbjct: 557 GGSFVTEDPTKIGDIFLKAIA 577 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/174 (10%), Positives = 41/174 (23%), Gaps = 35/174 (20%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + +G + +V+ + T Sbjct: 71 MWNDVSYYPDWSWSYGVGRQPVAYCPTEAKRLQNYHNNRSGFVSYLNGLVARGGTYHDIG 130 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSL------PFQKFIIFLTDGENNNFKSN---- 347 M + L++ T + +K++IF+TDG+ + ++ Sbjct: 131 MIWGARFLSTTGLFKSATPETNDVTDPDNPAKIRGFSVKKYMIFMTDGDMSPTWNDYSAY 190 Query: 348 -------------------------VNTIKICDKAKENFIKIVTISINASPNGQ 376 C+ AK I I I+ + + Sbjct: 191 GIEYLDGRVNGSPTTDNAALLARHLQRFRMACNAAKAKGIDIWVIAFSTTLTAD 244 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 28/294 (9%), Positives = 71/294 (24%), Gaps = 5/294 (1%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L + + + + + + FAL + L+ + Y ++ A A Sbjct: 241 LTADMTNLITRFLTATRGATVLEFALILPVLCALLAGGFELGYRAYVNAILQGALLEASR 300 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 S + ++ + I K+ + F Sbjct: 301 QATVGDRSGAQIDKTITDRMATLSGSISIQSIKKESFYNFSNVGKPEKLTFDRNGDGAYD 360 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-----IEHLLNQRYNQKIVSFI 177 ++ N D N+ D + Y LQ+ + L Q+I + Sbjct: 361 STQDCYEDANNNGAYDVKTNSGIGTADDIVRYTVSLQYPSIMPVGSLFGWGSQQEITTST 420 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + D + + +++ + + A ++ Sbjct: 421 VLRNQPFTSRAMPTIAATSADNAARVRDSISEADVNEVLLGGKIVGEPMDFAGSGRIVMS 480 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + S + + + +P +GT + Sbjct: 481 DVTSNGFSDGRRGQRIVWQRCTGALNVPDSQPQYGTSGKGESDATLQAMGAPDN 534 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 60/186 (32%), Gaps = 22/186 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + K I + ++ + +GL Y++ + S + + Sbjct: 255 SNFKLIKEFLSALISPFNIAQDKIRVGLSQYSSDPRTEWDLSAYATRDQVLEAVRNLRYK 314 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + L +K +I LTDG++ + + Sbjct: 315 GGNTFTGLALTH-------------VLEQNLKPDAGARLEAEKLVILLTDGKSQDDANL- 360 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 K I+I I + + + LK S P YNV++ L + ++ Sbjct: 361 ----AAQTLKNMGIEIFAIGVKNADEAE--LKQVASEPLELTVYNVLDFPLLSSLVGRLT 414 Query: 407 QLMVHR 412 +++ R Sbjct: 415 RVLCTR 420 >gi|332256817|ref|XP_003277513.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Nomascus leucogenys] Length = 3172 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/353 (7%), Positives = 93/353 (26%), Gaps = 39/353 (11%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +L ++ + +++ A+ + A I++ + + L ++ Sbjct: 1496 VLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR 1555 Query: 121 IVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + L + I + E ++I++ Sbjct: 1556 FAQV----IRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFG 1611 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + + + + + L Sbjct: 1612 PSAATPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLRFV 1659 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTP 296 + +D + + + +GL+ Y + RQ + + ++ Sbjct: 1660 SEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKVVYKGGRHANTKV 1718 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ +F + + +T G++ +V+ Sbjct: 1719 GLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQDVSL-----A 1762 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1763 LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1813 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 259 LDFLINLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 317 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 318 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 363 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 364 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 410 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 64/204 (31%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + + ++ + +D + + + +I ++ + Sbjct: 1226 GGKRDVVFLIDGSQRAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1282 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1283 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1332 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1333 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1384 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1385 VFSVSTFRELPSLEQKLLTPITTL 1408 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1284 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1335 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1336 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1390 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1391 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1441 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1442 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1500 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1501 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1549 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1550 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1602 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1603 EERIMNSFGPSAATPAPPG 1621 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/299 (8%), Positives = 71/299 (23%), Gaps = 33/299 (11%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLN---QRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + + + + Q + + + LI + Sbjct: 72 AVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGV 131 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + +++ A+ N + Sbjct: 132 REASVVPFG-----------KYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTD 180 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLI-----LKPTDSTP----AMKQAYQILTSDK- 309 TR + I + G V V + + ++ Y++ + Sbjct: 181 PTRKKYIIFLTDGMPTVLNTVDTITYREVKPKFRGGYQYTGNRVTGSLSVTYELYSDGWT 240 Query: 310 ---KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-----TIKICDKAKENF 361 + + + RQ + + +G + + S + N Sbjct: 241 AGIRFTDNKGYSRQFYSDGQDYVNGWWVSWDNGYSFTYSSIERKIRADATAVAKTLGMNN 300 Query: 362 IKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + +I + L++ ++ +L +FQ SQL + ++ Sbjct: 301 ITLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAITGTIR 359 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 59/217 (27%), Gaps = 50/217 (23%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + +LQF + V+ +P + RP Sbjct: 23 TGGKEVAVQAASSSLPSLQLQFSSSQSEYAKPPNGDAQGRLDVTLVPQ-GAVSGIIRPPI 81 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ + K+AL ++ + + Sbjct: 82 DVVFVMDVSGSMTAM--------------------KLQSAKSALQAAVNYFKSNYNQNDR 121 Query: 252 VYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + R + S ++ + T+ + A+ A T Sbjct: 122 FALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFT-- 179 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 +K+IIFLTDG Sbjct: 180 -----------------DPTRKKYIIFLTDGMPTVLN 199 >gi|296225412|ref|XP_002758286.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Callithrix jacchus] Length = 860 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 71/226 (31%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 SVAFVIDVSGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ + +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEAKMFVKSIDD---RGMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIRGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH----YNVVNAD-SLIHVFQNISQLMVH 411 + N L+ + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYN-FLENMALENQGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|237716506|ref|ZP_04546987.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408104|ref|ZP_06084652.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645098|ref|ZP_06722824.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809499|ref|ZP_06768202.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|229444153|gb|EEO49944.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354912|gb|EEZ04004.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639604|gb|EFF57896.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443317|gb|EFG12081.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 342 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 55/162 (33%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESINPSLISKQGTAIGEAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II +TDGEN+ + +K I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPD 229 >gi|87199536|ref|YP_496793.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] gi|87135217|gb|ABD25959.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] Length = 520 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/189 (8%), Positives = 58/189 (30%), Gaps = 5/189 (2%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + G+ E ++ +AL++ + ++G V + +++A + A LA +++ Sbjct: 5 LIRSGLKDETGAVAVTYALALTGLIAMVGIGYDVSQVLTLDSELQNAADQAALAAVTQLD 64 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + G + + + + +G++ + + N Sbjct: 65 R---QSGAIDRAKNAANSLIANNTLLAKDAGVVTIKEAGFAFYATRADAEAGAN-PTTDN 120 Query: 130 THMANNRLDSSNNTIFYNMDV-MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 +++ + + Y + ++ + + +P + Sbjct: 121 ARARFAKVEVNGRSAQYALTPIGSAVASWVSPTLSAAAVAGIGSAICKVPPMKICAPSND 180 Query: 189 PIFLIELVV 197 F + V Sbjct: 181 LSFNQDYVG 189 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/299 (8%), Positives = 72/299 (24%), Gaps = 33/299 (11%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLN---QRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + + + + Q + + + LI + Sbjct: 189 AVSGIIRPPIDVVFVMDVSGSMTAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGV 248 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 + +++ A+ N + Sbjct: 249 REASVVPFG-----------KYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTD 297 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDM---------DSLILKPTDSTPAMKQAYQILTSDK- 309 TR + I + G V V T ++ Y++ + + Sbjct: 298 PTRKKYIIFLTDGMPTVLNAVDTITYREVKQNFWGGYSYTGNRVTDSLSVTYELYSDGRT 357 Query: 310 ---KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-----NVNTIKICDKAKENF 361 + + + R+ + + +G + + S + + N Sbjct: 358 AGIRFTDNKGYSRRFYSDGQDYVNGWRVSWDNGYSFTYSSIEGKIRADATAVAKTLGMNN 417 Query: 362 IKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 I + +I + L++ ++ +L +FQ SQL + ++ Sbjct: 418 ITLYSIGFGDNDEVDMDYLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAITGTIR 476 >gi|152993598|ref|YP_001359319.1| hypothetical protein SUN_2020 [Sulfurovum sp. NBC37-1] gi|151425459|dbj|BAF72962.1| hypothetical protein [Sulfurovum sp. NBC37-1] Length = 940 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 66/202 (32%), Gaps = 28/202 (13%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + + Q TK+ ++AL + + + G +E I Sbjct: 31 FDASGSMWGQINGVTKIEIARDALKNVVREWNPNVELGLTVYGHRSKGDCNDIEVVIPI- 89 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 G ++ + M T + ++++A L ++++ Sbjct: 90 -GKVDKKRVIDTVMKIKPKGKTPISRSLRKAAGELKYTEEKA------------------ 130 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI--VTISINASPNGQRLLK-TCVSSPE 387 II ++DG+ + + + K+ I + N + L+ ++ Sbjct: 131 -TIILISDGKETC---DPDPCATAKELKKEGIDFVAHVVGFNVDKKTDKQLECIANATGG 186 Query: 388 YHYNVVNADSLIHVFQNISQLM 409 +++ NA +L + I + + Sbjct: 187 EYFSAKNAAALNKAMKTIVKKV 208 >gi|320106178|ref|YP_004181768.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924699|gb|ADV81774.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 365 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/190 (10%), Positives = 54/190 (28%), Gaps = 33/190 (17%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 S+D + L+ ++ + + + +K+ T A+ Sbjct: 140 ASLDFVRLSNPKDEAFLVDFSDEAFIDQDFTSDVKKLE---DGLGYVKASGGTAIYDAVV 196 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + L + ++ ++ +TDG++N S + +E Sbjct: 197 ASADYL-----------------AKNAKLPKQVLLIVTDGDDNASGST--LEDAIRRVQE 237 Query: 360 -NFIKIVTIS--INASPNGQR-------LLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + I + N L + Y + + ++Q + Sbjct: 238 LDGPVIYCVGLLFGPDSNKSESRHARRVLETLAAQTGGLAYFPRKLSEVDSIATEVAQDI 297 Query: 410 VHRKYSVILK 419 ++Y++ + Sbjct: 298 -RQQYTISYR 306 >gi|2645090|dbj|BAA23627.1| collagen-like protein [Hydra vulgaris] Length = 354 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 53/167 (31%), Gaps = 20/167 (11%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQA 301 + +G+I + + E I+ S + ++ + + T A+++A Sbjct: 29 NFDVDNQLTRIGIIRFDSDAEIIIQLS-DHKTLKDLLNDIDSIRYNEGIQTRIDKALERA 87 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + K ++ L DG+N+ + + + + + Sbjct: 88 MEAFSEKNGGRADA--------------TKALVLLADGQNSFIEGSQDLNEELKPLIDAK 133 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI-HVFQNISQ 407 + I I + + L + + D L V + IS+ Sbjct: 134 VFRYVIGIGRELDLKELEDIA--TNNIAIYADSFDELKFKVDEQISR 178 >gi|227819050|ref|YP_002823021.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] gi|227338049|gb|ACP22268.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] Length = 155 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/129 (10%), Positives = 38/129 (29%), Gaps = 7/129 (5%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + E ++I A+++ + +G W+ K ++ A + R Sbjct: 21 LKDESGTVAVIAAVTLPVLVGAMGLGAETGYWYLKDRKLQH-------AADVSAHAAAVR 73 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + I A + + S + + ++ R+ + ++ Sbjct: 74 YRAGDQKPALETTAKRIAMASGYSPGSLSVSTGAGAGGSHKVTVELSETHPRLFSSIFSS 133 Query: 135 NRLDSSNNT 143 + S Sbjct: 134 EPVTMSARA 142 >gi|326911040|ref|XP_003201870.1| PREDICTED: collagen alpha-1(XIV) chain-like [Meleagris gallopavo] Length = 438 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 54/183 (29%), Gaps = 21/183 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 LK+ + S + + K + + + Y + IE + V S+ Sbjct: 70 QLLKDFISSMARSFENVVMGKGGIRLAVALYGEKPRMCIELTDYVTIEEMLVAIQEISIK 129 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+ A ++ K ++ +T G KS+ Sbjct: 130 GSSLKVGSALAFAAHAMS--------------HPDTLREDAAKVVVLITSG-----KSSD 170 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQ 407 ++ + + + I + L K + E+ V + L + Q +S+ Sbjct: 171 LVEDKAQVLQDAGVTVFAVGI-KDADKHELNKIASEPTAEHVIYVDDFHLLHNAAQKLSR 229 Query: 408 LMV 410 + Sbjct: 230 RLC 232 >gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Macaca mulatta] Length = 946 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNQNV-RTWRNDLISATKTQVSDAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|17545367|ref|NP_518769.1| transmembrane protein [Ralstonia solanacearum GMI1000] gi|17427659|emb|CAD14178.1| putative transmembrane protein [Ralstonia solanacearum GMI1000] Length = 347 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/296 (9%), Positives = 66/296 (22%), Gaps = 29/296 (9%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +I +++ + + V +N +++A A+ + Sbjct: 11 ARRQRGAVGLIMPAMLIAIFSVGALAVDVARLIVVRNELQNAA-----DAAALAGAAGLY 65 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + SN + N S + + + T + S I+ Sbjct: 66 PTNPTPNWSNGVAQGTSAVKLNKSSNVQLVSGTVQAGYWNLTGTPAGLQSQSITPGANDV 125 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + N N + F N V P P L Sbjct: 126 PAVQVTINRSAGNNGGPVATLLAPIFGALSANSSATAVAVIAAPGSAG------PGALFP 179 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L + + + + D K ++ I+ + Sbjct: 180 LAISK------CLYDLYWNYATGQPKIDPSTGK---------PYVFQINTSYPSSSNCTS 224 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G + + + T +K + + Sbjct: 225 GEWTGFNGPTDASTLKGLVQNGNTSTLSIGNMI---NTALASGVKSSVYMAIPSTP 277 >gi|319956804|ref|YP_004168067.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319419208|gb|ADV46318.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 398 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 58/202 (28%), Gaps = 26/202 (12%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D + + Q K+ + A+ L + + VY + Sbjct: 27 FDASGSMWGQVDGERKIDIARRAMRDLLRDWNPQIPLGLTVYGHRRK-GDCNDIETPIPV 85 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 G ++ + T A+K+ + Sbjct: 86 GPLDRQRMIRAVEGIRPKGKTPIARALKRVAAQFRGSED-------------------PA 126 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKI--VTISINASPNGQRLLKTCV-SSPEY 388 II ++DG+ + + + + +++ I+ I + R L ++ Sbjct: 127 TIILISDGKESC---DADPCATARELRKSGIRFVAHVIGFHVDRTTDRQLACIARATGGS 183 Query: 389 HYNVVNADSLIHVFQNISQLMV 410 +++ NA +L +++ + Sbjct: 184 YFSARNAAALNRAITQVAKKIT 205 >gi|116625580|ref|YP_827736.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] gi|116228742|gb|ABJ87451.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] Length = 376 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 81/294 (27%), Gaps = 30/294 (10%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA------AILAGASKMV 69 S + + ALS ++ L ++G + V KN ++ ++ A L G + + Sbjct: 6 KSRRGFVLVTMALSTVAVLGVVGLAVDVGRIFIAKNEVQVYCDSAAVAAAAALDGTTYGI 65 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS--GYSAVFYNTEIQNIVNSSRI 127 + + + N A++ F + L+ +A + + V Sbjct: 66 AAAAGAAAASPNKWNFGTTAILSPLVTFAQAPGGPWLANPNSAAGYTYAHVAASVTVPLY 125 Query: 128 SMTHM------------ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN--QKI 173 + + A ++ + S + + Y Sbjct: 126 FLPLVVGQSTSAVTAAAAAGQISLAALPRGLAPYTAVSTNTAGPTFGFVAGNSYTIHWPT 185 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + R++ ++ D + S+ + + + + +AA Sbjct: 186 YNANRSHCRVDNPDKCFNSPPCAGDSAASLSAVVANWGSKYHGY--WGSNSNSDIAAA-- 241 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +D+I L D L+ + + + + Sbjct: 242 ----VMDTIQLTPLAVGDNVDPLLTPGNKQSEAGYLDQRASQDTDTTDNTVAGY 291 >gi|307107471|gb|EFN55714.1| hypothetical protein CHLNCDRAFT_134017 [Chlorella variabilis] Length = 611 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 15/137 (10%), Positives = 42/137 (30%), Gaps = 23/137 (16%) Query: 251 DVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +I ++ + + T+ A+++A Q+L Sbjct: 474 RSKVAVIQFSNDIRVEQGPVDVDVGAFEALMAGMARM--NGGTNIALAVQKAGQLLKPLS 531 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE--NFIKIVTI 367 + ++ LTDG ++ +S + + + +++ Sbjct: 532 AT-----------------AHRVLVLLTDGRIDSHQSR-EARDMAARLGDEQANVRVHAY 573 Query: 368 SINASPNGQRLLKTCVS 384 + + Q LL+ C + Sbjct: 574 GVGRGVDKQELLRICAA 590 >gi|185132132|ref|NP_001118067.1| complement factor B [Oncorhynchus mykiss] gi|3982897|gb|AAC83699.1| complement factor Bf-1 [Oncorhynchus mykiss gairdneri] Length = 743 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 58/192 (30%), Gaps = 17/192 (8%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDS 286 + +NA+ + + S + V I+ S K + + + Sbjct: 248 FNSARNAVKKLITKVSSFSVSPNYEIIFFASDVLEVVNIIDFSGDKRKPLVDVLAELNNF 307 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + A++ + ++ ++ + +IF TDG N S Sbjct: 308 KYDARDNVGTNLNLAFKTILERM------AIQKKRNEMLFMEIHHVLIFFTDGAYNMGGS 361 Query: 347 NVNTIKIC--------DKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 NT+ +E ++ + + + + ++ +++ + N Sbjct: 362 PENTMAKIRESVYMNNKTKREKYLDVYVFGVGSDIFDEDIMPLVTKRNGERHYFKLKNVI 421 Query: 397 SLIHVFQNISQL 408 L F +I Sbjct: 422 DLERTFDDIIDE 433 >gi|17229274|ref|NP_485822.1| hypothetical protein all1782 [Nostoc sp. PCC 7120] gi|17130872|dbj|BAB73481.1| all1782 [Nostoc sp. PCC 7120] Length = 615 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 43/159 (27%), Gaps = 18/159 (11%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + ++ Y V+ + P T+K T+ + + + + Sbjct: 68 DQLEPKDILSVVVYDDAVDTVVPPQPVTDK-PALKKSIRQVRAGGITNLSGGWLKGCEYV 126 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKI 364 ++ LTDG N + + E I Sbjct: 127 KHQL----------------DPQKINRVLLLTDGHANMGIQDPKILTATSTQKAEEGITT 170 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 T+ N L+ ++ Y + + D VF Sbjct: 171 TTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 63/180 (35%), Gaps = 28/180 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ + D + +G+I Y+ +VEK + + K + D Sbjct: 643 FEIIKSLVKTLSDQVA---LDLAAARVGIINYSHKVEKVAHLTQFSTKDDFKLAVDRMQY 699 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + T + A+ +A + + +K + +TDG+ + + Sbjct: 700 LGEGTYTASALHEANHMFEA-----------------ARPGVKKVALVITDGQ-TDTRDE 741 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPE--YHYNVVNADSLIH 400 N ++ KA + ++I I + + + + + P+ + Y + +L Sbjct: 742 KNLTEVVKKASDINVEIFVIGVVKKNDPNFEVFHKEMNLIATDPDSEHVYQFDDFITLQE 801 >gi|262196282|ref|YP_003267491.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079629|gb|ACY15598.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 340 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/262 (9%), Positives = 48/262 (18%), Gaps = 28/262 (10%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPA------LLRIEMGERPIFLIELVVDLSGSMHCA 206 + L + + + G A Sbjct: 79 DIVFVLDVSGSMEEPASGRIAELSVLPPAASHPDGPPPPPPGSGPGAPPSGAEPGPPADA 138 Query: 207 MNS-DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 DP + A + SI Sbjct: 139 YGPGDPCAAPDYQPEPPYEGDPCAPFEPYGPGDPRSIGGYDERPAGELGPEYPADEPAAP 198 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDST--------PAMKQAYQILTSDKK------- 310 + + V T AM L + Sbjct: 199 LVPRKIDVARAE-LVHAIERLEP--GTRMNVLFFNNRLEAMAATIVGLDEPGRDALVHFV 255 Query: 311 --RSFFTNFFRQGVKIPS-LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + ++ L+DG N ++ +A ++I TI Sbjct: 256 LSTEPYGPTALAPALRTALSMNPRRLVLLSDGLGNVGGDASAVLRDAREAMLGGVRIDTI 315 Query: 368 SINASPNGQRLLKTCVSSPEYH 389 I + + + L S + Sbjct: 316 GIGQAQDDRLLRALAEESGGLY 337 >gi|48477361|ref|YP_023067.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] gi|48430009|gb|AAT42874.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] Length = 379 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 55/189 (29%), Gaps = 35/189 (18%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 ++ K+ K + ++ I D + LI + + +E ++ + Sbjct: 51 RKGKLDLAKEGAMNLIEKI------PRDNIVSLIAFGDTAKVIVE----GKEPTFALEAI 100 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T A+ A ++ II LTDG + Sbjct: 101 PSLKVAGNTAMYTALLTATKL-------------------ADKYNMPGRIILLTDGMPTD 141 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVF 402 N + + I I I LLK Y++ N + L V Sbjct: 142 VSMNESYENLQVP---EGFTIDCIGIG-DNYRDDLLKLLADKGNSIFYHLENPEELPKVM 197 Query: 403 QN-ISQLMV 410 ++ +S + Sbjct: 198 ESTVSSDIS 206 >gi|74011920|ref|XP_548489.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Canis familiaris] Length = 897 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 74/226 (32%), Gaps = 42/226 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSMH KM K+ALL L + ++ Sbjct: 290 NVVFVIDVSGSMH--------------------GRKMEQTKDALLKILGDMKGEDYLNFI 329 Query: 252 VYMG-LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 ++ G +I + + + E+ R +V D T+ + + +L ++ Sbjct: 330 LFSGDVITWKDDLVQATP--ENIEEARIFVKNIHD---RGLTNINDGLLRGISMLNRARE 384 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 II LTDG+ N +S I+ A + + Sbjct: 385 EHRVPE-----------RSTSIIIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGF 433 Query: 370 NASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQLMVH 411 + N L + + Y +A+ L ++ ++ ++ Sbjct: 434 GNNLNYNFLESMALENHGLARRIYEDSDANLQLQGFYEEVANPLLT 479 >gi|160882769|ref|ZP_02063772.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|237720675|ref|ZP_04551156.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|260170238|ref|ZP_05756650.1| hypothetical protein BacD2_00060 [Bacteroides sp. D2] gi|293373991|ref|ZP_06620332.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145609|ref|ZP_07038677.1| BatB protein [Bacteroides sp. 3_1_23] gi|315918601|ref|ZP_07914841.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111793|gb|EDO13538.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|229449510|gb|EEO55301.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292631067|gb|EFF49704.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516100|gb|EFI39981.1| BatB protein [Bacteroides sp. 3_1_23] gi|313692476|gb|EFS29311.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 342 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 55/162 (33%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESISPSLISKQGTAIGEAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II +TDGEN+ + +K I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPD 229 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 51/160 (31%), Gaps = 29/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D ++ K + +D +GL+ + + + + Sbjct: 134 DVSRSMLAEDVSPNRLGRAKQQIKDMVDE-------MPGDRVGLVVFAGETRQTLPLTRH 186 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E +Q + + S+ + A++ A K Sbjct: 187 VEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDK------------------TTDHK 228 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ LTDGE+ + + E I+I TI + Sbjct: 229 AMVILTDGEDQESDPVSEAKRAYE---EQGIRIFTIGLGD 265 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 54/192 (28%), Gaps = 33/192 (17%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 RTK A K AL F K+ + + + + G E +++ Sbjct: 96 CSGNRTKSEAAKTALAQFARITP-----KDANMGLAVFDAYGIAERVPL--GLENRDKFI 148 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T A+ Y+ L +R + ++ +TDG+ Sbjct: 149 AAVNATAPGNGTPLHDALLLGYRRLEETARRQAGYGEY-------------HLVVITDGQ 195 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEY--HYNVVNAD 396 + + + +++ + I TI + + P + N Sbjct: 196 AYPQNQDPTPV-VAYILRQSPVVIHTIGFCIGTDHSLNQ--------PGRTVYRAADNPR 246 Query: 397 SLIHVFQNISQL 408 L + + Sbjct: 247 ELQQGLEEVLAE 258 >gi|320104266|ref|YP_004179857.1| hypothetical protein Isop_2740 [Isosphaera pallida ATCC 43644] gi|319751548|gb|ADV63308.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 1239 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 48/167 (28%), Gaps = 28/167 (16%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 Y G++ +T R + + + + P M A L Sbjct: 469 SNYDYAGVLFWTGRDQWLFPLITVGPNRNRMLALIDQMIPGDMPAFGPGMTVASNSLLQK 528 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI- 367 + K +I ++DG+ + ++ I + T+ Sbjct: 529 -----------------TDAITKHMIIISDGD-----PAPPPPGLINQLVRGKITVTTVL 566 Query: 368 ----SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 N + + ++ YNV N +L ++Q +L+ Sbjct: 567 TAAHG-NDPGSFNTMQSIAQATKGRFYNVTNPRALPRIYQKEIRLIS 612 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 48/151 (31%), Gaps = 29/151 (19%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++A+L ++ ED+ G+I + PS + + + + Sbjct: 94 RSAILDVVNRSQERQRRPEDL-SGVIVFGKNARVETPPSLNP----RRLQGLENPPDPQY 148 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD +K A + + I+ ++DG N + Sbjct: 149 TDIAAGLKLALAVF--------------------PKDTARRIVLISDGNANRGSV----L 184 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTC 382 + AK I I T+ ++ + L++ Sbjct: 185 EQAAAAKRQGIPIDTLVVDYKYDDDVLVEKI 215 >gi|157273368|gb|ABV27267.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 324 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/203 (9%), Positives = 58/203 (28%), Gaps = 42/203 (20%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + A + A+ + L+ + + E E + + + V + Sbjct: 111 RRAKVIEAVKFLARAAKPGD------EFFLVDFKNKAELAEEFTPRPADIEEAVDNIVW- 163 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A++ + + +K I+ +DG++ + Sbjct: 164 --GGGTALLDAIQLSAEY-----------------ADKEGKNRRKAIVVFSDGDDRDSYY 204 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-------------TCVSSPEYHYNVV 393 + ++ +E +++ + + L + + Sbjct: 205 DR--RQLIKLLQEYQVQVYIVGFPDDDDDGGLFGRSTRKRAVQLIKDIANETGGRAFFPK 262 Query: 394 NADSLIHVFQNISQLMVHRKYSV 416 + D L + + I+ + + YS+ Sbjct: 263 SVDELPEIVRTINADLRTQ-YSI 284 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/140 (9%), Positives = 36/140 (25%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 S++ AL + + + I V ++ +++ + A ++ A ++ S Sbjct: 17 GAISVMAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQLKRADSAANLNA 76 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + + I ++ G + S Sbjct: 77 AVLGTVRNIGAKNGYPSGIAMGCGDATGGGADAMTACLGVWDPASGGPKHFSATYEATKV 136 Query: 140 SNNTIFYNMDVMTSYDYRLQ 159 S N + + + Sbjct: 137 SPNAVRVQATQTVPILFVID 156 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 56/190 (29%), Gaps = 30/190 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLS----HVKEDVYMGLIGYTTRVEKNIEPSWG---TEKVR 277 ++ LK+ + + +++ + +I ++ + I + G + Sbjct: 353 GQRITLLKDTMSDLISGGATTDLANVSLRDREKVSIIPFSFGPHEVISETLGAVGSPSRT 412 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 R T A+ AY + + I+ +T Sbjct: 413 DLQQRVEALQADGGTGIYDAVLAAY-------------------AESAGGDYIPSIVLMT 453 Query: 338 DGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DGE ++ + + I + I N + + ++ ++ +N Sbjct: 454 DGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYG-EANVADMEQLAATTGGETFDAING 512 Query: 396 DSLIHVFQNI 405 D L F+ I Sbjct: 513 D-LDEAFKEI 521 >gi|253996767|ref|YP_003048831.1| hypothetical protein Mmol_1398 [Methylotenera mobilis JLW8] gi|253983446|gb|ACT48304.1| conserved hypothetical protein [Methylotenera mobilis JLW8] Length = 463 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 12/160 (7%), Positives = 50/160 (31%), Gaps = 19/160 (11%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS- 73 + ++ ++I A+ ++ + ++G ++ + + K +++A +++ L+GA ++ ++ Sbjct: 12 MQRQRGAVALIVAICLVLLVGMLGLVLDLGHLYVTKTELQNAADSSSLSGARELNGKVTG 71 Query: 74 -------RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN------ 120 + ++ N A+ +A T+ Sbjct: 72 INSAITRAIEAAGKNNFNLNSTAVTVNASNMWVGSCPSDGCMVPISSVTTDALAGDKTFL 131 Query: 121 -----IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + + + + + + Sbjct: 132 KVDTGLRTINTWFIQVLPGVANTTQTFGMAVAGKYAVDIT 171 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 58/188 (30%), Gaps = 34/188 (18%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR------VEKNIEPSWGTEKVRQYVT 281 + K+ L + E + L+ ++++ + P K + Sbjct: 90 ASNFKSELRFVRKMLSDFPVAPEATRVALVTFSSKSHVVTRADYVSAPKAHQHKCSLFSK 149 Query: 282 RD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T + A ++A QIL K I +TDG Sbjct: 150 EIPSITYRGGGTYTRGAFQRAAQILRQS-----------------RENATKVIFLITDGY 192 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLI 399 +N + +E ++I T+ I N + L + + H + V N Sbjct: 193 SNGGD----PRPVAAALRERGVEIFTLGI-WQGNIRELHEMASQPKDQHCFFVHNFAE-- 245 Query: 400 HVFQNISQ 407 F+ +++ Sbjct: 246 --FEALAR 251 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 45/138 (32%), Gaps = 6/138 (4%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLG 76 + +F+++ A+ ++ + ++G + + + + + ++ + A LAGA +M Sbjct: 1 RGSFALVAAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAA 59 Query: 77 DRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + SN A + + + Y V +++ Sbjct: 60 AAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFL 119 Query: 134 NNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 120 GPSRTVSATSTAKATNVD 137 >gi|299532595|ref|ZP_07045984.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] gi|298719398|gb|EFI60366.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] Length = 412 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 39/145 (26%), Gaps = 5/145 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ I AL+++ L +GF + K +++A ++ LA A ++ Sbjct: 8 LRRQRGAIIITVALALLFLLGFMGFALDFGRLFVVKTELQTATDSCALAAAQELDGASDA 67 Query: 75 -----LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 SN ++L+G + + Sbjct: 68 RTRAVNAGLAAGNSNRVHFQGASAGIAVSDISFSDALNGSYSTTFPVTSAKYARCEHART 127 Query: 130 THMANNRLDSSNNTIFYNMDVMTSY 154 + + + S Sbjct: 128 GLLPWLLQALGAFSGNASYGANQSV 152 >gi|126730248|ref|ZP_01746059.1| hypothetical protein SSE37_10849 [Sagittula stellata E-37] gi|126708981|gb|EBA08036.1| hypothetical protein SSE37_10849 [Sagittula stellata E-37] Length = 183 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 9/56 (16%), Positives = 23/56 (41%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 +SR + + E N ++ FA F L++ + + + + +E + + Sbjct: 7 ISRLGRFAGRFCREENGNSTLEFAFYFSIFFLVLAAGVEIAYMNLRHAMLERSVDR 62 >gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] Length = 946 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVV 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNHNI-RTWRNDLVSATKTQIADAKRYIEKI---QPNGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + ++ + + ++ I Sbjct: 405 ----------MGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKQNIQD-NVSLFSLGI 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENRGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|261416578|ref|YP_003250261.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261373034|gb|ACX75779.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325972|gb|ADL25173.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 236 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 64/191 (33%), Gaps = 19/191 (9%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNI---EPSWG 272 + K+ +LK+AL + S S + ++Y+ +I + + I + + Sbjct: 23 VSGSMNEIGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQAANIILEPQSASE 82 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + T A+ +L + + PS ++ F Sbjct: 83 IANDPSKMNVINKMQAIGNTPLGKALTSLVDLLENREI-------------YPSRAYRPF 129 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 I+ +DG N+ ++ + + + ++I A + L K + + Sbjct: 130 IVLASDGMPNDLWQ-QPLDRLLNSERSKKANRLALAIGADADESMLKKFVNNEEMPIFKA 188 Query: 393 VNADSLIHVFQ 403 NA + F+ Sbjct: 189 NNAIEIQKFFK 199 >gi|269836152|ref|YP_003318380.1| hypothetical protein Sthe_0119 [Sphaerobacter thermophilus DSM 20745] gi|269785415|gb|ACZ37558.1| hypothetical protein Sthe_0119 [Sphaerobacter thermophilus DSM 20745] Length = 598 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/219 (5%), Positives = 43/219 (19%), Gaps = 18/219 (8%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + +IFA + + + + + V ++ +++ A+ + Sbjct: 11 RGQILVIFAAATVLLIAFLAAAVDVGYLLSQRRGVQN---------AADAAALAVAHAVH 61 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + + ++ + N + + + Sbjct: 62 SGVTNTATLEQIAAYYVEENGYDATPEITFSPDMKQVRVTVN-AEVPKFFVGLVYPGDWK 120 Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 V + + + ++ D Sbjct: 121 VGARAAASLEPVPFDAALLALNPSSGGIKTSGNTGIEVTGGSVMSNYTIETSGSTSIIAD 180 Query: 199 LSG--------SMHCAMNSDPEDVNSAPICQDKKRTKMA 229 S + AP D + K+ Sbjct: 181 QQVNANDGFRESGASRIEGGMGTNPYAPEVPDPLKDKIQ 219 >gi|226365428|ref|YP_002783211.1| hypothetical protein ROP_60190 [Rhodococcus opacus B4] gi|226243918|dbj|BAH54266.1| hypothetical protein [Rhodococcus opacus B4] Length = 548 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 58/211 (27%), Gaps = 26/211 (12%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D+SGSM + + ++ L F + ++ Sbjct: 350 AIDVSGSMETPAGNRSRMDLTVDAALAGNAMFPDSVSAGLWAFSQGLGGGPQDYQE---- 405 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ + ++ ++ LI T A++ + Sbjct: 406 LVPIRRYDTVVDGLTQ--RELMAQQAGKVEGLIGGGTGLYDTTLAAFRTVQDSY------ 457 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINA 371 +I LTDG N + S + +E + IVTI I Sbjct: 458 ----------DPRAVNSVIILTDGANEDPDSITKEQLLGILEREMDPARPVIIVTIGITD 507 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + + Y + + +VF Sbjct: 508 DADAATLAEISRVTGGSSYVAKDPADIANVF 538 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 20/184 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + AL+ L+ + + I + + + +R Sbjct: 276 KLQQTREALVTILNDLRPQDRFNIIGFSNRIKM--WKDHLLPVTPD--NIRNGKIYMYHL 331 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ A ++L + ++ IIFLTDG+ ++ Sbjct: 332 SPTGGTDINGALQAAIKLLNNYVAQNDIE-----------DRSVSLIIFLTDGKPTFGET 380 Query: 347 NVNTIKICDKAKENF-IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHV 401 N I K I I T+ I + + L L+ C + H LI Sbjct: 381 NTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGF 440 Query: 402 FQNI 405 + I Sbjct: 441 YDEI 444 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 20/184 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + AL+ L+ + + I + + + +R Sbjct: 310 KLQQTREALVTILNDLRPQDRFNIIGFSNRIKM--WKDHLLPVTPD--NIRNGKIYMYHL 365 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ A ++L + ++ IIFLTDG+ ++ Sbjct: 366 SPTGGTDINGALQAAIKLLNNYVAQNDIE-----------DRSVSLIIFLTDGKPTFGET 414 Query: 347 NVNTIKICDKAKENF-IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHV 401 N I K I I T+ I + + L L+ C + H LI Sbjct: 415 NTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGF 474 Query: 402 FQNI 405 + I Sbjct: 475 YDEI 478 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 51/187 (27%), Gaps = 23/187 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVR 277 + + +D +V +I + V + P+ WGT Sbjct: 46 ASGSMAGNDIEQARAGAEWVFGLLDEDDYVS------IIAFDNEVTTVLAPTRWGTISRE 99 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 V D TD + +A L + ++ L+ Sbjct: 100 TAVDAVADISAGGGTDMYSGLLEAKASLQD---------------LPTDDNTARRVLLLS 144 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG++N+ + + I+I I + + + + ++ A Sbjct: 145 DGKDNSH-DPEAFGTLAREIDTEGIRIKAAGIGSDYREETIRTLGTVARGEWTHLDAAGD 203 Query: 398 LIHVFQN 404 + F + Sbjct: 204 IESFFGD 210 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 58/184 (31%), Gaps = 20/184 (10%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + AL+ L+ + + I + + + +R Sbjct: 310 KLQQTREALVTILNDLRPQDRFNIIGFSNRIKM--WKDHLLPVTPD--NIRNGKIYMYHL 365 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ A ++L + ++ IIFLTDG+ ++ Sbjct: 366 SPTGGTDINGALQAAIKLLNNYVAQNDIE-----------DRSVSLIIFLTDGKPTFGET 414 Query: 347 NVNTIKICDKAKENF-IKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNADSLIHV 401 N I K I I T+ I + + L L+ C + H LI Sbjct: 415 NTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLSLENCGLTRRVHEEDKAGAQLIGF 474 Query: 402 FQNI 405 + I Sbjct: 475 YDEI 478 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/359 (10%), Positives = 91/359 (25%), Gaps = 62/359 (17%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT-----EI 118 G + +N + + D RF + + Sbjct: 1027 GTTYTFNNSNTSFSIALGTNKGTLYIENDGDYRFTPPSGGAEGAPVEVTYKIKDGDGDTS 1086 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + +N S + + V + ++ ++ + Sbjct: 1087 SAKLTIINPVLVVGSNANDTGSGASTAIDDHVRPNPLLAPDVDGAIVGGVGADVLIGDV- 1145 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + G + + +D+SGS+ T+ +K+A+ Sbjct: 1146 --GGVTSGSYNLTFM---IDMSGSI--------------------SGTEFQLMKDAINNL 1180 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 L + + + + + S T + + T+ A+ Sbjct: 1181 LAKFSGI----SQLQVEIGTFADNSNVVGTYSSVTAAQQAVSN---LTRSGGGTNYQAAL 1233 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-----IKI 353 ++T D K++ FLTDGE +T + Sbjct: 1234 TTLNTMMTVDPVAD-----------------HKYVYFLTDGEPTVGSWTNSTQIANGMAA 1276 Query: 354 CDKAKENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + I + I L ++P+ + V N D L ++ + Sbjct: 1277 LNALTAPGVVINAVGIGVPSGASFGNNLNAIDNTPDNYLAVDNFDDLSSGLGSLFTAVS 1335 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 22/343 (6%), Positives = 72/343 (20%), Gaps = 19/343 (5%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + M N ++ ++ + + D + ++ V Sbjct: 726 AFSVLMDGNGKVTVTQYAALEHGVDGSSASDHDDSVSLAAAAAIKVVQTVTDRDGDSTTA 785 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S+ ++ + +TI + + L + + + + Sbjct: 786 TSASGLNIAFKDDGPSAVADTITSGTNGTVNLVLVLDSSGSIGDTNMQVIKDAVTNLMNS 845 Query: 183 IEMGERPIFLIELVV--DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + L++ + + + T A+ Sbjct: 846 YGNSLVKVMLVDFAGSATVKSVGSQVWLTKDQATGQLTTISSGGSTDYDDALQAVQDNYG 905 Query: 241 SIDLLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA- 297 + + + + + W ++ + + + T A Sbjct: 906 TPPSADNTFVFFISDGVPSSTDDAINTEERNEWTNFLTQKGIDGAYA-VGIGSTSVLDAD 964 Query: 298 ---MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + + T G + D + + + + Sbjct: 965 LQSVAWSPSGAHNSNVVVINTAGQLSGTLTNLTQVIDGNVTANDSAGADGFATPKLVSV- 1023 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 E T N + + + + N Sbjct: 1024 ----EYNGTTYTF-----NNSNTSFSIALGTNKGTLYIENDGD 1057 >gi|170591963|ref|XP_001900739.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158591891|gb|EDP30494.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 447 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 39/116 (33%), Gaps = 21/116 (18%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ A+K A + + + K + +TDG++ + + Sbjct: 12 SGATNTGAALKFALERGFQNARGGN---------------IPKVAVVVTDGQSQDSVA-- 54 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + ++ + + + + N +L + +P V + D L + Sbjct: 55 ---ESAQQLRDAHVMVYAVGVTNLVNVHQLHQI-AGNPARVLTVESFDELSKTLAD 106 >gi|156394499|ref|XP_001636863.1| predicted protein [Nematostella vectensis] gi|156223970|gb|EDO44800.1| predicted protein [Nematostella vectensis] Length = 175 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 37/156 (23%), Gaps = 19/156 (12%) Query: 236 LLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F+ + LI Y+ + + G + T Sbjct: 38 KQFVQGLSDIFDISPGQSRASLIIYSDFPKLIFDLEDGVTNQNITSVLKNLEYLRGRTRI 97 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A ++ + LTDG+ + Sbjct: 98 DKALMMAEEVFAD-----------------ARPTVPRIAFILTDGKQTQDYDAIPLDVSS 140 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + K+ +KI I + + L+ P + Sbjct: 141 QRLKKMGVKIYVIGVGPYVDISE-LRLLTEKPGDVF 175 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 K+ K+AL L + + I + + S K+ Sbjct: 302 SSASMVGAKLRQTKDALFTILHDLRPQDRFNIIGFSNRIKVWKDNLISVTPNSIRDGKIY 361 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A++ A ++L + + I I+FLT Sbjct: 362 -----IHHMSPTGGTDINGALQTAIRLLNN----------YVAHNDIEDRSVS-LIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + KE + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTKEAAQGRVCIFTIGIGNDVDFKLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EENAGAQLIGFYDEI 478 >gi|118093056|ref|XP_421769.2| PREDICTED: similar to AMACO [Gallus gallus] Length = 800 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 48/173 (27%), Gaps = 20/173 (11%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + ++ + L+ Y +R + L + Sbjct: 562 FTINRDVTQIALVAYGSRARTVFAL-DTYTNNSALLQAINQMPFLG------------DV 608 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 ++ + K K ++ LT+G + ++N + + Sbjct: 609 ASASSALLHIHSDVMTVQKGARPGVSKVVVLLTNGG-----GMEDAAAPARHLRDNGVMV 663 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + I LL+ SP Y ++ + + L I++ + S + Sbjct: 664 FVVVIG-DAERDTLLRV-AGSPSYLVHISSYEDLQRYQDLITERICEEAKSPV 714 Score = 43.3 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 47/170 (27%), Gaps = 21/170 (12%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G++ +++ K + T++ A+K Sbjct: 78 DALDIHPGRVRVGMVQFSSAPHLEFSLDSYLTKQEVKERIKRTAFRGGSTETGRALKYIL 137 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K +I ++DG++ + + KE Sbjct: 138 HKGFPGGRNL---------------TVPKILIIISDGKSQGSTAVPAM-----QVKERGT 177 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH-VFQNISQLMVH 411 + + I + + ++ +A + ++ +S + Sbjct: 178 TVFAVGIKFPRWEELHAVASEPTEQHVLFAGDASDAANGLYSALSSSICS 227 >gi|294997269|ref|NP_001025043.3| integrin alpha-D isoform 1 [Mus musculus] Length = 1202 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 67/215 (31%), Gaps = 39/215 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 170 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 224 Query: 283 DMDSLILKPTDSTPAMKQA-----------YQILTSD------------KKRSFFTNFFR 319 + T + +++ + L ++ F Sbjct: 225 IVQLQ--GLTYTASGIQKVVDSNRLLLLRVWSALKPGCHYNTHMPWWVWQRVRFSWKELF 282 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 +K +I +TDG+ F+ + + +A++ I I + + L Sbjct: 283 HSKNGARKSAKKILIVITDGQK--FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTAL 340 Query: 380 KT-----CVSSPEYHYNVVNADSLIHVFQNISQLM 409 + S ++ + V N +L + + I + + Sbjct: 341 QELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 375 >gi|189238317|ref|XP_972173.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 604 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 79/262 (30%), Gaps = 51/262 (19%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + + L + + ++ ++S Sbjct: 262 VVFVLDHSGSMEGRKYEQLMQAMDKILSDLN-PDDLFHIVRFSENVSV------------ 308 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K + + + D ++ + T Sbjct: 309 -------WNFEKNKFDQVS--------FLQKPEYRNLDSFLAEFNLGDAAQV-------T 346 Query: 274 EKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 E + + D + T+ + ++ ++ + N Q Sbjct: 347 EGNIKKAKKIKDHDVDMGCTNIIGGLVVGLYLVRRTLQKFYEKNVE--------TKHQPM 398 Query: 333 IIFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE-- 387 IIFLTDG N SN + I KI K + N I ++S + L K + Sbjct: 399 IIFLTDGLPNEGISNPDKITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFS 458 Query: 388 -YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ +S Sbjct: 459 RHIYEAADAALQLQNFYRTVSS 480 >gi|74220882|dbj|BAE33629.1| unnamed protein product [Mus musculus] Length = 1202 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 67/215 (31%), Gaps = 39/215 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV--RQYVTR 282 ++ +K+ + + + S L+ Y+ ++ + + + + V Sbjct: 170 QSDFTQMKDFVKALMGQLASTS-----TSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDA 224 Query: 283 DMDSLILKPTDSTPAMKQA-----------YQILTSD------------KKRSFFTNFFR 319 + T + +++ + L ++ F Sbjct: 225 IVQLQ--GLTYTASGIQKVVDSNRLLLLRVWSALKPGCHYNTHMPWWVWQRVRFSWKELF 282 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 +K +I +TDG+ F+ + + +A++ I I + + L Sbjct: 283 HSKNGARKSAKKILIVITDGQK--FRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTAL 340 Query: 380 KT-----CVSSPEYHYNVVNADSLIHVFQNISQLM 409 + S ++ + V N +L + + I + + Sbjct: 341 QELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 375 >gi|313239054|emb|CBY14036.1| unnamed protein product [Oikopleura dioica] Length = 442 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGEN---NNFKSNV 348 +S K A+ + + +G + F K T G N S Sbjct: 226 NSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGTVVLLTNAASTD 285 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 ++ D+ KE +++ + + GQ L SSP E Y + L + + I+ Sbjct: 286 EVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENLYTAEESSDLAGLVRTIA 344 Query: 407 QLMVHRKYS 415 + + S Sbjct: 345 DEICSTELS 353 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 66/294 (22%), Gaps = 35/294 (11%) Query: 127 ISMTHMANNRLDSSNN---TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 +++ + L T+ L + + + ++ Sbjct: 58 VALFVLGLATLGVGAARPQASVPIAANSTTILLALDVSGSMCSTDVSPNRLTAAEKAATD 117 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +P +V +GS + + + ++ A+ +L +D+I Sbjct: 118 FIRAQPGGSRIGLVTFAGSAGLLVPPTDDTDKLLAALKSLTTSRGTAIGQGILTSIDAIA 177 Query: 244 LLSHVKEDV---YMGLIGYTTRVEKNIEPSWGTEKV----RQYVTRDMDSLILKPTDSTP 296 + G G + + + G V R + T Sbjct: 178 EVDPSVPPTGADVPGGTGGEYAADVIVVLTDGANTVGVDPRTAAGEAATRRLRVYT---- 233 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + S G G + I Sbjct: 234 -IGFGTTTPAPMVCDSSQIGDDAFGFGGGFGGGGGGGFGGGGGGFGGRNGERDPRAI--- 289 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L + ++ +Y NA+ L ++ + + Sbjct: 290 -----------------DEDALKQVASTTGGTYYRAQNANELQDALTDLPRTIT 326 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 44/138 (31%), Gaps = 6/138 (4%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLG 76 + +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 1 RGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAA 59 Query: 77 DRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + SN A + + + Y V +++ Sbjct: 60 AAANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFL 119 Query: 134 NNRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 120 GPSRTVSATSTAKATNVD 137 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 20/261 (7%), Positives = 48/261 (18%), Gaps = 20/261 (7%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + +++ +L L + I + K ++ A A ++ Sbjct: 1 MRHFFRDRAGQITVLASLLSPVGLGVAALAIDLSTLQMVKQRLKVTA-DAASLAAVAVLP 59 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + R +I+ A S N V + Sbjct: 60 DTGTALSRALAIAADNAGAGAGSVTTAADVQFGSYDSATRTFTAGATPANAVQVTASRSE 119 Query: 131 HMANNRLDSSNN-----------TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 N + + L +S Sbjct: 120 ARGNPVITGFARALGWATPDLSASAVAVRFSPAYCFLVLDPSASGALSVSGTGRLSVPNC 179 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 +++ + + + + + A Sbjct: 180 GVQVNSTSASAA--------TTGNNSTAQARSFCITGGYSGSSFSPVPITRCLPAADPLA 231 Query: 240 DSIDLLSHVKEDVYMGLIGYT 260 D + Y GL + Sbjct: 232 DIPEPAPPTAGCYYNGLNTGS 252 >gi|126334042|ref|XP_001370610.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1154 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 61/181 (33%), Gaps = 23/181 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ +I + L+ Y+ + + + + + T + Sbjct: 169 MKEFVRTIM-GQFRGTNTLFSLMQYSNDFRIHFTFNIFKNNPDPGILVNRIEQLGGLTFT 227 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKI 353 A+++ + + K +I +TDG ++ + I + Sbjct: 228 ATAIQKVVKEIFQSWNG-------------ARKDAVKILIVITDGRKESDRLEYEDVIPL 274 Query: 354 CDKAKENFIKIVTISIN---ASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQL 408 +KA I I + +P+ + L+T S P + + V N +L ++ + + Sbjct: 275 AEKA---GIIRYAIGVGNTFTTPSALQELRTIASQPSQEHVFQVNNFAALRNIQNQLQEK 331 Query: 409 M 409 + Sbjct: 332 I 332 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 72/223 (32%), Gaps = 38/223 (17%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +P+ + ++D SGS+ + + K A+ ++ + ++S Sbjct: 38 TQPVSELVFILDSSGSVAQSDF-----------IISVEFLKFASKIISVSASTTRVAVIS 86 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + + Y + E + ++ + T + A++ A + Sbjct: 87 YSSCNQIHIRVNYISSPENKNKCTFDN-------DLTSVNYHPGGTCTAGALEAAGR--- 136 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + Q+ ++ LTDG +N+ K K +KI T Sbjct: 137 -------------DVLSHGRPGAQRVVMLLTDGASNDG---GPPHANAQKLKSEGVKIFT 180 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I I S L S EY Y + + + ++ + + Sbjct: 181 IGIG-SIKLSELNAIATSVDEYVYILADFGDVRNLATVVKDDI 222 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/328 (10%), Positives = 83/328 (25%), Gaps = 67/328 (20%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + A AL + + +++ +S M + Sbjct: 322 TQADSAALDALKQQLLSPAEKTYRSEQGFRDGGSDAPAGGSVLPASLPMMAPASQAT-TV 380 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 S +T + N+ + + + + + I + Sbjct: 381 SADTAWTNLKKDARMLVLIDVSGSMQTKIDGGQSRIELMESTAIAA-----LDVLPKTTR 435 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 G+ + N V+ P+ + + + GLI Sbjct: 436 LGAWAFSSNLQKNHVDYLPL---------------------TNGEQPILDDTYRNGLIA- 473 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + T + AY+ +T Sbjct: 474 --KAHTLPGLAA----------------KNGDTALYDTIAAAYKSVTDTY---------- 505 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-----ENFIKIVTISINASPN 374 + ++ LTDG N++ + ++ + K + +KIVTIS+ + Sbjct: 506 ------DPNYVNSVVVLTDGTNDDPNGGLALDQLLARLKSQYSADKPVKIVTISLGTGTD 559 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVF 402 L + ++ Y + + VF Sbjct: 560 PDALKRIAKATDGLSYQTKTPEQISGVF 587 >gi|298484180|ref|ZP_07002346.1| BatB protein [Bacteroides sp. D22] gi|298269684|gb|EFI11279.1| BatB protein [Bacteroides sp. D22] Length = 342 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 55/162 (33%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESINPSLISKQGTAIGEAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II +TDGEN+ + +K I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPD 229 >gi|262201266|ref|YP_003272474.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262084613|gb|ACY20581.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 594 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 60/202 (29%), Gaps = 20/202 (9%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + T+ L +A + L+ + + G + + + + Sbjct: 392 DTSGSMETPVGNTTRAGLLIDASMTGLEMFP-----RNAMVGGWAFGIDKGGPDQDWTEM 446 Query: 273 TEKVRQYVTRDMDSLILKPT---DSTPAMKQAYQILTSDKKRSFFTNFFRQGVK---IPS 326 + R T A+++ + K Sbjct: 447 -----APIRRLDAPSGSGGTHREALARAVREGLAPARLGGGTGLYDTTLAAFKKVQSTYD 501 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAK----ENFIKIVTISINASPNGQRLLKTC 382 + +I +TDG+N + S T + + + + I+TI I+ + L + Sbjct: 502 PNYSNSVIIMTDGQNEDPGSITLTELLAELKELEDPARPVLILTIGISEDADTNALRQIA 561 Query: 383 VSSPEYHYNVVNADSLIHVFQN 404 ++ Y A + VF N Sbjct: 562 QATGGTTYVAKTAADIKQVFTN 583 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 58/191 (30%), Gaps = 45/191 (23%) Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + + + +D+S SM +D K ++ K + + + Sbjct: 84 VSAQQTVGVDLAFCIDVSNSMAA---------------RDVKPDRIGFAKQIVTHTMQQL 128 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQA 301 + +I + + + R ++ + + T+ A++++ Sbjct: 129 AGS-------RIAMIVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQGTNLGQALERS 181 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q L+ K +I LTDGE++ + + KE Sbjct: 182 AQALS------------------APSRAGKAVILLTDGEDHEG----GLEEGIKRLKEQG 219 Query: 362 IKIVTISINAS 372 IK ++I Sbjct: 220 IKAYVVTIGLP 230 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 23/191 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDMD 285 + L A+ L+S ++ + + +I + + + + + Sbjct: 33 IDTLNKAVESMLNSFKKAETMETFIKLSIITFGSENGVDLHTPLTE-----VSKID-FKP 86 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A K ++ ++ I+ L+DGE N+ Sbjct: 87 LTVSGSTPMGAAFKMGKAMIEDKDIFK-------------GRDYRPTIVLLSDGEPNDDW 133 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + ++I A + L + +A+++I F+ I Sbjct: 134 -RQPLDDFVSTGRTKKCDRMALAIGA-ADKTVLNMFIEGCENSLFYAEDAENIIDEFKKI 191 Query: 406 SQLMVHRKYSV 416 + + R SV Sbjct: 192 TMSVTQRTKSV 202 >gi|86748912|ref|YP_485408.1| TadE-like [Rhodopseudomonas palustris HaA2] gi|86571940|gb|ABD06497.1| TadE-like [Rhodopseudomonas palustris HaA2] Length = 181 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 44/133 (33%), Gaps = 1/133 (0%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAI 61 LLSR ++ + + + +I FA+ F+ L+ +I +E+A ++A Sbjct: 5 LLSRAVLLARRFSRNRRGSAAIEFAMIAPIFIALLFAIIETAFVFLASQVLETAVQDSAR 64 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L + + + N K D S S +N Sbjct: 65 LILTGQAQAASYTQSQFKTDLCNRLKALFSCDGVYVDVQSYGSDFSTVSITTPIDSSKNF 124 Query: 122 VNSSRISMTHMAN 134 VN+ + S + Sbjct: 125 VNTMKYSPGAAGD 137 >gi|262196568|ref|YP_003267777.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] gi|262079915|gb|ACY15884.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] Length = 564 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 31/87 (35%), Gaps = 4/87 (4%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS---SPE 387 K I ++DG + + ++ +A N I+I TI A N L S Sbjct: 174 KLIAVVSDGRDADPSPENY-RRVAKRAARNDIRIHTIGFPADRNRYPLYGLAEMSKQSEG 232 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKY 414 V+ + F +++ + + Sbjct: 233 TFRLVLTESAFGSHFGQLAREINEQYV 259 >gi|260824347|ref|XP_002607129.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] gi|229292475|gb|EEN63139.1| hypothetical protein BRAFLDRAFT_118666 [Branchiostoma floridae] Length = 642 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 46/390 (11%), Positives = 102/390 (26%), Gaps = 40/390 (10%) Query: 43 VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 + KK + A +AA+ AG S +++ R + ++ F + Sbjct: 126 YVGTVMKKGAAREAYDAAVEAGESAGH--VAQESPRGREEFRISINVGAEEKVTFQLTYQ 183 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTH-----MANNRLDSSNNTIF-YNMDVMTSYDY 156 + + + I+ + ++ + L + D T Sbjct: 184 EFLVRRLGIYEHVISIRPEQIIPDMRISVDLRESQGFSFLRVPDIRTSDLLTDQSTDELS 243 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 S G F+++ V + V+ Sbjct: 244 SATVSRASPEHYTVTYSPSEQEQAAMSTQGIMGDFVVQYDVIRQQQIGFIQVYGDYFVHY 303 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 T + + ++ +D V + E + E + Sbjct: 304 ------FAPTGLPVMPKNVIFVIDL--REMDFFNIVTFSSRTKRWKSELQV---ANQENI 352 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R T T+ A+ +A +L + + + I+ L Sbjct: 353 RAADTYVTSMAAHGGTNINDAILEASVLLDPELRSRHDS-------------HASMIVLL 399 Query: 337 TDGENNNFKSNVNTI--KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 TDG+ +N N I D N + + + + L + + + + + Sbjct: 400 TDGQPTGGVTNTNHIIANARDSLAGNHA-LFCLGFGYDVSFEFLERLALQNGGFARRIYP 458 Query: 395 ADS----LIHVFQNISQLMVHRKYSVILKG 420 D L F ++ + +S G Sbjct: 459 DDDGELQLTSFFDEVA-DVTQTAFSNYFNG 487 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 63/193 (32%), Gaps = 21/193 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMD- 285 K+ + ++ I + T R+ E G K + + + + D Sbjct: 267 FERAKDVIKTLIEKISYYEVSPNYEILLFATDTDRIISMREFKNGQGKDLLKIIQKLQDY 326 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + K S + QAY + K Q +I TDG+ N Sbjct: 327 AYDKKGQRSGTNIAQAYNKIYESM------TIELMTNKEDFKATQHIVIMFTDGQANMGG 380 Query: 346 SNVNTIKICDKAK--------ENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNA 395 S + DK K E +++ + + + + LKT ++ ++ + + + Sbjct: 381 S---PKPLVDKIKSLVRQNSVEEKLELYVFGLGNDVHAEDINDLKTDRANEKFFFKLKSL 437 Query: 396 DSLIHVFQNISQL 408 D L F N+ Sbjct: 438 DDLKETFDNMIDE 450 >gi|284053937|ref|ZP_06384147.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 339 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 36/139 (25%), Gaps = 19/139 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + ++ + R + + + + T +K + L Sbjct: 1 NVGDRISVVAFDHRAKVLVP-NQDIADPDGIKKKIDGLRCSGGTAIDEGLKLGIEELGKG 59 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 K+ F LTDGEN + + + A E + I ++ Sbjct: 60 KQDRISQGF-----------------LLTDGENEHGDNKRCLK-LAKLATEYKLTINSLG 101 Query: 369 INASPNGQRLLKTCVSSPE 387 N L K + Sbjct: 102 FGDDWNQDILEKIADAGGG 120 >gi|218693235|gb|ACL01121.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFFNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|194016210|ref|ZP_03054824.1| YwmD [Bacillus pumilus ATCC 7061] gi|194011683|gb|EDW21251.1| YwmD [Bacillus pumilus ATCC 7061] Length = 225 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 57/205 (27%), Gaps = 29/205 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV------KEDVYMGLIGYTTRVEKN 266 D++ + + K+ K ++ F + + V E + Sbjct: 38 DLSGSMAQSVEGEKKIDIAKRSIQSFASILSDDTQVLLRVFGHEGTNKNAGKAISCESSE 97 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 +G+ + + T A+ Q + Sbjct: 98 AVFGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQA---------------- 141 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCV- 383 + + ++DG+ S + + E I + I + + R LK+ Sbjct: 142 -EGKNIVYVVSDGQETCGGSPS---QAAKELHEGGIDTIVNIIGFDVNEKEARSLKSVAK 197 Query: 384 SSPEYHYNVVNADSLIHVFQNISQL 408 + + NA+ L ++ QN + Sbjct: 198 AGGGQYQPAANAEELNYILQNAAST 222 >gi|116876155|gb|ABK30937.1| complement component 2/factor B variant 2 [Carcinoscorpius rotundicauda] Length = 889 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 50/150 (33%), Gaps = 17/150 (11%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 G + +++ V + + + T + A+ + K Sbjct: 463 TRFGAVSFSSTVSASF-LPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSK 521 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISI 369 +F + I LTDG+ N + ++ + K + +I +I++ Sbjct: 522 HTF-----------ADRAMKTIIFLLTDGKAN---MRGDPKQVAKELKADVKAEIYSIAL 567 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + +L + S ++ Y + + ++L Sbjct: 568 TGDYDINKLREVASSKKDHVYILKDYETLD 597 >gi|115315541|gb|AAV65032.2| complement component 2/factor B variant 1 [Carcinoscorpius rotundicauda] Length = 889 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 50/150 (33%), Gaps = 17/150 (11%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKK 310 G + +++ V + + + T + A+ + K Sbjct: 463 TRFGAVSFSSTVSASF-LPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSK 521 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISI 369 +F + I LTDG+ N + ++ + K + +I +I++ Sbjct: 522 HTF-----------ADRAMKTIIFLLTDGKAN---MRGDPKQVAKELKADVKAEIYSIAL 567 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + +L + S ++ Y + + ++L Sbjct: 568 TGDYDINKLREVASSKKDHVYILKDYETLD 597 >gi|110799635|ref|YP_695225.1| von Willebrand factor type A/Cna B-type domain-containing protein [Clostridium perfringens ATCC 13124] gi|110674282|gb|ABG83269.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens ATCC 13124] Length = 928 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/300 (10%), Positives = 65/300 (21%), Gaps = 67/300 (22%) Query: 153 SYDYRLQFIEHLLNQRY------------NQKIVSFIPALLRIEMGERPIFLIELVVDLS 200 + L + ++ + L ++D S Sbjct: 141 DVVFLLDNSNSMTTNNRAIKIKEQIKNVMDKLNTNNTRYALVTYASTILDGRYYHLIDRS 200 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 + N + ++ + + + + + Y+G Sbjct: 201 IGDNKYTVYKGYTSNQCYLNFTSNIQEI---------YNKIPTTVPNQRNNGYVGGTFTQ 251 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLI-----------------LKPTDSTPAMKQAYQ 303 + K IE ++ + + D L T + Sbjct: 252 EGLLKAIELLKNSDADEKIIIHLTDGLPTFSFLLKEFGGNEKAIFDYNTQYN---GIGVR 308 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-- 361 + + T + I + NN + K+ Sbjct: 309 GFGTSYFFNTKTQKPYIYSREEVYSALNRSINKNESIWNNGFPTTL---EAENIKKENPD 365 Query: 362 IKIVTISINAS---------------------PNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I I TI I P ++ L++ SSP + N D + Sbjct: 366 INIYTIGIELKKEVYKWDDYRKYYNAEGVVELPEIKKFLESISSSPAEAFVNENVDDIDE 425 >gi|296224100|ref|XP_002757907.1| PREDICTED: vitrin [Callithrix jacchus] Length = 656 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 51/184 (27%), Gaps = 25/184 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + L + D +G + YT + K Sbjct: 489 FRTVLQFVTNISKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSG 548 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A + L +K +I +TDG + + +V Sbjct: 549 GTSTGAAINFALEQLF----------------MKSKPNKRKLMILITDGRSYD---DVRI 589 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQL 408 + K + I + + Q L+ + P ++ + V D+L + Q Sbjct: 590 PAMAAHLK--GVITYAIGV--AWAAQEELEVIATHPARDHSFFVDEFDNLYQYVPRVIQN 645 Query: 409 MVHR 412 + Sbjct: 646 ICTE 649 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 52/186 (27%), Gaps = 31/186 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDST 295 D L MG++ Y + ++ ++ + + ++ Sbjct: 294 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSQDLKTAIEKIPQR--GGLSNVG 351 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + S + ++ + DG + + Sbjct: 352 RAISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASR 393 Query: 356 KAKENFIKIVTISINASPNGQRL--------LKTCVSSPEYHY-NVVNADSLIHVFQNIS 406 A+E+ I I I+I + ++ K + ++ +V + L Q + Sbjct: 394 LARESGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQTLV 453 Query: 407 QLMVHR 412 + + Sbjct: 454 KRVCDT 459 >gi|149067646|gb|EDM17198.1| integrin alpha M [Rattus norvegicus] Length = 1151 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 59/187 (31%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMDQFQ-----KSKTLFSLMQYSDEFRTHFTFNDFKRNPDPKSHVRPIRQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L + K ++ +TDGE Sbjct: 221 LNGRTKTASGIRKVVRELFQKINGARDNAA-------------KILVVITDGEKFG--DP 265 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 +N + +A+E I I + + P +R L T S P ++ + V N ++L + Sbjct: 266 LNYEDVIPEAEEAGIIRYVIGVGNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIR 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|29789036|ref|NP_036843.1| integrin alpha-M [Rattus norvegicus] gi|8917587|gb|AAF81280.1| integrin beta 2 alpha subunit [Rattus norvegicus] Length = 1151 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 59/187 (31%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + +D K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMDQFQ-----KSKTLFSLMQYSDEFRTHFTFNDFKRNPDPKSHVRPIRQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L + K ++ +TDGE Sbjct: 221 LNGRTKTASGIRKVVRELFQKINGARDNAA-------------KILVVITDGEKFG--DP 265 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 +N + +A+E I I + + P +R L T S P ++ + V N ++L + Sbjct: 266 LNYEDVIPEAEEAGIIRYVIGVGNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIR 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|87311196|ref|ZP_01093319.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] gi|87286104|gb|EAQ78015.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] Length = 179 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 11/127 (8%), Positives = 27/127 (21%), Gaps = 5/127 (3%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-A 63 + R + + FA + L I + + + A A Sbjct: 40 RKIRPTKRSRSTQRHGAAIVEFAFAAPILFLFILASVEFGRLTMIR----HTADNAAYEA 95 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 MV + + ++ A ++ + N Sbjct: 96 ARYAMVPGATSSEAKEKATQLLATIGARKVEVDVDPAVLQPETKTITVTVRVPAAGNSWV 155 Query: 124 SSRISMT 130 + + Sbjct: 156 VPKYFSS 162 >gi|332845803|ref|XP_510951.3| PREDICTED: integrin alpha-X [Pan troglodytes] Length = 1209 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHQLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 66/210 (31%), Gaps = 33/210 (15%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 +AP ++ + +D+ D+ +G++ Y+ I G K Sbjct: 7 TAPRPIFNPLEDFEKVRQWIGKLVDTFDIEEDGG-GTRVGVVIYSDAPRMEISLGNGLGK 65 Query: 276 -------------VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + ++ T + +++ A S G Sbjct: 66 TDLIKAVLVIYLNLIDFLFAQSLMYERGNTLTGESIRYA----------SEVAFSETSGA 115 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + S + +I LTDG + + I A+E+ I I + + + L Sbjct: 116 RALSEGINRIMIVLTDGRAQDNVAGPAVI-----AQEDGIVIYAVGVG--HAIKDELDEI 168 Query: 383 VSSPE--YHYNVVNADSLIHVFQNISQLMV 410 S P + ++V ++ + N+ + + Sbjct: 169 ASKPTHRHKFSVSEYGAIESIRSNLRRTIC 198 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 66/210 (31%), Gaps = 33/210 (15%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 +AP ++ + +D+ D+ +G++ Y+ I G K Sbjct: 7 TAPRPIFNPLEDFEKVRQWIGKLVDTFDIEEDGG-GTRVGVVIYSDAPRMEISLGNGLGK 65 Query: 276 -------------VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + ++ T + +++ A S G Sbjct: 66 TDLIKAVLVIYLNLIDFLFAQSLMYERGNTLTGESIRYA----------SEVAFSETSGA 115 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + S + +I LTDG + + I A+E+ I I + + + L Sbjct: 116 RALSEGINRIMIVLTDGRAQDNVAGPAVI-----AQEDGIVIYAVGVG--HAIKDELDEI 168 Query: 383 VSSPE--YHYNVVNADSLIHVFQNISQLMV 410 S P + ++V ++ + N+ + + Sbjct: 169 ASKPTHRHKFSVSEYGAIESIRSNLRRTIC 198 >gi|310824462|ref|YP_003956820.1| hypothetical protein STAUR_7237 [Stigmatella aurantiaca DW4/3-1] gi|309397534|gb|ADO74993.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 913 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 9/94 (9%), Positives = 29/94 (30%), Gaps = 6/94 (6%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCV 383 S + ++ +D ++ + + + + I + + L + Sbjct: 491 SDKATRHVLLFSDAADSEEPDDYRATLAA--LRRENVTVSVIGLGTPKDSDADLLREVAQ 548 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 Y +A SL +F + + + + + Sbjct: 549 LGGGRIYFAEDALSLPRIFSQ--ETITVARAAFV 580 >gi|115377018|ref|ZP_01464236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365996|gb|EAU65013.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 884 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 9/94 (9%), Positives = 29/94 (30%), Gaps = 6/94 (6%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCV 383 S + ++ +D ++ + + + + I + + L + Sbjct: 462 SDKATRHVLLFSDAADSEEPDDYRATLAA--LRRENVTVSVIGLGTPKDSDADLLREVAQ 519 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 Y +A SL +F + + + + + Sbjct: 520 LGGGRIYFAEDALSLPRIFSQ--ETITVARAAFV 551 >gi|312138512|ref|YP_004005848.1| hypothetical protein REQ_10630 [Rhodococcus equi 103S] gi|325676202|ref|ZP_08155882.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] gi|311887851|emb|CBH47163.1| putative secreted protein [Rhodococcus equi 103S] gi|325552986|gb|EGD22668.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] Length = 545 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/363 (11%), Positives = 86/363 (23%), Gaps = 40/363 (11%) Query: 51 NSMESA----NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 ++A + A A ++ ++ + N A+ +A + Sbjct: 202 AQAQAANMHQTDPAARLAAVASDGGIAIATEQQVAGQNAGDSAVRLEATTPNSGAVFLDY 261 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 + ++ + D+ + L + + Sbjct: 262 PVVA----TAPGSEHDDARSAGVALAEVMGSDAGRAALAQRGFRAPDRAP-LDAGRGVGD 316 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVV-DLSGSMHCAMNSDPEDVNSAPICQDKKR 225 A + + LVV D+SGSM + + + R Sbjct: 317 VAVLTVEDPTETAQVLRRYAVLALPSRALVVEDVSGSMAEQAGPETRIALTVQASETGAR 376 Query: 226 TKMAALKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + L F + + + L V + + V Sbjct: 377 LFPDNAQLGLWAFSIGLGGGSQDYKELAPIRRLDETVDGVSHRQRLTDAVRTLPSLVK-- 434 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T A++ + +I LTDG N + Sbjct: 435 ------GGTGLYDTTLAAFRKVKEGY----------------DPAAINSVILLTDGANED 472 Query: 344 FKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + +E + IVTI I + L K ++ + + Sbjct: 473 PSTISLDELLATLKREQDPARPVIIVTIGITEDADAAVLQKISAATGGTSHVARTPAEIP 532 Query: 400 HVF 402 VF Sbjct: 533 GVF 535 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 69/197 (35%), Gaps = 28/197 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 + +M L+ ++ ++ LS +++ ++G++ + K + + V Sbjct: 173 VSGSMTGQGRMERLRRSVSTYI-----LSTIEDGAWLGIVTFRGTSHKICDLEQLNGDSV 227 Query: 277 RQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 R+ + + T+ T A+ A QIL + K+ + Sbjct: 228 REEILNMTLDGLTNRTGKVGTNITRAVTLAVQILGPAVQD----------RKLGDSTGPR 277 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 +I +TDG + ++V + D AK + I TI++ L + + Sbjct: 278 QMILITDGRDRRLNNSVIFMLQNDTAK--GVVIDTIALGDGAEEGLPL-LSEVTGGQFFF 334 Query: 392 VVNAD----SLIHVFQN 404 ++D +L Sbjct: 335 SPDSDAGGSALDDALTA 351 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 39/137 (28%), Gaps = 20/137 (14%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 Q + + T A+ A Q + I Sbjct: 411 AASAQVLAYADSLVADGGTAIYDALTLAQQQ--------------ARQELRADPERFVSI 456 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRLLKTCVSSPEYHY 390 + LTDG N + + A++ +++ I + +G+ + + + Sbjct: 457 VLLTDGANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGE-MQALAALTGGRAF 515 Query: 391 NVVNA--DSLIHVFQNI 405 + N L VF+ I Sbjct: 516 DARNTGKSGLPLVFKEI 532 >gi|42524419|ref|NP_969799.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] gi|39576628|emb|CAE80792.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] Length = 469 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 73/244 (29%), Gaps = 36/244 (14%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + ++ + + ++ I L+E V+D SGSM M Sbjct: 43 DFDPSKIPANASGEVKAEDLPSTAIPLVEYVIDSSGSMGQLMGPKK-------------- 88 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 TK+ LK L +L S +G + + K+ Sbjct: 89 TKIYVLKKLLARYLMSQWTEKTSSGLRVIGSRRKKDCKDNYLAIEPAQSKLGAIEGIVKG 148 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T A+K AY+ + K ++ TDGE + Sbjct: 149 FEPVGMTPIGQALKDAYK-------------------DVEHYKGPKRVVLFTDGEETCGQ 189 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I K+ +K ++ N +L + + + L +FQ++ Sbjct: 190 -DPCKIAAELSGKDVDLKFFVVAFGL-QNQPDVLDKLACIGDM-SQADDEEKLEELFQDL 246 Query: 406 SQLM 409 + + Sbjct: 247 DKQL 250 >gi|293190491|ref|ZP_06608878.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] gi|292820902|gb|EFF79858.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] Length = 338 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 51/215 (23%), Gaps = 22/215 (10%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 D +C D + + F + I + + + Sbjct: 90 DKSASRDIVLCLDASGSMLPYDSKIGGAFREIISHFEGERISLQLW----DAYSMTMFPL 145 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ--------- 320 + E + D++ T A Q L N Sbjct: 146 TDDYEMATDVLQDMSDTIDTGLTRIG-GRLSATQELFDYLAPVMDENQEVSSIVGDGLAS 204 Query: 321 ---GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS---PN 374 G + I+ TD E + I + AK + + + + + Sbjct: 205 CVMGFDHNDKQRSRTILLATDNEVYGDGVYNLSEAI-EFAKSQGVTVTALYPGSDITLSS 263 Query: 375 GQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQL 408 L+ + Y+ + S+ V + I Sbjct: 264 EALQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|296130910|ref|YP_003638160.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296022725|gb|ADG75961.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 330 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 14/177 (7%), Positives = 48/177 (27%), Gaps = 13/177 (7%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + + L + L + + + E V + ++ +D+ Sbjct: 115 VVATFEQLVDGFTGERVALSVFDSTSATVFPLTDDYELVTSRLAAAREAFADPSSDAAAD 174 Query: 298 MKQAYQILT--SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + L + + + + + ++ TD E + + D Sbjct: 175 VLRGTDGLDGQASLIGDGVASCALLFDQYDADR-SRSVVLATDNE-LAGEPVYTLAEAAD 232 Query: 356 KAKENFIKIVTIS------INASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNI 405 A + + + AS + L+ + + ++ + + Sbjct: 233 LAASRGVVLHALYPDEDRFFGASTAAE--LRIAAEGTGGTFAAAQDPGAVPAILAQV 287 >gi|168217441|ref|ZP_02643066.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens NCTC 8239] gi|182380509|gb|EDT77988.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens NCTC 8239] Length = 928 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/291 (11%), Positives = 66/291 (22%), Gaps = 49/291 (16%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR-IEMGERPIFLIELVVD--LSGSMHCAMNS 209 + L +L KI I ++ ++ L+ H S Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLIDRS 200 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + N ++ + + + + Y G + K IE Sbjct: 201 IGNNQYTVYDLYTSNQCHLNFTSNIQEIYNKIPSTVPNQRNNPYAGGTFTQEGLLKAIEL 260 Query: 270 SWGTEKVRQYVTRDMDSLI-----------------LKPTDSTPAMKQAYQILTSDKKRS 312 + + + D L T+ + L + Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEFGGNKKAIFDYNTNYN---GIGVRGLGTSYFFD 317 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISIN 370 T + + NN + K I I TI + Sbjct: 318 TKTQKPYIYSREEVYSALNRSKNKYESIWNNGFPTTL---EAENIKNENPDINIYTIGVE 374 Query: 371 A-------SPNGQ--------------RLLKTCVSSPEYHYNVVNADSLIH 400 Q + L++ SSP + N D + Sbjct: 375 LKKEVYKWDDYRQYYNAEGTVELPEIRKFLESISSSPAEAFVNENVDDIDE 425 >gi|311899781|dbj|BAJ32189.1| hypothetical protein KSE_64300 [Kitasatospora setae KM-6054] Length = 154 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 11/136 (8%), Positives = 39/136 (28%), Gaps = 2/136 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ + S++ A++ +S + ++G ++ ++ A ++ R G Sbjct: 17 DRGSISVMVAITAVSLVAVVGLVLDFGGQLRSTERADALAQEAARVAGQQLDITKLRAGG 76 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 ++ A A + + + + + + L Sbjct: 77 GYQVDPKQAVSAGRAYLASQHVSGSVDFPRYPDLSEISVTVDTTY--PTALLGAVGITSL 134 Query: 138 DSSNNTIFYNMDVMTS 153 + + +T Sbjct: 135 SVQGHGKATLLHGITE 150 >gi|154497289|ref|ZP_02035985.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] gi|150273688|gb|EDN00816.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] Length = 234 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 45/161 (27%), Gaps = 17/161 (10%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + L L FLD + ++ + + + + + Sbjct: 34 YGQPIEELNAGLRQFLDELRKDELTCTSAETAVVTFGSSAQCVADF------ATADQIQV 87 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T + A +L K+R +Q ++ ++DG N Sbjct: 88 EPLEANGLTYMGEGLTMALDLLEQRKERYKAAGV---------DYYQPILVVMSDGCPNG 138 Query: 344 F--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 +IC + + +V + I + ++L + Sbjct: 139 DPRVLREAAQRICQMVEARRLTVVAVGIGEGADMEQLRRIS 179 >gi|118081930|ref|XP_414992.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 1794 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 49/188 (26%), Gaps = 28/188 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K+ L + + V + R + + V Sbjct: 495 FNKVKDFLFRIISYFPKIGPEGTQVAVAQYSEEPRAAFHFNQHQDRNGALKAVKEL--HY 552 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + + + L Q + F ++ +TDG + + Sbjct: 553 AGGNTKTGRGIAFMLKELF-------------QPSRGMRPEFPHVLMLVTDGRSQDDVLP 599 Query: 348 VNTIKICDKAKENFIKIVTISI-NASP---NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 A I+I+ + + A P N L + + V D + + Sbjct: 600 P-----ARAAHALGIRIIAVGVSGADPAELNDILLQQNLQ----NVFYVSTFDEFPQILR 650 Query: 404 NISQLMVH 411 + +++ Sbjct: 651 ELIEVICS 658 >gi|83941161|ref|ZP_00953623.1| hypothetical protein EE36_02993 [Sulfitobacter sp. EE-36] gi|83955720|ref|ZP_00964300.1| hypothetical protein NAS141_07935 [Sulfitobacter sp. NAS-14.1] gi|83840014|gb|EAP79190.1| hypothetical protein NAS141_07935 [Sulfitobacter sp. NAS-14.1] gi|83846981|gb|EAP84856.1| hypothetical protein EE36_02993 [Sulfitobacter sp. EE-36] Length = 179 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 6/60 (10%), Positives = 20/60 (33%), Gaps = 1/60 (1%) Query: 3 LLSRFRFYFKKGIASEKANFSII-FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 +++R ++ E +I F + + + V + ++ + A+ Sbjct: 1 MIARISQALRRFRRGEDGAVVLIEFVILFPVIFGMFLASVEVSLYSLRQFHLNRGLETAV 60 >gi|158337332|ref|YP_001518507.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307573|gb|ABW29190.1| von Willebrand factor, type A domain protein [Acaryochloris marina MBIC11017] Length = 686 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 71/230 (30%), Gaps = 52/230 (22%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++D+SGSM K+ K+A +K +L L + + V Sbjct: 309 PSNLVFLIDVSGSM-------------------KRPNKLALVKKSLCLLVHQLKPEDRVS 349 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y R + + GT+K + T +K AY + Sbjct: 350 ------LVVYAGRAGIVLPSTPGTQKA-TIMNAIDRLEAGGSTAGAAGIKMAYDMAERHF 402 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS-NVNTIKICDKAKENFIKIVTIS 368 ++ +I TDG+ N +S + ++ ++ ++ + + + Sbjct: 403 LKNGNN----------------RVILATDGDFNVGQSSDAELERLIEQKRDRGVFLTVLG 446 Query: 369 INASPNGQRLLKTCVSSP-EYHYNVVN--------ADSLIHVFQNISQLM 409 ++ + + + + L I++ + Sbjct: 447 YGTGNYKDNKMELLANKGNGNYAYIDTLLEAQKVLVNDLRGTLFTIAKDV 496 >gi|94498562|ref|ZP_01305117.1| hypothetical protein SKA58_08314 [Sphingomonas sp. SKA58] gi|94422005|gb|EAT07051.1| hypothetical protein SKA58_08314 [Sphingomonas sp. SKA58] Length = 216 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/114 (14%), Positives = 34/114 (29%), Gaps = 1/114 (0%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAG 64 R + +K + F L++ FL + I + ++ N + AA L Sbjct: 2 RVPHFVQKLARDRSGLALVEFGLALPLFLGFVLSGIEMANYVMANNRTQRLTTMAADLVA 61 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 S + + ++ S A D + + I + E Sbjct: 62 QSGAGAIGTSEAQIYDLFSALDLTAQPFDLRNHGRIVITAVKGTDTNADNVVES 115 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/372 (8%), Positives = 82/372 (22%), Gaps = 89/372 (23%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTI 144 K+ R + +K +G + +N+ ++ + + Sbjct: 135 FKKNDDYKDNRLVSYELKGKYAGSQNQLSIHTAMSALNTKQVFSKVAKGKRGIALAYRDD 194 Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + + + V Sbjct: 195 PIEGQANVAISFVFDASGSMEFSLDGTEKV------------------------------ 224 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 N K R+++ L+ + + + +V ++ + + Sbjct: 225 ----------NPYSNNPLKNRSRIDILREKTKKMMADLQPIGNVSVNLVQFNSHASFVQQ 274 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 IE G + + T+ ++ L S+ + + GV Sbjct: 275 NFIELDKGLTSINSAIDNLNPEHA---TNPGDGLRYGMVSLQSNAAQLKYVVLLTDGVPN 331 Query: 325 P-------SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK------------------- 358 + K DG N N + Sbjct: 332 SYMVGPQYNYWGNKIETRTRDGFVNQAYFNTWKYDLSTGFHSTRANSAFAGGNPSWAVDG 391 Query: 359 ------------ENFI-KIVTISINA-SPNGQRLLKT-CVSSPE----YHYNVVNADSLI 399 + I ++ I +A + Q+ + + +V + L Sbjct: 392 SIEYSGEVSKQFKKGIKRVNVIGFSAKEADKQQGSRLTAAIKEGVPETSYTDVSDDKQLE 451 Query: 400 HVFQNISQLMVH 411 F +I + + Sbjct: 452 QTFADIKKQVEQ 463 >gi|125586597|gb|EAZ27261.1| hypothetical protein OsJ_11198 [Oryza sativa Japonica Group] Length = 405 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/185 (8%), Positives = 44/185 (23%), Gaps = 42/185 (22%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTR 282 K+ +K ALL + + + + ++ ++ + + + + Sbjct: 82 AGDKLDRMKAALLFVIRKLADVDC------LSIVTFSNDAARLYPLRFDAGDAAWADLKA 135 Query: 283 DMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 +D L T+ ++ + + Sbjct: 136 LVDGLVADGNTNIRAGLEIGLAVAAGRRLTVGRAQNRGD--------------------- 174 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIH 400 + + + T + A + L S + V + +L Sbjct: 175 ------------ATRLDPGGVPVHTFGLGADHDPAVLQAIAGKSREGMFHYVADDVNLTA 222 Query: 401 VFQNI 405 F + Sbjct: 223 PFSQL 227 >gi|163848230|ref|YP_001636274.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526140|ref|YP_002570611.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669519|gb|ABY35885.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450019|gb|ACM54285.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 419 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/257 (13%), Positives = 71/257 (27%), Gaps = 57/257 (22%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 V + R + V+D SGSM K Sbjct: 20 TTPQVAYVLLEAQPGAVLTQVRVPIHVCFVLDRSGSM--------------------KGE 59 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L+ A++ ++ ++ + ++ + R E + + + + Sbjct: 60 KIERLRQAVVRAIEQLNEQDI------ISIVIFDHRTEVLVP-AQPVRQRTAILDLVHRI 112 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T PA+++ Q L + + ++ LTDG+ + Sbjct: 113 RDAGGTRIAPALEKGIQELQKMPQGV------------------RRLVLLTDGQ-TEHEK 153 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ--- 403 D A I I + I N L++ S + ++ FQ Sbjct: 154 ECLLR--ADDAGRLGIPITALGIGKDWNEDLLIEMANRSRGVADYIAQPGEIVQYFQHTV 211 Query: 404 ------NISQLMVHRKY 414 I ++ + Sbjct: 212 QRAQQTTIQNSVLTLRL 228 >gi|13475415|ref|NP_106979.1| hypothetical protein mll6477 [Mesorhizobium loti MAFF303099] gi|14026167|dbj|BAB52765.1| mll6477 [Mesorhizobium loti MAFF303099] Length = 183 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 38/140 (27%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + Y + ++ + L L+L + + + K ME+ A Sbjct: 4 MFKMLSRYLGRFRHDQRGAVMVEMTLITPLMLVLSAGVFEFGNLIHDKLLMEAGLTDAAR 63 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + G + + A + + + + + + + + Sbjct: 64 YAARCNSQLYTDSGLAAINCTTTAANIAVYGKPSVTVVNGAVTDTPRVSGWQTSNVTVTT 123 Query: 123 NSSRISMTHMANNRLDSSNN 142 N+S + S+ Sbjct: 124 NNSCQDTVVGGVTQYRSTTA 143 >gi|302870519|ref|YP_003839156.1| hypothetical protein Micau_6084 [Micromonospora aurantiaca ATCC 27029] gi|302573378|gb|ADL49580.1| hypothetical protein Micau_6084 [Micromonospora aurantiaca ATCC 27029] Length = 148 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 42/151 (27%), Gaps = 12/151 (7%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++R R ++ ++ + V+ L+L G ++ + K + L Sbjct: 1 MNRLRTRLRQARHDDRGQITPWTIFGVVLVLILAGLVLDLGLGMSDKVRL--------LD 52 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A +R D + A + A ++ +++ T V Sbjct: 53 VAQGAARAGAREIDLATYRATGAVQLQPAQAAAAARSFAQQAGVDGQVTASATPTAVTVT 112 Query: 124 SS----RISMTHMANNRLDSSNNTIFYNMDV 150 S + + L S + Sbjct: 113 ISGTRQTQLLHLVGITALPVSATATANPLTG 143 >gi|188581386|ref|YP_001924831.1| hypothetical protein Mpop_2134 [Methylobacterium populi BJ001] gi|179344884|gb|ACB80296.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 543 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ SI+ AL + + L F + + + + ++ A + A L+ AS + + Sbjct: 1 MRFRSDQRGAVSIVTALGLTTLLGAAAFGLDLSRLYGTQRRVQGAADLAALSAASDLSTA 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN---IVNSSRIS 128 + + A ++++ + S ++ Sbjct: 61 DAAARRALADNGFGDGTRIGVQAGAYLRSAAVANGSRFTPGATAPNAVRVSLTAPVPLTF 120 Query: 129 MTHMANNR-LDSSNNTIF 145 ++ D S Sbjct: 121 GRYLGLPTHYDVSAVGTA 138 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 9/194 (4%), Positives = 39/194 (20%), Gaps = 15/194 (7%) Query: 35 LLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 +G + + +++A ++ LA A ++ + + A + Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPDAISRAVSAG----AAAGNLNG 98 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +A + ++ ++ + + Sbjct: 99 VNLQSANWSGQGQITAADISFRDASY-----------LSTTSATAAVYSQCTHTQSNIGM 147 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 N + + + I + + + Sbjct: 148 WLLKALGAFSGNSASFPATGNVGTYAVATRASAQSACPIPVALQAKTPGATRPDYGYTVG 207 Query: 215 NSAPICQDKKRTKM 228 + + Sbjct: 208 DWLTLLAKSNGGST 221 >gi|90409149|ref|ZP_01217268.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] gi|90309757|gb|EAS37923.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] Length = 226 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 46/139 (33%), Gaps = 17/139 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + L +DLSGSM+ + T+++ +K+ L F+ + ++ Sbjct: 87 NMILSLDLSGSMNEVDM----------RLDGQSVTRLSLVKSLLKKFVAT-------RQG 129 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +GLI + ++ + + Q + L+ T ++ A + +K + Sbjct: 130 DRLGLILFADHAYLQTPLTFDLKTIAQRIEETQIGLVGIRTAIGESIAIAIKRFVKNKNK 189 Query: 312 SFFTNFFRQGVKIPSLPFQ 330 G Sbjct: 190 QRILILLTDGSNTAGRIKP 208 >gi|323137420|ref|ZP_08072498.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322397407|gb|EFX99930.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 208 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 27/69 (39%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 IA + ++ FA+ + FL LIG + +++ ++ A A A + + Sbjct: 24 FIADREGTTAVEFAMIAVPFLGLIGAIFETGTIYFRTAQLQMATETASRAVLTHSTAAGL 83 Query: 74 RLGDRFESI 82 ++ Sbjct: 84 TYKQFVDNN 92 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/152 (11%), Positives = 41/152 (26%), Gaps = 24/152 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD--SLILKPTDSTPAMK 299 + + +I ++ + + + S T++ ++ Sbjct: 77 LKGFDVAPNTTRVAVISFSEDAVVHADFLKDPGNKCHLSRKMQGVHSANQGATNTGAGLQ 136 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A+++ +K +I +TDG +K +K K Sbjct: 137 AAWEVFQ-----------------RSRPTAKKLLILVTDGMATMGPD---PVKKAEKLKN 176 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + I I L+ S+P + Sbjct: 177 MGVDIFVFGIGRMLKQH--LEQLASTPANVSD 206 >gi|156743748|ref|YP_001433877.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235076|gb|ABU59859.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 936 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/360 (10%), Positives = 92/360 (25%), Gaps = 54/360 (15%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A V N +R + ++ + + A + + + Sbjct: 271 AYLEVPNDARTQNNAIETFSYVRGTPRVLLVAQAPDDAISLERALRAARIEVTVVSPASI 330 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTS--YDYRLQFIE----------HLLNQRYNQK 172 + + + N + Y F Sbjct: 331 PATFGELIRYDAIALINVPRALFSNETVQRIAAYVRDFGGGLLMVGGPQSFGPGGWRGTP 390 Query: 173 IVSFIPALLRIE-MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 + + +P + I +P I +V+D+SGSM + P K++ Sbjct: 391 VEAALPVTMDIPERQRQPPVSIVVVIDISGSMAATEDGIP---------------KLSLA 435 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + +I + R + P G+ + + L Sbjct: 436 LEGARRIAALLRDED------ELTVIPFDDRPGVIVGPLPGSRRDVAIEQLNQVRLGGSG 489 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + A++ A + S + II +TDG + + + Sbjct: 490 INIHDALRVAARY------------------TRASERPVRHIITITDGNDTTQQ--EGAL 529 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + + + +++I + + + A + + +Q ++ Sbjct: 530 DIVRSLHDEGVTLTSVAIGQGDHVPFIRDMAAVGGGRTFLTERAADVPDLLTGETQTIMT 589 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/268 (11%), Positives = 79/268 (29%), Gaps = 45/268 (16%) Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL---IELVVDLSGSM 203 T F +H+ Q Y + ++ + V+D+SGSM Sbjct: 286 WRPKATDLPSTALFTQHVEGQGYGLLLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSM 345 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + K AL L + + + Sbjct: 346 --------------------YGESIEQAKQALRYGLQQLQPEDSFNIVTFNHEAMLYS-- 383 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E+ + + + + + T+ A+K A+ + + Sbjct: 384 EQLLPVTS--STITRALRFVDGLDADGGTEMAAALKAAFS--------------IKTHDQ 427 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + S + I+F+TDG N + + I+ + + ++ T+ I ++PN + + + Sbjct: 428 LNSTRWLNQIVFITDGSVGNESALFDLIE--QQLVDR--RLFTVGIGSAPNSYFMTRAAM 483 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + + + + Sbjct: 484 KGKGTYTYIGDVKEVNTKMRLLFSKISQ 511 >gi|73998866|ref|XP_535021.2| PREDICTED: similar to A-domain containing protein similar to matrilin and collagen [Canis familiaris] Length = 788 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G I +++ + + T++ A+K Sbjct: 97 DALDINPERVRVGAIQFSSAPHLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLL 156 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + +I +TDG + + + KE + Sbjct: 157 RRGFPGGR---------------NASVPQILIVITDGRSQGHVAVPT-----KQLKERGV 196 Query: 363 KIVTISI 369 I + + Sbjct: 197 TIFAVGV 203 >gi|320533438|ref|ZP_08034121.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320134346|gb|EFW26611.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 369 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + + + +GL+ + + + + + + +R + D L + P + T Sbjct: 132 SSVLTTFSEILEDFDGERVGLVAWNSAAQTIVPLTDDYDLLRDQLQELGDVLDIDPKNVT 191 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL---------PFQKFIIFLTDGENNNFKS 346 ++ Y+ + + G + S + II TD + + Sbjct: 192 LKQQRDYEEAFGGTQTTGVNGSSLAGDGLASCAQAFDNQGLDRSRSIILATDNQVIDPDD 251 Query: 347 NV--NTIKICDKAKENFIKIVTISINASPNGQ-------------RLLKTCVSSP--EYH 389 + E I++ +I A + LKT P Sbjct: 252 EQIYPLPDAVNLLAERKIRLFSIY-GADDDQDFQNLLDKSPEESREELKTVTEGPGKGRF 310 Query: 390 YNVVNA 395 Y+V ++ Sbjct: 311 YDVEDS 316 >gi|320105056|ref|YP_004180647.1| hypothetical protein Isop_3541 [Isosphaera pallida ATCC 43644] gi|319752338|gb|ADV64098.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 818 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 60/200 (30%), Gaps = 7/200 (3%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + + + I ++VD S S + D+ A +A A D Sbjct: 86 ASLTVERTGLPTIVVMVDDSASQGLSDRPADPDLARAIRDLGAYAPGIATNPTASQSQSD 145 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAM 298 S L + + ++ + + V+ +L + PA+ Sbjct: 146 SRASLEIPRIALTQAILSRDNGAF-LRALQAKHKVRIETVSTATRTLAVIEDEDGIAPAL 204 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + A L + S RQ + II TDG+ + + AK Sbjct: 205 E-AIAKLEASGSSSKLGAGVRQVLTQLRGVAPTAIIMFTDGQTTEGED---LASAAELAK 260 Query: 359 ENFIKIVTISINASPNGQRL 378 + I+T+ + + L Sbjct: 261 RKGVPIITVGLGDPDAPRDL 280 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 26/186 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + T K + + Sbjct: 591 SNFRLMLEFISNVAKAFEISDIGSKIATVQFTYDQRTEFSFTDYTTKEKVLSAIRNIRYM 650 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + G + F++ LTDG++ + Sbjct: 651 SGGTATGDAI---------SFTTRNVFGPVKDGAN------KNFLVILTDGQSYDD---- 691 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A++ I + ++ + A + LK S P + + L + ++ Sbjct: 692 -VRGPAVAAQKAGITVFSVGVAWAPLDD---LKDMASEPRESHTFFTREFTGLEQMVPDV 747 Query: 406 SQLMVH 411 + + Sbjct: 748 IRGICK 753 >gi|301755498|ref|XP_002913610.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Ailuropoda melanoleuca] Length = 765 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/127 (10%), Positives = 36/127 (28%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G + +++ + + T++ A+K Sbjct: 79 DALDINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + ++ +TDG + + KE + Sbjct: 139 RRGFPGGR---------------NASVPQILVVITDGRSQGPVELP-----AKQLKERGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|281347736|gb|EFB23320.1| hypothetical protein PANDA_001404 [Ailuropoda melanoleuca] Length = 708 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/127 (10%), Positives = 36/127 (28%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G + +++ + + T++ A+K Sbjct: 37 DALDINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLL 96 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + ++ +TDG + + KE + Sbjct: 97 RRGFPGGR---------------NASVPQILVVITDGRSQGPVELP-----AKQLKERGV 136 Query: 363 KIVTISI 369 + + + Sbjct: 137 TVFAVGV 143 >gi|118346597|ref|XP_976944.1| TPR Domain containing protein [Tetrahymena thermophila] gi|89288544|gb|EAR86532.1| TPR Domain containing protein [Tetrahymena thermophila SB210] Length = 1163 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 83/245 (33%), Gaps = 42/245 (17%) Query: 187 ERPIFLIELVVDLSGSMHCA------MNSDPEDVNSAPICQDKKRTKMAALKN------- 233 E+ I ++ D S SM D +D + + + + K+ + KN Sbjct: 922 EKKWKDIVIIADFSYSMRHKVIGSEENELDADDTDEDESGKTENKEKIGSNKNQRQARSQ 981 Query: 234 ---ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSL 287 A+ FL D ++V+ D + I + V+ E + T +R + + Sbjct: 982 KYIAVKKFLQIFD--NYVQNDDRVAFIKFNENVDVIFELNEKENNTIFLRNSIKSFLKIC 1039 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + A+ A ++ + K+I+ TDG ++ K + Sbjct: 1040 PEGESAVRQAIYTAIKLFQKAVPKD----------------HSKWIVIFTDGGDSCSKIS 1083 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSSPEYHYNVVN--ADSLIHVFQN 404 ++ ++ + I+ I ++ + C + + N + + FQ Sbjct: 1084 EP--ELMQMLSQSDVNIMIIGCGLDKYATKIFRQYCEKTKNGIFIKTNNKKEEIDIAFQA 1141 Query: 405 ISQLM 409 I+ + Sbjct: 1142 ITNKI 1146 >gi|73954244|ref|XP_546326.2| PREDICTED: similar to Integrin alpha-2 precursor (Platelet membrane glycoprotein Ia) (GPIa) (Collagen receptor) (VLA-2 alpha chain) (CD49b) [Canis familiaris] Length = 1171 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 80/284 (28%), Gaps = 35/284 (12%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI--ELVV 197 + +I ++ T+ L ++ + + + + + Sbjct: 89 TATSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGSQYYATGVCSDVSPDFQI 148 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGL 256 S + DV + A+ FL+ L + +GL Sbjct: 149 LTSFAPAAQTCPSLIDVVVVCDESNSIY-----PWEAVKNFLEKFVQGLDIGPKKTQVGL 203 Query: 257 IGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I Y + + K T T++ A++ A Sbjct: 204 IQYANKPRVVFNLNTFKTKAEMTEATSQTYQYGGDLTNTFKAIQYAKD------------ 251 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 F K ++ +TDGE+++ I C+ + I I++ N Sbjct: 252 -FAYAAAAGGRPGATKVMVVVTDGESHDGSMLKAVIDQCN---NDNILRFGIAVLGYLNR 307 Query: 376 QRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 308 NALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQI 351 >gi|283807143|pdb|3K6S|A Chain A, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807145|pdb|3K6S|C Chain C, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807147|pdb|3K6S|E Chain E, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807149|pdb|3K6S|G Chain G, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807156|pdb|3K71|A Chain A, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807158|pdb|3K71|C Chain C, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807160|pdb|3K71|E Chain E, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807162|pdb|3K71|G Chain G, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807164|pdb|3K72|A Chain A, Structure Of Integrin Alphax Beta2 gi|283807166|pdb|3K72|C Chain C, Structure Of Integrin Alphax Beta2 Length = 1095 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 164 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 223 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 224 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 267 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 268 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 314 >gi|256827824|ref|YP_003156552.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] gi|256577000|gb|ACU88136.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] Length = 2452 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/354 (11%), Positives = 74/354 (20%), Gaps = 26/354 (7%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A S + G IS N + Sbjct: 1537 AAAIQSAVTPLYGSDGSGGTVISGFELAISGGGELDSGLNSNGLDIILSKDGDDVVGRTT 1596 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA- 179 RIS+ L S L + + Sbjct: 1597 AGEVFRISVDSTGTVVLKQSAEVDHIEGTPNDDLISLADSKVFLEATATVTDGDNDVATR 1656 Query: 180 --LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--RTKMAALKNAL 235 + + R + + + S S + + +T+M A A+ Sbjct: 1657 TLSVDLGGNIRFVDDVPSAENDSVSAAENWTGAKTYNLMLIVDRSGSINQTEMQAAVVAM 1716 Query: 236 LLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 LD ++ E V + ++ + + + Y+ +S T+ Sbjct: 1717 NSLLDKYAEVALGGEAGVQVQVVTFAVDGTLVHGSPVSIAEAKAYLDILDNSGGSGNTNY 1776 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 G + + + F++DG V Sbjct: 1777 DA--------------AVAAATPAIDGWPVATADHDNVVYFISDGAPTVGNGTVGLTNAE 1822 Query: 355 DKA-----KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA-DSLIHVF 402 + A + I + S L N DSL+ Sbjct: 1823 EAAWETTLESRGATSWAIGVGTSNAVDDDLADVAYPDGNVLLASNFNDSLLDAL 1876 >gi|154507989|ref|ZP_02043631.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] gi|153797623|gb|EDN80043.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] Length = 338 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 39/166 (23%), Gaps = 18/166 (10%) Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + + + + D++ T A Q L N Sbjct: 135 WDAYSMTMFPLTDDYDMATDVLQDMSDTIDTGLTRIG-GRLSATQELFDYLAPVMDENQE 193 Query: 319 RQ------------GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 G + I+ TD E + I AK + + Sbjct: 194 VSSIVGDGLASCVMGFDHNDKQRSRTILLATDNEVYGDGVYNLSEAI-QFAKSQGVTVTA 252 Query: 367 ISINAS---PNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + + L+ + Y+ + S+ V + I Sbjct: 253 LYPGSDITLSSEALQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|34452173|ref|NP_000878.2| integrin alpha-X precursor [Homo sapiens] gi|146345441|sp|P20702|ITAX_HUMAN RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leu M5; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor Length = 1163 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|23398603|gb|AAH38237.1| ITGAX protein [Homo sapiens] gi|123983342|gb|ABM83412.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] gi|123998043|gb|ABM86623.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] Length = 1169 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|60654537|gb|AAX29959.1| integrin alpha X [synthetic construct] Length = 1170 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 56/191 (29%), Gaps = 30/191 (15%) Query: 225 RTKMAALKNALLLFLDSIDL----LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK----V 276 ++ ALK AL + + + + LI ++ VE + Sbjct: 361 GKRLEALKAALGNLSGADTSLGWRFAAFADRERVTLIPFSGDVEAVRSFQVNKASRAADL 420 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T+ A+ +AY+ + S+ + ++ + Sbjct: 421 QAIAAAGGALQAGGGTNIYGALSEAYRQAAAAPAGSYTS-----------------VVLM 463 Query: 337 TDGENNNFKSNVNTIK--ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 TDGE S A +K T+ + Q + + + ++ Sbjct: 464 TDGEGTAGPSLNEFRDFYAALPAGARSVKTFTVLFG-DSDVQEMNEVAALTGGRTFD--G 520 Query: 395 ADSLIHVFQNI 405 +L F+ I Sbjct: 521 QQNLAAAFKEI 531 >gi|269128868|ref|YP_003302238.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268313826|gb|ACZ00201.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 515 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 10/85 (11%), Positives = 30/85 (35%), Gaps = 4/85 (4%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQR 377 K ++ +TDG N++ + + +++V I + + Sbjct: 418 RKTYKPDMVNTVLLMTDGRNDDDDGPTLRQTLAKLRAGHDPERPVQLVIIGFGDEVDRKE 477 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVF 402 L + ++ + A+ + ++F Sbjct: 478 LQQLAEATGGSVHFAKTAEDMRNIF 502 >gi|156403935|ref|XP_001640163.1| predicted protein [Nematostella vectensis] gi|156227296|gb|EDO48100.1| predicted protein [Nematostella vectensis] Length = 247 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/237 (9%), Positives = 63/237 (26%), Gaps = 41/237 (17%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-LS 246 R + V+D S SM + F+ + Sbjct: 29 RRRMDVAFVIDRSASMGDENFG------------------------YIKQFIKKVSHEFL 64 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 K+ V +G++ ++ + + L Sbjct: 65 LSKDSVQVGVVPFSHHYALEFGLT-NYTNHKALDAAIDKIQFEGSFSMLSG------ALK 117 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 +++ F K + ++ + DG N + + + K+ ++ Sbjct: 118 VVQQKLFMPQLVEVKSKAKKDKPLQAVVIVGDGGNLSGSDALYESSLA--LKDTGKRLFV 175 Query: 367 ISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQLMVH------RKYSV 416 + + + P+ + +N +L + + ++ + R++SV Sbjct: 176 VGLGRLEYEASMRMLASEPPKTHFFNAGTGKNLKNFVKRLANSICKDICPERRRFSV 232 >gi|114569600|ref|YP_756280.1| von Willebrand factor, type A [Maricaulis maris MCS10] gi|114340062|gb|ABI65342.1| von Willebrand factor, type A [Maricaulis maris MCS10] Length = 555 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 68/239 (28%), Gaps = 49/239 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ERP + ++D+SGSM+ K+ A+ L +D + Sbjct: 183 ERPRANLVFLIDVSGSMN-------------------SPDKLPLAVQAMHLLVDELHPDD 223 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ Y + + P+ R+ T + AY + Sbjct: 224 ------TVALVVYASASGVVLPPTEARNA-REIHRALDSLSAGGSTAGGAGLALAYDLAE 276 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIV 365 + ++ LTDG+ N + + +++ I + Sbjct: 277 QNF----------------DEDAVNRVMLLTDGDFNVGVTQDERLEDFVARKRDSGIYLS 320 Query: 366 TISINASPNGQRLLKTCV-SSPEYHYNVVNADS-----LIHVFQNISQLMVHRKYSVIL 418 + ++++T + + + + F ++ + K V Sbjct: 321 VMGFGRGNYNDQMMQTIAQAGNGTAAYIDSRQEARRMLVEESFSSLFTIANDVKIQVEF 379 >gi|168204511|ref|ZP_02630516.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens E str. JGS1987] gi|170663839|gb|EDT16522.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens E str. JGS1987] Length = 928 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/291 (11%), Positives = 66/291 (22%), Gaps = 49/291 (16%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR-IEMGERPIFLIELVVD--LSGSMHCAMNS 209 + L +L KI I ++ ++ L+ H S Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLIDRS 200 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + N ++ + + + + Y G + K IE Sbjct: 201 IGNNQYTVYDLYTSNQCHLNFTSNIQEIYNKIPSTVPNQRNNPYAGGTFTQEGLLKAIEL 260 Query: 270 SWGTEKVRQYVTRDMDSLI-----------------LKPTDSTPAMKQAYQILTSDKKRS 312 + + + D L T+ + L + Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEFGGNKKAIFDYNTNYN---GIGVRGLGTSYFFD 317 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISIN 370 T + + NN + K I I TI + Sbjct: 318 TKTQKPYIYSREEVYSALNRSKNKYESIWNNGFPTTL---EAENIKNENPNINIYTIGVE 374 Query: 371 A-------SPNGQ--------------RLLKTCVSSPEYHYNVVNADSLIH 400 Q + L++ SSP + N D + Sbjct: 375 LKKEVYKWDDYRQYYNAEGTVELPEIRKFLESISSSPAEAFVNENVDDIDE 425 >gi|313239055|emb|CBY14037.1| unnamed protein product [Oikopleura dioica] Length = 1055 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGEN---NNFKSNV 348 +S K A+ + + +G + F K T G N S Sbjct: 839 NSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGTVVLLTNAASTD 898 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 ++ D+ KE +++ + + GQ L SSP E Y + L + + I+ Sbjct: 899 EVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENLYTAEESSDLAGLVKTIA 957 Query: 407 QLMVHRKYS 415 + + S Sbjct: 958 DEICATELS 966 >gi|310119165|ref|XP_003118918.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] gi|310126588|ref|XP_003120448.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 535 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 60/192 (31%), Gaps = 32/192 (16%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-- 285 LK+ + I K +I E EK++++V +D Sbjct: 70 FDLLKDIQKEVVQDICSSEACKNS-KADIIFLIDGSESISP--KDFEKMKRFVESMVDIY 126 Query: 286 ------SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T + A+ F K Q+++I +TDG Sbjct: 127 RAIFDVQQMRDGTRTGKALNFTL--------------PFFDSSKGGRPSVQQYLIVITDG 172 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + N I ++ I I + +LL+ + + Y+ VN ++L Sbjct: 173 VAQD-----NVIIPAKALRDKNTIIFAIGVG-EAKKSQLLEI-TNDEDKVYHDVNFEALQ 225 Query: 400 HVFQNISQLMVH 411 ++ + I + Sbjct: 226 NLEKEILSKVCD 237 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 9/90 (10%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + +++I +TDG KS+ + + + + N + I I I N L + Sbjct: 6 ADTGRINVARYVIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGI-REANIDELKEI 59 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + V D L + + + Q + Sbjct: 60 A---KDKIFFVYEFDLLKDIQKEVVQDICS 86 >gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291346197|gb|EFE73101.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 597 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 61/196 (31%), Gaps = 24/196 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + +++M K +L+ LD +GL + T ++ + E Sbjct: 403 PGRNQSRMDVTKESLIQALDQFTPND------EIGLWEFATTLDGEKDYRRLMET----- 451 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTS--DKKRSFFTNFFRQGVKIPSLPFQ---KFIIF 335 R D T + A+ L + + S + ++ Sbjct: 452 KRLGDPAAGGGTHR-EKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFNALVI 510 Query: 336 LTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 LTDG N + + + + KE + ++ I++ + + + + Y Sbjct: 511 LTDGSNQDTNGISRS-GLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGGDGY 569 Query: 391 NVVNADSLIH-VFQNI 405 V + + + Q I Sbjct: 570 EVSDPAEIQAVILQAI 585 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 62/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + Sbjct: 259 SSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLVSVTP--DNVRDGK 316 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 Y+ TD A+++A ++L + I I+FLT Sbjct: 317 IYIHHMS---PTGGTDINGALQRAIKLLND----------YVAHNDIEDRSVS-LIVFLT 362 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + KE + I TI I + + L L+ C + Sbjct: 363 DGKPTVGE--THTLKILNNTKEAARGQVCIFTIGIGDDVDFKLLEKLSLENCGLTRRVLE 420 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 421 EEDAGSQLIGFYDEI 435 >gi|72180809|ref|XP_798930.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] gi|115975272|ref|XP_001180569.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] Length = 964 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 48/156 (30%), Gaps = 15/156 (9%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 TK+A +K+AL LD + + + + + +++ Sbjct: 358 SGSMSGTKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGMLYSTKENVRRAKRF 417 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V + T+ A+ +L ++ ++ +I LTDG Sbjct: 418 VMGLQEMD---NTNLHKAIISGVNMLRAESEQDPQEEEIVS-----------MLIVLTDG 463 Query: 340 ENNNFKSN-VNTIKICDKAKENFIKIVTISINASPN 374 N+ + + + +A + I A + Sbjct: 464 NPNHGEIDKTIIERNVHEAINGDFSLFCIGFGADAD 499 >gi|308492612|ref|XP_003108496.1| hypothetical protein CRE_11099 [Caenorhabditis remanei] gi|308248236|gb|EFO92188.1| hypothetical protein CRE_11099 [Caenorhabditis remanei] Length = 439 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 49/152 (32%), Gaps = 14/152 (9%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + T++V + D TD + AY + K+ Sbjct: 293 WAADDVPTNANVTQEVLTMLDYIPDVTQDNNTDLECLFRYAYDGIDDSKEFD-------- 344 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISINASPNGQRLL 379 +K +I N+ + + + K + E K++ + + + +L Sbjct: 345 ----ERYGIEKVVIVFVAANANDDQDYNESFEFAHKIRTEQDAKVIVVGMGTGLDQAKLS 400 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 K +S + + + D+L + I+ + Sbjct: 401 KLAYAS-GFAFFSTSYDNLQSLIPQINNAICS 431 >gi|15897953|ref|NP_342558.1| hypothetical protein SSO1089 [Sulfolobus solfataricus P2] gi|13814278|gb|AAK41348.1| Hypothetical protein SSO1089 [Sulfolobus solfataricus P2] Length = 436 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 17/86 (19%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-------FIKIVTISINASPNGQRL 378 II LTDG+ + K+ +I+TI I + N + L Sbjct: 202 QSQVPNKIIMLTDGKPTDK----------RNVKDYEKFDIPPNTQIITIGIGSDYNERIL 251 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQN 404 K S Y++ + L VF++ Sbjct: 252 KKLADKSSGKFYHLKDISELPDVFES 277 >gi|242016552|ref|XP_002428850.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513586|gb|EEB16112.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1945 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 64/178 (35%), Gaps = 30/178 (16%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAM 298 I + + +G+I Y+T+ ++ + + + +V K T++ A+ Sbjct: 61 ISDVEVGPKAFRLGVIIYSTQAVDHLSVTDELDLEKCSLLHVQIPKIKYPGKNTNTKEAL 120 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 ++A +ILT K I +TDG +N + + K Sbjct: 121 EKAEKILT-----------------RARKGAVKIIFLITDGFSNMGN----PLPMAQILK 159 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + I T I + N + LLK E+ Y + D F+ ++ +H S Sbjct: 160 DQDTIIYTFGI-INGNARELLKISSQPIEEHAYLFTSFDE----FEKFARRALHEDLS 212 >gi|156355315|ref|XP_001623615.1| predicted protein [Nematostella vectensis] gi|156210333|gb|EDO31515.1| predicted protein [Nematostella vectensis] Length = 1190 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 46/169 (27%), Gaps = 27/169 (15%) Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +G I ++ + + I + + V T + ++ AY + Sbjct: 616 QNNPMSRVGAIDFSEQANEAIPLTSDQAALETGVQALKYQYQNGMTRTDKGLELAYDLFK 675 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNVNTIKICDKAKENFIK 363 ++ + + +I +TDG D + Sbjct: 676 TNVRAD----------------AGQMLIVVTDGHTTPWDGKTGIELIKPPSDAIRSLG-- 717 Query: 364 IVTISINASPN-GQRLLKTCVSSP--EYHYNVVNADSLI---HVFQNIS 406 + TI + L S P + + N D LI F I+ Sbjct: 718 VHTIVAGIDERVNEEELYYMASDPVDKNMVRMNNYDKLISHVEDFTTIA 766 >gi|157375629|ref|YP_001474229.1| vault protein inter-alpha-trypsin subunit [Shewanella sediminis HAW-EB3] gi|157318003|gb|ABV37101.1| vault protein inter-alpha-trypsin domain protein [Shewanella sediminis HAW-EB3] Length = 770 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 62/197 (31%), Gaps = 11/197 (5%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + M K A+ L + + + ++ + E + Sbjct: 383 TSGSMSGSAMEQAKKAMKYALAGLGSDDTFNVIEFNSKVSSLSK-GPIPASTKNIEMANR 441 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +V T+ A++ A L + S + QG S + ++F+TD Sbjct: 442 FVHSLTS---DGGTEMALALEHA---LGQESGGSSWQETGLQGKDEESTSRLRQVLFMTD 495 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G N IK + ++ T+ I ++PN + + + + D + Sbjct: 496 GAVGNEAELFKLIKY----RIGKSRLFTLGIGSAPNSHFMQRAAEFGRGTFTYIGDLDEV 551 Query: 399 IHVFQNISQLMVHRKYS 415 Q + + H + + Sbjct: 552 QEKIQGLLYKIEHPQIT 568 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/413 (9%), Positives = 98/413 (23%), Gaps = 76/413 (18%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSS 125 ++ + L D + F +++ + + V + Sbjct: 158 DYAADNGSAASEEDVNVPAQTVQLWGDERNFENSYLDYNGAYIKKWVEPVPSNNPTADLH 217 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIP 178 T + N LD + + L + Q N ++ + Sbjct: 218 PEDATTLYNVYLDVIGSEKQEIS--PIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVN 275 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHC-----------AMNSDPEDVNSAPICQDKKRTK 227 + + P I + ++ + + D+N ++ + Sbjct: 276 E-ISENLLSDPNMDIR-IGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNR 333 Query: 228 MAA--------LKNALLLFLDSIDLLSHVKEDVYM----GLIGYTTRVEKNIEPS----- 270 LKN + E + + G ++ + + Sbjct: 334 TPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSIWGA 393 Query: 271 ---WGTEKVRQYVTRDMDSLI----LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV- 322 W + + R T + + TN + Sbjct: 394 WSSWSVMGDKIAIDRGDLFKNFETFSGNTS-NAGFTYPVTYASDFDRPVNGTNVQYRYGE 452 Query: 323 -KIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-----------DKAKENFIKIVTISIN 370 K ++ + + N +K +EN I +I + Sbjct: 453 VKEGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLG 512 Query: 371 ASPN----------GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 + G+ +LK + +Y+ N + +I ++IS Sbjct: 513 IDGSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 565 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 58/180 (32%), Gaps = 25/180 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTD 293 + + L ++ V +GL+ Y+ + S + + + ++ T Sbjct: 54 NWTIKIVRDLPIHEDAVRVGLVQYSESAKTEFNLSK-YSERNDIIAHMETLTFMQVEDTR 112 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ +A + + L + II TDG S K Sbjct: 113 TGVALNKADEEIFDFNGG-------------ARLKATRLIIIFTDG-----LSMDKPSKA 154 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + +KI TIS+N+ +L V + + + + + + + H + Sbjct: 155 AKALRRKGVKIYTISVNSIGFIPEMLGI-VGDADNVFGPNDEERIEE---RLLGEVEHSR 210 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/264 (12%), Positives = 74/264 (28%), Gaps = 35/264 (13%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + + + L +D+SGSM +T M K+ Sbjct: 51 IIDVKVDITGANPITRNSDVVLAIDISGSM-------------GNTISGDFKTSMDYAKD 97 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPT 292 A L FL+ + + ++ Y+T I ++ V Q + T Sbjct: 98 AALAFLNQAKANPQNR----IAIVAYSTTASLKIGLTYLNATGVTQITNQINALQATNST 153 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + + ++ L ++ + T + + + + I Sbjct: 154 NIYAGIVRSETELETNGRFDCSTARAIILLTDGVTNVTGTSGNTNCNVSKTSQCVTDAIN 213 Query: 353 ICDKAKE-------NFIKIVTIS----INASPNGQRLLKTCV------SSPEYHYNVVNA 395 AK ++ ++ I+ + N + Y + Sbjct: 214 AATNAKTTTKSSVVYNNQVFSVGLFGGISGNINTNNSDQNIAKYTLDNIQGSAAYITQSG 273 Query: 396 DSLIHVFQNISQLMVHRKYSVILK 419 +L ++ I+ + S+I K Sbjct: 274 ANLTAIYNQIATQISWIAQSLIAK 297 >gi|239943867|ref|ZP_04695804.1| hypothetical protein SrosN15_22911 [Streptomyces roseosporus NRRL 15998] gi|291447330|ref|ZP_06586720.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291350277|gb|EFE77181.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 396 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 67/248 (27%), Gaps = 37/248 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLS----GSMHCAMNSDPEDVNSAPICQDKKRT 226 + +V+ + +R ++ S + V K Sbjct: 169 RPVVAAARPADPLAPEQRRELPFPGTRSVADGLLSSYEHRLRRPSRTVYVLDTSGSMKGR 228 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ----YVTR 282 ++A LK+AL +E + L+ + + V++ + + Sbjct: 229 RLAQLKSALNGLTGDF------REREQVTLLPFGSTVKQVRTHTVDPADPKAGPAAIRAD 282 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T ++ AY L D + +F + I+ +TDGEN Sbjct: 283 AAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS-----------------IVLMTDGENT 325 Query: 343 NFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---S 397 +S + + + L + ++ + S Sbjct: 326 AGRSAAEFGAFYRALPEARRVTPVFPVVFG-DSDRSELEAIAALTGGRLFDGTKEEGPGS 384 Query: 398 LIHVFQNI 405 L F+ I Sbjct: 385 LDGAFEEI 392 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 51/164 (31%), Gaps = 20/164 (12%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + L+ Y++ ++ + S + D + T + + L K Sbjct: 184 NTLFSLMQYSSGLKTHFTFSKFRSSLSPLRLVDPIVQLQGLTFTATGILAVVNELFHSKN 243 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +K +I +TDG+ +K + +A+ I I + Sbjct: 244 G-------------ARRSAKKILIVITDGQK--YKDPWEYRDVIPQAERAGIIRYAIGVG 288 Query: 371 A----SPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLM 409 Q L S ++ + V N +L + + + + + Sbjct: 289 DAFQEPIARQELNTIGSAPSQDHVFKVDNFAALSSIQKQLQEKI 332 >gi|113931618|ref|NP_001039260.1| inter-alpha (globulin) inhibitor H2 [Xenopus (Silurana) tropicalis] gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis] Length = 942 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 63/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ L+ + Sbjct: 305 NILFVIDVSGSMW--------------------GLKMKQTVEAMKSILEDLRPDDQFSII 344 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I + E S + +YV T+ A+ +A IL + Sbjct: 345 DFNHNI-RCWKDELVYASSVEKQDASKYVQSI---QPNGGTNINEALLRAIFILKEASNK 400 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 I+ ++DG+ K + + +++F + ++ I Sbjct: 401 GMLE-----------PNSVSLIVLVSDGDPTVGEIKLSKIQKNVRTNIRDDFA-LHSLGI 448 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + A L ++ +S Sbjct: 449 GFDVDYDFLERLAQENHGMAQRIYGKQDTAAQLKEFYKKVST 490 >gi|326772466|ref|ZP_08231750.1| von Willebrand factor type A domain-containing protein [Actinomyces viscosus C505] gi|326637098|gb|EGE38000.1| von Willebrand factor type A domain-containing protein [Actinomyces viscosus C505] Length = 370 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + + + +G++ + + + + + E +R ++ D L + P + T Sbjct: 133 SSVLTTFSEILEDFDGERVGIVAWNSAAQTIVPLTDDYELLRDQLSELGDVLDIDPENVT 192 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL---------PFQKFIIFLTDGENNNFKS 346 + AYQ S G + S + II TD + + + Sbjct: 193 YKQQLAYQEAFGGTVNSSINGSSLAGDGLASCAQAFDNQGLERSRSIILATDNQVIDPDN 252 Query: 347 NV--NTIKICDKAKENFIKIVTISINASPN-------------GQRLLKTC--VSSPEYH 389 E I++ +I A + + LKT Sbjct: 253 EQIYPLPDAAKLLAERKIRLFSIY-GADEDQPYQDLLDKTPEESREELKTVTEEQGKGRF 311 Query: 390 YNVVNA 395 Y+V ++ Sbjct: 312 YDVEDS 317 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/300 (12%), Positives = 83/300 (27%), Gaps = 51/300 (17%) Query: 145 FYNMDVMTSYDYRLQFIEHL----LNQRYNQKIVSFIPALLRIEMG---ERPIFLIELVV 197 +DV T+ ++ + + L + + S + + G Sbjct: 3446 AKIVDVPTNVNFVIDTSGSMYYGRLLNLDSIHMNSAEKYKVFVNYGATLTAADGTQLYNG 3505 Query: 198 DLSGSM-------------------HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + K A + Sbjct: 3506 SSQSGWVTVTYDQMKAGLQYDGYRAEDPIYIKSSIGEDQTYKLTDFPSVFDMTKQAYQVL 3565 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 +D I ++ K + ++ + + V + + E + + TD + Sbjct: 3566 VDEILTNTNDKSSLNFNVVTFNSTVGGDSSFHYDAES--------NSFVNSRGTDIHNYL 3617 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVK-IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 L + F + + + FLTDG++N SN D A Sbjct: 3618 ----NSLIAGGGTEFEAPLKTISDHIVTDGNTRNVVYFLTDGKDNTGFSNSANNS--DYA 3671 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEY---------HYNV-VNADSLIHVFQNISQ 407 ++++I++ S + ++ + Y + V N + L +F++I Q Sbjct: 3672 ALKHAEVISIAVGPSGDADQVNQIAQLGEGYNNNNDSEPSYSKVITNTNELTDIFKDIGQ 3731 >gi|237689767|gb|ACQ59162.1| factor B/C2 [Larimichthys crocea] Length = 755 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/198 (9%), Positives = 50/198 (25%), Gaps = 20/198 (10%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 ++ + + + I + + V ++ + + + + Sbjct: 257 KYISNTTSVIKKLIQKISSFTVSPNYEIVFFSSEIEEVVNILDFWEKKDVRKDILEKLDT 316 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I + + + + + + II TDG N Sbjct: 317 FGINDRSSTGTDLNLVFFTFWERMSF-----IKTRVGANAFKDHRHVIILFTDGAYNMGG 371 Query: 346 SNVNTIKICDKA-------------KENFIKIVTISINASPNGQRL--LKTCVSSPEYHY 390 S T+ +E ++ + +I A L + ++ + Sbjct: 372 SPAPTLARIKDMVYMNQTGEKEVNSREEYLDVYVFAIGAETFDDDLKSFTVGKAEEKHFF 431 Query: 391 NVVNADSLIHVFQNISQL 408 V D + F +I Sbjct: 432 KVKGIDKMYDAFDDIIDE 449 >gi|163858495|ref|YP_001632793.1| hypothetical protein Bpet4177 [Bordetella petrii DSM 12804] gi|163262223|emb|CAP44525.1| hypothetical protein Bpet4177 [Bordetella petrii] Length = 183 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 54/157 (34%), Gaps = 20/157 (12%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 M +I + ++V + + ++ Q D T A++ A ++ Sbjct: 1 MAIITFGSQVALHQPLASASDIHWQ------DLSAGGMTPLGTALQMAKAMIEDK----- 49 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 +PS ++ ++ ++DG N+ I D + + ++I A Sbjct: 50 --------DVVPSRAYRPTVVLVSDGGPNDAWEKPLNAFISDG-RSAKCDRLAMAIGADA 100 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + L K + + NA L F+ ++ + Sbjct: 101 DEAVLGKFIEGTSNRLFYAENAKQLRDFFKFVTMSVT 137 >gi|331002541|ref|ZP_08326058.1| hypothetical protein HMPREF0491_00920 [Lachnospiraceae oral taxon 107 str. F0167] gi|330409144|gb|EGG88600.1| hypothetical protein HMPREF0491_00920 [Lachnospiraceae oral taxon 107 str. F0167] Length = 1277 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 43/148 (29%), Gaps = 3/148 (2%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 +++ + ++ A + ++ LI +N + A + A+ AG + S L Sbjct: 6 NKRGSLTVFMAGILTVVIIFTTVLIDGGRIILARNIVSGAGDMALNAGLTYYNSVLQDTY 65 Query: 77 ---DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 +++ K + + V + S I +A Sbjct: 66 GLFAISQNMDELEKNLEVYFNATLKSGGLYNQGLVKELVDIALSGRGTEEVSDIMKMKLA 125 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 S T +V L ++ Sbjct: 126 KGDFSISQATGANLSNVNILRAQVLDYM 153 >gi|294011132|ref|YP_003544592.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] gi|292674462|dbj|BAI95980.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] Length = 157 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 40/126 (31%), Gaps = 2/126 (1%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-AILAGASKMV 69 K+ ++ N ++ A+ + +LL V K ++ A A LA A Sbjct: 2 MKRLLSDRYGNSTVELAIIMPVLVLLTCMAGDVAMAFKAKIGLQRAAERTAQLAAAGGYT 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRF-IKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 ++ + + +++ A A + + + + +A ++ Sbjct: 62 NDTTDTSKAYNNLAADAAAAAGVPTGNVTVTPTLLCNATVQTASPEVPCPDGQQTKRYVA 121 Query: 129 MTHMAN 134 + Sbjct: 122 IAISGT 127 >gi|307196324|gb|EFN77937.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Harpegnathos saltator] Length = 2255 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/257 (12%), Positives = 67/257 (26%), Gaps = 46/257 (17%) Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 L ++ +L + + + + + +VD SGS+ + + Sbjct: 36 LDDVDKMLKSKTDILSRRL-KIDIDRLRNKTNQIELMFLVDASGSVGAENFRSELNFVTK 94 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + +A + I S + I + + Sbjct: 95 LLS--------DFTVDATAARVAIITFGSPRNVTRNVDQISRHGGNDHKCYL------LN 140 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + T + A+ +A IL K + +T Sbjct: 141 KQFNEIA--YSGGGTYTRGALLEALAILEKS-----------------REEASKVVFLIT 181 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNA 395 DG +N + K + T I + L S P + Y + + Sbjct: 182 DGFSNGGD----PRPAANLLKNAGATVFTFGIRTGNVEE--LHDIASFPGYTHSYLLDSF 235 Query: 396 DSLIHVFQNISQLMVHR 412 F+ +++ +HR Sbjct: 236 AE----FEALARRALHR 248 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 22/63 (34%), Gaps = 1/63 (1%) Query: 359 ENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ I TI A+ L Y N+ L+ FQ+I + + + + Sbjct: 633 DSRIITHTIGFAANTQANSFLNQIALQGGGGFYQADNSQELLGAFQSILKTVKDVNATFV 692 Query: 418 LKG 420 G Sbjct: 693 SPG 695 >gi|156382057|ref|XP_001632371.1| predicted protein [Nematostella vectensis] gi|156219426|gb|EDO40308.1| predicted protein [Nematostella vectensis] Length = 297 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 26/189 (13%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + LK + + + + ++ Y + + + S+G + V R + Sbjct: 45 SSNFDRLKEFVSTVIGGF---VISPQGTQISVVVYHSSAKTH--LSFGDAQDLISVRRII 99 Query: 285 DSLILKP---TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S+ T + + +A+Q + S + ++ + DG++ Sbjct: 100 SSIAYPSGPQTYTDRGLVEAHQRFAKENGAR-------------SSRTTRVVVVINDGKS 146 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N + K+ I I+ + +S + L S + H A +L + Sbjct: 147 NG----ESLEASSKPLKDEGIVIMALGFGSSVRVEELQTMASSQGDMHLY-STAIALKND 201 Query: 402 FQNISQLMV 410 +I + Sbjct: 202 ANSIVNSIC 210 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/317 (10%), Positives = 74/317 (23%), Gaps = 40/317 (12%) Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + + N V + + S++ S L Sbjct: 838 VNKGQHEDAEANLTTSKTVKVTDYNKREYEITLGASTSGREAGTEAKAASVVLVLDRSGS 897 Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + +V+ + P + +V N ++ ++ ++ Sbjct: 898 M-GADGMTALVNAADTFIDTLKTASPDSQVAVVY---------FNGTQDEDDNTTTSKNF 947 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTR 282 + ++ FL G +EK ++YV Sbjct: 948 TKLNTDEKVKSIKDFL--------SNNGYSYGGTPMGDALEKAKGLLDADQTGNQKYVLF 999 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS--LPFQKFIIFLTDGE 340 D L +D A + + G + + + G Sbjct: 1000 FTDGLPGHSSDDAFNCMVANSAVNCATDIKANATIYTVGYHLSGTLYWHKGDSATSSAGR 1059 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL-KTCVSS------PEYHYNVV 393 ++ N C + S + L K +S +Y Y V Sbjct: 1060 DHGGYYNPYFGGSCKNH------------DLSTSASDFLEKYIATSVQEGSNSKYAYTVD 1107 Query: 394 NADSLIHVFQNISQLMV 410 D L F+ ++ + Sbjct: 1108 KTDDLEAEFKKLAAQIG 1124 >gi|224090451|ref|XP_002195054.1| PREDICTED: similar to Integrin alpha-2 [Taeniopygia guttata] Length = 1178 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 65/204 (31%), Gaps = 32/204 (15%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + A++ L F+ + +GLI Y + K Sbjct: 173 VCDESNSIYPWDAVRAFLKKFVQGL---DIGLNKTQVGLIQYANDPRVVFNLNTYQTKDE 229 Query: 278 QYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 V + T++ A+ A Q S + K ++ Sbjct: 230 -VVKAMEKTFQKGGDLTNTFKAIDNARQYAFSAESG-------------GRPTATKVMVV 275 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISI-NA----SPNGQRLLKTCVS-----S 385 +TDGE+++ + I C+ E+ I I++ + + L+K + Sbjct: 276 VTDGESHDGSNLKTVIGKCN---EDNITRFGIAVLGYLIRHELDTKNLIKEIKGIASHPT 332 Query: 386 PEYHYNVVNADSLIHVFQNISQLM 409 +Y +NV + +L+ + + + Sbjct: 333 EKYFFNVSSEAALLEEAGTLGERI 356 >gi|83648073|ref|YP_436508.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83636116|gb|ABC32083.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 261 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 64/207 (30%), Gaps = 31/207 (14%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + + D C D K K+ K A+ F++ + ++V + G Sbjct: 73 NYYIVFDGSGSMDNTDCGDGK-RKLDVAKTAVKKFVEQLPADANVGVYAFDGQGV----G 127 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E+ + V+Q + + + T + ++ LT+ + + Sbjct: 128 ERTHLATQNRPLVKQMIDQLV---AGGGTPLSAGLEDGKAALTAQAGKQLGYGEY----- 179 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 ++ +TDG + ++ ++ I I TI + L + Sbjct: 180 --------HLVIITDGLASLGYETDGAVQ--TILQDTPINIHTIGFCIGDD-HSLHQ--- 225 Query: 384 SSPEYHYN--VVNADSLIHVFQNISQL 408 P + + DSL+ + Sbjct: 226 --PGLTFYRSASDPDSLMAGLNEVLAE 250 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 60/175 (34%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTR------VEKNIEPSWGTEKVRQYVTRD-MDSLILKPTDS 294 + + + + ++ +++R V+ +PS+ K T + Sbjct: 102 LADFTLAENAARVAIVTFSSRNKVVNHVDHLSKPSYHKHKCSLLEEELPRIKYAGGGTYT 161 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 AM +A ++L + K + +TDG +N Sbjct: 162 KGAMIKAQEVL-----------------RHARPNATKAVFLMTDGYSNGGDPLPE----A 200 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 K K+N ++I T I + + L+ + P E+ Y + + F+ +++ Sbjct: 201 RKLKQNDVQIFTFGIRSGN--VKELQNMATDPAEEHSYFLDSFAE----FEALAR 249 >gi|294012237|ref|YP_003545697.1| hypothetical protein SJA_C1-22510 [Sphingobium japonicum UT26S] gi|292675567|dbj|BAI97085.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 543 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 14/143 (9%), Positives = 40/143 (27%), Gaps = 2/143 (1%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + S + ALS+ + G +++A + A LA AS++ Sbjct: 6 KGLLQSTGGAVAPTVALSLFGLIAAGGIAFDYARMAGLDTELQNAADQAALAAASQLDGT 65 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + A + + ++ + + + + + Sbjct: 66 STAITRATSAAQTLIVNQ-TRFANDGNASGLSVTVPTLTFYSSYDQDADTGTVTTSATA- 123 Query: 132 MANNRLDSSNNTIFYNMDVMTSY 154 ++ Y + + Sbjct: 124 AKFVQVSVGTREAVYALTPVVQL 146 >gi|198430661|ref|XP_002124079.1| PREDICTED: similar to alpha-2 IX collagen [Ciona intestinalis] Length = 1221 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 58/173 (33%), Gaps = 14/173 (8%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ED +G+I Y++ + +I + T+ + K T + A++ A Sbjct: 2 VGNFQIGSEDTQVGVIQYSSCIRPHIWLNNFTDTEQLQSAIARIKFEAKGTATGKALRFA 61 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + KK + + ++ +I +TDG + + + Sbjct: 62 SNRMFNWKKGARELSGG----------VRRILILITDGR--WSVRDDIPGALA-SLESKQ 108 Query: 362 IKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 ++ + + L + +++ D L V + I + +K Sbjct: 109 VERFAVGVGRVVKTLDLGALVSEPKSNHSFHIPFFDELDEVGRKILSTICQQK 161 >gi|268529744|ref|XP_002629998.1| Hypothetical protein CBG13361 [Caenorhabditis briggsae] gi|187028722|emb|CAP32249.1| hypothetical protein CBG_13361 [Caenorhabditis briggsae AF16] Length = 359 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/225 (10%), Positives = 58/225 (25%), Gaps = 24/225 (10%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + L+ V +S S+ I + + + + I Sbjct: 8 TVLLMLVPSVWSISDRPCGNDTSNLWLDLVLVIDNSQPTNTIYETISGIFGAQSRIGTHY 67 Query: 247 HVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQ 303 + +I Y ++++ +T + + + A Sbjct: 68 EDPRSSRVAIITYNNDATTVAQFPTFKSLDQLKTELTVLKNDGTSGNESYLDTGLSSAQA 127 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 IL + ++K ++ + I K N + Sbjct: 128 ILNK---------------TEDRMNYKKVVLVF---ASTYDLLYDRPDYIAHALKSNGVT 169 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNIS 406 ++T+ + LK+ S + + +V L + I+ Sbjct: 170 VITVYTGNDKTVKGQLKSVASD-GFAFQMSVNTTTDLQNALTMIN 213 >gi|328859780|gb|EGG08888.1| hypothetical protein MELLADRAFT_115926 [Melampsora larici-populina 98AG31] Length = 1673 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 38/129 (29%), Gaps = 25/129 (19%) Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 AM AY I+ K+++ + F + + + + +T Sbjct: 1249 ATAMNLAYDIILQRKQKNPLSGFLLLHDQRDTTTKPQMALVMT----------------- 1291 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-------NISQ 407 +A+ + + TI N L + + + V + + +I Sbjct: 1292 -RAEAANVPVHTIGWGKVHNPSCLWQLSNHTGGTYTYVNDLYEIKDTLTGCIGGMISIGA 1350 Query: 408 LMVHRKYSV 416 + SV Sbjct: 1351 TNAKLQISV 1359 >gi|325954651|ref|YP_004238311.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437269|gb|ADX67733.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 336 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 55/161 (34%), Gaps = 30/161 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ QD +++ + L+S+ L+ + + Sbjct: 96 DVSTSMDAQDVAPSRLMKASRIISQSLNSLG-------GDRAALVIFAADGYTISPLTND 148 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + Y+ + + + TD + A ++A +L + K Sbjct: 149 YAAIDSYLGSLSTNLISNQGTDFSAAFREAVSVL------------------KGAPNTSK 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 ++ L+DGE++ N A +N I +V+I I Sbjct: 191 LVVLLSDGEDHESGENQ----SIKLANDNQIHVVSIGIGTD 227 >gi|198436525|ref|XP_002124218.1| PREDICTED: similar to laminin, beta 2 [Ciona intestinalis] Length = 671 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/179 (9%), Positives = 52/179 (29%), Gaps = 4/179 (2%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + ++ + H+ + + N ++ + Sbjct: 53 NIVKEWVKQLAAGMHIGHGHIQIGVVQYSHWYHNRPL-NNQRYIKTEIELGEHMNKDDFD 111 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + +K + + F G K ++ +I LTDG+ + + I+ Sbjct: 112 RAVDRIKYQGFKTYTAHAINKTLEFDFLGPKNRYPDAKRALILLTDGKATDSSYLPDVIQ 171 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++ + I + + + L + + N + L + ++ + Sbjct: 172 HAE--RDRGVIIYAVGVG-DFDSSELQLITHHHENREFELENFEDLDSIVNSLQFQLAT 227 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 81/225 (36%), Gaps = 39/225 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D KN L F++ + + G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGIGN--------FTRAKNCLRDFIEKVASYGVKPKY---G 310 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK------PTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ +VTR ++ + + T++ A+++ Y +++ Sbjct: 311 LVTYATIPKVWVKLRDDNSSDADWVTRILNQISYEDHKLKAGTNTKKALQEVYNMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + N +G II +TDG +N V+ I +E Sbjct: 368 ---WPGNALLEGWNHTRH----VIILMTDGLHNMGGDPVSVIHEIRDFLYIGRDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVF 402 +++ + + N + + ++ + V + ++L VF Sbjct: 421 DYLDVYVFGVGPLVNQENINALASKKDKEQHVFRVKDMENLEDVF 465 >gi|239939820|ref|ZP_04691757.1| hypothetical protein SrosN15_02385 [Streptomyces roseosporus NRRL 15998] gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_03262 [Streptomyces roseosporus NRRL 11379] Length = 592 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 61/196 (31%), Gaps = 24/196 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + +++M K +L+ LD +GL + T ++ + E Sbjct: 398 PGRNQSRMDVTKESLIQALDQFTPND------EIGLWEFATTLDGEKDYRRLMET----- 446 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTS--DKKRSFFTNFFRQGVKIPSLPFQ---KFIIF 335 R D T + A+ L + + S + ++ Sbjct: 447 KRLGDPAAGGGTHR-EKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFNALVI 505 Query: 336 LTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 LTDG N + + + + KE + ++ I++ + + + + Y Sbjct: 506 LTDGSNQDTNGISRS-GLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGGDGY 564 Query: 391 NVVNADSLIH-VFQNI 405 V + + + Q I Sbjct: 565 EVSDPAEIQAVILQAI 580 >gi|149914293|ref|ZP_01902824.1| hypothetical protein RAZWK3B_19871 [Roseobacter sp. AzwK-3b] gi|149811812|gb|EDM71645.1| hypothetical protein RAZWK3B_19871 [Roseobacter sp. AzwK-3b] Length = 177 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 22/56 (39%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 ++ ++ +E N SI F + F++ + F + + + +E + Sbjct: 1 MIRALSGRLRRFRRAEDGNASIEFVIIFPVFMMFLIFALELSFITLRHAMLERGLD 56 >gi|116249090|ref|YP_764931.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253740|emb|CAK12133.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 176 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 60/184 (32%), Gaps = 17/184 (9%) Query: 4 LSRFRFY--FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + R + + ++ + K +I FA+ + ++I +I V + + A +A++ Sbjct: 1 MRRGKHFASLRRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDASV 56 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + + D K I N + S S S + + + Sbjct: 57 HKISRMIRTG-----------EVASSNITLADFKAGICNDMLLSFSCSSGLLVKVNVLSD 105 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 ++S+ + + L + L + + + +S LL Sbjct: 106 LSSAASADPIDDSGNLTVTETYDIGKGSDYILVQTFLPWTAVVNFFSLSSAKLSDGRYLL 165 Query: 182 RIEM 185 + Sbjct: 166 GSSV 169 >gi|296128023|ref|YP_003635273.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296019838|gb|ADG73074.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 500 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 77/241 (31%), Gaps = 46/241 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 R + ++V + R + LVVD SGSM A Sbjct: 135 FRPDHRLVRVGISSAPASAVSRADADLVLVVDCSGSMDEA-------------------G 175 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM K AL + S+ + ++ Y+T + +EP+ + + Sbjct: 176 KMETTKYALRTLVSSLRRTD------RVAMVCYSTEADVYLEPTP-VAEREGVLAAIDRL 228 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ + Y + S + T ++ ++DG N ++ Sbjct: 229 APRDSTNAAAGLALGYDLAMSMRTEGRLT----------------RVVLVSDGVANVGET 272 Query: 347 NVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQ 403 + +I +AK I ++++ + + LL+ +H V VF Sbjct: 273 DPEGILARISSQAKA-GISLISVGVGITTYNDHLLEQLADQGDGWHVYVDGEAEAERVFA 331 Query: 404 N 404 Sbjct: 332 T 332 >gi|187923652|ref|YP_001895294.1| cyclic nucleotide-binding protein [Burkholderia phytofirmans PsJN] gi|187714846|gb|ACD16070.1| cyclic nucleotide-binding protein [Burkholderia phytofirmans PsJN] Length = 596 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 49/164 (29%), Gaps = 17/164 (10%) Query: 4 LSRFRFYFKK-------------GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK 50 + R F ++ G + +++ + + + +G + V + + + Sbjct: 1 MRRLSFAMRRAACRAHPLRMRCAGARRQHGAVAVVAVIWLSVAIAALG-ALDVGNVFFAR 59 Query: 51 NSMESANNAAILAGASKMVSN---LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 ++ + A LA + S + ++ + + A + S S Sbjct: 60 RQLQRTADLAALAAVQMIGSTGGCATATTAAQQNAAANGFTAGGTTSINTTCGRWDTSKS 119 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 Y N V ++++ + S + ++ Sbjct: 120 TYFGTSGNPLNAVQVTTTQVVPYFFLGPSRNVSATATAFASNID 163 >gi|284173928|ref|ZP_06387897.1| hypothetical protein Ssol98_04615 [Sulfolobus solfataricus 98/2] gi|261602665|gb|ACX92268.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 356 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 17/86 (19%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-------FIKIVTISINASPNGQRL 378 II LTDG+ + K+ +I+TI I + N + L Sbjct: 122 QSQVPNKIIMLTDGKPTDK----------RNVKDYEKFDIPPNTQIITIGIGSDYNERIL 171 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQN 404 K S Y++ + L VF++ Sbjct: 172 KKLADKSSGKFYHLKDISELPDVFES 197 >gi|72168566|ref|XP_796840.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115961659|ref|XP_001187264.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 958 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 61/190 (32%), Gaps = 26/190 (13%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++ L A +L + + ++G+ ++ + T+ R Sbjct: 326 VSGSMGGNDRLTKLNQAATQYLRYT-----IDDGSFVGIAHFSDYSRIIENLTEITDNSR 380 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + + S+ PT + +IL G P +I+ ++ Sbjct: 381 EDLVMGLPSIANGPTCIGCGVLDGIKILK--------------GETGMEDPAGGYILLIS 426 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD- 396 DG+ N ++A + + TI+ + + + LL+ V + + + Sbjct: 427 DGQQNRQPYIDEVFDEVEEA---GVIVDTIAFSDAADPN-LLELSVRTNGLGFFYPDTAT 482 Query: 397 --SLIHVFQN 404 +L F Sbjct: 483 STALNDAFTA 492 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 52/161 (32%), Gaps = 31/161 (19%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D ++ K + +D +GL+ + + + + Sbjct: 87 DVSRSMLAEDVSPNRLGRAKQQIKDMVDE-------MPGDRVGLVVFAGETRQTLPLTRH 139 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E +Q + + S+ + A++ A K Sbjct: 140 VEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDK------------------TTDHK 181 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKE-NFIKIVTISINA 371 ++ LTDGE+ + +A E I+I TI + Sbjct: 182 AMVILTDGEDQESD----PVSEAKRAHEEQGIRIFTIGLGD 218 >gi|239945536|ref|ZP_04697473.1| hypothetical protein SrosN15_31402 [Streptomyces roseosporus NRRL 15998] gi|239992004|ref|ZP_04712668.1| hypothetical protein SrosN1_32194 [Streptomyces roseosporus NRRL 11379] Length = 416 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 64/208 (30%), Gaps = 30/208 (14%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++MAA K A LD++ + Sbjct: 31 VELVLDVSGSMRTRD--------------IDGQSRMAAAKQAFNDVLDAVPEQVQLGIRT 76 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R + + P D T A K A L Sbjct: 77 LGADYPGEDR--------------KVGCKDTRQLYPVGPLDRTEA-KTAVATLAPTGFTP 121 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + + I+ +TDGE+ + + A+ + + T+ + + Sbjct: 122 IGPALLGAADDLEGGEGSRRIVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPN 181 Query: 373 PNGQRLLK-TCVSSPEYHYNVVNADSLI 399 ++ L ++ + V + + L Sbjct: 182 AKIRQQLTCIAEATGGTYTAVQHKEELS 209 >gi|152990153|ref|YP_001355875.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422014|dbj|BAF69518.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 549 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 53/160 (33%), Gaps = 33/160 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ D ++ K + F+ +++ + L+ + PS Sbjct: 92 DISNSMKATDYYPDRLQFAKKKIEEFIK-------ASKNLNIALLAFADEAYIVSPPSSD 144 Query: 273 TEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + + SL L+ T+ A+ A +L QK Sbjct: 145 KEALLYMLKHLDTESLALQGTNFLAALMSADMLL--------------------GKEGQK 184 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ TDG + +I AK+ I++ I I Sbjct: 185 SVVLFTDG----GNKEDFSKEIA-FAKKRGIQVHIIGIGT 219 >gi|262165918|ref|ZP_06033655.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] gi|262025634|gb|EEY44302.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] Length = 406 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/207 (10%), Positives = 57/207 (27%), Gaps = 21/207 (10%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + G ++ ++ +++S ++L I V + +++A +AA LA A + Sbjct: 7 RNGRRKQQGLVLVLVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADAT 66 Query: 72 LSRLGDRFESISNHAK--RALIDDAKRFIKNHIKESLSGY-----------SAVFYNTEI 118 + S A A + + + S + Y Sbjct: 67 HDTATATDVAKSTLASMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVS 126 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF-------IEHLLNQRYNQ 171 ++ + M + + S + + + + + + + Sbjct: 127 VADLSLTEFFMQALGMAK-QVSAAAVAGPSSSINTIGNVVPIGVCKGSESDGIYGYHSSD 185 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVD 198 + MG L++ Sbjct: 186 VYALKVGDSSMTAMGSGNYNLLDFGSG 212 >gi|153876590|ref|ZP_02003832.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152066946|gb|EDN66168.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 305 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 47/163 (28%), Gaps = 15/163 (9%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + LL VKE Y L+ + + + + + + T A+ A Sbjct: 152 LQLLEIVKEADYFSLLSFNHSLNWIAKNIQVKSQQKWLKRQFNYQFPGGGTALYDAIFNA 211 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Y L + S F L+ N+ S Sbjct: 212 YTFLQKNSFPDKIAVMIVLSDGGDSHSELNFKDLLSKIPFNSDTSP-------------- 257 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 I+I + + + +RL + + Y+ D + +F+ Sbjct: 258 IRIFAVGYGSITDKKRLNEIAKMTQGKFYDGAMVD-VDKIFKK 299 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/378 (10%), Positives = 94/378 (24%), Gaps = 68/378 (17%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 + KK A + + G + E+I + +A +D I Sbjct: 48 VATDIERLLAKKRK---ALERLASEAERLQKEHRWQDGIKEENIEYYNSKAEMDYDGEDI 104 Query: 99 KNHIKESLSGY---SAVFYNTEIQNIVNSSRISMTHMANNRLDSS---NNTIFYNMDVMT 152 + + L S V + + + + Sbjct: 105 DSQMSLKLDFVYDPSFKNQVNYSHTAVQIPTDIYKGAPVILNELNWTQALERVFIENSRD 164 Query: 153 SYDYRLQFIEHL----------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 Q + ++ + + G + ++VD+SGS Sbjct: 165 DPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGASSPKDMVILVDVSGS 224 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + +K ++ LD++ +V + + + Sbjct: 225 V--------------------SGLTLKLIKASVTEMLDTLSDDDYVN------VARFNEK 258 Query: 263 VEKNIE-----PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 E + + + TD A+ L + Sbjct: 259 AEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNKTN------- 311 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 +P K I+ TDG + + +K +++ T S+ Sbjct: 312 ------VPRANCNKIIMLFTDGGEDRAQDIFEQYNWPNKT----VRVFTFSVGQHNYDVT 361 Query: 378 LLKTCV-SSPEYHYNVVN 394 L+ S+ Y++ + + Sbjct: 362 PLQWIACSNKGYYFEIRS 379 >gi|163751746|ref|ZP_02158964.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] gi|161328398|gb|EDP99557.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] Length = 447 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 11/153 (7%), Positives = 41/153 (26%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F+ + + + ++F + + S L + + K +++A +++ L A Sbjct: 13 FKQFSLAKLGKQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSALYAAK 72 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + S R + + ++ + S + Sbjct: 73 ILQNGGSLFDAREAATLILMQNLGFEENSELNTSVDLSSPDYNATQVTANIFIEFSLWPD 132 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + ++ ++ + Sbjct: 133 PFIPVFDDSSKYVRIRVENVGLNNFFAQILNFD 165 >gi|2159|emb|CAA78125.1| VLA-2 [Sus scrofa] Length = 191 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 59/193 (30%), Gaps = 30/193 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDS 286 A+KN L F+ + +GLI Y + K T Sbjct: 1 WDAVKNFLEKFVQGL---DIGPTKTQVGLIQYANNPRVVFNLNTFKTKAEMVEATSHTTQ 57 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A S K ++ +TDGE+++ Sbjct: 58 YGGDLTNTFKAIQYARDSAYSAAAG-------------GRPGATKVMVVVTDGESHDGSM 104 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNAD 396 I C+ + I I++ N L +K S + Y +NV + Sbjct: 105 LKAVIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEA 161 Query: 397 SLIHVFQNISQLM 409 L+ + + + Sbjct: 162 DLLEKAGTLGEQI 174 >gi|327270792|ref|XP_003220172.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 921 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 66/194 (34%), Gaps = 32/194 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 Q K +++ L A LFL I +++ ++G++ + ++ + Sbjct: 321 ASAQMGKDNRLSRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQAGLQRIFSDIE 375 Query: 278 QY-VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + +T + + + + A+Q+ + + I+ L Sbjct: 376 REGLTSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE----------------IVLL 419 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV--- 393 T+GE ++ ++ + I I TI+ + L K + + Sbjct: 420 TNGEGSDLSPCLSKNQ------SQEIIIHTIAFGSKA-SNELEKLADMTGGKTFYATDSL 472 Query: 394 NADSLIHVFQNISQ 407 +++ LI F IS Sbjct: 473 DSNGLIDAFGGISS 486 >gi|307180277|gb|EFN68310.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Camponotus floridanus] Length = 2214 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 38/131 (29%), Gaps = 29/131 (22%) Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + A+ +A IL K + +TDG +N Sbjct: 173 NITYSGGGTYTRGALLEALTILEKS-----------------REKANKVVFLITDGFSNG 215 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV 401 D K + T I + L S P + Y + + Sbjct: 216 GD----PRPAADLLKNTGATVFTFGIRTGNVEE--LHDIASPPGYTHSYLLDSFAE---- 265 Query: 402 FQNISQLMVHR 412 F+ +++ +HR Sbjct: 266 FEALARRALHR 276 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 57/221 (25%), Gaps = 58/221 (26%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-- 242 + +V+D+SGSM + ++ K A ++++ Sbjct: 108 AASSGPKDVVIVLDVSGSM-------------------SQYGRLDLAKEAAETVINTLGA 148 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D +V + T ++ V L T+ A + + Sbjct: 149 DSFVNVVTFSETARVLLTNSTTLVRATEDNLGELVSLVQNLEFDLANVGTNFGAAFETTF 208 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 IL + + S Q I+FLTDG N Sbjct: 209 DILEASRTSE-----------ETSSNCQTAIVFLTDGNTN-------------------- 237 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + S + + + + +V + L Sbjct: 238 ------VGLSTDEVTSKQIACDTGGIYEHVEDGGDLSQAMA 272 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 54/191 (28%), Gaps = 14/191 (7%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 Q + T K + + + + + +++ N Sbjct: 10 QGRTMTGFQISKAFVKAL---LSEVRVASNATRIAIGTFSSDHRINFNYILNPSYANTKC 66 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + D +K + D F + K +I LTDG Sbjct: 67 KFNDDFKKIK----FDGFMTNIKGSLQDAYNVFRSLDSDPVTHSRRPRSNKVVILLTDGV 122 Query: 341 NNNFKSNVNTIKI-----CDKAKENF-IKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 N + V++ + K+ +++ T+ + Q +LK + P + Sbjct: 123 GNMVGNRVDSAGADGAPEALRLKQTGYVELYTVGV-THATDQNMLKKIATDPSLFLYSKD 181 Query: 395 ADSLIHVFQNI 405 L ++ NI Sbjct: 182 FTDLGNLAANI 192 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 54/162 (33%), Gaps = 30/162 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESINPSLISKQGTAIGEAINLAARSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 II +TDGEN+ + E I++ + + Sbjct: 192 AIIVITDGENHEGGAVEAAKAA----AEKGIQVNVLGVGMPD 229 >gi|319649593|ref|ZP_08003749.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] gi|317398755|gb|EFV79437.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] Length = 461 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 39/116 (33%), Gaps = 21/116 (18%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ +++QAY L + +K II LTDG++ Sbjct: 9 PGGGTEIFTSLEQAYSELEELQ------------------LQRKHIILLTDGQSATNGDY 50 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I + KE I + T+++ + L Y+V ++ + + Sbjct: 51 ELLI---EGGKEKNITLSTVALGQDADRGLLEDLAGMGLGRFYDVTDSSIIPSILS 103 >gi|218693343|gb|ACL01175.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGETPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|293402468|ref|ZP_06646604.1| putative porin-like protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304131|gb|EFE45384.1| putative porin-like protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 1191 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 71/226 (31%), Gaps = 33/226 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I +S S + ++ C + + L +A+ F+ ++ S+V Sbjct: 610 KYIDIMKNGTVSTSGAKTLTNEIVAKYPIVYC-----DRSSVLIDAVSSFVKGMNADSNV 664 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM---DSLILKPTDSTPAMKQAYQIL 305 +I + + + ++ + + + + T+ + A + + Sbjct: 665 S------IITFGNNATDVTKGFVNVGENKEELLQTILKCSGTPYQGTNMADGLTMAKKQI 718 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + + I +DG + ++ D AK + Sbjct: 719 DNTPSSN-----------------NCYSIVFSDGAP-SLPKDIAEQAAVDAAKALKKVSL 760 Query: 366 TISINASPNGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLMV 410 T ++ A + +K SSP+ + N + + Q IS + Sbjct: 761 TYAVCAGEADKDFMKNKIASSPDQFLSTDNMTDIYDLLQIISDEVG 806 >gi|260801245|ref|XP_002595506.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] gi|229280753|gb|EEN51518.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] Length = 1641 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 49/183 (26%), Gaps = 26/183 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSLI 288 N L L +G+ Y + V + + + Sbjct: 602 FPNVLKFILKLAAGFEIGPNAAKLGVYQYGSDVRTEFPIGQ-YNTREDVLNAVLNIQYMN 660 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+++ Y+ +K +I +TDG+ + + Sbjct: 661 QWGTFTGKALEEVYKTF------------------PAGDDAKKVVIIITDGKAMDEEV-- 700 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K K + I + + + L SS + + + D + + + Sbjct: 701 -LRKASQDVKADGAMICAVGVG--GFRLKELSLLASSQDLVFTATDFDKMDAIRDTVLDA 757 Query: 409 MVH 411 + Sbjct: 758 VCE 760 >gi|297465782|ref|XP_609132.4| PREDICTED: collagen, type VI, alpha 3-like isoform 1, partial [Bos taurus] Length = 1803 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 98/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + +++ A+ + A I++ + + L + +I Sbjct: 68 ASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSRIEDGVPQHLVLFLGGKSQDDI 127 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + L + I + + E +++V+ Sbjct: 128 SRYSQV----IKSAGIASLGVGDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNS 183 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + + + + + L Sbjct: 184 FGSSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLR 231 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D + + + +GL+ Y + K +Q + + ++ Sbjct: 232 FVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTK-QQIIDAINKVVYKGGRHANT 290 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ + +F + + +T G+ + + Sbjct: 291 KVGLEHLRR-----------NHFVPEAGSRLDQRVPQIAFVITGGK---SVEDAQEASMA 336 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 337 --LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 387 >gi|297473452|ref|XP_002686619.1| PREDICTED: collagen, type VI, alpha 3-like isoform 4 [Bos taurus] gi|296488814|gb|DAA30927.1| collagen, type VI, alpha 3-like isoform 4 [Bos taurus] Length = 2555 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 98/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + +++ A+ + A I++ + + L + +I Sbjct: 889 ASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSRIEDGVPQHLVLFLGGKSQDDI 948 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + L + I + + E +++V+ Sbjct: 949 SRYSQV----IKSAGIASLGVGDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNS 1004 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + + + + + L Sbjct: 1005 FGSSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLR 1052 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D + + + +GL+ Y + K +Q + + ++ Sbjct: 1053 FVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTK-QQIIDAINKVVYKGGRHANT 1111 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ + +F + + +T G+ + + Sbjct: 1112 KVGLEHLRR-----------NHFVPEAGSRLDQRVPQIAFVITGGK---SVEDAQEASMA 1157 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1158 --LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1208 >gi|260800505|ref|XP_002595170.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] gi|229280413|gb|EEN51181.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] Length = 183 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 44/173 (25%), Gaps = 19/173 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + + + L Y+ V + D Sbjct: 20 HDNFQRMKSFVKRVAGSFAIGPSSAQISLFQYSNFVVQEFALDTYGSIGEINQAVDAVMY 79 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A+ + Q S ++ I LTDG +++ Sbjct: 80 QGGGTATGLALYEMRQYGFSFANG-------------GRPGTRRVAILLTDGMSSDTVDK 126 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I + + I ++ + LL +P+ +++ N L Sbjct: 127 HAMAAW-----QAGISLYVVGIGSNVDMNELLAI-GGTPDNVFSLGNFGQLQD 173 >gi|168209268|ref|ZP_02634893.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712663|gb|EDT24845.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] Length = 928 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/286 (11%), Positives = 69/286 (24%), Gaps = 39/286 (13%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR-IEMGERPIFLIELVVD--LSGSMHCAMNS 209 + L +L KI I ++ ++ L+ H S Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLMDGS 200 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + N ++ + + + + Y G + K IE Sbjct: 201 IGNNQYTVYDLYTSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLKAIEL 260 Query: 270 SWGTEKVRQYVTRDMDSLI---------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + + D L + Y + + + + Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEFGGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKT 320 Query: 321 GVKI---PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISIN----- 370 + E T++ + K+ I I TI I Sbjct: 321 QKPYIYSREEVYSALNRSKNKYEAIWNNGFPTTLE-AENIKKENPDINIYTIGIELKKRV 379 Query: 371 ---------ASPNG-------QRLLKTCVSSPEYHYNVVNADSLIH 400 G ++ L++ SSP + N D + Sbjct: 380 YKWTDNRQYYDAEGIVELPEIRKFLESISSSPAEAFLNENVDDIDE 425 >gi|297473448|ref|XP_002686617.1| PREDICTED: collagen, type VI, alpha 3-like isoform 2 [Bos taurus] gi|296488812|gb|DAA30925.1| collagen, type VI, alpha 3-like isoform 2 [Bos taurus] Length = 2956 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 98/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + +++ A+ + A I++ + + L + +I Sbjct: 1290 ASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSRIEDGVPQHLVLFLGGKSQDDI 1349 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + L + I + + E +++V+ Sbjct: 1350 SRYSQV----IKSAGIASLGVGDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNS 1405 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + + + + + L Sbjct: 1406 FGSSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLR 1453 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D + + + +GL+ Y + K +Q + + ++ Sbjct: 1454 FVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTK-QQIIDAINKVVYKGGRHANT 1512 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ + +F + + +T G+ + + Sbjct: 1513 KVGLEHLRR-----------NHFVPEAGSRLDQRVPQIAFVITGGK---SVEDAQEASMA 1558 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1559 --LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1609 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 52/167 (31%), Gaps = 17/167 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + + K Q + + + Sbjct: 55 LDFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSLNSYSTKA-QVLDAVKALGFIGGELA 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G +++ + Sbjct: 114 NVGLALDF---------VVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDG------ 158 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + A + L+ ++ + V SL V Sbjct: 159 -VIALKQASVFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 204 >gi|297473450|ref|XP_002686618.1| PREDICTED: collagen, type VI, alpha 3-like isoform 3 [Bos taurus] gi|296488813|gb|DAA30926.1| collagen, type VI, alpha 3-like isoform 3 [Bos taurus] Length = 2962 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 98/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + +++ A+ + A I++ + + L + +I Sbjct: 1296 ASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSRIEDGVPQHLVLFLGGKSQDDI 1355 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + L + I + + E +++V+ Sbjct: 1356 SRYSQV----IKSAGIASLGVGDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNS 1411 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + + + + + L Sbjct: 1412 FGSSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLR 1459 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D + + + +GL+ Y + K +Q + + ++ Sbjct: 1460 FVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTK-QQIIDAINKVVYKGGRHANT 1518 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ + +F + + +T G+ + + Sbjct: 1519 KVGLEHLRR-----------NHFVPEAGSRLDQRVPQIAFVITGGK---SVEDAQEASMA 1564 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1565 --LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1615 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 52/167 (31%), Gaps = 17/167 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + + K Q + + + Sbjct: 261 LDFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSLNSYSTKA-QVLDAVKALGFIGGELA 319 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G +++ + Sbjct: 320 NVGLALDF---------VVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDG------ 364 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + A + L+ ++ + V SL V Sbjct: 365 -VIALKQASVFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|297473446|ref|XP_002686616.1| PREDICTED: collagen, type VI, alpha 3-like isoform 1 [Bos taurus] gi|296488811|gb|DAA30924.1| collagen, type VI, alpha 3-like isoform 1 [Bos taurus] Length = 3162 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/355 (7%), Positives = 98/355 (27%), Gaps = 39/355 (10%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A++L ++ + +++ A+ + A I++ + + L + +I Sbjct: 1496 ASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSRIEDGVPQHLVLFLGGKSQDDI 1555 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + L + I + + E +++V+ Sbjct: 1556 SRYSQV----IKSAGIASLGVGDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNS 1611 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + + + + + + L Sbjct: 1612 FGSSGVTPAPPGVDTPSP------------SRPEKKKADIVFLLDGSINFRRDSFQEVLR 1659 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDS 294 + +D + + + +GL+ Y + K +Q + + ++ Sbjct: 1660 FVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTK-QQIIDAINKVVYKGGRHANT 1718 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 ++ + +F + + +T G+ + + Sbjct: 1719 KVGLEHLRR-----------NHFVPEAGSRLDQRVPQIAFVITGGK---SVEDAQEASMA 1764 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +K+ + + + + + + K S+ + V N L + + + + + Sbjct: 1765 --LTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQVLETL 1815 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 52/167 (31%), Gaps = 17/167 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + + K Q + + + Sbjct: 261 LDFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSLNSYSTKA-QVLDAVKALGFIGGELA 319 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G +++ + Sbjct: 320 NVGLALDF---------VVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDG------ 364 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + A + L+ ++ + V SL V Sbjct: 365 -VIALKQASVFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDYLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L + I ++FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNK----------YVAHSDIGDRSVS-LVVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|291221810|ref|XP_002730913.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 858 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 62/194 (31%), Gaps = 28/194 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 ++ L F+ L+ + +D ++G+I +++ E R Sbjct: 365 TSGSMAGNRIERLHRDTTHFI-----LNVIDDDSFVGIIQFSSDATVLSEMKKIDYASRP 419 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ ++ A Q L + + ++ +TD Sbjct: 420 QIAASVPYSANGGTNFGAGIRAALQELKESQLSLKGAS----------------LLIITD 463 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G+ + D+ + +++ TI+ L + +Y V + ++ Sbjct: 464 GQFSYTSDVT------DEVYASGVRVDTIAY-TQAAEDSLRVLSDRTGGSYYYVSDDETS 516 Query: 399 IHVFQNISQLMVHR 412 + +++ + R Sbjct: 517 TELLDSLTSTITDR 530 >gi|254480742|ref|ZP_05093989.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214039325|gb|EEB79985.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 493 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 55/192 (28%), Gaps = 21/192 (10%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ D + + +AL + L D G+ + E + Sbjct: 37 DVSGGMKTSDPRNRRASALVLLVNLLPDGAKA----------GIWVFGKEAELLVPHGLV 86 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 RQ + + +L T+ A++QA L + K+ Sbjct: 87 DAAWRQRAQQAIATLKAAGQRTNIPAALEQATADLEQPTSGYRTSVLLLTAGKVDVAESP 146 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 N +S + E + + TI+ +A + L + Sbjct: 147 ---------IINVNESRKLLNGRAVELGELGVPVHTIAFSAQADAMLLRSLARQTNGTSR 197 Query: 391 NVVNADSLIHVF 402 +AD L +F Sbjct: 198 QADSADELSVMF 209 >gi|194205647|ref|XP_001498059.2| PREDICTED: similar to von Willebrand factor A domain-containing protein 2 precursor (A domain-containing protein similar to matrilin and collagen) (AMACO) (Colon cancer secreted protein 2) (CCSP-2) [Equus caballus] Length = 784 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L + V +G +++ + + T++ A+K Sbjct: 79 DALDINLKRVRVGAFQFSSAPHLEFPLDSFSTQQEVKAKIKRMVFKGGRTETGLALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + + +I +TDG + + + KE I Sbjct: 139 RKGFPGGR---------------NASVPQILIIITDGRSQGHVALP-----AKQLKERGI 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|168214300|ref|ZP_02639925.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] gi|170714242|gb|EDT26424.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] Length = 928 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/286 (11%), Positives = 69/286 (24%), Gaps = 39/286 (13%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLR-IEMGERPIFLIELVVD--LSGSMHCAMNS 209 + L +L KI I ++ ++ L+ H S Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLMDGS 200 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + + + + N ++ + + + + Y G + K IE Sbjct: 201 IGNNQYTVYDLYTSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLKAIEL 260 Query: 270 SWGTEKVRQYVTRDMDSLI---------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + + D L + Y + + + + Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEFGGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKT 320 Query: 321 GVKI---PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISIN----- 370 + E T++ + K+ I I TI I Sbjct: 321 QKPYIYSREEVYSALNRSKNKYEAIWNNGFPTTLE-AENIKKENPDINIYTIGIELKKRV 379 Query: 371 ---------ASPNG-------QRLLKTCVSSPEYHYNVVNADSLIH 400 G ++ L++ SSP + N D + Sbjct: 380 YKWTDNRQYYDAEGIVKLPEIRKFLESISSSPAEAFLNENVDDIDE 425 >gi|294139780|ref|YP_003555758.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326249|dbj|BAJ00980.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1195 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 31/121 (25%), Gaps = 8/121 (6%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 T + Q S K + + V N+ +N + Sbjct: 376 YITDGVPTVDQSANSKVKALPGVSSSDKFVNSSPNFTSYLPALAGWMNTNDVNTNAAFPE 435 Query: 353 ICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 E + TI + + L +T +Y+ +A L + ++ Sbjct: 436 ------EQNVTTFTIGFSDGADDAAPLLTRTAELGGGEYYSAKSATQLQAALSQVFSQIL 489 Query: 411 H 411 Sbjct: 490 E 490 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 70/234 (29%), Gaps = 43/234 (18%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS---HVK 249 I V+D SGSM K+ L A+ L + V+ Sbjct: 304 IVFVLDHSGSM--------------------SGRKIDQLIEAMQNILTDLKETDLFNIVR 343 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + + + + ++ + T ++ A +I+ Sbjct: 344 FGDLAMVWDVSQNQFTQLPNFNEYGNLEPHLREINLPRAVNGT--EENIEAAKKIIEDKS 401 Query: 310 KRSFFTNFFR---------QGVKIPSLPFQKFIIFLTDGENNNFKS----NVNTIKICDK 356 + + + + +Q I+FLTDG N S NT+ + Sbjct: 402 RLGMTNMMYGLEVGLFLIKRTQEETPDKYQPMIVFLTDGHPNAGMSGRDEITNTVTSLNS 461 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNIS 406 K+ I ++S + + L K + + Y +A L ++ IS Sbjct: 462 GKKK-ASIFSLSFGDFADKRFLRKISSKNSGFSRHIYESSDASLQLQDFYRAIS 514 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 22/63 (34%), Gaps = 1/63 (1%) Query: 359 ENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 ++ I TI A+ L Y N+ L+ FQ+I + + + + Sbjct: 638 DSRIITHTIGFAANTEANSFLNQIALQGGGGFYQADNSQELLGAFQSILKTVKDVNATFV 697 Query: 418 LKG 420 G Sbjct: 698 SPG 700 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 67/194 (34%), Gaps = 28/194 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 K + + F+ ++ LI ++ R ++ +E Q V Sbjct: 154 GSGSINKADFER--MKQFVSETTKRLSGRD-TRFALIQFSDRYLEHFNF--NSEDPEQLV 208 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T++ A+++ + L + +K S + +I +TDG Sbjct: 209 --LHIHQVGGWTETATAIRRVVRELFTSQKGS-------------RNGATRILIVITDGV 253 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNA 395 + I +AK+ I I + SP+ +R L S P E+ + V N Sbjct: 254 KTDRLQYSQVIP---EAKQAGIIRYAIGVGDAFSSPDAKRELDDIASEPKAEHIFTVYNF 310 Query: 396 DSLIHVFQNISQLM 409 ++L + + + Sbjct: 311 NALRGIQDQLKDKI 324 >gi|260462605|ref|ZP_05810811.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259031511|gb|EEW32781.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 495 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 3/48 (6%), Positives = 17/48 (35%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME 54 + ++ I+ ++++ + + I + + N ++ Sbjct: 1 MLRTVRAFWHDQRGIALILVSVTLPALIGFSLLAIDMSRVNNLHNDLQ 48 >gi|268578717|ref|XP_002644341.1| Hypothetical protein CBG14163 [Caenorhabditis briggsae] gi|187027709|emb|CAP32779.1| hypothetical protein CBG_14163 [Caenorhabditis briggsae AF16] Length = 2222 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 59/180 (32%), Gaps = 7/180 (3%) Query: 233 NALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + F+ + ++GL+ +E P+ + + + +L+ Sbjct: 2037 DTFTKFIQTFMINYDVGISGAHVGLVVVAPDIEDQAPPAANLNSITSQASLISNLGLLR- 2095 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 D+ A Q+LT + + + + ++++T + + Sbjct: 2096 -DNYADFDHAGQVLTYNLQVVSSAEYMSPTAGYRTNINNHLLVYITTTTSFYTDPTPSAQ 2154 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + K+ I TI + + +LL + + SL + + I QL+ Sbjct: 2155 TIIAQ-KQYGII--TIGYGPTVDAGKLLTIS-GGAACSFTANDVASLNNQIKPIQQLIAT 2210 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 33/131 (25%), Gaps = 23/131 (17%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAY 302 L H MGL+ + + Q + ++ T A+ A Sbjct: 121 DLMHRARTDRMGLVAFAGTAFLQCPLTLDDAAFSQSIDSLDTRTISEGGTALAEAINTAR 180 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + K ++ TDGE+ + + K + Sbjct: 181 ETF------------------KNEKDNHKVLVLFTDGEDQDMGAVSAAEKA----AAEGM 218 Query: 363 KIVTISINASP 373 I TI I Sbjct: 219 LIFTIGIGTPD 229 >gi|47226573|emb|CAG08589.1| unnamed protein product [Tetraodon nigroviridis] Length = 960 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 61/187 (32%), Gaps = 28/187 (14%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PT 292 + F+ + L + + +I Y + + K + + T Sbjct: 161 MTAFIQKLIPSLDIGPQATQLSIIQYGVDPKFEFRLNQYRTKEDALAAASRITQMYGHST 220 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ A++ A Q + K ++ +TDGE+++ + + Sbjct: 221 NTFQAIQYASQ-------------WGFHQNNGGRPEAAKVMVVVTDGESHDVAFRESVLD 267 Query: 353 ICDKAKENFIKIVTISI-NA----SPNGQRLLKTCVS-----SPEYHYNVVNADSLIHVF 402 CD + I I++ S + + L+ S + Y +NV +L ++ Sbjct: 268 ECD---KKGITRFGIAVLGYYIRNSIDTENLIAEIKSIASKPTANYFFNVSEEAALSNIA 324 Query: 403 QNISQLM 409 + + Sbjct: 325 GTLGDRI 331 >gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens] Length = 550 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 55/193 (28%), Gaps = 38/193 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + + H Sbjct: 263 NIVFVIDVSGSMW--------------------GVKMKQTVEAMQAILDDLTIDDHFSII 302 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + E S ++Y+ T+ A+ +A Q+L + Sbjct: 303 DFNHNV-RCWSEELVAGSSIQIADAKKYIQNIK---PNGGTNINEALMRAVQMLLKASNQ 358 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 II ++DG+ K + + +E + ++ I Sbjct: 359 GLI-----------DPRSVSMIILVSDGDPTVGEIKLSTIQKNVKRVMREE-FSLFSLGI 406 Query: 370 NASPNGQRLLKTC 382 + L + Sbjct: 407 GFDVDYDFLERIA 419 >gi|297669807|ref|XP_002813080.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 3 [Pongo abelii] Length = 2575 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTRVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1204 Query: 405 ISQLM 409 + + + Sbjct: 1205 VLETL 1209 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 728 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 729 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 780 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 781 VFSVSTFRELPSLEQKLLTPITTL 804 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 680 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 731 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 732 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 786 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 787 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 837 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 838 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 896 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 897 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 945 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 946 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 998 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 999 EERIMNSFGPSAATPAPPG 1017 >gi|297669805|ref|XP_002813079.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 2 [Pongo abelii] Length = 2976 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1445 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1503 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1504 VYKGGRHANTRVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1553 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1605 Query: 405 ISQLM 409 + + + Sbjct: 1606 VLETL 1610 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1129 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1130 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1181 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1182 VFSVSTFRELPSLEQKLLTPITTL 1205 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1081 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1132 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1133 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1187 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1188 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1238 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1239 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1297 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1298 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1346 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1347 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1399 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1400 EERIMNSFGPSAATPAPPG 1418 >gi|297669803|ref|XP_002813078.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 1 [Pongo abelii] Length = 3182 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTRVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1335 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1336 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1387 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1388 VFSVSTFRELPSLEQKLLTPITTL 1411 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|262198733|ref|YP_003269942.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082080|gb|ACY18049.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 684 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/256 (8%), Positives = 62/256 (24%), Gaps = 45/256 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D + + D + + A FL + + + ++ Sbjct: 67 DPRCGLDAVIVLDASSSVRNYNNPPDANGAVDLIAGAGNAFLGAFADT-----NSRVAVV 121 Query: 258 GYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLILK-------------------PTDSTP 296 Y ++ + T T+ Sbjct: 122 SYNADPRLQLDLTAVTTDSLAAGGAHGIAMGDPGGPQGPMSPTTGYSEHARNGSGTNWEA 181 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + A +L ++ + P+ +G + ++ D+ Sbjct: 182 GLVYAQNVLENNGRADVPKLVIHVTDGRPTRHLTPDGTVTDEGGMAVHVA--EAAEVADQ 239 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYH--------YN--------VVNADSLIH 400 K + + I + + +P L+ S P+ ++ + D L Sbjct: 240 LKASGVHIFAVGVGRAPQFSEELQ-ATSGPDVFDQTQPGDAFDVVNDDVILAADFDQLEE 298 Query: 401 VFQNISQLMVHRKYSV 416 + + ++ + ++ Sbjct: 299 LLRGVADQICGASLTI 314 >gi|241667422|ref|ZP_04755000.1| hypothetical protein FphipA2_01540 [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254875971|ref|ZP_05248681.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841992|gb|EET20406.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 332 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 52/191 (27%), Gaps = 17/191 (8%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D T++ K L L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPTRLERAKYKTLDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V V D + ++ + A++++ Q++T + K Sbjct: 151 ANTVENLVPVINSDIVPVQGNNIYKAIEKSAQLITQAGAKKGQIILITDSTPSADAIA-K 209 Query: 332 FIIFLTDGENNNFKSNVNTI------KICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 G + + + + K++ I N L S Sbjct: 210 AKQLAEQGIDTDVYAIGTPKGGIAKDDKGNYIKDDGGNIQ--YFGIDLNQLEALAKAGSG 267 Query: 386 PEYHYNVVNAD 396 N D Sbjct: 268 KLVTLTANNTD 278 >gi|148675425|gb|EDL07372.1| mCG142757 [Mus musculus] Length = 1320 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 55/188 (29%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ L + + V +GL Y+ + + + K + Sbjct: 191 NHFQQVKDFLASIITQFA---IGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHL 247 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + K +I +TDG++ + Sbjct: 248 RYKGGNTFTGLALTH-------------VLEQNLKPAAGVRPEAAKVLILVTDGKSQDD- 293 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K+ I + + + + LK S P +NV++ L + Sbjct: 294 ----VRTAARILKDQDIDVFVVGVKNVDEAE--LKLLASQPLDITVHNVLDFPQLDTLAP 347 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 348 LLSRLICQ 355 >gi|123231311|emb|CAM23231.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938197|emb|CAX15853.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1280 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 55/188 (29%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ L + + V +GL Y+ + + + K + Sbjct: 183 NHFQQVKDFLASIITQFA---IGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHL 239 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + K +I +TDG++ + Sbjct: 240 RYKGGNTFTGLALTH-------------VLEQNLKPAAGVRPEAAKVLILVTDGKSQDD- 285 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K+ I + + + + LK S P +NV++ L + Sbjct: 286 ----VRTAARILKDQDIDVFVVGVKNVDEAE--LKLLASQPLDITVHNVLDFPQLDTLAP 339 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 340 LLSRLICQ 347 >gi|258679414|ref|NP_082794.1| collagen alpha-1(XX) chain [Mus musculus] gi|292495082|sp|Q923P0|COKA1_MOUSE RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|123231310|emb|CAM23230.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938196|emb|CAX15852.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1320 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 55/188 (29%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ L + + V +GL Y+ + + + K + Sbjct: 191 NHFQQVKDFLASIITQFA---IGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHL 247 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + K +I +TDG++ + Sbjct: 248 RYKGGNTFTGLALTH-------------VLEQNLKPAAGVRPEAAKVLILVTDGKSQDD- 293 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K+ I + + + + LK S P +NV++ L + Sbjct: 294 ----VRTAARILKDQDIDVFVVGVKNVDEAE--LKLLASQPLDITVHNVLDFPQLDTLAP 347 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 348 LLSRLICQ 355 >gi|313242832|emb|CBY39591.1| unnamed protein product [Oikopleura dioica] Length = 674 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGEN---NNFKSNV 348 +S K A+ + + +G + F K T G N S Sbjct: 458 NSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGTVVLLTNAASTD 517 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 ++ D+ KE +++ + + GQ L SSP E Y + L + + I+ Sbjct: 518 EVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENLYTAEESSDLAGLVRTIA 576 Query: 407 QLMVHRKYS 415 + + S Sbjct: 577 DEICATELS 585 >gi|218693363|gb|ACL01185.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|89054212|ref|YP_509663.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88863761|gb|ABD54638.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 1356 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 12/110 (10%), Positives = 29/110 (26%), Gaps = 8/110 (7%) Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + + + S +I ++DG N N + I + + Sbjct: 101 AVNAINPRGRTPMTDAVIAAAQSLRSTEEAATVILVSDGIENC---NPDPCAIAAELEAT 157 Query: 361 FI--KIVTISINA---SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I + ++ ++ NA L + + Sbjct: 158 GVDFTAHVIGFDVASEPEARAQMQCIADNTGGQFLTADNATELSQALEQV 207 >gi|386831|gb|AAA51620.1|AAA51620 integrin alpha subunit precursor [Homo sapiens] Length = 1163 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 50/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + ++ + T + A++ L Sbjct: 183 RPSTQFSLMQFSNKFQTHLTFEEFRRTSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 242 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 243 SYG-------------ARRDATKILIVITDGKKEGDTLDYKDVIPMADAA---GIIRYAI 286 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 287 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQTQLREKI 333 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K I +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVIFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|309364361|emb|CAR98722.1| hypothetical protein CBG_25517 [Caenorhabditis briggsae AF16] Length = 342 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 53/188 (28%), Gaps = 25/188 (13%) Query: 228 MAALKNALLLFLD---SIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTR 282 + +L D I +G I Y + + ++ + R Sbjct: 17 TNLVYETILSLFDKNLQIGTGYADPRSTRVGFITYNYNATDVADFYKLQSYDDLKSQIQR 76 Query: 283 DMDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + T A+ A ++ S ++K +I T+ Sbjct: 77 LKMTPLTTTTVSRMDTALYAAMNMINSTAGF--------------RDNYKKVVIVFTNVH 122 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH--YNVVNADSL 398 + + I ++T++ +S + Q LK S+ Y+ + Sbjct: 123 GTYKS--NPPKDVSKSLQMKGIPVITVNTGSSSDTQSWLKNIASTNMAFAIYDGNVTQEI 180 Query: 399 IHVFQNIS 406 +I+ Sbjct: 181 QKAMTDIN 188 >gi|296190554|ref|XP_002806559.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Callithrix jacchus] Length = 3582 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 58/175 (33%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRD------MDSLILKPTDS 294 + V + ++ ++++ + T + RQ+ S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTSRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVLFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|291448990|ref|ZP_06588380.1| secreted protein [Streptomyces roseosporus NRRL 15998] gi|291351937|gb|EFE78841.1| secreted protein [Streptomyces roseosporus NRRL 15998] Length = 420 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 64/208 (30%), Gaps = 30/208 (14%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++MAA K A LD++ + Sbjct: 35 VELVLDVSGSMRTRD--------------IDGQSRMAAAKQAFNDVLDAVPEQVQLGIRT 80 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 R + + P D T A K A L Sbjct: 81 LGADYPGEDR--------------KVGCKDTRQLYPVGPLDRTEA-KTAVATLAPTGFTP 125 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + + I+ +TDGE+ + + A+ + + T+ + + Sbjct: 126 IGPALLGAADDLEGGEGSRRIVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPN 185 Query: 373 PNGQRLLK-TCVSSPEYHYNVVNADSLI 399 ++ L ++ + V + + L Sbjct: 186 AKIRQQLTCIAEATGGTYTAVQHKEELS 213 >gi|281339018|gb|EFB14602.1| hypothetical protein PANDA_010506 [Ailuropoda melanoleuca] Length = 1096 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/285 (12%), Positives = 82/285 (28%), Gaps = 37/285 (12%) Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI--ELVV 197 + +I ++ T+ L ++ + + + + + Sbjct: 78 TATSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAKQCGSQYYTTGVCSDVSADFQL 137 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI-DLLSHVKEDVYMGL 256 S + DV + A+ FL+ L + +GL Sbjct: 138 LTSFAPAAQACPSLIDVVVVCDESNSIY-----PWEAVKNFLEKFVQSLDIGPKKTQVGL 192 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFF 314 I Y + + K + + T++ A++ A Sbjct: 193 IQYANKPRVIFNLNTFKTKAE-MIEATSQTYQYGGDLTNTFKAIQYAKD----------- 240 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 F L K ++ +TDGE+++ I CD + I I++ N Sbjct: 241 --FAYAAGAGGRLGAAKVMVVVTDGESHDGSMLKAVIDQCD---NDNILRFGIAVLGYLN 295 Query: 375 GQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + + +NV + +L+ + + + Sbjct: 296 RNALDTKNLIKEIKAIASIPTETFFFNVSDEAALLEKAGTLGEQI 340 >gi|218693409|gb|ACL01208.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|218693373|gb|ACL01190.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|218693353|gb|ACL01180.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|55958063|emb|CAI12958.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 935 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 299 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 338 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 339 DFNQNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 393 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 394 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 442 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 443 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 484 >gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 [Homo sapiens] gi|229462889|sp|P19823|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; AltName: Full=Inter-alpha-trypsin inhibitor complex component II; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|55958062|emb|CAI12957.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 946 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNQNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens] gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens] gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens] Length = 946 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNQNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|91788739|ref|YP_549691.1| putative type 4 fimbrial biogenesis PilY1-like protein signal peptide [Polaromonas sp. JS666] gi|91697964|gb|ABE44793.1| putative type 4 fimbrial biogenesis PilY1-related protein signal peptide [Polaromonas sp. JS666] Length = 1025 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 79/300 (26%), Gaps = 59/300 (19%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + L N + + + + G M Sbjct: 37 TAETPNVLIVLDNT-------ANWNTAFVNEKAALVSV----FQGMPVDKFRVGLMMFNE 85 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 N + RT ++ K A +DS+D+ + Sbjct: 86 TGGGNSGNDGAYVRAAARTITSSNKTAYANLIDSLDITGDKSNGGKVAKAMMEAYYY--- 142 Query: 268 EPSWGTEKVRQYVTRDMDSLILKPTDSTPAM-KQAYQILTSDKKRSFFTNFFRQGVKIPS 326 S GT + +L S AM A L S ++ Sbjct: 143 -LSGGTPHAGNQKNKTDYALNTAGNASDDAMHALAGNALASKNSTTYVKPGSGD------ 195 Query: 327 LPFQKFIIFLTDG---------------------------ENNNFKSNVNTIKICDKAKE 359 + +II++++G N + + + K+ Sbjct: 196 -CVKNYIIYISNGPAQDNSSDIETATTALAAAGGKTTAISLNPSGSQDNPADEWARFLKK 254 Query: 360 NFIK--IVTISINASPNGQ-----RLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + IK + T+ +N +GQ LLK+ S S +++V + + +I + Sbjct: 255 SAIKGVVYTLDVNKGTSGQGPGWTALLKSMASQSGGRYFDV-STSGIADAINSILSEIQS 313 >gi|260837139|ref|XP_002613563.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] gi|229298948|gb|EEN69572.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] Length = 195 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/147 (11%), Positives = 46/147 (31%), Gaps = 18/147 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + +G++ Y + + + + S T + Sbjct: 66 NFVATTTSDFQIGPNNAQVGIVQYANWLYEEVSLNQYKTLDELLPAIYNISYWGGGTYTG 125 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ ++ + K +I +TDG++ + + Sbjct: 126 WAIDYVVNATLTESRG-------------ARQDVPKVVIVVTDGQSADD-----VRQPAL 167 Query: 356 KAKENFIKIVTISINASPNGQRLLKTC 382 +AK++ I +V I + + +G L++ Sbjct: 168 RAKQSGIIMVAIGVGSIYDGTELVEIA 194 >gi|27806781|ref|NP_776396.1| vitrin precursor [Bos taurus] gi|75054534|sp|Q95LI2|VITRN_BOVIN RecName: Full=Vitrin; Flags: Precursor gi|17941422|gb|AAL18262.2| vitrin [Bos taurus] Length = 652 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 50/401 (12%), Positives = 108/401 (26%), Gaps = 52/401 (12%) Query: 39 FLIYVLDWHYKK---------NSMESANN---AAILAGASKMVSNL-SRLGDRFESISNH 85 FLI K+ + + A L G + N ++ + S Sbjct: 270 FLIDGSSSIGKRRFRIQKQFLTDVAQTLDIGPAGPLMGVVQYGDNPATQFNLKTHMNSQD 329 Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 K A+ ++R ++ ++S + F++ N + +++ + D Sbjct: 330 VKAAIEKISQRGGLSNAGRAISFVTKNFFSKFNGNRGGAPNVAVVMVDGWPTDKVEEASR 389 Query: 146 YNMDVMTSYDYRLQFIEH------LLNQRYNQKIVSFIPALLRIEMGE------RPIFLI 193 + + + +L + K V + + L+ Sbjct: 390 LARESGVNIFFITIEGASENEKQYMLEPNFANKAVCRTNGFYSLTVQNWFSLHKTVQPLV 449 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + V D + D+ T + + L + D Sbjct: 450 KRVCDTDRLACSKTCLNSADIGFVIDGSSSVGT--SNFRTVLQFVANLSREFEISDMDTR 507 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +G + YT + K T + A+ A + L Sbjct: 508 IGAMQYTYEQRLEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF------- 560 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K +K +I +TDG + + + A + I + + Sbjct: 561 ---------KKSKPNKRKLMILITDGRSYDDIRIPAML-----AHHKGVITYAIGV--AW 604 Query: 374 NGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 Q L + P ++ + V D+L V + Q + Sbjct: 605 AAQDELDIIATHPARDHAFFVDEFDNLYKVVPKVIQNICTE 645 >gi|332815788|ref|XP_003309587.1| PREDICTED: collagen alpha-3(VI) chain [Pan troglodytes] Length = 2570 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1204 Query: 405 ISQLM 409 + + + Sbjct: 1205 VLETL 1209 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 728 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 729 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 780 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 781 VFSVSTFRELPSLEQKLLTPITTL 804 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 680 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 731 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 732 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 786 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 787 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 837 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 838 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 896 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 897 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 945 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 946 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 998 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 999 EERIMNSFGPSAATPAPPG 1017 >gi|240255535|ref|NP_476507.3| collagen alpha-3(VI) chain isoform 4 precursor [Homo sapiens] Length = 2570 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1204 Query: 405 ISQLM 409 + + + Sbjct: 1205 VLETL 1209 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 728 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 729 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 778 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 779 EYVFSVSTFRELPSLEQKLLTPITTL 804 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 680 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 731 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 732 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 786 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 787 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 837 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 838 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 896 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 897 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 945 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 946 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 998 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 999 EERIMNSFGPSAATPAPPG 1017 >gi|219841772|gb|AAI44596.1| COL6A3 protein [Homo sapiens] gi|223462379|gb|AAI50626.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1204 Query: 405 ISQLM 409 + + + Sbjct: 1205 VLETL 1209 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 728 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 729 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 778 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 779 EYVFSVSTFRELPSLEQKLLTPITTL 804 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 680 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 731 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 732 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 786 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 787 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 837 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 838 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 896 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 897 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 945 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 946 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 998 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 999 EERIMNSFGPSAATPAPPG 1017 >gi|218693397|gb|ACL01202.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|218693357|gb|ACL01182.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693425|gb|ACL01216.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693427|gb|ACL01217.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|219521324|gb|AAI71790.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1204 Query: 405 ISQLM 409 + + + Sbjct: 1205 VLETL 1209 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 728 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 729 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 778 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 779 EYVFSVSTFRELPSLEQKLLTPITTL 804 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 680 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 731 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 732 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 786 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 787 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 837 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 838 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 896 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 897 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 945 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 946 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 998 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 999 EERIMNSFGPSAATPAPPG 1017 >gi|119591515|gb|EAW71109.1| collagen, type VI, alpha 3, isoform CRA_g [Homo sapiens] Length = 2205 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1445 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1503 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1504 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1553 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1605 Query: 405 ISQLM 409 + + + Sbjct: 1606 VLETL 1610 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1081 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1132 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1133 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1187 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1188 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1238 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1239 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1297 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1298 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1346 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1347 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1399 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1400 EERIMNSFGPSAATPAPPG 1418 >gi|119591516|gb|EAW71110.1| collagen, type VI, alpha 3, isoform CRA_h [Homo sapiens] Length = 2977 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1451 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1509 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1510 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1559 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1611 Query: 405 ISQLM 409 + + + Sbjct: 1612 VLETL 1616 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1029 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1085 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1086 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1135 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1136 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1185 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1186 EYVFSVSTFRELPSLEQKLLTPITTL 1211 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1087 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1138 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1139 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1193 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1194 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1244 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1245 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1303 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1304 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1352 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1353 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1405 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1406 EERIMNSFGPSAATPAPPG 1424 >gi|119591510|gb|EAW71104.1| collagen, type VI, alpha 3, isoform CRA_b [Homo sapiens] Length = 2210 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1450 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1508 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1509 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1557 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1558 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1610 Query: 405 ISQLM 409 + + + Sbjct: 1611 VLETL 1615 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1028 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1084 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1085 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1134 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1135 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1184 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1185 EYVFSVSTFRELPSLEQKLLTPITTL 1210 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1086 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1137 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1138 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1192 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1193 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1243 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1244 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1302 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1303 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1351 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1352 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1404 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1405 EERIMNSFGPSAATPAPPG 1423 >gi|119591509|gb|EAW71103.1| collagen, type VI, alpha 3, isoform CRA_a [Homo sapiens] Length = 2211 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1451 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1509 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1510 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1559 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1611 Query: 405 ISQLM 409 + + + Sbjct: 1612 VLETL 1616 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1029 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1085 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1086 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1135 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1136 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1185 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1186 EYVFSVSTFRELPSLEQKLLTPITTL 1211 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1087 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1138 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1139 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1193 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1194 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1244 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1245 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1303 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1304 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1352 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1353 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1405 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1406 EERIMNSFGPSAATPAPPG 1424 >gi|119591514|gb|EAW71108.1| collagen, type VI, alpha 3, isoform CRA_f [Homo sapiens] Length = 2244 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1484 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1542 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1543 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1591 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1592 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1644 Query: 405 ISQLM 409 + + + Sbjct: 1645 VLETL 1649 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 >gi|119591511|gb|EAW71105.1| collagen, type VI, alpha 3, isoform CRA_c [Homo sapiens] Length = 2971 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1445 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1503 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1504 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1553 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1605 Query: 405 ISQLM 409 + + + Sbjct: 1606 VLETL 1610 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1081 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1132 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1133 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1187 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1188 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1238 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1239 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1297 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1298 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1346 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1347 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1399 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1400 EERIMNSFGPSAATPAPPG 1418 >gi|119591512|gb|EAW71106.1| collagen, type VI, alpha 3, isoform CRA_d [Homo sapiens] Length = 2411 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|119591513|gb|EAW71107.1| collagen, type VI, alpha 3, isoform CRA_e [Homo sapiens] Length = 3177 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|114584071|ref|XP_001153479.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Pan troglodytes] Length = 2971 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1445 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1503 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1504 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1553 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1605 Query: 405 ISQLM 409 + + + Sbjct: 1606 VLETL 1610 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1129 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1130 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1181 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1182 VFSVSTFRELPSLEQKLLTPITTL 1205 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1081 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1132 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1133 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1187 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1188 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1238 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1239 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1297 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1298 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1346 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1347 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1399 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1400 EERIMNSFGPSAATPAPPG 1418 >gi|114584073|ref|XP_001153410.1| PREDICTED: alpha 3 type VI collagen isoform 2 [Pan troglodytes] Length = 2977 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1451 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1509 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1510 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1558 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1559 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1611 Query: 405 ISQLM 409 + + + Sbjct: 1612 VLETL 1616 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1029 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1085 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1086 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1135 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1136 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1187 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1188 VFSVSTFRELPSLEQKLLTPITTL 1211 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1087 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1138 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1139 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1193 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1194 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1244 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1245 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1303 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1304 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1352 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1353 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1405 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1406 EERIMNSFGPSAATPAPPG 1424 >gi|114584069|ref|XP_001153544.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Pan troglodytes] Length = 3177 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1335 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1336 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1387 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1388 VFSVSTFRELPSLEQKLLTPITTL 1411 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|114584077|ref|XP_001153230.1| PREDICTED: alpha 3 type VI collagen isoform 1 [Pan troglodytes] Length = 3010 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1484 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1542 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1543 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1591 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1592 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1644 Query: 405 ISQLM 409 + + + Sbjct: 1645 VLETL 1649 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1335 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1336 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1387 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1388 VFSVSTFRELPSLEQKLLTPITTL 1411 >gi|114584075|ref|XP_516178.2| PREDICTED: alpha 3 type VI collagen isoform 5 [Pan troglodytes] Length = 2976 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1450 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1508 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1509 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1557 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1558 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1610 Query: 405 ISQLM 409 + + + Sbjct: 1611 VLETL 1615 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 63/204 (30%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1028 GGKRDVVFLIDGSQSAGPEFQYIRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1084 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K A + ++ + + + +G Sbjct: 1085 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG--SALEYVSRNIFKRPLGSRIEEG------- 1134 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 1135 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 1186 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 1187 VFSVSTFRELPSLEQKLLTPITTL 1210 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1086 NAHSSKDEVQNAVQR--------LRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQ 1137 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1138 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1192 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1193 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1243 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1244 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1302 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1303 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1351 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1352 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1404 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1405 EERIMNSFGPSAATPAPPG 1423 >gi|62088852|dbj|BAD92873.1| alpha 3 type VI collagen isoform 5 precursor variant [Homo sapiens] Length = 1702 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 176 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 234 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 235 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 283 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 284 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 336 Query: 405 ISQLM 409 + + + Sbjct: 337 VLETL 341 >gi|55743106|ref|NP_476508.2| collagen alpha-3(VI) chain isoform 5 precursor [Homo sapiens] Length = 2971 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1445 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1503 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1504 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1552 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1553 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1605 Query: 405 ISQLM 409 + + + Sbjct: 1606 VLETL 1610 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1081 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1132 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1133 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1187 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1188 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1238 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1239 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1297 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1298 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1346 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1347 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1399 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1400 EERIMNSFGPSAATPAPPG 1418 >gi|55743098|ref|NP_004360.2| collagen alpha-3(VI) chain isoform 1 precursor [Homo sapiens] gi|311033499|sp|P12111|CO6A3_HUMAN RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|225000446|gb|AAI72233.1| Collagen, type VI, alpha 3 [synthetic construct] gi|302313173|gb|ADL14511.1| collagen, type VI, alpha 3 [Homo sapiens] Length = 3177 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|62988748|gb|AAY24135.1| unknown [Homo sapiens] Length = 2588 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 100/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V+ + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVDDPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|3127926|emb|CAA36267.1| collagen type VI, alpha 3 chain [Homo sapiens] Length = 3176 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 58/185 (31%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1651 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1710 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1758 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L + + Sbjct: 1759 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 262 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 320 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 321 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 413 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 62/206 (30%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1229 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKAEFL 1285 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1286 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1335 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + + K+ + TI+ + + + L+K SP Sbjct: 1336 --VPQFLVLISSG-----KSDDEVVVPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1385 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1386 EYVFSVSTFRELPSLEQKLLTPITTL 1411 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 33/379 (8%), Positives = 99/379 (26%), Gaps = 44/379 (11%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 + H K+ +++A + R + ++ ++ I+ + + Sbjct: 1287 NAHSSKDEVQNAVQR--------LRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 1338 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 L S+ + + V + + + N + V S F Sbjct: 1339 FLVLISSG----KSDDEVVVPAVELKQFGVAPFTIARNADQEEL-VKISLSPEYVFSVST 1393 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + P + + S + D D Sbjct: 1394 FRELPSLEQKLLTPITTLTSEQIQKLL---------ASTRYPPPAVESDAADIVFLIDSS 1444 Query: 225 RT-KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + + + L+ V +G++ ++ V + + Sbjct: 1445 EGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAP-VLDAI 1503 Query: 284 MDSLILKPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + ++ A++ + L F + + ++ + G++ Sbjct: 1504 RRLRLRGGSPLNTGKALEFVARNL-----------FVKSAGSRIEDGVPQHLVLVLGGKS 1552 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + S + + I ++ + + L+T + P + V L ++ Sbjct: 1553 QDDVSR-----FAQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNI 1605 Query: 402 FQNISQLMVHRKYSVILKG 420 + I + G Sbjct: 1606 EERIMNSFGPSAATPAPPG 1624 >gi|77407764|ref|ZP_00784518.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|77173630|gb|EAO76745.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|218693241|gb|ACL01124.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693247|gb|ACL01127.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693329|gb|ACL01168.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693351|gb|ACL01179.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693361|gb|ACL01184.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693365|gb|ACL01186.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693375|gb|ACL01191.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693413|gb|ACL01210.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693417|gb|ACL01212.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693437|gb|ACL01222.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693441|gb|ACL01224.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693443|gb|ACL01225.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693445|gb|ACL01226.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693449|gb|ACL01228.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693451|gb|ACL01229.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 47/165 (28%), Gaps = 26/165 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT---DSTPAMK 299 + + + +GL+ Y + + + T + Sbjct: 72 TSIPNQPKTTRLGLVTYNWNATIQAGLDKFQSQQDVFENIFNALNSVSSTSESYLANGLV 131 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A +L +QK I+ ++ S+ N I I D+ K+ Sbjct: 132 AAENVL--------------ARGPNRGNNYQKVIVLFAA----SYSSHSNPIAIADRLKQ 173 Query: 360 NFIKIVTIS---INASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I I+T+ + Q L K SP + + + + Sbjct: 174 AGITIITMGYNNVGDPNFYQNLAKIA--SPNKSFTEKSLSQIGDI 216 >gi|218693287|gb|ACL01147.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|22536818|ref|NP_687669.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|25010692|ref|NP_735087.1| hypothetical protein gbs0632 [Streptococcus agalactiae NEM316] gi|76788219|ref|YP_329407.1| Cna B domain-containing protein [Streptococcus agalactiae A909] gi|77411235|ref|ZP_00787585.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|22533664|gb|AAM99541.1|AE014220_13 cell wall surface anchor family protein, putative [Streptococcus agalactiae 2603V/R] gi|23095046|emb|CAD46276.1| Unknown [Streptococcus agalactiae NEM316] gi|76563276|gb|ABA45860.1| cna B-type domain protein [Streptococcus agalactiae A909] gi|77162661|gb|EAO73622.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|218693223|gb|ACL01115.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693225|gb|ACL01116.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693227|gb|ACL01117.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693229|gb|ACL01118.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693231|gb|ACL01119.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693233|gb|ACL01120.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693237|gb|ACL01122.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693239|gb|ACL01123.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693243|gb|ACL01125.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693245|gb|ACL01126.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693249|gb|ACL01128.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693251|gb|ACL01129.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693253|gb|ACL01130.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693255|gb|ACL01131.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693257|gb|ACL01132.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693259|gb|ACL01133.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693261|gb|ACL01134.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693263|gb|ACL01135.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693265|gb|ACL01136.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693267|gb|ACL01137.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693269|gb|ACL01138.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693271|gb|ACL01139.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693273|gb|ACL01140.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693275|gb|ACL01141.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693277|gb|ACL01142.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693279|gb|ACL01143.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693281|gb|ACL01144.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693283|gb|ACL01145.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693285|gb|ACL01146.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693289|gb|ACL01148.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693291|gb|ACL01149.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693293|gb|ACL01150.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693295|gb|ACL01151.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693297|gb|ACL01152.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693299|gb|ACL01153.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693301|gb|ACL01154.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693303|gb|ACL01155.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693305|gb|ACL01156.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693307|gb|ACL01157.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693309|gb|ACL01158.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693311|gb|ACL01159.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693313|gb|ACL01160.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693315|gb|ACL01161.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693317|gb|ACL01162.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693319|gb|ACL01163.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693321|gb|ACL01164.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693323|gb|ACL01165.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693325|gb|ACL01166.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693327|gb|ACL01167.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693331|gb|ACL01169.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693333|gb|ACL01170.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693337|gb|ACL01172.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693339|gb|ACL01173.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693341|gb|ACL01174.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693345|gb|ACL01176.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693347|gb|ACL01177.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693349|gb|ACL01178.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693355|gb|ACL01181.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693359|gb|ACL01183.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693369|gb|ACL01188.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693371|gb|ACL01189.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693377|gb|ACL01192.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693379|gb|ACL01193.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693381|gb|ACL01194.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693383|gb|ACL01195.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693387|gb|ACL01197.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693389|gb|ACL01198.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693391|gb|ACL01199.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693393|gb|ACL01200.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693395|gb|ACL01201.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693399|gb|ACL01203.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693401|gb|ACL01204.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693403|gb|ACL01205.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693405|gb|ACL01206.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693407|gb|ACL01207.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693411|gb|ACL01209.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693415|gb|ACL01211.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693419|gb|ACL01213.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693421|gb|ACL01214.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693433|gb|ACL01220.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693435|gb|ACL01221.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693439|gb|ACL01223.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693447|gb|ACL01227.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693453|gb|ACL01230.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693455|gb|ACL01231.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693457|gb|ACL01232.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693459|gb|ACL01233.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693461|gb|ACL01234.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693463|gb|ACL01235.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693465|gb|ACL01236.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693467|gb|ACL01237.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693471|gb|ACL01239.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693473|gb|ACL01240.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693475|gb|ACL01241.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/201 (9%), Positives = 58/201 (28%), Gaps = 38/201 (18%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--------NIEPSWGTE 274 + ++ LK A F+ S + +G++ Y + Sbjct: 342 ETPKRIERLKVAAKNFV------SLAENGTELGIVSYASDAAVASGRTEVAIAPLGANRA 395 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + T+ +++A ++T+ + T +I+ Sbjct: 396 AWNNAIDGL---GPSTRTNIGAGLQKARDLITAAGGVTANT----------------YIV 436 Query: 335 FLTDGENNNF----KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYH 389 ++DG NN ++ + + I + + + + ++ Sbjct: 437 LMSDGLNNEPAPQANADADLNGKIAMLLADGIPVYVTCTGSDLGLASQCSEIGTGTGGHY 496 Query: 390 YNVVNADSLIHVFQNISQLMV 410 + ++ L F + + +V Sbjct: 497 VDSADSARLPEAFADFHERIV 517 >gi|218693367|gb|ACL01187.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693469|gb|ACL01238.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 62/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 62/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + +R Sbjct: 54 SSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLISVTP-----DSIR 108 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 109 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 157 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 158 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 215 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 216 EEDAGSQLIGFYDEI 230 >gi|332290703|ref|YP_004429312.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332168789|gb|AEE18044.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 351 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 57/187 (30%), Gaps = 15/187 (8%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 T + LK A F+ ++ + + + + +++ + Sbjct: 126 STSLQELKQASSSFISNVMPAIPTDSFKMAIYWFDGEDVLHELQPLTTSAAMLQEAIDGI 185 Query: 284 M-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D TD A+ +A +I + S + F ++ TDG + Sbjct: 186 TEDISNDPSTDLYGAVIKAAEIADNIIDVSENEDLFAAA----------SVVIFTDGTDQ 235 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ + TI + + + +L + NA L VF Sbjct: 236 AARYTEEEAVDAVSNADDDVSFFTIGLGSEIDEG-ILSKIGQTESVF--ASNAAELESVF 292 Query: 403 QNISQLM 409 +IS + Sbjct: 293 NDISSGV 299 >gi|289524039|ref|ZP_06440893.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289502695|gb|EFD23859.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 1071 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 39/406 (9%), Positives = 82/406 (20%), Gaps = 105/406 (25%) Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSN------------NTIFYNMDVMTSYDYRL 158 + + + + + L + Y V + L Sbjct: 6 SNINVIKRVFVAAVVLAFIAFVPLEVLPQAAGEAPVLPSCNKPLPEGYAQSVQPNVLLLL 65 Query: 159 QFIEHL------------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + + + R + Sbjct: 66 DTSGSMMYLSGSETSTYGDGSKPYKYGNKIYRYFGRDWESSNNNPDGPYNYHPNLVYIDD 125 Query: 207 --------MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID-------------LL 245 D S ++M LK L L+ Sbjct: 126 ADLTGISQRYRDRYFAKSGSRYLYPNDSRMYKLKLVLWEILNDPGLVSGLRMGLATYYQQ 185 Query: 246 SHVKEDVYMGLIGYTTRVEK---------NIEPSWGTEKVRQYVTRDMD----------- 285 + +V + Y E SW +R++ + Sbjct: 186 ENDPPEVPSIGVYYQWPPETYGSAQALSWEGSSSWNRALLRRFFKSTDEPGSIEEITKWV 245 Query: 286 ----------SLILKPTDSTPAMKQAYQILTSDKKRSFFT-NFFRQGVKIPSLPFQKFII 334 T ++ A +FF I + Q ++I Sbjct: 246 DGIEASDNPELRAHGATPLAASIYGADAHPDKKNPYRGDVRDFFLMDEAINAWCQQNWLI 305 Query: 335 FLTDGENNNFKS-NVNTIKICDKA---------------KENFIKIVTISI------NAS 372 LTDG + N + + K ++ + I + S Sbjct: 306 VLTDGADTQTWPYNTSPVTAVKNLYDEHAKTSWPSFYGKKAQPVRTMVIGLINPNASGVS 365 Query: 373 PNGQRLLKTC-------VSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L + + + Y + D L+ F++I + + Sbjct: 366 TLKNTLNRMADMGDDGQENGSSHAYFATDVDELMQAFKDIFKQIQS 411 >gi|218693423|gb|ACL01215.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693429|gb|ACL01218.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693431|gb|ACL01219.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 109/361 (30%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V ++ S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|189238315|ref|XP_972124.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 79/261 (30%), Gaps = 49/261 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 L + +++ Q + + L + + ++ D+S + D Sbjct: 294 VVLVLDHSASMRGRKHEQLMQAMDKILSDLN-PDDLFHVVCFSEDVSV---WNLEKKQFD 349 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +D ++ + D + + V+ + Sbjct: 350 ------------------------LIDFMEKFDYENLDSCLTELNLGNAVQ------FTE 379 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + +++ D + + T+ + ++ K+++ N Q I Sbjct: 380 DNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVE--------TKHQPMI 431 Query: 334 IFLTDGENNNFKSNVNTI-KICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPE--- 387 I LTDG N S+ I KI K + N I ++S + L K + Sbjct: 432 ILLTDGLPNVGLSDPVEITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 491 Query: 388 YHYNVVNAD-SLIHVFQNISQ 407 + Y +A L + ++ + Sbjct: 492 HIYEAADAALQLQNFYRTVFS 512 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 59/185 (31%), Gaps = 21/185 (11%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MDSL 287 A K +L+ + + + + + + + + + + + + Sbjct: 258 AAKASLISYENDTKTSRLLPGQQQVAIYSFDSEITLLTDYTSDINLLEAAIDTIPNSVLE 317 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ AM+ A +G + + LTDGE+N + Sbjct: 318 RGNSTNLLGAMEIAA----ERWNDQIDLIAVERG----------YAVLLTDGEHNFDSRS 363 Query: 348 VNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 I+ D + + I++ + N + L SS + + +A+ L VF Sbjct: 364 PADIE-ADLTNFFGTRKKVYAIAVGNNVNLENLEAITASSEQVLTVNSFESAEELEAVFT 422 Query: 404 NISQL 408 ++ Sbjct: 423 EVATT 427 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 70/204 (34%), Gaps = 38/204 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++ L A LFL L +++ + G++ + + T+ R + Sbjct: 76 GDRLNRLNQASQLFL-----LQIIEKGSWTGMVTFDSSATIQSALIQIETDAQRNSLISR 130 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + T ++ A+ ++ + I+ LTDGE++ Sbjct: 131 LPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE-----------------IVLLTDGEDS- 172 Query: 344 FKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNAD--SLI 399 TI C D+ K++ I T+++ S + L + + NA LI Sbjct: 173 ------TISSCFDEVKQSGAIIHTVALGPSADPG-LEELAKMTGGMKTSPTDNAQNNGLI 225 Query: 400 HVFQNISQ---LMVHRKYSVILKG 420 F +S + R + KG Sbjct: 226 DAFSALSSGNGAITQRSIQLESKG 249 >gi|91791025|ref|YP_551976.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91700905|gb|ABE47078.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 753 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/276 (12%), Positives = 84/276 (30%), Gaps = 46/276 (16%) Query: 155 DYRLQFIEHLLNQ-----------RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 L + +++ + A ++ ++ ++ + D + Sbjct: 508 AIPLDTSARMSGAVSRLVRIFTKELQDKRRRTVKLASAGGQVASNRVWRLKAMGDTNVFK 567 Query: 204 HCAMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + ++ + A L +++ +S VK + M GY Sbjct: 568 VTSSVCGIDAAATILLDRSGSMSRCIVEAAGAALSCSQALERISKVKTSIEM-FPGYAKC 626 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 V + + RQ R + T A+++ L + + + Sbjct: 627 VGNTVALQAFGQSARQVARRVNEVDAEGGTPLAEALQEVMPRLLAQRVK----------- 675 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL--- 379 ++ + +TDG NN + I +K ++ V I I L Sbjct: 676 -------KRIVFLVTDGIPNNRPGALEEIGKAEKL---GVEFVGIGIGVHGRAIEGLTPF 725 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 C + +A L F+ + + + K + Sbjct: 726 SIC---------INDASELPDAFEKLFRGNIALKLA 752 >gi|76808650|ref|YP_333444.1| hypothetical protein BURPS1710b_2049 [Burkholderia pseudomallei 1710b] gi|126452631|ref|YP_001066157.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|167738664|ref|ZP_02411438.1| TadE family protein [Burkholderia pseudomallei 14] gi|167824258|ref|ZP_02455729.1| TadE family protein [Burkholderia pseudomallei 9] gi|167894376|ref|ZP_02481778.1| TadE family protein [Burkholderia pseudomallei 7894] gi|167919038|ref|ZP_02506129.1| TadE family protein [Burkholderia pseudomallei BCC215] gi|226196406|ref|ZP_03791988.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237812172|ref|YP_002896623.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|242315614|ref|ZP_04814630.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254188727|ref|ZP_04895238.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|254197381|ref|ZP_04903803.1| TadE family protein [Burkholderia pseudomallei S13] gi|254262100|ref|ZP_04953154.1| TadE family protein [Burkholderia pseudomallei 1710a] gi|254297708|ref|ZP_04965161.1| TadE family protein [Burkholderia pseudomallei 406e] gi|76578103|gb|ABA47578.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|126226273|gb|ABN89813.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|157807056|gb|EDO84226.1| TadE family protein [Burkholderia pseudomallei 406e] gi|157936406|gb|EDO92076.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|169654122|gb|EDS86815.1| TadE family protein [Burkholderia pseudomallei S13] gi|225931623|gb|EEH27628.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237504336|gb|ACQ96654.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|242138853|gb|EES25255.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254220789|gb|EET10173.1| TadE family protein [Burkholderia pseudomallei 1710a] Length = 142 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 37/120 (30%), Gaps = 4/120 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + E+ S+ F L+ +L++ ++ K + +A+ A A Sbjct: 1 MKRLA----RLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARA 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G V L+ ++S + ++ +S + Sbjct: 57 GVVVRVPQLAATDITNIALSYAQGSLVSGGTVGTPVVNVDQSAGTSPGSPLKVTVSYTYQ 116 >gi|121600676|ref|YP_992999.1| hypothetical protein BMASAVP1_A1673 [Burkholderia mallei SAVP1] gi|124385110|ref|YP_001026331.1| TadE-like protein [Burkholderia mallei NCTC 10229] gi|126440021|ref|YP_001058915.1| TadE family protein [Burkholderia pseudomallei 668] gi|126448781|ref|YP_001080395.1| hypothetical protein BMA10247_0832 [Burkholderia mallei NCTC 10247] gi|167815888|ref|ZP_02447568.1| hypothetical protein Bpse9_12144 [Burkholderia pseudomallei 91] gi|167845805|ref|ZP_02471313.1| hypothetical protein BpseB_10993 [Burkholderia pseudomallei B7210] gi|167902778|ref|ZP_02489983.1| hypothetical protein BpseN_11002 [Burkholderia pseudomallei NCTC 13177] gi|167911023|ref|ZP_02498114.1| hypothetical protein Bpse112_11050 [Burkholderia pseudomallei 112] gi|254177893|ref|ZP_04884548.1| putative membrane protein [Burkholderia mallei ATCC 10399] gi|254179868|ref|ZP_04886467.1| TadE family protein [Burkholderia pseudomallei 1655] gi|254358537|ref|ZP_04974810.1| putative membrane protein [Burkholderia mallei 2002721280] gi|121229486|gb|ABM52004.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293130|gb|ABN02399.1| TadE-like protein [Burkholderia mallei NCTC 10229] gi|126219514|gb|ABN83020.1| TadE family protein [Burkholderia pseudomallei 668] gi|126241651|gb|ABO04744.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|148027664|gb|EDK85685.1| putative membrane protein [Burkholderia mallei 2002721280] gi|160698932|gb|EDP88902.1| putative membrane protein [Burkholderia mallei ATCC 10399] gi|184210408|gb|EDU07451.1| TadE family protein [Burkholderia pseudomallei 1655] Length = 142 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 37/120 (30%), Gaps = 4/120 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + E+ S+ F L+ +L++ ++ K + +A+ A A Sbjct: 1 MKRLA----RLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARA 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G V L+ ++S + ++ +S + Sbjct: 57 GVVVRVPQLAATDITNIALSYAQGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYTYQ 116 >gi|109039132|ref|XP_001085463.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Macaca mulatta] Length = 891 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ + +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEAKTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|332222722|ref|XP_003260519.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Nomascus leucogenys] Length = 3535 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|297685094|ref|XP_002820135.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Pongo abelii] Length = 3553 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 95 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 154 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 155 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 193 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 194 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 242 >gi|239990323|ref|ZP_04710987.1| hypothetical protein SrosN1_23653 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 67/248 (27%), Gaps = 37/248 (14%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLS----GSMHCAMNSDPEDVNSAPICQDKKRT 226 + +V+ + +R ++ S + V K Sbjct: 300 RPVVAAARPADPLAPEQRRELPFPGTRSVADGLLSSYEHRLRRPSRTVYVLDTSGSMKGR 359 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ----YVTR 282 ++A LK+AL +E + L+ + + V++ + + Sbjct: 360 RLAQLKSALNGLTGDF------REREQVTLLPFGSTVKQVRTHTVDPADPKAGPAAIRAD 413 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T ++ AY L D + +F + I+ +TDGEN Sbjct: 414 AAALSAEGDTAIYSSLAAAYDHLGPDTESAFTS-----------------IVLMTDGENT 456 Query: 343 NFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---S 397 +S + + + L + ++ + S Sbjct: 457 AGRSAAEFGAFYRALPEARRVTPVFPVVFG-DSDRSELEAIAALTGGRLFDGTKEEGPGS 515 Query: 398 LIHVFQNI 405 L F+ I Sbjct: 516 LDGAFEEI 523 >gi|193783708|dbj|BAG53619.1| unnamed protein product [Homo sapiens] Length = 868 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|219883045|ref|YP_002478209.1| hypothetical protein Achl_4441 [Arthrobacter chlorophenolicus A6] gi|219862051|gb|ACL42392.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 327 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 39/138 (28%), Gaps = 2/138 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + E+ ++I A +++ L I + ++ ++SA +AA LA + + + Sbjct: 11 SDERGAVAVIVAFAMVGILGFAAVAIDTGALYSERAQLQSAADAAALAISQECSKTRTCT 70 Query: 76 GDRFE--SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + A D A + S + + + Sbjct: 71 NPALKLPAAQELANANSNDGASTVSSLTYPTANSVKVQTTTKDGKTKAGSLALTFAPILG 130 Query: 134 NNRLDSSNNTIFYNMDVM 151 S + Sbjct: 131 IETDTVSATSTARWGSPA 148 >gi|55662683|emb|CAH74138.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665761|emb|CAH73557.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 845 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 87 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 146 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 147 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 185 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 186 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 234 >gi|55662684|emb|CAH74139.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665762|emb|CAH73558.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55957947|emb|CAI14068.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 3548 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 87 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 146 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 147 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 185 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 186 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 234 >gi|148886654|ref|NP_699197.3| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Homo sapiens] gi|296452942|sp|Q4LDE5|SVEP1_HUMAN RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=CCP module-containing protein 22; AltName: Full=Polydom; AltName: Full=Selectin-like osteoblast-derived protein; Short=SEL-OB; AltName: Full=Serologically defined breast cancer antigen NY-BR-38; Flags: Precursor Length = 3571 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|119579467|gb|EAW59063.1| hCG1794476, isoform CRA_b [Homo sapiens] Length = 1196 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|112180424|gb|AAH30816.1| SVEP1 protein [Homo sapiens] Length = 868 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|37222213|gb|AAQ89957.1| selectin-like protein [Homo sapiens] gi|68655017|emb|CAF04067.1| SEL-OB protein [Homo sapiens] Length = 3574 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|85374105|ref|YP_458167.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] gi|84787188|gb|ABC63370.1| hypothetical protein ELI_06390 [Erythrobacter litoralis HTCC2594] Length = 195 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/173 (9%), Positives = 47/173 (27%), Gaps = 4/173 (2%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA----ILAG 64 + + + ++ + F + + ++L+ + + Y + ++ A L G Sbjct: 4 SFLPRLLDDKRGVTIVEFGMVAPTLIVLLLGVFDITYNMYSSSMLQGTVQQAARNSALEG 63 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A M N + + N F + ES + + + ++ Sbjct: 64 ADSMAQNAAVETAVRQVAPNATFTHKRIAYTTFTEVGSAESFDDVNDDGICADGELFEDA 123 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + + + S V Y + + V + Sbjct: 124 NGNGIWDSDRGVVGSGGARDAVVYTVSIEYPRVVPIATFIGLDPNYSLDVQTV 176 >gi|85374479|ref|YP_458541.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] gi|84787562|gb|ABC63744.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] Length = 202 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 45/176 (25%), Gaps = 8/176 (4%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA-- 63 R K+ E I FA ++ F +++ L + Y ++ ++ A A A Sbjct: 2 RRTSLLKRIARREDGVTIIEFAFAMPVFAVILMALFDLGFQIYAQSIVQGAVQEAARAST 61 Query: 64 ----GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 G++ + + + I + F I E + S Sbjct: 62 LESGGSNSAALDDTVRKNVQTVIPGATLTFTRKNYANFEDVGIPEDFTDTSGSEDGICNN 121 Query: 120 NIVNSSRISMTHMANNRLD--SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 +R S++ F + + I Sbjct: 122 GEPFDDVNGNGVWDADRGADGLGGARDAVLYGASASFERVFPFHSFVPGMSKDVVI 177 >gi|47219688|emb|CAG12610.1| unnamed protein product [Tetraodon nigroviridis] Length = 717 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 52/173 (30%), Gaps = 22/173 (12%) Query: 234 ALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 L F+ S+ ++ + L+ Y+ R S + Sbjct: 475 TLCDFVRSLSVQFDINRDVAQLALVAYSRRATTVFNLDTHDSGSAVLTAIGEASYMGGVA 534 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + K L K ++ +TDG Sbjct: 535 STGTALLH-------------VHSDVLTVDKGARLGVNKAVVVVTDGSGGTDAVVP---- 577 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 K ++N + + + I ++LL+ S E+ V + + L F+++ Sbjct: 578 -AQKLRDNGVSVFVVGIG-DMQREKLLQI-AGSEEHLILVPSYEDL-KYFEDV 626 >gi|15963888|ref|NP_384241.1| hypothetical protein SMc04117 [Sinorhizobium meliloti 1021] gi|307315736|ref|ZP_07595255.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307320421|ref|ZP_07599838.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15073063|emb|CAC41522.1| Conserved hypothetical transmembrane protein [Sinorhizobium meliloti 1021] gi|306893987|gb|EFN24756.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306898627|gb|EFN29295.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 204 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 48/150 (32%), Gaps = 5/150 (3%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R F++ I K +I FA+ + F +++ I + + +A + LA + Sbjct: 16 RGLFRRLIGDRKGATAIEFAILALPFFIVVFASIETFIAFAGEQLLANATD--TLARKIR 73 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + +G + ++A D+ + E+ ++ +++ Sbjct: 74 TGEITTDIGKPGFTTETQFRQAFCDEIAIMMTCSATEAEQASKLHLDVRKLPADLSA--- 130 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + N D + + Y Sbjct: 131 FPKAVPRNGSDLDTSGFTFAPGGPNDYTMV 160 >gi|156399648|ref|XP_001638613.1| predicted protein [Nematostella vectensis] gi|156225735|gb|EDO46550.1| predicted protein [Nematostella vectensis] Length = 1841 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/335 (9%), Positives = 90/335 (26%), Gaps = 39/335 (11%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + + AI + + + + + + K ++ ++ + Sbjct: 1512 QDTMDKAIDSASYLNME-------IKTGKVLNFAQEHLFSRVPAGKQNVLLLITDGVSYD 1564 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 T+ + + + + + + + +L Q Sbjct: 1565 DVTKPAVQLQHKGVEIFCVGVGDNVVRKELETIASSPQSDHVIMTSYN--MLEATLEQIK 1622 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + + E R + D + + ++ + + T + +KN Sbjct: 1623 QKVCSSAAKKENTHRASNRMPHDCDQAADIGFLIDGSRTIE---VMGKGNFGTIIEFVKN 1679 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 +F ++ +G + Y + I G+ + + + + + T Sbjct: 1680 VTKMF-------PLSRDAFSVGAVVYGSEPSLEIPL--GSHNTSKGLLKALSKIKYPGTA 1730 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + Y L G + K ++ LT G S + + Sbjct: 1731 TKTGKALEYARLN------------LFGSRNARRKVPKILVVLTKG-----SSRDDIREA 1773 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + + + +V++ + N + L S P Sbjct: 1774 SEDLHRDGVHVVSVGMG-PVNDRMELANMASLPNK 1807 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 45/186 (24%), Gaps = 24/186 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + LD + L+ + L+ Y D S Sbjct: 634 DIPQMYFSHVLDYLKALARGFNASQTQVSLMAYGDDATAAFNFGSVNTTSEIDDGIDAVS 693 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + A+ +A + L ++ LT G ++ Sbjct: 694 YMGGEPKTGNALYKAKKGLFEVS---------------SRPGVSHALVLLTSGSASDEIG 738 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + +K+ I + N + L++ + + D+L I Sbjct: 739 PPALE-----LRNAGVKVTAIGLGKLANVKELIEIASEPKSSHVFTAF-LDTLPERMDEI 792 Query: 406 SQLMVH 411 + Sbjct: 793 VTGVCS 798 Score = 36.4 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 36/274 (13%), Positives = 82/274 (29%), Gaps = 28/274 (10%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE-MGERPIFLIELVVDLSGS 202 Y S + + +Q+ + A + E PI L+ S Sbjct: 1376 QSYPTVQPISNQDTFEQTPSMRDQQGFTAEQAQELAGMGANEQQETPIATEMLMSKQPIS 1435 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDL-LSHVKEDVYMGLIGY 259 + + + D K T ++ NA +L F + E +G++ Y Sbjct: 1436 SSGPTVKREDCSVNLGVVLDSKTTAYSSDDNARKMLQFAKKVTHLFPVSPEGNRVGMMVY 1495 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA--MKQAYQILTSDKKRSFFTNF 317 + + + + + + + +DS + + A + L S Sbjct: 1496 SGSPKLKV-VHFDHFLDQDTMDKAIDSASYLNMEIKTGKVLNFAQEHLFSRVPAGK---- 1550 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 Q ++ +TDG + + K + + ++I + + + + Sbjct: 1551 ------------QNVLLLITDGVSYDD-----VTKPAVQLQHKGVEIFCVGVGDNVVRKE 1593 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L S H + + + L + I Q + Sbjct: 1594 LETIASSPQSDHVIMTSYNMLEATLEQIKQKVCS 1627 >gi|86132310|ref|ZP_01050905.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817229|gb|EAQ38412.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 351 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 15/183 (8%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK A F+ ++ + + + + + ++++ + D + Sbjct: 130 QELKQASTSFITNVMPAVPTESFKMAIYWFDGEDVLHELQPLTTSATQLQEAIDGVTDDI 189 Query: 288 IL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A+ +A + + + F ++ TDG + + Sbjct: 190 SNDPSTDLYGAVIKAATNAENIVETLENEDLFAAA----------SVVIFTDGTDQAARY 239 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I TI + + + + +L + N++ L VF +IS Sbjct: 240 SEQEALDAVSNAGEEISFFTIGLGSEIDEE-VLTAIGKTGSAF--AENSNELEAVFNDIS 296 Query: 407 QLM 409 + Sbjct: 297 NGV 299 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/259 (11%), Positives = 78/259 (30%), Gaps = 31/259 (11%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + +FI +L G + + Sbjct: 224 DSFMYDTPEEVSAAFISSLTETIEGADADDEYSPGGQQNRKIVLDPAGSMNIYLVLDASD 283 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVR 277 + K L +D + ++ Y T + ++ S + V+ Sbjct: 284 SIGKNNFTGAKKCLSSLIDKVASYGVEPRY---AVVTYATEAKAVVKLSDKESSNADWVK 340 Query: 278 QYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 Q + + D + T++ A+ Y+++ + ++ + I+ Sbjct: 341 QELEKIKYSDHRLKAGTNTKKALTMLYEMMILQESQNDINWNKTRH----------VIVL 390 Query: 336 LTDGENNNFKSNVNTIKICDKA----------KENFIKIVTISINASPNGQRLLKTCV-- 383 +TDG N V I+ + +EN++ + I + +++ Sbjct: 391 MTDGNYNMGGDPVAAIEQIREFLDIGRNRKNPRENYLDVYVFGIGPLVDQEKINALASKK 450 Query: 384 SSPEYHYNVVNADSLIHVF 402 ++ + V + + L +VF Sbjct: 451 DGEKHVFKVKDMEDLENVF 469 >gi|325066442|ref|ZP_08125115.1| von Willebrand factor type A [Actinomyces oris K20] Length = 370 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 57/186 (30%), Gaps = 27/186 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + + + +G++ + + + + + E +R +T D L + P + T Sbjct: 133 SSVLTTFSEILEDFDGERVGIVAWNSAAQTIVPLTDDYELLRDQLTELGDVLDIDPENVT 192 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL---------PFQKFIIFLTDGENNNFKS 346 + AYQ + G + S + II TD + + + Sbjct: 193 YKQQLAYQEAFGGTVNTSINGSSLAGDGLASCAQAFDNQGLERSRSIILATDNQVIDPDN 252 Query: 347 NV--NTIKICDKAKENFIKIVTISINASPN-------------GQRLLKTC--VSSPEYH 389 E I++ +I A + + LKT Sbjct: 253 EQIYPLPDAAKLLAERKIRLFSIY-GADEDQPYQNLLDKTPEESREELKTVTEEQGKGRF 311 Query: 390 YNVVNA 395 Y+V ++ Sbjct: 312 YDVEDS 317 >gi|302869502|ref|YP_003838139.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504036|ref|YP_004082923.1| von willebrand factor type a [Micromonospora sp. L5] gi|302572361|gb|ADL48563.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315410655|gb|ADU08772.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 572 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 103/361 (28%), Gaps = 35/361 (9%) Query: 51 NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 ++ + AA L+ + N ++ R ++ +D + S Sbjct: 235 TAIANGLGAAALSEEDVIAYNSTKPPIRLAALYLEPAPIPLDYPFAVLPGIEPTKASAAR 294 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 +F + + ++ + Sbjct: 295 VLFEVLRTPGFKDRLASQALRAPDGNWGRGFQAPTGAPSPANGGASQVPPSGQGGAADLD 354 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +S + + + +D+SGSM + + +++ +AA Sbjct: 355 PGAISTATTTWSVATQSGRMLCV---IDVSGSMKKPVATAN--------GASREQVTVAA 403 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L LF DS + G + + + S +++ + S Sbjct: 404 ASQGLGLFDDSWSIGLWTFSTNLQGSQDW-SELVGIKPLSSNRGSLQRGLASIKPS--SG 460 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNV 348 T M AY+ + D + + I+ TDG E+++ S Sbjct: 461 NTGLYDTMLAAYKKVQQDWEPGKVNS----------------IVLFTDGKNEDDDGISQK 504 Query: 349 NTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNI 405 + +K K++ ++++ I I N L + + + + +F + I Sbjct: 505 ALLDQLNKLKDDEQPVQVIIIGIGTEVNRAELESITKVTGGGAFVTTDPSKIGEIFLRAI 564 Query: 406 S 406 + Sbjct: 565 A 565 >gi|299132281|ref|ZP_07025476.1| TadE family protein [Afipia sp. 1NLS2] gi|298592418|gb|EFI52618.1| TadE family protein [Afipia sp. 1NLS2] Length = 181 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 10/76 (13%), Positives = 28/76 (36%), Gaps = 1/76 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAILAGA 65 R F++ + + + ++ FA+ F L+ ++ V + +E+ + A L Sbjct: 11 LRDVFRRFGMNRRGSAAVQFAMVAPLFFALLFAIVEVAMMFFATQVLETGTQDTARLLLT 70 Query: 66 SKMVSNLSRLGDRFES 81 + ++ Sbjct: 71 HQAQDQQMTAEQIHDN 86 >gi|295395241|ref|ZP_06805449.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] gi|294972003|gb|EFG47870.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] Length = 324 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 16/174 (9%), Positives = 47/174 (27%), Gaps = 5/174 (2%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-TRDMDSLILKPTDSTPA 297 LD+ L + +G+ + + + E +++ + Sbjct: 112 LDTYADLIKSFKGERIGMTVFNSAAVSVFPLTTDYEMASEFLEDAQLGFESNGLRGINFY 171 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + + +IF TD ++ + A Sbjct: 172 QGTVDRSIDGSSLIGDGLASCLNNFDRNDEERSRSVIFATD-NQLAGNPIYELMEAAELA 230 Query: 358 KENFIKIVTI---SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K++ +++ + A+ L + + + + + V Q + Q Sbjct: 231 KKHKVRVYALAPKGFFAASKLDELKQAAELTGGEMFTMGSGGGARGVIQKVQQE 284 >gi|269929102|ref|YP_003321423.1| hypothetical protein Sthe_3201 [Sphaerobacter thermophilus DSM 20745] gi|269788459|gb|ACZ40601.1| Protein of unknown function DUF2134, membrane [Sphaerobacter thermophilus DSM 20745] Length = 341 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 5/48 (10%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 10 YFKK-GIASE-KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 ++ + + ++FA++++ + + G I + +K ++++ Sbjct: 5 TMRRWLWHARARGQVLVLFAVALVGMIGMTGLAIDLGYTFSQKRAIQN 52 >gi|238063244|ref|ZP_04607953.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885055|gb|EEP73883.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 265 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 62/191 (32%), Gaps = 41/191 (21%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +V F + + R + + VD+S SM + D K +++A Sbjct: 68 LVVGFARPSAEVRV-PRERATVMVAVDVSTSM---------------LASDVKPDRLSAA 111 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-K 290 K+A F+D + ++ +GL+ + + P E + + R + + Sbjct: 112 KDAARRFVDGL------PDEFNVGLVAFAGSAAVLVPPGTDREALHDGIERLAEGSTGVQ 165 Query: 291 PTDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A+ A + L S + II L+DG N + Sbjct: 166 GTAIGEAINTSLGAVRGLDSQAAKDLP---------------PARIILLSDGANTSGMDP 210 Query: 348 VNTIKICDKAK 358 + AK Sbjct: 211 MEAAAEAVDAK 221 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/248 (8%), Positives = 57/248 (22%), Gaps = 39/248 (15%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +K+ + + + + + + D + + + + K Sbjct: 4 KKVFTLVTLAVLTLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVEGERKFDL 63 Query: 231 LKNALLLFLDS-----------IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K + F S + K V + + Sbjct: 64 AKQEVFKFAQSLPKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPY-----EKESF 118 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T A++ A Q + + + + +TDG Sbjct: 119 ENSLNGLGPNGWTPIARALEHAKQT-----------------DEQLNNGTKHIVYLITDG 161 Query: 340 ENNNFKSNVNTIKICDKAKENF--IKIVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNAD 396 E +K+ + + + I ++ +L + + ++Y Sbjct: 162 EETCGGD---PVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTGK 218 Query: 397 SLIHVFQN 404 + + Sbjct: 219 EMGSILSA 226 >gi|302767296|ref|XP_002967068.1| hypothetical protein SELMODRAFT_408444 [Selaginella moellendorffii] gi|300165059|gb|EFJ31667.1| hypothetical protein SELMODRAFT_408444 [Selaginella moellendorffii] Length = 2121 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 62/209 (29%), Gaps = 35/209 (16%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V+D SGSM P +++ + A++ F+ + Sbjct: 1895 DAANHVVFVIDKSGSMGSPDAK--------PRMASFPPSRLGCVYEAMVRFIRTRQAA-- 1944 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +M ++ + + + V + T + ++ A +IL Sbjct: 1945 -GIRDFMSVVLFESSAVIAM---EKVAMAESQVYALLGHDSSGGTVYSAGLQLAEEIL-- 1998 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--ENFIKIV 365 + IIFL+DG N + K K E + + Sbjct: 1999 --------------GRSAGDAKAPAIIFLSDGGNAGGSD---PVAFVRKIKSMEPRLVVH 2041 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVN 394 TI + + +++L + + N Sbjct: 2042 TIVFGSDLSPRKVLVDMAREGGGVFQITN 2070 >gi|239833242|ref|ZP_04681571.1| TadE family protein [Ochrobactrum intermedium LMG 3301] gi|239825509|gb|EEQ97077.1| TadE family protein [Ochrobactrum intermedium LMG 3301] Length = 215 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 24/55 (43%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 L+R ++ ++ ++ F L ++ FLL++ I + + + +A Sbjct: 32 LARVGVCLRRFPRAQNGVAAVEFVLLIVPFLLIVFATIEIGVSFAARQVIANATE 86 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 63/184 (34%), Gaps = 36/184 (19%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVTRDMDSL 287 N L + V + ++ ++++ + + + R++ ++ Sbjct: 128 NELKFVKKLLSDFPVVPSATRVAIVTFSSKNNVVPRVDYISSSRAHQHKCSLLNREIPNI 187 Query: 288 I--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A +QA QIL ++ S K I +TDG +N Sbjct: 188 TYRGGGTYTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD 230 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQ 403 I ++ ++I T I R L S+P E+ Y + + + F+ Sbjct: 231 ----PRPIAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FE 280 Query: 404 NISQ 407 +++ Sbjct: 281 ALAR 284 >gi|119716454|ref|YP_923419.1| TadE family protein [Nocardioides sp. JS614] gi|119537115|gb|ABL81732.1| TadE family protein [Nocardioides sp. JS614] Length = 131 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 39/133 (29%), Gaps = 5/133 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F +++ E+ + +I + + LL+ + +++ K + A AA Sbjct: 2 FASLYRRS-RDERGSVTIQMVFLMPALFLLMFLGLQGALYYHAK---QVALAAAQEGARE 57 Query: 67 KMVSNLSRLGDRFESISNHAKRALID-DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +R + + D + + + + + + + + Sbjct: 58 AGSETGTRDAGVATANTFLQDAGGSDVMTSTSVSGSRTTTTATITVTGKSMSVIPGWHVT 117 Query: 126 RISMTHMANNRLD 138 + RL Sbjct: 118 VTQSASVPVERLT 130 >gi|332216203|ref|XP_003257234.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Nomascus leucogenys] Length = 890 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/249 (12%), Positives = 74/249 (29%), Gaps = 23/249 (9%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI---CQDKKRTK 227 + + ++ +++V N A + K Sbjct: 240 TDSLLDGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPVVPKNVAFVIDISGSMAGRK 299 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K ALL L+ + ++ ++ G + T + ++ R +V D Sbjct: 300 LEQTKEALLRILEDMKEEDYLNFILFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK- 357 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + + +L ++ +I LTDG+ N +S Sbjct: 358 --GMTNINDGLLRGISMLNKAREEHRVPE-----------RSTSIVIMLTDGDANVGESR 404 Query: 348 VNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVF 402 I+ A + + + N L + + + Y +AD L + Sbjct: 405 PEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLQLQGFY 464 Query: 403 QNISQLMVH 411 + ++ ++ Sbjct: 465 EEVANPLLT 473 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 53/185 (28%), Gaps = 26/185 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + K + Sbjct: 358 SNFRLMLEFISNVAKSFEITDIGAKIAAVQFTYDQRTEFSFTDYITKENVLAALRGIRYM 417 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + + + + F++ LTDG++ + Sbjct: 418 SGGTATGEAISHTTRNVF---------------GPVRDGGNKNFLVILTDGQSYDD---- 458 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 A++ I I +I I A + LK S+P+ + + L + ++ Sbjct: 459 -VRGPAVAAQQAGITIYSIGIAWAPLDD---LKDMASAPKETHTFFTREFTGLEQIVPDV 514 Query: 406 SQLMV 410 + + Sbjct: 515 VRGIC 519 >gi|207029558|ref|NP_001125590.1| inter-alpha-trypsin inhibitor heavy chain H3 [Pongo abelii] Length = 879 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKKEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|39963539|ref|XP_364917.1| hypothetical protein MGG_09762 [Magnaporthe oryzae 70-15] gi|145015458|gb|EDJ99994.1| hypothetical protein MGG_09762 [Magnaporthe oryzae 70-15] Length = 777 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/224 (11%), Positives = 60/224 (26%), Gaps = 32/224 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + LV+D+S SM M ED N + + +A L L+++ Sbjct: 49 TPTDLVLVIDVSPSMQTEMVVPTEDENQVRERFGFTV--LDLVGHACLTILETLTERD-- 104 Query: 249 KEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSL---------ILKPTDSTPAM 298 +G++ + R + + + D +L D + Sbjct: 105 ----RLGIVMFKGRATVLQGLTLQDPQAKERSAKYLGDLRRLSEKWHCNMLGERDVMDGL 160 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + Q+ + ++ +TD + + + + Sbjct: 161 QVGLQLFN-------------EVRDHSLPYRVPAVMLVTD-SHLDSTEYTKPVASLKETN 206 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 +I T + S + + ++ + F Sbjct: 207 PEKAQIHTFGFGYNSEAGVFKAFSEISGGRYTFIPDSSMIGTAF 250 >gi|75070765|sp|Q5RB37|ITIH3_PONAB RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|55728562|emb|CAH91023.1| hypothetical protein [Pongo abelii] Length = 876 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 281 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKKEDYLNFI 320 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 321 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 376 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 377 HRVPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 425 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 426 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 470 >gi|53719424|ref|YP_108410.1| hypothetical protein BPSL1811 [Burkholderia pseudomallei K96243] gi|134282410|ref|ZP_01769115.1| TadE family protein [Burkholderia pseudomallei 305] gi|217421983|ref|ZP_03453487.1| TadE family protein [Burkholderia pseudomallei 576] gi|52209838|emb|CAH35810.1| putative membrane protein [Burkholderia pseudomallei K96243] gi|134246448|gb|EBA46537.1| TadE family protein [Burkholderia pseudomallei 305] gi|217395725|gb|EEC35743.1| TadE family protein [Burkholderia pseudomallei 576] Length = 142 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 37/120 (30%), Gaps = 4/120 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + E+ S+ F L+ +L++ ++ K + +A+ A A Sbjct: 1 MKRLA----RLVHDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARA 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G V L+ ++S + ++ +S + Sbjct: 57 GVVVRVPQLAATDITNIALSYARGSLVSGGTVGAPVVNVDQSAGTSPGSPLKVTVSYTYQ 116 >gi|32473964|ref|NP_866958.1| hypothetical protein RB5901 [Rhodopirellula baltica SH 1] gi|32444501|emb|CAD74500.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 700 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 96/350 (27%), Gaps = 45/350 (12%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN-AAILAGASKMVSNLSRLGDRFESIS 83 + ++ ++++ + +++ + A++ A + +S Sbjct: 57 MVGSMLVHLVVILSLAL---------VQLQNPIDDEAVVMIAPPPEYEETVDLIEEIVVS 107 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + + + DA + + + + + D N Sbjct: 108 DQPQVEIGSDALAEFDMAEASAATFAEIANMVSPVD-----------LEPTDLGDIMVNK 156 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 +F + ++ E LI + D SGS+ Sbjct: 157 MFSQPVAPQDR--LTDQKGRVGQGTAGASGAVDQITFEVMQAAEERPTLIVWLFDQSGSL 214 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 D+ ++ L+ L D + V +IG+ +V Sbjct: 215 -------TRQRQDIRDRFDRIYEELGMLREQLDAKTAGEDPSDPSEARVLTSIIGFGEKV 267 Query: 264 EKNIE-PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + E P+ + ++Q V A++ A + S ++ + Sbjct: 268 QLFTEEPTADLDLIKQTVADIPV-DNSGTERVFTAIESAAKQYNSLRRSNGP-------- 318 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + +TD ++ ++I C ++ I + + + A Sbjct: 319 --RGPKRNVMFVVVTDERGDDAHLLESSIGSC---RKWGIPVYVVGVPAP 363 >gi|12655866|gb|AAK00631.1| complement factor B [Halocynthia roretzi] Length = 1084 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 79/247 (31%), Gaps = 23/247 (9%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 L N +S + + + + + D+S S+ + + Sbjct: 505 SLSNLPDQSLTISRTISASEVNI----FHYVIFIFDVSKSV-------TKKYQDFSSGIE 553 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + LKN + SI + + ++T V++ I+ + + Sbjct: 554 FAKRLIDRLKNFGGVLKYSIIAYASSNKTQLEITDRFSTNVKEVIKRLDNLDSQVKEAVS 613 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 ++ T + A+K ++ F ++ + + TDG +N Sbjct: 614 ELIEETRSGTATAKALKSLRDMML----------FMEHDIRNDQTNDKCHVFLFTDGMHN 663 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC--VSSPEYHYNVVNADSLIH 400 K+ V K K + I+ +IS P+ + + S PE + + + L Sbjct: 664 EGKNPVEVRKEMQKIFGSNIEFYSISAQEDPSPEAFEELIGLASEPENYIYIEDIHLLSS 723 Query: 401 VFQNISQ 407 ++ Sbjct: 724 YLDKVTD 730 >gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6] gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6] Length = 688 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 48/363 (13%), Positives = 99/363 (27%), Gaps = 60/363 (16%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRAL--IDDAKRFIKNHIKESLSGYSAVFY 114 +AA A S+ E S + L I + + + A Sbjct: 180 VDAAADAFWSRNNHVEQAFSFNLEVRSAYPIDGLRLPAHPNAIISQNPQGEHLVWQASVT 239 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 N + + + + + + + + + Sbjct: 240 NQQNAEEQSPNAKQTAFTLDKDITVYWR-LQEGLPGRLEAVSYRDPQQS----ERGTIKL 294 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +F P + + V+D SGSM K A L Sbjct: 295 TFTPGDDLSAIQQ--GRDWVFVLDKSGSM---------------------SGKHATLTEG 331 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPT 292 + L + + ++ + RV++ + V Q + T Sbjct: 332 VKRGLGKLPSGDRFR------ILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGT 385 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+++A L SD+ II +TDG N + Sbjct: 386 NLYDALERAVSGLDSDRTTG--------------------IILVTDGVANVGV--TEKKQ 423 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +++ T + S N L S + ++ N+D ++ N++ + H+ Sbjct: 424 FLKLMQRYDVRLYTFIMGNSANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQ 483 Query: 413 KYS 415 Y Sbjct: 484 AYR 486 >gi|149176271|ref|ZP_01854886.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] gi|148844873|gb|EDL59221.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] Length = 715 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 63/196 (32%), Gaps = 35/196 (17%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +A + A L FL+ +DL MG+I + ++ K+ + Sbjct: 528 SGSMSGEPLAESQKAALAFLEQVDLT-----HCSMGVIAVADSTQTVLDACQNASKIEKA 582 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V ++ +S A + L + +F+I L DG Sbjct: 583 VKSLSIGMVGCG-NSAQPFDTAMKKLKKVEG-------------------PRFVITLADG 622 Query: 340 ENNNFKSNVNTIKICDKAKE-NFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADS 397 + ++AK + +I I+I + L+ S + + Sbjct: 623 VWAD------QPHAVNRAKSLHSAEIDVIAIGFGDADKNFLRDIASCDEGSFF--TSLSG 674 Query: 398 LIHVFQNISQLMVHRK 413 L F +I+Q++ + Sbjct: 675 LSATFSSIAQVITKTQ 690 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/279 (9%), Positives = 74/279 (26%), Gaps = 43/279 (15%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + IP ++ + L ++ + ++ Sbjct: 414 PLDVLVVVDRSASM----KEGISQNDIPRDQAVKNALTGAGGL-LQKFININAENKLSVI 468 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + ++ K +++ +++ +K + K+ Sbjct: 469 GFQGSLNYNSREGKPERISWRSIIYQPSINNNKDADVLKNWESSSALNRDDLSYKDKN-- 526 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL----TSDKKRSFFTNFFRQGVKIPS 326 T+ A+ +A ++L ++ Sbjct: 527 --------------------GTNYHAALVKADEMLNKVADDGHRKIMVFVSDGVPTFYFG 566 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQR------L 378 + +D N S T D K+ + I ++ ++ N + Sbjct: 567 SDHYRAGNGTSDAS-NIKSSQDGTRAAIDDFKKKHPNLSIYSLGVSKDINSDTASSSPVV 625 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQL--MVHRKYS 415 LK S +++Y + N L + I + + H + S Sbjct: 626 LKYL-SGEDHYYGITNTVELEKIANKIVEDSKVSHLEIS 663 >gi|156358451|ref|XP_001624532.1| predicted protein [Nematostella vectensis] gi|156211319|gb|EDO32432.1| predicted protein [Nematostella vectensis] Length = 180 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 54/173 (31%), Gaps = 30/173 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + + + Y+ E +++ V K T + A+K Sbjct: 34 AESFPVSATNTQVATVVYSEEPELIFNFGKYNDINEIKTAVDNMP--YHGKTTHTGKALK 91 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + K + +I LTDG ++ K ++ Sbjct: 92 FTLEEVF----------------KKARKNVKNVLIALTDGH-----ASDLVKKPAQAVRD 130 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 I++ + + + L+ + P ++ +NV + D + ++ + + Sbjct: 131 YGIEVFAVGVGSPD--IAELEEIATDPDKDHVFNV-DFDEVSNITGGLENQIC 180 >gi|110833087|ref|YP_691946.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] gi|110646198|emb|CAL15674.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] Length = 556 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 41/123 (33%), Gaps = 3/123 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI--LAGASKMVSNLS 73 + + + F S+ L++ ++ + + +K +++A NA LA + + Sbjct: 8 KKQHGSALVWFV-SLTFIFLMVALVVDGSNLYNEKRRLQAAANAVASELAASGQTCFGSE 66 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 D + +N D R + ++ + ++ + ++ Sbjct: 67 LASDATDLSNNAETILARDYPGRDFVVTSAQVVTVDGVDGRYQVSPVVFDARESNGVAVS 126 Query: 134 NNR 136 + Sbjct: 127 LSS 129 >gi|255948704|ref|XP_002565119.1| Pc22g11730 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592136|emb|CAP98461.1| Pc22g11730 [Penicillium chrysogenum Wisconsin 54-1255] Length = 1029 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 70/237 (29%), Gaps = 23/237 (9%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED-VNSA 217 + + I S A I + + P D V Sbjct: 462 DYRGS-RGIQRQASINSSYGAGKSINTAITDYTNPDYASPDAEFPAINTVHIPLDLVVVI 520 Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 P+ + K+ L++AL + ++ MGL+ + + T K Sbjct: 521 PVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSW 573 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 ++ ++S+ + L +D + + P I+ ++ Sbjct: 574 AGWSKILESIRPVGQ----------KSLRADVVEGANVAMDLLMQRKFNNPVS-TILLIS 622 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 D ++ +S + + AK + I + + + +++ + + V + Sbjct: 623 DSSISDPESVDFVVSRAEAAK---VTIHSFGLGLTHKPDTMIELSTRTKGSYSYVKD 676 >gi|162452306|ref|YP_001614673.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] gi|161162888|emb|CAN94193.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] Length = 408 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 63/230 (27%), Gaps = 32/230 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-- 270 D + + + + +K AL LD I V ++ +++ + Sbjct: 103 DRSDSMTKDWEGSDRWTMMKGALGTALDVIRDRMSVGLQLFPSDEHCGMPAGEDLSVAVA 162 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS------------------ 312 G V T + T + A+ +A LT + Sbjct: 163 PGATSVPAIKTLLDGTDPGGATPTADALARALGYLTVGAGGALEGDKYVLLATDGGPNCN 222 Query: 313 ----FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + G + +V T++ + IK + Sbjct: 223 QDPAMTCEAATCTTNMDGDCPSGVPNCCVAGLTDVCLDDVRTVQRVKDLRAAGIKTFVVG 282 Query: 369 I-NASPNGQRLLKTCVSSPE-------YHYNVVNADSLIHVFQNISQLMV 410 I A+ L + V ++ VV+A SL I++ +V Sbjct: 283 IPGATAYAHVLDQLAVEGDTATSETSPRYFEVVDAASLGDTLTGITRDLV 332 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 48/158 (30%), Gaps = 20/158 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + I S ++ + +GL Y++ E S + + + Sbjct: 275 NNFKLIKEFLSNLISPFSIAEDKIRVGLSQYSSDPRTEWELSAYSTREQVLEAVRNLRYK 334 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + L +K +I LTDG++ + + Sbjct: 335 GGNTFTGLALTH-------------VLEQNLKPDAGARLEAEKLVILLTDGKSQDDANL- 380 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K I+I I + + + L+ S P Sbjct: 381 ----AAQTLKNLGIEIFAIGVKNADEAE--LRQVASEP 412 >gi|118093660|ref|XP_001235190.1| PREDICTED: similar to Bardet-Biedl syndrome 5 [Gallus gallus] Length = 1556 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 69/180 (38%), Gaps = 31/180 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +K+ + +D + + +GL+ Y+ V+ + T + V++ + + Sbjct: 761 FEIIKDFVTALVDRVT---VGRNATRIGLVLYSLEVQLEFGLNKHTTQQDVKRAIRKMQ- 816 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T + A+++A Q +G +K I LTDG+ + + Sbjct: 817 -YMGEGTYTGTAIRKATQ----------------EGFLGARTGVRKVAIVLTDGQADKRE 859 Query: 346 SNVNTIKICDKAKENFIKIVTISI--NASPNGQRL---LKTCVSSPE--YHYNVVNADSL 398 + V + +A I++ I I + P L S P+ + Y + + ++L Sbjct: 860 A-VKLDIVVREAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDDFNTL 918 >gi|195539501|ref|NP_001124213.1| inter-alpha (globulin) inhibitor H2 [Gallus gallus] gi|190576833|gb|ACE79193.1| inter-alpha inhibitor heavy chain 2 precursor [Gallus gallus] Length = 948 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 56/223 (25%), Gaps = 44/223 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-E 250 I V+D+SGSM KM A+ L + Sbjct: 312 NILFVIDVSGSMW--------------------GLKMKQTIEAMKAILSELRAADQFSLI 351 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D + + + +V T+ A+ +A IL + Sbjct: 352 DFNHNVRCWRDNLVSATP-----AQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAQN 406 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTIS 368 + + I+ ++DG+ K + K+ + + Sbjct: 407 -----------LGMLDPNSVSMIVLVSDGDPTVGELKLTTIQKNVKQSIKDE-YSLFCLG 454 Query: 369 INASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 I + L + + + + + + +S Sbjct: 455 IGFDVDYDFLQRIATDNRGMAQRIFGNQETSAQMKRFYNQVST 497 >gi|270008950|gb|EFA05398.1| hypothetical protein TcasGA2_TC015570 [Tribolium castaneum] Length = 873 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 71/263 (26%), Gaps = 49/263 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + L + + ++ +S Sbjct: 262 VVFVLDHSGSMGGRKYEQLKQAMDKILSDLN-PDDLFHIVRFSEIVSV------------ 308 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K ++ + + + + E Sbjct: 309 -------WNLEKNKFDKIR---------FKQMPDYENFDSVLAEFNLRDAIQVTE--DNI 350 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 +K + +D T+ + ++ ++ + N Q I Sbjct: 351 KKAKSIKDDIVDMAC---TNIIGGLVVGLYLVRRTLQKFYEKNIE--------TKHQPMI 399 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK---IVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N + I I ++S + L K + Sbjct: 400 IFLTDGLPNVGLIIRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 459 Query: 388 YHYNVVNAD-SLIHVFQNISQLM 409 + Y +A L + ++ +S L+ Sbjct: 460 HIYEAADAALQLQNFYRTVSSLL 482 >gi|91084773|ref|XP_972278.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 698 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/263 (11%), Positives = 71/263 (26%), Gaps = 49/263 (18%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + ++Y Q + L + + ++ +S Sbjct: 262 VVFVLDHSGSMGGRKYEQLKQAMDKILSDLN-PDDLFHIVRFSEIVSV------------ 308 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + ++ K ++ + + + + E Sbjct: 309 -------WNLEKNKFDKIR---------FKQMPDYENFDSVLAEFNLRDAIQVTE--DNI 350 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 +K + +D T+ + ++ ++ + N Q I Sbjct: 351 KKAKSIKDDIVDMAC---TNIIGGLVVGLYLVRRTLQKFYEKNIE--------TKHQPMI 399 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK---IVTISINASPNGQRLLKTCVSSPE--- 387 IFLTDG N + I I ++S + L K + Sbjct: 400 IFLTDGLPNVGLIIRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSR 459 Query: 388 YHYNVVNAD-SLIHVFQNISQLM 409 + Y +A L + ++ +S L+ Sbjct: 460 HIYEAADAALQLQNFYRTVSSLL 482 >gi|320159019|ref|YP_004191397.1| hypothetical protein VVM_02412 [Vibrio vulnificus MO6-24/O] gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O] Length = 688 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 48/363 (13%), Positives = 99/363 (27%), Gaps = 60/363 (16%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRAL--IDDAKRFIKNHIKESLSGYSAVFY 114 +AA A S+ E S + L I + + + A Sbjct: 180 VDAAADAFWSRNNHVEQAFSFNLEVRSAYPIDGLRLPAHPNAIISQNPQGEHLVWQASVT 239 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 N + + + + + + + + + Sbjct: 240 NQQNAEEQSPNAKQTAFTLDKDITVYWR-LQEGLPGRLEAVSYRDPQQS----ERGTIKL 294 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +F P + + V+D SGSM K A L Sbjct: 295 TFTPGDDLSAIQQ--GRDWVFVLDKSGSM---------------------SGKHATLTEG 331 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPT 292 + L + + ++ + RV++ + V Q + T Sbjct: 332 VKRGLGKLPSGDRFR------ILMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGT 385 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+++A L SD+ II +TDG N + Sbjct: 386 NLYDALERAVSGLDSDRTTG--------------------IILVTDGVANVGV--TEKKQ 423 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +++ T + S N L S + ++ N+D ++ N++ + H+ Sbjct: 424 FLKLMQRYDVRLYTFIMGNSANTPLLEPMTQVSNGFATSISNSDDILGHIMNVTSKLTHQ 483 Query: 413 KYS 415 Y Sbjct: 484 AYR 486 >gi|290976446|ref|XP_002670951.1| predicted protein [Naegleria gruberi] gi|284084515|gb|EFC38207.1| predicted protein [Naegleria gruberi] Length = 1082 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 54/182 (29%), Gaps = 27/182 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L S + ++ Y + ++ ++ + + TD Sbjct: 149 VCTSLKHFAQQSFNNPAIICRMVAYESSAKEI----DMKGTLQSIIRNIETAFTGGGTDF 204 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN----FKSNVNT 350 A + A I+T + + F I FLTDGE+ + Sbjct: 205 ASAFQLACTIITRESGQDRENLPFGNV----------VITFLTDGEDFSKVGKPGGLQYL 254 Query: 351 IKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADS-------LIHV 401 + ++ I I T+ + N + L ++ + + D+ L + Sbjct: 255 SEEINRVYRGDITIHTVGFGSHHNLELLDNIRKVGTIEGAYRYANYDDNNDVICGKLTSI 314 Query: 402 FQ 403 F Sbjct: 315 FD 316 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 73/225 (32%), Gaps = 39/225 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A KN L F++ + + G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGAGN--------FTRAKNCLRDFIEKVASYGVKPKY---G 310 Query: 256 LIGYTTRV----EKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + S + V + + + D ++ T++ A++ Y ++ Sbjct: 311 LVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMM---- 366 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF---- 361 II +TDG N V+ D +++ Sbjct: 367 ----NWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPRE 420 Query: 362 --IKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVF 402 + + + N + + ++ + V + ++L VF Sbjct: 421 DYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 465 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 73/225 (32%), Gaps = 39/225 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A KN L F++ + + G Sbjct: 607 VLDPSGSMNIYLVLDGSDSIGAGN--------FTRAKNCLRDFIEKVASYGVKPKY---G 655 Query: 256 LIGYTTRV----EKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + S + V + + + D ++ T++ A++ Y ++ Sbjct: 656 LVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMM---- 711 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF---- 361 II +TDG N V+ D +++ Sbjct: 712 ----NWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPRE 765 Query: 362 --IKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVF 402 + + + N + + ++ + V + ++L VF Sbjct: 766 DYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 810 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 73/225 (32%), Gaps = 39/225 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A KN L F++ + + G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGAGN--------FTRAKNCLRDFIEKVASYGVKPKY---G 310 Query: 256 LIGYTTRV----EKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + S + V + + + D ++ T++ A++ Y ++ Sbjct: 311 LVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSMM---- 366 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF---- 361 II +TDG N V+ D +++ Sbjct: 367 ----NWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPRE 420 Query: 362 --IKIVTISINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVF 402 + + + N + + ++ + V + ++L VF Sbjct: 421 DYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVF 465 >gi|327260860|ref|XP_003215251.1| PREDICTED: collagen alpha-2(VI) chain-like [Anolis carolinensis] Length = 1019 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/162 (11%), Positives = 50/162 (30%), Gaps = 17/162 (10%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + + L+ Y + E ++ ++ + + ++ A+ Sbjct: 861 TLARSNSDNMNARIALLQYGSENEHVVAFPLTYNITEISDALAQIRYLDS--SSNLGSAI 918 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L R + + +F+TDG N + + D K Sbjct: 919 IYAVNNL---------VINPRDRQRAARRNAELSFVFITDGITGNKNLD----EAIDSMK 965 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + + L K + + + SL Sbjct: 966 KQNVMPTVVALGSDVDMDVLHKISLGDQTAIFREKDYASLSE 1007 >gi|253701737|ref|YP_003022926.1| hypothetical protein GM21_3141 [Geobacter sp. M21] gi|251776587|gb|ACT19168.1| conserved hypothetical protein [Geobacter sp. M21] Length = 383 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/201 (10%), Positives = 51/201 (25%), Gaps = 1/201 (0%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAK 87 + ++ FL+L G I + + ++ + A L GA + L Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKKRLLLQAQHSPGKLAQVL 67 Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 + A R + SA+ + +T + + S + T Sbjct: 68 ADPLQSAARSVAVDTATGKHSASALVALMNDNGNALTENNDIT-VGFWNMSSRSYTPGGT 126 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 R + V+ I + + + ++ A Sbjct: 127 PVNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAICAAA 186 Query: 208 NSDPEDVNSAPICQDKKRTKM 228 +++ + + Sbjct: 187 CEPSCGYPDVCSIPERRMSHL 207 >gi|326798073|ref|YP_004315892.1| von Willebrand factor type A [Sphingobacterium sp. 21] gi|326548837|gb|ADZ77222.1| von Willebrand factor type A [Sphingobacterium sp. 21] Length = 622 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/333 (12%), Positives = 92/333 (27%), Gaps = 54/333 (16%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L D + N + F + S S N ++ M + Sbjct: 141 LPSTEDYQKIKENGFVSPKKEPLSTFAVDVDGASYSNIRRFINNGQLPPKDAIRLEEMIN 200 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 L + N + S + +++ I+ + P Sbjct: 201 YFQYNLPAPQNNEPVAIHTELSQAPW----------NPHHRLLRIALKAKAIDAAKLPPA 250 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + ++D+SGSM ++ +K++L + +D + Sbjct: 251 NLVFLIDVSGSM-------------------DGPNRLPLVKSSLKMLVDQLRKED----- 286 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + ++ Y + P W EK+R + T +K AY + K+ Sbjct: 287 -RVAIVTYAGTARIKLAPVWANEKMRIK-NAIDELDAGGSTAGGAGLKMAYDLAREHFKK 344 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISIN 370 II +DG+ N SN + + +K +++ + + + Sbjct: 345 DGNN----------------RIILASDGDFNVGPSSNEDMETLIEKERQSGVSLSVLGFG 388 Query: 371 ASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVF 402 ++ + + + N Sbjct: 389 MVNLKDSKMELLANKGHGNYAYIDNLMEAKKAM 421 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 66/221 (29%), Gaps = 40/221 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L + + Sbjct: 283 NVVFVIDVSGSM--------------------SGRKLVQTKEALLKILSDVKKDDFLNFI 322 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ + T E + + E ++ T+ + + +++ ++ Sbjct: 323 LFSSDV--RTWKENLVPATP--ENLKAAEEFVHQIQATGGTNINDGLLRGIEMVNKAREM 378 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 II LTDGE N +S V I+ A + + Sbjct: 379 GTVL-----------DRSTSIIIMLTDGEANVGESRVEKIQENVRNAIGGKYPLYNLGFG 427 Query: 371 ASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 N L + + + Y +A+ + + ++ Sbjct: 428 YDVNYNFLERMALENNGVARRIYEDSDANLQMQGFYDEVAN 468 >gi|126723497|ref|NP_001075477.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Oryctolagus cuniculus] gi|75056157|sp|Q9GLY5|ITIH3_RABIT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|11041718|dbj|BAB17302.1| inter-alpha-trypsin inhibitor heavy chain3 [Oryctolagus cuniculus] Length = 903 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 74/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K+ALL L+ + H+ Sbjct: 284 NVVFVIDVSGSM--------------------YGRKLEQTKDALLKILEDMREEDHLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ + T + ++ R +V D T+ + + +L + ++ Sbjct: 324 LFSSDVT-TWKEHLVQATPENLQEARAFVKSIQDQ---GSTNLNDGLLRGISMLNTAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N+ +S I+ A + + Sbjct: 380 HRVPE-----------RSTSIVIMLTDGDANSGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYHYNV-VNAD---SLIHVFQNISQLMVH 411 + N L + + + + ++D L ++ ++ ++ Sbjct: 429 NNLNYNFLESLALENDGFARRIYEDSDANLQLHGFYEEVANPLLT 473 >gi|256599878|pdb|3IBS|A Chain A, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron gi|256599879|pdb|3IBS|B Chain B, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron Length = 218 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/225 (12%), Positives = 66/225 (29%), Gaps = 58/225 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D +D + +G I + + + Sbjct: 19 DISNSXLAQDVQPSRLEKAKRLISRLVDELD-------NDKVGXIVFAGDAFTQLPITSD 71 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 72 YISAKXFLESISPSLISKQGTAIGEAINLATRSFTP------------------QEGVGR 113 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 II +TDGEN+ + +K I++ + + Sbjct: 114 AIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGXPEGAPIPVEGTNDYRRDREG 169 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 N + + V N++S ISQ + Sbjct: 170 NVIVTRLNEGXCQEIAKDGKGIYVRVDNSNSAQKA---ISQEISK 211 >gi|194391096|dbj|BAG60666.1| unnamed protein product [Homo sapiens] Length = 698 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFT 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|307591436|ref|YP_003900235.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986290|gb|ADN18169.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 441 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/240 (14%), Positives = 69/240 (28%), Gaps = 46/240 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + + + RP V+D SGSM + Sbjct: 15 ADTADQRLFIMLKLRPTKEVSNSRPSTSFAFVIDTSGSMDEVVTGGKSK----------- 63 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTR 282 K+ ++ L + + +D ++ +I + + I + T+ ++ + R Sbjct: 64 -------KSIVIESLYQLVRSGRLTQDDHIAIIEFHDQASTLIGLTPATQVFQLENAIAR 116 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D T M +A +LT + ++ TDGE Sbjct: 117 LNDF--SGGTCMGKGMNEALVLLT------------------NQSMTSRRVLIFTDGETF 156 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL-LKTCVSSPEYHYN--VVNADSLI 399 + + I + I I + + N L K + Y + NA L Sbjct: 157 DEED---CEIIAQQFSNQGISITAMGVGDEFNEDLLIHKISDPTGGRLYPLVIGNATGLQ 213 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 56/195 (28%), Gaps = 23/195 (11%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL-IGYTTRVEKNIEPSW 271 D + + + + K + AL+ ++ + ++V Y + E + Sbjct: 177 DASGSMLARLDGTPKTVIARQALIALINRLPETTNVALRTYGHRRADDCSDTELIQALAP 236 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + T ++ Q L + Sbjct: 237 LQRDALIARINAIRPVNGGRTPIAQSLADMAQDL-------------------AGIEGNV 277 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYH 389 I+ ++DG+ V T + A ++I I + +RL V+ + Sbjct: 278 LIVLVSDGDETCGGDPVATASMLRAANSQ-LRISVIGFDVEQEEWRRRLEGIAVAGGGAY 336 Query: 390 YNVVNADSLIHVFQN 404 ++ NA+ L Sbjct: 337 FDASNAEQLADALDQ 351 >gi|260803822|ref|XP_002596788.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] gi|229282048|gb|EEN52800.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] Length = 547 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 54/168 (32%), Gaps = 27/168 (16%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 AL +D + + +D+ +G+I Y I T Sbjct: 62 ALQFIIDFLQCFTD--QDIGIGVILYNCVPRTGIPL-------------------GMYTI 100 Query: 294 STPAMKQAYQILTSDKK--RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN---NFKSNV 348 S P + A LT + R+ F + I L+DG N ++ Sbjct: 101 SNPGLPFAISNLTQEGGLSRTGHALSFMTDTSKFRTGIPRTAILLSDGFPQSDANAQAMD 160 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEYHYNVVNA 395 + + A++ I + + + A+ +L+T S + + N Sbjct: 161 DYEAQAEAARDAGIDLYAVGVGAAGLVNWDVLETITGSSDRVFRSDNP 208 >gi|194381994|dbj|BAG64366.1| unnamed protein product [Homo sapiens] Length = 1207 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 55/183 (30%), Gaps = 22/183 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + RQ + Sbjct: 1044 RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLK-DFSTKRQIIDAINKV 1102 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G++ Sbjct: 1103 VYKGGRHANTKVGLEH-----------LRVNHFVPEAGSRLDQRVPQIAFVITGGKSVED 1151 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +V+ + +K+ + + + + + + K S+ + V N L Sbjct: 1152 AQDVSL-----ALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSE-LSE 1203 Query: 405 ISQ 407 IS Sbjct: 1204 IST 1206 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 58/204 (28%), Gaps = 21/204 (10%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 622 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 678 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + K + + F R Sbjct: 679 LNAHSSKDEVQ-NAVQRLRPKGGRQINVG---------NALEYVSGNIFKRPLGSRIEEG 728 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 +F++ ++ G KS+ + K+ + TI+ + + + L+K SPEY Sbjct: 729 VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSPEY 780 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 ++V L + Q + + Sbjct: 781 VFSVSTFRELPSLEQKLLTPITTL 804 >gi|163752614|ref|ZP_02159791.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161327493|gb|EDP98702.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 564 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 31/121 (25%), Gaps = 8/121 (6%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 T + Q S K + + V N+ +N + Sbjct: 192 YITDGVPTVDQSANSKIKGLPGVSSSDKFVNTSPNFTSYLPALAGWMNTNDVNTNAAFPE 251 Query: 353 ICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + TI + + L KT +Y+ +A L + ++ Sbjct: 252 M------QSVTTFTIGFSDGADDAAPLLTKTAELGGGEYYSAKSATQLQAALSQVFSQIL 305 Query: 411 H 411 Sbjct: 306 E 306 >gi|260800517|ref|XP_002595175.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] gi|229280419|gb|EEN51187.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] Length = 286 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/167 (10%), Positives = 48/167 (28%), Gaps = 20/167 (11%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ L + V + ++ Y + + D + T Sbjct: 20 MKTFIRQLFSYLDIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDDIEYMGGYTL 79 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ A + +K K + +TDG + + + Sbjct: 80 TGKAIDFATNLHFDLRKG-------------ARADVTKIAVVITDGRSYDDVNRP----- 121 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + ++ I + + + + + + L P+ ++ D L Sbjct: 122 ARRMRQAGIVTIAVGVGNNLDRDQ-LTAIAGDPKTLLSLDGFDRLQD 167 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 35/100 (35%), Gaps = 7/100 (7%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQ 278 TK+A LK A+ + + + +I +++ V + + +Q Sbjct: 276 SGSMAGTKLALLKRAMGFVIQHLGPSD------RLSVIAFSSTVRRLFHLRRMSHSGRQQ 329 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 + T+ A+K+A +++ ++ + Sbjct: 330 ALQAVNSLGAGGGTNIADALKKAAKVIEDRSYKNPVCSII 369 >gi|218673728|ref|ZP_03523397.1| hypothetical protein RetlG_20318 [Rhizobium etli GR56] Length = 176 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 57/157 (36%), Gaps = 19/157 (12%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M RF + ++ + K +I FA+ + ++I +I V + + A +A+ Sbjct: 1 MTARKRFAPF-RRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDAS 55 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + + + + + + K I + + + S S + + + Sbjct: 56 VHKISRMIRTG-----------EVASSKITLAGFKSKICDDMLLAFSCSSGLVVKVSVLS 104 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 ++S+ + + +L + Y++ + Y Sbjct: 105 DLSSATSADPIDNSGKLTVT---ETYDIGKGSDYILV 138 >gi|260800507|ref|XP_002595171.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] gi|229280414|gb|EEN51182.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] Length = 352 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/143 (11%), Positives = 41/143 (28%), Gaps = 18/143 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ + + +GL+ Y+ E + + T++ Sbjct: 25 QFLKKTVGDFNIGYDATQIGLVQYSDEAETIFALDSFSSPTSLRDAIETIQYTGGATNTG 84 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ Q + + + K I LT G S+ + Sbjct: 85 NALDYMVQYMFASRNG-------------ARQDSTKIAIVLTGGA-----SSDDIKAAAQ 126 Query: 356 KAKENFIKIVTISINASPNGQRL 378 + +++ + I I + + +L Sbjct: 127 RMRKSSVITYAIGIGSELDYDQL 149 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 40/137 (29%), Gaps = 18/137 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I E +G++ Y+ R I ++ + T + A+ Sbjct: 229 ISDFDLGPEATRIGVVVYSNRASLEISLDAYDDQEALQDAVADIAYPGGYTLTGAAIDYT 288 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S + + +K + LTDG + + + ++ Sbjct: 289 TTFAFSTRNGARDG-------------VRKVAVILTDGVSYDD-----PAEPAQSMRKAA 330 Query: 362 IKIVTISINASPNGQRL 378 I + I ++ + +L Sbjct: 331 IITYAVGIGSNLDRDQL 347 >gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens] Length = 543 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 72 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 111 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T R + ++Y+ + T+ A+ +A IL Sbjct: 112 DFNQNI-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEANN- 166 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 167 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 215 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 216 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 257 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 88 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 142 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 143 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 191 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 192 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 249 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 250 EEDAGSQLIGFYDEI 264 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 88 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 142 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 143 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 191 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 192 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 249 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 250 EEDAGSQLIGFYDEI 264 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|300021847|ref|YP_003754458.1| hypothetical protein Hden_0312 [Hyphomicrobium denitrificans ATCC 51888] gi|299523668|gb|ADJ22137.1| hypothetical protein Hden_0312 [Hyphomicrobium denitrificans ATCC 51888] Length = 210 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 14/134 (10%), Positives = 30/134 (22%), Gaps = 4/134 (2%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L +R R YF+ + A ++ FAL +L+ V + A Sbjct: 7 LAARIRLYFRTFASDTSAVAAVEFALIAPLLMLMTFGTFEVSRALVAHKRFQ----RATA 62 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + + + + ++ T+ Sbjct: 63 MVGDLVSREKQIGSSLSTANTALDGMLVSAQHAMEPFSATPLQIAITQLRASATDASATK 122 Query: 123 NSSRISMTHMANNR 136 S Sbjct: 123 VEWSWSYHSAPIKS 136 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 108 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 162 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 163 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 211 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 212 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 269 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 270 EEDAGSQLIGFYDEI 284 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 177 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 231 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 232 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 280 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 281 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 338 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 339 EEDAGSQLIGFYDEI 353 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 88 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 142 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 143 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 191 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 192 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 249 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 250 EEDAGSQLIGFYDEI 264 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + I + + + +R Sbjct: 184 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIR 238 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 239 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 287 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 288 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 345 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 346 EEDAGSQLIGFYDEI 360 >gi|302381763|ref|YP_003817586.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] gi|302192391|gb|ADK99962.1| TadE family protein [Brevundimonas subvibrioides ATCC 15264] Length = 181 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 19/55 (34%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 +++R R ++ E ++ FAL + L + M+ A Sbjct: 1 MIARARVGLRRFWRDESGVSAVEFALLAPVMIALYFGSAEFCQGFMAQKRMDHAT 55 >gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus leucogenys] Length = 946 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 64/222 (28%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDRFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 350 DFNQNV-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|296158802|ref|ZP_06841631.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] gi|295891007|gb|EFG70796.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] Length = 597 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 14/141 (9%), Positives = 45/141 (31%), Gaps = 4/141 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN--- 71 + +++ A+ + + +G + V + ++ + ++ + A +A + S Sbjct: 25 ARRQHGAIAMLAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQMIGSTGGC 83 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + ++ + + A S S Y N V ++++ Sbjct: 84 ATATTAAQQNAAANGFTAGSTTTISTTCGRWDTSSSTYFGTSGNPLNAVQVKTTQVVPYF 143 Query: 132 MANNRLDSSNNTIFYNMDVMT 152 + S + ++ T Sbjct: 144 FVGPSRNVSATATAFASNIDT 164 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 60/195 (30%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + + + + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDRFSIIGFPNRIKVWKDHLISVTP-----DSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L S I+FLT Sbjct: 357 DGKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 464 EEDAGSQLIGFYDEI 478 >gi|332140408|ref|YP_004426146.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327550430|gb|AEA97148.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 44/401 (10%), Positives = 91/401 (22%), Gaps = 49/401 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE-IQNI 121 A A + RF +I + A T Sbjct: 301 ADAKDNSEEDVNGYPNTGGELSFNSSYNDYIGVRFRNVNIPQGAVITDAYLEFTAYSTRT 360 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI----VSFI 177 + + + +A++ + Y + + + + N +S I Sbjct: 361 YGNPSMRIRGVADDDASDFHPNRRYRLRNLPKTSG---ITWSMPDFYNNYVYRTPDISGI 417 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMH-CAMNSDPEDVNSAPICQDKKRT---------- 226 + G + + V+D S + P + + T Sbjct: 418 VKQIVDRSGWQSGNDMAFVMDNFVSYRGAHTYNSPSKAPKLIVKFNGAATPGASATVREH 477 Query: 227 ---KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-----TRVEKNIEPSWGTEKVRQ 278 K+ L + F +D L + Y V ++ S Sbjct: 478 LVSKIDEL--SANGFTPIVDTLLEAANYYGGRSVDYGRKRGENDVNWSVRRSTRVSHRSS 535 Query: 279 YVTRDMDS-------LILKPTDSTPAMKQAYQILTSDK----KRSFFTNFFRQGVKIPSL 327 Y+ D + T + ++ + + G + Sbjct: 536 YLGADSILPWGCTEDNLSDSDCITEQIPTPATYISPVSDLQCQTNNHIVLLSDGEANNNH 595 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI-------VTISINASPNGQRLLK 380 K L N + + + + TI A+ L Sbjct: 596 SVSKIQTLLGQSCTGNG-GEKCGLDLVRNISDTSTSVIGPRVITHTIGFAANNTANNFLN 654 Query: 381 TCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 S Y N+ L+ F I + + + + G Sbjct: 655 QLALQSGGGFYQADNSTDLLEAFNTILRSVKDVNATFVSPG 695 >gi|329898072|ref|ZP_08272282.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] gi|328920971|gb|EGG28395.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] Length = 490 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 46/158 (29%), Gaps = 27/158 (17%) Query: 255 GLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRS 312 G+ + V + + V T+ A+ A L Sbjct: 66 GVWLFGDGVRTLLPHVVIDEAWRDKAVESVRVIDNSGQRTNIPAALDAALYDLN------ 119 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV--------NTIKICDKAKENFIKI 364 ++ ++ LTDG+ + S V + + + + Sbjct: 120 -----------RLDPDYRTSVVLLTDGKVDVSSSAVANANAARMLLEEQAPILGQTGVPV 168 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 TI+++ + L K S+ +ADSL VF Sbjct: 169 HTIALSNEADWAFLRKLAESTQGLAEKAESADSLTRVF 206 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/248 (8%), Positives = 58/248 (23%), Gaps = 39/248 (15%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 +K+ + + + + + + D + + + + K Sbjct: 4 KKVFTLVTLAVLTLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVEGERKFDL 63 Query: 231 LKNALLLFLDS-----------IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K + F S + K V + + Sbjct: 64 AKQEVFKFAQSLPKDAKIRMSLFGSEGNNKNSGKAQSCEVIRGVYGVQPY-----EKESF 118 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + T A++ A Q + + + + +TDG Sbjct: 119 ENSLNELGPNGWTPIARALEHAKQA-----------------DEQLNNGTKHIVYLITDG 161 Query: 340 ENNNFKSNVNTIKICDKAKENF--IKIVTISIN-ASPNGQRLLKTCVSSPEYHYNVVNAD 396 E +K+ + + + I ++ +L + + ++Y Sbjct: 162 EETCGGD---PVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTGK 218 Query: 397 SLIHVFQN 404 + + Sbjct: 219 EMGSILSA 226 >gi|126314598|ref|XP_001371349.1| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 [Monodelphis domestica] Length = 3274 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/341 (8%), Positives = 96/341 (28%), Gaps = 37/341 (10%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 +A + + + I NH A A+ + I G S + Sbjct: 103 IANMTYLGGDSKTGRGLRYVIQNHLTPAAGSRARDGVPQVIVVLTDGRSQDDVAGPSAEL 162 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 +++ + + + + + +V+ + + + Sbjct: 163 KSAAAVDVVAIGVQDAEEG--ELKEMATEPLDLH-VFNLENFTALHDIVGNLVACVRSSM 219 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 ++ P ++ S + ++ + ++ AA+++ ++ Sbjct: 220 VADIAGAPEIPKDITAQESADLIFLIDGSDKTGSA----------HFAAIRD---FLVNF 266 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMK 299 ++ L + +G++ Y+ + + K + ++ T+ A+ Sbjct: 267 LERLPIGARQIQVGVVQYSNGPRTAFSLNSYSTKA-DVLDAVKALRVIGGEETNVGAALD 325 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +F + G + ++ ++ ++ + + K+ Sbjct: 326 F-----------VVENHFTQAGGSRVEEGVPQVLVLISGSQSTDDIRDGVV-----ALKQ 369 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I + + A + L+ + + + SL Sbjct: 370 --ASIFSFGLGAQGADRAELQHIATDENFVFTTPEFRSLGD 408 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 14/183 (7%), Positives = 53/183 (28%), Gaps = 21/183 (11%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + L +D + + + +GL+ Y + + + + Sbjct: 1650 DNFQEVLRFVSGIVDTIYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKEEILDAINKVVY 1708 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 ++ ++ + +F + + +T G S Sbjct: 1709 KGGRQANTLVGLEHLRK-----------NHFVPEAGSRIDQRIPQIAFVITGG-----SS 1752 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + + +K+ + + + + + + + S+ + V N L + + + Sbjct: 1753 VEDVEEATRALSQKGVKVFAVGV-RNVDLREVSRI-ASNSAIAFRVSNVQELSELSEQVL 1810 Query: 407 QLM 409 + + Sbjct: 1811 ETL 1813 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 36/355 (10%), Positives = 97/355 (27%), Gaps = 34/355 (9%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A+L ++ ++ + L++ A + + + L SA Sbjct: 1088 RAVLQAVRQLSPLGGASLHTGSALDFVLRNVLVESAGSRMAEGVPQVLIVLSAG----RS 1143 Query: 119 QNIVNSSRISMTHMANNRLDSS-NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 Q+ + + + ++ M Y F + + + + Sbjct: 1144 QDDIQGPAVLLKRSGAVPFGIGIGHSDDAEMRT---ISYVPDFAAVIPTFQQLAHVQQLV 1200 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + + E + + + DV + + ++ + Sbjct: 1201 SEKVILLSQEELQK-----IRPALPLPPPGAGSKRDVVFLIDGSQNTVQEFSYIRALIES 1255 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + + +I ++ + + K V Sbjct: 1256 LIEEL---DVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDDVQV-AVRRLRPKGGRQINVG 1311 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 A ++ + R + +G +F++ ++ G S+ + Sbjct: 1312 --SALDYVSKNIFRRPLGSRLEEG-------VPQFLVLISSGR-----SSDDVEDAAAHI 1357 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 KE + T++ N P L SPEY ++V L + Q + + Sbjct: 1358 KETGVAPFTVARNVDPEE---LVRISLSPEYVFSVSTFRELPSLEQKLLTPITTL 1409 >gi|110833078|ref|YP_691937.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] gi|110646189|emb|CAL15665.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] Length = 151 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 31/105 (29%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R ++ K ++ F L + L+ + M+ + + A A + Sbjct: 1 MRRLIRRVKKIVKGVVALEFLLVSPLVIGLVYAAATYGVLFSWQMRMQVSVDRAAAAATT 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + S G S++N A + + ++ Sbjct: 61 LDRNTTSDPGVMAASLANGALANNVPTFMGSVPADACVTVGAEVV 105 >gi|91977980|ref|YP_570639.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684436|gb|ABE40738.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 181 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 1 MHLL-SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-N 58 M L SR ++ + + + +I FAL F L+ +I V + +E+A + Sbjct: 2 MSLFVSRAAVLMRRFGRNRRGSAAIEFALIAPIFFGLLFAIIEVALMFFAGQVLETAVQD 61 Query: 59 AAIL 62 +A L Sbjct: 62 SARL 65 >gi|91215374|ref|ZP_01252345.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] gi|91186326|gb|EAS72698.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] Length = 689 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 55/193 (28%), Gaps = 24/193 (12%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEK 275 + KMA K A ++++++ + +I + + + + Sbjct: 284 SSGSMRGGNKMAQAKEASEFIVNNLNIGDNFN------VIDFDNNIVLFQPELVEYNIQN 337 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + L T+ + ++ A + + F Sbjct: 338 SNAALDFIENIVALGATNISESLVTAINQFEAGAEDKANIIVFFTDGG------------ 385 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVN 394 T+GE N ++ E I + T I L V + + + N Sbjct: 386 ATEGETNTQNILQLAEDTVNQI-ETEIFLFTFGIGEDVTTDLLTLLAVQNNGFVTFLGDN 444 Query: 395 A--DSLIHVFQNI 405 D + + + I Sbjct: 445 EIVDIISNFYLTI 457 >gi|282897345|ref|ZP_06305347.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] gi|281197997|gb|EFA72891.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] Length = 449 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 63/199 (31%), Gaps = 34/199 (17%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I +++D SGSM+ + K+A NA+ F + S Sbjct: 124 PRAWIIVLLDFSGSMNQIDSGGS--------------KKIAGAINAIREFT---KVSSDR 166 Query: 249 KEDVYMGLIGYT-----------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 D + ++ + + + S K++ + T+ Sbjct: 167 GGDTQISVVPFGEAGKNCPEYTVNKDTLDKFLSASDFKLQNSLEYLSGLNPCGSTNLYQP 226 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +K+A + L + F + II L+DG +N + ++ Sbjct: 227 LKKALEFL-----GNPEDPRFTLPENSSEPNPRLSIILLSDGYHNAMNEFQDFNELKSLL 281 Query: 358 KE-NFIKIVTISINASPNG 375 + I + T+ +P+ Sbjct: 282 QSYENITVHTLGYGLTPSQ 300 >gi|260425606|ref|ZP_05779586.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423546|gb|EEX16796.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 178 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 22/55 (40%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + ++ ++ + I FAL + + I + ++ ++E A + + Sbjct: 5 LKNALRRFRDDDEGSAVIPFALWTPLMVGMALSAIEMGALTVRQTALERALDQTV 59 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 64/242 (26%), Gaps = 32/242 (13%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 NTI + L + R + + + + E +E + Sbjct: 521 ASVVANTIAEENCKGLMSKLWTTYSHQLPSYRSSSSQIVQFTDSRKQQSSETTENKVEKL 580 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 + ++ + + + N L + V + + Sbjct: 581 GQYFKKNIRRLREKSLSLDLVFLVDESSSVGHSNFVNELRFVKKLLSDFPVVPSATRVAI 640 Query: 257 IGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 I ++++ + + + + + T + A +QA QIL Sbjct: 641 ITFSSKTNVQTRVDYISSSEPHQHKCSLLNREIPAIT--YKGGGTFTKGAFQQAAQIL-- 696 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + K I +TDG +N I ++ ++I T+ Sbjct: 697 ---------------RYSRSNSTKVIFLITDGYSNGGD----PRPIAANLRDLGVEIFTV 737 Query: 368 SI 369 I Sbjct: 738 GI 739 >gi|330503959|ref|YP_004380828.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] gi|328918245|gb|AEB59076.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] Length = 3977 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/297 (11%), Positives = 81/297 (27%), Gaps = 31/297 (10%) Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY- 156 + + + + + N + + T + LD +++ + + Sbjct: 3055 VSGNGQVLTAQTVGSHQPVFTVTLTNPATGAFTFELHAPLDHPAPA-SGSVENNLDFQFA 3113 Query: 157 --RLQFIEHLLNQRYNQKIVSFIP-----ALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + + + P + + +V+DLSGSM Sbjct: 3114 YQMVDGNGSTAQGALHISVDDDSPAQPKDIAKSASEPQGIHTNLMVVLDLSGSM------ 3167 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 + AP TK+A K+A+ +DS D L V + ++ + Sbjct: 3168 -----DDAPSGVSGFSTKLALAKDAVQRLIDSYDNLGD----VMVRIVTFANTASAVGNV 3218 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS-------DKKRSFFTNFFRQGV 322 + ++T ++ T+ A+ +A S + + Q Sbjct: 3219 WMTASDAKAWLTALANNAGNGSTNYDDALIKAMNAYDSTGKLTGTGVQSVSYFLSDGQPT 3278 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 + P + E + IK + + + + L Sbjct: 3279 LSNANPGSNNSGSQYNPELGDGIGAGEEADWIAFLTAKGIKSYALGMGLGSDLDKTL 3335 >gi|149725809|ref|XP_001495640.1| PREDICTED: similar to leukocyte adhesion glycoprotein [Equus caballus] Length = 1140 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/189 (10%), Positives = 57/189 (30%), Gaps = 25/189 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K + + ++ + + T K + + + + + Sbjct: 164 NDFQKMKEFVTIVMNQFKKSKTLFSLMQYSDTFQTHFTFKEFANNPNPGSLVRPINQL-- 221 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + +++ + L + K ++ +TDGE Sbjct: 222 ---GGRTHTATGIRKVVRELFHSRNG-------------ARKNALKILVVITDGEKFG-- 263 Query: 346 SNVNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIH 400 + + +A + I I + + + L T S P ++ + V N ++L Sbjct: 264 DRLEYEDVIPEADQEGIIRYVIGVGIAFSIEKSREELNTIASKPARDHVFRVNNFEALKT 323 Query: 401 VFQNISQLM 409 + + + + Sbjct: 324 IQNQLQEKI 332 >gi|241113142|ref|YP_002972977.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861350|gb|ACS59016.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 176 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 58/181 (32%), Gaps = 16/181 (8%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 RF ++ K +I FA+ + ++I +I V + + A +A++ Sbjct: 5 KRFAS-LRRLFGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFVNS----ALDASVHKI 59 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + + + + D K I N + S S S + + + ++S Sbjct: 60 SRMIRTG-----------EVASSKITLADFKARICNDMLLSFSCSSGLLVKVIVLSDLSS 108 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + + + L + L + + + +S LL Sbjct: 109 AASTDPIDDSGNLTVTETYDIGKGSDYILVQTFLPWTAVVNFFSLSSAKLSDGRYLLGSS 168 Query: 185 M 185 + Sbjct: 169 V 169 >gi|114683021|ref|XP_001148451.1| PREDICTED: collagen alpha-1(XX) chain [Pan troglodytes] Length = 1284 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 25/189 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 193 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 249 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q V K +I +TDG++ + Sbjct: 250 RYKGGNTFTGLALTH-------------VLGQNLQPVAGLRPEAAKVVILVTDGKSQDDV 296 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K+ + + + + + + L+ S P ++V++ L + Sbjct: 297 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPPRDITVHSVLDFLQLGTLAG 349 Query: 404 NISQLMVHR 412 +S+L+ R Sbjct: 350 LLSRLICQR 358 >gi|302536534|ref|ZP_07288876.1| VWA domain-containing protein [Streptomyces sp. C] gi|302445429|gb|EFL17245.1| VWA domain-containing protein [Streptomyces sp. C] Length = 532 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 61/209 (29%), Gaps = 25/209 (11%) Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS--IDLLSHVKEDVYMGLIGYTT 261 + ++ ++ L++AL + ++ + L+ + Sbjct: 338 NELRRPSRTVYVLDTSGSMEEGDRIGRLRSALTDLTGTGSSGTGQRFRDREEVTLLPFGD 397 Query: 262 RV----EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 +V +EP + T ++K AYQ L +F + Sbjct: 398 KVKKVLTHVVEPGNPGPALDAIRGDVKSLRPEGGTAVYASLKAAYQHLGEGNADAFTS-- 455 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 I+ +TDG++ + + ++ + + + + + Sbjct: 456 ---------------IVLMTDGQSGDKVKDFDSFYAGLPEAQKRTPVFAVLFG-DSDRKE 499 Query: 378 LLKTCVSSPEYHYNVVNA-DSLIHVFQNI 405 L + ++ + SL F+ I Sbjct: 500 LTHITELTGGRLFDATDGNSSLAGAFEEI 528 >gi|299067814|emb|CBJ39025.1| conserved protein of unknown function [Ralstonia solanacearum CMR15] Length = 348 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 11/298 (3%), Positives = 49/298 (16%), Gaps = 32/298 (10%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +I +++ + + + +N +++ Sbjct: 11 ARRQRGAVGLIMPAMLIAIFSVGALAVDIARLIVVRNELQN------------------- 51 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS-RISMTHMA 133 + A + ++ ++ + N+ S + + Sbjct: 52 ------AADAAALAGAAGLYPTNPTPNWSNGVTQGTSAVKLNKSSNVQLVSGTVQAGYWN 105 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + + ++ P + + Sbjct: 106 LTGTPAGLQSQSITPGANDVPAVQVTISRS-AGNNGGPVATLLAPIFGALSANSSATAVA 164 Query: 194 ELVVDLSGSMHCAMNSDPEDV-NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + S + K+ Sbjct: 165 VIAAPGSAGPGALFPIAISKCLYDLYWNYSTGQPKID-PSTGKPYVFQINTSYPTSSSSC 223 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G + + + T +K + + Sbjct: 224 TSGEWTGFNGPTDASTLKGLVQNGNTSTLSIGNMI---NTALATGVKSSVYMAIPSTP 278 >gi|108758819|ref|YP_634592.1| hypothetical protein MXAN_6470 [Myxococcus xanthus DK 1622] gi|108462699|gb|ABF87884.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 914 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 20/196 (10%), Positives = 53/196 (27%), Gaps = 33/196 (16%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 RTKM ++ L ++ V + T + S + + Sbjct: 415 RTKMELAAEGVVAALTLLNPKDEVSVHMV------DTAAHEIFPLSPVEAGLP--LDAVA 466 Query: 285 DSLILKPT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 A++ + S + ++ +D ++ Sbjct: 467 RGFSGGGGIYVGEALRAGRTEIL------------------RSEKPTRHVLLFSDA-ADS 507 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + +E + + I + P+ L + Y +A SL + Sbjct: 508 EEPDDYQRTLAH-LREQEVTVSVIGLGVPSDPDADLLREVAHRGGGRVYFAEDAMSLPRI 566 Query: 402 FQNISQLMVHRKYSVI 417 F + + + + + Sbjct: 567 FSQ--ETLTVARATFV 580 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 26/190 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTR 282 TK+ K+AL L + + I + + + +R Sbjct: 2 VGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIRDGKVY 56 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 TD A+++A ++L S I+FLTDG+ Sbjct: 57 IHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLTDGKPT 105 Query: 343 NFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNA 395 + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 106 VGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAG 163 Query: 396 DSLIHVFQNI 405 LI + I Sbjct: 164 SQLIGFYDEI 173 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 26/190 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVRQYVTR 282 TK+ K+AL L + + I + + + +R Sbjct: 2 VGTKLRQTKDALFTILHDLRPQDRFSIIGFSNRIKVWKDHLISVTP-----DSIRDGKVY 56 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 TD A+++A ++L S I+FLTDG+ Sbjct: 57 IHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLTDGKPT 105 Query: 343 NFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHYNVVNA 395 + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 106 VGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAG 163 Query: 396 DSLIHVFQNI 405 LI + I Sbjct: 164 SQLIGFYDEI 173 >gi|240172543|ref|ZP_04751202.1| hypothetical protein MkanA1_24733 [Mycobacterium kansasii ATCC 12478] Length = 963 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 60/189 (31%), Gaps = 33/189 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-----QYVT 281 K+ A + +D++ ++ + R+++ + G + + + Sbjct: 311 KIVAARRVSTRIVDTLSSTD------RFAVLTFGDRIDRAGGLADGLVEASDRHRYRAIE 364 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T+ ++Q +L + R ++ +TDG+ Sbjct: 365 HLARVDARGDTELLAPLRQGLSLLRGSQGRDA------------------VLVLITDGQ- 405 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + ++ D+ +++ T+ ++ + N L + + V + D L Sbjct: 406 -VGNEDQLLRELSDEL--QHVRVHTVGVDQAVNAGFLDRLANIGGGHCELVESEDRLDEA 462 Query: 402 FQNISQLMV 410 + + + Sbjct: 463 MHAMQRRIG 471 >gi|225012027|ref|ZP_03702464.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003582|gb|EEG41555.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 346 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 24/239 (10%), Positives = 66/239 (27%), Gaps = 73/239 (30%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I +D+S SM + +D ++ K + L+ + Sbjct: 91 DIVFAIDVSKSM---------------LAEDIAPNRLEKSKRLVSAILNQLAS------- 128 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKK 310 +G+I Y + + + + ++ L + T A+ + + + Sbjct: 129 DRVGIIAYAAQAVPQLPITTDYSAAKMFLQALNTEMLSSQGTALDSAIDLSGTFFDDEDQ 188 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + I ++DGE+++ ++ + IKI T + Sbjct: 189 TN------------------RVIFLISDGEDHSEDASNAASRA----AAMGIKIFTFGVG 226 Query: 371 ASP----------------------------NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N + L ++ + + + +++ Sbjct: 227 TEAGAPIPIKRNNVVESYKKDFNGEVVITKRNQETLEAIANATDGQYQDGNDTQAVLDF 285 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 43/137 (31%), Gaps = 6/137 (4%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLGD 77 +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 1 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAA 59 Query: 78 RFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + SN A + + + Y V +++ Sbjct: 60 AANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLG 119 Query: 135 NRLDSSNNTIFYNMDVM 151 S + +V Sbjct: 120 PSRTVSATSTAKATNVD 136 >gi|149732066|ref|XP_001492602.1| PREDICTED: similar to complement factor B [Equus caballus] Length = 768 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 72/225 (32%), Gaps = 39/225 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ M D D A KN L F++ + G Sbjct: 262 VLDPSGSMNIYMVLDGSDSIGAHN--------FTRAKNCLRDFIEKVASYGVKPRY---G 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + I S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATVPKVLIRVSQERSSDADWVTEKLNEISYEDHKLKTGTNTRKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 I+ +TDG +N V I +E Sbjct: 368 -----WEGNAPPEGWNRTRH--VILLMTDGLHNMGGDPVPVIHEIRDLLDIGRDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF 402 +++ I + N + + + + V + ++L VF Sbjct: 421 DYLDIYVFGVGPLVNQENINALASKKDGEQHVFKVKDMENLEDVF 465 >gi|332141921|ref|YP_004427659.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327551943|gb|AEA98661.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 44/401 (10%), Positives = 91/401 (22%), Gaps = 49/401 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE-IQNI 121 A A + RF +I + A T Sbjct: 301 ADAKDNSEEDVNGYPNTGGELSFNSSYNDYIGVRFRNVNIPQGAVITDAYLEFTAYSTRT 360 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI----VSFI 177 + + + +A++ + Y + + + + N +S I Sbjct: 361 YGNPSMRIRGVADDDASDFHPNRRYRLRNLPKTSG---ITWSMPDFYNNYVYRTPDISGI 417 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMH-CAMNSDPEDVNSAPICQDKKRT---------- 226 + G + + V+D S + P + + T Sbjct: 418 VKQIVDRSGWQSGNDMAFVMDNFVSYRGAHTYNSPSKAPKLIVKFNGAATPGASATVREH 477 Query: 227 ---KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT-----TRVEKNIEPSWGTEKVRQ 278 K+ L + F +D L + Y V ++ S Sbjct: 478 LVSKIDEL--SANGFTPIVDTLLEAANYYGGRSVDYGRKRGENDVNWSVRRSTRVSHRSS 535 Query: 279 YVTRDMDS-------LILKPTDSTPAMKQAYQILTSDK----KRSFFTNFFRQGVKIPSL 327 Y+ D + T + ++ + + G + Sbjct: 536 YLGADSILPWGCTEDNLSDSDCITEQIPTPATYISPVSDLQCQTNNHIVLLSDGEANNNH 595 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI-------VTISINASPNGQRLLK 380 K L N + + + + TI A+ L Sbjct: 596 SVSKIQTLLGQSCTGNG-GEKCGLDLVRNISDTSTSVIGPRVITHTIGFAANNTANNFLN 654 Query: 381 TCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 S Y N+ L+ F I + + + + G Sbjct: 655 QLALQSGGGFYQADNSTDLLEAFNTILRSVKDVNATFVSPG 695 >gi|299134237|ref|ZP_07027430.1| Protein of unknown function DUF2134, membrane [Afipia sp. 1NLS2] gi|298590984|gb|EFI51186.1| Protein of unknown function DUF2134, membrane [Afipia sp. 1NLS2] Length = 547 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 37/122 (30%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R ++ ++ E+ N + I A ++ + V K +SA + A + AS Sbjct: 9 RHLGRQFVSDERGNIAFIAAACMLLVTGCAALGVDVGSVFTDKRRAQSAADLAAIVAASD 68 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + + + + + ++A + ++ Sbjct: 69 LTRAPRAAAATVAKNNYPPDSLVAVETGVYKADTSLVPQQRFTAGATPANAVRVTMQTKT 128 Query: 128 SM 129 + Sbjct: 129 PL 130 >gi|119585666|gb|EAW65262.1| inter-alpha (globulin) inhibitor H3, isoform CRA_a [Homo sapiens] Length = 670 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|119585667|gb|EAW65263.1| inter-alpha (globulin) inhibitor H3, isoform CRA_b [Homo sapiens] Length = 890 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|114587338|ref|XP_516521.2| PREDICTED: inter-alpha (globulin) inhibitor H3 isoform 2 [Pan troglodytes] Length = 865 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 259 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 298 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 299 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 354 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 355 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 403 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 404 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 448 >gi|114587336|ref|XP_001172570.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1 [Pan troglodytes] Length = 890 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|288563|emb|CAA47439.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 885 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 280 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMKEEDYLNFI 319 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 320 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 375 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 376 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 424 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 425 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 469 >gi|296284151|ref|ZP_06862149.1| TadE-like protein [Citromicrobium bathyomarinum JL354] Length = 257 Score = 47.2 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 7/67 (10%) Query: 5 SRFRFY---FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES----AN 57 +R R + K+ + I FA+S+ FL L F I + ++ ++ Sbjct: 10 ARLRQFGRRLKRLRRDNRGLALIEFAISLPIFLGLGMFGIELANYAVTSMNVSQISLTIA 69 Query: 58 NAAILAG 64 + A G Sbjct: 70 DNAARMG 76 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 44/134 (32%), Gaps = 5/134 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAILAGASKMV 69 F++ A + + ++ FAL + FLL+I +I + Y+K + +A+ A L Sbjct: 4 FRRLAADGRGSVAVEFALFLPVFLLVIFSIIELGAAWYQKQMLVNASREGARLGALFSTS 63 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 L+ + + + + + + V N + Sbjct: 64 GGLTAQEVQERVNQYLSDS---GFPSQAVVQAVGVDGASGDPVTVNVSADYEFPVLSAFI 120 Query: 130 THMANNRLDSSNNT 143 + + S T Sbjct: 121 GAVP-GTISLSATT 133 >gi|254381319|ref|ZP_04996684.1| hypothetical protein SSAG_00986 [Streptomyces sp. Mg1] gi|194340229|gb|EDX21195.1| hypothetical protein SSAG_00986 [Streptomyces sp. Mg1] Length = 148 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 37/134 (27%), Gaps = 4/134 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ ++ A+ ++ L +IG + +A + A Sbjct: 9 RSRLREDRGGIAVYTAIVTIALLGIIGLALDGAGKLRATER----ADAVAMEAARAAGQA 64 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + A A+ ++ + SA + + ++ Sbjct: 65 IDPTQAVTGNGLRADPEAAAAVARAYLARAGTQGSVSLSADGTQLTVTVSDTYATKFLSV 124 Query: 132 MANNRLDSSNNTIF 145 N + S + Sbjct: 125 AGINSMGVSGHGSA 138 >gi|326501022|dbj|BAJ98742.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 284 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 63/180 (35%), Gaps = 28/180 (15%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + K+ L +++ + M + + + + I + + Sbjct: 76 SAFQYAKSFLYAYVNQTYD-----DLSIMSIHFFDSTFDPYIYYGNNRATILNMIQ--SK 128 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ + ++ + + K ++ LTDG + + Sbjct: 129 AYRGAGTATGNAINNSVALIKNKNFPNG---------------VPKILVILTDGGSYDS- 172 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 I+ + A++N I + + I ++ N +L++ S+ Y + + SL ++ I Sbjct: 173 ----VIEAANNARKNGIMLFCVGIGSNVNTAQLIQIAGSTSNIVY-ISSYSSLTNLVNLI 227 >gi|224065915|ref|XP_002191423.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain H3, partial [Taeniopygia guttata] Length = 869 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/283 (13%), Positives = 74/283 (26%), Gaps = 20/283 (7%) Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 N L ++ F S+ L N + F Sbjct: 189 GTFITNELQNTIKKTFSGKKGHISFKPTLDQQRTCANCSESVLDGDFTVRYDVKRTTPDN 248 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + ++ + + ++ K ALL LD I Sbjct: 249 LQIVNGYFVHFFAPTNLPKLSKNIIFVLDTSGSMSGREIEQTKEALLKILDDIKEDDFFN 308 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 ++ I T + ++ R++V T+ + + IL + Sbjct: 309 IILFDSEIS-TWKETLIKATPENLDEARKFVQHIS---AQGLTNLHGGLMRGIDILNAAH 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-IKICDKAKENFIKIVTIS 368 + + II LTDG+ N SN + KA + + + Sbjct: 365 EENLV-----------PKRSASIIIMLTDGQPNVGLSNTHEIENAVKKAIDGRYTLYNLG 413 Query: 369 INASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 + + L + + + Y +A L + +S Sbjct: 414 FGSGVDYGFLERMALENKGLARRIYPDSDAALQLQGFYDEVSN 456 >gi|322418525|ref|YP_004197748.1| hypothetical protein GM18_0996 [Geobacter sp. M18] gi|320124912|gb|ADW12472.1| hypothetical protein GM18_0996 [Geobacter sp. M18] Length = 389 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 43/178 (24%), Gaps = 17/178 (9%) Query: 28 LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA-----SKMVSNLSRLGDRFESI 82 + ++ FL++ G I + + + ++ + A L GA + + Sbjct: 15 IMLVVFLVVTGLAIDIGYMYVSEEDLQHSAEMAALTGAQTIKQRYLYQAQTDPARLPAIS 74 Query: 83 SNHAKRALIDDAKRFIKN--HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 S+ + + A + + + N + +M+ + + Sbjct: 75 SDPVQAPARNAAVDLVTGKHDAAALVGLLNNNGNALTGDNDITVGFWNMSSRSYTPGGTP 134 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 N + G P+ L+ Sbjct: 135 VNAMQVRTRRTAESSSV----------GLGTVGTFIAKISGTENFGSTPVATAALIPG 182 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/378 (10%), Positives = 94/378 (24%), Gaps = 69/378 (18%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 + KK A + + G + E+I + +A +D I Sbjct: 84 VATDIERLLAKKRK---ALERLASEAERLQKEHRWQDGIKEENIEYYNSKAEMDYDGEDI 140 Query: 99 KNHIKESLSGY---SAVFYNTEIQNIVNSSRISMTHMANNRLDSS---NNTIFYNMDVMT 152 + + L S V + + + + Sbjct: 141 DSQMSLKLDFVYDPSFKNQVNYSHTAVQIPTDIYKGAPVILNELNWTQALERVFIENSRD 200 Query: 153 SYDYRLQFIEHL----------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 Q + ++ + + G + ++VD+SGS Sbjct: 201 DPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGASSPKDMVILVDVSGS 260 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + +K ++ LD++ +V + + + Sbjct: 261 V--------------------SGLTLKLIKASVTEMLDTLSDDDYVN------VARFNEK 294 Query: 263 VEKNIE-----PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 E + + + TD A+ L + Sbjct: 295 AEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNT--------- 345 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 +P K I+ TDG + + +K +++ T S+ Sbjct: 346 -----NVPRANCNKIIMLFTDGGEDRAQDIFEQYNWPNKT----VRVFTFSVGQHNYDVT 396 Query: 378 LLKTCV-SSPEYHYNVVN 394 L+ S+ Y++ + + Sbjct: 397 PLQWIACSNKGYYFEIRS 414 >gi|240140255|ref|YP_002964733.1| hypothetical protein MexAM1_META1p3746 [Methylobacterium extorquens AM1] gi|240010230|gb|ACS41456.1| hypothetical protein MexAM1_META1p3746 [Methylobacterium extorquens AM1] Length = 204 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 13/126 (10%), Positives = 29/126 (23%), Gaps = 3/126 (2%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-AILAGASKMVSNL 72 +E +I FA ++L I + N + A A A LA + Sbjct: 22 FGRAEGGVSAIEFAFIAPILVILFIAAIEIPRAIATNNRLAQATIAMADLASKNDYADIN 81 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + ++ + + V + + + + Sbjct: 82 DVFAAAQVVAAPYSLAGAGIVLTAGGVYQVGNDF--VARVCSSVQSGDKARIVGSDIGPP 139 Query: 133 ANNRLD 138 Sbjct: 140 PAGTAS 145 >gi|242207819|ref|XP_002469762.1| predicted protein [Postia placenta Mad-698-R] gi|220731182|gb|EED85029.1| predicted protein [Postia placenta Mad-698-R] Length = 1006 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 55/188 (29%), Gaps = 29/188 (15%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 N + + +++ K A ++ + V +GL+ + V + Sbjct: 370 WNYRQRLKSRSMSRLDVAKQAFEALVNKSEAFD---HPVGIGLVTFGREVREV------- 419 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP----SLPF 329 T ++ +L K ++ + Sbjct: 420 -------------QTDGDTPLYDSLVVGRSMLERFKAQASSYQILLLTDLTDPMEKNPSA 466 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 II LTDGE+ + + + + + I + +I + S + V++ Y Sbjct: 467 ALRIICLTDGEDVGSRVT--AAAVAAQLQRSGIILDSIVVQTSALVNAIHPIAVATGGYS 524 Query: 390 YNVVNADS 397 + V + Sbjct: 525 FKVDTVEQ 532 >gi|167581951|ref|ZP_02374825.1| hypothetical protein BthaT_27659 [Burkholderia thailandensis TXDOH] Length = 142 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 34/111 (30%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + E+ S+ F L+ +L++ ++ K + +A+ A AG V L Sbjct: 6 DLVRDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGVVVRVPQL 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + ++S + ++ +S + Sbjct: 66 AAADIENVALSYAQGSLVSGGTVGAPVVYVDQSAGTSPGSPLKVTVSYTYQ 116 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 47/352 (13%), Positives = 94/352 (26%), Gaps = 44/352 (12%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + D ++ +A I I ++ S ++ V Sbjct: 252 GAPGEEDPVLKADIKVNLGLGVDKAAILSPYHAITLDEQQGGLTASLANRVRAERDFVLQ 311 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 R+ + + N + N +V +P + Sbjct: 312 WRLEQGVSPVAWVFNQNGKTHQTQAASDDGSMVNTAASTSASNIDNYSLVMVLPPKVEAS 371 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + LV+D SGSM + KNAL L + Sbjct: 372 GQLNLPRELILVIDTSGSMA--------------------GDSIIQAKNALRYALRGLKA 411 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + + V E + T+ + A++ A Sbjct: 412 QDSFNIIEFNSDVSLLSPV----PLPATAENLAIARQFVNRLQADGGTEMSLALEAALPK 467 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + S + +IF+TDG I + +N ++ Sbjct: 468 QRPSRAASENN-------------VLQQVIFMTDGS--VGNEEALFELIRHQIGDN--RL 510 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 T+ I ++PN + + + + D + Q I+QL+ +Y V Sbjct: 511 FTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVE---QKINQLLTKIQYPV 559 >gi|300692338|ref|YP_003753333.1| hypothetical protein RPSI07_2704 [Ralstonia solanacearum PSI07] gi|299079398|emb|CBM10247.1| conserved hypothethical protein [Ralstonia solanacearum PSI07] Length = 361 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 2/40 (5%), Positives = 16/40 (40%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 ++++ ++ ++ + I + +N +++ Sbjct: 12 SNQRGAVGVMMPFLLIIIFAIGVLAIDIARLLVVRNELQN 51 >gi|297683547|ref|XP_002819437.1| PREDICTED: collagen alpha-1(XIV) chain-like [Pongo abelii] Length = 1761 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|162447313|ref|YP_001620445.1| surface-anchored VWFA domain-containing protein [Acholeplasma laidlawii PG-8A] gi|161985420|gb|ABX81069.1| surface-anchored VWFA domain protein [Acholeplasma laidlawii PG-8A] Length = 486 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 72/237 (30%), Gaps = 45/237 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ ++++G+ P I +++D+SGSM K Sbjct: 123 NNETHLLLIGLETKQVDLGDIP-SNIVILLDVSGSM-------------------SATNK 162 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ K A+ L ++ + + Y + + + ++ T ++R Sbjct: 163 LSLAKKAMELLIEQMKPNDVISLVTYSSGEKVVFKGKSIDDMAYMTSQIR-------LLK 215 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T + AY++ II TDG+ N S Sbjct: 216 ASGSTAGKKGLDMAYKVAEEYFIEGGNN----------------RIILATDGDFNVGISS 259 Query: 347 NVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I+ + +E+ I ++L + + ++ + + S F Sbjct: 260 TDMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAF 316 >gi|158259621|dbj|BAF85769.1| unnamed protein product [Homo sapiens] Length = 1105 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|159040448|ref|YP_001539701.1| hypothetical protein Sare_4964 [Salinispora arenicola CNS-205] gi|157919283|gb|ABW00711.1| conserved hypothetical protein [Salinispora arenicola CNS-205] Length = 144 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 11/139 (7%), Positives = 34/139 (24%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + S+ A++ + L ++ ++ + A AG + + Sbjct: 6 LGRGDSGRVSVFLAIAALGILTVVALAFDGAGQLRALQRADNLASEAARAGGQSIDLVSA 65 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 G A+RA+ + S + + Sbjct: 66 IEGGPKRIDEPQARRAVAGYLAAAGASTHTVSFPVVDGETVVRVRVTVTYERSLLGLVGF 125 Query: 134 NNRLDSSNNTIFYNMDVMT 152 ++ ++ + + Sbjct: 126 DDTVNVTGEATARALTGGV 144 >gi|118364357|ref|XP_001015400.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89297167|gb|EAR95155.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 2720 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 15/200 (7%), Positives = 51/200 (25%), Gaps = 20/200 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR------VEKNIEPSWGTEKVRQYVTR 282 ++ D + + + + + + + + + +K+R ++ + Sbjct: 2509 NTIQTTFEQTRDILTEKGYDPQCFQIMISCFRSYNSKWEEIFQTSTWENNPDKLRSFLQK 2568 Query: 283 DMDSLILKPTDSTP----AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 T + A + D + + + + Sbjct: 2569 IT---ASGGTWPGESVEVGLWWANKQNDEDPISQVIILGDQPAHLMNEAQKFRNQFGQSY 2625 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 + K + C K K I + T + + + ++ + Sbjct: 2626 WDTTPLKGLTYYVPECQKLKNKNIPVNTFYLKQGAKS-TFENIAKLTNGISQYLDINSGQ 2684 Query: 397 ---SLIHVF-QNISQLMVHR 412 L +F + I + + + Sbjct: 2685 SSKQLTTLFVEQILKDIGKQ 2704 >gi|71051532|gb|AAH36192.1| COL14A1 protein [Homo sapiens] Length = 534 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 22/173 (12%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + E +GL Y+ + + K T + A+ Sbjct: 185 VTAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY- 243 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + K I +TDG++ + + +E+ Sbjct: 244 ------------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESG 286 Query: 362 IKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 +++ I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 287 VELFAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 337 >gi|326922791|ref|XP_003207628.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1224 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 69/180 (38%), Gaps = 31/180 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMD 285 +K+ + +D + + +GL+ Y+ V+ + T + V++ + + Sbjct: 789 FEIIKDFVTALVDRVT---VGRNATRIGLVLYSLEVQLEFGLNKHTTQQDVKRAIRKMQ- 844 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T + A+++A Q +G +K + LTDG+ + + Sbjct: 845 -YMGEGTYTGTAIRKATQ----------------EGFLGARTGVRKVAVVLTDGQADKRE 887 Query: 346 SNVNTIKICDKAKENFIKIVTISI--NASPNGQRL---LKTCVSSPE--YHYNVVNADSL 398 + V + +A I++ I I + P L S P+ + Y + + ++L Sbjct: 888 A-VKLDVVVREAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDDFNTL 946 >gi|238060728|ref|ZP_04605437.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237882539|gb|EEP71367.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 580 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 66/219 (30%), Gaps = 32/219 (14%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V+D+SGSM + S AA L LF DS + Sbjct: 382 MLCVIDVSGSMKQPVPSANNATREQVTV--------AAASRGLNLFDDSWSIGLWTFSTE 433 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G + Y + S ++ Q + T M AY+ + D + Sbjct: 434 LVGTLDYR-ELVPINLLSSNRSRLEQGLATI--RPSSGDTGLYDTMLAAYKTVQEDWEPG 490 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTIS 368 + ++ TDG+N + + + + +++V I Sbjct: 491 RVNS----------------VVLFTDGKNEDANGISQQKLLAELKQAADPERPVQVVIIG 534 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNIS 406 I + L + + + + +F + I+ Sbjct: 535 IGNDVSKSELDSITKVTGGGSFITEDPTKIGDIFLKAIA 573 >gi|70734188|ref|YP_257828.1| hypothetical protein PFL_0686 [Pseudomonas fluorescens Pf-5] gi|68348487|gb|AAY96093.1| conserved hypothetical protein [Pseudomonas fluorescens Pf-5] Length = 664 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/277 (9%), Positives = 76/277 (27%), Gaps = 8/277 (2%) Query: 11 FKKGI--ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 K+ A ++ ++ A++ LLL+ ++ + ++ ++ + A L S+ Sbjct: 4 LKRFYGPARQRGAIGLMAAVTFGLALLLMLLVVDSGRLYMEQRKLQRVADNAALEAVSRG 63 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + + L + A + + + S + + ++ Sbjct: 64 GTCQAGLTAAA----YAGQNATRNGFTVATGSSLSTSCGSLTTGANGLRTFTANPAQAVA 119 Query: 129 MTHMANNRLDSSNNT--IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + +A + + S + + +L +S L + Sbjct: 120 IRVIATHTVPISVASGVAALFTPGPINLTTQLSATAVAAAPTPTVAQLSIRSTLGTVSTA 179 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I + L GS+ + +N+ L A + ++ + Sbjct: 180 QSSILNPLVGGMLGGSLSLSAVGWNGLLNTNINLLSYLNQLAINLGVAAGNYTQLLNTTT 239 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 V + + + + + Sbjct: 240 SVTQLIQAAITVVQANGATADILTALGNLQVAAINAA 276 >gi|324508324|gb|ADY43516.1| C-type lectin protein 160 [Ascaris suum] Length = 696 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 61/200 (30%), Gaps = 26/200 (13%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S + +T + A+ +A + + +++D + L+ + T Sbjct: 36 DGSQSVGVTNFQTTLDAIVHAT-----QLTNIGPLEKDSRVALVVFDETAHIEFGLMNTT 90 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + R + M A + + + P+ K + Sbjct: 91 NADVEAAIRRSKYPNTNGVNIAAGMDAATRKIFN-------------HAYPPNRNAPKLL 137 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---NGQRLLKTCVSSPEYHY 390 I +T +N+ +S+V A+E I I+ + L+ P+ Y Sbjct: 138 IIIT---SNSDQSDVRAENS--LAREVGAIPAAIGIHNGNGIGASREQLEIATGDPKRVY 192 Query: 391 NVVNADSLIHVFQNISQLMV 410 + SL + Q + + Sbjct: 193 TTTDYSSLDDILQKLLCEIG 212 >gi|75812635|ref|YP_320253.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705391|gb|ABA25064.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 592 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 51/186 (27%), Gaps = 33/186 (17%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ +K A F++ +L + ++ + ++ + + + Sbjct: 58 ASSSMSDGKLTEVKTAATKFVERRNLTQD-----KLAVVSFGLDIQTATPLTDNADTLES 112 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + ++ T + A L + + I+ TD Sbjct: 113 AIASLSEA---GGTPMAQGLDAAIGELQA-------------------TFLSRNILLFTD 150 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G ++ + A+ I + I++ L + P + + Sbjct: 151 GVPDSQALASLS---AQSARSQRINL--IAVATGDADTNYLAQLTADPSLVFYANSG-QF 204 Query: 399 IHVFQN 404 F+N Sbjct: 205 DQAFRN 210 >gi|47227140|emb|CAG00502.1| unnamed protein product [Tetraodon nigroviridis] Length = 762 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 78/294 (26%), Gaps = 47/294 (15%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPALLRIEMGERP 189 + + D + + ++ + +I + + Sbjct: 198 ESGQWTGNEPACYFKHTYDTPLEVSQAF---GSAIKESLTTLESINDVQGERKIRISKNG 254 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I + VD+S S+ + K A+L + I S Sbjct: 255 TLNIYIAVDISESIQKDHV--------------------ESAKKAILKLITKISSFSVSP 294 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 L+ +++ + + + E + + + + + A++ + Sbjct: 295 NYE---LLFFSSELSEVVNILDFFENQPVDIKGRLTKFKVNEEHTGTDLNLAFKTILVRM 351 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT--------------IKICD 355 ++ + + II TDG N S + T + Sbjct: 352 -----ALIKQRVGEKAFEEHRHAIIVFTDGVYNMGGSPLPTVAKIKHMVYMNKIDEETGQ 406 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 ++ ++ I I A L + + + ++ +L F NI Sbjct: 407 NPRDEYLDIYIFGIGAEIYESDLRPLTAGTGGEHFFKLMEIQNLQETFDNIIDE 460 >gi|293344916|ref|XP_001079629.2| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|293356747|ref|XP_215375.5| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|149043683|gb|EDL97134.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1034 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE G ++++ + Sbjct: 68 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLMRMPSGRDELKASIDA 127 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 128 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 170 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADS 397 ++AK IK+ +++I RL + +++ + + + Sbjct: 171 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRL--SIIATDHTYRRNFTAADWGH 228 Query: 398 LIHVFQNISQLM 409 + ISQ + Sbjct: 229 SRDAEETISQTI 240 >gi|323474472|gb|ADX85078.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477209|gb|ADX82447.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 356 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 34/83 (40%), Gaps = 11/83 (13%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKT 381 II LTDG+ + ++ + K+ I I+TI I N + L K Sbjct: 122 QSQVPTKIIMLTDGKPTDKRNVKDYEKL-------DIPPNTQIITIGIGNDYNERILKKL 174 Query: 382 CVSSPEYHYNVVNADSLIHVFQN 404 S Y++ + L ++F++ Sbjct: 175 ADRSSGKFYHIKDISELPNIFES 197 >gi|322433071|ref|YP_004210320.1| hypothetical protein AciX9_4225 [Acidobacterium sp. MP5ACTX9] gi|321165298|gb|ADW71002.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 457 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 46/165 (27%), Gaps = 18/165 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 +++E + A+S++ L +G I V K ++SA + A A A ++ S Sbjct: 7 LVSNESGQVLVAAAVSLLVILGFLGISIDVGHRQLSKLRLQSATDTAATAAALEIRVCGS 66 Query: 74 RLGDRF---ESISNHAKRALIDDAKRFIKNHIKESLSGYS--------------AVFYNT 116 + S+ + + L+ Sbjct: 67 LVSCPAMQSAVQSSFIENGYPATPLLLNCATSSDDLTLVLNSPPCAMGSADPNYGKRGYV 126 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 E+Q + M + ++ + S + L Sbjct: 127 EVQVAQHVPTYFMKMLGISQFNISARSEAARNPGGACIY-ALDPT 170 >gi|302142112|emb|CBI19315.3| unnamed protein product [Vitis vinifera] Length = 626 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 52/168 (30%), Gaps = 22/168 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K+ LK A+ L + S+ + ++ ++ + + T ++ R Sbjct: 300 TGSKLQMLKRAMRLVISSLGPSD------RLAIVAFSASPRRLLPLRRMTAHGQRSARRI 353 Query: 284 MDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D L+ + + A+++A ++L ++R+ + K + Sbjct: 354 IDRLVCSQGSSVGEALRKATKVLEDRRERNPVASIMLLSDGQDDRVHSKAPNQRHVPAHV 413 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKTCVSS 385 + + I + + S + CV Sbjct: 414 SSTRFSHIE----------IPVHSFGFGESGGYSQEPAEDAFAKCVGG 451 >gi|225458992|ref|XP_002285586.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 738 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 52/168 (30%), Gaps = 22/168 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K+ LK A+ L + S+ + ++ ++ + + T ++ R Sbjct: 350 TGSKLQMLKRAMRLVISSLGPSD------RLAIVAFSASPRRLLPLRRMTAHGQRSARRI 403 Query: 284 MDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D L+ + + A+++A ++L ++R+ + K + Sbjct: 404 IDRLVCSQGSSVGEALRKATKVLEDRRERNPVASIMLLSDGQDDRVHSKAPNQRHVPAHV 463 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKTCVSS 385 + + I + + S + CV Sbjct: 464 SSTRFSHIE----------IPVHSFGFGESGGYSQEPAEDAFAKCVGG 501 >gi|260837103|ref|XP_002613545.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] gi|229298930|gb|EEN69554.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] Length = 184 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 14/136 (10%), Positives = 39/136 (28%), Gaps = 20/136 (14%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDS 294 + ++ + ++ Y + + + ++ T + Sbjct: 68 NFVATTTSDFQIGPDNAQVSVVQYESSPTEEFPLDR-YATLEDLLSAINLIPYRGGGTRT 126 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + K II +TDG++ + + Sbjct: 127 GRAIDYVVTTTLTVSRG-------------ARQGVPKVIIVVTDGQSGDD-----VREPA 168 Query: 355 DKAKENFIKIVTISIN 370 +AK++ I +V I + Sbjct: 169 RRAKQSGIIMVAIGVG 184 >gi|147816411|emb|CAN77438.1| hypothetical protein VITISV_007401 [Vitis vinifera] Length = 757 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 52/168 (30%), Gaps = 22/168 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K+ LK A+ L + S+ + ++ ++ + + T ++ R Sbjct: 369 TGSKLQMLKRAMRLVISSLGPSD------RLAIVAFSASPRRLLPLRRMTAHGQRSARRI 422 Query: 284 MDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 +D L+ + + A+++A ++L ++R+ + K + Sbjct: 423 IDRLVCSQGSSVGEALRKATKVLEDRRERNPVASIMLLSDGQDDRVHSKAPNQRHVPAHV 482 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLKTCVSS 385 + + I + + S + CV Sbjct: 483 SSTRFSHIE----------IPVHSFGFGESGGYSQEPAEDAFAKCVGG 520 >gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 710 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/229 (12%), Positives = 69/229 (30%), Gaps = 31/229 (13%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + ++ + G + + + ++ E +++ K Sbjct: 204 IIIETSIKDEDKGIAISSDGYIAISTYPYFEGKVENNSEFYFLIDCSGSMYGSRIENAKF 263 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLIL-KP 291 L L + S+ D +I + T E+ T K V+ + + D Sbjct: 264 CLNLLIHSL------PIDSRFSIIKFGTSYEEIFPICDYTNKNVKIAMRQIKDLDADMDG 317 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD ++ Y + + I LTDG+ +N Sbjct: 318 TDILSPLEYVYTQ-------------------TTKNGYHRKIFLLTDGQVHNSDVIC--- 355 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + ++N +I I + + + + + S + + + D++ Sbjct: 356 SLAQEKRDNN-RIYAIGLGSGADPGLIKNVSLKSWGNYVLIADKDNMNE 403 >gi|73958318|ref|XP_547049.2| PREDICTED: similar to integrin alpha X precursor [Canis familiaris] Length = 1149 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 51/167 (30%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + T D S + T + A++ L S Sbjct: 173 RPSTQFSLMQFSNNFRVHFTFEVFTYSSNPLALLDSVSQLGGLTHTATAIRIVTNELFSA 232 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFIKIVTI 367 K K +I +TDG+ + + I + + A I + Sbjct: 233 SKG-------------ARKDASKILIVITDGQKKGDSLGYEDVIPMAEAA---GIIRYAV 276 Query: 368 SINASPNGQRLLK---TCVSSPEY--HYNVVNADSLIHVFQNISQLM 409 + + + K S P + + V N D+L + + + + Sbjct: 277 GVGTAFQKMQSWKELNDIASKPSHEYIFKVENFDALRDIQNQLKEKI 323 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/170 (11%), Positives = 48/170 (28%), Gaps = 20/170 (11%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LI + V+ + + ++++ + + T + T + Sbjct: 113 LISFDINVDLLSDYTNSPREIKRSIDKATI-----NTGAGTGSVTGNS--TPKGTLLYDA 165 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---- 371 + K+ +K ++ LTDG + + + A++ + I I Sbjct: 166 VYLAAHDKLRQEAGRKILVMLTDGGDQGSQ--ETLKTATEAAQKANAIVYVILIADRGFY 223 Query: 372 ------SPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHRKY 414 + + + NV N L F I + + Sbjct: 224 SGGGFSFGGDRDMESLAHDTGGRVINVGNNGRKLEDAFDQIQDELRTQYL 273 >gi|209515958|ref|ZP_03264819.1| TadE family protein [Burkholderia sp. H160] gi|209503616|gb|EEA03611.1| TadE family protein [Burkholderia sp. H160] Length = 157 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 36/106 (33%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 +E+ ++ F L ++++ +I V K + +A+ A AG + L+ Sbjct: 25 AERGVVALEFVLVFPFLMMVLFGIIDVSLLLCDKAVITNASREAARAGVIVRIPQLTATQ 84 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 +++ K + + +S + ++ Sbjct: 85 ITNVALNYTQKNLVTGGTATTPTVTVDQSSGTSTGSPLTVKVTYTY 130 >gi|218693385|gb|ACL01196.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 108/361 (29%), Gaps = 27/361 (7%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 K++ ++ D ++ +A K ++ + + Sbjct: 251 AVEKLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFT 310 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 ++ + +++ + N + + + + + QK + +L Sbjct: 311 ATTHNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILET 369 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + LI V D +M A+N +P S + K+ + + + Sbjct: 370 QSSNARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFI 426 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + ++ + + GT + V + S++ + A+ Y Sbjct: 427 INGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPHNQLSVMS---NEGYAINSGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N + Sbjct: 484 YLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + ++ +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|91783018|ref|YP_558224.1| hypothetical protein Bxe_A2791 [Burkholderia xenovorans LB400] gi|91686972|gb|ABE30172.1| Predicted membrane protein [Burkholderia xenovorans LB400] Length = 595 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 12/140 (8%), Positives = 37/140 (26%), Gaps = 2/140 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + +++ A+ + + +G + V + ++ + ++ + A +A + S Sbjct: 25 ARRQHGAIAMLAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQLIGSTGGC 83 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKE-SLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + N A + I S + + N V + Sbjct: 84 ATATTAAQQNAAANGFTAGSTTTISTTCGRWDTSSNTYFGTSGNPLNAVQVKTTQVVPYF 143 Query: 134 NNRLDSSNNTIFYNMDVMTS 153 + + Sbjct: 144 FVGPSRNVSATATAFASNID 163 >gi|62897073|dbj|BAD96477.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] gi|62898698|dbj|BAD97203.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] Length = 890 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMQEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|133925809|ref|NP_002208.3| inter-alpha-trypsin inhibitor heavy chain H3 preproprotein [Homo sapiens] gi|166203665|sp|Q06033|ITIH3_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|77748471|gb|AAI07605.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|77748473|gb|AAI07606.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|78070482|gb|AAI07815.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] Length = 890 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 284 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMQEEDYLNFI 323 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 324 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 379 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 380 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 428 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 429 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 473 >gi|39652254|emb|CAC79611.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 886 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 72/225 (32%), Gaps = 40/225 (17%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 + V+D+SGSM K+ K ALL L+ + ++ Sbjct: 280 NVAFVIDISGSMA--------------------GRKLEQTKEALLRILEDMQEEDYLNFI 319 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++ G + T + ++ R +V D T+ + + +L ++ Sbjct: 320 LFSGDVS-TWKEHLVQATPENLQEARTFVKSMEDK---GMTNINDGLLRGISMLNKAREE 375 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISIN 370 +I LTDG+ N +S I+ A + + Sbjct: 376 HRIPE-----------RSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFG 424 Query: 371 ASPNGQRLLKTCVSSPEYH---YNVVNAD-SLIHVFQNISQLMVH 411 + N L + + + Y +AD L ++ ++ ++ Sbjct: 425 NNLNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLT 469 >gi|317403331|gb|EFV83845.1| hemolysin-type calcium-binding region [Achromobacter xylosoxidans C54] Length = 1141 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/329 (7%), Positives = 60/329 (18%), Gaps = 63/329 (19%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNH-AKRALIDDAKRFIKNHIKESLSGYSAVF 113 SA + A ++ F+ + D + + + Sbjct: 537 SAADIAAGKLVYTPSASGQDTSFGFQVRDDGGTANGGKDTSGDYNFAIKTNNFISGDNDG 596 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 T + +N + + + + + + + Sbjct: 597 SGTGTKPPINGGSGDDVILGDK----GGTVTTVEPGKNYNIAIVVDTSGSMSEASGTKGL 652 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 T+M + Sbjct: 653 ----------------------------------------------------TRMQLTID 660 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 AL +++ + +G + V + T+ Sbjct: 661 ALKNLANTLKGHDGIVNVALIGFESTASTKYTINGL--NASNVGDLIKAIEKLSASGGTN 718 Query: 294 STPAMKQAYQILTSDKKRSFFTNFF--RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 A +A + S F + F + + + Sbjct: 719 YEGAFDEAVKWFNKQPTSSNGQAFENVTYFLTDGDPTFSNRGSNGDWWSGGSTTNYYDMK 778 Query: 352 KICDKAK--ENFIKIVTISINASPNGQRL 378 DK K + I I N L Sbjct: 779 DAVDKFKGLSGKSTVHAIGIGTGVNEAYL 807 >gi|119595693|gb|EAW75287.1| collagen, type XX, alpha 1, isoform CRA_c [Homo sapiens] Length = 1329 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 200 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 256 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 257 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 303 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 304 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 337 >gi|66347353|emb|CAI95066.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595691|gb|EAW75285.1| collagen, type XX, alpha 1, isoform CRA_a [Homo sapiens] Length = 1297 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 200 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 256 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 257 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 303 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 304 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 337 >gi|112734845|ref|NP_065933.2| collagen alpha-1(XX) chain [Homo sapiens] gi|292495087|sp|Q9P218|COKA1_HUMAN RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|66347352|emb|CAI95065.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595695|gb|EAW75289.1| collagen, type XX, alpha 1, isoform CRA_e [Homo sapiens] Length = 1284 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 193 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 249 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 250 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 296 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 297 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 330 >gi|45946128|gb|AAH43183.1| Collagen, type XX, alpha 1 [Homo sapiens] Length = 1284 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 193 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 249 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 250 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 296 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 297 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 330 >gi|119595694|gb|EAW75288.1| collagen, type XX, alpha 1, isoform CRA_d [Homo sapiens] Length = 637 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 193 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 249 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 250 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 296 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 297 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 330 >gi|7959281|dbj|BAA96034.1| KIAA1510 protein [Homo sapiens] Length = 1140 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 46/161 (28%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L + + V +GL Y+ + + + + K + Sbjct: 43 SHFQQVKDFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRL 99 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 100 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 146 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 147 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 180 >gi|269926137|ref|YP_003322760.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] gi|269789797|gb|ACZ41938.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] Length = 490 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 37/374 (9%), Positives = 90/374 (24%), Gaps = 35/374 (9%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++FAL + + I I + + ++ M++A +AA LA +++ NL D Sbjct: 21 RSGQSLVLFALLSVVLIGFIAMGIDLGMAYSQRRFMQNAVDAAALAATNQLADNLQGTSD 80 Query: 78 ----RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + N S +N ++ + Sbjct: 81 GSWTFLVNDQNIRDTVRRYIDANSGITPPGSSYGAAGGCSSGPFCIEYLNVNKSLLARSP 140 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + + T +++ +Y F+ +L + + + + + P + Sbjct: 141 TSNGQVPSGTAIVRVNIKRTYS---TFLATVLGRNKMSVGATAAAQIFPVSKPKYPPTGV 197 Query: 194 ---------------ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + +D A ++ Sbjct: 198 WPMVRKYTGNALTEFPTNGACPPPVVFWSPNDSGSTVGDFKGLFYVGKYSAYVQELNYDP 257 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNI---EPSWGTEKV-RQYVTRDMDSLILKPTDS 294 + + + + + + V V + L +K T Sbjct: 258 SNKVAGTIPNHVQMITEFLSDNDDPYGDTEGPPANLNGSDVHDSLVQWFRNGLSIKLT-- 315 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A + K+ S + + G+ + I Sbjct: 316 ------AGSYIAPPSDADNEPTIPSYSGKLSSWQINRTSTYSNYGDKVEVITGNLGNNIS 369 Query: 355 DKAKENFIKIVTIS 368 D KE + + Sbjct: 370 DAMKEY-VHNHVVG 382 >gi|313215187|emb|CBY42862.1| unnamed protein product [Oikopleura dioica] Length = 289 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGEN---NNFKSNV 348 +S K A+ + + +G + F K T G N S Sbjct: 161 NSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPEFGKGRRIDTVGTVVLLTNAASTD 220 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 ++ D+ KE +++ + + GQ L SSP E Y + L + + I+ Sbjct: 221 EVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENLYIAEESSDLAGLVKTIA 279 Query: 407 QLMVHRKYS 415 + + S Sbjct: 280 DEICATELS 288 >gi|260832994|ref|XP_002611442.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] gi|229296813|gb|EEN67452.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] Length = 1121 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 51/186 (27%), Gaps = 30/186 (16%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY-TTRVEKNIEPSWGTEKVRQY 279 + ++N + ++ + +G+I + + + I + Sbjct: 949 SGSVSGQFDTVRNFIRGVVNCLT---IGGSHARVGVIKFAGSNANRQISLT-DYNNKNDL 1004 Query: 280 VTRDMDSLILKPTDST--PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + R + + + ++ +K I LT Sbjct: 1005 LVRIRNLDRSLGSAVGSVAGLSVMRNEFSTS----------------GRPTARKIGIVLT 1048 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNA 395 DG + + V + + I ++ + + + L+ S P E + Sbjct: 1049 DGRDTSSSDAVIPD--AETLRNEGTTIFSVGV--ANARRETLENMASRPVNENVFTA-TF 1103 Query: 396 DSLIHV 401 SL + Sbjct: 1104 ASLQTI 1109 >gi|83718955|ref|YP_442973.1| hypothetical protein BTH_I2452 [Burkholderia thailandensis E264] gi|167620114|ref|ZP_02388745.1| hypothetical protein BthaB_27657 [Burkholderia thailandensis Bt4] gi|257139197|ref|ZP_05587459.1| hypothetical protein BthaA_08316 [Burkholderia thailandensis E264] gi|83652780|gb|ABC36843.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 142 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 34/111 (30%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + E+ S+ F L+ +L++ ++ K + +A+ A AG V L Sbjct: 6 DLVRDERGVVSLEFVLAFPFMMLVLFGIVDTSLLLCDKAVITNASREAARAGVVVRVPQL 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + ++S + ++ +S + Sbjct: 66 AAADIENIALSYAQGSLVSGGTVGAPVVYVDQSAGTSPGSALKVTVSYTYQ 116 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 44/128 (34%), Gaps = 10/128 (7%) Query: 300 QAYQILTSD--KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + +L+ + N + F II LTDG + T + + Sbjct: 453 EGPDMLSDGRCAQAGSNVNEQIEEANSGGKKFPSMIIALTDG-TLMPEPYEETKIEAENS 511 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQ----LMVHR 412 ++ I +I + + L + SP++ + V N L + + ++ + + Sbjct: 512 RQLGATIYSIGVMD--YRRDQLLSIADSPDHVFGVDNGFKGLQDIVEPLTAKSCIEITNL 569 Query: 413 KYSVILKG 420 + S G Sbjct: 570 EPSTFCAG 577 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 50/184 (27%), Gaps = 9/184 (4%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D T + K+ + + K + + I Y+T ++ + K+R + Sbjct: 59 DTSETAKDSWKDIYTFVEKMVKKYPNPK--LRVSFITYSTEGHTLMKLTSDKNKIRDGLA 116 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + + T + + + F G + + Sbjct: 117 ELQNVVPTGATHLQEGFIKVNEQIEEANSGGELPGDFTVGRPK--YFQPDITVTTAPIPS 174 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + K + ++ + L+ V +P Y SL + Sbjct: 175 TSLPMINFEVNKAVRMKS---TVYFVA--YKDYKKEELRNIVQTPSQVYKAERFQSLNRM 229 Query: 402 FQNI 405 ++ Sbjct: 230 LNSL 233 >gi|75812639|ref|YP_320257.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] gi|75705395|gb|ABA25068.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] Length = 405 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 11/188 (5%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P I +++D SGSM+ + + + A + + LK+ + ++ Sbjct: 78 PPAWIIVLLDFSGSMNKLDSRGTKKIEGAIKAIRQLTS---VLKDRGENT--QVAIVPFG 132 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + + + K++ + T+ +K+A + L + Sbjct: 133 EAGANCPQGYPVNKDTLDKFFAANDFKLQNNLDYLASLTPCASTNLYEPLKKAVKFLANT 192 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVTI 367 F+ P P + II L+DG +N + + K N I + T+ Sbjct: 193 SDSRFY-----LPKDSPQTPPRLSIILLSDGYHNFANEAQDFQSLTTLLKRNTNIIVHTL 247 Query: 368 SINASPNG 375 +P Sbjct: 248 GYGLTPEQ 255 >gi|120554526|ref|YP_958877.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324375|gb|ABM18690.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 715 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 46/137 (33%), Gaps = 10/137 (7%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI--PSLPFQKFIIF 335 QYV + ++ + A++++ QI + + + +I Sbjct: 78 QYVNMLVPYGVVDQSWRDTAIERSEQINSVALHTNLGLAMEKAANDWLSGGTLENTHLIV 137 Query: 336 LTDGENNNFKSNVNTIKICDKA--------KENFIKIVTISINASPNGQRLLKTCVSSPE 387 L+DG+ + + + + K+ I T+ ++ + + L + Sbjct: 138 LSDGKVDVPGGDDASQAEEKRIVDSLLPALKDKGATIHTVGLSEKADIRFLRNLARETGG 197 Query: 388 YHYNVVNADSLIHVFQN 404 +A++L F N Sbjct: 198 SFQLAQSAEALNLAFAN 214 >gi|329954839|ref|ZP_08295856.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526943|gb|EGF53954.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 342 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 65/223 (29%), Gaps = 58/223 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD + +++ K + +D E+ +G+I + + + Sbjct: 97 DISNSMLAQDVQPSRLQKAKRLVAQLVDK-------MENDKVGMIVFAGDAFTQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + T A+ A + T + Sbjct: 150 YISAKMFLESIDPSLISKQGTAIGAAINLASRSFTP------------------QEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I +TDGEN+ + E I++ + + Sbjct: 192 AVIVITDGENHEGGAVEAAKAA----AEKGIQVNVLGVGMPEGAPIPAEGTNDYRRDRDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 N Q + + + V N + ISQ + Sbjct: 248 NVIVTRLNEQMCQEIAKAGNGIYVRVDNTNGAQKA---ISQEV 287 >gi|260837445|ref|XP_002613714.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] gi|229299103|gb|EEN69723.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] Length = 1875 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 32/78 (41%), Gaps = 6/78 (7%) Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +I +TDG++ + ++ + + + I + + + +LL+ + + Sbjct: 1 MIVVTDGKSGDSVASS-----ANDLASQGVDVYAIGVG-NYDATQLLEIAAGNQNNVIEL 54 Query: 393 VNADSLIHVFQNISQLMV 410 + ++L I+Q + Sbjct: 55 TDFNALSAEINQIAQTVC 72 >gi|146351309|ref|YP_001210536.1| hypothetical protein pAL1.089 [Arthrobacter nitroguajacolicus] gi|146218873|emb|CAL09944.1| putative membrane protein [Arthrobacter nitroguajacolicus] Length = 133 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 36/135 (26%), Gaps = 9/135 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E+ + + + + L+++G ++ K+ S +A A + Sbjct: 7 AQQERGSATPFILILAVGLLIMVGLVVDGGGQVNAKDQANSVAQSAARAAVVNISDATGG 66 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + A + I + N + N + ++ Sbjct: 67 TPVIDTASAQAAAQTYIA---------AAGKTGTVTINGNNVTVTVTSNYQTVFLSLAGI 117 Query: 135 NRLDSSNNTIFYNMD 149 N L S + Sbjct: 118 NTLTGSATSTAQLAQ 132 >gi|326911070|ref|XP_003201885.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Meleagris gallopavo] Length = 948 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 56/223 (25%), Gaps = 44/223 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-E 250 I V+D+SGSM KM A+ L + Sbjct: 312 NILFVIDVSGSMW--------------------GLKMKQTIEAMKAILSELRAADQFSLI 351 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D + + + +V T+ A+ +A IL + Sbjct: 352 DFNHNVRCWRDNLVSATP-----AQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAQN 406 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTIS 368 + + I+ ++DG+ K + K+ + + Sbjct: 407 -----------LGMLDPNSVSMIVLVSDGDPTVGELKLTTIQKNVKQSIKDE-FSLFCLG 454 Query: 369 INASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 I + L + + + + + + +S Sbjct: 455 IGFDVDYDFLQRIATDNRGMAQRIFGNQETSAQMKRFYNQVST 497 >gi|42521785|ref|NP_967165.1| hypothetical protein Bd0148 [Bdellovibrio bacteriovorus HD100] gi|39574315|emb|CAE77819.1| hypothetical protein predicted by Glimmer/Critica [Bdellovibrio bacteriovorus HD100] Length = 739 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 40/387 (10%), Positives = 100/387 (25%), Gaps = 42/387 (10%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R F + + +++ ++ AL L +I V + K +++++ + A GA K Sbjct: 5 RSTFIQNLNNKRGQIALFVALIFQILFLFFAMVINVGLLVHHKINLQNSVDLAAYYGAMK 64 Query: 68 MVSNLSRLGDR--------FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 ++ + + D + + + + Sbjct: 65 QAEGMNVIAHTNYQIRQSWKLLAWRYRMLGTAGDFEEHPFDKVGGGRIRTTDDDSINPGA 124 Query: 120 NIVNSSRIS----MTHMANNRLDSSNNTIFYNMDVMT-SYDYRLQFIEHLLNQRYNQKIV 174 S + +++ + + + L + + + + + Sbjct: 125 QNFYDSPSFCITYIPFKPMPDGENTCKALSSHTGINVFKIPPVLIDLPSVSGRTRSLALN 184 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + R GS + M V + I Q + ++ L A Sbjct: 185 LRASLIER--------------CKDFGSFNYLMLGG--FVVAFNIDQGNRALLISYLSKA 228 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + V +G+ + + S Sbjct: 229 MSPSSPDADFFDIDGKSVKLGMENTLKNNLTAANNTSDLK--MSIYNSLGHSACNGSQG- 285 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A + + L K F+ + P K + + N ++ Sbjct: 286 --AKGEPARWLKQIKTYPGFSYIDTICTETSITPKPKMLSNVESDLPNATVADGRFRTEV 343 Query: 355 DKAKE--------NFIKIVTISINASP 373 D+ K + I +I + +P Sbjct: 344 DELKNYIGIREPISDIYNYSIGVEKNP 370 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 43/145 (29%), Gaps = 20/145 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + + +GL Y++ + + + T + A+ Sbjct: 274 VAPFNVAMDKIRVGLTQYSSDPRTEWDLNTYATRDEVLEALRSLRYKGGNTFTGLALTH- 332 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +L + K K II LTDG++ + + K Sbjct: 333 --VLEHNLKADTGA----------RSEAPKLIILLTDGKSQDDANPP-----AQVLKNMG 375 Query: 362 IKIVTISINASPNGQRLLKTCVSSP 386 I+I + + + L+ S P Sbjct: 376 IQIFAVGV--KNADETELRQVASDP 398 >gi|309364927|emb|CAP23537.2| CBR-CLEC-143 protein [Caenorhabditis briggsae AF16] Length = 666 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 66/220 (30%), Gaps = 15/220 (6%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V N + V Q ++ ++N L L SI + VK Sbjct: 289 PSISDDVYSDPICPGNIKNLWLDVVVVVDKSQLMTNAQLWQVRNTLTQVLGSISKIGPVK 348 Query: 250 EDVY-----MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +G++ Y + Y + + T++ Sbjct: 349 YPADPRSTCVGIVTYDDNATTQSQLDASKSFSDLYNVIQSSLISVDNTNT--------SY 400 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + + + I F D + + + I ++ K+N + I Sbjct: 401 LSLALLAAEKALKDGRNRTYRFNYKKVIIAFAADYQGHGTALDAMP--IANRLKDNAVTI 458 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 +T++ ++ + Q ++ S + ++ L F Sbjct: 459 ITVACTSNSDKQTAIQGIASPGFDLVDEMDTPKLPTAFAQ 498 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 42/146 (28%), Gaps = 19/146 (13%) Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 +P G+ + + T A++ AY+ + Sbjct: 453 FTVNDPQPGSADLTAISAAADGLTLGSGTAIYSALEAAYRYVADSAAAPADGVAPLT--- 509 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENFIKIVTISINASPNGQRLL 379 I+ +TDGENN + + + D A+ ++ T+ + Sbjct: 510 --------SIVLMTDGENNQGTTADAFHSSYLALPDAARS--VRTFTVVFG-DARVDEMR 558 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNI 405 + ++ SL F+ I Sbjct: 559 TIADWTGGAMFDART-SSLSEAFREI 583 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/239 (12%), Positives = 65/239 (27%), Gaps = 31/239 (12%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V + DL H ++ +S+ + + + Sbjct: 819 LVIAGGPTQMTCQSDSTWSSQPPCCDLKCPPHALVDLMFLLDSSSSVGRSNWNLLINFTV 878 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 L F+ ++ V + E I ++RQ + R + T Sbjct: 879 ALLDKFV---ISPDDMRVGVARFNRHFDRDSEILIGNYSNISELRQKLRRMP--YRGRGT 933 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ + N I+ +TDG ++ ++ Sbjct: 934 LTGNAL--------------WHMNNHSLHAPGNRPGVPDVIVVITDG-----LASDEVLR 974 Query: 353 ICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN-----ADSLIHVFQN 404 + KE +K+ + + + L+ S EY + N AD L Sbjct: 975 AANALKEQDVKMYVVGLINRMNRMNLAQLQDISSGTEYLQIIDNGYERLADELSDTLTQ 1033 >gi|313233701|emb|CBY09871.1| unnamed protein product [Oikopleura dioica] Length = 663 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 17/182 (9%), Positives = 51/182 (28%), Gaps = 24/182 (13%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 ++ F ++ V V + + ++ ++ E +++D + + Sbjct: 481 QDTFDFFQALLNEFDTV-NQVQLSITSFSDDAVVDLPMGHYNEPDLFGAVKNVD-WVGRL 538 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD ++ A + + +IF++DG + Sbjct: 539 TDINEGLQTALSTMN------------------TTDDVPDIMIFVSDGF--DSFDPGAIG 578 Query: 352 KICDKAKENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + +V++ + L+ + + D L+ I + + Sbjct: 579 DNAADISNAGVDVVSVGFGLNGFVNFMALVTVADNEGANVFTASTGDELLSQTTAILEAL 638 Query: 410 VH 411 Sbjct: 639 CT 640 >gi|313200801|ref|YP_004039459.1| outer membrane adhesin-like protein [Methylovorus sp. MP688] gi|312440117|gb|ADQ84223.1| outer membrane adhesin like proteiin [Methylovorus sp. MP688] Length = 1543 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 36/288 (12%), Positives = 73/288 (25%), Gaps = 28/288 (9%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 S T + +++T D + + D T L Sbjct: 943 SGSGTGTLTGSAGGQPVLTITINNGGEYDVNLLRPLDHPDTKTEDVLTLNVGVSATANGV 1002 Query: 170 NQKIVSFIPALLRIEMGE-------RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + + + +D+SGSM Sbjct: 1003 TSVGNLTVNVEDDSPVANPITANLSTTNTNLLITLDISGSMRTHD-------------GV 1049 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 T++A+ ++ + LD D L + + L+ ++T + ++ + + + Sbjct: 1050 GDTTRLASAIQSIKVLLDKYDALGDTR----ISLVVFSTTAAQVGTDWMTIDQAKAQLDQ 1105 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + T+ A+ A K + N LT G N Sbjct: 1106 ILVKGPGGNTNYDSALANAMDAFDDPGKLTNAQNVAYFISDGEPNTGSGSNTSLTGGTNT 1165 Query: 343 NFKS----NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 N + + N IK I + + N L Sbjct: 1166 NSSDAGIQTQEELAWKTFLEANQIKSYAIGVGSDINSVNALNPVAYDG 1213 >gi|156382210|ref|XP_001632447.1| predicted protein [Nematostella vectensis] gi|156219503|gb|EDO40384.1| predicted protein [Nematostella vectensis] Length = 8745 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 +K+ + ++ + + +++ LI Y + R Sbjct: 1442 SATSAPQHFQVMKDTVKSIIEK-----YGIDRIHLSLIVYGDLPYTRVSFGQDIPDARTL 1496 Query: 280 VTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 S + +D A+ +A ++ + +K ++ +TD Sbjct: 1497 KNLVEALSRVSGGSDLGKALAEATKVFRN---------------DDVRPNAKKVLVVITD 1541 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNAD 396 +++ ++ +K + + I+++ ++I A + L + + + Sbjct: 1542 SKSSYGVAS--LVKAAEPLEHAGIRVLAVAIGADADASELRSIVQHQGDVIKASSSEDPS 1599 Query: 397 SLIH 400 L Sbjct: 1600 GLAD 1603 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/310 (10%), Positives = 82/310 (26%), Gaps = 24/310 (7%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + I + + +A + L A + V + + + + S Sbjct: 1488 QDIPDARTLKNLVEALSRVSG--GSDLGKALAEATKVFRNDDVRPNAKKVLVVITDSKSS 1545 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + ++ S + + L L Sbjct: 1546 YGVASLVKAAEPLEHAGIRVLAVAIGADADASELRSIVQHQGDVIKASSSEDPSGLADRL 1605 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +M P+ + + A + + SI + + E + GL+ Y Sbjct: 1606 VSAMLKETRKIPKVDLGFAVSAGSSNS--DATLRHMKDIIKSI-VTRYGAERMRYGLMSY 1662 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFF 318 V I + G + + A+ ++ Sbjct: 1663 GDDVTTTIFFTDGITNPNSLLPFIDILQKTPSGSSLEKALVGGKKLFEHGG--------- 1713 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 KF++ +TD + + D AK + ++ I++ + + + L Sbjct: 1714 ------TRPGVDKFLVVITD-RKSTGDAAQTENAKADLAKA-GVNVLVIAVGSEVDHKEL 1765 Query: 379 LKTCVSSPEY 388 +T ++P + Sbjct: 1766 NET-ATTPSH 1774 >gi|219847650|ref|YP_002462083.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541909|gb|ACL23647.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 419 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 75/242 (30%), Gaps = 61/242 (25%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 R + V+D SGSM K K+ L+ A++ ++ +D Sbjct: 37 MTQVRMPVNVCFVLDRSGSM--------------------KGEKIERLRQAVVKAIELLD 76 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + ++ + R E + P + V R D+ T PA+++ Sbjct: 77 QQDS------LAIVIFDHRTEVLVPAQPVRNRAMILDLVHRIRDA---GGTRIAPAVEKG 127 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Q L + +I LTDG+ + + D A Sbjct: 128 LQELQKMPPGV------------------RRLILLTDGQT---EHENECLLRADDAGRLG 166 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN---------ISQLMVHR 412 + I + I N L++ S + +++ FQ+ I ++ Sbjct: 167 VPITALGIGKDWNEDLLIEMANRSKGVADYIAQPGEIVNYFQHTVQRAQQTVIQNSVLTL 226 Query: 413 KY 414 ++ Sbjct: 227 RF 228 >gi|327266510|ref|XP_003218048.1| PREDICTED: complement C2-like [Anolis carolinensis] Length = 735 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 65/180 (36%), Gaps = 26/180 (14%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-------TRDMDSLILKPTDS 294 I+ ++ V ++ + ++ + ++ + + V D T++ Sbjct: 285 INRVASFDIAVKFSILSFASQPKTIVDIHEDIAEDPELVLEKMKSDMNYKDHGNATGTNT 344 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ Y+++ +D+ + II LTDG++N + + ++ Sbjct: 345 YAALDAIYRMMINDRANVLEKWDKVRHA----------IILLTDGKSNLGRPPKDAVRSI 394 Query: 355 DKA------KENFIKIVTISINA-SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNI 405 + +++++ I + + + + P + + + ++L F++I Sbjct: 395 EGLVNVGENRKDYLDIYVFGVGNLDVDWSAMNEIASKKPGEKHAFKLEKLETLKEAFEDI 454 >gi|251791983|ref|YP_003006703.1| TadF [Aggregatibacter aphrophilus NJ8700] gi|247533370|gb|ACS96616.1| TadF [Aggregatibacter aphrophilus NJ8700] Length = 200 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 9/56 (16%), Positives = 23/56 (41%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 M++ F + K +++ + +I F +M + FL + +++A Sbjct: 1 MNITDFFSSHSKNFRTNKRGSVTIEFLFMLMLLTFIFAFLADFAIIRITQGKLDNA 56 >gi|313764974|gb|EFS36338.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1] gi|314916261|gb|EFS80092.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4] gi|314917531|gb|EFS81362.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1] gi|314921865|gb|EFS85696.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3] gi|314955334|gb|EFS99739.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1] gi|315102367|gb|EFT74343.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1] gi|315109816|gb|EFT81792.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2] gi|327334607|gb|EGE76318.1| putative membrane protein [Propionibacterium acnes HL097PA1] gi|327454298|gb|EGF00953.1| hypothetical protein HMPREF9581_00471 [Propionibacterium acnes HL087PA3] gi|327456363|gb|EGF03018.1| hypothetical protein HMPREF9586_00740 [Propionibacterium acnes HL083PA2] gi|328756057|gb|EGF69673.1| hypothetical protein HMPREF9579_00544 [Propionibacterium acnes HL087PA1] gi|328758902|gb|EGF72518.1| hypothetical protein HMPREF9588_00650 [Propionibacterium acnes HL025PA2] Length = 169 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 11/117 (9%), Positives = 23/117 (19%), Gaps = 4/117 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVG----EGI 91 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 L+ T + + Sbjct: 92 AVGQRVGLAELAGTRCSNPAIAVDSSDLTLPVGSAGTTSARVSCTIKLSDLLVPGMP 148 >gi|296135127|ref|YP_003642369.1| hypothetical protein Tint_0639 [Thiomonas intermedia K12] gi|295795249|gb|ADG30039.1| hypothetical protein Tint_0639 [Thiomonas intermedia K12] Length = 1169 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 24/260 (9%), Positives = 57/260 (21%), Gaps = 18/260 (6%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 YN+ S G P + + + + + Sbjct: 284 GYNKSAPSANKVTGPTNAGYVPYTDQVVYAERGFGYYVQSLNATSGNQVVSMTNLGTNPS 343 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDS 286 +A+ AL F ++ ++ + + Y + + Sbjct: 344 TSAVNTALTPFTTALKPETNNSSSSEIKALAYQSPTAGLVQGAGNVLSNLIASCAGQYVI 403 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT------- 337 L+ T + + F+ + Sbjct: 404 LVTDGLPTMDLNGKNWPPLGSAAGQGYGVNAAFYGIAGNSNYGINDDSNNLPSGQTQGAL 463 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--------GQRLLKTCVSSPEYH 389 D N N ++ ++TI + IK + + A + Sbjct: 464 DAANTNDQALIDTITAIQALNKKGIKTYVVGLGAGVDANANPAAYAALNAMAIAGGTGQE 523 Query: 390 YNVVNADSLIHVFQNISQLM 409 Y N + +I+ + Sbjct: 524 YPANNVTAFNSALGSIAAQI 543 >gi|260785923|ref|XP_002588009.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] gi|229273165|gb|EEN44020.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] Length = 948 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/285 (11%), Positives = 79/285 (27%), Gaps = 32/285 (11%) Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 + D S + + + ++ + A+ + G+ Sbjct: 395 LAPADTPTDDPSGASPGVIPGAQGVVVLCVDVSGSMGVTTRVPQLQAEWKAMRDKKAGKE 454 Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + +E ++ ++ + D + ++ + + D + Sbjct: 455 YVTRLEC---MTAAIARHLERLELDTPDRQVAIVTFQSHVQLFGDGRYPPTDVTGDTLNS 511 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + GLI + + E + + + D T PA+ ++ Sbjct: 512 YD----GLISTGRHLAEKFELQPIKDSLATLKEKVCDLSAGGTTALGPALAVCAGVVADK 567 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT-----IKICDKAKENFIK 363 + II TDG N +V KI D A+ + I Sbjct: 568 PRSE--------------------IILCTDGAANVGVGDVKKDPGFYKKIGDFARSHKIV 607 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 I I I + ++ N+++ L+ + I+Q Sbjct: 608 ISIIGIEGENVALEQVSAAAATSGGTVNILHPLELVRQIRQIAQN 652 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 54/170 (31%), Gaps = 33/170 (19%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY--- 279 +T++ LK+A+ +D+I V D +I + TR T V+QY Sbjct: 6 GDKTRIQLLKSAVDNMIDNIAEKEDV--DAKWEVIDFATRAAVRGGGWLNTSNVKQYVTT 63 Query: 280 -VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + D + I + T+ M A + + S +K ++FLTD Sbjct: 64 AINEDNNVDIGRGTNYQAGMDLAQKEFEKKQPES------------DRPNAKKIVLFLTD 111 Query: 339 GENNN------------FKSNVNTIKICDKAKENFIK---IVTISINASP 373 G+ + I+ +K I I Sbjct: 112 GQPTYYGSGVGNDLHGNGSYFTSQIENASYTAAENLKCDYFYAIGIGLES 161 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 77/252 (30%), Gaps = 47/252 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++V + P ++D+SGSM T+ Sbjct: 189 NSQNRLVKIGLKARSMAGENLPASNFVFLIDVSGSM-------------------YGATR 229 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L +++ + ++ Y + + + G K ++ + Sbjct: 230 LDLVKSSLKLLTNNLREKD------RVAIVVYAGSAGEVLPSTSGANK-QKIKEALDNLN 282 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AY+I + + II TDG+ N S Sbjct: 283 AGGSTTGGAGIQLAYKIAKQNFIKGGNN----------------RIILCTDGDFNVGVSS 326 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQNI 405 N + + ++ +++ + + + ++ + H + N L + + Sbjct: 327 NDGLLALIEQERKSGVFLSILGYGMGNYKDSKMQTLAQAGNGNHAYIDN---LQEANKVL 383 Query: 406 SQLMVHRKYSVI 417 Y+V Sbjct: 384 VNEFGATMYTVA 395 >gi|332520546|ref|ZP_08397008.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332043899|gb|EGI80094.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 698 Score = 46.8 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 59/190 (31%), Gaps = 33/190 (17%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 LK+A L +D + + ++ Y +EP+ G +K ++ + Sbjct: 361 LLKSAFKLLVDQLRDKD------KVSIVVYAGAAGVVLEPTSGKDK-KKILKALNKLQSG 413 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNV 348 T + AY++ + ++ +I TDG+ N SN Sbjct: 414 GSTAGGAGINLAYKLAEENFNKNGNN----------------RVILATDGDFNVGASSNQ 457 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQN--- 404 + + +++ + + + L+T H + +F Sbjct: 458 AMEDLIIEKRKSGVFLSVLGFGYGNYKDDKLETLADKGNGNHAYIDTMQEAKLIFGKEFG 517 Query: 405 -----ISQLM 409 I++ + Sbjct: 518 GTLFTIAKDV 527 >gi|116618630|ref|YP_819001.1| von Willebrand factor domain-containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] gi|116097477|gb|ABJ62628.1| von Willebrand factor (vWF) domain containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] Length = 920 Score = 46.8 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 19/355 (5%), Positives = 76/355 (21%), Gaps = 29/355 (8%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 D + + S S + + + + + Sbjct: 77 ASNGWDKNSSWNGDSSDTSKSYLKFGTDTSNPDYQIRKYAKETSTPGLYDVYLNAKGNEV 136 Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + + ++ + I + ++ + Sbjct: 137 KNIKPIDIVLVVDMSGSMEPANNPSGSNRAQATRDGVKQFLQAIKDAGISQYVNVGLVGF 196 Query: 207 MNSDPEDV--NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 + N + + + + + + + G Sbjct: 197 SSPGNYVTGSNGYLEVGMQSLSTTGQTEQINKTLSPTFSGGTFTQLGIRRGQKMLDDDKN 256 Query: 265 KNIE----PSWGTEKVR-QYVTRDMDSLILKPTD------STPAMKQAYQILTSDKKRSF 313 + + + G + S T P + ++ + Sbjct: 257 DHKKMMILLTDGVPTFSYKVTQATTISGTTYGTAFSSSDQDQPGSTSQFVSGRNNWESRK 316 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINAS 372 + Q + ++ G N + T+ ++++ +I + I Sbjct: 317 NRYISGTDPNGATTSAQNYGLYYNTGNNVKIWNTWAATLGEAKISRDSGTEIHALGIQLG 376 Query: 373 PNGQRLLKTC---------------VSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + ++S + + +A + +N ++ ++ + Sbjct: 377 DDNNNNYWDSPLYLSSNNVRSRASLIASSGLYQDANSASDVETYLKNQAKDVLSQ 431 >gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016] gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016] Length = 688 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/251 (15%), Positives = 75/251 (29%), Gaps = 53/251 (21%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++F P + + V+D SGSM Sbjct: 287 SERGTIKLTFTPGDDLSAIQQ--GRDWVFVLDKSGSM---------------------SG 323 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDM 284 K A L + L + + ++ + RV++ + V Q + Sbjct: 324 KHATLTEGVKRGLGKLPSGDRFR------ILMFDNRVQEITNGFIAVNQNNVTQAIETIN 377 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ A+++A L SD+ II +TDG N Sbjct: 378 QIATGGGTNLYDALERAVSGLDSDRTTG--------------------IILVTDGVANVG 417 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + + +++ T + S N L S + ++ N+D ++ N Sbjct: 418 V--TEKKQFLKLMQRYDVRLYTFIMGNSANTPLLEPMTQVSNGFATSISNSDDILGHIMN 475 Query: 405 ISQLMVHRKYS 415 ++ + H+ Y Sbjct: 476 VTSKLTHQAYR 486 >gi|39936736|ref|NP_949012.1| hypothetical protein RPA3674 [Rhodopseudomonas palustris CGA009] gi|192292562|ref|YP_001993167.1| TadE family protein [Rhodopseudomonas palustris TIE-1] gi|39650592|emb|CAE29115.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286311|gb|ACF02692.1| TadE family protein [Rhodopseudomonas palustris TIE-1] Length = 177 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 39/128 (30%), Gaps = 1/128 (0%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAILAGAS 66 R +K + K + ++ FA+ F L+ +I V + +E+A +++ L Sbjct: 9 RAMARKFRRNRKGSAAVEFAIVAPIFFALLFAIIEVAMIFFASQVLETAVQDSSRLIFTR 68 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + R + S + V++ + Sbjct: 69 QAQDASMTQDQFKTEVCKRLISLFDCSIVRVDVQNYGSDFGTVSITTPIDSNKKFVDNMQ 128 Query: 127 ISMTHMAN 134 ++ + Sbjct: 129 YNIGKAGD 136 >gi|309791117|ref|ZP_07685650.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226815|gb|EFO80510.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 454 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/262 (9%), Positives = 67/262 (25%), Gaps = 43/262 (16%) Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + T + ++ + +++ + E + + + Sbjct: 27 ITLATDAGGVGAPVNWALVADASRSMRIPIVSEEQFRELVRAGG--AQEVLVDGVPV--- 81 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + + + + AL ++ +D L+ Sbjct: 82 -----------WQIAGPVPDAIRAQSPSALDYTVRALHSVVERLDRQD------RFSLVA 124 Query: 259 YTTRVEKNIEPSWG--TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + + G ++ +TR + + TD + M+ L Sbjct: 125 CAEQALTLVPSMSGERRAELVAGITRLHSLRLGEATDLSSGMRLGLAELAHAAA------ 178 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + +I LTDG N + + + +A + + I T+ + Sbjct: 179 ----------PGTVRRLILLTDGFTENADA---CMALARQAAQAGVSISTLGLGGEFQDD 225 Query: 377 RLLKTCVSSPEYHYNVVNADSL 398 L S + AD + Sbjct: 226 LLTGLADVSGGRASFMRRADQI 247 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/197 (10%), Positives = 60/197 (30%), Gaps = 35/197 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K + +A L + + + + +I ++ + + +++ + + Sbjct: 99 SNKRDRVNSAALAMVKASNPED------EVFVISFSEEAFITQDFTSDVKQLESSLRKL- 151 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T A+ L + ++ +K ++ +TDGE+N Sbjct: 152 --GSKGETAMRDALSLGLDHLRAPARKD-----------------KKVLVVITDGEDN-- 190 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-------SPEYHYNVVNADS 397 S + A + + I I + A+ + S + + N Sbjct: 191 SSIQKQENLIRAAHLSNVIIYGIGLLAAEAPASAQRAKASLDVLTLATGGRSWYPENVAD 250 Query: 398 LIHVFQNISQLMVHRKY 414 + + I+ + ++ Sbjct: 251 IEKITPEIAHEIRNQYV 267 >gi|21234171|ref|NP_639753.1| hypothetical protein SCP1.175 [Streptomyces coelicolor A3(2)] gi|13620657|emb|CAC36696.1| putative membrane protein [Streptomyces coelicolor A3(2)] Length = 147 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 13/129 (10%), Positives = 40/129 (31%), Gaps = 4/129 (3%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ + S+ A+S ++ L+++G L+ N A + A A + Sbjct: 15 DRGSMSLFMAMSALAILMVLGLLVDGGGALNASNR---ATSLAQEAARTAGQQLDPAQAV 71 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 +I+ + A +++ + T + + + + + Sbjct: 72 EGTAITIDPEAARAAALDYLAAANVQGDVQITDGGQSLTVTVRD-SYTTHFASLLGVGTI 130 Query: 138 DSSNNTIFY 146 + + Sbjct: 131 NVDGTATAH 139 >gi|89100236|ref|ZP_01173103.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085086|gb|EAR64220.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 476 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 58/198 (29%), Gaps = 20/198 (10%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +S+ + + KM K+A+ F +I + V VY + ++ Sbjct: 178 DASSSMLLDVDGKQKMEIAKSAVRSFAKTIGEENDVSLYVYGHAGTQEDKDKQIS----- 232 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T + L+ + + +A + + + Sbjct: 233 -------CTTIDEVYPLQSYN-EESFFKAVEGVEAKGWTPLAGAIKAAREASMDYEGDIT 284 Query: 333 IIFLTDGENN-NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK-TCVSSPEYHY 390 + ++DG + + + + I I NA + LK + + Sbjct: 285 LYIVSDGAETCDGNPVEEARLFAETNESRMVNI--IGFNADAKAEDQLKKVAEAGKGEYI 342 Query: 391 NVVNADSLIHVFQNISQL 408 NAD L +IS Sbjct: 343 GADNADQLN---SSISNE 357 >gi|222082656|ref|YP_002542021.1| hypothetical protein Arad_9366 [Agrobacterium radiobacter K84] gi|221727335|gb|ACM30424.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 176 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 59/155 (38%), Gaps = 18/155 (11%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + +R F++ +A K +I FAL + +LI +I + + + A +A++ Sbjct: 2 MRARPFASFRRLLADRKGVAAIEFALLALPLFMLIFAIIEISVMFFVNS----AMDASVQ 57 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + + + + D K I + + + S S + +I + + Sbjct: 58 KISRMIRTG-----------EVASSKISQADFKAKICDDMLLAFSCSSNLLVKVDILSDL 106 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 +S+ + A+ L + YN+ + + Sbjct: 107 SSATSANPINASGNLAVT---ETYNIGKGSDFVLV 138 >gi|313827551|gb|EFS65265.1| conserved hypothetical protein [Propionibacterium acnes HL063PA2] gi|314918056|gb|EFS81887.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1] gi|314967895|gb|EFT11994.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1] gi|315080579|gb|EFT52555.1| conserved hypothetical protein [Propionibacterium acnes HL078PA1] Length = 169 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 13/142 (9%), Positives = 39/142 (27%), Gaps = 10/142 (7%) Query: 4 LSRFRFYFK-KGIASEKA-NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA- 60 LSR+RF + ++ + S++ + + L+ G + +++ A A Sbjct: 27 LSRWRFPVSPQRRRDDRGVSMSVLVTVLLPILLISAGVAVDGA----ERSR---AVRVAH 79 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +A + + + + +A S + Sbjct: 80 TVAAEAARAGCEEGSAAQLVGQDGSSAARMAAEASARRAKADGLSQLVIDVNGDAVSVAT 139 Query: 121 IVNSSRISMTHMANNRLDSSNN 142 + + + + + Sbjct: 140 EITRPTRLLALIGLTEVHGRAS 161 >gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] Length = 865 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 52/175 (29%), Gaps = 31/175 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDS 286 + K AL L L + G+I +++ + T+ + +D Sbjct: 308 LDQAKRALRLCLRHLAEGD------RFGVIAFSSDFRALEPSLAPFTQATLKAADAFVDG 361 Query: 287 L-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L T+ + A +L ++ + ++ LTDG+ Sbjct: 362 LRADGGTEMLNPLLAAVGMLGDAERD-------------------RVVVLLTDGQ--VGN 400 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + K ++I T I + + + + + + + Sbjct: 401 EAQIVDRVVQRGK--GVRIYTFGIGTNVSDVLVNDLARRTEGAAEFIHPGERIDE 453 >gi|296284150|ref|ZP_06862148.1| hypothetical protein CbatJ_11026 [Citromicrobium bathyomarinum JL354] Length = 198 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 43/152 (28%), Gaps = 3/152 (1%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + + FA+ SF++L+ + + Y + M A A A S Sbjct: 2 LARALNRLRRDTRGSTIVEFAIIAPSFMILLMGVFDLGQAVYLRAVMNGAMQEA--ARDS 59 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFI-KNHIKESLSGYSAVFYNTEIQNIVNSS 125 + S + + + L F K++ + + + ++ Sbjct: 60 TLESGPTAEAAIDGMVETRVQHVLRSAELSFDRKSYYDFTDIERAEAINDDNANGECDAG 119 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 N DS + + + Sbjct: 120 ETFEDENGNGSWDSDVGSGGFGGARDITMYTV 151 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/329 (10%), Positives = 72/329 (21%), Gaps = 71/329 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM----- 203 + L + + AL+ L + +D+ M Sbjct: 377 ISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLDIRIGMVNFYH 436 Query: 204 -----------HCAMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLDSIDL 244 + D+N ++ + LKN Sbjct: 437 NSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGG 496 Query: 245 LSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + E + + G + + T T D + Sbjct: 497 ENRNPEKILIVVGDGTPTF-SYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDVLFRNFEE 555 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFK 345 T + + N + K ++ + N N Sbjct: 556 FSGNTS-NAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGS 614 Query: 346 SNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLKTCVS 384 +K +EN I +I + N G+ +LK Sbjct: 615 PTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIAD 674 Query: 385 SPE-----YHYNVVNADSLIHVFQNISQL 408 E +YN N + ++ ++IS Sbjct: 675 LEEDGVTPRYYNANNKNDIVTALEDISST 703 Score = 44.1 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 74/300 (24%), Gaps = 22/300 (7%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + D + + + E V + + + Sbjct: 262 SNERYQGDKGSPYDGENLIDDSDYTNGGANWRNYDYATNNGSTAGEEDRNVPAQTVQLWG 321 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N +S + + L+ + + ++ E E Sbjct: 322 NERNFENSYLDYNGAYIKKWVEPVLPSS-TASDLHPEDATTLYNVYLDVIGGEKKEISPI 380 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM +TK AAL A+ + + Sbjct: 381 DIVFVLDKSASMSELTAGTN------------SQTKNAALIEAVNEMSKDLLSDPSLDIR 428 Query: 252 VYMGLIGYTTRV--------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + M + + + ++ ++ + T T +K Y+ Sbjct: 429 IGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYE 488 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFI 362 L D V P + + N + T+ A++N + Sbjct: 489 TLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDV 548 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/329 (10%), Positives = 72/329 (21%), Gaps = 71/329 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM----- 203 + L + + AL+ L + +D+ M Sbjct: 219 ISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLDIRIGMVNFYH 278 Query: 204 -----------HCAMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLDSIDL 244 + D+N ++ + LKN Sbjct: 279 NSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGG 338 Query: 245 LSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + E + + G + + T T D + Sbjct: 339 ENRNPEKILIVVGDGTPTF-SYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDVLFRNFEE 397 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFK 345 T + + N + K ++ + N N Sbjct: 398 FSGNTS-NAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGS 456 Query: 346 SNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLKTCVS 384 +K +EN I +I + N G+ +LK Sbjct: 457 PTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIAD 516 Query: 385 SPE-----YHYNVVNADSLIHVFQNISQL 408 E +YN N + ++ ++IS Sbjct: 517 LEEDGVTPRYYNANNKNDIVTALEDISST 545 Score = 44.1 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 74/300 (24%), Gaps = 22/300 (7%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + D + + + E V + + + Sbjct: 104 SNERYQGDKGSPYDGENLIDDSDYTNGGANWRNYDYATNNGSTAGEEDRNVPAQTVQLWG 163 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N +S + + L+ + + ++ E E Sbjct: 164 NERNFENSYLDYNGAYIKKWVEPVLPSS-TASDLHPEDATTLYNVYLDVIGGEKKEISPI 222 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM +TK AAL A+ + + Sbjct: 223 DIVFVLDKSASMSELTAGTN------------SQTKNAALIEAVNEMSKDLLSDPSLDIR 270 Query: 252 VYMGLIGYTTRV--------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + M + + + ++ ++ + T T +K Y+ Sbjct: 271 IGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYE 330 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFI 362 L D V P + + N + T+ A++N + Sbjct: 331 TLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDV 390 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/329 (10%), Positives = 72/329 (21%), Gaps = 71/329 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM----- 203 + L + + AL+ L + +D+ M Sbjct: 350 ISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLDIRIGMVNFYH 409 Query: 204 -----------HCAMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLDSIDL 244 + D+N ++ + LKN Sbjct: 410 NSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGG 469 Query: 245 LSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + E + + G + + T T D + Sbjct: 470 ENRNPEKILIVVGDGTPTF-SYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNGVLFRNFEE 528 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFK 345 T + + N + K ++ + N N Sbjct: 529 FSGNTS-NAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGS 587 Query: 346 SNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLKTCVS 384 +K +EN I +I + N G+ +LK Sbjct: 588 PTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIAD 647 Query: 385 SPE-----YHYNVVNADSLIHVFQNISQL 408 E +YN N + ++ ++IS Sbjct: 648 LEEDGVTPRYYNANNKNDIVTALEDISST 676 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 74/300 (24%), Gaps = 22/300 (7%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + D + + + E V + + + Sbjct: 235 SNERYQGDKGSPYDGENLIDDSDYTNGGANWRNYDYATNNGSTAGEEDRNVPAQTVQLWG 294 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N +S + + L+ + + ++ E E Sbjct: 295 NERNFENSYLDYNGAYIKKWVEPVLPSS-TASDLHPEDATTLYNVYLDVIGGEKKEISPI 353 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM +TK AAL A+ + + Sbjct: 354 DIVFVLDKSASMSELTAGTN------------SQTKNAALIEAVNEMSKDLLSDPSLDIR 401 Query: 252 VYMGLIGYTTRV--------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + M + + + ++ ++ + T T +K Y+ Sbjct: 402 IGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYE 461 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFI 362 L D V P + + N + T+ A++N + Sbjct: 462 TLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNGV 521 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/329 (10%), Positives = 72/329 (21%), Gaps = 71/329 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM----- 203 + L + + AL+ L + +D+ M Sbjct: 133 ISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLDIRIGMVNFYH 192 Query: 204 -----------HCAMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLDSIDL 244 + D+N ++ + LKN Sbjct: 193 NSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGG 252 Query: 245 LSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + E + + G + + T T D + Sbjct: 253 ENRNPEKILIVVGDGTPTF-SYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDVLFRNFEE 311 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFK 345 T + + N + K ++ + N N Sbjct: 312 FSGNTS-NAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGS 370 Query: 346 SNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLKTCVS 384 +K +EN I +I + N G+ +LK Sbjct: 371 PTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIAD 430 Query: 385 SPE-----YHYNVVNADSLIHVFQNISQL 408 E +YN N + ++ ++IS Sbjct: 431 LEEDGVTPRYYNANNKNDIVTALEDISST 459 Score = 44.1 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 74/300 (24%), Gaps = 22/300 (7%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + D + + + E V + + + Sbjct: 18 SNERYQGDKGSPYDGENLIDDSDYTNGGANWRNYDYATNNGSTAGEEDRNVPAQTVQLWG 77 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N +S + + L+ + + ++ E E Sbjct: 78 NERNFENSYLDYNGAYIKKWVEPVLPSS-TASDLHPEDATTLYNVYLDVIGGEKKEISPI 136 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM +TK AAL A+ + + Sbjct: 137 DIVFVLDKSASMSELTAGTN------------SQTKNAALIEAVNEMSKDLLSDPSLDIR 184 Query: 252 VYMGLIGYTTRV--------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + M + + + ++ ++ + T T +K Y+ Sbjct: 185 IGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYE 244 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFI 362 L D V P + + N + T+ A++N + Sbjct: 245 TLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDV 304 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/329 (10%), Positives = 72/329 (21%), Gaps = 71/329 (21%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM----- 203 + L + + AL+ L + +D+ M Sbjct: 220 ISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLDIRIGMVNFYH 279 Query: 204 -----------HCAMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLDSIDL 244 + D+N ++ + LKN Sbjct: 280 NSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYKDNGG 339 Query: 245 LSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI------------ 288 + E + + G + + T T D + Sbjct: 340 ENRNPEKILIVVGDGTPTF-SYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDVLFRNFEE 398 Query: 289 -LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFK 345 T + + N + K ++ + N N Sbjct: 399 FSGNTS-NAGFTHPVTYASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGS 457 Query: 346 SNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLKTCVS 384 +K +EN I +I + N G+ +LK Sbjct: 458 PTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIAD 517 Query: 385 SPE-----YHYNVVNADSLIHVFQNISQL 408 E +YN N + ++ ++IS Sbjct: 518 LEEDGVTPRYYNANNKNDIVTALEDISST 546 Score = 44.1 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/300 (11%), Positives = 74/300 (24%), Gaps = 22/300 (7%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + D + + + E V + + + Sbjct: 105 SNERYQGDKGSPYDGENLIDDSDYTNGGANWRNYDYATNNGSTAGEEDRNVPAQTVQLWG 164 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 N +S + + L+ + + ++ E E Sbjct: 165 NERNFENSYLDYNGAYIKKWVEPVLPSS-TASDLHPEDATTLYNVYLDVIGGEKKEISPI 223 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D S SM +TK AAL A+ + + Sbjct: 224 DIVFVLDKSASMSELTAGTN------------SQTKNAALIEAVNEMSKDLLSDPSLDIR 271 Query: 252 VYMGLIGYTTRV--------EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + M + + + ++ ++ + T T +K Y+ Sbjct: 272 IGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYE 331 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFI 362 L D V P + + N + T+ A++N + Sbjct: 332 TLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQSSYRTSTNGAWSNWTVMEDKIAEDNDV 391 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 36/113 (31%), Gaps = 16/113 (14%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + L P + F+TDG N+ ++ Sbjct: 259 NWGTATGAALNYIRKYL-------------LVPSAGNRDPADTIVYFITDG--NSQEALS 303 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 D ++V I I + + Q L+ SSP+ V + L V Sbjct: 304 FVQDAADNIHATGARVVAIGITDAID-QSQLEIIASSPDDVIIVEDFADLDEV 355 >gi|325116955|emb|CBZ52508.1| unnamed protein product [Neospora caninum Liverpool] Length = 765 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 56/158 (35%), Gaps = 21/158 (13%) Query: 235 LLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILKP 291 + FL + L ++V L+ ++T V + ++ Sbjct: 96 VKQFLHAFLSKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGT 155 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ ++ Q+L + T K +I +TDGE++ S+ +T+ Sbjct: 156 TNTAAGLQTCNQMLFDYPREERQT-------------VPKLVIAMTDGESD---SDFHTV 199 Query: 352 KICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPE 387 +E I ++++ + + N R + C + Sbjct: 200 NEAKVIRERGGIITVLSVGMYVNHNECRSMCGCRNDSS 237 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 66/186 (35%), Gaps = 45/186 (24%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 ++ + + +VD+S SM+ QD +++ KN ++ + + Sbjct: 86 QQKMNNVMFLVDISNSMNA---------------QDVAPDRLSLAKNIVISSMQKMT--- 127 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQIL 305 + +GL + + + +V+ S++ + TD AM+ A Sbjct: 128 ----NDRVGLAVFAGEAFSVMPLTTDYLAAESFVSGLETSVVSTQGTDFYKAMQVAVSKF 183 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K S I+ ++DGE+N +A+ N I+++ Sbjct: 184 KAVSKGSG------------------RIVLISDGEDNEGNE----AAAIKEAQSNGIQVI 221 Query: 366 TISINA 371 T+ + Sbjct: 222 TVGVGT 227 >gi|6017001|gb|AAF01565.1|AF061273_1 thrombospondin-related adhesive protein homolog [Neospora caninum] Length = 765 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 56/158 (35%), Gaps = 21/158 (13%) Query: 235 LLLFL-DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV--TRDMDSLILKP 291 + FL + L ++V L+ ++T V + ++ Sbjct: 96 VKQFLHAFLSKLPIGNDEVNTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGT 155 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ ++ Q+L + T K +I +TDGE++ S+ +T+ Sbjct: 156 TNTAAGLQTCNQMLFDYPREERQT-------------VPKLVIAMTDGESD---SDFHTV 199 Query: 352 KICDKAKENF--IKIVTISINASPNGQRLLKTCVSSPE 387 +E I ++++ + + N R + C + Sbjct: 200 NEAKVIRERGGIITVLSVGMYVNHNECRSMCGCRNDSS 237 >gi|56675030|gb|AAW19657.1| matrilin-3 [Cervus elaphus] Length = 146 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 50/149 (33%), Gaps = 19/149 (12%) Query: 254 MGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + ++ Y + V+ + +++ V R T S A++ A Sbjct: 1 VAVVNYASTVKIEFHLQTHSDKQSLKRAVARITPLST--GTMSGLAIQTAMD-------- 50 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 G + PS K I +TDG + + ++ +A+ + I++ + ++ Sbjct: 51 --EAFTVEAGARGPSSNIPKVAIIVTDGRPQDQVN-----EVAARARASGIELYAVGVDR 103 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + E+ + V + Sbjct: 104 ADMESLKMMASEPLDEHVFYVETYGVIEK 132 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/358 (10%), Positives = 92/358 (25%), Gaps = 60/358 (16%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + N A F + S S + ++ Sbjct: 109 RNFSLAPTTNDKFESVVQNGNMVAGETPVSTFSIDVDTGSYSTTRRLINQGQLPTKNTVR 168 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + + + N+ V T + +++ + Sbjct: 169 VEELVNYFSYDYPVPTNSE-QPFSVNTELAPSP--------YNADTQLLRIGLKGFDVAP 219 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + L++D+SGSM K+ LK A+L+ + Sbjct: 220 DKLSASNLVLLLDVSGSM-------------------SSADKLPLLKQAMLMLSQQLSAQ 260 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V VY G + V + ++ +++ T+ + ++ AYQ+ Sbjct: 261 DKVSIVVYAGA----SGVVLDGVAGNDFTAIKTALSQLN---AQGGTNGSQGIQLAYQLA 313 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKI 364 + +I TDG+ N ++ + + I + Sbjct: 314 QKHFIENG----------------SNRVILATDGDFNLGMTDHQQLVDFVASRSKKGIGL 357 Query: 365 VTISINASP-----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 T+ N L + + + + L + + + ++ Sbjct: 358 STLGFGLGSGSASYNDHLLEQLSNKANGQYAFIDT---LNEARKVLVDQLSATLLTIA 412 >gi|24375029|ref|NP_719072.1| type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] gi|24349772|gb|AAN56516.1|AE015789_3 type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] Length = 1168 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/283 (10%), Positives = 69/283 (24%), Gaps = 66/283 (23%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV--KEDV 252 V + + ++ K ++ K A+ + S + + Sbjct: 215 FGVGQPVTFYTDNYLRWYWLSKEGRLPTIKVPRIDIAKKAISNIIRSTPTVDFGLAVFNY 274 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQIL------ 305 G + T+ R + +D L K T M +AY+ Sbjct: 275 NYPNEGNRDGGRIVSGITQMTDTSRASLLSTIDGLPAKTNTPLCETMYEAYRYFAGKGVV 334 Query: 306 -----TSDKKRSFFTNFFRQGVKIPSLPFQK--------FIIFLTDGENN-NFKSNVNTI 351 T + V+ ++I++TDG + +N N Sbjct: 335 FGHADTDYGSYVGNRPPYDNSVETNGTYVSPFKVCTDIAYVIYVTDGSPTVDGSANNNVK 394 Query: 352 KICDKAKENF-----------------------------------------IKIVTISI- 369 + A ++ ++ TI Sbjct: 395 TLTAAASKSGNYSSFSQGLSTPSYLPALASYMFNNDLINKPDSSNTEQMQNVRTYTIGFS 454 Query: 370 -NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 A L +T ++ N+ L + + +++ Sbjct: 455 KGADDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|326673940|ref|XP_001921532.3| PREDICTED: complement factor B [Danio rerio] Length = 862 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 67/203 (33%), Gaps = 30/203 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDM 284 + A ++ + L + ++ ++ Y + + + + + V + ++ Sbjct: 386 DTFQAAKKAIIELVRKLDSYEVNMKFDIVSYASEPREIVSITSFNSHDVDFVLRKLSEFS 445 Query: 285 D--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D + TD + A+++ Y L R+ K Q +I TDG +N Sbjct: 446 DEVHENRRGTDLSKALERVYGQL----------ALLRENKKSHFNETQNILIIATDGHSN 495 Query: 343 NFKSNVNTIK-----------ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YH 389 + + D +E + + ++ N + L S + Sbjct: 496 MGPNPQIMLNKIRSLLGYKPSSVDHTQEELLDVYVFAVGKDVNRKDLTSFASSKKGEKHV 555 Query: 390 YNVVNADSLIHVFQN-ISQLMVH 411 + + + L +VF IS V Sbjct: 556 FVLQDYQQLGYVFNQMISDSAVT 578 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + VR Sbjct: 259 SSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLVSVTP-----DNVR 313 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L + I IIFLT Sbjct: 314 DGKVYIHHMSPTGGTDINGALQRAIKLLND----------YVAHNDIEDRSVS-LIIFLT 362 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPN----GQRLLKTCVSSPEYHY 390 DG+ + +T+KI + KE I I TI I + + L+ C + Sbjct: 363 DGKPTVGE--THTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTRRVLE 420 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 421 EDDAGAQLIGFYDEI 435 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + + VR Sbjct: 257 SSASMVGTKLRQTKDALFTILHDLRPQDHFSIIGFSNRIKVWKDHLVSVTP-----DNVR 311 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++A ++L + I IIFLT Sbjct: 312 DGKVYIHHMSPTGGTDINGALQRAIKLLND----------YVAHNDIEDRSVS-LIIFLT 360 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPN----GQRLLKTCVSSPEYHY 390 DG+ + +T+KI + KE I I TI I + + L+ C + Sbjct: 361 DGKPTVGE--THTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTRRVLE 418 Query: 391 NVVNADSLIHVFQNI 405 LI + I Sbjct: 419 EDDAGAQLIGFYDEI 433 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 62/166 (37%), Gaps = 25/166 (15%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 ++ + V + LI Y+ + + + + V +++ T + A+ Sbjct: 221 NINVHPDAVRLALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIK--GTTSTHIALH 278 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 QAY++LT + +K II TDG + + + K+ Sbjct: 279 QAYKLLTDTDNENGV-----------REGVKKMIIIFTDGHSQRS-----PQDMALRLKD 322 Query: 360 NFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++I I++ + + LL ++ ++ + VN LI ++ Sbjct: 323 KGVEIFAITLTPAPYADEGELLSITQNT-DHIFTPVNLKVLITTYK 367 >gi|141795362|gb|AAI39710.1| LOC563828 protein [Danio rerio] Length = 835 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 67/203 (33%), Gaps = 30/203 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDM 284 + A ++ + L + ++ ++ Y + + + + + V + ++ Sbjct: 359 DTFQAAKKAIIELVRKLDSYEVNMKFDIVSYASEPREIVSITSFNSHDVDFVLRKLSEFS 418 Query: 285 D--SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 D + TD + A+++ Y L R+ K Q +I TDG +N Sbjct: 419 DEVHENRRGTDLSKALERVYGQL----------ALLRENKKSHFNETQNILIIATDGHSN 468 Query: 343 NFKSNVNTIK-----------ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YH 389 + + D +E + + ++ N + L S + Sbjct: 469 MGPNPQIMLNKIRSLLGYKPSSVDHTQEELLDVYVFAVGKDVNRKDLTSFASSKKGEKHV 528 Query: 390 YNVVNADSLIHVFQN-ISQLMVH 411 + + + L +VF IS V Sbjct: 529 FVLQDYQQLGYVFNQMISDSAVT 551 >gi|21616104|emb|CAD21938.1| putative complement factor Bf/C2 [Tetraodon nigroviridis] Length = 760 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 78/294 (26%), Gaps = 47/294 (15%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPALLRIEMGERP 189 + + D + + ++ + +I + + Sbjct: 196 ESGQWTGNEPACYFKHTYDTPLEVSQAF---GSAIKESLTTLESINDVQGERKIRISKNG 252 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I + VD+S S+ + K A+L + I S Sbjct: 253 TLNIYIAVDISESIQKDHV--------------------ESAKKAILKLITKISSFSVSP 292 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 L+ +++ + + + E + + + + + A++ + Sbjct: 293 NYE---LLFFSSELSEVVNILDFFENQPVDIKGRLTKFKVNAEHTGTDLNLAFKTILERM 349 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT--------------IKICD 355 ++ + + II TDG N S + T + Sbjct: 350 -----ALIKQRVGEKAFEEHRHAIIVFTDGVYNMGGSPLPTVAKIKHMVYMNKIDEETGQ 404 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNVVNADSLIHVFQNISQL 408 ++ ++ I I A L + + + ++ +L F NI Sbjct: 405 NPRDEYLDIYIFGIGAEIYDSDLRPLTAGTGGEHFFKLLEIQNLQETFDNIIAD 458 >gi|50842569|ref|YP_055796.1| hypothetical protein PPA1091 [Propionibacterium acnes KPA171202] gi|50840171|gb|AAT82838.1| hypothetical membrane associated protein [Propionibacterium acnes KPA171202] gi|314976099|gb|EFT20194.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1] gi|314989870|gb|EFT33961.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3] gi|315077955|gb|EFT50006.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2] gi|315095966|gb|EFT67942.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1] gi|315101403|gb|EFT73379.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1] gi|315108619|gb|EFT80595.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2] gi|327331877|gb|EGE73614.1| hypothetical membrane associated protein [Propionibacterium acnes HL096PA3] gi|327445861|gb|EGE92515.1| hypothetical protein HMPREF9571_01597 [Propionibacterium acnes HL043PA2] gi|327448157|gb|EGE94811.1| hypothetical protein HMPREF9570_00996 [Propionibacterium acnes HL043PA1] gi|327453937|gb|EGF00592.1| hypothetical protein HMPREF9586_01901 [Propionibacterium acnes HL083PA2] gi|328754611|gb|EGF68227.1| hypothetical protein HMPREF9588_02098 [Propionibacterium acnes HL025PA2] gi|328760793|gb|EGF74359.1| hypothetical membrane associated protein [Propionibacterium acnes HL099PA1] Length = 182 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 13/142 (9%), Positives = 39/142 (27%), Gaps = 10/142 (7%) Query: 4 LSRFRFYFK-KGIASEKA-NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA- 60 LSR+RF + ++ + S++ + + L+ G + +++ A A Sbjct: 40 LSRWRFPVSPQRRRDDRGVSMSVLVTVLLPILLISAGVAVDGA----ERSR---AVRVAH 92 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +A + + + + +A S + Sbjct: 93 TVAAEAARAGCEEGSAAQLVGQDGSSAARMAAEASARRAKADGLSQLVIDVNGDAVSVAT 152 Query: 121 IVNSSRISMTHMANNRLDSSNN 142 + + + + + Sbjct: 153 EITRPTRLLALIGLTEVHGRAS 174 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 64/201 (31%), Gaps = 21/201 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV 276 +A K+A+L L + +I + + V S + Sbjct: 410 TSGSMTGDSIAQAKSAILNALAGLGSQDTFN------VIAFDSSVRSLSPVALSATAANL 463 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + T+ PA+ +A S K ++F+ Sbjct: 464 GKANLFVQSLEADGGTEMAPALLRALSQPESGVSSISSAV---------KPERLKQVVFI 514 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG N ++++ + + ++ ++ T+ I A+PNG + + + + V Sbjct: 515 TDGAVGN-EASLFALIAANIGRQ---RLFTVGIGAAPNGYFMERAARAGRGTYTYVGKIS 570 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 + + + + + S + Sbjct: 571 EVDAKIGELLEKIESPQISDV 591 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 49/170 (28%), Gaps = 18/170 (10%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYV 280 R + + + + LS + V GL+ Y+ K+ +++ Sbjct: 659 FSRVRQPNFQQVVNFLKTIVSSLSIRPDTVRFGLVFYSEEPRLEFSLDAFQNPAKILEHL 718 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T + A+ F ++ + Q+ + +T+G Sbjct: 719 DKLTYRERKGRTKTGAALDF-----------LRNEVFIQEKGSWSNHGVQQIAVVITEGF 767 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + + S + + + I + + + L K P + Sbjct: 768 SQDRVSRP-----ASRLRRAGVTIYAVGTHNVSESKDLEKIASYPPWKYS 812 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 69/221 (31%), Gaps = 27/221 (12%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V +S + A +D + + +KN L + L + V Sbjct: 185 PHVDHVSPACREAALADIVFLVDNSTSIGPQN--FQKVKNFLYSVI---LGLDISSDRVR 239 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKR 311 +GL Y + + ++ + + +L T++ A++ Sbjct: 240 VGLAQYNDNIYPAFQL--NQHPLKSMILEQIQNLPYRTGGTNTGSALEF----------- 286 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + + +I +TD +SN ++ D+ KE+ + + + Sbjct: 287 IRTNYLTEESGSRAKDRVPQIVILVTD-----RESNDEVQEVADRLKEDGVVVYVVG-VN 340 Query: 372 SPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVH 411 + Q L K E +N N + L +I Q + Sbjct: 341 VQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCS 381 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 63/195 (32%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGFSNRIKVWKDHLVSVTP-----NSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++ Q+L + I ++FLT Sbjct: 357 DGKVYIHHMSPSGGTDINGALQRGIQLLND----------YVAHNDIEDRSVS-LVVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T+KI + +E + I T+ I A + + L L+ C + H Sbjct: 406 DGKPTVGE--THTLKILNNTREAARGRVCIFTVGIGADVDFKLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 + LI + I Sbjct: 464 DHDARAQLIGFYDEI 478 >gi|46362531|gb|AAH68979.1| Slc35c2 protein [Danio rerio] Length = 1816 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 54/165 (32%), Gaps = 22/165 (13%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 E V + L Y+ + T K T + A+ A L + Sbjct: 609 GPEGVQIALSQYSGDPRTEWHLNNFTSKEPLLEAVRNFRYKGGNTFTGQALIHA---LEN 665 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + K +F++ LTDG++ + I ++ K ++I I Sbjct: 666 NLKEEVGA----------RPNTPQFLLLLTDGKSQDD-----AIAAANRLKNAGVEI--I 708 Query: 368 SINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMV 410 +I + L+ S P Y V + L + +++++ Sbjct: 709 AIGVKNADEAELRQVASEPLELNVYKVNDFPLLSKLVGKLARILC 753 >gi|90020471|ref|YP_526298.1| arginine biosynthesis bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase [Saccharophagus degradans 2-40] gi|89950071|gb|ABD80086.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 708 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 44/366 (12%), Positives = 113/366 (30%), Gaps = 75/366 (20%) Query: 54 ESANNAAILA---------GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKE 104 SA +A L+ ++ ++ L D+ +++ + D + + Sbjct: 177 SSAAESAKLSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGMPVDGDTIVAPAPQG 236 Query: 105 SLSGYSAVFYNTEIQNIVNSSRISMTH-----------MANNRLDSSNNTIFYNMDVMTS 153 + + + S S+ + + L + + Sbjct: 237 NDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEKDAIRAEELINYFD 296 Query: 154 YDYRLQFIEHLLNQRYN-----------QKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 Y+Y L + + N +K+V I ++P + ++D+SGS Sbjct: 297 YNYPLP-SDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDIAPDQKPRTNLVFLLDVSGS 355 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 M+ + K+ +K ++ + L +++ + ++ Y Sbjct: 356 MN-------------------SQDKLPLVKQSMEMLLSTLNPDD------TVAIVVYAGA 390 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 +EP+ +K ++ ++ T + AY + ++ Sbjct: 391 AGTVLEPTPAKDK-QKILSAMQRLQAGGSTAGGAGIALAYDLAEANF------------- 436 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIK-ICDKAKENFIKIVTISINASPNGQRLLKT 381 +I TDG+ N +N T++ ++ +E I + + L++T Sbjct: 437 ---DKKAVNRVILATDGDFNVGSTNNETLQGFVERKREKGIFLSVLGFGQGNYNDHLMQT 493 Query: 382 CVSSPE 387 + Sbjct: 494 LAQNGN 499 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 63/194 (32%), Gaps = 24/194 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 TK+ K+AL L + + +V K+ S + +R Sbjct: 85 SSASMVGTKLRQTKDALFTILHDLRPQD----RFSIIGFSNRIKVRKDHLISVTPDSIRD 140 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 TD A+++A ++L S I+FLTD Sbjct: 141 GKVYIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI-----------GDRSVSLIVFLTD 189 Query: 339 GENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHYN 391 G+ + +T+KI + +E + I TI I + + L L+ C + H Sbjct: 190 GKPTVGE--THTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEE 247 Query: 392 VVNADSLIHVFQNI 405 LI + I Sbjct: 248 EDAGSQLIGFYDEI 261 >gi|42524203|ref|NP_969583.1| hypothetical protein Bd2793 [Bdellovibrio bacteriovorus HD100] gi|39576411|emb|CAE80576.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 354 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 57/160 (35%), Gaps = 23/160 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + + +D K +++A K L +D + +G++ + + Sbjct: 97 DVSESMMAEDVKPSRLAQAKAELSRLVD-------LMPGNKVGIVAFAGSAALLSPLTND 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 ++ Y+ S+ + T+ T A+K + + + T + Sbjct: 150 PGAIKMYLESLEPSSVSSQGTNFTEALKISKEAFERGGVSTDETV-----------KVTR 198 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 I+ +DGE++ + K ++I +++ Sbjct: 199 VILIASDGEDH----EQGALDEAKKMAGEGVRIFSLAYGT 234 >gi|302540662|ref|ZP_07293004.1| von Willebrand factor, type A [Streptomyces hygroscopicus ATCC 53653] gi|302458280|gb|EFL21373.1| von Willebrand factor, type A [Streptomyces himastatinicus ATCC 53653] Length = 340 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 63/237 (26%), Gaps = 24/237 (10%) Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + E L + + S D S P+ ++M K +L Sbjct: 105 AVTPSPGPSAREVEAALGMWTITV-QSARLNTVVDASASMSDPVPGRPGESRMDVTKASL 163 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + +GL ++T + TR + + T Sbjct: 164 RQALSRFNAGD------EIGLWEFSTE------LDGDRDYRELVATRRLGARTPDGTGQR 211 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-----KFIIFLTDGENNNFKSNVNT 350 + A+ L + S + ++ LTDG N + S Sbjct: 212 AELAAAFDALKPLPEGSTGLYDTTLAAYKKAQETFVRGKFNAVVMLTDGANQDPGSISR- 270 Query: 351 IKICDKAK-----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + K + + ++ I++ + + ++ V + + Sbjct: 271 GALVKELKRLVDPDRPVPLIAIAVGPDADQAACREIAQATGGSAQQVNDPAQINTAM 327 >gi|258543794|ref|ZP_05704028.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] gi|258520969|gb|EEV89828.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] Length = 1128 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/345 (11%), Positives = 70/345 (20%), Gaps = 63/345 (18%) Query: 120 NIVNSSRISMTHMANNRLDSSNNT---IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV-- 174 N + S S+ + V + L +L Sbjct: 26 NTLAQSTPSINDTPFANRPLHLQSESKTTSAGGVKPNVMLFLDDSGSMLENANTGSYRIP 85 Query: 175 SFIPALLRIEMGERPIFLIELVV------DLSGSMHCAMNSDPEDVNSAPIC-------- 220 L + LI + + SM + D+ Sbjct: 86 DLWNTYLGNCTYNQNNLLISRNLLPGNGQQTNPSMGRCLYYYNADIIYYRNKGLPPGPKS 145 Query: 221 --QDKKRTKMAALKNALLLFLDSIDLLSHVK---------EDVYMGLIGYTTRVEKNIEP 269 + + +M A NA+ L + + G + ++ Sbjct: 146 FLRPAAQVRMDANINAINEVLTKTGDSVNWHLLTLWGSEFRHLNGGFYKFDNIGTASVGL 205 Query: 270 S--------------WGTEKVRQYVTRDMDSL--------------ILKPTDSTPAMKQ- 300 S GT +Y+ + + Sbjct: 206 SAEKARKLVNNMSPIAGTPATERYLKAANVLDHQIKYRCQKNYIVFMSDGEANGGGFPWP 265 Query: 301 -AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 ++ G + ++ D + K Sbjct: 266 GGVYGTQPQWWSQHDSSGPNSGDPGKGVSVFSSKLYNMDMRVGGTDVEGGSWDDPKYPK- 324 Query: 360 NFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I+ +TI P G+ LK E Y NA L F Sbjct: 325 QNIETITIGYGNGLTPQGRNYLKNAAQPSEGAYFANNASELADAF 369 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 76/220 (34%), Gaps = 55/220 (25%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LV+D SGSM ++ + A+ LFL L ++ Sbjct: 307 VCLVLDKSGSMAAMD-------------------RLNRMNQAVKLFL-----LQITEKGS 342 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 ++G++ + R + R Y+ + T ++ A+Q + + Sbjct: 343 WVGIVLFDERAIIRNPLIQIISEDDRNYLMTRLPEAAGGGTSICSGVQAAFQAIKQKFQT 402 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISIN 370 + + I+ LTDGE+ T+ C ++ K++ I T+++ Sbjct: 403 TDGSE----------------IVLLTDGED-------VTVSSCFEEVKQSGATIHTVALG 439 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADS----LIHVFQNIS 406 S Q L + + + ++ LI F +S Sbjct: 440 TSA-AQELERLSDMTGG-ISTAPSDEAQNNGLIDAFSGLS 477 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 26/186 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + L + + + +T + T K + + Sbjct: 378 SNFRLMLEFISNVAKAFEISDIGSKIATVQFTYDQRTEFSFTDYTTKEKVLSAIRNIRYM 437 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ + G + F++ LTDG++ + Sbjct: 438 SGGTATGDAI---------SFTTRNVFGPVKDGAN------KNFLVILTDGQSYDD---- 478 Query: 349 NTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNI 405 A++ I + ++ + A + LK S P + + L + ++ Sbjct: 479 -VRGPAVAAQKAGITVFSVGVAWAPLDD---LKDMASEPRESHTFFTREFTGLEQMVPDV 534 Query: 406 SQLMVH 411 + + Sbjct: 535 IRGICK 540 >gi|296221263|ref|XP_002756657.1| PREDICTED: von Willebrand factor A domain-containing protein 2 [Callithrix jacchus] Length = 725 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 14/155 (9%), Positives = 41/155 (26%), Gaps = 20/155 (12%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 79 DALDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGRTETGLALKNLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + KE + Sbjct: 139 HRGFPGGR---------------NASVPQILIIITDGKSQGHVALP-----AKQLKEKGV 178 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + + + + ++ + Sbjct: 179 TVFAVGVRFPRWEELHMLASEPREQHVLLAEQVED 213 >gi|163816348|ref|ZP_02207714.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] gi|158448345|gb|EDP25340.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] Length = 612 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 71/233 (30%), Gaps = 45/233 (19%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + K++ I+ G++ + ++D SGSM K Sbjct: 205 NKDTKLMMVGINTDEIDFGDKKPSNLVFLIDTSGSM-------------------YDDNK 245 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++ + + +++D V ++ Y + + G+E+ + Sbjct: 246 LPLVQQSFAMLAENLDENDRVS------IVTYAGEDTVVLSGTPGSEQ-YTISEALSNMT 298 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--K 345 T+ A+ AY++ + +I TDG+ N Sbjct: 299 AEGCTNGGDAIITAYELAEKNFINGGNN----------------RVILATDGDLNVGLTS 342 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 + I ++ KEN I + + L+ + + + +A Sbjct: 343 ESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAYE 395 >gi|75906479|ref|YP_320775.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75700204|gb|ABA19880.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 615 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 44/159 (27%), Gaps = 18/159 (11%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 ++ + ++ Y V+ + P T+K T+ + + + + Sbjct: 68 DQLEPKDILSVVVYDDAVDTVVSPQPVTDK-PALKKSIRQVRAGGITNLSGGWLKGCEYV 126 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKI 364 ++ LTDG N + + K E I Sbjct: 127 KHQL----------------DPQKINRVLLLTDGHANMGIQDPKILTATSAQKAEEGITT 170 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 T+ N L+ ++ Y + + D VF Sbjct: 171 TTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|314969033|gb|EFT13131.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1] Length = 169 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 11/117 (9%), Positives = 23/117 (19%), Gaps = 4/117 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVG----EGI 91 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 L+ T + + Sbjct: 92 AVGQRVGLAELAGTRCSNPAIAVDSSDLTLPVGSAGTTSARVSCTIKLSDLLVPGMP 148 >gi|300788919|ref|YP_003769210.1| hypothetical protein AMED_7090 [Amycolatopsis mediterranei U32] gi|299798433|gb|ADJ48808.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 134 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 11/139 (7%), Positives = 35/139 (25%), Gaps = 6/139 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + E+ + S++ ++ L+ +GF + + S A AG + Sbjct: 1 MRSVRGERGSASVLVVGLSLAVLIGVGFAVDGSRKGLAYSEATSIAEEAARAGDQALRVP 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 G + + + A + + + + Sbjct: 61 ALAAGTDADVVPDLAAAEARRYVVSAGA------TGTVQVDNGRIVVDTTIVRQTVFLGV 114 Query: 132 MANNRLDSSNNTIFYNMDV 150 + + + + Sbjct: 115 IGIGEFTVHGHGVAVLVSA 133 >gi|163852925|ref|YP_001640968.1| TadE family protein [Methylobacterium extorquens PA1] gi|163664530|gb|ABY31897.1| TadE family protein [Methylobacterium extorquens PA1] Length = 202 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 32/124 (25%), Gaps = 1/124 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 +E +I FA ++L I + N + A A + ++++ Sbjct: 20 FGRAEGGVSAIEFAFIAPVLVILFIAAIEIPRAIATNNRLAQATIAMADLASKNDYADIN 79 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + ++ A + + V + + + + Sbjct: 80 DVYAAAQVVAAPYSLAGTGIVLTAG-GVYQVGNDFVARVCSSVQSGDKARIVGSDIGPPP 138 Query: 134 NNRL 137 Sbjct: 139 AGTA 142 >gi|149202125|ref|ZP_01879098.1| hypothetical protein RTM1035_12398 [Roseovarius sp. TM1035] gi|149144223|gb|EDM32254.1| hypothetical protein RTM1035_12398 [Roseovarius sp. TM1035] Length = 178 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 10/59 (16%), Positives = 26/59 (44%), Gaps = 1/59 (1%) Query: 4 LSR-FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++R F+ + ++ + S+ F L F+ ++ F I + + +E + A+ Sbjct: 1 MTRVFKNFLRRFRQDDTGTASLEFVLVAPFFIGIMIFSIEMGFVTMRATLLERGLDMAV 59 >gi|227830111|ref|YP_002831890.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|227456558|gb|ACP35245.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] Length = 356 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 33/82 (40%), Gaps = 11/82 (13%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKT 381 II LTDG+ + ++ + K+ I I+TI I N + L K Sbjct: 122 QSQVPTKIIMLTDGKPTDKRNVKDYEKL-------DIPPNTQIITIGIGNDYNERILKKL 174 Query: 382 CVSSPEYHYNVVNADSLIHVFQ 403 S Y++ + L ++F+ Sbjct: 175 ADRSSGKFYHIKDISELPNIFE 196 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 36/327 (11%), Positives = 85/327 (25%), Gaps = 42/327 (12%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + +++ + + A N + + Sbjct: 1937 VASTDQDVTVYEKALDLTQDGQDLAPGTVTGSDPGNTGETATGTLVGSVSGGSGAITYTL 1996 Query: 125 SRISMTHMANNRLDSSNNTI------------FYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + +L++ + S + + + L N + Sbjct: 1997 VGSATGTYGQIQLNADGTYTYTLTSAPKTTPNANDGPNTLSESFTYKATDALGNSTTSTI 2056 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V+ + + + ER + +E +D + + ++ D + P +++ K Sbjct: 2057 VVNIVDDVPKAVASERSVAAVE--IDSNILIVLDISGSMADASGVPGL-----SRLELAK 2109 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A+ LD D L VK + L+ +++ + + T Sbjct: 2110 QAISALLDKYDDLGDVK----VQLVTFSSNATDRTSVWVDVATAKTLLAGLS---AGGGT 2162 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ Y + K Q F +DG+ N Sbjct: 2163 NYDAAVATMYNAFNTSGKL---------------TGAQNVGYFFSDGKPNEGDIGTADEA 2207 Query: 353 ICDK-AKENFIKIVTISINASPNGQRL 378 N IK I + + + L Sbjct: 2208 TLKAFLDANNIKNYAIGLGSGVSNANL 2234 >gi|327268126|ref|XP_003218849.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Anolis carolinensis] Length = 1325 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 39/126 (30%), Gaps = 33/126 (26%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E ++ + T++ A++ A+ P + Sbjct: 626 NLEDALSWIEELQVRKVGGSTNTLQALQVAF-----------------------CDPDTQ 662 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISINASPNGQR--LLKTCVSSPE- 387 I LTDG + I + K + I + TIS N G L + + Sbjct: 663 AIYLLTDGRP-----DQPPNTILAQLKLQRNIPVHTISFNCDDTGANKFLHEVASETGGR 717 Query: 388 -YHYNV 392 ++Y++ Sbjct: 718 FHYYHI 723 >gi|313837215|gb|EFS74929.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314971986|gb|EFT16084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 322 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D ++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDASPNRLAAAKAKAKDFVNSLPTG------FNVAVVSISAHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + + T A+ ++ Q + S Sbjct: 155 RPTVLRALDGIELQD---GTALGEAIDKSLQAVKMAPGGSKD-------------RVPAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-----ASPNGQ--------RLL 379 I+ L+DG N S + AK + + TI+ +GQ +LL Sbjct: 199 IVMLSDGGNTQGGSPLVAATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLL 255 Query: 380 KTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V++ + + Sbjct: 256 SDVADRTDAKSWTADSADKLQEVYKQVHSSVG 287 >gi|227827418|ref|YP_002829197.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584633|ref|YP_002843134.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619574|ref|YP_002914399.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459213|gb|ACP37899.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019682|gb|ACP55089.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380643|gb|ACR41731.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 356 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 33/82 (40%), Gaps = 11/82 (13%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKT 381 II LTDG+ + ++ + K+ I I+TI I N + L K Sbjct: 122 QSQVPTKIIMLTDGKPTDKRNVKDYEKL-------DIPPNTQIITIGIGNDYNERILKKL 174 Query: 382 CVSSPEYHYNVVNADSLIHVFQ 403 S Y++ + L ++F+ Sbjct: 175 ADRSSGKFYHIKDISELPNIFE 196 >gi|293354104|ref|XP_001061161.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] Length = 550 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 62/200 (31%), Gaps = 23/200 (11%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSWG 272 + K T L + ++ + + +M I + TR Sbjct: 76 YFVFDKSKAVTSWNDLYTSWEDLVEKYVKYACTLQLLPKQHMSFILFDTRASIKTPLISE 135 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K+RQ +T+ ++ T+ +++A + + + V + Sbjct: 136 RWKIRQELTKLDSQVLEGATNLQKGLEKANEQIQN--------------VNSGATKRSSL 181 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 II GE + + DKA+ + + +N+S + L + E + + Sbjct: 182 IIAAVWGELT-PQVLKESKDEADKARSMGAYVYCVGLNSSERQK--LNEITDNEENVFIL 238 Query: 393 VNADSLIHVFQNISQLMVHR 412 + F +I + Sbjct: 239 DGQS---NRFHSIVDEIGTM 255 >gi|293342304|ref|XP_002725205.1| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] Length = 603 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 62/200 (31%), Gaps = 23/200 (11%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSWG 272 + K T L + ++ + + +M I + TR Sbjct: 76 YFVFDKSKAVTSWNDLYTSWEDLVEKYVKYACTLQLLPKQHMSFILFDTRASIKTPLISE 135 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K+RQ +T+ ++ T+ +++A + + + V + Sbjct: 136 RWKIRQELTKLDSQVLEGATNLQKGLEKANEQIQN--------------VNSGATKRSSL 181 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 II GE + + DKA+ + + +N+S + L + E + + Sbjct: 182 IIAAVWGELT-PQVLKESKDEADKARSMGAYVYCVGLNSSERQK--LNEITDNEENVFIL 238 Query: 393 VNADSLIHVFQNISQLMVHR 412 + F +I + Sbjct: 239 DGQS---NRFHSIVDEIGTM 255 >gi|145596959|ref|YP_001161256.1| hypothetical protein Strop_4450 [Salinispora tropica CNB-440] gi|145306296|gb|ABP56878.1| hypothetical protein Strop_4450 [Salinispora tropica CNB-440] Length = 161 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 11/135 (8%), Positives = 36/135 (26%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ S+ A++ + L ++ ++ + A AG + + Sbjct: 25 RTDSGRVSVFLAIAALGILTVVALAFDGAGQLRALQRADNLASEAARAGGQSIDLVSAIE 84 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G A+RA+ + + S + + ++ Sbjct: 85 GGPKRIDEPQARRAVTEYLAAAGASTHTVSFPVVDGETVVRVRVTVTYERSLLGLVGFDD 144 Query: 136 RLDSSNNTIFYNMDV 150 ++ + + Sbjct: 145 TVNVTGEATARALTG 159 >gi|29349873|ref|NP_813376.1| putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341784|gb|AAO79570.1| conserved hypothetical protein, putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] Length = 621 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/338 (11%), Positives = 85/338 (25%), Gaps = 64/338 (18%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-----GYSAVFYNTEIQNI 121 + + + N K+ F + S S S + Sbjct: 138 SVRMDAVNTEEYKSISENGFKQVGESPLSTFSIDVDAASYSNMRRMINSGTLPVPDAIRT 197 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 A + ++V Sbjct: 198 EELVNYFSYDYAKPTGSDPVKITMEAGVCP---------------WNADHRLVRIGLKAR 242 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I + P + ++D+SGSM T++ +K++L L +++ Sbjct: 243 EIPTDKLPESNLVFLIDVSGSMW-------------------GPTRLDLVKSSLKLLVNN 283 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + V VY G V+ P +K+R + ++ A Sbjct: 284 LREKDKVAIVVYAG----NASVKLESTPGSDKQKIRDAIDELTSGGST---AGGAGIQLA 336 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKEN 360 Y++ + II +DG+ N S+V ++ +K +++ Sbjct: 337 YKVAKQNFL----------------PKGNNRIILCSDGDFNVGVSSVEGLEQLIEKERKS 380 Query: 361 FIKIVTISINASPN-GQRLLKTCVSSPEYHYNVVNADS 397 + + + + H + N Sbjct: 381 GVFLSVLGYGMGNYKDNKGQALAEKGNGNHAYIDNLQE 418 >gi|103486591|ref|YP_616152.1| hypothetical protein Sala_1102 [Sphingopyxis alaskensis RB2256] gi|98976668|gb|ABF52819.1| conserved hypothetical protein [Sphingopyxis alaskensis RB2256] Length = 542 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 19/54 (35%), Gaps = 4/54 (7%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME 54 M + SR I +A SI A+ + + + V + + ++ Sbjct: 1 MAIRSRLAS----LIRCRRAGISIAAAIGMPMLIGAAALAVDVGSLYLDRRKLQ 50 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 29/200 (14%) Query: 229 AALKNALLLFLDSIDL---LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K + F++ ++ + V G + Y+ VE + + + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAINY 128 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ Q S R KF + +TDG Sbjct: 129 L--GRGTFTDCAISNMTQQFQSQTARDV-----------------KFAVVITDGHVT-GS 168 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL---I 399 ++A++ IK+ ++ + Q L + + + Y + D+L Sbjct: 169 PCGGMKMQAERARDMGIKLFAVAPSEDVYEQGLREIASPPHDLYRSNYTITPKDALHIDE 228 Query: 400 HVFQNISQLMVHRKYSVILK 419 + + I + M H Y+ K Sbjct: 229 NTIERIIKAMKHEAYAECYK 248 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + + L+ Y + E ++ ++ + + ++ A+ Sbjct: 864 TLARRNDDNMNARIALLQYGSESEQNVVFPLTYNLTEISNALAQIKYLDS--SSNIGSAI 921 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + G ++ + +F+TDG + + + K Sbjct: 922 IHAINNI---------VLSPGNGQRVARRNAELSFVFITDGITGSKN----LEEAINSMK 968 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + + LLK + + + +SL Sbjct: 969 KQDVMPTVVALGSDVDMDVLLKLGLGDRAAIFREKDYESLSQ 1010 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 56/182 (30%), Gaps = 33/182 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V+D S S+ + ++ + + K+ + ++ + Sbjct: 613 ALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGS-----IAKDPKSETGARVGVVQYSH 667 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E + + E+ S E V++ I T + A++ AY L + Sbjct: 668 EGTFEAIKLDD---ERINSLSSFKEAVKRL------EWIAGGTWTPSALQFAYNKLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG + + N +C + + TI I Sbjct: 719 RREKAQVFA---------------VVITDGRYDPRDDDKNLGALC----GRDVLVNTIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|218531749|ref|YP_002422565.1| TadE family protein [Methylobacterium chloromethanicum CM4] gi|254562693|ref|YP_003069788.1| hypothetical protein METDI4318 [Methylobacterium extorquens DM4] gi|218524052|gb|ACK84637.1| TadE family protein [Methylobacterium chloromethanicum CM4] gi|254269971|emb|CAX25949.1| hypothetical protein METDI4318 [Methylobacterium extorquens DM4] Length = 202 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 27/124 (21%), Gaps = 1/124 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 +E +I FA ++L I + N + A A+ ASK Sbjct: 20 FGRAEGGVSAIEFAFIAPVLVILFIAAIEIPRAIATNNRLAQAT-IAMADLASKNDYADI 78 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + V + + + + Sbjct: 79 NDVFAAAQVVAAPYSLAGTGIVLTAGGVYQVGNDFVARVCSSVQSGDKARIVGSDIGPPP 138 Query: 134 NNRL 137 Sbjct: 139 AGTA 142 >gi|94498563|ref|ZP_01305118.1| TadE-like protein [Sphingomonas sp. SKA58] gi|94422006|gb|EAT07052.1| TadE-like protein [Sphingomonas sp. SKA58] Length = 193 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 31/86 (36%), Gaps = 1/86 (1%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA-NNAAILAGASK 67 +F ++ FA+ + + L L+ I + + +E A AA ++ AS Sbjct: 2 RHFAPLAKDRTGVSTVEFAIILPALLTLMCGAIELGHMLLARVVLEGAMTEAARISTASL 61 Query: 68 MVSNLSRLGDRFESISNHAKRALIDD 93 + R ESI + D Sbjct: 62 ETAEAQRTTLMEESIEQAMGNFPLAD 87 >gi|84687514|ref|ZP_01015390.1| von Willebrand factor type A domain protein [Maritimibacter alkaliphilus HTCC2654] gi|84664423|gb|EAQ10911.1| von Willebrand factor type A domain protein [Rhodobacterales bacterium HTCC2654] Length = 760 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 35/121 (28%), Gaps = 8/121 (6%) Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A+ A L K + + +I ++DG + + Sbjct: 16 PGTDR-QAIADAVNGLNPKGKTPLSAAVLQAAETLKYTEEAATVILVSDGIETC---DFD 71 Query: 350 TIKICDKAKENFI--KIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + +E + I + +L + + + NA L + + Sbjct: 72 PCALGRQLEETGVGFTAHVIGFDVTEPEAQAQLQCLAEETGGMYRSASNASELSGALEEV 131 Query: 406 S 406 + Sbjct: 132 A 132 >gi|239934065|ref|ZP_04691018.1| hypothetical protein SghaA1_38057 [Streptomyces ghanaensis ATCC 14672] gi|291442504|ref|ZP_06581894.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291345399|gb|EFE72355.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 141 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 46/144 (31%), Gaps = 7/144 (4%) Query: 1 MHLLSRFRFYFK-KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 MH +R + +++ + A ++ + +I A+ LL+ +I W+Y + + A A Sbjct: 1 MHHRARLKAWWRGRRWADDRGDAAIQMAIVYPVVLLVAIAVIQASMWYYAR---QIALTA 57 Query: 60 A--ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 A +A A ++ + R + + E + + Sbjct: 58 AREGVAAARAYQASPADGAARAREVL-GRTAGDSLHSVSVTAGSTGERVRVQVTGTAQSM 116 Query: 118 IQNIVNSSRISMTHMANNRLDSSN 141 I I + R Sbjct: 117 IPGIAGLTITQSASGPVERWTVPG 140 >gi|323525753|ref|YP_004227906.1| hypothetical protein BC1001_1410 [Burkholderia sp. CCGE1001] gi|323382755|gb|ADX54846.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 613 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 41/140 (29%), Gaps = 6/140 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +++ A +S ++ + + ++Y + ++ + A AG + S Sbjct: 40 ARRQRGAIALLAA-VWVSLAVIALGAVDIGHFYYARRDLQRTADLAAAAGVQLIGSAGGC 98 Query: 75 LGDRFESISNHAKRALIDDAKRFI-----KNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + N L D S ++ +V S + Sbjct: 99 AAAASSAQLNATANGLPGDGAVQTTCGRWDPSANAGQSYFAVTGTPLNAVQVVVSRPVPF 158 Query: 130 THMANNRLDSSNNTIFYNMD 149 + + S + + + Sbjct: 159 LFLIGQTRELSASAVAQATN 178 >gi|253998722|ref|YP_003050785.1| outer membrane adhesin-like protein [Methylovorus sp. SIP3-4] gi|253985401|gb|ACT50258.1| outer membrane adhesin like proteiin [Methylovorus sp. SIP3-4] Length = 1536 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/281 (11%), Positives = 72/281 (25%), Gaps = 14/281 (4%) Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 S T + +++T D + + D T L Sbjct: 936 SGSGTGTLTGSAGGQPVLTITINNGGEYDVNLLRPLDHPDTKTEDVLTLNVGVSATANGV 995 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + + + +LS + + + + T++A Sbjct: 996 TSVGNL------TVNVEDDSPVANPISANLSTTDTNLLITLDISGSMRTQDGVGGTTRLA 1049 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + ++ LD D L + + L+ ++T + ++ + + + + + Sbjct: 1050 SAIQSIKTLLDKYDALGDTR----ISLVVFSTTAAQVGTDWMTIDQAKAQLDQILVNGPK 1105 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS--- 346 T+ A+ A K + N LT N N Sbjct: 1106 GNTNYDSALANAMDAFDDAGKLTNAQNVAYFISDGEPNTGSGSNTSLTGSTNTNGSDAGI 1165 Query: 347 -NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + + N IK I + + N L Sbjct: 1166 QTQEELAWKTFLEANQIKSYAIGVGSDINSVNALNPVAYDG 1206 >gi|340082|gb|AAA36794.1| undulin 1 [Homo sapiens] Length = 843 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 53/170 (31%), Gaps = 22/170 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 E +GL Y+ + + K T + A+ Sbjct: 1 FDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNY---- 56 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + K I +TDG++ + + +E+ +++ Sbjct: 57 ---------IFENSFKPEAGSRTGVSKIGILITDGKSQDDIIPPS-----RNLRESGVEL 102 Query: 365 VTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I + + + L+ S P+ + YNV D + V +++++ + R Sbjct: 103 FAIGV-KNADVNE-LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSR 150 >gi|307318081|ref|ZP_07597517.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306896122|gb|EFN26872.1| TadE family protein [Sinorhizobium meliloti AK83] Length = 141 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 37/103 (35%), Gaps = 1/103 (0%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASK 67 F ++ S+ ++ FAL + LLL+ +I Y +N + A + A + Sbjct: 6 FILRRLFRSQSGATAVEFALVCLPLLLLVFGIIEFGRAFYVRNELSHAVDVAARRVLIGQ 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 + + + + + D KE++ G + Sbjct: 66 IARDATDSEALTKLAGAVRESFHSGDPTLLTIAVTKETVDGIA 108 >gi|123491786|ref|XP_001325914.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908821|gb|EAY13691.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 753 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 72/241 (29%), Gaps = 55/241 (22%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + +S ++D SGSM Sbjct: 213 DKDKSIAISSDGYISISTYTFFEGKVQANTEF---YFIIDCSGSM--------------- 254 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 +++ K+ L + L S+ +I + T+ E +EP T++ Sbjct: 255 -----YGSRIKNAKSCLNVLLHSL------PIGCRFSIIKFGTKFEVALEPCDYTDENMS 303 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +D + M + ++ ++ + K + LTD Sbjct: 304 KAMHQLDLIDAD--MCGNDMISPLKYISEHPQK---------------KDYIKQVFLLTD 346 Query: 339 GENNNFKSNVNTIKICD--KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 GE++ I IC +A + ++ TI I + + ++ + + + + D Sbjct: 347 GEDDR-------ISICAMVQANRDNFRVFTIGIGSDADRNLIIDVARNGSGRYIFIDDED 399 Query: 397 S 397 Sbjct: 400 E 400 >gi|300788143|ref|YP_003768434.1| hypothetical protein AMED_6298 [Amycolatopsis mediterranei U32] gi|299797657|gb|ADJ48032.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 534 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 65/224 (29%), Gaps = 34/224 (15%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM--AALKNALLLFLDSI 242 G +++++D+SGSM+ + + + K + A + +L F + Sbjct: 329 AGVNLSARVQVLIDVSGSMNAQVPGTNLNRMQVTMEAAAKAMHLFKPATQLRMLAFSTRL 388 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D +E + M + + T ++ Y Sbjct: 389 DGDKDYRELLPMASVAQHLASGALEKLGQVKAT------------PDGGTGLYDSVLDTY 436 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----K 358 + + + +I +TDG N + + + + A Sbjct: 437 RTARREWE----------------PGRLNLVIVMTDGRNEDPRGISRADLLTELAGLQDA 480 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I ++ + I + L + ++ + + + VF Sbjct: 481 RRPIPLIGVGIGPDADKAELDQLTAATGGQAFLAPDPAKITDVF 524 >gi|258515700|ref|YP_003191922.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] gi|257779405|gb|ACV63299.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] Length = 140 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 13/89 (14%), Positives = 31/89 (34%), Gaps = 1/89 (1%) Query: 33 FLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL-SRLGDRFESISNHAKRALI 91 L+ + + K+ +++A +AA LAG+S + + L + + + Sbjct: 1 MLMFSALGLDYGRAYVLKHQLQAACDAASLAGSSAVSAKLITDGTGSVTNKKLLLDPIIA 60 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + N S+ + N Sbjct: 61 EARATDVWNQNVSSMKFIDKGVTIVDTSN 89 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 46/156 (29%), Gaps = 19/156 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + L K+DV + +I Y + +L T + Sbjct: 90 INWTTQLTEALLTDKDDVRIAMIQYAETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRT 149 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A L S++K K I+ TDG S + IK Sbjct: 150 GKALLAAKGELFSEEKG-------------ARKNASKIIVLFTDG-----LSVDDPIKHA 191 Query: 355 DKAKE-NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +E IKI + + + Q + + Sbjct: 192 QQLREIEKIKIYVVYVGSDGFEQEMNRIAGGRSNVF 227 >gi|172062966|ref|YP_001810617.1| TadE family protein [Burkholderia ambifaria MC40-6] gi|171995483|gb|ACB66401.1| TadE family protein [Burkholderia ambifaria MC40-6] Length = 142 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 41/132 (31%), Gaps = 3/132 (2%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +G + S+ FAL + L+++ +I V K + +A+ A AG V L Sbjct: 6 RGARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPML 65 Query: 73 SRLGDRFESISNHAKRA---LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + ++S + SG + T + + Sbjct: 66 TPTQIANVALSYTQNSLVSGGTGTVPTVAVTQANGTTSGTALTVTVTYTYSGLVLGTALS 125 Query: 130 THMANNRLDSSN 141 + +S+ Sbjct: 126 VLTGPITISASS 137 >gi|258651507|ref|YP_003200663.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554732|gb|ACV77674.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 593 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 53/194 (27%), Gaps = 28/194 (14%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW------GTE 274 ++ K+A L S + + + T+ + I S G+ Sbjct: 403 GTNGADRIDLAKDAAARGLGLYRADSDIGLWEFSTRLSPTSDHRELIPISSLGPDGQGST 462 Query: 275 KVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T + A + + + Sbjct: 463 GAARLAAALNGLQAIPDGGTGLYDTVLDATRTVRAGY----------------DPDRVNV 506 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEY 388 ++ LTDG N++ S + A E + +++I+ + L + ++ Sbjct: 507 VLLLTDGMNDDVNSITMDQLLSTLAAEQDPARPVPVISIAFGPDSDVAALQQISRATGGA 566 Query: 389 HYNVVNADSLIHVF 402 Y + + +F Sbjct: 567 TYLSQDPRQIGEIF 580 >gi|313244510|emb|CBY15285.1| unnamed protein product [Oikopleura dioica] Length = 486 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 44/129 (34%), Gaps = 7/129 (5%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGEN---NNFKSNV 348 +S K A+ + + +G + F K T G N S Sbjct: 319 NSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPEFGKGRRIDTVGTVVLLTNAASTD 378 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNIS 406 ++ D+ KE +++ + + GQ L S+P E Y + L + + I+ Sbjct: 379 EVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASNPTKENLYIAEESSDLAGLVKTIA 437 Query: 407 QLMVHRKYS 415 + + S Sbjct: 438 DEICATELS 446 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 64/236 (27%), Gaps = 49/236 (20%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D SGSM K+ + AL+ LD + Sbjct: 269 TTMPKNVVFVIDKSGSM--------------------SGRKIQQTREALIKILDDLSPRD 308 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + K R + L T+ A+ A Q+L Sbjct: 309 Q-FNLITFSSEATQWSPSLVPASAENVNKARSFAAAI---HALGGTNINDAVLMAVQLLD 364 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKI- 364 + + II LTDG+ + N+ C Sbjct: 365 RSNR-----------EERLPTRSVSLIILLTDGDPTVGEGPASNSKTRCTGENVREAVSG 413 Query: 365 -VT----ISINASPNGQRLLKTCVSSPEY----HYNVVNADSLIHVFQNISQLMVH 411 + + + S ++L + + + + +A L +Q ++ ++ Sbjct: 414 QYSSLPGLGFDVSYAPEKL---ALDTGGLARRIYEDSDSAMQLQDFYQEVANPLLT 466 >gi|296412728|ref|XP_002836073.1| hypothetical protein [Tuber melanosporum Mel28] gi|295629876|emb|CAZ80230.1| unnamed protein product [Tuber melanosporum] Length = 997 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 23/210 (10%), Positives = 60/210 (28%), Gaps = 36/210 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE--KNIEPS-WGTE 274 P+ + K+ ++++L + ++ MGL+ + + S Sbjct: 499 PVSSSMQGLKINLIRDSLKFLIHNLGERD------RMGLVTFGSSSGGVALTPLSVKSWS 552 Query: 275 KVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + V TD A +L K + + I Sbjct: 553 GWAKVVNSIRPVGQKSLRTDVVDGANVAMDLLMQRKSSNPIAS----------------I 596 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++D ++ ++ + + AK I I + + + +++ + V Sbjct: 597 LLISDSSTSDTENVDFVVSRAEAAK---ITIHSFGLGLTHKPDTMIELSTRTKGQFTYVK 653 Query: 394 NADSLIH-------VFQNISQLMVHRKYSV 416 + L Q +S + + Sbjct: 654 DWMMLRECVAGCIGALQTLSHQNAKLRLKL 683 >gi|218693335|gb|ACL01171.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 40/358 (11%), Positives = 109/358 (30%), Gaps = 27/358 (7%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 K++ ++ D ++ +A K ++ + + ++ Sbjct: 254 KLIDKITSNKDNRVALVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATT 313 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + +++ + N + + + + + QK + +L + Sbjct: 314 HNYSYLNLTNDANEVNILKSRIPKEAEH-INGDRTLYQFGATFTQKALMKANEILETQSS 372 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 LI V D +M A+N +P S + K+ + + + + Sbjct: 373 NARKKLIFHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKIP---DRSGILQEDFIING 429 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + V + ++ + + GT + V ++ S++ + A+ Y L Sbjct: 430 DDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMS---NEGYAINSGYIYLY 486 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + T GE N + I T Sbjct: 487 ------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG-------NIRPKGYDIFT 533 Query: 367 ISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + I + + ++ +++ S E + NV + + + + +V K+S++ Sbjct: 534 VGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSIV 591 >gi|219883044|ref|YP_002478208.1| TadE family protein [Arthrobacter chlorophenolicus A6] gi|219862050|gb|ACL42391.1| TadE family protein [Arthrobacter chlorophenolicus A6] Length = 126 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 36/114 (31%), Gaps = 7/114 (6%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + SE+ + ++ FAL + + ++ ++ + + AA GA M Sbjct: 1 MKHLRSERGSVAVEFALILPILIAVLLGIMEFGRAYNA----QITVTAAAREGARVMSIQ 56 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 S + A ++ + + + A T + + Sbjct: 57 GSPALAKTAVQ---AASPALNPQLSTGQIQVSPTTCTAGANVTVTVTYRLNFIT 107 >gi|4139906|pdb|1AOX|A Chain A, I Domain From Integrin Alpha2-Beta1 gi|4139907|pdb|1AOX|B Chain B, I Domain From Integrin Alpha2-Beta1 Length = 203 Score = 46.4 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 61/192 (31%), Gaps = 28/192 (14%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +A+ FL+ L +GLI Y + K V S Sbjct: 21 YPWDAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQY 80 Query: 289 LKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A++ A + S K ++ +TDGE+++ Sbjct: 81 GGDLTNTFGAIQYARKYAYSAASG-------------GRRSATKVMVVVTDGESHDGSML 127 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADS 397 I C+ + I I++ N L +K S + Y +NV + + Sbjct: 128 KAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAA 184 Query: 398 LIHVFQNISQLM 409 L+ + + + Sbjct: 185 LLEKAGTLGEQI 196 >gi|157151714|ref|NP_001096685.1| collagen alpha-3(VI) chain [Canis lupus familiaris] gi|70571939|tpe|CAI77244.1| TPA: collagen, type VI, alpha 3 [Canis lupus familiaris] Length = 3169 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 57/185 (30%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D L + + +GL+ Y + +Q + Sbjct: 1651 RRDSFQEVLRFVSEIVDTLYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKQQIIDAINKV 1709 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ +F + + +T G Sbjct: 1710 VYKGGRHANTKVGIEH-----------LRLNHFVPEAGSRLDQRVPQIAFVITGG----- 1753 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 KS + + + +K+ + + + + + + K S+ + V N L + + Sbjct: 1754 KSVEDAQEASLALTQKGVKVFAVGV-KNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1811 Query: 405 ISQLM 409 + + + Sbjct: 1812 VLETL 1816 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 57/187 (30%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ + +D + + + +I ++ + + K Sbjct: 1248 FQYIRTLIERLVDYL---DVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDEVQ-NAVRRLR 1303 Query: 288 ILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + A++ + + S +F++ ++ G K Sbjct: 1304 PKGGRQINIGSALEYVSRNIFKRPLGSRIEEG-----------VPQFLVLISSG-----K 1347 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 S+ + K+ + TI+ + ++L+K SPEY ++V L + Q + Sbjct: 1348 SDDEVDDSAAELKQFGVAPFTIA--RDADQEQLVKIS-LSPEYVFSVNTFRELPSLEQKL 1404 Query: 406 SQLMVHR 412 + Sbjct: 1405 LTPITTL 1411 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 48/168 (28%), Gaps = 20/168 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 ++ ++ LS + + +G++ Y+ Q + + Sbjct: 263 DFLVNLLERLSVGAQQIRVGVVQYSDEPRTVFSL-DTYSTKAQVLDAVKALAFTGGELAN 321 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + +F R G + ++ ++ G +++ + Sbjct: 322 VGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRDGVV----- 367 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ ++ + V + L Sbjct: 368 ALKQ--ASVFSFGLGAQAASRAELQHIATNDNLVFTVPEFRSFGDLQE 413 >gi|329894134|ref|ZP_08270119.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] gi|328923306|gb|EGG30626.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] Length = 460 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 44/144 (30%), Gaps = 10/144 (6%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ---ILTSDKKRSFFTNFFRQGVKIPSLP 328 + ++ A+ A + L +D F Sbjct: 139 DAFNIVEFNHEASALFAQPYPAENYALAHAIRFIRSLEADGGTEIEAAFDLTLALPTDAQ 198 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + IIF+TDG + KI + ++ ++ T+ I +SPN + + + Sbjct: 199 KLRQIIFITDGS--VSNESELLAKINRELEDR--RLFTVGIGSSPNRYFMEEAARAGRGT 254 Query: 389 HYNVVNADSLIHVFQNISQLMVHR 412 + N + I +L+ Sbjct: 255 FSYIANVSDVED---EIGRLLGKL 275 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 59/160 (36%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ QD ++ ++ KN ++ + + + +G+I + + + Sbjct: 97 DVSNSMNAQDVEQNRLQQAKNLIINAMGKMT-------NDKVGIIVFAGEASSIMPLTTD 149 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V YV S++ + TD AM+ A + K + Sbjct: 150 FTAVETYVGGVETSIVKMQGTDFLKAMQTAADKFRNVAKG------------------SR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ L+DGE+N A I+++++ I + Sbjct: 192 KVVLLSDGEDNEGNE----KAAAKLANREGIRVISVGIGS 227 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/260 (12%), Positives = 73/260 (28%), Gaps = 45/260 (17%) Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 S + Q ++ + RP + +++D+SGSM + +S Sbjct: 87 SLAAAPDPLLGAGGDNATQLYMAVGLDSGKASDFRRPRLNLLVLLDVSGSMGESFSSYYY 146 Query: 213 DVNSA----PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 D P +++R+KM K L + + G++ + + Sbjct: 147 DQLGQQVAQPSGSEEQRSKMDVAKEVLAGVVGKLGPDDS------FGVVLFRQMGRDIAD 200 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 S T+ + A + R ++ Sbjct: 201 TS--------------------GTNMQAGLDAA----------TGEMRACRTCMEADRAA 230 Query: 329 FQKFIIFLTDGENNNFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I+ +TD + N S+ + + I I + N + + Sbjct: 231 TENRIVLITDAQPNQGDISDEGLLARLKANAADGIHTTIIGVGLDFNTELVEGISKVRGA 290 Query: 388 YHYNVVNADS----LIHVFQ 403 +++V + L F Sbjct: 291 NYFSVHSPGEFRRRLTDEFD 310 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/365 (10%), Positives = 99/365 (27%), Gaps = 45/365 (12%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 + + KK A + +L + G + ++ + +A +D Sbjct: 52 VASDIENLLAKKRR---ALDRLANEAERLQREHLWQDGIKELDMAYYDSKADLDYYSMDG 108 Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + ++ Y+ +N VN S ++ + I ++ + + Sbjct: 109 EGEMENPSHIKLEFVYDPNFKNNVNYSYTAVQIP--TDIYKGAPVILNELNWTQALEKVF 166 Query: 159 QFIEHLLNQRYNQKIV--SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + N + + ++ +F A + +DL Sbjct: 167 -----MENSQEDPSLLWQAFGSATGVTRYYPATPWKSPDKIDLYDVRRRPWYIQGASSPK 221 Query: 217 API-----CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--- 268 + + +K +++ LD++ +V + + + E + Sbjct: 222 DMVILVDVSGSVSGLTLKLIKASVMEMLDTLSDDDYVN------VARFNEKAEAVVPCFK 275 Query: 269 --PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + TD A+ L + +P Sbjct: 276 HLVQANVRNKKIFKDAVQQMQAKGTTDYKSGFHFAFNQLLNKTN-------------VPR 322 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K I+ TDG + + +K +++ T S+ L+ + Sbjct: 323 ANCNKIIMLFTDGGEDRAQDVFMQYNWPNKT----VRVFTFSVGQHNYDVTPLQWIACTN 378 Query: 387 EYHYN 391 + +Y Sbjct: 379 KGYYF 383 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 57/195 (29%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 K+ K+AL L + H + I + + VR Sbjct: 302 SSASMVGAKLRQTKDALFTILHDLRPQDHFNIIGFSNRIKVWKDHLVSVTP-----NNVR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A++ +L + I+FLT Sbjct: 357 DGKVYIHHMSPSGGTDINGALQTGIALLHDS---------VARHDLEDRSV--SLIVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKENF---IKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T KI + +E + I T+ I + + L L+ C + H Sbjct: 406 DGKPTVGE--THTPKILNNTREAARGRVCIFTVGIGDDVDFRLLEKLSLENCGFTRHVHE 463 Query: 391 NVVNADSLIHVFQNI 405 + LI + I Sbjct: 464 DEDAGAQLIGFYDEI 478 >gi|325678275|ref|ZP_08157903.1| hypothetical protein CUS_5520 [Ruminococcus albus 8] gi|324110038|gb|EGC04226.1| hypothetical protein CUS_5520 [Ruminococcus albus 8] Length = 1201 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/288 (8%), Positives = 73/288 (25%), Gaps = 20/288 (6%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 SI+ L ++ ++ +I + K ++ LA + + L Sbjct: 10 RRVSGTISIMLTLLLLPIYSVLSIIIEGARYQSAKQQLDELTYLGQLAI---LADYMDFL 66 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + ++ S ++ + + T++ + N T Sbjct: 67 EENYDLYSFYSASDAEGKQMLDESFQYYVESATSAGGIDTTKLNKLFNLPMDCCTVEGMY 126 Query: 136 RL-----------DSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVSFIPALLRI 183 L + ++ S+ +E L +V+ ++ Sbjct: 127 SLADPEILKYQIKQYGKYRMPADILSDFSFKMIFDLLESKLGGVAKKLSVVNAASKQMKD 186 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--- 240 + S + + + S D + A + + + D Sbjct: 187 ASSIMNNAKAVFSAEESLTSNISTFSGKYDEFQNKKSEVTDWNIDQATYDIIKNYTDADM 246 Query: 241 --SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + +++ + + Y + + + + DS Sbjct: 247 NAQTNTANNLYSRLKSCMSSYQSTANTYLNYVEQIKDIESQSNWSSDS 294 >gi|260800525|ref|XP_002595179.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] gi|229280423|gb|EEN51191.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] Length = 284 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 57/185 (30%), Gaps = 20/185 (10%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ L + V + ++ Y +V + D + T Sbjct: 20 MKTFIRQLFAYLDIGENAVRVSIVQYAAQVRTEFFLDQYYDLQEAQDAVDGIEYMGGFTL 79 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ A + +K K + +TDG + + + Sbjct: 80 TGKAIDFATNLHFDLRKG-------------ARADVTKIAVVITDGRSYDDVNRP----- 121 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + ++ I + + + + + + L P+ ++ D L + ++ ++ K Sbjct: 122 ARRMRQAGIVTIAVGVGNNLD-RDQLTAIAGDPKTLLSLDGFDRLQDLTTSLPTMLCDGK 180 Query: 414 YSVIL 418 +L Sbjct: 181 IRTLL 185 >gi|194223901|ref|XP_001494668.2| PREDICTED: similar to integrin alpha 2 subunit [Equus caballus] Length = 1194 Score = 46.0 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 39/284 (13%), Positives = 75/284 (26%), Gaps = 47/284 (16%) Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 LQ + N + +S L +G + DV+ Sbjct: 108 LNLQTSTSIPNVTEMKTNMSLGL-TLTRNVGTGGFLTCGPLWAQQCGSQYYTTGVCSDVS 166 Query: 216 -------SAPICQDKKRTKMAAL-----------KNALLLFLDSI-DLLSHVKEDVYMGL 256 S + + + A+ FL+ L +GL Sbjct: 167 PDFQLLASFSPAAQTCPSLIDVVVVCDESNSIYPWEAVKNFLEKFVQGLDIGPTKTQVGL 226 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFT 315 I Y + K T++ A++ A S Sbjct: 227 IQYANNPRVVFNLNTFKTKAEMVEATSQTFQFGGDLTNTFKAIQYARDSAYSAAAG---- 282 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 K ++ +TDGE+++ I C+ + I I++ N Sbjct: 283 ---------GRRGATKVMVVVTDGESHDGSMLKAVIDQCN---SDNILRFGIAVLGYLNR 330 Query: 376 QRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLM 409 L +K S + Y +NV + +L+ + + + Sbjct: 331 NALDTKNLIKEIKAIASIPTERYFFNVSDEVALLEKAGTLGEQI 374 >gi|17533679|ref|NP_496742.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876680|emb|CAB03054.1| C. elegans protein F35C5.5b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 387 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/213 (8%), Positives = 60/213 (28%), Gaps = 18/213 (8%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVYM 254 D + A V + + +F I K + Sbjct: 21 TDRNCGHDLANLWLDVVVVVDNSAPMTQEGLTEVAAQIVTVFGAGTRIGTNYIDKRTTRV 80 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ Y T + + D + + + + + Sbjct: 81 GLVTYNTEATIQADLNRFQSP--------DDLFSTVFQILPNDLSTSEDVFLAKGIGAAE 132 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 +++ +I N+ + + I ++ K + + I T++ + + + Sbjct: 133 QLLAAGRKNNTRKNYKQMVIVYASAYNDEGEEDPRP--IAERLKASGVSIATVAFDQTGD 190 Query: 375 GQRLLKTC--VSSPEYHYNVVNAD---SLIHVF 402 + ++K +++P +++ + + + Sbjct: 191 EE-MIKLIGEIATPGFNFTNEDENLVKEIQTAM 222 >gi|17533681|ref|NP_496741.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876681|emb|CAB03055.1| C. elegans protein F35C5.5a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 389 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/213 (8%), Positives = 60/213 (28%), Gaps = 18/213 (8%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVYM 254 D + A V + + +F I K + Sbjct: 21 TDRNCGHDLANLWLDVVVVVDNSAPMTQEGLTEVAAQIVTVFGAGTRIGTNYIDKRTTRV 80 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 GL+ Y T + + D + + + + + Sbjct: 81 GLVTYNTEATIQADLNRFQSP--------DDLFSTVFQILPNDLSTSEDVFLAKGIGAAE 132 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 +++ +I N+ + + I ++ K + + I T++ + + + Sbjct: 133 QLLAAGRKNNTRKNYKQMVIVYASAYNDEGEEDPRP--IAERLKASGVSIATVAFDQTGD 190 Query: 375 GQRLLKTC--VSSPEYHYNVVNAD---SLIHVF 402 + ++K +++P +++ + + + Sbjct: 191 EE-MIKLIGEIATPGFNFTNEDENLVKEIQTAM 222 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 25/195 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 TK+ K+AL L + H + I + ++ + + +R Sbjct: 314 SSASMVGTKLRQTKDALFTILQDLRPEDHFNIIGFSNRIK-VWQHDQLVPVTPN--NIRD 370 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ A++ + +IL ++ IIFLTD Sbjct: 371 AKVYIHNMSPSGGTNINGALQISTKILNDYIAQNDIEA-----------RSVSLIIFLTD 419 Query: 339 GENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKTCVSSPEYHYNV-- 392 G + KI + KE I+ + TI I + + L + + + V Sbjct: 420 GRPTFGEI--EPAKIINNTKEA-IRNKFCLFTIGIGNDVDYKLLERLALENCGMMRRVRE 476 Query: 393 --VNADSLIHVFQNI 405 A+ L + I Sbjct: 477 EEDAAEQLKGFYYEI 491 >gi|226487570|emb|CAX74655.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 837 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 94/315 (29%), Gaps = 55/315 (17%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 F + +S +++ + N+ + +A+ + + + ++ Sbjct: 200 YTITFEAHLWMQSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDPL 259 Query: 153 SYDYRLQFIEH--LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L++ + N +V +P + + + ++D SGSM Sbjct: 260 TSLVSLEYGDSNQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEGDN--- 316 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--E 268 ++ K +LLLFL S+ + +IG+ + Sbjct: 317 -----------------ISYAKTSLLLFLKSL------PVNCRFQIIGFGSNFAALFSEP 353 Query: 269 PSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + + D T++ A+K A Sbjct: 354 TDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL------------------HSSPSGE 395 Query: 328 PFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + K IIFLTDG+ N + DKA+ + TI + + + Sbjct: 396 GWFKQIIFLTDGDVGNADEVIGLVRMNVDKAR-----VFTIGLGQGVSTALIGGVARVGN 450 Query: 387 EYHYNVVNADSLIHV 401 V +A L Sbjct: 451 GTAAYVRDASQLQSA 465 >gi|226487568|emb|CAX74654.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 94/315 (29%), Gaps = 55/315 (17%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 F + +S +++ + N+ + +A+ + + + ++ Sbjct: 195 YTITFEAHLWMQSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDPL 254 Query: 153 SYDYRLQFIEH--LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L++ + N +V +P + + + ++D SGSM Sbjct: 255 TSLVSLEYGDSNQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEGDN--- 311 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--E 268 ++ K +LLLFL S+ + +IG+ + Sbjct: 312 -----------------ISYAKTSLLLFLKSL------PVNCRFQIIGFGSNFAALFSEP 348 Query: 269 PSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + + D T++ A+K A Sbjct: 349 TDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL------------------HSSPSGE 390 Query: 328 PFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + K IIFLTDG+ N + DKA+ + TI + + + Sbjct: 391 GWFKQIIFLTDGDVGNADEVIGLVRMNVDKAR-----VFTIGLGQGVSTALIGGVARVGN 445 Query: 387 EYHYNVVNADSLIHV 401 V +A L Sbjct: 446 GTAAYVRDASQLQSA 460 >gi|226487566|emb|CAX74653.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 94/315 (29%), Gaps = 55/315 (17%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 F + +S +++ + N+ + +A+ + + + ++ Sbjct: 195 YTITFEAHLWMQSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDPL 254 Query: 153 SYDYRLQFIEH--LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + L++ + N +V +P + + + ++D SGSM Sbjct: 255 TSLVSLEYGDSNQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEGDN--- 311 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--E 268 ++ K +LLLFL S+ + +IG+ + Sbjct: 312 -----------------ISYAKTSLLLFLKSL------PVNCRFQIIGFGSNFAALFSEP 348 Query: 269 PSWGTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 + + + + D T++ A+K A Sbjct: 349 TDYSEDSLNAAMNYQKDLNADMGGTEAYNALKSAL------------------HSSPSGE 390 Query: 328 PFQKFIIFLTDGE-NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + K IIFLTDG+ N + DKA+ + TI + + + Sbjct: 391 GWFKQIIFLTDGDVGNADEVIGLVRMNVDKAR-----VFTIGLGQGVSTALIGGVARVGN 445 Query: 387 EYHYNVVNADSLIHV 401 V +A L Sbjct: 446 GTAAYVRDASQLQSA 460 >gi|293341619|ref|XP_001069109.2| PREDICTED: similar to Antxr2 protein [Rattus norvegicus] Length = 463 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 44/151 (29%), Gaps = 24/151 (15%) Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K+ + + + T +K A + + + L II Sbjct: 77 KISKGLEDLKAVQPVGETYIHEGLKLANEQIQNA----------------GGLKTSSIII 120 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV- 393 LTDG + K K++ + + + Q L+ S + + V Sbjct: 121 ALTDG-KLDGLVPSYAEKEAKKSRSLGASVYCVGVLD--FEQAQLERIADSKDQVFPVKG 177 Query: 394 NADSLIHVFQNIS----QLMVHRKYSVILKG 420 +L + +I ++ S + G Sbjct: 178 GFQALKGIINSILAQSCTEILELSPSSVCVG 208 >gi|297270401|ref|XP_002800091.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Macaca mulatta] Length = 3386 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 56/175 (32%), Gaps = 36/175 (20%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD-------MDSLILKPTDS 294 + V + ++ ++++ + + + + S T + Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNYVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYT 169 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A +QA QIL K + +TDG +N I Sbjct: 170 KGAFQQAAQILL-----------------HARENSTKVVFLITDGYSNGGD----PRPIA 208 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 +++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 209 ASLRDSGVEIFTFGIWQGN--IRELNDMASAPKEEHCYLLHSFEE----FEALAR 257 >gi|253569758|ref|ZP_04847167.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840139|gb|EES68221.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 621 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 72/232 (31%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + ++V I + P + ++D+SGSM T+ Sbjct: 229 NADHRLVRIGLKAREIPTDKLPESNLVFLIDVSGSMW-------------------GPTR 269 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + +K++L L ++++ V VY G V+ P +K+R + Sbjct: 270 LDLVKSSLKLLVNNLREKDKVAIVVYAG----NASVKLESTPGSDKQKIRDAIDELTSGG 325 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 ++ AY++ + II +DG+ N S+ Sbjct: 326 ST---AGGAGIQLAYKVAKHNFL----------------PKGNNRIILCSDGDFNVGVSS 366 Query: 348 VN-TIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEYHYNVVNADS 397 V ++ +K +++ + + + + H + N Sbjct: 367 VEGLEQLIEKERKSGVFLSVLGYGMGNYKDNKGQALAEKGNGNHAYIDNLQE 418 >gi|220905850|ref|YP_002481161.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219862461|gb|ACL42800.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 236 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 56/177 (31%), Gaps = 22/177 (12%) Query: 224 KRTKMAALKNALLLFLDSIDLL--SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + K+ +L A+ + + + V + I ++ E +I ++ R Sbjct: 34 SQGKIQSLNAAIKETIPMLQQTAADNPNAQVLVRAIKFSDGAEWHIPTPTPVDQFR---- 89 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D TD A++ + L + + ++ ++DG+ Sbjct: 90 -WTDLTAGGVTDLGMALEMVAEQL--------------RVPPMSERALPPVLVLISDGQP 134 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + + +++ I++ N + L K S N D L Sbjct: 135 TDDFGSGLKALMAQPWGQKAVRV-AIAVGQDANHEVLQKFIGPSELRVLQANNPDQL 190 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTMGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTMGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTMGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|299134236|ref|ZP_07027429.1| TadE family protein [Afipia sp. 1NLS2] gi|298590983|gb|EFI51185.1| TadE family protein [Afipia sp. 1NLS2] Length = 145 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 14/124 (11%), Positives = 42/124 (33%), Gaps = 6/124 (4%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F+ + ++ FAL + F+LL+ ++ + + ++ LA + S Sbjct: 8 FRDFRRACSGASAVEFALVLPVFMLLVFGIVMFGAYLAMVHDVQQ------LAAEAARTS 61 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 +S++ + + H+ + + A + ++S + Sbjct: 62 VAGLNETERKSLATNYVTQNAASYPLIVPAHLSVNAATSGADPNVFIVTVNYDASNTFIY 121 Query: 131 HMAN 134 + + Sbjct: 122 SLPS 125 >gi|224092755|ref|XP_002190101.1| PREDICTED: inter-alpha (globulin) inhibitor H2 [Taeniopygia guttata] Length = 948 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 58/223 (26%), Gaps = 44/223 (19%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK-E 250 I V+D+SGSM KM A+ L + Sbjct: 312 NILFVIDVSGSMW--------------------GLKMKQTIEAMKAILSELRAADQFSLI 351 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D + + + +V T+ A+ +A IL K Sbjct: 352 DFNHNVRCWRDNLVSATP-----SQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAK- 405 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTIS 368 + + I+ ++DG+ K + K+ + + Sbjct: 406 ----------SLGMLDPNSVSMIVLVSDGDPTVGELKLTTIQKNVKQSIKDE-FSLFCLG 454 Query: 369 INASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 I + L + + + + + + + + +S Sbjct: 455 IGFDVDYDFLQRIATDNRGMAHRIFGNQETSLQMKNFYNQVST 497 >gi|111022920|ref|YP_705892.1| hypothetical protein RHA1_ro05957 [Rhodococcus jostii RHA1] gi|110822450|gb|ABG97734.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 548 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 58/211 (27%), Gaps = 26/211 (12%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +D+SGSM + + ++ L F + ++ Sbjct: 350 AIDVSGSMETPAGNRSRMDLTVDAALAGNAMFPDSVSAGLWAFSQGLGGGPQDYKE---- 405 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ + ++ +D LI T A++ + Sbjct: 406 MVPIRRYDTVVDGLTQ--RQLMAQQAGKVDGLIGGGTGLYDTTLAAFRTVQETY------ 457 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINA 371 +I LTDG N + S + +E + IVTI I Sbjct: 458 ----------DPRAVNSVIILTDGANEDPDSITKEQLLSILQRETDPARPVIIVTIGITG 507 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + + Y + + +VF Sbjct: 508 DADAATLAEISRVTGGSSYVAKDPADIANVF 538 >gi|47224105|emb|CAG13025.1| unnamed protein product [Tetraodon nigroviridis] Length = 983 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/170 (10%), Positives = 55/170 (32%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ +V + + T + ++ + + Sbjct: 96 LIRTSVTEMLETLSDDDYVNVVYFNTQVKKTACFDHLVQ--ANVRNKKLLKDAVQNITAK 153 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ T ++ A++ L+ K I+ TDG ++ + Sbjct: 154 GITNYTKGLEFAFEQLSVTNVSRANC--------------NKIIMLFTDGGEERAQAILE 199 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 + ++I T S+ + + + S+ Y Y + + ++ Sbjct: 200 KYNA-----DKKVRIFTFSVGQHNYDKGPIQWMACSNKGYFYEIPSIGAI 244 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 77/244 (31%), Gaps = 43/244 (17%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 GS + S DV +K+ + + KE Sbjct: 184 TDYGGNWRKEGSRNTECPSSGVDVLFVLDGSGSVGKNFDKVKDWVKNITAKL---DIGKE 240 Query: 251 DVYMGLIGYTTRV---------EKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMK 299 V +G++ Y+ V E S G K+ +D + L+ T + A++ Sbjct: 241 IVRVGVVQYSHYVEGKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQ 300 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + + + ++ ++ LTDG+ + K + ++ + Sbjct: 301 KVIRDFDDAYIGN-----------------KQVLLLLTDGQAKDNKLILPN---ANRLRN 340 Query: 360 NFIKIVTISINASPNGQRLLKTCVS---SPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T ++ LK S S + + V + L + +++ + S Sbjct: 341 KG--IATFAVGVGEYDISELKLIASGTDSTDRVFTVTDFGELDSIVKSLQTEIQ----SF 394 Query: 417 ILKG 420 +L+G Sbjct: 395 VLEG 398 >gi|296268733|ref|YP_003651365.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] gi|296091520|gb|ADG87472.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] Length = 587 Score = 46.0 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 34/91 (37%), Gaps = 8/91 (8%) Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-----ENFIKIVTISINASPNGQRLLKT 381 ++ LT G+++ + ++ + + + ++IV + A + L + Sbjct: 493 PMMSNALLVLTYGQDDGRGISR--QELAEALRKEWNPDRPVQIVVVMFGAGRDRAALEEA 550 Query: 382 CVSSPEYHYNVVNADSLIHVF-QNISQLMVH 411 + Y +I VF I++ + H Sbjct: 551 AAITNGEVYVARQPGEIIDVFLSAIARRLCH 581 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 36/376 (9%), Positives = 101/376 (26%), Gaps = 45/376 (11%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 L + + + A + S S++S L + + Sbjct: 125 LSAGGVQGMIAMDTTAAAPQALSRNAQPLGSVAASSMSDLAIVAPQPTTKQYPNAEQNPL 184 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + + + S +++ ++ +++ + + + Y Sbjct: 185 KIASDEPVSTFSIDVDAASYAVVRSSLSRAQLPPSGAVRIEEMINYFSYDYLATDAGDVP 244 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 +R + +++ ++ E P + ++D SGSM+ Sbjct: 245 FRATTSVFETPWNADTQLLHIGIQGTLPDVSEHPPLNLVFLIDTSGSMNQPD-------- 296 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 K+ L +A L L + + +I Y + + P+ +++ Sbjct: 297 -----------KLPLLISAFRLMLSELRPED------EVSIITYAGSAGQVLAPTPASDR 339 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + T ++QAY I + + +I Sbjct: 340 A-TILAALNRLSAGGSTAGQAGLRQAYAIAAAMSEDGEIA----------------RVIL 382 Query: 336 LTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 TDG+ N +++ + + ++++ + Sbjct: 383 ATDGDFNVGLNDPDTLKDYITTRRDSGTYLSVLGFGRGNLNDAVMQSLAQNGNG--IAAY 440 Query: 395 ADSLIHVFQNISQLMV 410 D+L + + + Sbjct: 441 IDTLSEAQKVLVDQLS 456 >gi|119476218|ref|ZP_01616569.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] gi|119450082|gb|EAW31317.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] Length = 750 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 51/158 (32%), Gaps = 27/158 (17%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRS 312 G+ + V + + T R + + + T+ A+++A Sbjct: 79 GVWTFGQWVNNLVPSNTVTTAWRANASAQAEKISSVALRTNIPAALEKAM---------- 128 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA--------KENFIKI 364 VK + +I LTDG + S+++ + K + I Sbjct: 129 -------ADVKSKGTDYSIHLILLTDGMVDVSLSSIDNETARQRIVAEILPALKGAGVTI 181 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 T++++ + + + + + A+ L VF Sbjct: 182 HTVALSQNADLELMELLAAETGGLSAVAETAEDLSRVF 219 >gi|114800018|ref|YP_760837.1| hypothetical protein HNE_2140 [Hyphomonas neptunium ATCC 15444] gi|114740192|gb|ABI78317.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 583 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/188 (10%), Positives = 47/188 (25%), Gaps = 26/188 (13%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 TK+ ++ + + + + + + G + + G + +R Sbjct: 58 GGTTKIEVARDVMSEYFRTRNAAEPLG-VIAYGHRRRGDCADIEVIAETGVQNAATLSSR 116 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T T A++ A + + +R+ II +TDG Sbjct: 117 VNQIRPNGMTPLTDALRMAQKQIPKTAERAD-------------------IILVTDGLET 157 Query: 343 NFKSNVNTIKICDKAKENFIKI--VTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLI 399 I+I + + L ++ L Sbjct: 158 CKADPCALAAELAA---EGIEIRAHVVGFGLTEQEAASLSCIPEATGGLLLRPQTGQELS 214 Query: 400 HVFQNISQ 407 I++ Sbjct: 215 DALGQIAE 222 >gi|226947175|ref|YP_002802248.1| hypothetical protein Avin_51980 [Azotobacter vinelandii DJ] gi|226722102|gb|ACO81273.1| hypothetical protein Avin_51980 [Azotobacter vinelandii DJ] Length = 581 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 24/260 (9%), Positives = 67/260 (25%), Gaps = 47/260 (18%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + + + + I + Sbjct: 360 APELAIVFDASGSM------DISLLATKQEIDRAVMVQGMTDI----------AARVLLG 403 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 ++ ++ A K A + + V + + + Sbjct: 404 GNPGIDTTGHLYREPKRITAAKQATTAVVQRL-PSDVSAGLVLIERCPASRSMGFFTPVQ 462 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 G + R ++ T + +A Q+L + + Sbjct: 463 RGG-----LLARLNSIQPVEGTPLADGVAKAGQMLDGVNR-------------------E 498 Query: 331 KFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 ++ ++DGE + + + ++ +KI + I + G L ++ Sbjct: 499 SVMVVVSDGEESCHQDPCAVARDLAR--RKPHLKINVVDIAGTGAGNCL---AQATGGRV 553 Query: 390 YNVVNADSLIHVFQNISQLM 409 +N NA L + + +Q + Sbjct: 554 FNARNAGELAAMTRQAAQDV 573 >gi|153836830|ref|ZP_01989497.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|260361572|ref|ZP_05774599.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|260876729|ref|ZP_05889084.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|260900400|ref|ZP_05908795.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308095540|ref|ZP_07663324.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|149749976|gb|EDM60721.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|308089064|gb|EFO38759.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091406|gb|EFO41101.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109136|gb|EFO46676.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308113987|gb|EFO51527.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 154 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 34/154 (22%), Gaps = 6/154 (3%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L +F+ ++ ++ I F L++ L+L +I V + + A Sbjct: 2 LYLKFKTQMRR-GRHQEGLAIIEFILALPVLLMLTVLVIDVCRAFIQYTEVNKALQNGAR 60 Query: 63 AGASKMVSNLS-----RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 L + + A +I T Sbjct: 61 YAVVDTYGTLDFEGIADETKIKNVVVYGSPTASTTPVIDYIGVDDIVITQPTGTNKVVTL 120 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 S +N L S Sbjct: 121 SATYNYVPIFSTLPFSNTSLQFSIGATTSMRTGP 154 >gi|332227198|ref|XP_003262778.1| PREDICTED: vitrin isoform 2 [Nomascus leucogenys] Length = 679 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/333 (8%), Positives = 81/333 (24%), Gaps = 47/333 (14%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN--NRLDSSNNTIFYNMDVMTSYDYRLQ 159 + + + + + I + + + Sbjct: 175 PGTTAQPVTLMQLLAVTVAVATPTTILPRPFPSAASTTSIPRPEPVGHRSQEMDLWSTTT 234 Query: 160 FIEHLL--NQRYNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + A + +G + + L+ + S+ DP Sbjct: 235 YTSSQNRPGADPGIQRQDPSGAAFQKPVGVDVSLGLVPKEELSTQSLEPVSLGDPNCKID 294 Query: 217 APICQDKKRT------KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + ++ L D L MG++ Y + Sbjct: 295 LSFLIDGSTSIGKRRFRIQK-----QLLADVAQALDIGPAGPLMGVVQYGDNPATHFNLK 349 Query: 271 W--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + ++ + + ++ A+ + S + Sbjct: 350 THMNSRDLKTAIEKITQR--GGLSNVGRAISFVTKNFFSKANGN-------------RSG 394 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL--------- 379 ++ + DG + + A+E+ I I I+I + ++ Sbjct: 395 APNVVVVMVDGWPTD-----KVEEASRLARESGINIFFITIEGAAENEKQYVVEPNFANK 449 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 C ++ Y ++V + L Q + + + Sbjct: 450 AVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 482 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 64/186 (34%), Gaps = 22/186 (11%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLIL 289 L A+ L ++ +K ++++ I + + + + D Sbjct: 30 LNTAIRDMLGTLKEQESLKAEIHISFITFGNGGANLHTALTPVSN------IEFNDFTEG 83 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T A++ A +++ + + IPS + I+ L+DG N+ Sbjct: 84 GMTPLGGALRIAKEMVENREI-------------IPSKSYAPIILLLSDGAPNDNGWENE 130 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + + +++ I + L SS Y ++ ++I F+ ++ + Sbjct: 131 MYRFINDGRSKKCMRMSLGIGRDYDYDVLKGF--SSNGEVYEAKDSMNIIDFFKFMTMTI 188 Query: 410 VHRKYS 415 + S Sbjct: 189 KEKTLS 194 >gi|299138555|ref|ZP_07031734.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599801|gb|EFI55960.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 381 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 17/173 (9%), Positives = 56/173 (32%), Gaps = 33/173 (19%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+ +++ + + + K++Q + T A+ + L+ + Sbjct: 180 LVDFSSEAYIDQDFTNSIAKLQQGLAYI---HTSGGTALYDAVVASADYLSKNA------ 230 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-NFIKIVTIS--INAS 372 ++ ++ +TDGE+N ++ + ++ + I I Sbjct: 231 -----------KHPKQVLLIVTDGEDNASSAS--LESAIRRVQDLDGPAIYCIGLLFGDD 277 Query: 373 -------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 + L + + Y + + + + ++Q + + Y++ Sbjct: 278 VSRSEAKHAREVLQELAQQTGGQAYFPKSLKDVDGLTREVAQDIRTQ-YTIEY 329 >gi|269126104|ref|YP_003299474.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311062|gb|ACY97436.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 583 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 32/221 (14%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + +VD+SGSM + RT+M + AL +D + Sbjct: 388 RKRANVLFLVDVSGSMAEPL-------------PGTGRTRMQQAQRALRQAVDDFVAVDR 434 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V + +G + + + + T + A++ + Sbjct: 435 VGLWEFSTDLGGGRDYRPLVPIRPMSTPGHRERLREQIAALRPRGDTGLYDSTLAAFRHV 494 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKI 364 + ++ LTDG N++ ++ ++ + + ++I Sbjct: 495 ----------------RAVRQDGAINSVVVLTDGRNDDPGGGLSLEELLKELDGADGVRI 538 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 TI+ A +G L K ++ Y+ + +L V + Sbjct: 539 FTIAYGAGADGGALKKISEATDAAAYDSRDPATLDKVLTQV 579 >gi|149184583|ref|ZP_01862901.1| hypothetical protein ED21_27733 [Erythrobacter sp. SD-21] gi|148831903|gb|EDL50336.1| hypothetical protein ED21_27733 [Erythrobacter sp. SD-21] Length = 144 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 14/118 (11%), Positives = 30/118 (25%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 F + E+ I FAL L L LI + NS+ + + + Sbjct: 5 LFHRLRRDERGTAIIEFALLAPVILGLFFGLIQIGISMQAYNSLRGVASDTARYAVVEYM 64 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + E+ + + I + + + + Sbjct: 65 NGDTIDDTTIENRAKAIATGAPYLLNNSVTATITPVATPRVHGTHEKTLLISYTPPDV 122 >gi|110798896|ref|YP_694951.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens ATCC 13124] gi|110673543|gb|ABG82530.1| von Willebrand factor type A domain protein [Clostridium perfringens ATCC 13124] Length = 580 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 71/260 (27%), Gaps = 56/260 (21%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 I +V D SGSM M + + + + +L+ F + Sbjct: 72 SIVNPVDIIMVADKSGSMEYEMPTLKRAMKNFLDDIESSFG--DRANISLIEFSGE-NKT 128 Query: 246 SHVKEDVYMGLIGYT--------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 + + G Y + + + V+ + + TD A Sbjct: 129 YIGRYCDFNGFNCYEKYFIKGSIDDAKVLCDYTSEYSTVKSKIDKIT---AYGRTDIEAA 185 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN---------NNFKSNV 348 ++ + L + +K+++F TDG + Sbjct: 186 LELVKKKLDDR-----------------NSNNKKYVVFFTDGLPIQLLNIETREYPSLDY 228 Query: 349 NTIKICDKAKE----------NFIKIVTISINASPNGQRLLKTCVS------SPEYHYNV 392 I KE N + +I + K + ++ Sbjct: 229 IEKYIIPHTKEYFYEKGFLDKNKVNFYSIGLFTGRRFDSEKKIAKDFIKSINNSGSYFIT 288 Query: 393 VNADSLIHVFQNISQLMVHR 412 +++ L V+ +I+ +++ Sbjct: 289 DDSNRLDSVYNDIAMNIINE 308 >gi|254460998|ref|ZP_05074414.1| von Willebrand factor, type A [Rhodobacterales bacterium HTCC2083] gi|206677587|gb|EDZ42074.1| von Willebrand factor, type A [Rhodobacteraceae bacterium HTCC2083] Length = 627 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 41/381 (10%), Positives = 106/381 (27%), Gaps = 70/381 (18%) Query: 57 NNAAILAGASKMV--------SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + A+ A AS++ + + S + + E+ Sbjct: 123 ADMAVGAIASELAPSSVPVLRMSAKSQSRTLVAGSIADASVVAVEVDTEAFASASENTLK 182 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY--------RLQF 160 + + V+++ ++ + +++ + Y Sbjct: 183 VTRETSVSTFSVDVDTASYAIVRSSLMNGQLPPAAAVRIEEMVNYFPYDYAAPTQGVFAT 242 Query: 161 IEHLLNQ--RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 ++ N +++ E+ RP + ++D SGSM+ Sbjct: 243 SLAMMATPWNENTQLLRIALQGQMPEVEARPPLNLVFLIDTSGSMN-------------- 288 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + K+ LK +L L L + + ++ Y + +EP+ E+ Sbjct: 289 -----QPNKLPLLKQSLRLLLGQLRAED------QVAIVAYAGSAGQVLEPTAAHERD-T 336 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + T+ ++QAY + ++ ++ TD Sbjct: 337 ILGALDNLAAGGETNGQAGLQQAYALAKQMQEEGEV----------------SRVLLATD 380 Query: 339 GENNNFKSNVN-TIKICDKAKENFIKIVTISINASP-NGQRLLKTCVSSPEYHYNVVN-- 394 G+ N SN +++ + + + + + + + N Sbjct: 381 GDFNVGLSNAEALKGYIADKRDSGTFLSVLGFGRGNLDDETMQALAQNGNGQANYIDNLP 440 Query: 395 ------ADSLIHVFQNISQLM 409 D L I+ + Sbjct: 441 EAQKVLVDQLTGALFPIASDV 461 >gi|300704937|ref|YP_003746540.1| hypothetical protein RCFBP_20766 [Ralstonia solanacearum CFBP2957] gi|299072601|emb|CBJ43951.1| conserved exported protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 340 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 26/272 (9%), Positives = 69/272 (25%), Gaps = 22/272 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ ++ + ++ FL + I + +N +++A A+ + Sbjct: 6 RRQRGAVGVLSPILLIIFLAIGAMAIDIAHLFVVRNELQNAA-----DAAALAGAAGLYP 60 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + SN + N + + + T + S I+ Sbjct: 61 ANPKPNWSNGVAQGTSAIKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVA 120 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE-----MGERPI 190 + + N S F + + V P + + Sbjct: 121 GVQVTVTRSPGNNGGPVSGWLTWVFNGGAASIQATAVAVIAAPGSANPGSLFPVALNKCL 180 Query: 191 FLIELVV-------DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL--LLFLDS 241 F + D S ++ + S C + T +A + S Sbjct: 181 FDLYWNYTTGQPLNDPSTGQPYVIDINTSYPPSGGTCASGEWTGFDGPTDASTEKNLVSS 240 Query: 242 IDLLSHV---KEDVYMGLIGYTTRVEKNIEPS 270 + + ++ G+ + + Sbjct: 241 GNPTNISIGENINISTGVKTSVYNAIPPLPLT 272 >gi|260800509|ref|XP_002595172.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] gi|229280415|gb|EEN51183.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] Length = 299 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 49/170 (28%), Gaps = 19/170 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I E +G++ Y+ R I ++ + T + A+ Sbjct: 46 ISDFELGPEATRIGVVVYSNRASLEISLDAFEDQESLQDAVAGIAYPGGYTLTGAAIDYT 105 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 S + +K + LTDG + + + ++ Sbjct: 106 TTFAFSTRNG-------------AREGVRKVAVILTDGVSYDD-----PAEPAQSMRKAA 147 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I + I ++ + + L P+ + + + L ++ + + Sbjct: 148 IITYAVGIGSNLDRDQ-LDVIAGVPDNLFVLDDFSMLDNLRTTLPTQVCD 196 >gi|332796610|ref|YP_004458110.1| von Willebrand factor type A [Acidianus hospitalis W1] gi|332694345|gb|AEE93812.1| von Willebrand factor type A [Acidianus hospitalis W1] Length = 381 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 59/212 (27%), Gaps = 40/212 (18%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 I L+ + ++ H + D + K+ K+ + I Sbjct: 24 KILLVPEKISVATGFHYIILLDTS--------GSMEGLKIEKAKSGAIELFKRI------ 69 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + I + + VE E S + + T A+ A++I Sbjct: 70 PQGNKVTFIKFASTVEVVREFSDPLD----LTNEIQNIAANGQTSLFTALLTAFKI---- 121 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 +I+ LTDG + K+ ++I++ Sbjct: 122 ---------------AVKYSMPAYILLLTDGNPTDVTDTRTYEKMSIP---QGVQIISFG 163 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I N + L + Y++ A + Sbjct: 164 IGDDYNEELLKILSDRTGSTFYHIQEASEIPE 195 >gi|327260894|ref|XP_003215268.1| PREDICTED: collagen alpha-3(VI) chain-like [Anolis carolinensis] Length = 3053 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 47/173 (27%), Gaps = 19/173 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 + + ID + + + +GL+ Y + V T+K + + Sbjct: 1647 DNFQEVVDFVYGIIDAIYEEGDSIKVGLVQYNSDVSDEFFLKDFTDKEQILEAVKRIAYK 1706 Query: 289 LKPT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T ++ A+K +F ++ + +T G + Sbjct: 1707 GGRTANTGTAIKH-----------IKAKHFVKEAGSRVDQKVPQIAFIITGGRPEDDG-- 1753 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + +K+ + + G + S+ + A L Sbjct: 1754 ---QTAALALAQQGVKVFAVGVRNIDLGDIAKLSSDSTTG--FRAATAQELSE 1801 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 57/195 (29%), Gaps = 20/195 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 D R+ LK+ + +DS+ V + ++ Y+ + ++K Sbjct: 1034 IDGSDATRSSFPELKSFVQRVVDSL---DVGPGKVRVAVVQYSNDANTEFNLNEYSDKA- 1089 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 +T + A L F R+ +F+I LT Sbjct: 1090 SVITAVQRMTAMGGYAVNTG--AALNYL-------ISNVFTREAGSRVQEGVPQFVILLT 1140 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 +S + + + K + I L+T PE+ V Sbjct: 1141 -----AERSRDDVRRPALELKTRGAV--PLGIGFGNADITQLQTISFVPEFAVFVSGVSE 1193 Query: 398 LIHVFQNISQLMVHR 412 L + Q I++ + Sbjct: 1194 LGRIQQLIAERVTRL 1208 >gi|194679013|ref|XP_608702.4| PREDICTED: matrilin 4-like [Bos taurus] gi|297490994|ref|XP_002698558.1| PREDICTED: matrilin 4-like [Bos taurus] gi|296472628|gb|DAA14743.1| matrilin 4-like [Bos taurus] Length = 788 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 43/129 (33%), Gaps = 24/129 (18%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQ 300 D L V +G + +++ + + V+ + R + T+++ A+K Sbjct: 79 DALDINPRKVRVGAVQFSSVPRLEFPLDAFSTQQGVKGKIKRMV--FKGGHTETSLALKY 136 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + + +I +TDG + + + K+ Sbjct: 137 LLRKGFPGGR---------------NASVPQVLIIITDGRSQGHVALP-----AKQLKQR 176 Query: 361 FIKIVTISI 369 I + ++ I Sbjct: 177 GITVFSVGI 185 >gi|156546182|ref|XP_001603829.1| PREDICTED: similar to voltage-gated calcium channel alpha2-delta subunit 1 [Nasonia vitripennis] Length = 1179 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 43/370 (11%), Positives = 96/370 (25%), Gaps = 28/370 (7%) Query: 47 HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI-KNHIKES 105 + A ++ + + + + L A+ + Sbjct: 89 MNAVMKLMENAEQAAVSASREGAVSPPKYYASHRINLYGDDGKLSSGAREVNLTANRHFD 148 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF-YNMDVMTSYDYRLQFIEHL 164 + + V + + + Y D S+ Y L Sbjct: 149 HLAVNLSLSAVLMPPGVKDTESDVASGLQWSEYLDPLFVNNYESDPTLSWQYFGSTSGFL 208 Query: 165 L------NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN-SDPEDVNSA 217 N P + + V D S + P+D+ Sbjct: 209 RRFPEKEYNLANLTPAISWPPVDASPSQSGKPVRVRDVDDFRLSEWFVGGATSPKDLAIL 268 Query: 218 PICQDKKRTKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 +K+ + LDS+ V Y + K+ + E V Sbjct: 269 VDGTSLSSSKIRKLVVATTNAVLDSLGPNDFVNVYRYAESAEEIVKCFKDSMLAASPENV 328 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + P++ + AM A++IL + S I Sbjct: 329 KELKNAMNNLKSESPSNISAAMSTAFEILHKYNRSS---------QGSQCNQAIMLITSR 379 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 ++G + N+ + ++I T + + Q L ++ ++ + N + Sbjct: 380 SEGPPVDLIKRYNSPHM-------PVRIFTYLVG-GDDSQELHDMSCNNKGFYARITNME 431 Query: 397 SLI-HVFQNI 405 + VF+ I Sbjct: 432 EIKSKVFEYI 441 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 54/154 (35%), Gaps = 21/154 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT-DSTPAMKQAYQILTS 307 ++ + ++ + + V+ P G+E+ + + ++ A +L Sbjct: 445 RDFDEITVLPFDSAVQNQYGPVAGSEREVAQGEIIARGVTGGGGINVHDSLVAAGNVL-- 502 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + II L DG ++ + N +++ D+ + I TI Sbjct: 503 ----------------KGRNAPIRHIILLADGSDSQQQEN--AVRLTDEHRRLGITTSTI 544 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +I + L V+ H+ V +A SL + Sbjct: 545 AIGNGGDVGFLNNVAVAGGGRHFLVEDALSLPDI 578 >gi|319956033|ref|YP_004167296.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418437|gb|ADV45547.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 335 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/231 (12%), Positives = 69/231 (29%), Gaps = 58/231 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + +D+ ++ K L FLD++ +GL+ + + Sbjct: 93 DISGSMRSKDRYPNRLEFAKVKLKQFLDAL-------PGDQVGLMAFARNAFVLAPFTGD 145 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T ++Q V + + A + + ++ + +K Sbjct: 146 TATLKQIVDGVNE----------DYINMAATDF---------VSLAEEAARLLAKKKEKI 186 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-------SPNGQRLL------ 379 ++ TDG + + KE I + + + G+ + Sbjct: 187 LVVFTDGGD-----PQALKGFKEALKERGITLYAVLVGTKKGAPVLDRRGRPMTKRDGTI 241 Query: 380 ----------KTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 K + + N + +++ + HRK+S +G Sbjct: 242 AITQLNEALGKIARETGGDYVIAGNGRE---DMRRLAEEI-HRKFSSEKQG 288 >gi|313245449|emb|CBY40179.1| unnamed protein product [Oikopleura dioica] Length = 377 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 46/172 (26%), Gaps = 21/172 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ED+ + Y+ S + + ++ + + ++ + + A K Sbjct: 185 GNFGSEDLQFCVYQYSESAANVFSLSESADFDSLDLKTALEQGIEIPEDRGANIGAAFKT 244 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 F + +I LTD N + Sbjct: 245 -----------IHDEGFNAINGWRKNDQIPSVLIVLTDNLNTVDFYDDL-----QYVHNK 288 Query: 361 FIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 ++V + I + L S E + V + L +V + + Sbjct: 289 AYRVVAVGIGENVENSSLSSIASLPSDENVFTVRDFTELSNVVDEVGYDICQ 340 >gi|94970371|ref|YP_592419.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94552421|gb|ABF42345.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 356 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 47/127 (37%), Gaps = 11/127 (8%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI-----PSLPFQKFIIFLTDGENNNFKS 346 TD T + + ++L + + + + +K +I L+DGE+N + Sbjct: 188 TDDTDLLGKGVRMLRPGGGTAMYDAIYYACRDKLLKENGNTAMRKAMILLSDGEDNQSRV 247 Query: 347 NVNTIKICDKAKENFIKIVTISINASP---NGQRLLKT-CVSSPEYHYNVVNADSLIHVF 402 + + A+ + I IS N S G ++L+ ++ + + + F Sbjct: 248 TR--EEAVEMAQRAEVIIYAISTNTSGLKLRGDKVLERFAEATGGRAFFPFKISDVANAF 305 Query: 403 QNISQLM 409 I + Sbjct: 306 SEIQDEL 312 >gi|196231437|ref|ZP_03130295.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224290|gb|EDY18802.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 725 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 26/231 (11%), Positives = 63/231 (27%), Gaps = 63/231 (27%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + D +++ K A + + +GLI + + Sbjct: 96 DCSRSMLSTDLAPSRLGRAKLATQDLISQLT-------GDRVGLIAFAGTAFLQAPLTID 148 Query: 273 TEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V V+ ++ T+ A+ +A + + + Sbjct: 149 YGAVLDSVSELDTNIIPRGGTNIAAAITEADAAFGKGESDN------------------R 190 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 +I TDGE + D + I T+ + ++ Sbjct: 191 CLIIFTDGEELESDAVAAAAAEKDHMR-----IFTVGLGSADGSLIPVPGENGGTEFVKD 245 Query: 374 ----------NGQRLLKTCVSSPEYHYNVVN-ADSLIHVFQNISQLMVHRK 413 + RL K S+ ++ ++ + + + I + K Sbjct: 246 ENGQNVKSKLDEDRLRKIAESTGGFYVHLQSGPAEMKQI---IDDGLGKMK 293 >gi|115453449|ref|NP_001050325.1| Os03g0403600 [Oryza sativa Japonica Group] gi|113548796|dbj|BAF12239.1| Os03g0403600 [Oryza sativa Japonica Group] Length = 416 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 14/134 (10%), Positives = 37/134 (27%), Gaps = 25/134 (18%) Query: 274 EKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + R + +D L T+ ++ + + + Sbjct: 3 DAARADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAV---------------N 47 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYN 391 ++ ++DG+ N + + + T + A + L S + Sbjct: 48 VMLMSDGQQNRGDATRLDPG--------GVPVHTFGLGADHDPAVLQAIAGKSREGMFHY 99 Query: 392 VVNADSLIHVFQNI 405 V + +L F + Sbjct: 100 VADGVNLTAPFSQL 113 >gi|315652227|ref|ZP_07905222.1| hypothetical protein HMPREF0381_2216 [Eubacterium saburreum DSM 3986] gi|315485533|gb|EFU75920.1| hypothetical protein HMPREF0381_2216 [Eubacterium saburreum DSM 3986] Length = 1268 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 16/151 (10%), Positives = 39/151 (25%), Gaps = 9/151 (5%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ + +I +L + ++ LI +N + A + A+ AG + S L Sbjct: 6 NKHGSLTIFMSLILTMVMIFTTVLIDGGRIILARNVVSGAGDMALNAGLTYYNSVLQDTY 65 Query: 77 DRFESISNHAKRALIDDAK------RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 F N + + + I + ++ + Sbjct: 66 GLFAVSKNMDDLKENLEVYFEATLKSSGLHDQGLVKELVDIALSGSNGDEISDIMKMKLA 125 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 S + L ++ Sbjct: 126 DGGFE---VSQAAGANLSNANILRAQVLDYM 153 >gi|47215752|emb|CAG05763.1| unnamed protein product [Tetraodon nigroviridis] Length = 565 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 40/128 (31%), Gaps = 14/128 (10%) Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + T + + + + + T + K I +TDG + Sbjct: 136 ALASRMKGAATWISDDLVEFVRFFLKAHIPTASTPQQIL--RTSRTNATKVIFLITDGYS 193 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 N + +E ++I T+ I + L S P+ + Y V N Sbjct: 194 NGGD----PRPVAAALRERGVEIFTLGIWQGN--IKELHDMASEPKEQHCYLVPNFAE-- 245 Query: 400 HVFQNISQ 407 F+ +++ Sbjct: 246 --FEALAR 251 >gi|13476804|ref|NP_108373.1| hypothetical protein mll8236 [Mesorhizobium loti MAFF303099] gi|14027565|dbj|BAB53834.1| mll8236 [Mesorhizobium loti MAFF303099] Length = 379 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 20/243 (8%), Positives = 56/243 (23%), Gaps = 7/243 (2%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 ++ I + ++++ +AA LA +K + +++ ++ A + D + Sbjct: 1 MVAGAIDLTGTSDDAAQLQNSLDAAGLAIGTKYLPSMAASDVAALGLTFFAANLSLADQQ 60 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 + S + + + + A S Sbjct: 61 ENADSVSAFSATASGDPSAYYISLSSSINRPSFINGAAPWPAHRSATVKMNPGAQAC--V 118 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS-----D 210 L V+ ++ +S + Sbjct: 119 LALDPHASAAVSLQGSTDVAMSSCVIAANSDASDAVRRGGSAQISAGCVSTVGGTYGLSP 178 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P + + + L + + + + K + NI + Sbjct: 179 PSANLACGAPLEHQYASFDPLADVVPPAYTLCLPVPNGKNYTLSPGTYCDKTLSGNITLN 238 Query: 271 WGT 273 G Sbjct: 239 PGV 241 >gi|189347764|ref|YP_001944293.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341911|gb|ACD91314.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 340 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 26/230 (11%), Positives = 72/230 (31%), Gaps = 49/230 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D ++ A ++A L S+ L+ + ++ Sbjct: 96 DVSNSMLARDVLPDRLEAARDAALRIGGSVREG-------RRALLLFAGSPLVQCPLTYD 148 Query: 273 TEKVRQYV-TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + + + T PA++ A ++ T + + Sbjct: 149 REAFSALLGMATPALIEEQGTSFLPAVELALKLFTGSVPLDSDGTAEGE----------R 198 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA--SPN--------------- 374 ++ L+DGE++ + K + N + + + + + Sbjct: 199 IVVLLSDGEDHEGATAAAAA----KLRRNGVSLFVLGFGSRNGADIPDPLRPGLKKLDGA 254 Query: 375 ---------GQRLLKTCVSSPEYHYNVVNADSL-IHVFQNISQLMVHRKY 414 Q L + +S +++ D++ V I +++ + Sbjct: 255 GRVVTTRFSPQTLRQLASASGGFYFEGAGGDAVYDEVALRIDRIVSRSRL 304 >gi|328474273|gb|EGF45078.1| hypothetical protein VP10329_16240 [Vibrio parahaemolyticus 10329] Length = 154 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 17/154 (11%), Positives = 34/154 (22%), Gaps = 6/154 (3%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L +F+ ++ ++ I F L++ L+L +I V + + A Sbjct: 2 LYLKFKTQMRR-GRHQEGLAIIEFILALPVLLMLTVLVIDVCRAFIQYTEVNKALQNGAR 60 Query: 63 AGASKMVSNLS-----RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 L + + A +I T Sbjct: 61 YAVVDTYGTLDFEGIADETKIKNVVVYGSPTASTTPVIDYIGVDDIVITQPTGTNKVVTL 120 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 S ++ L S Sbjct: 121 SATYNYVPIFSTLPFSSTSLQFSIGATTSMRTGP 154 >gi|198424466|ref|XP_002124191.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 47/361 (13%), Positives = 98/361 (27%), Gaps = 47/361 (13%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LAGA K + ++ + + + + K +V NT Sbjct: 65 LAGAPKFSGLYNVGSTGDGAVIKCSAN-VQPFSSTTKAGNTKWMTDHGCSVVLNTNAVPT 123 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY---DYRLQ----FIEHLLNQRYNQKIV 174 ++ S + + N RL + L + Sbjct: 124 GDNQNKSADAIGMTMIPLGGNKTAVCSPGRRKICGRSLRLSPGACYTGTLGAAMTMKTGW 183 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD-PEDVNSAPICQDKKRTKMAALKN 233 IP + + VVD S S+ A + + + +A + + Sbjct: 184 GEIPCF-------KNYLDLVYVVDSSNSISDANFTIMKQIIVNASEAFEASIGDTTQVAV 236 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 LDS H Y + + + +M L T Sbjct: 237 LQYGNLDSAA-FDHTDSKYYKSPTKLGDC--------NDIDCFNRAIKANMTHLNAANTF 287 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 ++ A+++A F +K ++ +TDG+ N I Sbjct: 288 TSLAIRRAV--------------EFDFAQSKNKDKAKKILVLITDGQANFQS---QLIVS 330 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSP---EYHYNVVNADSLIHVFQNISQLMV 410 + + I + I + + + L+ + E + N L +N+++ + Sbjct: 331 YRLTQSHNITVYAIGVALKSDAE--LRISANGGVSKERVLDANNYSELSKALRNLTETIA 388 Query: 411 H 411 Sbjct: 389 Q 389 >gi|145219383|ref|YP_001130092.1| TadE family protein [Prosthecochloris vibrioformis DSM 265] gi|145205547|gb|ABP36590.1| TadE family protein [Chlorobium phaeovibrioides DSM 265] Length = 144 Score = 46.0 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 39/124 (31%), Gaps = 2/124 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 S+K + + FA + L+L+ ++Y Y K + A AG + Sbjct: 20 RSQKGSVLVEFAFILPVLLMLLFGVVYFSVALYNKTVLTMATREGARAGVLYVAGQTDYS 79 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G+ ++ A + L + F + + + S I + + Sbjct: 80 GNISNAV--AAAQPLCNSVISFGSDATPVAQASVSGDILTVSASGNYTGLYIFSGLVLSA 137 Query: 136 RLDS 139 + Sbjct: 138 QTSM 141 >gi|332227196|ref|XP_003262777.1| PREDICTED: vitrin isoform 1 [Nomascus leucogenys] Length = 694 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 31/186 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDST 295 D L MG++ Y + + ++ + + ++ Sbjct: 332 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVG 389 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + S + ++ + DG + + Sbjct: 390 RAISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASR 431 Query: 356 KAKENFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVVNADSLIHVFQNIS 406 A+E+ I I I+I + ++ C ++ Y ++V + L Q + Sbjct: 432 LARESGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLV 491 Query: 407 QLMVHR 412 + + Sbjct: 492 KRVCDT 497 >gi|198436415|ref|XP_002121394.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 904 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 75/246 (30%), Gaps = 27/246 (10%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + I+ + + ++ +S+ + + R + Sbjct: 676 DDYILRPSTSQTIRCQSNKNWTHAPPCCARQCPASPKIDIVMVLDSSSSVTEPGWRKMIN 735 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 +K AL F + + V Y I ++ ++G E++ + + R Sbjct: 736 FVKTAL-GFYEMGPNSTSVSVFRYNAEIDEANKISFQYTQTYGKEQLLRRIGRLP--YNG 792 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + T + A+ A ILT++ I ++ LTDG++ + Sbjct: 793 QGTRTGQALSYALHILTNE---------------INRPDAVDVVLVLTDGKSQD-----A 832 Query: 350 TIKICDKAKENFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN-ADSLIHVF-QNI 405 + + N + I+I L +P + + + S F + I Sbjct: 833 VKAPAEALRRNGVLTYAIAIQPERGVLNMNQLNDIAGTPHNLFLLRSGFSSFTEDFAREI 892 Query: 406 SQLMVH 411 + H Sbjct: 893 RNSVCH 898 >gi|332262298|ref|XP_003280198.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Nomascus leucogenys] Length = 1284 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 44/161 (27%), Gaps = 23/161 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K L + + V +GL Y+ + + + K + Sbjct: 193 SHFQQVKEFLASVIA---PFEIGPDKVQVGLTQYSGDAQTEWDLNSLGTKEQVLAAVHRL 249 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ Q K +I +TDG++ + Sbjct: 250 RYKGGNTFTGLALTH-------------VLGQNLQPAAGLRPEAAKVVILVTDGKSQDDV 296 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 K+ + + + + + + L+ S P Sbjct: 297 H-----TAARVLKDLGVNVFAVGVKNADEAE--LRLLASPP 330 >gi|198426626|ref|XP_002122822.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1823 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 25/244 (10%), Positives = 61/244 (25%), Gaps = 34/244 (13%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDV---------NSAPICQDKKRTKMAALKNA 234 G E +++ S + + E V + + RT+ + + Sbjct: 871 SDGYNLGSRCEFACEVTPSQSFTLVGNTESVCLSEDGLTASWSHPAPTCIRTQCSQVPPI 930 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKVRQYVTRDMDSLILKPT 292 + +D + V + + T Sbjct: 931 PNGRMTCLDGNFVNSVCTFHCETSNGYSVYPETNNEITCLSNGTWDKQVPCCSKEYHLGT 990 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A++ A ++ S++ + ++ +TDG + + Sbjct: 991 LTGQALRHAKDVMFSEENGN-------------RPGVVDVVLTITDGRSQDH-----VRN 1032 Query: 353 ICDKAKENFIKIVTISINASP----NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ-NISQ 407 + + N + I I + LL S +L F +S+ Sbjct: 1033 VSRALRANGVLTYVIGIQPGNNKKLDKNELLDIAGSRENMFLVTGGFSALDQQFAYKLSR 1092 Query: 408 LMVH 411 + Sbjct: 1093 SICT 1096 >gi|55670411|pdb|1V7P|C Chain C, Structure Of Ems16-Alpha2-I Domain Complex Length = 200 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 61/192 (31%), Gaps = 28/192 (14%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +A+ FL+ L +GLI Y + K V S Sbjct: 20 YPWDAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQY 79 Query: 289 LKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A++ A + S K ++ +TDGE+++ Sbjct: 80 GGDLTNTFGAIQYARKYAYSAASG-------------GRRSATKVMVVVTDGESHDGSML 126 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--------LKTCVS--SPEYHYNVVNADS 397 I C+ + I I++ N L +K S + Y +NV + + Sbjct: 127 KAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAA 183 Query: 398 LIHVFQNISQLM 409 L+ + + + Sbjct: 184 LLEKAGTLGEQI 195 >gi|3182932|sp|Q28902|COCA1_RABIT RecName: Full=Collagen alpha-1(XII) chain gi|13195730|gb|AAB34889.2| type XII collagen [Oryctolagus cuniculus] Length = 639 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 49/138 (35%), Gaps = 23/138 (16%) Query: 277 RQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + T + A+ NF Q +K + Sbjct: 3 KSLLQAVANLPYKGGNTLTGMAL-----------NFIRQQNFKTQAG--MRPRARKIGVL 49 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVV 393 +TDG++ + K K+ +++ I I + + LK + P+ + YNV Sbjct: 50 ITDGKSQDD-----VEAPSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVA 102 Query: 394 NADSLIHVFQNISQLMVH 411 + DSL + +++ + + Sbjct: 103 DFDSLSKIVDDLTINLCN 120 >gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3] gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. ANA-3] Length = 751 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 75/252 (29%), Gaps = 44/252 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N +V +P + + LV+D SGSM Sbjct: 347 SSNTDNYSLVMVLPPKVEASEQLNLPRELILVIDTSGSMA-------------------- 386 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KNAL L + + + + + RQ+V R Sbjct: 387 GDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLLS-PTPLPATASNLAMARQFVNRL- 444 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ A+ A + +IF+TDG Sbjct: 445 --QADGGTEMAQALNAAL-------------PRQAFNAASAEDKSLRQVIFMTDGS--VG 487 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I ++ +N ++ T+ I ++PN + + + + D + Q Sbjct: 488 NESALFELIRNQIGDN--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVE---QK 542 Query: 405 ISQLMVHRKYSV 416 ISQL+ +Y V Sbjct: 543 ISQLLAKIQYPV 554 >gi|294140776|ref|YP_003556754.1| hypothetical protein SVI_2005 [Shewanella violacea DSS12] gi|293327245|dbj|BAJ01976.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 438 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 11/103 (10%), Positives = 32/103 (31%), Gaps = 3/103 (2%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 + ++F + + S L + + K +++A ++++L A + ++ S Sbjct: 18 KQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSVLYAAKILQNDGSLFQ 77 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 R + + + + LS Sbjct: 78 AREAATLILIQNLG---FEENSELDTSIDLSSPDYNSTLVTSN 117 >gi|257892784|ref|ZP_05672437.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] gi|257829163|gb|EEV55770.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] Length = 677 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/374 (9%), Positives = 90/374 (24%), Gaps = 45/374 (12%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI--KNHIKESL 106 K ++ A++L +V+ E+ + H + + + Sbjct: 6 TKTNL-----ASMLMIVFSLVAVFFGGETFAETNAFHYQNNAEGEFLTNGVKEGDTYNYD 60 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMA------NNRLDSSNNTIFYNMDVMTSYDYRLQF 160 G + E I + + D + + + + Sbjct: 61 YGNAPEANFDEENYINYENSAFVKKSVKKAEGIQGLFDVTLAIKGNQLKKPIDLVMVIDY 120 Query: 161 IEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + ++ + + +E G I ++ + + + + + Sbjct: 121 SSSMTGEKLSNALKGLQEFGEELDDSLESGNIRIGIVAYNRFVYSTDDFLTDINQLEYFL 180 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + T M L+ K V++G Sbjct: 181 RNTAESHTGTFMQKGLLEGQSLLEEKSRPEAEKMLVHIG----------------DDSAN 224 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R Y+ ++ + + Sbjct: 225 RSYLPKENAQVFHNSGEIVD-----------YNGYHTDQYVTEFQTDSEKYQTSGSSTDP 273 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNA 395 ++ N T+ K IK +++ S G+ + + SSP + ++ N Sbjct: 274 NAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSIDENL 333 Query: 396 DSLIHVFQNISQLM 409 L + + I+ + Sbjct: 334 TGLGNALKEIANGI 347 >gi|221115448|ref|XP_002154505.1| PREDICTED: similar to polydom [Hydra magnipapillata] Length = 2514 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 68/227 (29%), Gaps = 39/227 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 I ++D+SGS+ + + S+ + + + +A + A++ F I+ Sbjct: 159 QYNNSKADIIFLIDVSGSISDDGFNTEREFVSSLLSKISVQ--PSAARIAVVTFGRDINK 216 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 I Y + R T+ A+++A + Sbjct: 217 D-----------IDYIDYGY----LDKNKCTFNEEFKRVK-HRKEGWTNINGALQKAKAL 260 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE--NFI 362 L S ++ F + LTDG N S + +++ Sbjct: 261 LDSANEKKFK-----------RHNVNTVAVLLTDGGWNYGGS---PYDTATNLRTGFHYV 306 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I +I + + ++ LK E + F ++ + Sbjct: 307 DIFSIGVGHWLD-RKQLKNIAGKEENVIIAKDFSD----FSGLATTI 348 >gi|148699894|gb|EDL31841.1| procollagen, type VI, alpha 1, isoform CRA_c [Mus musculus] Length = 227 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 51/153 (33%), Gaps = 25/153 (16%) Query: 229 AALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTR 282 +K+ F+D++ + + G + Y+ VE + G ++++ V Sbjct: 59 DKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDA 118 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 K T + A+K+ + L K++I +TDG Sbjct: 119 VK--YFGKGTYTDCAIKKGLEELLIGGSHLKEN---------------KYLIVVTDGHPL 161 Query: 343 NFKSNVN--TIKICDKAKENFIKIVTISINASP 373 ++AK IK+ +++I Sbjct: 162 EGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDH 194 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 45.6 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 21/204 (10%), Positives = 57/204 (27%), Gaps = 12/204 (5%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + D S D + +NAL F+ + + T Sbjct: 100 DNSGSMYDTLSGDPTGDPALMRTTYARNALRTFV--------GSSFHAGDSVSFATFNTN 151 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + + TD + + Y L + R + + Sbjct: 152 VV-LHADEAGDPVVMDMLLSGIRRPGTD--ESYTELYHALADMALPVGERSGRRAVIVLS 208 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 + + ++ ++ ++ N + + I + Q L + + + Sbjct: 209 DGEDYSYATHSGNPHPIYGNQQLSPDEVVEEYIRNGVTLYAIHFGLEKD-QYLGEMALKT 267 Query: 386 PEYHYNVVNADSLIHVFQNISQLM 409 Y+ + + L ++ +I Q + Sbjct: 268 GGAVYDAKDQEELTGIYHDIRQKI 291 >gi|189347157|ref|YP_001943686.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] gi|189341304|gb|ACD90707.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] Length = 2825 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/307 (10%), Positives = 68/307 (22%), Gaps = 38/307 (12%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN----- 147 + ++G Y + Y Sbjct: 2030 PVLTSAGETVVYDVTGDRLTAYVDSGSSTGALDGTDRVVFTFEITPIRPTCAAYEFTPVL 2089 Query: 148 --------MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV--- 196 + + + + L I + + + Sbjct: 2090 TQIDHPDAAADNIETLKTIDLSGSVYGYDNDGDRIGLGTVTLVITVQDDVPIPTVVSESF 2149 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 + + D + I +T++ K A+ ++ D L VK + + Sbjct: 2150 TAKPIDTNLTIILDVSGSMNDVIPNSGGKTRLQFAKEAIASLINQYDALGDVK----VQI 2205 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 I ++ + I +T+ T+ A+ Sbjct: 2206 ITFSASAQPIIVSGQVWLDPGTAITQINGLSASGSTNYDDAL-------------VDVMT 2252 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNN-----FKSNVNTIKICDKAKENFIKIVTISINA 371 + K+ + Q FL+DG N ++ + N I I + A Sbjct: 2253 TYNNAGKLSTPQSQNVGYFLSDGVPNTPATSPGINSPEEQQWESFLTTNNINNYAIGVGA 2312 Query: 372 SPNGQRL 378 L Sbjct: 2313 GATQATL 2319 >gi|222478564|ref|YP_002564801.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451466|gb|ACM55731.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 571 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 70/272 (25%), Gaps = 22/272 (8%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 L F +L +Y + + + GE + + + Sbjct: 221 AGPIDIMLTLDFSGSMLYDQYGGVVSTDPIQVDGQSYGETTKIDLVELGTRQFIDYLQAQ 280 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNAL-LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 + DV + D + + + + ++ + ++ L + + Sbjct: 281 NA--DVQVGVVYFDGEGSGENTPRTGILEPLTTNLSAVDTALSNLRQKLANVVSDAAPST 338 Query: 268 EPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 T + + A L S+ + + + Sbjct: 339 PFDNDGNPDPYSNADGIAT-----GTYISEGLDDAQTELASNGR---------ASAEKRN 384 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + F DG N + + TI++N S + + + Sbjct: 385 IVLSDGESFNGDGNTNYAPPASAAANARAASPAPATDVYTINVNGSASTLQAMAGPAGGS 444 Query: 387 E----YHYNVVNADSLIHVFQNISQLMVHRKY 414 + ++ + ++ VF N++ V K Sbjct: 445 GGDPVFFNDINDPLNIPTVFGNLAAQTVAEKV 476 >gi|330834639|ref|YP_004409367.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566778|gb|AEB94883.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 383 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 52/174 (29%), Gaps = 32/174 (18%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + K + L I + + I +++RV E + ++ Sbjct: 54 KIESAKKGAIELLKRI------PQGNKVSFITFSSRVNVVREFVDPED----LTNEIVNL 103 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A+ A+ + ++I LTDG + + Sbjct: 104 NAGGQTALFTALLTAFNL-------------------HNKHGVPSYVILLTDGNPTDDTN 144 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + + ++ V+ + N L S Y+V +A + Sbjct: 145 IETYRRL--QIPSS-VQAVSFGLGDDYNESILKSLADKSGGVFYHVSDAMEIPE 195 >gi|295690805|ref|YP_003594498.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295432708|gb|ADG11880.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 185 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 10/123 (8%), Positives = 29/123 (23%), Gaps = 10/123 (8%) Query: 13 KGIASEKANFSIIFALSVMSFLLL-IGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ FAL +++ G + + + S + Sbjct: 10 RFWRDRRGASAVEFALIAPVLIVMYCGMA-EFTQAMMAQRRLTN--------ITSSIGDL 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ + + F +K L+ + +S M Sbjct: 61 TAQASQTGPARTTDIFTIGAIIMSPFPTGGLKMCLASVVSDANGKATVAWSQASAAGMAE 120 Query: 132 MAN 134 Sbjct: 121 CPT 123 >gi|182434868|ref|YP_001822587.1| hypothetical protein SGR_1075 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463384|dbj|BAG17904.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 424 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 65/217 (29%), Gaps = 48/217 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++MAA K A LD+ H+ Sbjct: 41 VELVLDVSGSMRTRD--------------IDGQSRMAAAKQAFNDVLDAAPEEVHLGIRT 86 Query: 253 YMGLIGYTT---------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ + + V T PA+ A Sbjct: 87 LGADYPGEDRKVGCKDTKQLYPVGPL--DRTEAKAAVATLA---PTGFTPIGPALLGAAD 141 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + I+ +TDGE+ + + A+ + Sbjct: 142 DLEGGEGS-------------------RRIVLITDGEDTCGPLDPCEVAREIAARGTHLV 182 Query: 364 IVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 + T+ + + ++ L ++ + V + + L Sbjct: 183 VDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 219 >gi|4996347|dbj|BAA78416.1| complement B/C2-A2 [Cyprinus carpio] Length = 750 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 53/213 (24%), Gaps = 13/213 (6%) Query: 206 AMNSDPEDVNSAPICQDKKRTK-MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE 264 D+ A + K K + + +D I + TR+ Sbjct: 253 MNQGGKLDIYIAVDASESIDKKDFENAKTTIKMLIDKISYYPVSPNYEILMFATDVTRIT 312 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + R + D +K I + Sbjct: 313 SMRDFKNNEDA-RNLMNIFKDLDDFNYERV--GVKTGTNIAKAYTAILESIKLEELNNAA 369 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-------KENFIKIVTISINASPNGQR 377 Q +I TDG+ N + ++ +E + + + N + Sbjct: 370 IFNETQHIVILFTDGQANMGGNPRPKVEQIKHLVTKNHPNREKNLDLYVFGVGDDVNQED 429 Query: 378 LLKTCV--SSPEYHYNVVNADSLIHVFQNISQL 408 + +Y + + + + +F ++ Sbjct: 430 INGLVSQRDQEKYFFKLKDLTEVQKMFDDMIDE 462 >gi|326775386|ref|ZP_08234651.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326655719|gb|EGE40565.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 424 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 65/217 (29%), Gaps = 48/217 (22%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 +ELV+D+SGSM +++MAA K A LD+ H+ Sbjct: 41 VELVLDVSGSMRTRD--------------IDGQSRMAAAKQAFNDVLDAAPEEVHLGIRT 86 Query: 253 YMGLIGYTT---------RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ + + V T PA+ A Sbjct: 87 LGADYPGEDRKVGCKDTKQLYPVGPL--DRTEAKAAVATLA---PTGFTPIGPALLGAAD 141 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + I+ +TDGE+ + + A+ + Sbjct: 142 DLEGGEGS-------------------RRIVLITDGEDTCGPLDPCEVAREIAARGTHLV 182 Query: 364 IVTISINASPNGQRLLK-TCVSSPEYHYNVVNADSLI 399 + T+ + + ++ L ++ + V + + L Sbjct: 183 VDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 219 >gi|269128710|ref|YP_003302080.1| Vault protein inter-alpha-trypsin domain-containing protein [Thermomonospora curvata DSM 43183] gi|268313668|gb|ACZ00043.1| Vault protein inter-alpha-trypsin domain protein [Thermomonospora curvata DSM 43183] Length = 795 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 46/149 (30%), Gaps = 24/149 (16%) Query: 262 RVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + S T++ R + V T+ +++ +L + Sbjct: 349 PAGLDGGLSPATDRNRFRAVEHLAGLQARGGTELAAPLREGAALLDDAGRD--------- 399 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 + ++ +TDG+ + I N ++I + I+ + N L + Sbjct: 400 ----------RVLVLITDGQ--VGNEDQLLALIDPFL--NGLRIHAVGIDQAVNAGFLGR 445 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + V + D L ++I + Sbjct: 446 LATAGQGRLELVESEDRLDEAMEHIHHRI 474 >gi|162455534|ref|YP_001617901.1| hypothetical protein sce7252 [Sorangium cellulosum 'So ce 56'] gi|161166116|emb|CAN97421.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 402 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/252 (9%), Positives = 65/252 (25%), Gaps = 36/252 (14%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-----------LSHVKE 250 + + D +P T+ + +A + + ++ + + Sbjct: 87 APNIYFVLDASGSMLSPASSSSNATRYDRVHDAAVDLVRNLGPLINVGAAVFPLDATDEA 146 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 G + + L T + + L Sbjct: 147 PCRPGGEVFAVTPGAPFAGGQEGSTTTSRFRAATRAAPLGGTPTAATLAALTPKLVDLPG 206 Query: 311 RSFFTNFFRQGVKIPSL-------PFQKFIIFLTDGENNNFKS-----------NVNTIK 352 ++ G S D E N +T+ Sbjct: 207 KTIVVLATDGGPNCNSSASCEADGCIANIEQQCPDPEENCCSPLGPSGPEGCLDRDDTVA 266 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE-------YHYNVVNADSLIHVFQNI 405 ++ + I++ + I S +L + +++ V + D+L +V I Sbjct: 267 AIERLAASGIEVYVVGIPGSEFYGDVLDQMALAGGAAQFVSPFYFKVDDLDTLGNVLSKI 326 Query: 406 SQLMVHRKYSVI 417 + ++V ++ ++ Sbjct: 327 ASIVVSCEFDLV 338 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 65/203 (32%), Gaps = 36/203 (17%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD- 283 A N + ++D+ K +G + Y+ R + + + Sbjct: 87 SRDYQACANYATSIVRNLDMP---KTGARVGTVIYSKRSIVLFDFKDNKTDIILQLNTIK 143 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + A++ A + L + +R+ +K I +T GE + Sbjct: 144 YREKPNRALSTGQALELAQKRLFQNARRNS----------------KKIAILVT-GEKSQ 186 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN--VVNADSLIHV 401 + + K++ + I T+ + L++ SSP Y Y V L Sbjct: 187 DDVIIPS----KLMKDSGVIIYTVGVGEG-YFLPQLESIASSPSYVYTEKVKGIGELATR 241 Query: 402 F--------QNISQLMVHRKYSV 416 +NI+ +K +V Sbjct: 242 MVEGVCKEKENIATKEGSKKVNV 264 >gi|156602970|ref|XP_001618750.1| hypothetical protein NEMVEDRAFT_v1g153509 [Nematostella vectensis] gi|156200182|gb|EDO26650.1| predicted protein [Nematostella vectensis] Length = 133 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 17/150 (11%), Positives = 48/150 (32%), Gaps = 24/150 (16%) Query: 265 KNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++G ++ V +D + T + A++ + Sbjct: 1 PQTIFTFGKYNNKKDVLEAVDKMPYPKGSTYTGRALQYMNDEIY---------------R 45 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 K + +I LTDG+ ++ + ++ I+I +I + S + L Sbjct: 46 KATRVGVPNILIVLTDGKAHDSVAEP-----AKALRDIGIEIYSIGVGESYDKAELDAIA 100 Query: 383 VS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + ++V + ++ + + + Sbjct: 101 TDPDASHVFSV-DFKNMNSIVSTLDARICT 129 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 57/159 (35%), Gaps = 16/159 (10%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L+ + F + + + ++ + V+ ++ + T + Sbjct: 37 ITELQEGVSTFFAQLLADDVAEYSAEVAVVTFGGNVDMAVDFAAVTRQT------VPSLT 90 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T A++ A ++L + K+ + R GV +Q +++ +TDG + + Sbjct: 91 AGGMTPMGEAVETALELLHTRKE-----EYKRAGVDY----YQPWLVIMTDGAPTD-NIS 140 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + + D +E + + I I + L K P Sbjct: 141 KASRLVDDLVREKKLAVFAIGIGKDADMNELAKLSGGRP 179 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 57/165 (34%), Gaps = 24/165 (14%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 ++ Y R K + T + A+K A L + K+ S Sbjct: 1 MVLYNDRPSAEFYLDTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQKRGSRKA 60 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 ++ I +TDGE S + + + + + + + + Sbjct: 61 LG-----------VKQIAIVMTDGE-----SEDDVTTTAAELRRSGVTVYALGV--KNAS 102 Query: 376 QRLLKTCVSSPEY--HYNVVNA---DSLIHVF-QNISQLMVHRKY 414 LK S PE+ +NV + SL +++ +++V+R++ Sbjct: 103 VEELKKIGSYPEHEFVFNVGSFLMLSSLEKSLRKSLCKVVVNRRF 147 >gi|221136524|ref|XP_002171216.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 126 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 1/72 (1%) Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 K II LTDG + +I D+ + + + I+ + I ++ N + L + Sbjct: 23 VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVN-ETELSHITGGKKN 81 Query: 389 HYNVVNADSLIH 400 Y DSL Sbjct: 82 AYTAATFDSLTD 93 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 54/177 (30%), Gaps = 22/177 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L ++ +G+I Y+TR + + K + + + T + Sbjct: 24 QFTLKTVIGFDIGPTATQVGIIQYSTRPRQEFSMNSFQTKESLSTAIENVNYMAGGTLTG 83 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + + K +I +TDG S+ + + Sbjct: 84 RAIRYVTKYGFGESDG-------------ARPGIPKIVILVTDGV-----SSDDIEQPAL 125 Query: 356 KAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +A++ I + I + + L+ S+ N + L + + + Sbjct: 126 EAQQKGISLYAIGVSGYDMDQ---LERIASNNRTLAVAENFNLLDSLRNTLLTGICD 179 >gi|154493525|ref|ZP_02032845.1| hypothetical protein PARMER_02864 [Parabacteroides merdae ATCC 43184] gi|154086735|gb|EDN85780.1| hypothetical protein PARMER_02864 [Parabacteroides merdae ATCC 43184] Length = 1299 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 32/330 (9%), Positives = 88/330 (26%), Gaps = 18/330 (5%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANNAA-ILA 63 R + Y ++ + I + ++ L + G + V ++++ +A + ++ Sbjct: 11 RNKKYKERRMKRNSGLIGWI--VLIVGLLCSVRGGAVDVGSAAALRDALGNATVSGNVVM 68 Query: 64 GASKMVSNLSRLGDRFESISN---HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + + I + + A+ + +L F + + Sbjct: 69 LTGDVSLSSTLNITGGTMILDLNGMQISITKNKAEAKCISVTGGTLEITGGGFISAQTTG 128 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S + ++ + +N Y L + Sbjct: 129 TEWFSDRAAIALSYDGGTVRIYRATFNAIASDGTAYTLDPNNDYTVDNMIPAGAYMTNSS 188 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 G + + + + S + + + + + T KN F D Sbjct: 189 DYGSTGLVSSSITVALTNYNVSYNTSGGTTTNPGTPSYTIETPDFTLPTVTKNG-YTFAD 247 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM-- 298 + V T + +W ++ + + T + Sbjct: 248 WTYNGNPVNPTALPTTADRVTSKDMAFGATW------TLISYKVVYDVAGGTAIQDGLYN 301 Query: 299 -KQAYQIL-TSDKKRSFFTNFFRQGVKIPS 326 + L T ++ F ++R K+ S Sbjct: 302 IETGISSLPTPKREGYVFNGWYRDNQKVTS 331 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 63/194 (32%), Gaps = 33/194 (17%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL- 287 + + + + + +I ++++ + ++ V + + Sbjct: 150 QNFASEIKFVKKLLSDFNVSYNYTRVAVITFSSQKKIFRHIDQISQSVEDNDKCLLLNYQ 209 Query: 288 ------ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A+K+A +I + + S +K I +TDG + Sbjct: 210 VPRIAFSGGGTYTYGALKEAEEIFKNARLDS-----------------KKIIFLITDGFS 252 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIH 400 N K+N + I +I I + + L ++ Y + + D Sbjct: 253 NGRDPIPL---AGRLKKDNNVVIYSIGIQSGNYAE-LHAIASAPEGDHCYLLDSFDH--- 305 Query: 401 VFQNISQLMVHRKY 414 F+ +++ +H Y Sbjct: 306 -FETLARKALHSDY 318 >gi|88800837|ref|ZP_01116392.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] gi|88776410|gb|EAR07630.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] Length = 716 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 13/135 (9%), Positives = 40/135 (29%), Gaps = 28/135 (20%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN-NFKSNVNT 350 T+ A+ +A + ++ +TDG+ + ++VN Sbjct: 90 TNLGRALDEAAYDF-----------------AYSTYTGPTHVVLITDGQVDIAPNADVNQ 132 Query: 351 IKICDKA-------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ +I TI+++ + L + + + L+ + Sbjct: 133 VERGRILSQLVPRYNSANARIHTIALSDDADHALLRQLSEQTGGQYLRANQGADLLPLLT 192 Query: 404 NISQL---MVHRKYS 415 ++ + + Sbjct: 193 SLGNEVAPVSQLRVR 207 >gi|332227202|ref|XP_003262780.1| PREDICTED: vitrin isoform 4 [Nomascus leucogenys] Length = 657 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 31/186 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDST 295 D L MG++ Y + + ++ + + ++ Sbjct: 295 LADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQR--GGLSNVG 352 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + S + ++ + DG + + Sbjct: 353 RAISFVTKNFFSKANGN-------------RSGAPNVVVVMVDGWPTD-----KVEEASR 394 Query: 356 KAKENFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVVNADSLIHVFQNIS 406 A+E+ I I I+I + ++ C ++ Y ++V + L Q + Sbjct: 395 LARESGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLV 454 Query: 407 QLMVHR 412 + + Sbjct: 455 KRVCDT 460 >gi|314927767|gb|EFS91598.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] Length = 322 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D ++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDASPNRLAAAKAKAKDFVNSLPTG------FNVAVVSISAHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + + T A+ ++ Q + S Sbjct: 155 RPTVLRALDGIELQD---GTALGEAIDKSLQAVKMAPGGSKD-------------RVPAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-----ASPNGQ--------RLL 379 I+ L+DG N S + AK + + TI+ +GQ +LL Sbjct: 199 IVMLSDGGNTQGGSPLVAATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLL 255 Query: 380 KTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V++ + + Sbjct: 256 SDVADRTDAKSWTADSADKLQEVYKQVHSSVG 287 >gi|260824071|ref|XP_002606991.1| hypothetical protein BRAFLDRAFT_64975 [Branchiostoma floridae] gi|229292337|gb|EEN63001.1| hypothetical protein BRAFLDRAFT_64975 [Branchiostoma floridae] Length = 179 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 10/97 (10%), Positives = 28/97 (28%), Gaps = 6/97 (6%) Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 N + ++ LTDG + + ++ I + Sbjct: 18 VANTMLLPLNGNRPDAPDVVVVLTDGFSGDD-----VTGPASLLHSMGVQTFVIGVG-GC 71 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 L + ++ Y + + + L + +I + + Sbjct: 72 VSDAQLGNIANCEDHVYKLSDFNGLDGITDSIHKQIC 108 >gi|220922774|ref|YP_002498076.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219947381|gb|ACL57773.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 193 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 16/169 (9%), Positives = 49/169 (28%), Gaps = 10/169 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + +++ ++ F+L + F L+ +I + +AA+ + Sbjct: 9 LTRSLRAFGRAQQGATAVEFSLVAIPFFGLLAAIIETAIAFFA----GQLLDAAVSNASR 64 Query: 67 KMVSNLSRLG------DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ + + ++++ + F + +K + I Sbjct: 65 QIYTGAFQTQTGVSATTSEQALTAFRNNLCANRVTIFNCSAVKVDIRTLDDNASFAAISP 124 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 + S++ T LD+ V+ ++N Sbjct: 125 VDPSTKGWATGFGTRYLDAGGKPPGPGKIVIVQAAVAFPIFFSMINPAT 173 >gi|330904068|gb|EGH34640.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 84 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 20/97 (20%) Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 VR ++ + K T A+ A + L + ++ Sbjct: 8 TVRVWLDEARIGIAGKNTALGDAIGLALKRL------------------RMRPATSRALV 49 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +TDG NN + + I A E +KI I I + Sbjct: 50 LVTDGANNAGQID--PITAARLAAEEGVKIYPIGIGS 84 >gi|326666584|ref|XP_687953.4| PREDICTED: collagen alpha-1(VII) chain [Danio rerio] Length = 2001 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/321 (9%), Positives = 76/321 (23%), Gaps = 58/321 (18%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + S + V NT+ ++ + T + + + + Sbjct: 882 VPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASATGQTASALIRMTTVNDQS 941 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 I+S G I +VD S S+ Sbjct: 942 -----------ILSAETPPPGPVCGRVK-ADIVFLVDESWSIGTNNFGK----------- 978 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 LK+ L + + + ++ Y+ + + ++ Sbjct: 979 ---------LKDFLFRTVTYFPSIGPKG--TQIAVVHYSDQPRIEFNFNTHKDRNSVLRA 1027 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + + + + + ++ LTDG Sbjct: 1028 LREVRYGGGNTKTGRGISYVLREMFQESLGM-------------RQEAPHVLVLLTDGRA 1074 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL 398 + + A + ++ I I A + + + SP + + + D L Sbjct: 1075 QDDVEPPS-----RIAHALGVSVLVIGI-AHADMEEVRTIA--SPTTYKNIFYASDFDDL 1126 Query: 399 IHVFQNISQLMVHRKYSVILK 419 + + + K Sbjct: 1127 PSIEREFIGSICSEALQSEFK 1147 >gi|300704936|ref|YP_003746539.1| pilus related protein, tade-like [Ralstonia solanacearum CFBP2957] gi|299072600|emb|CBJ43950.1| putative pilus related protein, TadE-like [Ralstonia solanacearum CFBP2957] Length = 151 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 44/138 (31%), Gaps = 2/138 (1%) Query: 13 KGIASEK--ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + + + + + FAL LL++ ++ Y K+ + +A+ A AG Sbjct: 12 RLVRTRRMQGAAGVEFALIFPLLLLVVFGIVEFGAAWYDKSVITNASREAARAGVVFSSP 71 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 S + + + R + + + S + + + + + + Sbjct: 72 APSTTKIQSVATNYCQNRLVTFGSATNCTASVSPSATCTAGTGNQLTVTVAYTFTGLVLG 131 Query: 131 HMANNRLDSSNNTIFYNM 148 +A + + + Sbjct: 132 KLAPFAGSLALSAQTTML 149 >gi|260827154|ref|XP_002608530.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] gi|229293881|gb|EEN64540.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] Length = 306 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 62/188 (32%), Gaps = 17/188 (9%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T +LK + L I + + + ++ +++ V N T + + Sbjct: 118 STDFTSLKTYVAELLSYICPENEMGTFHRVAVVTFSSSVVLNFNFHEATSLGQIQASIHS 177 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T + A+ T F+ G ++ +TDG N Sbjct: 178 LPYEGGSTRTADAI------------NFVRTQVFQTGNYRDEPDVDLEVLLITDGHPNGA 225 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRL---LKTCVSSP--EYHYNVVNADSLI 399 ++ +++ +A I + ++ + LK V++P ++ +N+ + Sbjct: 226 GNSPQDVELAAEALGERANIFALGYGSAYSSASDFDHLKNLVTAPWYKHIFNLEHVSDFE 285 Query: 400 HVFQNISQ 407 + ++ Sbjct: 286 TMLDTLAN 293 >gi|94732541|emb|CAK05117.1| novel protein similar to vertebrate collagen family [Danio rerio] Length = 1721 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/321 (9%), Positives = 76/321 (23%), Gaps = 58/321 (18%) Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + S + V NT+ ++ + T + + + + Sbjct: 771 VPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASATGQTASALIRMTTVNDQS 830 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 I+S G I +VD S S+ Sbjct: 831 -----------ILSAETPPPGPVCGRVK-ADIVFLVDESWSIGTNNFGK----------- 867 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 LK+ L + + + ++ Y+ + + ++ Sbjct: 868 ---------LKDFLFRTVTYFPSIGPKG--TQIAVVHYSDQPRIEFNFNTHKDRNSVLRA 916 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + + + + + ++ LTDG Sbjct: 917 LREVRYGGGNTKTGRGISYVLREMFQESLGM-------------RQEAPHVLVLLTDGRA 963 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL 398 + + A + ++ I I A + + + SP + + + D L Sbjct: 964 QDDVEPPS-----RIAHALGVSVLVIGI-AHADMEEVRTIA--SPTTYKNIFYASDFDDL 1015 Query: 399 IHVFQNISQLMVHRKYSVILK 419 + + + K Sbjct: 1016 PSIEREFIGSICSEALQSEFK 1036 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 58/185 (31%), Gaps = 27/185 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 A A + ++ + V + V +GL+ ++ + N V R L Sbjct: 208 AADLATVFQWMNVSSGTDVGQFVRVGLVTFSNQAFVNGNL-DDFTSYNSLVKRLFQMPYL 266 Query: 290 KPT--DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + A++ A IL S + I+ T Sbjct: 267 GGSELNIESALQSASDILQSS-----------------RYYARTAILLYTSAYGEGG--F 307 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQNIS 406 + I ++ KE+ KI+T++ P G + K ++SP + I Sbjct: 308 TDPKAIANQIKESGTKIITVAFRQQPEGSLVEKLSHLASPG-FSFASRQSIITD---EIL 363 Query: 407 QLMVH 411 + + Sbjct: 364 RALCQ 368 >gi|182439948|ref|YP_001827667.1| hypothetical protein SGR_6155 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178468464|dbj|BAG22984.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 596 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 46/406 (11%), Positives = 105/406 (25%), Gaps = 55/406 (13%) Query: 33 FLLLIGFL---------IYVLDWHYKKN---SMESANNAAIL---AGASKMVSNLSRLGD 77 L L G K M +A +L A ++ + + Sbjct: 201 LLALAGIGASSARQGGDSDTRVAQTAKALAERMSDG-DAQVLGTLARSTSGAEEGNPKRN 259 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI-------SMT 130 + +S A A +A + + G + Y + N + S MT Sbjct: 260 QAVLVSEQAAFAHNAEATGGGELDLFYPRDGAPLLDYPYTLVNEADLSVAESRAALRFMT 319 Query: 131 HMANNRL-DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERP 189 + + F D + PA E Sbjct: 320 LLGDRDARAVFAEHGFRAGDGSAEDSLVAAAGG-------RKPQPYATPAAEAPSAKELR 372 Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 L + + + + A + +++M K +L+ L+ + Sbjct: 373 ETLGMWTITVQSAR--LTTVVDASGSMATPVPGRGQSRMDVTKESLIQALEQFTPNDEIG 430 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS-- 307 + + + + R D T + A+ L Sbjct: 431 LWEFATTLDGDRDYRRLMP-----------TKRLGDPAEGGGTHREE-LTAAFAGLRPVP 478 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQ---KFIIFLTDGENNNFKSNVNTIKICD--KAKE--N 360 + + S + ++ LTDG N + +S + + + ++ Sbjct: 479 GGATGLYDTTLASYKEARSTFVKGKFNALVILTDGSNQDDRSISRSGLVAELKALRDPER 538 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH-VFQNI 405 + ++ I++ + + + + Y V + + + Q I Sbjct: 539 PVPVIAIAVGPDADRDEVAEIARVTGGDGYEVSDPAEIRAVILQAI 584 >gi|104782921|ref|YP_609419.1| RTX toxin [Pseudomonas entomophila L48] gi|95111908|emb|CAK16632.1| putative RTX toxin [Pseudomonas entomophila L48] Length = 2350 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 43/322 (13%), Positives = 82/322 (25%), Gaps = 51/322 (15%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LA S S G+ + + + + + Sbjct: 1442 LAAGSVTGSEPGSTGETASGSLVGSVSGGFGALTYSLVGNATGQYGQIQLNADGSYTYTL 1501 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 ++ + N ++ F + + + IV +P Sbjct: 1502 TSAPKTPG---GTNDGANTVVEQFTYKAT--------DALGNSVISTIAINIVDDVPKAE 1550 Query: 182 RIEMGERP---IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 P I LVVD+S SM+ T++ K A+ Sbjct: 1551 SAVRSITPGEVDSNILLVVDVSSSMN-------------SGSGVPGLTRLELAKQAINTL 1597 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 LD D + +K + ++ ++T + + + T A Sbjct: 1598 LDKYDDMGDIK----VQIVTFSTGATMQTPVWVSISEAKSLIAGLT---AGGSTYYDSAA 1650 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC--DK 356 +A + S K Q F +DGE + S + Sbjct: 1651 TKAQEAFVSAGKLVG---------------AQNVSYFFSDGEPSGGHSITAVRETTWETF 1695 Query: 357 AKENFIKIVTISINASPNGQRL 378 +N IK + + + N L Sbjct: 1696 LDDNGIKSFAVGMGSGVNAGNL 1717 >gi|283457760|ref|YP_003362348.1| hypothetical protein RMDY18_06960 [Rothia mucilaginosa DY-18] gi|283133763|dbj|BAI64528.1| hypothetical protein RMDY18_06960 [Rothia mucilaginosa DY-18] Length = 197 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 11/155 (7%), Positives = 46/155 (29%), Gaps = 8/155 (5%) Query: 3 LLSRFRFYFKKGIAS------EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 L +R R + + E+ + + + + LL+ + + + + ++ Sbjct: 42 LQARARAQSRALASDPLASDPEEGSIAPLIVGMLALLLLIGSVTVAITGAYLQTQHLQDV 101 Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +A + + + + + + + + E++S S + Sbjct: 102 ADAQANSITRTIRT--PEEANGSAAWEYASAYLAEIQPGQDFQALRLENVSVDSDQSVHV 159 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + +S+ + + ++ Sbjct: 160 YLSARIRPPLLSILVPDGIEVTAHGSSRLKMSQGP 194 >gi|311271483|ref|XP_003133150.1| PREDICTED: anthrax toxin receptor-like [Sus scrofa] Length = 728 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 27/259 (10%), Positives = 61/259 (23%), Gaps = 39/259 (15%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + L + N + + ++ + P + + Sbjct: 82 QSTYDLYFILDMSGSINNNWMDIYALVE----DLVKKFDNPKIRMSFIT----------- 126 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 Q K+ + K+ ++ + + + + + Sbjct: 127 ---------YSTQGHILMKLTSDKDLSEFEVEENESVGCYHSGGRKRGRVGCGPLVEELP 177 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + +A++ L D N + Sbjct: 178 IREKKTTASSKAGIQNLKQ--------ESFGEAFEGL-PDPLYRVEANEQIEQANSGENK 228 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 II LTDG S T + D+A++ + I + L S ++ Sbjct: 229 VPSMIIALTDG-TLESISLQETKQQADRARKLGANVYCIGVKD--YETEQLSEIADSSDH 285 Query: 389 HYNVVNADSLIHVFQNISQ 407 + V D +NI Sbjct: 286 VFGV---DQGFKALKNIID 301 >gi|148657452|ref|YP_001277657.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] gi|148569562|gb|ABQ91707.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] Length = 392 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 12/160 (7%), Positives = 36/160 (22%), Gaps = 13/160 (8%) Query: 34 LLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF---ESISNHAKRAL 90 + G I + ++ ++ + A L ++ + + + Sbjct: 26 IAFAGLAIDGALAFAWRRNVMNSADGAALIATRALIVDRGSVNGIAITNAVRTYLQAELG 85 Query: 91 IDD-----AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS--RISMTHMANNRLDSSNNT 143 +D+ ++ A I +V + M + L + Sbjct: 86 VDNPDFELTYVNGAGQSMGTVDSSPAPVNARGIGIVVRHTFDTYLMGILGQPTLTVRGVS 145 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + I + + P + Sbjct: 146 AARFGNGGM---LIGDLIAPMGLLPEAAEFFRSNPTSALV 182 >gi|315186713|gb|EFU20471.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 331 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 66/223 (29%), Gaps = 56/223 (25%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D++ + + +D +++ K L+ + I G++ + + E + + Sbjct: 94 FDISRSMLVRDVPPSRLDVAKEIALMLVSRI-------SGARWGVVAFKGKGELLLPLTP 146 Query: 272 GTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + L TD + +A + R Q Sbjct: 147 DLLGLEDAIGLLSPVLLRSPGTDVASGLSRALEAFPEQSNR------------------Q 188 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------------- 373 + +I L+DGE + + A+ + + T+ I Sbjct: 189 RLVILLSDGEALTGEIGPVL----ELARNLGVAVHTVGIGTESGGPVPLEGEEVLKKPSG 244 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L + + ++V NA+ FQ++ + Sbjct: 245 EPVISRLDASLLKRIAEITGGRFFSVENAEG--QTFQHVVSTI 285 >gi|268324283|emb|CBH37871.1| hypothetical secreted protein, CARDB family [uncultured archaeon] Length = 1149 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 21/237 (8%), Positives = 61/237 (25%), Gaps = 46/237 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 I V D++GSM +++ +K+ F D + Sbjct: 82 TTKNADIVFVFDITGSM---------------------GEEISEMKDISKNFADGLAAAG 120 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY-QIL 305 + ++ + ++ +M +++ L Sbjct: 121 ID--------YRFGLTEFRDFPVTCDGTVCGDANDFAYKVYNGGVLSNSSSMFKSWIDSL 172 Query: 306 TSDKKRSFFTNFFRQ------GVKIPSLPFQKFIIFLTDGENNNF-----KSNVNTIKIC 354 + K K I+ ++D ++ + + Sbjct: 173 NPSGGGDLPESTLAALMHTVKDQKWRGGDASKIIVLISDAYPHSDEHCCNQEKNTFDGVI 232 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD-SLIHVFQNISQLMV 410 + + + + P+ + S+ +++ SL V + I+ ++ Sbjct: 233 SALTRSGMTVYVVG----PDEASMEMIANSTGGKFFHIRAEGVSLKPVLEEIAGVLT 285 >gi|297559547|ref|YP_003678521.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843995|gb|ADH66015.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 588 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/203 (11%), Positives = 59/203 (29%), Gaps = 24/203 (11%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 N + ++ MGL + E ++ +++ D Sbjct: 384 ADTSANMAEDLDGGPSRMEVAQQAALMGLSLFPD--ETDMGLWLMSDENASGRVEAADMH 441 Query: 288 ILKP-----TD------STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 L T A + A + S + + R +I L Sbjct: 442 PLGGAEQGDTATRRRELIGVAEEIAVRGGGSRLYDNILAAYDRVQDDYDEDKI-NSVILL 500 Query: 337 TDGENNNFKSNVNTIKICDKAKEN-----FIKIVTISINA---SPNGQRLLKTCVSSPEY 388 T G+ + S++ + ++ + + I+ + + L + ++ Sbjct: 501 TAGQ-DEGSSDIAHADLVAALQDRFDPERPVSMFIIAFGSREQQVAEEELRRIAAATSGS 559 Query: 389 HYNVVNADSLIHVF-QNISQLMV 410 + + D + +F +IS+ + Sbjct: 560 LFVTDDPDEIGDIFLSSISRRLC 582 >gi|288919483|ref|ZP_06413814.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349086|gb|EFC83332.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 533 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 21/197 (10%), Positives = 57/197 (28%), Gaps = 27/197 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDS----IDLLSHVKEDVYMGLIGYTTRVEKN-----IEPS 270 + +++ L++AL+ + + + + L+ + + V + +PS Sbjct: 349 SGSMEGDRISDLRSALIGLTGADSSLTARFTSFRAREKITLVPFDSGVNRISDFAVTDPS 408 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + ++++ T + Sbjct: 409 PDSPELKELRRAVEGFNAGGDTAIYS--------------ALRAAYDRAAADLARDGSYY 454 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKA--KENFIKIVTISINASPNGQRLLKTCVSSPEY 388 ++ LTDGEN S + + + T+ + L + + Sbjct: 455 TSVVLLTDGENTTGASADDFLAHYRSLSPAARAVPTFTVLFG-DADPDALRQIADVTGGT 513 Query: 389 HYNVVNADSLIHVFQNI 405 ++ + L VF++I Sbjct: 514 VFDAGSTS-LPDVFKDI 529 >gi|119963136|ref|YP_948083.1| von Willebrand factor type A domain-containing protein [Arthrobacter aurescens TC1] gi|119949995|gb|ABM08906.1| putative von Willebrand factor type A domain protein [Arthrobacter aurescens TC1] Length = 467 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 18/200 (9%), Positives = 57/200 (28%), Gaps = 27/200 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 T A ++ L+ + +G++ + + + + + + +T Sbjct: 242 SGSMTSTDAALASV------FQGLAKEFDGERIGMVIFDSSSVQLFPLTDDYQYAAEQLT 295 Query: 282 RDMDSLILK------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 R ++L T + L S + T + ++ Sbjct: 296 RAKEALDSGAGSFFDGTWNGEGSSLIGDGLASCIQSFPDT-------DTGDAKRSRSVVL 348 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTI-------SINASPNGQRLLKTCVSSPEY 388 TD + + + A +K+ + G +L + Sbjct: 349 ATD-NFLSGEPIFTLDEATALATGKSVKVYALNPGDMDYGEQPDQPGVQLRAAAERTGGS 407 Query: 389 HYNVVNADSLIHVFQNISQL 408 ++ + + D++ + + + + Sbjct: 408 YFTLDSPDAVPGIVRKVQES 427 >gi|83594486|ref|YP_428238.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] gi|83577400|gb|ABC23951.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] Length = 575 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 62/187 (33%), Gaps = 27/187 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++++A K ++ ++++ D+ +G++ ++ + S Sbjct: 405 SEGIAGAPSRISAAKRSIRDTVNALHK------DIRVGMVSFSDCMATQ--NSKYYSAAE 456 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + PA +T ++ S R+G + + + ++ ++ Sbjct: 457 R-----------------PAFLAGVDAITPERATSL-AASIRRGGALATRRSETVMMVVS 498 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DGE+ + KA+++ + I I ++ N + + +A Sbjct: 499 DGEDTCG-GDPCAAARAVKAEKSNVIIHVIDLSGGGNSGVARCIASAGGGRVFTPGSAAQ 557 Query: 398 LIHVFQN 404 + + Sbjct: 558 VTSSLRT 564 >gi|190894969|ref|YP_001985262.1| hypothetical protein RHECIAT_PC0000635 [Rhizobium etli CIAT 652] gi|190700630|gb|ACE94712.1| hypothetical conserved protein [Rhizobium etli CIAT 652] gi|327193255|gb|EGE60161.1| hypothetical protein RHECNPAF_1700074 [Rhizobium etli CNPAF512] Length = 176 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 47/121 (38%), Gaps = 7/121 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + + +I FA+ + +++ +I V + + A +A++ + + + Sbjct: 10 LRRLLGDRQGVAAIEFAILALPLFIMLFGIIEVSLMFFVNS----ALDASVHKISRMIRT 65 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFI-KNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + +++ + D F + + ++ S + T I NS +++ Sbjct: 66 G--EVASSKITLAGFKAKICDDMLLSFNCSSDLVVKVNVLSDLSSATSTDPIDNSGNLAI 123 Query: 130 T 130 T Sbjct: 124 T 124 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 78/229 (34%), Gaps = 34/229 (14%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D +GSM+ + D D A K L+ ++ + + Sbjct: 262 VLDPAGSMNIYLVLDGSDSIGASN--------FTGAKRCLVNLIEKVASYGVRPRYGLVT 313 Query: 256 LIGYTTRVEKNIEP-SWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKRS 312 Y + + +P S V + + + D + T++ A+ + Y +++ Sbjct: 314 YATYPNVLVRVSDPKSSDANWVTEKLNQISYEDHKLKTGTNTKRALVEVYNMMS------ 367 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KENFI 362 + + +G II +TDG +N V I +E+++ Sbjct: 368 WPGDVPPEGWNRTRH----VIILMTDGLHNMGGDPVTVINEIRDLLNIGKDRKNPREDYL 423 Query: 363 KIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV-FQNISQL 408 + + P L + + ++ + V + + L V FQ I + Sbjct: 424 DVYVFGVGPLVEPANINALASKKENEQHVFRVKDMEHLEDVFFQMIDES 472 >gi|194043712|ref|XP_001928130.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Sus scrofa] Length = 2972 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 56/185 (30%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + +Q + Sbjct: 1444 RRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKQQIIDAINKV 1502 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ F + + +T G Sbjct: 1503 VYKGGRHANTKVGLEH-----------LRLNQFVPEAGSRLEQRVPQIAFVITGG----- 1546 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 KS + + + +K+ + + + + + + K S+ + V N L + + Sbjct: 1547 KSVEDAQEASLALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1604 Query: 405 ISQLM 409 + + + Sbjct: 1605 VLETL 1609 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 16/169 (9%), Positives = 52/169 (30%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + K Q + L + Sbjct: 55 LDFLVNLLERLSVGAQQIRVGVVQYSDEPRTMFSLNSFATKA-QVLDAVKALGFLGGELA 113 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ +++ + Sbjct: 114 NVGLALDF---------VVDHHFTRAGGSRVEEGVPQVLVLISAAPSSDKIRDAVL---- 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ ++ + + V + L Sbjct: 161 -ALKQ--ASVFSFGLGAQAASKAELQHIATNDNFVFTVPEFRSFGDLQD 206 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 60/181 (33%), Gaps = 19/181 (10%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + ++ + D L + + +I ++ + + K Sbjct: 1042 QYIRTLIERLVDSLDVGFDTSRVAVIQFSEDPRVEFLLNAHSSKDEVQ-NAVRRLRPKGG 1100 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 A + ++ + + + +G +F+I ++ G KS+ Sbjct: 1101 RRINIG--GALEYVSKNIFKRPLGSRIEEG-------VPQFLILISSG-----KSDDEVD 1146 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + K+ + +TI+ + + + L+K SPEY ++V L + Q + + Sbjct: 1147 DSAAELKQFGVAPLTIA--RNADPEELVKIS-LSPEYVFSVSTFRELPSLEQKLLTPITT 1203 Query: 412 R 412 Sbjct: 1204 L 1204 >gi|194043710|ref|XP_001928122.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Sus scrofa] Length = 3178 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 17/185 (9%), Positives = 56/185 (30%), Gaps = 21/185 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + + L + +D + + + +GL+ Y + +Q + Sbjct: 1650 RRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLK-DFSTKQQIIDAINKV 1708 Query: 287 LILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + ++ ++ F + + +T G Sbjct: 1709 VYKGGRHANTKVGLEH-----------LRLNQFVPEAGSRLEQRVPQIAFVITGG----- 1752 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 KS + + + +K+ + + + + + + K S+ + V N L + + Sbjct: 1753 KSVEDAQEASLALTQRGVKVFAVGV-RNIDSEEVGKI-ASNSATAFRVGNVQELSELSEQ 1810 Query: 405 ISQLM 409 + + + Sbjct: 1811 VLETL 1815 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 16/169 (9%), Positives = 52/169 (30%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + K Q + L + Sbjct: 261 LDFLVNLLERLSVGAQQIRVGVVQYSDEPRTMFSLNSFATKA-QVLDAVKALGFLGGELA 319 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ +++ + Sbjct: 320 NVGLALDF---------VVDHHFTRAGGSRVEEGVPQVLVLISAAPSSDKIRDAVL---- 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ ++ + + V + L Sbjct: 367 -ALKQ--ASVFSFGLGAQAASKAELQHIATNDNFVFTVPEFRSFGDLQD 412 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 60/181 (33%), Gaps = 19/181 (10%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + ++ + D L + + +I ++ + + K Sbjct: 1248 QYIRTLIERLVDSLDVGFDTSRVAVIQFSEDPRVEFLLNAHSSKDEVQ-NAVRRLRPKGG 1306 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 A + ++ + + + +G +F+I ++ G KS+ Sbjct: 1307 RRINIG--GALEYVSKNIFKRPLGSRIEEG-------VPQFLILISSG-----KSDDEVD 1352 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + K+ + +TI+ + + + L+K SPEY ++V L + Q + + Sbjct: 1353 DSAAELKQFGVAPLTIA--RNADPEELVKIS-LSPEYVFSVSTFRELPSLEQKLLTPITT 1409 Query: 412 R 412 Sbjct: 1410 L 1410 >gi|119572368|gb|EAW51983.1| hCG2001479, isoform CRA_a [Homo sapiens] Length = 2537 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 27/336 (8%), Positives = 73/336 (21%), Gaps = 15/336 (4%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL 90 S I +S+ + + + + + + ++ + Sbjct: 2084 PSLTSTADATISTTVLIATTSSLTGTTDVST--STTINNISTPVQTNTTNASTSTNVANI 2141 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI---FYN 147 + + + + +A+ S+ S+ + + ++ NT Sbjct: 2142 TATSHTSTDDTVPNNTVPVTAIPSLANTGVDTTSNSFSIMTTSFSESTNAMNTTVIMATT 2201 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 T + N I + + V S S Sbjct: 2202 SPTSTDVAST-NNDASMTNFLLATMSAGNITSNSISITTTSFGNSVPFVTTPSPSTDATT 2260 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY--TTRVEK 265 S+ + Q L + + + + + T Sbjct: 2261 TSNNTNPGMTTYYQTSPTIPTHTLTSIPSSITSILSMFPTSNTFTTDKITNFTTPTNANT 2320 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT------SDKKRSFFTNFFR 319 I + T+ + T S + T + + T Sbjct: 2321 IIFNTLDTKSTMVIDATVTTTSTKDNTMSPDTTVTSIDKFTTHITQFATPHSATTTTLAL 2380 Query: 320 QGVKIPSLPFQKFIIF-LTDGENNNFKSNVNTIKIC 354 + +TD +N++ ++ Sbjct: 2381 SHTSLAPTNLSNLGTMDITDADNSSSVTDDTVEHGA 2416 >gi|83749631|ref|ZP_00946614.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|207728099|ref|YP_002256493.1| hypothetical protein RSMK04469 [Ralstonia solanacearum MolK2] gi|207744165|ref|YP_002260557.1| hypothetical protein RSIPO_02350 [Ralstonia solanacearum IPO1609] gi|83723692|gb|EAP70887.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|206591344|emb|CAQ56956.1| conserved hypothetical protein [Ralstonia solanacearum MolK2] gi|206595569|emb|CAQ62496.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609] Length = 345 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 19/288 (6%), Positives = 63/288 (21%), Gaps = 34/288 (11%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN----------------- 58 ++ ++ + ++ FL + + + +N +++A + Sbjct: 11 RRQRGAVGVLSPILLIIFLAIGAMAVDIAHLFVVRNELQNAADAAALAGAAGLYPANPKP 70 Query: 59 ------AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 A + S+ ++L + + G Sbjct: 71 NWSNGVAQGTSAVKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVPGVQVT 130 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + N + ++ L N+ Sbjct: 131 VTRSPGNNGGPVKGWLTWVFNGGAASIQATAVAVIAAPGSA------NPGSLFPVALNKC 184 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + + P V+D++ S + + + T+ + + Sbjct: 185 LFDLYWNYTTGQPLNDPSTGQPYVIDINTSYPPSSMTCASGEWTGFNGPTDASTEKSLVS 244 Query: 233 NALLLFLD-----SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + + +I + + T + + G Sbjct: 245 SGNPTNISIGQNINISTGVKTSVYNAIPALPLTVTMPVVSPLTPGATS 292 >gi|310825488|ref|YP_003957846.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309398560|gb|ADO76019.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 414 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 26/176 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDS 286 + A +A + + +GL+ + E+ + + ++ R Sbjct: 52 LVAAVDAAQSLVAQAGPRDY------IGLLAFDGVPEQLLPVRAMEPDAKTELSERLSSL 105 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A++ L ++ ++ ++ LTDGE + + Sbjct: 106 ETGSGTALHEAVELGSSSL----------------HRVLIPGARRKLLLLTDGEPSVGPA 149 Query: 347 -NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIH 400 + + K E+ + + + + +L+ S + +A++L Sbjct: 150 ALADFKTLGAKVAESGVTLHALGLGR-HYIPEMLEALSSPSGTGFAHADDAEALPT 204 >gi|115380522|ref|ZP_01467490.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] gi|115362473|gb|EAU61740.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] Length = 424 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 26/176 (14%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDS 286 + A +A + + +GL+ + E+ + + ++ R Sbjct: 62 LVAAVDAAQSLVAQAGPRDY------IGLLAFDGVPEQLLPVRAMEPDAKTELSERLSSL 115 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A++ L ++ ++ ++ LTDGE + + Sbjct: 116 ETGSGTALHEAVELGSSSL----------------HRVLIPGARRKLLLLTDGEPSVGPA 159 Query: 347 -NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIH 400 + + K E+ + + + + +L+ S + +A++L Sbjct: 160 ALADFKTLGAKVAESGVTLHALGLGR-HYIPEMLEALSSPSGTGFAHADDAEALPT 214 >gi|162448738|ref|YP_001611105.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] gi|161159320|emb|CAN90625.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] Length = 1041 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 42/134 (31%), Gaps = 6/134 (4%) Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ--GVKIPSLPFQKFIIFLTDGENNN 343 S+ L P A+++A + + + K ++ DG + Sbjct: 496 SVPLGPVADGAAIEKAIRAVGPGGGGIYVDITLEAAYQALDRDTSSLKHVLLFADGS--D 553 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 ++ + + A +++ + L Y V +A L VF Sbjct: 554 AENMGPCRAMVEAAMRRGTTTSVVALGQGSDVPELEALSRLGGGRFYLVEDATRLPAVFT 613 Query: 404 NISQLMVHRKYSVI 417 + ++ K +++ Sbjct: 614 Q--ETIMAAKSAIV 625 >gi|119572528|gb|EAW52143.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_a [Homo sapiens] Length = 1153 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|119572529|gb|EAW52144.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_b [Homo sapiens] Length = 1152 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|64654595|gb|AAH96346.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|224831239|ref|NP_001139280.1| integrin alpha-M isoform 1 precursor [Homo sapiens] gi|307148|gb|AAA59544.1| glycoprotein Mac-1 [Homo sapiens] Length = 1153 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|386975|gb|AAA59903.1| neutrophil adherence receptor alpha-M subunit [Homo sapiens] Length = 1145 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 10 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 69 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 70 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 128 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 129 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 187 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 188 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 242 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 243 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 290 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 291 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 324 >gi|307114|gb|AAA59491.1| leukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1152 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|88501734|ref|NP_000623.2| integrin alpha-M isoform 2 precursor [Homo sapiens] gi|1708572|sp|P11215|ITAM_HUMAN RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: Full=Neutrophil adherence receptor; AltName: CD_antigen=CD11b; Flags: Precursor gi|263049|gb|AAB24821.1| leukocyte integrin alpha chain [Homo sapiens] gi|64653358|gb|AAH96348.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|68563402|gb|AAH99660.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|168275740|dbj|BAG10590.1| integrin alpha-M precursor [synthetic construct] Length = 1152 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 94/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|198421146|ref|XP_002121270.1| PREDICTED: similar to cubilin [Ciona intestinalis] Length = 728 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 33/339 (9%), Positives = 91/339 (26%), Gaps = 15/339 (4%) Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + ++ + +++ + + ++++ Sbjct: 400 AVATSTSQLITSPNYPNNYPNNVDCTWTITASDGMRVQLNLIRFSTERSYDYLTISSDGT 459 Query: 138 DSSNNTIFYNMDVMTSY--DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N+ Y+ + TS L+F R + + P+ + + Sbjct: 460 QLARNSGNYSNRIYTSIGNTLTLRFTSDRSVTRPGFNATFVAVPGVEVTTEPTPVEITQA 519 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + + NS + + + N + F+ IDL S + Sbjct: 520 IFQDQTTAKVCQNSHTDLMFLLDSSGSVTSSDFQLAANFVKDFITGIDLTSFQVGVMQYS 579 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFF 314 ++ + + + T Y T+ Sbjct: 580 HYLLNRELDDQPYIT-------TEINIGEYTEADPFKTAMDTIQPHGYTTYTAHAVLKAI 632 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 F + + + K I+ +TDG + + +AK + I I + + Sbjct: 633 RVDFPRSTRFNNSCTSKIIVLITDG---SSSDSSMLRDAALEAKNLGVDIYAIGVGDANT 689 Query: 375 GQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L + + + + + SL + Q + ++ Sbjct: 690 QELVVLTRPESGTKDKIFQIDQYSSLPSILQGLRTKILS 728 >gi|121582838|ref|YP_973280.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596100|gb|ABM39538.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 240 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 65/199 (32%), Gaps = 20/199 (10%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKV 276 + K+ AL AL + S S ++ ++ +GLI + R +++ Sbjct: 37 VSGSMSENGKIDALNVALKEMILSFGKESGLRAEIQVGLITFGGREAHEHLPL-----VA 91 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T A A ++L ++ IPS ++ +I + Sbjct: 92 AKVIGGVEAFKANGGTPMGSAFALARKLLEDKEQ-------------IPSRAYRPVLILV 138 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG + + + ++I A + L + + A Sbjct: 139 SDGAPTDAWEAPLADLKASE-RGQKATRFAMAIGADADLDMLAQFPNDREAPVFKTHEAR 197 Query: 397 SLIHVFQNISQLMVHRKYS 415 + F+ ++ +V R S Sbjct: 198 DIGRFFRAVTMSVVSRSTS 216 >gi|288574994|ref|ZP_06393351.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] gi|288570735|gb|EFC92292.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] Length = 1057 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 64/240 (26%), Gaps = 39/240 (16%) Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 S + + + A+K L +D T Sbjct: 184 VSYEFPSYAWLGWYWKRQKLSWQPTGENKAVKRLSLDVIDPFFYAPSSFSGGVPDDKLGT 243 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + G E + + + T + + S R F ++ Sbjct: 244 SHWYDLLALIDGVETSKN-----DELRAVGATPLE-------KSIYSKGARDCAYEFIKE 291 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA---------------KENFIKIV 365 + ++I LTDGE + S+ + K K ++ Sbjct: 292 EIDY--PCQDNWLIVLTDGE--DSSSDADPPAAVKKLYEANLDDTWPKPYGKKAQPVRTF 347 Query: 366 TISINAS--------PNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 I + S + R + S + Y + +SL+ F+ I + + + S Sbjct: 348 VIGLVDSQSDTLDAMADEGRAWEVDESIKKTAYYATDTESLLEAFRTIFRTIQKNRSSSA 407 >gi|221134029|ref|ZP_03560334.1| vault protein inter-alpha-trypsin [Glaciecola sp. HTCC2999] Length = 757 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 28/246 (11%), Positives = 69/246 (28%), Gaps = 50/246 (20%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++S P+ + + V+D SGSMH + A+ Sbjct: 371 PVISHDPSSMASTVTPSIQQNTIFVLDSSGSMH----------------GTALTQAIDAI 414 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY--VTRDMDSLIL 289 + + + ++ + + S ++V + + Sbjct: 415 REGVSYLTEH----------DTFNIVDFDSEARALWRQSQFADEVSKAEAMRFLRHVDSD 464 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ A+ + L +IF+TDG NN Sbjct: 465 GGTNMQDALALSLTQLLDSSTGLTQ------------------VIFVTDGSINN--EREL 504 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 +I ++ + ++ T+ I A+PN + + + + + + + + Sbjct: 505 LKQIAEQLGDK--RLFTVGIGAAPNSHFMEYAAMLGKGTYTYIDDLTEIQPKMAYLFSQL 562 Query: 410 VHRKYS 415 + Sbjct: 563 RSPMIT 568 >gi|115358178|ref|YP_775316.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170703648|ref|ZP_02894385.1| TadE family protein [Burkholderia ambifaria IOP40-10] gi|115283466|gb|ABI88982.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170131447|gb|EDT00038.1| TadE family protein [Burkholderia ambifaria IOP40-10] Length = 142 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 41/132 (31%), Gaps = 3/132 (2%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 +G + S+ FAL + L+++ +I V K + +A+ A AG V L Sbjct: 6 RGARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPML 65 Query: 73 SRLGDRFESISNHAKRA---LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + ++S + +G + T + + Sbjct: 66 TPTQIANVALSYTQNSLVSGGTGTVPTVAVTQANGTTAGTALTVTVTYTYSGLVLGTALS 125 Query: 130 THMANNRLDSSN 141 + +S+ Sbjct: 126 VLTGPITISASS 137 >gi|297626392|ref|YP_003688155.1| hypothetical protein PFREUD_12030 [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296922157|emb|CBL56725.1| Hypothetical protein PFREUD_12030 [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 140 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 41/131 (31%), Gaps = 9/131 (6%) Query: 17 SEKA-NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ + S++ A + +F+L+ G + + + A A++ S Sbjct: 14 DERGASVSVLIAACIPAFILICGLAVDGAH--------QVSAQRAATVTAAQAARVGSDS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 A++ + A+ + + + ++ + ++ + N Sbjct: 66 AATGRLNGLDARQEAMRAAREHVGTQPGMACTVGIDANDRVHVEVSTRADTAFLSIVGIN 125 Query: 136 RLDSSNNTIFY 146 L++ Sbjct: 126 HLNARGAATAE 136 >gi|190889875|ref|YP_001976417.1| hypothetical protein RHECIAT_CH0000244 [Rhizobium etli CIAT 652] gi|190695154|gb|ACE89239.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 193 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 12/114 (10%), Positives = 32/114 (28%), Gaps = 8/114 (7%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ I K +I FA+ ++L + A Sbjct: 5 NPFTRLALTARRLIRERKGAGAIEFAILFPVLIMLYIGAFEITIGLSVSKR-------AT 57 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A S + +++ +S + + + ++G + Sbjct: 58 RAAGS-IADLVTQQQSITKSTLTEMRSVATSIFVPYNSTSLTLKITGVTVDASA 110 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 17/171 (9%), Positives = 45/171 (26%), Gaps = 22/171 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 T + L+ + L V + +T P+ Sbjct: 227 TSGSMAGTPIELLRETSRAIAAQLKLGDTVSICEWDTSNDWTLAGYAVTGPNDEL----- 281 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + D + T+ ++ Y + ++ ++D Sbjct: 282 LLEKINDVVHGGGTNLYGGLESGY----------------ELAQMVYDPDAINRLVLISD 325 Query: 339 GENNNFKSNVNTIKI-CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 G N ++++ I + I +V + ++ + L V+ Sbjct: 326 GGANAGITDLDLIAENAAYGGSDGIYLVGVGVDDPDDYNDELMDAVTDAGK 376 >gi|209886526|ref|YP_002290383.1| TadE family protein [Oligotropha carboxidovorans OM5] gi|209874722|gb|ACI94518.1| TadE family protein [Oligotropha carboxidovorans OM5] Length = 187 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 50/181 (27%), Gaps = 9/181 (4%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAIL 62 R ++ A+ K + ++ FAL F LI ++ V + +E+ + A L Sbjct: 8 RLRLSNALRRFGANRKGSAAVQFALVAPLFFALIFAIVEVALVFFANQILETGTQDTARL 67 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + +N R + A + ++ +G Sbjct: 68 LFTH---QAQDQNLTGEQVRTNLCNRVSVLLACDGVILDVRSYPAGQPFTVPPFFDGGGN 124 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 S + + ++ + + + L K + R Sbjct: 125 PISSNFLYQPPDPN-----SSNIVVVRAFYKWPLIFTNLGFSLINIGTDKRLLTSTVAFR 179 Query: 183 I 183 + Sbjct: 180 V 180 >gi|149608697|ref|XP_001516114.1| PREDICTED: similar to hCG2002731, partial [Ornithorhynchus anatinus] Length = 180 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 63/171 (36%), Gaps = 29/171 (16%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKR 311 +G++ Y V + + +T + + T + + A + ++ + Sbjct: 1 VGIVQYGENVTHEFNLNK-YTTTEEVLTASNEIIQRGGRQTMTALGIDTARKEAFTEPRG 59 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +K ++ +TDGE+++ I+ C+ I+ I+I Sbjct: 60 -------------ARRGVKKVMVIVTDGESHDNHDLEKVIQDCE---NENIQRFAIAILG 103 Query: 372 SPNGQRL--------LKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVHR 412 S N L +K+ S + ++ +NV + +L+ + + + + + Sbjct: 104 SYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALLTIVEALGERIFAL 154 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 78/232 (33%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVE----KNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDHKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|88857524|ref|ZP_01132167.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] gi|88820721|gb|EAR30533.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] Length = 1220 Score = 45.2 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 41/193 (21%), Gaps = 12/193 (6%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 L A F + + Y G + T YV+ + Sbjct: 319 CETLTEAFRFFSGRSVKYAFEDTNCYQRECGSFSYTGNTPPYDASTMSGSSYVSPFKEG- 377 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + SD T + + + G N Sbjct: 378 -CGSNAYIILITDGVPTVDSDANSYIATLIATK-TDPNNQTTSSSPFYFKVGNKTNSSYL 435 Query: 348 VNTIKICDK-------AKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSL 398 N + + + + TI LL+ ++ ++ L Sbjct: 436 PNLAHWMNNNDVNTNVSGNQNVSLYTIGFGEGAEDAEELLLEAATKGGGKYFGASSSIEL 495 Query: 399 IHVFQNISQLMVH 411 F ++ Sbjct: 496 SQAFSKALSKILE 508 >gi|254456154|ref|ZP_05069583.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] gi|207083156|gb|EDZ60582.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] Length = 1384 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 38/152 (25%), Gaps = 13/152 (8%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + Q TD MK A + ++ + + S + + Sbjct: 415 KDGAAQINKIISSVNARGGTDPYTFMKIA---------QDYYNHGSLSPIDKKSPCQKSY 465 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHY 390 +I + DG+ I + IK ++ + L + Sbjct: 466 VIVIGDGDFFPSMGWNTHINMAKALNNKGIKTFGVAFGTGISPGGLKRFNDLAKAGGTTK 525 Query: 391 N--VVNADSLIHVFQNISQLMVHRKYSVILKG 420 A +L Q ++ K S Sbjct: 526 AIIAPTAAALKTQLQAAISQIIASKLSFTAPA 557 >gi|104779911|ref|YP_606409.1| hypothetical protein PSEEN0657 [Pseudomonas entomophila L48] gi|95108898|emb|CAK13594.1| conserved hypothetical protein [Pseudomonas entomophila L48] Length = 635 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 12/167 (7%), Positives = 38/167 (22%), Gaps = 9/167 (5%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSRLGDRFE 80 ++ L+++ L + + ++ ++ + A L A N S Sbjct: 2 AALTMGLALLFML----LSVDSGRLYLEQRKLQRIADMAALEAAEHSASCNGSGPQAFAL 57 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + + L + + + I + + Sbjct: 58 ARNAATRNGLAVTDPLIA----SCGYLRTGNDNLRRFVADNTRNEAIKVEVSNVVVTSVA 113 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 L + ++S +L + + Sbjct: 114 AGVFAMVQGNTVPRTTTLHASAVAASPMPPLAMLSIRSSLATVNASQ 160 >gi|108760924|ref|YP_631268.1| adventurous gliding motility protein CglB [Myxococcus xanthus DK 1622] gi|108464804|gb|ABF89989.1| adventurous gliding motility protein CglB [Myxococcus xanthus DK 1622] Length = 416 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/310 (11%), Positives = 76/310 (24%), Gaps = 75/310 (24%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV-----------NSAPIC 220 + + + + I L+VD SGSM +N P AP Sbjct: 30 PLAIAQTTVEEVIKARQSKPNIMLLVDTSGSMTYPVNPSPACDVEFKGSIVPCGGEAPCN 89 Query: 221 QDKKRTKMAALKNALLLFLDS--------IDLLSHVKEDVYMGLIGYTTRVEKNI---EP 269 D T+ L++ + FL++ + + + + + + Sbjct: 90 VDVCPTRWTTLQSVVPQFLENSGRFVRFALTTYPETRGGESIPDLCRASTAGALLKTLPE 149 Query: 270 SWGTEKVRQYVTRDMDSL----------ILKPTDSTPAMK-----QAYQ---------IL 305 + + + L L T ++ ++ + Q +L Sbjct: 150 QEDDDSLLAHANEINTLLQGIPNGGEGRPLGGTPTSGSLNFVGGLEGLQDPDRENFVILL 209 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T N + + N ++ + E I+ + Sbjct: 210 TDGLPNCNAANPNSGANPELCKCTIEGNQCQSGYLNRGCLDTDASVTAVRELHEKGIQTI 269 Query: 366 TISINASP---NGQRLLKTCVSSPEY--------------------------HYNVVNAD 396 I A +G +L+ + + + N Sbjct: 270 VIGFGAETAVGDGPAVLEAMARAGGFKRTCSAERPCGEGDTCNPTTGLCNRSFFQAGNQA 329 Query: 397 SLIHVFQNIS 406 L + IS Sbjct: 330 ELASALEEIS 339 >gi|29169160|gb|AAO66330.1| hypothetical conditional gliding protein B [Myxococcus xanthus] Length = 408 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/310 (11%), Positives = 76/310 (24%), Gaps = 75/310 (24%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV-----------NSAPIC 220 + + + + I L+VD SGSM +N P AP Sbjct: 22 PLAIAQTTVEEVIKARQSKPNIMLLVDTSGSMTYPVNPSPACDVEFKGSIVPCGGEAPCN 81 Query: 221 QDKKRTKMAALKNALLLFLDS--------IDLLSHVKEDVYMGLIGYTTRVEKNI---EP 269 D T+ L++ + FL++ + + + + + + Sbjct: 82 VDVCPTRWTTLQSVVPQFLENSGRFVRFALTTYPETRGGESIPDLCRASTAGALLKTLPE 141 Query: 270 SWGTEKVRQYVTRDMDSL----------ILKPTDSTPAMK-----QAYQ---------IL 305 + + + L L T ++ ++ + Q +L Sbjct: 142 QEDDDSLLAHANEINTLLQGIPNGGEGRPLGGTPTSGSLNFVGGLEGLQDLDRENFVILL 201 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T N + + N ++ + E I+ + Sbjct: 202 TDGLPNCNAANPNSGANPELCKCTIEGNQCQSGYLNRGCLDTDASVTAVRELHEKGIQTI 261 Query: 366 TISINASP---NGQRLLKTCVSSPEY--------------------------HYNVVNAD 396 I A +G +L+ + + + N Sbjct: 262 VIGFGAETAVGDGPAVLEAMARAGGFKRTCSAERPCGEGDTCNPTTGLCNRSFFQAGNQA 321 Query: 397 SLIHVFQNIS 406 L + IS Sbjct: 322 ELASALEEIS 331 >gi|2625141|gb|AAB86642.1| CglB [Myxococcus xanthus DK 1622] Length = 416 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/310 (11%), Positives = 76/310 (24%), Gaps = 75/310 (24%) Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV-----------NSAPIC 220 + + + + I L+VD SGSM +N P AP Sbjct: 30 PLAIAQTTVEEVIKARQSKPNIMLLVDTSGSMTYPVNPSPACDVEFKGSIVPCGGEAPCN 89 Query: 221 QDKKRTKMAALKNALLLFLDS--------IDLLSHVKEDVYMGLIGYTTRVEKNI---EP 269 D T+ L++ + FL++ + + + + + + Sbjct: 90 VDVCPTRWTTLQSVVPQFLENSGRFVRFALTTYPETRGGESIPDLCRASTAGALLKTLPE 149 Query: 270 SWGTEKVRQYVTRDMDSL----------ILKPTDSTPAMK-----QAYQ---------IL 305 + + + L L T ++ ++ + Q +L Sbjct: 150 QEDDDSLLAHANEINTLLQGIPNGGEGRPLGGTPTSGSLNFVGGLEGLQDLDRENFVILL 209 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 T N + + N ++ + E I+ + Sbjct: 210 TDGLPNCNAANPNSGANPELCKCTIEGNQCQSGYLNRGCLDTDASVTAVRELHEKGIQTI 269 Query: 366 TISINASP---NGQRLLKTCVSSPEY--------------------------HYNVVNAD 396 I A +G +L+ + + + N Sbjct: 270 VIGFGAETAVGDGPAVLEAMARAGGFKRTCSAERPCGEGDTCNPTTGLCNRSFFQAGNQA 329 Query: 397 SLIHVFQNIS 406 L + IS Sbjct: 330 ELASALEEIS 339 >gi|328907234|gb|EGG27000.1| aerotolerance protein BatA [Propionibacterium sp. P08] Length = 307 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D ++AA K F++S+ + ++ + E + PS Sbjct: 86 DSSLSMKADDASPNRLAAAKAKAKDFVNSLPTG------FNVAVVSISAHPEIRMPPSTD 139 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + + T A+ ++ Q + S Sbjct: 140 RPTVLRALDGIELQD---GTALGEAIDKSLQAVKMAPGGSKD-------------RVPAA 183 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN-----ASPNGQ--------RLL 379 I+ L+DG N S + AK + + TI+ +GQ +LL Sbjct: 184 IVMLSDGGNTQGGSPLVAATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLL 240 Query: 380 KTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V++ + + Sbjct: 241 SDVADRTDAKSWTADSADKLQEVYKQVHSSVG 272 >gi|328865987|gb|EGG14373.1| hypothetical protein DFA_12145 [Dictyostelium fasciculatum] Length = 781 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 48/195 (24%), Gaps = 27/195 (13%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTE 274 + + +K + + + MGL Y V + + Sbjct: 223 CFDTTGSMSSIIQNVKTQVENTVKRLMKDIPSIRIGIMGLGDYCDGKLVLTTQDLTQDVT 282 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + +++ T A ++ + S K + Sbjct: 283 TLVKFIKNV------PSTSGGDA------------PEAYEFALMKAKELTWSEHTSKAFV 324 Query: 335 FLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTI-SINASPNGQRLLKTCVSSPEYHY 390 + D + I CD E IKI + ++ +S + Sbjct: 325 MIGDSNPHQPTFTDLNINWFKECDDLHERGIKIYGVKALGSSIFYND---IAERTGGICI 381 Query: 391 NVVNADSLIHVFQNI 405 + D + +F I Sbjct: 382 DFKKFDLITEMFLAI 396 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 78/232 (33%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVE----KNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATHPKIWVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDIPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|254460888|ref|ZP_05074304.1| TadE-like protein [Rhodobacterales bacterium HTCC2083] gi|206677477|gb|EDZ41964.1| TadE-like protein [Rhodobacteraceae bacterium HTCC2083] Length = 181 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 9/57 (15%), Positives = 24/57 (42%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 + + + + E+ N ++ FAL F +++ I + ++ +E + A Sbjct: 1 MKKLLNTVRNFRSGERGNVTVEFALVFPVFAMILTSSIEMGIITVRQTLLERGLDIA 57 >gi|222055527|ref|YP_002537889.1| von Willebrand factor A [Geobacter sp. FRC-32] gi|221564816|gb|ACM20788.1| von Willebrand factor type A [Geobacter sp. FRC-32] Length = 569 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 42/113 (37%), Gaps = 8/113 (7%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + + + + + + FII LTDG N + I+ C K Sbjct: 464 FYSVMPHGRTASGEAIIATALNLKQSSRKPFIIHLTDGAANWGCGVSDAIRYC---KARN 520 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 ++++TI + P + L E V + D L VF+ + + KY Sbjct: 521 VRLLTIGLECDPMNKTAL--AEEHGEMIQFVNDVDQLPTVFKQL---LSSTKY 568 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 29/200 (14%) Query: 229 AALKNALLLFLDSIDL---LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K + F++ ++ + V G + Y+ VE + + + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRY 128 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ Q S R KF + +TDG Sbjct: 129 L--GRGTFTDCAISNMTQQFQSQTARDV-----------------KFAVVITDGHVT-GS 168 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL---I 399 ++A++ IK+ ++ + Q L + + + Y + D+L Sbjct: 169 PCGGMKMQAERARDMGIKLFAVAPSEDVYEQGLREIASPPHDLYRSNYTITPKDALHIDE 228 Query: 400 HVFQNISQLMVHRKYSVILK 419 + + I + M H Y+ K Sbjct: 229 NTIERIIKAMKHEAYAECYK 248 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 56/182 (30%), Gaps = 33/182 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V+D S S+ + ++ + + K+ + ++ + Sbjct: 613 ALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGS-----IAKDPKSETGARVGVVQYSH 667 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E + + E+ S E V++ I T + A++ AY L + Sbjct: 668 EGTFEAIKLDD---ERINSLSSFKEAVKRL------EWIAGGTWTPSALQFAYNKLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG + + N +C + + TI I Sbjct: 719 RREKAQVFA---------------VVITDGRYDPRDDDKNLGALC----GRDVLVNTIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 29/200 (14%) Query: 229 AALKNALLLFLDSIDL---LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K + F++ ++ + V G + Y+ VE + + + Sbjct: 69 DQIKQFIPRFIEKLENEVYQNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRY 128 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + T + A+ Q S R KF + +TDG Sbjct: 129 L--GRGTFTDCAISNMTQQFQSQTARDV-----------------KFAVVITDGHVT-GS 168 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL---I 399 ++A++ IK+ ++ + Q L + + + Y + D+L Sbjct: 169 PCGGMKMQAERARDMGIKLFAVAPSEDVYEQGLREIASPPHDLYRSNYTITPKDALHIDE 228 Query: 400 HVFQNISQLMVHRKYSVILK 419 + + I + M H Y+ K Sbjct: 229 NTIERIIKAMKHEAYAECYK 248 Score = 44.8 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 15/162 (9%), Positives = 51/162 (31%), Gaps = 17/162 (10%) Query: 241 SIDLLSHVKEDVYMGLIGYTTR--VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++ + + + L+ Y + ++ ++ + + ++ A+ Sbjct: 864 TLARRNDDNMNARIALLQYGSEREQNVVFPLTYNLTEISNALAQIKYLDS--SSNIGSAI 921 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + G ++ + +F+TDG + + + K Sbjct: 922 IHAINNI---------VLSPGNGQRVARRNAELSFVFITDGITGSKN----LEEAINSMK 968 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + + LLK + + + +SL Sbjct: 969 KQDVMPTVVALGSDVDMDVLLKLGLGDRAAIFREKDYESLSQ 1010 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 56/182 (30%), Gaps = 33/182 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V+D S S+ + ++ + + K+ + ++ + Sbjct: 613 ALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGS-----IAKDPKSETGARVGVVQYSH 667 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E + + E+ S E V++ I T + A++ AY L + Sbjct: 668 EGTFEAIKLDD---ERINSLSSFKEAVKRL------EWIAGGTWTPSALQFAYNKLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG + + N +C + + TI I Sbjct: 719 RREKAQVFA---------------VVITDGRYDPRDDDKNLGALC----GRDVLVNTIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 62/195 (31%), Gaps = 26/195 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG-YTTRVEKNIEPSWGTEKVR 277 TK+ K+AL L + H + I + + +R Sbjct: 302 SSASMVGTKLRQTKDALFTILHDLRPQDHFNIVGFSNRIKVWKDHLVSVTP-----NSIR 356 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 TD A+++ Q+L + I ++FLT Sbjct: 357 DGKVYIHHMSPSGGTDINGALQRGIQLLND----------YVAHNDIEDRSVS-LVVFLT 405 Query: 338 DGENNNFKSNVNTIKICDKAKEN---FIKIVTISINASPNGQRL----LKTCVSSPEYHY 390 DG+ + +T KI + +E + I T+ I A + + L L+ C + H Sbjct: 406 DGKPTVGE--THTFKILNNTREATRGRVCIFTVGIGADVDFKLLEKLSLENCGLTRRVHE 463 Query: 391 NVVNADSLIHVFQNI 405 + LI + I Sbjct: 464 DHDARAQLIGFYDEI 478 >gi|302539262|ref|ZP_07291604.1| membrane protein [Streptomyces sp. C] gi|302448157|gb|EFL19973.1| membrane protein [Streptomyces sp. C] Length = 141 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 16/135 (11%), Positives = 39/135 (28%), Gaps = 6/135 (4%) Query: 11 FKKGIASE--KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + I + + + S +ALS + ++++G L+ N A A ++ Sbjct: 1 MRDRIRHDGDRGSLSPFYALSAIGIIMIMGLLVDGGGALNATNKATGLAQEAARAAGQQL 60 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + G + A A D + + E+ + + Sbjct: 61 NVPAAVQGTEITVDPDAAVAAAQDYLATQNVS----GSVTVTDGGQRLEVTVHDTYNTLF 116 Query: 129 MTHMANNRLDSSNNT 143 + + + S Sbjct: 117 AQFVGRSTISVSGTA 131 >gi|224024928|ref|ZP_03643294.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] gi|224018164|gb|EEF76162.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] Length = 339 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 20/226 (8%), Positives = 66/226 (29%), Gaps = 58/226 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + QD +++ K + +++ + + +I + + + Sbjct: 97 DISNSMLAQDVTPSRLEKSKKLISRLVETF-------NNDKVAMIVFAGEAFAQLPITSD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ + + TD A+ A + T + + Sbjct: 150 YISAKMFLESISPSLIATQGTDIRGAIDLAMKSFTP------------------NEGVGR 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGEN+ + +K ++ + + + Sbjct: 192 AIVLITDGENHEGGAVEAAKAAAEKGV----RVFVLGVGSPDGSPIPVEGTNEFRRDKDG 247 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N + + + + V N + + ++ + Sbjct: 248 NVVVTRLNEEMCREIAQAGNGMYVRVDNT---NNAEKALNAEINKL 290 >gi|209546921|ref|YP_002278839.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538165|gb|ACI58099.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 176 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 10/60 (16%), Positives = 27/60 (45%), Gaps = 6/60 (10%) Query: 4 LSRFRFY--FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + R + + ++ I K +I FA+ + +++ +I V + + A +A++ Sbjct: 1 MRRGKRFAPLRRLIGDRKGAAAIEFAILALPLFIILFGIIEVSLMFFVNS----ALDASV 56 >gi|85090839|ref|XP_958610.1| hypothetical protein NCU07565 [Neurospora crassa OR74A] gi|28919985|gb|EAA29374.1| hypothetical protein NCU07565 [Neurospora crassa OR74A] Length = 1161 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 58/197 (29%), Gaps = 28/197 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +++AL + S+ MGL+ + + T+ + Sbjct: 638 KINLVRDALRFMVSSLGDRD------RMGLVTFGSSGGGVPVVGMTTKAWPGWSNVLASV 691 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + + G K + I+ ++D ++ S Sbjct: 692 KPVGQKSHRADVVEGANVAMD----------LLMGRKYNNPIAT--IMLISDASTSDADS 739 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH------ 400 + + AK I I + + + +++ + + V + L Sbjct: 740 VDFVVSRAEAAK---ITIHSFGLGTTHKPDTMIELSTRTKASYTYVKDWMMLRECLAGCL 796 Query: 401 -VFQNISQLMVHRKYSV 416 Q++S V K + Sbjct: 797 GAMQSLSHQNVKLKLKL 813 >gi|319785613|ref|YP_004145089.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171501|gb|ADV15039.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 193 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 20/167 (11%), Positives = 50/167 (29%), Gaps = 11/167 (6%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 +F + + + +I FA+ + F LL+ ++ + M + + A K+ Sbjct: 17 RFFARFLRDRRGATAIEFAILSVPFALLVFAILESCISLAAQEVMANITDD----VARKL 72 Query: 69 VSNLSRLGDRF-ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + R D ++ + L + N + L Y++ + + + Sbjct: 73 RTGQLRAVDVAGTNLRDMICTKLQIIVSQDCPNQLLVDLRQYTSFADAATAGFKIQNGDV 132 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE-----HLLNQRY 169 +T S M Y+ + + + Sbjct: 133 VLTK-GTATQSFSVTAGAAESRNMLRVFYKWPIMTDLLAKSMGGNKT 178 >gi|254477813|ref|ZP_05091199.1| TadE-like protein [Ruegeria sp. R11] gi|214032056|gb|EEB72891.1| TadE-like protein [Ruegeria sp. R11] Length = 186 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 24/51 (47%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++ E +I FA+ + +FL L+ + + +++ +E A + + Sbjct: 17 LRRFRQREDGTATIEFAIVIPAFLFLLMNTVELGLITIQQSMLERALDQTV 67 >gi|78066138|ref|YP_368907.1| membrane protein [Burkholderia sp. 383] gi|77966883|gb|ABB08263.1| membrane protein [Burkholderia sp. 383] Length = 603 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 48/147 (32%), Gaps = 8/147 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + ++ FS++ A+ + + +I V + Y++ ++ + A LA +M Sbjct: 11 RFSSRQRGAFSVM-AIVALLIAVTTLGVIGVGNLFYQRRDVQRIADMAALAAVQRMDDTC 69 Query: 73 SRLGDRFESI-SNHAKRALIDDAKRF------IKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ S ++ A D+ + K S + V + Sbjct: 70 TQPTATATSNAQSNGLNASNGDSISIECGRWDTAVNPKPSYYAAATPGTTQLNAAKVTVT 129 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMT 152 R S + + ++ T Sbjct: 130 RQVPFFFVGPPQPVSAVSTARSTNIDT 156 >gi|196233140|ref|ZP_03131987.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] gi|196222784|gb|EDY17307.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] Length = 992 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 20/181 (11%), Positives = 48/181 (26%), Gaps = 28/181 (15%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K + + V + G + + + V + Sbjct: 471 AKRIAKEAIRVLGAQDEVG--LIDYAWGAQNGYNWVFPLTP-VSEYDKLVPLINAATPGD 527 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 D M+ A L K +I ++DG+ + T Sbjct: 528 MPDFHTPMQLALTGLQQSDAAL------------------KHLIVISDGDP-----SPPT 564 Query: 351 IKICDKAKENFIKIVTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + ++ I + I+IN + + + +Y + +L +F ++ Sbjct: 565 PALVQSFVDSKISVSMIAINPHGGRDISIMQAISQQTGGRYYFPEDPAALPSIFIKEAKT 624 Query: 409 M 409 + Sbjct: 625 L 625 >gi|3183039|sp|Q62469|ITA2_MOUSE RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|473099|emb|CAA82877.1| VLA-2 homologue [Mus musculus] Length = 1178 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 59/186 (31%), Gaps = 29/186 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 + + L + + LI Y E V ++ T+ Sbjct: 190 NFLVKFVTGLDIGPKKTQVALIQYANEPRIIFNL-NDFETKEDMVQATSETRQHGGDLTN 248 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A S K ++ +TDGE+++ I+ Sbjct: 249 TFRAIEFARDYAYSQTSG-------------GRPGATKVMVVVTDGESHDGSKLKTVIQQ 295 Query: 354 CDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLIHVFQ 403 C+ ++ I I++ N L +K S+P Y +NV + +L+ Sbjct: 296 CN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAG 352 Query: 404 NISQLM 409 + + + Sbjct: 353 TLGEQI 358 >gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] Length = 666 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 93/361 (25%), Gaps = 61/361 (16%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A K+ + + + S+ + + + + + S S +V Sbjct: 185 ATTGKLNEEQDKRTVKSDHTSSFTPQGWLSPIYT-KQTNEAANASDLSLTINMDIGLELV 243 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF------------IEHLLNQRYN 170 + + + +I + D+ L+F Sbjct: 244 DINAKYHQ-VNIENSAFGQYSIVLKPQQAINKDFVLEFKPLQKAQAQAALFTQQAKNGEQ 302 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 + +P + ER + VVD SGSMH M Sbjct: 303 FALAMLMPPSDQFTQSERLPREMVFVVDTSGSMH--------------------GQSMEQ 342 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K AL L +D + + + +R+ + Sbjct: 343 AKKALFYALSLLDSDDSFNIIGFNNQVT----AMSDTPLVASDFNLRRARRFIYNLQADG 398 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A+ F + ++FLTDG + Sbjct: 399 GTEIQGALDAVLN-------------------GAQFEGFVRQVVFLTDGS--VSNEDELF 437 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I ++ ++ T+ I ++PN + + + + + + Q + + Sbjct: 438 KSIARTLGDS--RLFTVGIGSAPNRFFMRRAADIGKGSYTFIGSTFDVQPKMQQLFDKLA 495 Query: 411 H 411 H Sbjct: 496 H 496 >gi|320108233|ref|YP_004183823.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319926754|gb|ADV83829.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 342 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 59/205 (28%), Gaps = 23/205 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + K+A FL ++ LI + V+ + + ++++ Sbjct: 110 DTSGSQMNVLPLEKDAGAQFLK-----DVLQPKDEAFLIKFDINVDMLADYTNSPRELKR 164 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ T + + K+ +K ++ LTD Sbjct: 165 ALDKAEI-------NTGAGTGSVTGNGTPRGTLLYDAVYLGAHDKLRQEAGRKVLVVLTD 217 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINA--------SPNGQRLLKTCVSSPEYHY 390 G + + + + A++ I I I + + + Sbjct: 218 GGDQGSQ--LKLQDAIEAAQKANAIIYVILIADRGGFGNFSFGGSGDMDRLSRETGGRMI 275 Query: 391 NV-VNADSLIHVFQNISQLMVHRKY 414 NV N L F IS + + Sbjct: 276 NVGNNGKKLEDAFAQISDELRTQYL 300 >gi|229578924|ref|YP_002837322.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|284997528|ref|YP_003419295.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|228009638|gb|ACP45400.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|284445423|gb|ADB86925.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 356 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 11/82 (13%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKT 381 II LTDG+ + ++ + K+ I I+TI I + N + L K Sbjct: 122 QSQVPTKIIMLTDGKPTDKRNVKDYEKL-------DIPPNTQIITIGIGNNYNERILKKL 174 Query: 382 CVSSPEYHYNVVNADSLIHVFQ 403 S Y++ + L ++F+ Sbjct: 175 ADRSSGKFYHIKDISELPNIFE 196 >gi|220941747|emb|CAX15448.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 429 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 57/179 (31%), Gaps = 20/179 (11%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PT 292 + F++ + + L + V +GL+ + + K SL+ Sbjct: 63 VKDFINGVVNKLDVRPDRVRIGLMQFAEGQRTEFYLNSHNTKQNVLSAIAQLSLMGGRAL 122 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ A+K A +F Q+ ++ +T G + + + Sbjct: 123 NTGAALKYAL-----------DNHFQTSSGSRRRQGVQQVLVLITGGPSQDD-----AKR 166 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 I D+ + T ++ A LKT + Y N L V + I ++ Sbjct: 167 IADRLALEGVL--TFAVGAGQVDLTFLKTAAFVEDLAYYRRNFADLSSVVEKIMTPLIT 223 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 43/136 (31%), Gaps = 6/136 (4%) Query: 21 NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLGDR 78 +F+++ A+ ++ + +G + + + + + ++ + A LAGA +M Sbjct: 1 SFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAA 59 Query: 79 FESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + SN A + + + Y V +++ Sbjct: 60 ANARSNGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLGP 119 Query: 136 RLDSSNNTIFYNMDVM 151 S + +V Sbjct: 120 SRTVSATSTAKATNVD 135 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 37/279 (13%), Positives = 76/279 (27%), Gaps = 60/279 (21%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPED 213 F + + + F P ++ GE I L++D+SGSM Sbjct: 238 YLLPIFNGFI-WNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSME--------- 287 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + KN +L LD + + + + ++ + S Sbjct: 288 ------------KDFSVYKNNILKILDKLAEIP----NWQINIVVFNDES-TARSFSNQE 330 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + T +K+A + + + Sbjct: 331 NNIEDIKVYINNLKANGYTKLYGTIKEALESF------------------KGKIDESSTL 372 Query: 334 IFLTDGENNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 I TDG++ SNV + D K + T+ Q + + + Sbjct: 373 IVFTDGKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQ-YYNQEFFEQVATRGGFT 431 Query: 390 Y----NVVNADSLIHVFQNISQ-----LMVHRKYSVILK 419 + + L NI Q ++ + +I + Sbjct: 432 HVSLNDPTGMHQLQQYIDNIEQKVMTFEIITEELKLITR 470 >gi|332884780|gb|EGK05036.1| hypothetical protein HMPREF9456_03189 [Dysgonomonas mossii DSM 22836] Length = 342 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 17/228 (7%), Positives = 65/228 (28%), Gaps = 59/228 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D +++ K L +D +++ + ++ + + + Sbjct: 99 DVSNSMMAEDISPSRLVKAKQILTRIIDE-------RKNDKVAIVVFAGEAFIQLPLTPD 151 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + ++ + ++ T A+ + + K Sbjct: 152 NQSAKLFLETINPSLVPVQGTAIGSAIDMSMSCFS------------------NDADIDK 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP------------------ 373 I+ +TDGE + + + + + + I + Sbjct: 194 AIVLITDGEGHEGNAEEAAARA----ASKGVHVNVVGIGTAEGAMIPEAENSRDIKRDTQ 249 Query: 374 --------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 N + + + + + N+ +++ + + Sbjct: 250 GQPVVTKLNEEMCRQIAKAGEGLYAHADNS---NSALKSLQAELDKLQ 294 >gi|41054731|ref|NP_032422.2| integrin alpha-2 precursor [Mus musculus] gi|40781666|gb|AAH65139.1| Integrin alpha 2 [Mus musculus] gi|148686423|gb|EDL18370.1| integrin alpha 2 [Mus musculus] Length = 1178 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 59/186 (31%), Gaps = 29/186 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP--TD 293 + + L + + LI Y E V ++ T+ Sbjct: 190 NFLVKFVTGLDIGPKKTQVALIQYANEPRIIFNL-NDFETKEDMVQATSETRQHGGDLTN 248 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ A S K ++ +TDGE+++ I+ Sbjct: 249 TFRAIEFARDYAYSQTSG-------------GRPGATKVMVVVTDGESHDGSKLKTVIQQ 295 Query: 354 CDKAKENFIKIVTISINASPNGQRL--------LKTCVSSPE--YHYNVVNADSLIHVFQ 403 C+ ++ I I++ N L +K S+P Y +NV + +L+ Sbjct: 296 CN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAG 352 Query: 404 NISQLM 409 + + + Sbjct: 353 TLGEQI 358 >gi|256833661|ref|YP_003162388.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] gi|256687192|gb|ACV10085.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] Length = 349 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 53/193 (27%), Gaps = 25/193 (12%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL------ 289 +D L+ + + L + + + V++ +T+ ++L Sbjct: 123 QEIVDLFSTLTDNFQGERIALSVFNSTSRLVFPLTDDYALVKEQLTQAKEALDPRVLTTT 182 Query: 290 ---------KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + L D + F + IIF TD Sbjct: 183 RQDVIDNYLYFTAGANGSLNGWSSLIGDGLANCALLFDEDP--YQDEDRSRSIIFATD-N 239 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKTC-VSSPEYHYNVVNA 395 + + + A I + I + ++ + + + ++ + Sbjct: 240 DLQGEPIYTLQDATNLALSRNISL--IGLYGAGGGDAQSEQEFRDIFLDADGLYFFSDDP 297 Query: 396 DSLIHVFQNISQL 408 D++ + +I Sbjct: 298 DTIPEIVADIQAQ 310 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 39/409 (9%), Positives = 96/409 (23%), Gaps = 72/409 (17%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSS 125 ++ + L D + F +++ + + V + Sbjct: 296 DYAADNGSTAGEEDVNVPAQTVQLWGDERNFENSYLDYNGAYIKKWVEPVLSNNPTSDLH 355 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIP 178 T + N LD + + L + Q N ++ + Sbjct: 356 PEDATTLYNVYLDVIGSEKQEIS--PIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVN 413 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHC-----------AMNSDPEDVNSAPICQDKKRTK 227 + + P I + ++ + + D+N ++ + Sbjct: 414 E-ISENLLSDPNMDIR-IGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNR 471 Query: 228 MAA--------LKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 LKN + K + +G T + Sbjct: 472 TPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNW 531 Query: 278 QYVTRDMDSLIL----------KPTDSTPAMKQA--YQILTSDKKRSFFTNFFRQGVKIP 325 + + T Y + + ++ N+ VK Sbjct: 532 AVMNNMIARDTGDLFKNFETYSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 590 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKIC-----------DKAKENFIKIVTISINASPN 374 ++ + + +K +EN I +I + + Sbjct: 591 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGS 650 Query: 375 ----------GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 G+ +LK + +Y+ N + +I ++IS Sbjct: 651 VAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 699 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 39/409 (9%), Positives = 96/409 (23%), Gaps = 72/409 (17%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSS 125 ++ + L D + F +++ + + V + Sbjct: 316 DYAADNGSTAGEEDVNVPAQTVQLWGDERNFENSYLDYNGAYIKKWVEPVLSNNPTSDLH 375 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIP 178 T + N LD + + L + Q N ++ + Sbjct: 376 PEDATTLYNVYLDVIGSEKQEIS--PIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVN 433 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHC-----------AMNSDPEDVNSAPICQDKKRTK 227 + + P I + ++ + + D+N ++ + Sbjct: 434 E-ISENLLSDPNMDIR-IGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNR 491 Query: 228 MAA--------LKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 LKN + K + +G T + Sbjct: 492 TPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNW 551 Query: 278 QYVTRDMDSLIL----------KPTDSTPAMKQA--YQILTSDKKRSFFTNFFRQGVKIP 325 + + T Y + + ++ N+ VK Sbjct: 552 AVMNNMIARDTGDLFKNFETYSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 610 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKIC-----------DKAKENFIKIVTISINASPN 374 ++ + + +K +EN I +I + + Sbjct: 611 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTAAYHHWLKNKYQENPPSIFSIGLGIDGS 670 Query: 375 ----------GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 G+ +LK + +Y+ N + +I ++IS Sbjct: 671 VAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 719 >gi|332262981|ref|XP_003280535.1| PREDICTED: integrin alpha-M-like [Nomascus leucogenys] Length = 997 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 60/191 (31%), Gaps = 25/191 (13%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +K + ++ + K L+ Y+ + + Sbjct: 162 NPNDFQKMKEFVSTVMEQLK-----KSKTLFSLMQYSEEFWTHFTFKEFQDNPNPRSLVK 216 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + +++ + L + + + F K ++ +TDGE Sbjct: 217 PIMQLFGRTHTATGIRKVVRELFNITQGARKNAF-------------KILVVITDGEKFG 263 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSL 398 + + +A + I + S N ++ L T S P ++ + V N ++L Sbjct: 264 --DPLGYEDVIPEADREGVIRYVIGVGDAFHSMNSRQELNTIASKPPRDHVFQVNNFEAL 321 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 322 KTIQNQLREKI 332 >gi|229593697|ref|XP_001027162.3| Sec23/Sec24 trunk domain containing protein [Tetrahymena thermophila] gi|225567370|gb|EAS06920.3| Sec23/Sec24 trunk domain containing protein [Tetrahymena thermophila SB210] Length = 558 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 31/275 (11%), Positives = 71/275 (25%), Gaps = 78/275 (28%) Query: 157 RLQFIEHLLN-------------QRYNQKIVSFIPALLRIEM-GERPIFLIELVVDLSGS 202 L + + R I S + + I LI ++ Sbjct: 161 VLDNSGSMSSVESYVKNGGVVQLSRKQSVIQSIQEMSNQGNSLQNKEIQLIVFSDEVQIY 220 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + +S L ++ + + + + + + + + + Sbjct: 221 TNIQDKPHNVSSSSLNN--------WDQLIASVQGIQEQMVGVPAEQVNDLLTIYRKSNK 272 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 + T PA+ A +I S++ Sbjct: 273 M--------------------------GSTALGPAILSAVEIAKSNRGSQ---------- 296 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTI--------KICDKAKENFIKIVTISINASPN 374 II TDG N +++ + AK+ +++ I I Sbjct: 297 ----------IILCTDGMANQGLGSISQQQQSTQFYQDVAQYAKQKGVQVSIIGIGDDKM 346 Query: 375 GQRLL-KTCVSSPEYHYNVVNADSLIHVFQNISQL 408 G +++ K + + + + +L NI Q Sbjct: 347 GLQVIGKLAEKTNGSVFKI-DLSNLNDKLDNILQD 380 >gi|119717474|ref|YP_924439.1| membrane protein-like [Nocardioides sp. JS614] gi|119538135|gb|ABL82752.1| membrane protein-like protein [Nocardioides sp. JS614] Length = 573 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 8/145 (5%), Positives = 46/145 (31%), Gaps = 4/145 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + E+ A+++ + + F + + + M++ + L + +V+ Sbjct: 1 MRRQRDERGAVVPFVAVTLTLLIAMAAFAVDLGMQRVVRRDMQALADVVALDLSR-LVNG 59 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + ++ + ++ + + ++ +S ++ Sbjct: 60 RTAAQIEAGYNGFPTISTELGRSVARNRDTLGDAPVVTYKLVHLDAATGLLETSGGAVKA 119 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDY 156 + + N ++ + + + Sbjct: 120 YSGTEVP---NAVWVQAEGSVDFAF 141 >gi|253576546|ref|ZP_04853874.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251843960|gb|EES71980.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 737 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 31/119 (26%), Gaps = 7/119 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 SE+ ++ F + + + I + +E LA A+ + Sbjct: 8 RSEQGAVTVFFIVIFAAVFAFVALFIDFARMFALQAQVE------TLAHAASRSVLSAYD 61 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + E A + + L F + +S+ M + Sbjct: 62 RELAERYGIFAYGQTDGNYIMSKVLQDQFELVRQKDGFPLLNA-RLDSSTASFMHPIGT 119 >gi|114705523|ref|ZP_01438426.1| hypothetical protein FP2506_13694 [Fulvimarina pelagi HTCC2506] gi|114538369|gb|EAU41490.1| hypothetical protein FP2506_13694 [Fulvimarina pelagi HTCC2506] Length = 180 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 41/149 (27%), Gaps = 5/149 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ I FAL +LL + K + +A AA + +L++ Sbjct: 10 RRDDRGAAGIEFALIFPVLILLAISAADAIHAVSIKARLNNA--AASVGDLISREESLTK 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 +D + I + + G V E + + Sbjct: 68 SSVGDIMSVLDDLLLPLDGDRAKISSAAFDIGKGSDPVLKWYEGEASHGRAS---KISIP 124 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 +++ S + + V + L Sbjct: 125 DQMMSKDRATVIQVQVSYDFSPTLSLSAF 153 >gi|111220185|ref|YP_710979.1| hypothetical protein FRAAL0704 [Frankia alni ACN14a] gi|111147717|emb|CAJ59375.1| hypothetical protein FRAAL0704 [Frankia alni ACN14a] Length = 711 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 33/359 (9%), Positives = 77/359 (21%), Gaps = 21/359 (5%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES-LSGYSAVFYNTEIQNI 121 A S + + D R A + ++ + + + Sbjct: 348 AAGSAPSAFPALESDVIAYNRTLTARTGPLSATTLVASYPADGAFTATVPYIVLKRTTDS 407 Query: 122 VNSSRISMTHMANNRLDSSNNT-IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + A R D+ R + S P L Sbjct: 408 APRHAAADAFAAYLRGDAGRRALTAAGFRTPAELAGRDDPGGLAATLNATDGVRSTPPKL 467 Query: 181 LRIEMGERPIFLIELVVDLSG-SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + + S D + P+ D RT++ A +A + L Sbjct: 468 ASTDASDSVLGTARRFFRHSRERGATLAVLDTSGSMALPLTSDAGRTRLQAATDAAMTGL 527 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-----------WGTEKVRQYVTRDMDSLI 288 + ++ + + + Sbjct: 528 RLF--APDSQVGLWQSSAELPDGHRELAPLARLNAAGHAGAGHAGTQRDDLTASLAALKP 585 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A + LT G + Sbjct: 586 GGSTDLYSTAVAAVRELTDRYAADRLNRVVLFTDGDADTGGDDHPDGEQPGRRATASAEP 645 Query: 349 NTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + ++++ I + + L + ++ + Y +AD+L ++ Sbjct: 646 TLDQAVSALRAAADPHRPVQLIVIGCDPGADLASLQRLAGATGGHAYLAPDADALFDIY 704 >gi|21675085|ref|NP_663150.1| hypothetical protein CT2279 [Chlorobium tepidum TLS] gi|21648325|gb|AAM73492.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 335 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 47/159 (29%), Gaps = 27/159 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++++ + +D ++ K LL + L+ + + Sbjct: 96 DISNSMLARDTAPDRLTHAKTELLQISRRLGDG-------RKALLLFAGTPVVQCPLTDD 148 Query: 273 TEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 E + + + + TD A A ++ S + S Sbjct: 149 EEDFEILLDMAAPELITTQGTDYRRAFDAALKLTNSGGELSSNET--------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 ++ +DGE++ + I K + + I + Sbjct: 194 VLVLASDGEDHG----NDLGDIATAMKTRGVHLHVIGVG 228 >gi|307729999|ref|YP_003907223.1| hypothetical protein BC1003_1969 [Burkholderia sp. CCGE1003] gi|307584534|gb|ADN57932.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 602 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 16/163 (9%), Positives = 45/163 (27%), Gaps = 11/163 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +++ A +S ++ + + ++Y + ++ + A AG + S Sbjct: 29 ARRQRGAIALLAA-VWVSLAVIALGAVDIGHFYYARRDLQRTADLAAAAGVQLIASAGGC 87 Query: 75 LGDRFESISNHAKRALIDDAKRFI-----KNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + N + L D S ++A +V S + Sbjct: 88 AAAANSAQLNASANGLPADGTVQSSCGRWDPSANAGQSYFAASGAPINALQVVVSRPVPF 147 Query: 130 THMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK 172 + + S + + +V + Sbjct: 148 LFLIGQTRELSASAVAQATNVRA-----FSLSTSIAGLSGGAL 185 >gi|308457577|ref|XP_003091161.1| hypothetical protein CRE_15032 [Caenorhabditis remanei] gi|308258115|gb|EFP02068.1| hypothetical protein CRE_15032 [Caenorhabditis remanei] Length = 439 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 49/152 (32%), Gaps = 14/152 (9%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + T++V + D TD + AY + K+ Sbjct: 293 WAADDVPTNANVTQEVLTMLDYIPDVTQDNNTDLECLFRYAYDGIGDSKEFD-------- 344 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISINASPNGQRLL 379 +K +I N+ + + + K + E K++ + + + +L Sbjct: 345 ----ERYGIEKVVIVFVAANANDDQDYNESFEFAHKIRTEQDAKVIVVGMGTGLDQAKLS 400 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 K +S + + + D+L + I+ + Sbjct: 401 KLAYAS-GFAFFSTSYDNLQLLIPQINNAICS 431 >gi|13474654|ref|NP_106223.1| hypothetical protein mll5590 [Mesorhizobium loti MAFF303099] gi|14025409|dbj|BAB52009.1| mll5590 [Mesorhizobium loti MAFF303099] Length = 421 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 12/151 (7%), Positives = 45/151 (29%), Gaps = 8/151 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + K ++ FAL V L++ + + S + ++ Sbjct: 234 FWSDRKGVAAVEFALIVPILLIMYFMTMEASQAIETSKKVSR--------IGSMVADLVT 285 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + + + + ++ +++ +V S +++ + Sbjct: 286 QQPTIVKADLDAIMKIGTSTIQPYNRSTPNITITAIQVTTDTPPKVLVVWSRQVANGVYS 345 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + T+ + V ++ R+ Sbjct: 346 AAAAAGTTTTVPATLKVAGTFLIRVDSNLSY 376 >gi|313238562|emb|CBY13611.1| unnamed protein product [Oikopleura dioica] Length = 445 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 45/172 (26%), Gaps = 21/172 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ED+ + Y+ S + + ++ + ++ + + A K Sbjct: 253 GNFGSEDLQFCVYQYSESAANVFSLSESADFDSLDLKTALEEGIEIPEDRGANIGAAFKT 312 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 F + +I LTD N + Sbjct: 313 -----------IHDEGFNAINGWRKNDQIPSVLIVLTDNLNTVDFYDDL-----QYVHNK 356 Query: 361 FIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 ++V + I + L S E + V + L +V + + Sbjct: 357 AYRVVAVGIGENVENSSLSSIASLPSDENVFTVRDFTELGNVVDEVGYDICQ 408 >gi|254491600|ref|ZP_05104779.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224463078|gb|EEF79348.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 338 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 70/237 (29%), Gaps = 48/237 (20%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I L++D S SM +A + +K + ++ L F+ + + Sbjct: 84 AEIVLLLDRSRSMDAPFA---VKTQAAAVSVGGDNSKRSVARDYLTEFVKN-------RP 133 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 D G + ++T+ + ++ E + + + L T+ A+ + + Sbjct: 134 DDRFGFVFFSTKAINLLPLTYDKESILATINANALGKGLSDTNMAEALVSSAGMFEG--- 190 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS-- 368 + + ++ ++DG S +I KE + I I Sbjct: 191 --------------QTYRGSRIVLLVSDGGQQ--LSAEAKEQISRLYKEMNLSIYWIYLK 234 Query: 369 ------INASPNGQRLLKTC-----------VSSPEYHYNVVNADSLIHVFQNISQL 408 ++ + L + P + + ++ I + Sbjct: 235 SNQEMSLDEGEDDNLLWADIPERKLHKFFQTIGVPYRAFEAGSLEAFAEAMAEIDRQ 291 >gi|229582324|ref|YP_002840723.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013040|gb|ACP48801.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 356 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 11/82 (13%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK----IVTISINASPNGQRLLKT 381 II LTDG+ + ++ + K+ I I+TI I + N + L K Sbjct: 122 QSQVPTKIIMLTDGKPTDKRNVKDYEKL-------DIPPNTQIITIGIGNNYNERILKKL 174 Query: 382 CVSSPEYHYNVVNADSLIHVFQ 403 S Y++ + L ++F+ Sbjct: 175 ADRSSGKFYHIKDISELPNIFE 196 >gi|64654539|gb|AAH96347.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 36/334 (10%), Positives = 95/334 (28%), Gaps = 31/334 (9%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N S + + S+ + + Sbjct: 18 NLDTENAMTFQENARGFGQSVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIH 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS-----GS 202 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 A+ P++ + D + + + F+ ++ K L+ Y+ Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEE 195 Query: 263 VEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + R V L T + +++ + L + + F Sbjct: 196 FRIHFTFKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--- 250 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQR 377 K ++ +TDGE + + +A + I + S ++ Sbjct: 251 ----------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQ 298 Query: 378 LLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 L T S P ++ + V N ++L + + + + Sbjct: 299 ELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|260892685|ref|YP_003238782.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] gi|260864826|gb|ACX51932.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] Length = 209 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 16/42 (38%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM 53 + + EK + FA + +++ + + K+ + Sbjct: 1 MRVLKGEKGYALLYFAFLLPVLVIVAAWALDFTRIRLVKDQL 42 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 59/186 (31%), Gaps = 26/186 (13%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE---------KNIEPSWGTEKVRQYVTRD 283 + ++ + + V +G++ Y+T I + + D Sbjct: 223 EVVKHWIQQVTSGFDISSSVQVGVVQYSTYQYRKVVQPFIKTEIRLGEYKDHILFDAAVD 282 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + A+++ +G ++ ++ LTDG++ + Sbjct: 283 KIKYHDRSTFTAYAIRKTVN-------------EDFKGNMSRYPDSRRVMVLLTDGQSTD 329 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + +AK+ ++ + + + L SP+ V + + L+ + Sbjct: 330 KED---LSSAAAEAKQEGVETFAVGVGSKI-ILSELVLIAGSPDKVITVNDFNELLGIVN 385 Query: 404 NISQLM 409 + + Sbjct: 386 QLQGDI 391 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 47/164 (28%), Gaps = 27/164 (16%) Query: 248 VKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVTRDMDSLILKPT-DSTPAMKQAYQI 304 + I +++ + + + + +Q + T ++ A Q Sbjct: 275 GPRFTRVAFITFSSVGKSRTHFNLNR-YDNAQQIIEAIRRVESTGGTTAVGEGIRIATQQ 333 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 ++ P +K ++ TDG +N ++ +AK + Sbjct: 334 ------------QEKRMGGRPISIAKKAMLIFTDGWSNKGPD---PEEMSKEAKAAGFVL 378 Query: 365 VTISINASPN--------GQRLLKTCVSSPEYHYNVVNADSLIH 400 +I + Q + + + Y+ N L+ Sbjct: 379 YSIGYEGNGRPDAEFAGLNQYTMDAIADTMHHVYSERNFSELVE 422 >gi|171686222|ref|XP_001908052.1| hypothetical protein [Podospora anserina S mat+] gi|170943072|emb|CAP68725.1| unnamed protein product [Podospora anserina S mat+] Length = 1046 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 60/197 (30%), Gaps = 28/197 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +++AL + ++ MGL+ + + T K ++S Sbjct: 519 KINLVRDALRFMVHTLGDRD------RMGLVTFGSSGGGV-PIVGMTTKAWPGWGNVLNS 571 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + S A + + G K + I+ ++D ++ S Sbjct: 572 IKPVGQKSHRA---------DVVEGANVAMDLLMGRKYNNPIAT--IMLISDASTSDADS 620 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH------ 400 + + AK I I + + + +++ + + V + L Sbjct: 621 VDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKDWMMLRECLAGCL 677 Query: 401 -VFQNISQLMVHRKYSV 416 Q +S V K + Sbjct: 678 GAMQTLSHQNVKLKLKL 694 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 33/255 (12%), Positives = 76/255 (29%), Gaps = 37/255 (14%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + AL G V+D SGSM+ + + + Sbjct: 380 RMTPERTLDATVLTALQGSFAGVHQRGNTLAVLDTSGSMNEEV--------PGSAGRSRL 431 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + A K+A+ LF + DL G + E+ + E++ Sbjct: 432 SVALDAAKSAIPLFAEDSDLGLWQFSTRLRGDQDW----EELVPLGPMGERL-------- 479 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF---IIFLTDGEN 341 T + A+ A + + + ++ LTDG Sbjct: 480 ----GAGTR-SQAVMDAVNRIEPRGDTGLYDTALAAFRYMNQHYVPGRPNQVVLLTDG-K 533 Query: 342 NNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 N+ ++ ++ + ++++TI A + L + ++ Y ++ Sbjct: 534 NSDPGSIALDELVRILRREYSPQRPVQVITIGYGADTDLAALSRISAATGAETYPALDP- 592 Query: 397 SLIHVFQNISQLMVH 411 +F+ + + Sbjct: 593 --NTIFEVLVDALTE 605 >gi|291525766|emb|CBK91353.1| hypothetical protein EUR_23650 [Eubacterium rectale DSM 17629] Length = 182 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 7/69 (10%), Positives = 25/69 (36%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + R + K N + + L++I + + ++ A +A+++ Sbjct: 1 MRKLRKKTAALLKDRKGNMMPLVVAVTICMLIIILGVAEYMRLVITAAGIKDAMESAVIS 60 Query: 64 GASKMVSNL 72 + + + Sbjct: 61 TVNDNYNEV 69 >gi|146338130|ref|YP_001203178.1| hypothetical protein BRADO1027 [Bradyrhizobium sp. ORS278] gi|146190936|emb|CAL74941.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 192 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 11/116 (9%), Positives = 32/116 (27%), Gaps = 8/116 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ A + FA V L++ + A + + A Sbjct: 13 FRLLRRFRADHSGVAATEFAFIVPLMLVMFFGTVEFCSAI--------AVDRKVTLMART 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + S+ +S ++ A + + + ++S + + + Sbjct: 65 LSDLTSQSTSVGDSDMSNFFAASTGIMYPYSTSPVNATISEIVVDSTGKQATVVWS 120 >gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] Length = 684 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 71/244 (29%), Gaps = 47/244 (19%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +V +P +R + V+D SGSMH Sbjct: 307 NDDYALVMLMPPSDEFIAAQRLPREVIFVIDTSGSMH--------------------GES 346 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K+AL L ++D + + +R+ Sbjct: 347 LEQAKSALFFALANLDPQDSFNIIEFNSKV----NALNAQALPANDFNIRRARNFVYGLK 402 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A + Q+L + + + + I+FLTDG Sbjct: 403 ADGGTEIGLAFE---QVLDNSEHAD----------------YLRQIVFLTDGSI--SNET 441 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +I ++ +I TI I ++PN + + + + + +N+ Sbjct: 442 EVFAQIKGSLGDS--RIFTIGIGSAPNSYFMTRAATLGRGTFTFIGDVTDVQRTMKNLFV 499 Query: 408 LMVH 411 + + Sbjct: 500 QLAN 503 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/314 (9%), Positives = 84/314 (26%), Gaps = 29/314 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + ++ + A + + + ++ ++ T Sbjct: 614 ASGSGTFTITASAGLKSLTLDGPDNADATLTLDRLADLVNNPVTLTTSKGTLTLTGYDAT 673 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + ++S T+ + + + + + + +++ L+ Sbjct: 674 TGKVSYTYQTSGQQAHTGDDTNVQDHFQITVE---DKFGG-KATGDLGVLITDTAPSLKP 729 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + M S + S + +++A LK+++ LD Sbjct: 730 IAESSALSSHGTNI-----MLTLDTSGSMNYGSGVYNGWTQLSRLAVLKSSVNNLLDKYG 784 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V+ + ++ + T + + + V+ T+ A+ A Sbjct: 785 EAGDVR----VMIVEFNTSASQKGGGWMSLAEAKALVSGLGY---GGGTNYQTALDTAMN 837 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC--DKAKENF 361 + Q F TDGE ++ +S + + +N Sbjct: 838 AWNNSGTGKLEGG-----------NVQNISYFFTDGEPDSNRSVNSAQQATWEKFLADNH 886 Query: 362 IKIVTISINASPNG 375 I I + G Sbjct: 887 INSYGIGLGTGATG 900 >gi|260434112|ref|ZP_05788083.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417940|gb|EEX11199.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 187 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 7/56 (12%), Positives = 21/56 (37%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 ++ R R + +E + +I F+ + + + + ++E A + Sbjct: 1 MIGRLREAARLFRRTEDGSATIELLFWFPFFMWVTYSGVDLGMMSFHHANLERALD 56 >gi|218507575|ref|ZP_03505453.1| hypothetical protein RetlB5_08145 [Rhizobium etli Brasil 5] Length = 161 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 12/114 (10%), Positives = 32/114 (28%), Gaps = 8/114 (7%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ I K +I FA+ ++L + A Sbjct: 5 NPFTRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKR-------AT 57 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A S + +++ +S + + + ++G + Sbjct: 58 RAAGS-IADLITQQQSITKSTLTEMRSVATAIFVPYNSTSLTLKITGITVDASA 110 >gi|156402481|ref|XP_001639619.1| predicted protein [Nematostella vectensis] gi|156226748|gb|EDO47556.1| predicted protein [Nematostella vectensis] Length = 154 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 41/152 (26%), Gaps = 20/152 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K L + K+ ++G+I Y + E T+K + Sbjct: 23 FKKVLDFVGGIVKEFEVSKQGTHVGIIRYDHKAEILKPFGQVTDKQGVLDKISKITFTGG 82 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A+ A L +I LTDG + + + Sbjct: 83 GTKTGQALTLAMDGLYQ---------------IDNRKEVPDVLIVLTDGASKDSVVDP-- 125 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTC 382 +K K + + + + I L Sbjct: 126 ---ANKLKNSGVTVFAVGIGKEFKEDELNLIA 154 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|53733375|gb|AAH83523.1| Cfb protein [Danio rerio] Length = 653 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 57/219 (26%), Gaps = 13/219 (5%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKK-RTKMAALKNALLLFLDSIDLLSHVKEDVY 253 D G D+ A D K + ++ I Sbjct: 237 FEDDQHGKKISLDRGGKLDIYIAVDASDSIDPKDFDKAKKIIKTLIEKISYYEVSPNYEI 296 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + ++ K + EK R+ + D + I K Sbjct: 297 LMFATDVDQIVKMRDFKTN-EKARKILKIFEDLDNFNYDKKGD--RTGTNIAKLYLKILD 353 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-------KENFIKIVT 366 + + K L Q II TDG+ N + + + +EN + + Sbjct: 354 SMSLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYV 413 Query: 367 ISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 + + + + + + + + D + + F Sbjct: 414 FGVGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFD 452 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|18858247|ref|NP_571413.1| complement factor B [Danio rerio] gi|1015970|gb|AAB19093.1| complement factor B [Danio rerio] Length = 737 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 57/219 (26%), Gaps = 13/219 (5%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKK-RTKMAALKNALLLFLDSIDLLSHVKEDVY 253 D G D+ A D K + ++ I Sbjct: 240 FEDDQHGKKISLDRGGKLDIYIAVDASDSIDPKDFDKAKKIIKTLIEKISYYEVSPNYEI 299 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + ++ K + EK R+ + D + I K Sbjct: 300 LMFATDVDQIVKMRDFKTN-EKARKILKIFEDLDNFNYDKKGD--RTGTNIAKLYLKILD 356 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-------KENFIKIVT 366 + + K L Q II TDG+ N + + + +EN + + Sbjct: 357 SMSLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYV 416 Query: 367 ISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 + + + + + + + + D + + F Sbjct: 417 FGVGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFD 455 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + T Sbjct: 111 LSDFPVVSTATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAIT--YRGGGT 168 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 169 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 207 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 208 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 258 >gi|320161334|ref|YP_004174558.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] gi|319995187|dbj|BAJ63958.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] Length = 652 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 17/271 (6%), Positives = 63/271 (23%), Gaps = 48/271 (17%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIY-----VLDWHYKKN----------------- 51 + +IFA+++++ LL +G + V K+ Sbjct: 1 MAQRQHGQVLVIFAVALVALLLFVGLAVDSGVLYVTYGQLKRASDAAAVAAANNFKRGQT 60 Query: 52 --SMESAN---------NAAIL-------AGASKMVSNLSRLGDRFESISNHAKRALIDD 93 M++A + + L ++ + + Sbjct: 61 ELEMKAAAMEIYRLHNVDTSTLDLNVQICDDDGDGFTDGTYKSGSDVLFDAGLLDSDEQK 120 Query: 94 AKRFIKNHIKESLSGYSAVFYNTE------IQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 F + + SA + ++ ++ + + ++ Sbjct: 121 VANFFEKICPRTDPTVSAGQQTASPRKLVYVSATQRANFYFLSLLGITHANLRTYSVGE- 179 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + E + ++ + + + + + Sbjct: 180 -AASVDLVIVIDTSESMASECQSYDASGVCAFFKSPGYNNTTLDDYDPYAPGNCNTIPTT 238 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + NS + + ++ ++ + Sbjct: 239 YAGITVQNSCYPLVQAEVSAISLVRTLYEGY 269 Score = 41.4 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 18/143 (12%), Positives = 32/143 (22%), Gaps = 33/143 (23%) Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + + + N T I Sbjct: 512 IDTNSNTCPPGSIHVNIPDPRYSAADYARDMADAAALTVNQSSNPLDSRYNPTEPI---- 567 Query: 358 KENFIKIVTISIN-----------------------ASPNGQRLLKTCVSSP-----EYH 389 N I I TI + +G R+ C S + Sbjct: 568 -GNDIAIYTIGLGAVTPDNPPAWLDTGEKILRYMAAVGDDGDRVTDPCASVGPKRSCGNY 626 Query: 390 YNVVNADSLIHVFQNISQLMVHR 412 Y + D+L +F +I+ + R Sbjct: 627 YYAPSGDALRAIFDDIASRIYTR 649 >gi|315081597|gb|EFT53573.1| TadE-like protein [Propionibacterium acnes HL078PA1] Length = 169 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 39/126 (30%), Gaps = 1/126 (0%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ + Sbjct: 36 RGAVAVEAALILPALLMIAAAATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVGQ 95 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 ++ A + A + + + G++ I S + + ++ Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPV-GFAGTASARVSCTIKLSDLLVPGMPGSFHIE 154 Query: 139 SSNNTI 144 S ++ Sbjct: 155 SVAHST 160 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 52/157 (33%), Gaps = 20/157 (12%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ Y+G++ Y++ V ++E + + ++ T + A+ QA Sbjct: 417 NSIKYINSSNYIGVVSYSSNVNVDLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQA 476 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +L + + L+DG++N+ + A+ Sbjct: 477 MIMLRDF--------------TKDNPNVSPMVFLLSDGQSNSGSEFSDIDGAIATAQ--- 519 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 I I TI + N L + N D + Sbjct: 520 IPIYTIG--YNANLNELKAISEINEAATINADTDDVI 554 >gi|16263307|ref|NP_436100.1| hypothetical protein SMa1572 [Sinorhizobium meliloti 1021] gi|307304374|ref|ZP_07584125.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|14523986|gb|AAK65512.1| conserved hypothetical protein [Sinorhizobium meliloti 1021] gi|306902576|gb|EFN33170.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 141 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 36/101 (35%), Gaps = 1/101 (0%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASKMV 69 ++ S+ ++ FAL + LLL+ +I Y +N + A + A ++ Sbjct: 8 LRRLFRSQSGATAVEFALVCLPLLLLVFGIIEFGRAFYVRNELSHAVDVAARRVLIGQIA 67 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 + + + + D KE++ G + Sbjct: 68 RDATDSEALTKLAGAVRESFHSGDPTLLTIAVTKETVDGIA 108 >gi|295093271|emb|CBK82362.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Coprococcus sp. ART55/1] Length = 612 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 39/361 (10%), Positives = 105/361 (29%), Gaps = 56/361 (15%) Query: 42 YVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 V+ A + A +AG + MV++ S + ++ ++ + Sbjct: 86 DVMYAE-------EACDVATVAGDTAMVTDTSNSMYTEIAYDTREYDSVAENGFVSTADR 138 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSR--ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + + + +++ + S L+ + + + Sbjct: 139 PLSTFAADRDTASYSNVRSYIESGCLPPDGAVRIEEMLNYFTYDYRRKPEDGEKFSIYTE 198 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + + K++ I+ G++ + ++D SGSM+ Sbjct: 199 YSD--CPWNKATKLMMVGINTDEIDFGDKKPSNLVFLIDTSGSMYED------------- 243 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 K+ ++ + + +++D + ++ Y + + G+E+ Sbjct: 244 ------NKLPLVQQSFAMLAENLDEND------KVSIVTYAGEDTVVLSGTSGSEQYTIN 291 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ A+ AY++ + +I TDG Sbjct: 292 -EALSSMTAEGCTNGGDAIITAYELAEKNFIEGGNN----------------RVILATDG 334 Query: 340 ENNNF--KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNAD 396 + N + I ++ KEN I + + L+ + + + +A Sbjct: 335 DLNVGLTSESDLVDLITEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAY 394 Query: 397 S 397 Sbjct: 395 E 395 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 60/219 (27%), Gaps = 40/219 (18%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 L+VD SGSM + D + K+ ++ L L + Sbjct: 26 DDAPTAMLIVDGSGSMWGRLAPDNKP-------------KIDVVREKLATILQTPSSTRV 72 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 G VE P + + + T A++ A + + Sbjct: 73 GLVSFGHRRRGDCNDVELIASPDSERAALLGPLAKLNPRGPGPVTA---ALEIAADAIGT 129 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-FIKIVT 366 + II + DG +N + + D AK + + Sbjct: 130 SRPAQ--------------------IIIVGDGADNCQQDSCAAAN--DFAKSAPGVAVQV 167 Query: 367 ISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN 404 I I + + ++ +Y+V +A L Sbjct: 168 IGIGVPATERPRIACVAQATGGRYYDVTDAAGLNAALDE 206 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 50/142 (35%), Gaps = 25/142 (17%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQ 376 + + I TI++ S + Sbjct: 426 EAVSRSGAIIHTIALGPSAARE 447 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 50/142 (35%), Gaps = 25/142 (17%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 343 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 402 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 403 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 439 Query: 355 DKAKENFIKIVTISINASPNGQ 376 + + I TI++ S + Sbjct: 440 EAVSRSGAIIHTIALGPSAARE 461 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 50/142 (35%), Gaps = 25/142 (17%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + Q +T ++ T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSAAHIQNYLIKITSSSDYQKITANLPQQASGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE+N +S C Sbjct: 389 HGLQAGFQAITSSDQSTSGSE----------------IVLLTDGEDNGIRS-------CF 425 Query: 355 DKAKENFIKIVTISINASPNGQ 376 + + I TI++ S + Sbjct: 426 EAVSRSGAIIHTIALGPSAARE 447 >gi|27365108|ref|NP_760636.1| hypothetical protein VV1_1751 [Vibrio vulnificus CMCP6] gi|37680842|ref|NP_935451.1| hypothetical protein VV2658 [Vibrio vulnificus YJ016] gi|27361254|gb|AAO10163.1| hypothetical protein VV1_1751 [Vibrio vulnificus CMCP6] gi|37199591|dbj|BAC95422.1| hypothetical protein [Vibrio vulnificus YJ016] Length = 406 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 59/206 (28%), Gaps = 21/206 (10%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 KG ++ ++ ++++SF+++ I V + +++A +AA LA A + Sbjct: 8 KGKRKQQGLVLVLVTVAMLSFVIMAALAIDVTHQVVNRTKLQNAVDAAALAAAMVADATH 67 Query: 73 SRLGDRFESIS--NHAKRALIDDAKRFIKNHIKESLS-----------GYSAVFYNTEIQ 119 + + N A + S Y Sbjct: 68 DTPTATAAAKTTLNSMHSASGNSELDIDSATFSIDYSNDPLTFPDSSFNSDEDIYVRISI 127 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-------QFIEHLLNQRYNQK 172 + ++ S M + ++ S + + + + + + + Sbjct: 128 DDLSLSEFFMQALGLSK-TVSASAVAGPSSSINTISNVVPIGVCKGDESGGIYGYNPAEV 186 Query: 173 IVSFIPALLRIEMGERPIFLIELVVD 198 V + MG L++ Sbjct: 187 YVLKVGDSTMTTMGSGNYHLLDFGSG 212 >gi|305666869|ref|YP_003863156.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] gi|88709093|gb|EAR01327.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] Length = 365 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 59/185 (31%), Gaps = 17/185 (9%) Query: 231 LKNALLLFLDSIDLLSHVKEDVY-MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 LK A + F++++ L+ + + + + +++ + D + Sbjct: 146 LKTASVSFVNNVMPLTTEDSYKMAIYWFDGEDELHLLNDLTSSKQELVSAINGITDDISN 205 Query: 290 -KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A+ ++ +I S ++ TDG + + Sbjct: 206 DPSTDLYGAVIKSTEIAEELLADSIKEEIIGAA----------SVVIFTDGTDQASRYTE 255 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY-NVVNADSLIHVFQNISQ 407 T + I +I + A + Q L + NA+ L + F +IS Sbjct: 256 QTALKKVEDAAANISFFSIGLGAEIDTQVLTDI----GKTFSVFAGNAEELENTFNDISI 311 Query: 408 LMVHR 412 + R Sbjct: 312 KISER 316 >gi|284997525|ref|YP_003419292.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|284445420|gb|ADB86922.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 380 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 49/196 (25%), Gaps = 43/196 (21%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ K + I + + I +++ V E + Sbjct: 46 TSGSMSGYKIELAKQGAIELFKRIPKGN------KVSFITFSSNVNVIKEFVDPLD---- 95 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T A+ A + +++ LTD Sbjct: 96 LTNEILQIAAGGQTALYTAILTANSLAKK-------------------YQMPTYLLLLTD 136 Query: 339 GENNNFKSNVNTIKICDKAKE---NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 G + + + K +++ + I N Q L + Y++ +A Sbjct: 137 GNPTDET------NVGNYLKLPYFEKMQVYSFGIGDDYNEQLLQNISDKTSGVMYHISDA 190 Query: 396 DSL-----IHVFQNIS 406 + + I+ Sbjct: 191 NEIPQKLPQKAVTQIA 206 >gi|260461955|ref|ZP_05810200.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259032202|gb|EEW33468.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 207 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 16/181 (8%), Positives = 50/181 (27%), Gaps = 12/181 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R ++ + ++ FAL V L++ + + S Sbjct: 13 IRGKAVGFWSNRRGVAAVEFALIVPILLVMYFMTMEASQAIETSKKVSR--------IGS 64 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + +++ + + + + + ++ ++ N+V S + Sbjct: 65 MVADLVTQQPTIVAADLDAIMKIGTSTIQPYNRSTPSIIITAIQVTTDTPPKVNVVWSRK 124 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVM----TSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + + T+ + V L + + Q Q+ + +L Sbjct: 125 LVNGVSSIATTLPATTTVPTTLRVAGTFLIRVQSNLSYSPIINWQPDTQQKLGLTQSLST 184 Query: 183 I 183 Sbjct: 185 T 185 >gi|220922038|ref|YP_002497339.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219946644|gb|ACL57036.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 138 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 11/125 (8%), Positives = 31/125 (24%), Gaps = 4/125 (3%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + ++ S+ FAL ++L +I Y +A +A ++ Sbjct: 2 RHRRSFPHNQDGTNSVEFALLAPVLIVLGFGIIEFGIMIYT----LNAAESAARDVTRRL 57 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 +N ++ + + S + + + Sbjct: 58 ATNRISAAQASSAVIQQLPSWVAAGTTVNVTQTAPTDPSSNRFTTEVAFSAKVATPTTLL 117 Query: 129 MTHMA 133 Sbjct: 118 SWAYG 122 >gi|291411005|ref|XP_002721795.1| PREDICTED: integrin alpha M [Oryctolagus cuniculus] Length = 1155 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/228 (13%), Positives = 67/228 (29%), Gaps = 45/228 (19%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + I ++D SGS+ ++ S + Q K + AL F Sbjct: 145 PQQESDIAFLIDGSGSIDSTDFQRMKEFVSTVMEQFTKSNSLFALMQYSEEFRTHFTFSD 204 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + L+ +RQ + T + + + L Sbjct: 205 FKRNPNPRALVKP----------------IRQLL---------GRTHTATGILKVVTELF 239 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + +K ++ +TDGE + + +A+ + Sbjct: 240 HSSSGARANA-------------RKVLVVITDGEKFGDT--LEYEDVIPRAEREGVIRYV 284 Query: 367 ISINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + S ++ L T S P + + V N ++L + + + + Sbjct: 285 VGVGDAFNSEQSRQELNTIASKPSREHVFRVNNFEALNTIRNQLQEKI 332 >gi|282853088|ref|ZP_06262425.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|282582541|gb|EFB87921.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|314922731|gb|EFS86562.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1] gi|314965815|gb|EFT09914.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2] gi|314982958|gb|EFT27050.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3] gi|315091262|gb|EFT63238.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4] gi|315094498|gb|EFT66474.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1] gi|315105219|gb|EFT77195.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2] gi|327328992|gb|EGE70752.1| putative membrane protein [Propionibacterium acnes HL103PA1] Length = 169 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 11/117 (9%), Positives = 23/117 (19%), Gaps = 4/117 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVG----ESI 91 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 L+ T + + Sbjct: 92 AVGQRVGLAELAGTRCSNPAIAVDSSDLTLPVGSAGTTSARVSCIIKLSDLLVPGMP 148 >gi|149375210|ref|ZP_01892982.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] gi|149360574|gb|EDM49026.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] Length = 658 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 28/264 (10%), Positives = 75/264 (28%), Gaps = 40/264 (15%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGER----PIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + ++ +P + + ++VD+SGSM ++ Sbjct: 1 MRLLSRLFVVLLALPTTALAQQAPTLQLPDSADVRIIVDISGSMKTNDPNNLR------- 53 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + A+ L + ++ G+ + V + T+ R Sbjct: 54 ------------RPAVRLLARMLPGQANA------GVWTFGQYVNMLVPHGKVTDDWRGL 95 Query: 280 VTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 D + T+ A++ A T+F Sbjct: 96 AVERSDEINSVALRTNLGEAIQVASDDYLLGSDSLDNTDFILLTDGKVD--------ISD 147 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + N+ + + D+ + T++++ + L + + +AD+ Sbjct: 148 NESANDRERERILGALLDELSSRGATLHTVALSEEADLALLKSLAERTGGRYALASSADA 207 Query: 398 LIHVF-QNISQLMVHRKYSVILKG 420 L F + ++ + ++ + G Sbjct: 208 LTLAFLEALNTAVPQQQIPIEDNG 231 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 50/171 (29%), Gaps = 29/171 (16%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-----PSWGTEKVRQYVTRDM 284 +K ++ LD++ +V + + + E + + + Sbjct: 218 LIKASVTEMLDTLSDDDYVN------VARFNEKAEAVVPCFDHLVQANVRNKKIFKEAVQ 271 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 TD A+ L + +P K I+ TDG + Sbjct: 272 QMQAKGTTDYKSGFHFAFNQLLNKTN-------------VPRANCNKIIMLFTDGGEDRA 318 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVN 394 + +K +++ T S+ L+ S+ Y++ + + Sbjct: 319 QDIFEQYNWPNKT----VRVFTFSVGQHNYDVTPLQWIACSNKGYYFEIRS 365 >gi|323474469|gb|ADX85075.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477206|gb|ADX82444.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 380 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 58/226 (25%), Gaps = 51/226 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L+ + + H + D + K+ K + I + Sbjct: 24 KVLLVPEKLGSATGFHYIVALDTSGSMTGY--------KIELAKQGAIELFKRIPKGN-- 73 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + I +++ V E + + T A+ A + Sbjct: 74 ----KVSFITFSSNVTVIKEFVDPLD----LTNEILQIAAGGQTALYTAILTANSLAKK- 124 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 +++ LTDG + + + K +++ Sbjct: 125 ------------------YQMPTYLLLLTDGNPTDET------NVGNYLKLPYFEKMQVY 160 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSL-----IHVFQNIS 406 + I N Q L + Y++ +A+ + I+ Sbjct: 161 SFGIGDDYNEQLLQSISDKTSGVMYHISDANEIPQKLPQKAVTQIA 206 >gi|121709994|ref|XP_001272613.1| von Willebrand and RING finger domain protein [Aspergillus clavatus NRRL 1] gi|119400763|gb|EAW11187.1| von Willebrand and RING finger domain protein [Aspergillus clavatus NRRL 1] Length = 1007 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 55/168 (32%), Gaps = 21/168 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L++AL + ++ MGL+ + + T K + ++S Sbjct: 505 KITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSWGGWNKILNS 557 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + L +D + + P I+ ++D ++ +S Sbjct: 558 IRPVGQ----------KSLRADVVEGANVAMDLLMQRKSNNPVS-TILLISDSSTSDPES 606 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK I I + + + +++ + + V + Sbjct: 607 VDFVVSRAEAAK---ISIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 651 >gi|114566000|ref|YP_753154.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114336935|gb|ABI67783.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 592 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 43/146 (29%), Gaps = 29/146 (19%) Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 E + + + + ++ T + A ++ Sbjct: 454 SSEVVVPFTRNQNILNKGLSTIS---PAGLTPMADGIMTAVNLI---------------- 494 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFIKIVTISINASPNGQR 377 + ++ ++DG N ++ +EN KI I I N Sbjct: 495 --KNNRVRNPLLVLISDGIPNIPLWTLDAQADALEAATHIREN--KIHFICIGLESNRFY 550 Query: 378 LLKTCVSSPEYHYNVV--NADSLIHV 401 L K ++ Y V N D+LI++ Sbjct: 551 LEKLSANAGGALYLVDDLNKDNLINI 576 >gi|301609308|ref|XP_002934204.1| PREDICTED: epithelial chloride channel protein [Xenopus (Silurana) tropicalis] Length = 906 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 64/192 (33%), Gaps = 32/192 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++ L A +++ + V++ Y+G++ ++ E + T+ + Sbjct: 315 ISGSMTNANRITRLYQASEVYI-----MQIVEQGAYVGIVTFSNVAEIKSQLVKITDTFQ 369 Query: 278 QYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + ++ T+ ++Q Q+ + I+ L Sbjct: 370 RESLKLKLPTVATGGTNICAGVQQGLQV----------------NRNLDQSTHGTEIVLL 413 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV--- 393 TDGE++ S ++ I TI++ + + L K + Sbjct: 414 TDGEDSGISSCFP------DITKSGAIIHTIALGNNADPG-LEKLADLTGGLKLYASDKV 466 Query: 394 NADSLIHVFQNI 405 +A+ LI F I Sbjct: 467 DANGLIDSFSGI 478 >gi|297714304|ref|XP_002833596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like, partial [Pongo abelii] Length = 384 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 21 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 60 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + + T R + ++Y+ + T+ A+ +A IL Sbjct: 61 DFNQNV-RTWRNDLISATKTQVADAKRYIEKI---QPSGGTNINEALLRAIFILNEASN- 115 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 116 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 164 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 165 GFDVDYDFLKRLSSENRGIAQRIYGNQDTSSQLKKFYNQVST 206 >gi|255530103|ref|YP_003090475.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255343087|gb|ACU02413.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 613 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 31/232 (13%), Positives = 70/232 (30%), Gaps = 44/232 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N K+V +I P + ++D+SGSM+ + K Sbjct: 218 NANHKLVQIGLQGKKIPTDNLPASNLVFLIDVSGSMN-------------------QPNK 258 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L + L + + + ++ Y + + G EK + Sbjct: 259 LPLLIASFKLLTEQLRPED------KVAIVVYAGNSGLVLPSTPGNEKTKIK-EALNKLS 311 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KS 346 T ++ AYQ+ T + + II TDG+ N S Sbjct: 312 AGGSTAGGAGIQLAYQVATDNFIKGGNN----------------RIILATDGDFNVGASS 355 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 + + + ++ +++ + + + ++T + + N Sbjct: 356 DKDMESLIEEKRKSGVFLTVLGYGMGNMKDSKMETLADKGNGNYAYIDNISE 407 >gi|78186536|ref|YP_374579.1| hypothetical protein Plut_0658 [Chlorobium luteolum DSM 273] gi|78166438|gb|ABB23536.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 140 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 32/125 (25%), Gaps = 6/125 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 +K + + FA + L+L+ ++Y Y K + A AG Sbjct: 19 ARDQKGSVLVEFAFILPVLLMLLFGVVYFSVALYNKTVLTMATREGARAGVLY------D 72 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + +A L F + + S I + Sbjct: 73 ADGNNTANAVNAALPLCSSVISFGSDATPVPQASVSGDILTVTASGNYTGFYIFSGLTTS 132 Query: 135 NRLDS 139 + Sbjct: 133 AQTSM 137 >gi|296448099|ref|ZP_06890001.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296254413|gb|EFH01538.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 618 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 29/300 (9%), Positives = 66/300 (22%), Gaps = 10/300 (3%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 +FA + + + ++ + K + + A +GA S S++ Sbjct: 1 MFAFVLPLVIGVCSLVVEFGNALLIKARYQRVADIASFSGALAYSGTSSTTTMTNASVAV 60 Query: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 A + A S S + + + + + L + + Sbjct: 61 AALNQVPAAAVSANLTVSPASSSNSAVLVAISAAHPLFLTPV----LNGPRSLQIAASAY 116 Query: 145 FYNMDVMTSYDYRLQ-----FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 +S L V + + + G Sbjct: 117 SQIAVSASSCLLALDSSKSGVTLSGGTSVTATSCVVASNSSVTVPCGTSIAAKAVDYYSS 176 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S V + + + L + GY Sbjct: 177 SAPSAPCSGISGSIVKAYTTDPVAGTSGLTTATARLTTVAAQTAPSAPSVSTGSNVSFGY 236 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 T +I Y + + T + A+ + + + + T F Sbjct: 237 -TAASMSIGNGCTATTSSAYSGKWTVTCPAGGTYTFGAITTSATLSFATSGTAATTYNFS 295 >gi|254481635|ref|ZP_05094879.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] gi|214038263|gb|EEB78926.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] Length = 1306 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 32/300 (10%), Positives = 61/300 (20%), Gaps = 77/300 (25%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + Y Q G+ S M+ + Sbjct: 313 GSGVGDGYPQSNAMDGNEYGGSVDGDISWGNTAYTFYTSHYMNWWHD----------DSL 362 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHV---KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 R K+ + + +D+ + G + + Sbjct: 363 VTPRPKIDIAADVITTIIDTNTSVDFGLLEFNYREGGRVT----QRIIQNMTA--ANRTN 416 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQIL---------------TSDKKRSFFTNFFRQG-- 321 + T + +AY+ L + + + Sbjct: 417 LINLVDALEPAGWTPLCESTYEAYRYLAGESPVYSGSASDGIYAGWRYDIAASDSAAMSG 476 Query: 322 ---VKIPSLPFQKFIIFLTDGENNN-----------------------FKSNVNTIKICD 355 V + +II +TDG S ++ + Sbjct: 477 GNYVSPNTDCAYTYIIIMTDGLPGRDGGANDAIETLTQKDCDMYDSEEGNSKNCMPQLAE 536 Query: 356 KAKEN------------FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 N I TI + + Q L T +Y +A L FQ Sbjct: 537 YMATNDLDGDTTNGSQFGIT-YTIGF--ATDQQLLQDTAEKGKGEYYTADSAAELTEAFQ 593 >gi|320155491|ref|YP_004187870.1| von Willebrand factor type A domain-containing protein, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319930803|gb|ADV85667.1| von Willebrand factor type A domain protein, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 406 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 59/206 (28%), Gaps = 21/206 (10%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 KG ++ ++ ++++SF+++ I V + +++A +AA LA A + Sbjct: 8 KGKRKQQGLVLVLVTVAMLSFVIMAALAIDVTHQVVNRTKLQNAVDAAALAAAMVADATH 67 Query: 73 SRLGDRFESIS--NHAKRALIDDAKRFIKNHIKESLS-----------GYSAVFYNTEIQ 119 + + N A + S Y Sbjct: 68 DTPTATAAAKTTLNSMHSASGNSELDIDSATFSIDYSNDPLTFPDSSFNSDEDIYVRISI 127 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-------QFIEHLLNQRYNQK 172 + ++ S M + ++ S + + + + + + + Sbjct: 128 DDLSLSEFFMQALGLSK-TVSASAVAGPSSSINTISNVVPIGVCKGDESGGIYGYNPAEV 186 Query: 173 IVSFIPALLRIEMGERPIFLIELVVD 198 V + MG L++ Sbjct: 187 YVLKVGDSTMTTMGSGNYHLLDFGSG 212 >gi|239906053|ref|YP_002952792.1| hypothetical protein DMR_14150 [Desulfovibrio magneticus RS-1] gi|239795917|dbj|BAH74906.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 391 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 19/141 (13%), Positives = 43/141 (30%), Gaps = 12/141 (8%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR-L 75 +E + + AL ++ L + YK++ +++A +AA LAGA+ ++ N Sbjct: 12 AESGSVVVFSALIMIVLAGLATLAVDYGFLQYKRSQLQTAADAAALAGAADLLRNGDDFD 71 Query: 76 GDRFESISNHAKRALIDDAKRFIKN----------HIKESLSGYSAVFYNTEIQNIVNSS 125 R ++ + D + + N Sbjct: 72 AVRATAVDFGQRNLGEQDTVASAVTTGDVELLKGETPAAGATPDTVRVTAGRTAQRGNPV 131 Query: 126 RISMT-HMANNRLDSSNNTIF 145 + + + N D + Sbjct: 132 DMFLGPVLGWNTQDLTATASA 152 >gi|50841536|ref|YP_054763.1| hypothetical protein PPA0046 [Propionibacterium acnes KPA171202] gi|289424429|ref|ZP_06426212.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289427366|ref|ZP_06429079.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|295129575|ref|YP_003580238.1| hypothetical protein HMPREF0675_3046 [Propionibacterium acnes SK137] gi|50839138|gb|AAT81805.1| putative membrane protein [Propionibacterium acnes KPA171202] gi|289155126|gb|EFD03808.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289159296|gb|EFD07487.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|291376464|gb|ADE00319.1| conserved hypothetical protein [Propionibacterium acnes SK137] gi|313771156|gb|EFS37122.1| conserved hypothetical protein [Propionibacterium acnes HL074PA1] gi|313792519|gb|EFS40605.1| conserved hypothetical protein [Propionibacterium acnes HL110PA1] gi|313803520|gb|EFS44702.1| conserved hypothetical protein [Propionibacterium acnes HL110PA2] gi|313806906|gb|EFS45404.1| conserved hypothetical protein [Propionibacterium acnes HL087PA2] gi|313811818|gb|EFS49532.1| conserved hypothetical protein [Propionibacterium acnes HL083PA1] gi|313817689|gb|EFS55403.1| conserved hypothetical protein [Propionibacterium acnes HL046PA2] gi|313821484|gb|EFS59198.1| conserved hypothetical protein [Propionibacterium acnes HL036PA1] gi|313824571|gb|EFS62285.1| conserved hypothetical protein [Propionibacterium acnes HL036PA2] gi|313826242|gb|EFS63956.1| conserved hypothetical protein [Propionibacterium acnes HL063PA1] gi|313832354|gb|EFS70068.1| conserved hypothetical protein [Propionibacterium acnes HL007PA1] gi|313832813|gb|EFS70527.1| conserved hypothetical protein [Propionibacterium acnes HL056PA1] gi|313839673|gb|EFS77387.1| conserved hypothetical protein [Propionibacterium acnes HL086PA1] gi|314926286|gb|EFS90117.1| conserved hypothetical protein [Propionibacterium acnes HL036PA3] gi|314961709|gb|EFT05810.1| conserved hypothetical protein [Propionibacterium acnes HL002PA2] gi|314964231|gb|EFT08331.1| conserved hypothetical protein [Propionibacterium acnes HL082PA1] gi|314975246|gb|EFT19341.1| conserved hypothetical protein [Propionibacterium acnes HL053PA1] gi|314977661|gb|EFT21756.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1] gi|314980207|gb|EFT24301.1| conserved hypothetical protein [Propionibacterium acnes HL072PA2] gi|314985154|gb|EFT29246.1| conserved hypothetical protein [Propionibacterium acnes HL005PA1] gi|314987063|gb|EFT31155.1| conserved hypothetical protein [Propionibacterium acnes HL005PA2] gi|314990444|gb|EFT34535.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3] gi|315078862|gb|EFT50880.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2] gi|315083131|gb|EFT55107.1| conserved hypothetical protein [Propionibacterium acnes HL027PA2] gi|315086659|gb|EFT58635.1| conserved hypothetical protein [Propionibacterium acnes HL002PA3] gi|315088063|gb|EFT60039.1| conserved hypothetical protein [Propionibacterium acnes HL072PA1] gi|315097114|gb|EFT69090.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1] gi|315107450|gb|EFT79426.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1] gi|327332549|gb|EGE74284.1| putative membrane protein [Propionibacterium acnes HL096PA2] gi|327333722|gb|EGE75439.1| putative membrane protein [Propionibacterium acnes HL096PA3] gi|327444419|gb|EGE91073.1| hypothetical protein HMPREF9568_01683 [Propionibacterium acnes HL013PA2] gi|327446672|gb|EGE93326.1| hypothetical protein HMPREF9571_01238 [Propionibacterium acnes HL043PA2] gi|327448885|gb|EGE95539.1| hypothetical protein HMPREF9570_00450 [Propionibacterium acnes HL043PA1] gi|327457365|gb|EGF04020.1| hypothetical protein HMPREF9584_00563 [Propionibacterium acnes HL092PA1] gi|328757926|gb|EGF71542.1| hypothetical protein HMPREF9563_00560 [Propionibacterium acnes HL020PA1] gi|328759748|gb|EGF73344.1| putative membrane protein [Propionibacterium acnes HL099PA1] gi|332674444|gb|AEE71260.1| hypothetical protein PAZ_c00470 [Propionibacterium acnes 266] Length = 169 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 39/126 (30%), Gaps = 1/126 (0%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ + Sbjct: 36 RGAVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVGQ 95 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 ++ A + A + + + G++ I S + + ++ Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPV-GFAGTASARVSCTIKLSDLLVPGMPGSFHIE 154 Query: 139 SSNNTI 144 S ++ Sbjct: 155 SVAHST 160 >gi|326916561|ref|XP_003204575.1| PREDICTED: matrilin-3-like [Meleagris gallopavo] Length = 363 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 27/77 (35%), Gaps = 9/77 (11%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 F K +I +TDG + + A+ I+I + + + L+ S Sbjct: 66 RTYFDKVVIVVTDGRPQDQ-----VENVAANARTAGIEIYAVGVGRADMQS--LRIMASE 118 Query: 386 P--EYHYNVVNADSLIH 400 P E+ + V + Sbjct: 119 PLDEHVFYVETYGVIEK 135 >gi|324508230|gb|ADY43477.1| C-type lectin protein 160 [Ascaris suum] Length = 599 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 61/200 (30%), Gaps = 26/200 (13%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S + +T + A+ +A + + +++D + L+ + T Sbjct: 36 DGSQSVGVTNFQTTLDAIVHAT-----QLTNIGPLEKDSRVALVVFDETAHIEFGLMNTT 90 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + R + M A + + + P+ K + Sbjct: 91 NADVEAAIRRSKYPNTNGVNIAAGMDAATRKIFN-------------HAYPPNRNAPKLL 137 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP---NGQRLLKTCVSSPEYHY 390 I +T +N+ +S+V A+E I I+ + L+ P+ Y Sbjct: 138 IIIT---SNSDQSDVRAENS--LAREVGAIPAAIGIHNGNGIGASREQLEIATGDPKRVY 192 Query: 391 NVVNADSLIHVFQNISQLMV 410 + SL + Q + + Sbjct: 193 TTTDYSSLDDILQKLLCEIG 212 >gi|264680611|ref|YP_003280521.1| membrane protein-like protein [Comamonas testosteroni CNB-2] gi|262211127|gb|ACY35225.1| membrane protein-like protein [Comamonas testosteroni CNB-2] Length = 521 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 51/170 (30%), Gaps = 16/170 (9%) Query: 8 RFYFKKGIASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + I + + + + AL +++ + L I + + ++K ++ + A LAGAS Sbjct: 6 KDLSRVRIRRQSGSVATLGALWLMIAVICLA--TIDIGNVFWQKRELQKMADLAALAGAS 63 Query: 67 KMVSNLSRLGDRFESI--------SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 + + +SI + + + + + ++ Sbjct: 64 GTPRSGACQSVAADSIPRNGGIPSELVSTAEGRWEVRTGLTSEQYFVAGQGPLNACKVQV 123 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD-----YRLQFIEH 163 +V + + + + + + L + Sbjct: 124 ARVVPYLFLFGADLDAGGRQVTASATAFRAPRLARLSVRSTLLELDTTKS 173 >gi|170748744|ref|YP_001755004.1| hypothetical protein Mrad2831_2326 [Methylobacterium radiotolerans JCM 2831] gi|170655266|gb|ACB24321.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 546 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 28/268 (10%), Positives = 65/268 (24%), Gaps = 11/268 (4%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + A+ N +I+ AL + I + + + A + A + A Sbjct: 10 RFGAARGGNVAILVALGSAMLMGAGAVGIDLGLVFQARRKAQGAADIAAMLAAVDPSQAD 69 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + A A + +SA + S+ + +T Sbjct: 70 TVARRSLSDNGYGAATATVAPGSYDASTRGVAPGGRFSAGGSPANAVRVGLSTSVPVTFG 129 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLL----------NQRYNQKIVSFIPALLR 182 L ++ + L ++ + Sbjct: 130 RAIGLPAAVPLRVTGTAASAQFAAFTIGSGTLRLEGGIANALLGALLGARLSLSVSDYDA 189 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK-NALLLFLDS 241 + + + S +M A +D N++ A+ +A Sbjct: 190 LASARVDGLRVLDSLGASLNMQAANYTDIVQANASVGQLLMAIRAAASDNGSAASALSGI 249 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 ++ L + V +G + P Sbjct: 250 LNALPNAGNLVAIGQVDGLGDAAALAPP 277 >gi|167626844|ref|YP_001677344.1| hypothetical protein Fphi_0624 [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596845|gb|ABZ86843.1| conserved hypothetical membrane protein with von Willebrand factor type A domain [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 332 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 53/193 (27%), Gaps = 21/193 (10%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D T++ K L L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPTRLERAKYKTLDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V V D + ++ + A++++ Q++T + K Sbjct: 151 ANTVENLVPVINSDIVPVQGNNIYKAIEKSAQLITQAGAKKGQIILITDSTPSADAIA-K 209 Query: 332 FIIFLTDGENNNFKSNVNTI------KICDKAKEN--FIKIVTISINASPNGQRLLKTCV 383 G + + + + K++ I+ N L Sbjct: 210 AKQLAEQGIDTDVYAIGTPKGGIAKDDKGNYIKDDRGNIQ----YFGIDLNQLEALAKAG 265 Query: 384 SSPEYHYNVVNAD 396 S N D Sbjct: 266 SGKLVTLTANNTD 278 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 12/146 (8%), Positives = 40/146 (27%), Gaps = 22/146 (15%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTD 293 D + L +G++ Y ++ ++ + + ++ Sbjct: 303 QFLADVVQALDIGPAGPLVGVVQYGDNPATQFNLKTHMNSQDLKTAIEKITQR--GGLSN 360 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + S + + + DG + ++ Sbjct: 361 VGRAISFVTKTFFSKANGNRGGA-------------PNVAVVMVDGWPTD-----KVEEV 402 Query: 354 CDKAKENFIKIVTISINASPNGQRLL 379 A+E+ I + I++ + ++ Sbjct: 403 SRVARESGINVFFITVEGAAEREKQH 428 >gi|90403616|ref|NP_001035046.1| integrin alpha M [Bos taurus] gi|74229861|gb|AAX46797.1| integrin alpha M [Bos taurus] Length = 1152 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 50/166 (30%), Gaps = 20/166 (12%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 K L+ Y+ + + + + T + +++ + L Sbjct: 182 KSKTLFSLMQYSDDFQTHFTFNDFKRNPVPEFLVGPIRQLFGRTHTATGIRKVVRELFHS 241 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K +I +TDGE + + +A I I Sbjct: 242 SSGARNHAI-------------KIMIVITDGEK--YLDPLEYSDAIPEADREKIIRYVIG 286 Query: 369 INA---SPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + ++ L T S P ++ + V N ++L + + + + Sbjct: 287 VGDAFRGRKSRQELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 39/409 (9%), Positives = 96/409 (23%), Gaps = 72/409 (17%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSS 125 ++ + L D + F +++ + + V + Sbjct: 139 DYAADNGSTAGEEDVNVPAQTVQLWGDERNFENSYLDYNGAYIKKWVEPVLSNNPTSDLH 198 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIP 178 T + N LD + + L + Q N ++ + Sbjct: 199 PEDATTLYNVYLDVIGSEKQEIS--PIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVN 256 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHC-----------AMNSDPEDVNSAPICQDKKRTK 227 + + P I + ++ + + D+N ++ + Sbjct: 257 E-ISENLLSDPNMDIR-IGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNR 314 Query: 228 MAA--------LKNALLLFLDSID--LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 LKN + K + +G T + Sbjct: 315 TPIGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNW 374 Query: 278 QYVTRDMDSLIL----------KPTDSTPAMKQA--YQILTSDKKRSFFTNFFRQGVKIP 325 + + T Y + + ++ N+ VK Sbjct: 375 AVMNNMIARDTGDLFKNFETYSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNYRYGEVKEG 433 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKIC-----------DKAKENFIKIVTISINASPN 374 ++ + + +K +EN I +I + + Sbjct: 434 DDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGS 493 Query: 375 ----------GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 G+ +LK + +Y+ N + +I ++IS Sbjct: 494 VAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 542 >gi|315605762|ref|ZP_07880794.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] gi|315312460|gb|EFU60545.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] Length = 339 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 45/186 (24%), Gaps = 19/186 (10%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 +S + E + L + + E +T D++ Sbjct: 116 ESFKKIISRFEGERISLQLWNAYSMTMFPLTDDYEMADDVLTEMADTI-DGGLSIYGGRI 174 Query: 300 QAYQILTSDKKRSFFTNFFR-------------QGVKIPSLPFQKFIIFLTDGENNNFKS 346 L + + G + I+ TD E + Sbjct: 175 SVTPTLEKYLRPLLPDDDEEKASLVGDGLASCVLGFDHTDKERSRTILLATDNEV-YGQG 233 Query: 347 NVNTIKICDKAKENFIKIVTISINAS----PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + AK + + + + L S+ Y+ + ++ V Sbjct: 234 IYDLAEAIEFAKSQKVTVSALYPGSEYAMTSEAYELRDQVRSTGGDFYDASSPSAVDEVV 293 Query: 403 QNISQL 408 + I Sbjct: 294 KQIESE 299 >gi|221108025|ref|XP_002157834.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 299 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 37/109 (33%), Gaps = 14/109 (12%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A++ A + + + K II LTDG + Sbjct: 5 TRIDRALRLAQKDMFTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPE 51 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +I D+ + + + I+ + I ++ N + L + Y DSL Sbjct: 52 RIADELRNDGVVILGVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 99 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 48/133 (36%), Gaps = 17/133 (12%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +G+I ++ R E +++ + + V + L+ T A++ A + + Sbjct: 177 SSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIP--LMNFTTRIDRALRLAQKDM 234 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K II LTDG + +I D+ + + + I+ Sbjct: 235 FTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVIL 281 Query: 366 TISINASPNGQRL 378 + I ++ N L Sbjct: 282 GVGIGSAVNETEL 294 >gi|220913380|ref|YP_002488689.1| TadE family protein [Arthrobacter chlorophenolicus A6] gi|219860258|gb|ACL40600.1| TadE family protein [Arthrobacter chlorophenolicus A6] Length = 148 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 40/131 (30%), Gaps = 4/131 (3%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L + + K + ++ FAL L L+ ++ + + + + A Sbjct: 16 LGDQLKKTRKDSNHKSRGAVAVEFALVAPVLLTLVVGIVEFANAYNA----QISVTQAAR 71 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A M S+ + +++ A F + + Y + Sbjct: 72 EAARTMAVKNSQADAKAAAVAGAPGINSAAFAYAFSPAACTPDTTAKVTITYPAATLTGL 131 Query: 123 NSSRISMTHMA 133 + +++T + Sbjct: 132 FGTSVTVTGVG 142 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 78/232 (33%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGAGNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVE----KNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDIPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV-FQNISQL 408 +++ + + N + + + V + ++L V FQ I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDES 472 >gi|283777925|ref|YP_003368680.1| hypothetical protein Psta_0124 [Pirellula staleyi DSM 6068] gi|283436378|gb|ADB14820.1| hypothetical protein Psta_0124 [Pirellula staleyi DSM 6068] Length = 505 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 63/226 (27%), Gaps = 16/226 (7%) Query: 11 FKKGIAS-----EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 K + ++ A+ + S +IG ++ + + A +AA A A Sbjct: 11 SSKFARKTREKQRRGKVLVLLAILLPSLCGMIGLVMDTGMLLSEHRQAQHAADAAATAAA 70 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + E+ + + H + Y+ + E+ Sbjct: 71 MEKQQGSTTAEVLAEATETVRVH--NQISSANVTVHSPPTSGPYAGSAKHVEVIVDYELP 128 Query: 126 RISMTHMA---NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 M + +N + F + S L + + +P L Sbjct: 129 TFFMHVVGGATSNDIRCRAVAGFEPVTEGASIV-VLDPN---PDNFSLGALTPILPPLPA 184 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + G + L V + G+MH ED P K K Sbjct: 185 LLGGVEVLGL--GRVQVHGAMHVNTEWGGEDEYGLPAGVVKPLKKA 228 >gi|325982790|ref|YP_004295192.1| PEP motif putative anchor domain-containing protein [Nitrosomonas sp. AL212] gi|325532309|gb|ADZ27030.1| PEP motif putative anchor domain protein [Nitrosomonas sp. AL212] Length = 333 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 43/153 (28%), Gaps = 20/153 (13%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + +I + + + + ++ T+ + A LT Sbjct: 105 STSVSVIDFDSSAAILQGLTPLSSGSAAVISAINAIDASGGTNIGAGIDSAAAELTGANH 164 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + ++ ++DG ++ ++ + I T+ + Sbjct: 165 TAGS---------------TQMMVVVSDGFSSGDPASSALAALGAGVDA----IHTVGL- 204 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + S + N + SLI +F Sbjct: 205 PGHDAFTMQNIATSGNGIYTNASSLTSLIDLFN 237 >gi|227827415|ref|YP_002829194.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584630|ref|YP_002843131.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619571|ref|YP_002914396.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459210|gb|ACP37896.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019679|gb|ACP55086.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380640|gb|ACR41728.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 380 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 58/226 (25%), Gaps = 51/226 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L+ + + H + D + K+ K + I + Sbjct: 24 KVLLVPEKLGSATGFHYIVALDTSGSMTGY--------KIELAKQGAIELFKRIPKGN-- 73 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + I +++ V E + + T A+ A + Sbjct: 74 ----KVSFITFSSNVNVIKEFVDPLD----LTNEILQIAAGGQTALYTAILTANSLAKK- 124 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 +++ LTDG + + + K +++ Sbjct: 125 ------------------YQMPTYLLLLTDGNPTDET------NVGNYLKLPYFEKMQVY 160 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSL-----IHVFQNIS 406 + I N Q L + Y++ +A+ + I+ Sbjct: 161 SFGIGDDYNEQLLQSISDKTSGVMYHISDANEIPQKLPQKAVTQIA 206 >gi|327191361|gb|EGE58388.1| hypothetical protein RHECNPAF_310002 [Rhizobium etli CNPAF512] Length = 193 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 12/114 (10%), Positives = 32/114 (28%), Gaps = 8/114 (7%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ I K +I FA+ ++L + A Sbjct: 5 NPFTRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKR-------AT 57 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A S + +++ +S + + + ++G + Sbjct: 58 RAAGS-IADLVTQQQSITKSTLTEMRSVATAIFVPYNSTSLTLKITGITVDASA 110 >gi|307947552|ref|ZP_07662881.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769249|gb|EFO28481.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 212 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 22/208 (10%), Positives = 54/208 (25%), Gaps = 26/208 (12%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM----V 69 + K + +I FAL L I I + + K++ + + L + Sbjct: 12 LKGNIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVA 71 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNH---IKESLSGYSAVFYNTEIQNIVNSSR 126 +S+ R + ++ K IK ++ A + Sbjct: 72 DGMSKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHES 131 Query: 127 ISMTHMANNRLDSSNNTIFYNM--DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 M + +D + ++ + + + + I Sbjct: 132 YMTHAMGFDNIDYTADSESTISFGQGKYEFIFLVDVSPSMG-----------------IG 174 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPE 212 R +++ + + H S Sbjct: 175 ASNRDRQIMQRAIGCQFACHEPWYSSVS 202 >gi|221096021|ref|XP_002158471.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 1924 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 100/409 (24%), Gaps = 48/409 (11%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 N I+F + +++ +IG + + +N + + Sbjct: 186 GNVFILF-MLIIAVFAIIG--VQLWQGIL-RNR--------------CFHNLTENVASAT 227 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + A + A + + + N+ S +++ L Sbjct: 228 NLSLSGATNLSLSGATNLSLSGATNLSLSGATNLSLSGATNLSLSGATNLSLSGATNLSL 287 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA-LLRIEMGERPIFLIELVVD 198 S ++ T+ L + + + + L Sbjct: 288 SG-ATNLSLSGATNLSLSDATNLSLSGATNLSLSGATNLSLSGATNLSLSGATNLSLSGA 346 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 + S+ A N + + + L A L L LS Sbjct: 347 KNLSLSGATNLSLSGATNLSLSGATNLS----LSGATNLSLSGATNLSLSGATNLSLSGA 402 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL--TSDKKRSFFTN 316 + S +D + + Y L D + Sbjct: 403 TNLSLSGATNLSLSGATNLSLSDYIIDDYMYTS-NDNNLFISRYYKLPEEDDWSLICSSA 461 Query: 317 FFRQGVKIPSLPFQKF--IIFLTDGENNNFKSNVNTIK------------ICDKAKENFI 362 V S K+ + D + +N+ + IC A + Sbjct: 462 MNSGQVSCNSKFIPKYDNYYIIDDYMYTSNDNNLFISRYYKLPEEDDWSLICSSAMNSGQ 521 Query: 363 KIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQNISQLMV 410 I N L C +S HYNV++ DS+ F I Q + Sbjct: 522 TSDCI------NWNLLYTHCNASNSNPHYNVISFDSVSDAFIAIFQSIT 564 >gi|327265755|ref|XP_003217673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Anolis carolinensis] Length = 1078 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 51/168 (30%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 245 KTSVYDMLDTLSDDDYVNVASFNQKAQAVSCFTHLVQ--ANIRNKKVFKEKVEVMEARGT 302 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD + A++ L + K I+ TDG + + Sbjct: 303 TDYKAGFEFAFEQLQNSNISRANC--------------NKMIMMFTDGGEDRVQDVFEKY 348 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 +K +++ T S+ L+ ++ Y++ + + ++ Sbjct: 349 NWPNKT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 392 >gi|170734865|ref|YP_001773979.1| TadE family protein [Burkholderia cenocepacia MC0-3] gi|169820903|gb|ACA95484.1| TadE family protein [Burkholderia cenocepacia MC0-3] Length = 177 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 10/143 (6%), Positives = 41/143 (28%), Gaps = 2/143 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + ++ FAL +M ++L + Y+ ++ A A + + ++ + Sbjct: 8 RSRTRGVVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAY 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + + ++ A T + + Sbjct: 68 PVSQAQCLVVYGSTTCGSSGTELVPGLTTSMVTICDAQ--RTTGCSDASDPSQFANLPTY 125 Query: 135 NRLDSSNNTIFYNMDVMTSYDYR 157 + +++ + + Sbjct: 126 DSNNNAASGTATGAINVVEVKIT 148 >gi|119593589|gb|EAW73183.1| hCG25234 [Homo sapiens] Length = 195 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 53/142 (37%), Gaps = 25/142 (17%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L+ +++ +GL+ + + + + + Q +T ++ T Sbjct: 63 NQAAELYLIQIIEKGSLVGLVTFDSFAKIQSKLIKIIDDNTYQKITANLPQEADGGTSIC 122 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 +K +Q + + +F + II LTDGE+ I +C Sbjct: 123 RGLKAGFQAIPQSNQSTFGSE----------------IILLTDGED-------YQISLCF 159 Query: 355 DKAKENFIKIVTISINASPNGQ 376 + K++ I TI++ S + + Sbjct: 160 GEVKQSGTVIHTIALGPSADEE 181 >gi|47212619|emb|CAF92825.1| unnamed protein product [Tetraodon nigroviridis] Length = 533 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 19/223 (8%), Positives = 60/223 (26%), Gaps = 26/223 (11%) Query: 205 CAMNSDPEDVNSAPICQ--DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 C +SD ++ +C+ + + + + + M I +++R Sbjct: 16 CIHDSDTSCLHLYFLCRRSGSVKNHWEEIYYFVENLAEKFKS-----PMLRMSFITFSSR 70 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT-SDKKRSFFTNFFRQG 321 ++ + +R+ + + T +++A + + + + G Sbjct: 71 ASTIMKLTENRINIRKGLNALKREIPGGDTIMHLGLQKANEQIKRENFGPASVIIALTDG 130 Query: 322 VKIPSLPFQ---------------KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + A+ + Sbjct: 131 ELQEDELISAQQEVAASRSMSAVAAVSALAAFSIPDVMDPFPVFPQQAETARSLGAIVYC 190 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQL 408 + + + L T + E+ + V +L V +I + Sbjct: 191 VGVKD--FNETQLATIADTIEHVFPVLGGFHALRGVINSIIKK 231 >gi|134291854|ref|YP_001115623.1| TadE family protein [Burkholderia vietnamiensis G4] gi|134135043|gb|ABO59368.1| TadE family protein [Burkholderia vietnamiensis G4] Length = 177 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 40/151 (26%), Gaps = 6/151 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + + ++ FAL ++ ++L + Y+ ++ A A Sbjct: 1 MKRLPMHRSRM----RGVAAVEFALVLIPMIVLATGVAEFGRAIYQYETLTKATRNAARY 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + S+ + + + + + +T + Sbjct: 57 LSVYLPSDAAYPLAAAQCLVVYGNTTCGAAGTELVPGLTTS--MVIVCDAAHTPDCADAS 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +++ + + Sbjct: 115 DPPQFANLPTYDSSNNAPSGTATGSINVVEV 145 >gi|229596006|ref|XP_001013680.3| hypothetical protein TTHERM_00833730 [Tetrahymena thermophila] gi|225565656|gb|EAR93435.3| hypothetical protein TTHERM_00833730 [Tetrahymena thermophila SB210] Length = 557 Score = 44.8 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 48/159 (30%), Gaps = 26/159 (16%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 +V + + T P + G+ S Sbjct: 244 FKNNVSQVNNMPQKYKEIRTNGSTALGPGLA------------------VSLGLASQSPQ 285 Query: 329 FQKFIIFLTDGENN-------NFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLK 380 II TDG N + K K+ AK I I TI++ + + L K Sbjct: 286 SGSSIILCTDGLANEGIGQLEDKKDYETYEKMGQLAKSLGILIHTITMRGNESDVRVLGK 345 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 ++ V AD + + +S +V K ++++ Sbjct: 346 LSDTTGGRTSRVGPADFNSNFVELVSSDLVATKVEIVVQ 384 >gi|17533687|ref|NP_496745.1| C-type LECtin family member (clec-65) [Caenorhabditis elegans] gi|3876683|emb|CAB03057.1| C. elegans protein F35C5.8, confirmed by transcript evidence [Caenorhabditis elegans] Length = 372 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 44/166 (26%), Gaps = 14/166 (8%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 I +G I Y K++ + + L+ T + Sbjct: 62 QIGTGYDDPRSTRVGFITYNWNATDV----ADFYKLQSWADLNSQIQRLQYTPQSS---S 114 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + G ++K +I T + + + K Sbjct: 115 PASRMDTGLNAAIGMIDATAGF---RDNYKKIVIVFT--SVHGSYKSNQPRDVSKILKSR 169 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN--ADSLIHVFQN 404 I +VT++ +S + Q LK S ++ + Sbjct: 170 GIPVVTVNTGSSSDTQAYLKQIASDNMSFAIADGNVTQEILKAMTD 215 >gi|268532320|ref|XP_002631288.1| Hypothetical protein CBG03101 [Caenorhabditis briggsae] Length = 394 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 61/202 (30%), Gaps = 15/202 (7%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + + V N + V Q ++ ++N L L SI + VK Sbjct: 11 PSISDDVYSDPICPGNIKNLWLDVVVVVDKSQLMTNAQLWQVRNTLTQVLGSISKIGPVK 70 Query: 250 EDVY-----MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 +G++ Y + Y + + T++ Sbjct: 71 YPADPRSTCVGIVTYDDNATTQSQLDASKSFSDLYNVIQSSLISVDNTNT--------SY 122 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + + + I F D + + + I ++ K+N + I Sbjct: 123 LSLALLAAEKALKDGRNRTYRFNYKKVIIAFAADYQGHGTALDAMP--IANRLKDNAVTI 180 Query: 365 VTISINASPNGQRLLKTCVSSP 386 +T++ ++ + Q ++ S Sbjct: 181 ITVACTSNSDKQTAIQGIASPG 202 >gi|288940556|ref|YP_003442796.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288895928|gb|ADC61764.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 610 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/256 (11%), Positives = 80/256 (31%), Gaps = 53/256 (20%) Query: 164 LLNQRYNQKIVSFIP-ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + R + + S + L+ ++ + L++D+SGSM Sbjct: 1 MASLRRARLLSSIVLLWLIAVQAPLAAPADVRLLIDVSGSMRQND--------------- 45 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + + AL + + G+ + + E I P+ + ++ + Sbjct: 46 PRNLRAPAL-QLVNELI---------PAGAIAGVWLFAEQTEVLIPPAPVDDAWKKRLAG 95 Query: 283 DMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + + TD A++ A + ++ +I TDG Sbjct: 96 RLARIHSRGLFTDIERAIRTA-----------------TEDWTKTPPEGERHLILFTDGL 138 Query: 341 NNNFKSNVNTIKI--------CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 + K + + K +K+ I ++ + + + + Sbjct: 139 VDVSKDEAESAASRERILSEQIESLKSEGVKVHAIGLSDQIDEPLMRLLATQTDGWLEVA 198 Query: 393 VNADSLIHVFQNISQL 408 +A++L +F + + Sbjct: 199 QDAETLQRLFLRVLEQ 214 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 39/361 (10%), Positives = 96/361 (26%), Gaps = 46/361 (12%) Query: 61 ILAGASKMVSNLSRLGDRFESISNH-AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 LA + + +S+ + A A S + ++ + Sbjct: 16 ALACVTGPQAAISQTTTPQPAAPPVTPSTAPQQPATTPATTPQSASPTQPNSTSPTSPTG 75 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + + T + D+ + T F + D + + N + +F Sbjct: 76 STAAPQSSTPTLQPRSDQDAQDGTTFTLHRSVNEVDLIFTVMGKDGHFVSNLQQQNFGLL 135 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + + + D R + + A FL Sbjct: 136 DDGRPPQHV----LRFAQQTNLPLRVGIMLDTS---------SSIRQRFEFEQQAATDFL 182 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAM 298 + ++ + + I + + T ++ Sbjct: 183 ---------LQVLHPADRAFVEGFDVQINIAQDFTNRIDMLDTGIRRLRPGGGTALFDSL 233 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKA 357 + Q +K I+ ++DG+++ + IK+C +A Sbjct: 234 ----------YRTCKDQMLTLQQDA----EVRKAIVLVSDGDDDYSRVLETEAIKMCQRA 279 Query: 358 KENFIKIVTISINASPNGQR----LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + TIS N P+ + L + ++ + D + F+NI + + + Sbjct: 280 ---ETIVYTISTNVGPSRDKGDDVLQQISDATGGQSFYPQRIDDVAIGFRNIEEELRSQY 336 Query: 414 Y 414 Sbjct: 337 L 337 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|198430849|ref|XP_002120173.1| PREDICTED: similar to cGMP-dependent protein kinase 1, alpha isozyme (CGK 1 alpha) (cGKI-alpha) [Ciona intestinalis] Length = 1896 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 64/180 (35%), Gaps = 24/180 (13%) Query: 228 MAALKNALLLFLDSIDLL--SHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDM 284 +++ K AL ++ + K +V++G++ + + I ++ ++ + Sbjct: 853 LSSYKQALRWVVELLTSFREDVDKGNVHVGVVAFHSWAGTRIALGAFEFSNLQARIIALS 912 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T+ PA+ + + + + QK +I +TDG ++ Sbjct: 913 NGRNYGGTNIAPAIDETLREFNRNGRTGI----------------QKQMILMTDGYSSYP 956 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + +A+ + V + + S Q L + + N + L V ++ Sbjct: 957 NAISP---AAQRARAQGVVTVAVGVGGSSYAQLL--NIAGNQTRVFYATNFNRLGEVVES 1011 >gi|197105074|ref|YP_002130451.1| hypothetical protein PHZ_c1611 [Phenylobacterium zucineum HLK1] gi|196478494|gb|ACG78022.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 164 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 11/71 (15%), Positives = 22/71 (30%), Gaps = 4/71 (5%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 ++ I + +I FAL + LI I + M + + A+ A + Sbjct: 2 AVRRFIDDVRGGAAIEFALISPLLIFLIFATIEMAVMAM----MSAGLDNAVATTARMIR 57 Query: 70 SNLSRLGDRFE 80 + Sbjct: 58 TGQDDGPASAA 68 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 315 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 372 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 373 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 418 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 419 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 462 >gi|119468018|ref|XP_001257815.1| von Willebrand and RING finger domain protein [Neosartorya fischeri NRRL 181] gi|119405967|gb|EAW15918.1| von Willebrand and RING finger domain protein [Neosartorya fischeri NRRL 181] Length = 1013 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 57/177 (32%), Gaps = 21/177 (11%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 P+ + K+ L++AL + ++ MGL+ + + T K Sbjct: 502 PVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSW 554 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + S+ + L +D + + P I+ ++ Sbjct: 555 AGWNKILSSIRPVGQ----------KSLRADVVEGANVAMDLLMQRKSNNPIS-TILLIS 603 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 D ++ +S + + AK + I + + + +++ + + V + Sbjct: 604 DSSTSDPESVDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 657 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 353 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 410 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 411 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 456 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 457 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 500 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 242 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 299 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 300 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 345 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 346 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 389 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 314 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 371 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 372 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 417 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 418 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 262 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 319 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 320 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 365 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 366 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 409 >gi|156390493|ref|XP_001635305.1| predicted protein [Nematostella vectensis] gi|156222397|gb|EDO43242.1| predicted protein [Nematostella vectensis] Length = 229 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 48/167 (28%), Gaps = 10/167 (5%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEKVRQYVTRDM 284 ++ A + + +S +V + L+ Y ++ T V++ Sbjct: 30 EYIRQAQKHVISISETISRTAYNVRLALVEYRDHPPQDKSFVTRVHDFTSDVKEMKVWVD 89 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D ++ A + + + + + + F DG N Sbjct: 90 KMSASGGGDCPESVADAI----FKACKLGYREDATKMCVLIADAPPHGLGFAHDGFPNGC 145 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYH 389 N + + C E I + TI S + L+ + + Sbjct: 146 PDNHDPLASCHVMAEKGITLYTIGCEPSVDSFRDFLVGMAAITGGQY 192 >gi|320161534|ref|YP_004174758.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|320161798|ref|YP_004175023.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] gi|319995387|dbj|BAJ64158.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|319995652|dbj|BAJ64423.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] Length = 150 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 9/47 (19%), Positives = 19/47 (40%), Gaps = 7/47 (14%) Query: 18 EKAN----FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 E+ + + L ++ L+L+G Y + + A +AA Sbjct: 10 ERGSSLAWTAAFLGLVLIPLLMLVG---DGARLLYVRGRLAQATDAA 53 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T ++ S + V + + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPRIWVKVSEQDSSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDIPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV-FQNISQL 408 +++ + + N + + + V + ++L V FQ I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDES 472 >gi|328952507|ref|YP_004369841.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] gi|328452831|gb|AEB08660.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] Length = 465 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 46/154 (29%), Gaps = 16/154 (10%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES--------------ANNA 59 ++ EK + +++ A+ + + + ++GF + + +KN ++ A + Sbjct: 1 MLSDEKGSVAVVVAVCLFALVGVLGFSLDTAFLYVEKNHTQNAVEAAALAGATNLCAGDQ 60 Query: 60 AIL-AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 + A + + G E ++D F SG A Y Sbjct: 61 EDIEAIVRNVAAENGLTGANHEVFVTFGFYDEMNDYGDFPVYKDFVEKSGMPAGEYV-NA 119 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 + + N + + + Sbjct: 120 VAVACLTNNVTLTGLNKSGTVAAMAVAFLRRFDM 153 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 63/176 (35%), Gaps = 18/176 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +K+ L L++ KED I ++T + + ++ + Sbjct: 112 VKDYKNGILALQTLITRAKEDTRYAGITFSTEANITFYFTDPLDAMKGL---GGITYAPG 168 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T++ A+ L +KK F K I+ +TDG++N + T Sbjct: 169 MTNTQAALDICRTQLWLNKKSGF------------RRLSFKRILIVTDGQSN--INMERT 214 Query: 351 IKICDKAKENFIKIVTISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + + K I+I +++ G ++ S+ + Y V N L+ V I Sbjct: 215 LYNAFQLKNMGIEIFVVAVGKYLRGIAEIVGLASSTDAHLYRVRNLRGLLEVVHLI 270 >gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7] gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-7] Length = 755 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 79/252 (31%), Gaps = 44/252 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + N +V +P + + LV+D SGSM Sbjct: 347 SSNTDNYSLVMVLPPKVEASEQLNLPRELILVIDTSGSMA-------------------- 386 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KNAL L + + + + + RQ+V R Sbjct: 387 GDSIIQAKNALRYALRGLRPQDSFNIIEFNSDVSLLS-PTPLPATATNLAMARQFVNRL- 444 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ A+ A + + + + + +IF+TDG Sbjct: 445 --QADGGTEMAQALNAAL----PRQAFNTASGEDKS---------LRQVIFMTDGS--VG 487 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I ++ +N ++ T+ I ++PN + + + + D + Q Sbjct: 488 NESALFELIRNQIGDN--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVE---QK 542 Query: 405 ISQLMVHRKYSV 416 ISQL+ +Y V Sbjct: 543 ISQLLAKIQYPV 554 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 27/234 (11%), Positives = 57/234 (24%), Gaps = 58/234 (24%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + + + N + +++ + V+ S Sbjct: 184 PKNVVIVIDTSGSMANLHSGKSLINIAIDAAITVLDTMNPNDKVGVIAFS---------- 233 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D P L A + + ++ Sbjct: 234 --DELKLPPKIGDASCYANELALATTINIQNLKQF------------------------- 266 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + T A A+ +L + Sbjct: 267 ------------VLSLVARGGTHYGKAFDAAFNLLKESYTLDADNE-------RGKIERD 307 Query: 331 KFIIFLTDGEN-NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + IIFLTDGE ++ S + I+ ++ EN + I+T + +G L+ Sbjct: 308 QVIIFLTDGEPLDDKTSIMRKIRSNNEEMENKVTILTFGLGLD-SGINFLEDIA 360 >gi|255263037|ref|ZP_05342379.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255105372|gb|EET48046.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 203 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 23/52 (44%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 + +E + S+ F + F++L+ I V ++ +E + A+ A Sbjct: 10 RFRKNEDGSSSVEFVILFPVFIVLVATSIEVGLVMTRQVMLERGTDLAVRAI 61 >gi|260808845|ref|XP_002599217.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] gi|229284494|gb|EEN55229.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] Length = 600 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 30/97 (30%), Gaps = 7/97 (7%) Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + +I LTDG + + ++ I + A Sbjct: 309 VASTMLLPGNGNRPDAPDVVIVLTDG------YSSGVVGPASVLHGMGVQTFAIGVGACA 362 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 N +L + S P+Y Y + + +L + + + Sbjct: 363 NSAQLTQI-ASCPDYIYRLPDFSALRSITAGMHDQIC 398 >gi|281209350|gb|EFA83518.1| hypothetical protein PPL_02583 [Polysphondylium pallidum PN500] Length = 461 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 76/210 (36%), Gaps = 18/210 (8%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 +V+DLSGSM A + + ++ + +D ++ Sbjct: 152 MIVLDLSGSMRSAAFKGS------LTPGELEMKRIEIAQALFQTMIDK---YVQLEIAAI 202 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GL+ + R+E P+ + + + + T A+K A + + ++ Sbjct: 203 VGLVCFGERIEVTFPPTRNFDSFSTELGEVVANQSK--TRLYEAIKLAGETIVKYREN-- 258 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 T+ V PS + LTDG++N SN ++ K I + I I Sbjct: 259 PTSLADGFVLAPSDKLICRVFALTDGQDN---SNACPYEVYKYLKSANIILDAIPIGEG- 314 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 G L ++ + ++ + + +F+ Sbjct: 315 -GNTLGSFTKATGGSCFTFNSSKAGVELFE 343 >gi|223462563|gb|AAI50654.1| Von Willebrand factor A domain containing 3B [Homo sapiens] Length = 1294 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 521 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|221040994|dbj|BAH12174.1| unnamed protein product [Homo sapiens] Length = 951 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 178 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 234 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 235 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 267 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 268 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 315 >gi|221052180|ref|XP_002257666.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] gi|193807496|emb|CAQ38002.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] Length = 294 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 56/210 (26%), Gaps = 24/210 (11%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 +G + +C D+ + K + + Sbjct: 71 NIGSESPPGSCNDTGANPGNYCDNY---YDITLLVEESSFIQKDYWK-KGTIPFLESMVR 126 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMK 299 K+ +M ++ + + + +K+ + V D T A++ Sbjct: 127 NARVSKDKAHMSIVLFARDSRVIVPFTDELSQDKDKLIEKVRAINDVATSPDTLYAYALE 186 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A++ + + K ++ G + ++ + + K+ Sbjct: 187 HAFEHVIFGEGTRKDA--------------PKVVVLFYYGFDYGANKSLIP-DVVEDYKK 231 Query: 360 NFIKIVTISINASP-NGQRLLKTCVSSPEY 388 IK+V + I + LL C Sbjct: 232 KNIKLVIVGIALGNRDNAYLLGGCAIGDNN 261 >gi|119622317|gb|EAX01912.1| hypothetical protein MGC26733, isoform CRA_a [Homo sapiens] Length = 1080 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 521 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|118918435|ref|NP_659429.4| von Willebrand factor A domain-containing protein 3B [Homo sapiens] gi|296439299|sp|Q502W6|VWA3B_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3B Length = 1294 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 521 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A K +L+ ++ + G Sbjct: 9 VLDPSGSMNIYLVLDGSDSIGASN--------FTGAKKSLVNLIEKVASYGVKPRY---G 57 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 58 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 114 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 115 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 167 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 168 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 219 >gi|30268323|emb|CAD89964.1| hypothetical protein [Homo sapiens] Length = 1060 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 521 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|21740064|emb|CAD39048.1| hypothetical protein [Homo sapiens] Length = 803 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 30 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 86 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 87 IKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 119 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 120 -DQPPETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 167 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 37/121 (30%), Gaps = 3/121 (2%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + FF + + + + K D +K+ Sbjct: 242 WDSADGYNGKVGGIEFFSRTLSWKDIKTAADGNGKDAAGVSWDGDPSKDPKGVDYSKQL- 300 Query: 362 IKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 ++ T+ S G+ L S ++++N + L + F I L+ S + Sbjct: 301 VQTFTVGFGQGISDAGKAFLTRGASQDDWYFNADKPEDLENAFNKIISLISTDSESPPFE 360 Query: 420 G 420 G Sbjct: 361 G 361 >gi|149921114|ref|ZP_01909572.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] gi|149818001|gb|EDM77460.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] Length = 389 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 22/284 (7%), Positives = 62/284 (21%), Gaps = 58/284 (20%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 T ++ + L + + D G Sbjct: 77 TTTATVNNIPPNVVLVLDKSRSM----------------------------VVNAWDDDG 108 Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 + + ++ + ++ + L L + G Sbjct: 109 NPDTEDVTRWHSLHDTVDTVGHQY------QDGMSLGLTLFPSVDAESSFD--GACPVNE 160 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 E + E + + D+ + T + + A L + + Sbjct: 161 VPEVGVGL-GNAEALLAAIPAADDTDLHGATPAAAGIATALAHLEALEDGRPAAMILVTD 219 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ----- 376 I + + + + + D I + I+ + + Sbjct: 220 GAANCSAGANDITKFSQYDED------LPLVVADAWDRAGIPTYVVGIDIQESSEHPFTN 273 Query: 377 ---RLLKTCVSSP-----EY--HYNVVNADSLIHVFQNISQLMV 410 +L + + Y+ +A +L I+ + Sbjct: 274 PREKLHEVAEAGGVARSDGEVGFYDAGDAQALTAALDEIAASVS 317 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 59/177 (33%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + + T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSREIPAIT--YRGGGT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 168 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 206 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 I ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 207 IAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM+ M D D A K L ++ + GL+ Sbjct: 239 DPSGSMNIYMVLDGSDSIGASN--------FTGAKRCLANLIEKVASYGVKPRY---GLV 287 Query: 258 GYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKR 311 Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 288 TYATVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS----- 342 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF------ 361 II +TDG +N V + D ++ Sbjct: 343 ---WPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDY 397 Query: 362 IKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + P L + ++ ++ + V + + L +VF Sbjct: 398 LDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 440 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM+ M D D A K L ++ + GL+ Sbjct: 263 DPSGSMNIYMVLDGSDSIGASN--------FTGAKRCLANLIEKVASYGVKPRY---GLV 311 Query: 258 GYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKR 311 Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 312 TYATVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS----- 366 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF------ 361 II +TDG +N V + D ++ Sbjct: 367 ---WPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDY 421 Query: 362 IKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + P L + ++ ++ + V + + L +VF Sbjct: 422 LDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 464 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM+ M D D A K L ++ + GL+ Sbjct: 616 DPSGSMNIYMVLDGSDSIGASN--------FTGAKRCLANLIEKVASYGVKPRY---GLV 664 Query: 258 GYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKR 311 Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 665 TYATVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS----- 719 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF------ 361 II +TDG +N V + D ++ Sbjct: 720 ---WPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDY 774 Query: 362 IKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + P L + ++ ++ + V + + L +VF Sbjct: 775 LDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 817 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM+ M D D A K L ++ + GL+ Sbjct: 261 DPSGSMNIYMVLDGSDSIGASN--------FTGAKRCLANLIEKVASYGVKPRY---GLV 309 Query: 258 GYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKR 311 Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 310 TYATVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS----- 364 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF------ 361 II +TDG +N V + D ++ Sbjct: 365 ---WPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDY 419 Query: 362 IKIVTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + P L + ++ ++ + V + + L +VF Sbjct: 420 LDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 462 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 43/156 (27%), Gaps = 19/156 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + L K+ V++ +I Y + T + Sbjct: 91 INWTTQLTEALLTDKDQVHIAMIQYAETPTIEFSLDTYRNPRDITNHIMTINFHSGGTRT 150 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ A L S+KK K I+ TDG S + +K Sbjct: 151 GKALLAAKVELFSEKKG-------------ARKNASKIIVLFTDG-----LSVDDPVKHA 192 Query: 355 DKAKE-NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + +E +KI + + + + + Sbjct: 193 QQLREVEKVKIYVVYVGSDGFEYEMDRIAGGKSNIF 228 >gi|224067090|ref|XP_002302350.1| predicted protein [Populus trichocarpa] gi|222844076|gb|EEE81623.1| predicted protein [Populus trichocarpa] Length = 705 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 59/158 (37%), Gaps = 33/158 (20%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ LK A+ L + S+ + ++ +++ ++ + T ++ R +D Sbjct: 347 KLQMLKRAMRLVISSLGSAD------RLSIVAFSSSPKRLLPLKRMTPNGQRSARRIIDR 400 Query: 287 LILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 L+ + A+++A ++L ++R+ + I+ L+DG++ Sbjct: 401 LVCGQGSSVGEALRKATKVLEDRRERNPVAS----------------IMLLSDGQDERSS 444 Query: 346 SNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTC 382 + I+ I + + + N Q + Sbjct: 445 TRFAHIE---------IPVHSFGFGQSGGNSQEPAEDA 473 >gi|254508395|ref|ZP_05120516.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] gi|219548708|gb|EED25712.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] Length = 696 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 36/275 (13%), Positives = 77/275 (28%), Gaps = 69/275 (25%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + + ++F P + + V+D SGSM Sbjct: 289 SSKRGTVKLTFTPGDDLTRVTQ--GRDWVFVLDKSGSM---------------------S 325 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 K + L + L + + ++ T + + + + Sbjct: 326 GKYSTLVEGVRQGLGKLPSEDRFRVVMFDSNTYDLTGGFVAV----NQSNISKALQAVEQ 381 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ M A + L D+ I+ +TDG N Sbjct: 382 VEPSNGTNLYEGMAAAIRKLDDDRPTG--------------------IVLVTDGVANVGV 421 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI------ 399 + + +++ +++ T + S N L+ S +V NAD +I Sbjct: 422 --TEKRRFFELMEKHDVRLFTFIMGNSANTPLLVPMTKLSNGVATSVSNADDIIGHLMSM 479 Query: 400 ------HVFQNIS--------QLMVHRKYSVILKG 420 ++NI + + + S + +G Sbjct: 480 TSKLTHQAYRNIQLDIDGVKIKDLTPEQISSLYRG 514 >gi|297537911|ref|YP_003673680.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] gi|297257258|gb|ADI29103.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] Length = 1187 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 35/329 (10%), Positives = 83/329 (25%), Gaps = 11/329 (3%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LAG + + + N + + E L+ Sbjct: 538 LAGGLQDTAGNPDTTNASSIDGNLQISNAAGN-LTITLSQPNEVLTSGGVAVTWLGSGTQ 596 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 ++ +A +D+S + +N+ + + A Sbjct: 597 QLTAMAGTHAVATASVDNSGHY-AFNLFAPVDHSGV-NIEDVKSINFGVTVADGVTSATS 654 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I L S +M D +++ + L++A+ ++ Sbjct: 655 TLSVNIEDDAPIALNRSDSFAMIDTNLLITLDTSNSMNDVSGINAET-RLQSAVKS-IER 712 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + ++ + L+ ++ E + + + I T+ A+ A Sbjct: 713 LMDIYDGFGEIRVRLVTFSNNAETQGTEWVTLDTAKTILDGI--LTIGGTTNYDGAIANA 770 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK--- 358 K S N L++ ++ N + K Sbjct: 771 MTAFADPGKISGAQNISYFFSDGNPNRGDGSNTTLSNVGSSVGPDNGIQVAEEAIWKGFL 830 Query: 359 -ENFIKIVTISINASPNGQRLLKTCVSSP 386 +N IK I + L Sbjct: 831 NDNQIKSYAIGMGTGLTDVSYLNPIAYDG 859 >gi|209527388|ref|ZP_03275895.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492179|gb|EDZ92527.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 396 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 60/197 (30%), Gaps = 29/197 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL--LS 246 P I ++VDL+GSM+ S ++ A FL+ + Sbjct: 69 PPAWIVVLVDLTGSMNELDTSGK--------------RRIDGALEATRRFLEQMSDRGGD 114 Query: 247 HVKEDVYMGLIGYTTRVEKNIE-------PSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 V G G + + K ++ + TD + Sbjct: 115 TKVAIVPFGKGGANCPGFEVTQRGIDSRFFPANDIKQTNFLDYLAAQTLCAATDIYGPLS 174 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-K 358 +A ++L + + F+ + +I L+DG +N + + + Sbjct: 175 EAIRVLGNRQDPRFYV-----PEDSGRPEPRLSVILLSDGFHNQPNEQQDFDNLITLLER 229 Query: 359 ENFIKIVTISINASPNG 375 N I + T+ +P Sbjct: 230 NNNIIVHTLGYGLTPQQ 246 >gi|185134784|ref|NP_001117673.1| complement factor B/C2-B [Oncorhynchus mykiss] gi|11990428|dbj|BAB19788.1| complement factor B/C2-B [Oncorhynchus mykiss] Length = 787 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 59/200 (29%), Gaps = 18/200 (9%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 +T + A+ + +D +I Y ++ + ++ S E VR Sbjct: 280 TSGSISKTHFNLARGAIADLIRKLDSYEIELNFQ---VISYASQAKDIVDIVSSNDEHVR 336 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 V + L S K + + + R+ + Q I+ T Sbjct: 337 SNVRSVLKRLAAFDHRSH-GNKTGTNLYAALYRVYERMAILRERNETRFSETQNVILIET 395 Query: 338 DGENNNFKSN-----------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS- 385 DG +N S + D ++ + + + + N L Sbjct: 396 DGYSNTGGSPLAVLAKIRSLLGYSTSALDHTEDKLLDVYVFGVGDNVNKNELNLIASRKR 455 Query: 386 -PEYHYNVVNADSLIHVFQN 404 ++ + + + L VF + Sbjct: 456 HEKHLFILKDYKQLGEVFNS 475 >gi|134299959|ref|YP_001113455.1| hypothetical protein Dred_2113 [Desulfotomaculum reducens MI-1] gi|134052659|gb|ABO50630.1| hypothetical protein Dred_2113 [Desulfotomaculum reducens MI-1] Length = 314 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 14/138 (10%), Positives = 41/138 (29%), Gaps = 7/138 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++ ++ +L++ + + V KN + +A +AA LAGA ++ N + Sbjct: 8 FFSDDRGLAVVLVSLAMTVLIGAAALVTDVGLVTLVKNRLSNAADAASLAGAQELPDNPN 67 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + + I + ++ I + Sbjct: 68 TALSIAHAY-------ALSNGLSEEQIDIHLVTDPTDTYYDGVKVDTHQTVDYIMAKALG 120 Query: 134 NNRLDSSNNTIFYNMDVM 151 + + + Sbjct: 121 FTSANVQATATAKVLPIT 138 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 94/285 (32%), Gaps = 28/285 (9%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 +H ++ F+ KK NF++I A+ + S LL+G ++ ++ K S+ESA+ A Sbjct: 23 IHFFNKLLFFSKK------GNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEA 76 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ ++ + + N AK L D KN+ + + A +I Sbjct: 77 -LSHGMSLICYQNDIE-----RDNLAKIILNDLIVSLKKNNFTKQEADLVAKNSKIDITT 130 Query: 121 IVN-SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF---------IEHLLNQRYN 170 ++N S+ + H + + +L+ Sbjct: 131 LINDSTNVKSYHFYIKSVYKMPLNKITKIFYPKDLTIVTNVNKIVPCPYTSYVMLSNPRA 190 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVD---LSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 ++ S + R + + + + ++GSM + + Sbjct: 191 RQFNSDWDLIHRRTVNAINSIITDKNIKYMIINGSMTNFD--PSHYYTAEVKQFNNVYRH 248 Query: 228 MAALKNALLLFLDSIDLLSHVKE-DVYMGLIGYTTRVEKNIEPSW 271 + + D +D ++ DV Y+ + SW Sbjct: 249 LNVPIFRSIGTRDYVDNNGICRDGDVLTNFSTYSCAFAALNDLSW 293 >gi|226510867|gb|ACO59960.1| matrilin-1 [Salmo salar] Length = 108 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 43/126 (34%), Gaps = 22/126 (17%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + ++GL+ Y++ V++ + ++ V + + + T + A++ Sbjct: 3 DVSETKAHVGLVQYSSSVKQEFPLGRYNNKKDLKDAVKKMA--YMERGTMTGQALRY--- 57 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 T+ K I TDG + ++ + KAKE K Sbjct: 58 ----------LTDSSFAPAGGARPGVAKVGIVFTDGRSQDYIGD-----AAKKAKEQGFK 102 Query: 364 IVTISI 369 + + + Sbjct: 103 MFAVGV 108 >gi|171320626|ref|ZP_02909646.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] gi|171094139|gb|EDT39226.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5] Length = 346 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 42/127 (33%), Gaps = 4/127 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ASE+ + ++ F + ++ L I + +N +++A +A A + Sbjct: 13 ASERGSVALFFVMFLIPLLSFGALAIDIAWVATVRNQLQNAADA----AALAAADAMMSP 68 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S A A A+ + + + + T + + I+ Sbjct: 69 TGGPLNWSQAAPAANGVIAQNSAAGAALSTGTVTTGYWNVTRNPASMQPTTITPGAYDVP 128 Query: 136 RLDSSNN 142 + + + Sbjct: 129 AVQVTVS 135 >gi|126433420|ref|YP_001069111.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126233220|gb|ABN96620.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 233 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 45/151 (29%), Gaps = 15/151 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +AAL+ F + + V + ++ + T + + + Sbjct: 36 IAALERGFAAFTRYLQNEVLASKRVEVAVVTFGTVATVLVPMQEART------LQPVAFT 89 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ + A IL K ++ +I+ LTDG+ N + Sbjct: 90 ASGTTNMAAGIHLALDILEDRKHAYKAAGL---------QYYRPWILLLTDGKPNLDGFD 140 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL 378 ++ + + + + Q+L Sbjct: 141 EAVARLNAVESARGVTVFAVGAGPRVDYQQL 171 >gi|313215532|emb|CBY16229.1| unnamed protein product [Oikopleura dioica] Length = 159 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 14/71 (19%), Positives = 28/71 (39%), Gaps = 3/71 (4%) Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQN 404 ++ D+ KE +++ + + GQ L SSP E Y + L + + Sbjct: 1 TDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENFYTAEESTDLAGLVKT 59 Query: 405 ISQLMVHRKYS 415 I+ + + S Sbjct: 60 IADEICTTELS 70 >gi|306828192|ref|ZP_07461454.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] gi|304429615|gb|EFM32662.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] Length = 1036 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 30/317 (9%), Positives = 79/317 (24%), Gaps = 56/317 (17%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + I ++ + + + + +G+++ Sbjct: 427 MTGKKQPLDILVVVDRSGSMQEG---------IGSIEKYKYWKYKYDEYYHIWRNAGTIY 477 Query: 205 CAMNSDPEDVNSAPICQDKKR--------TKMAALKNAL---LLFLDSIDLLSHVKEDVY 253 P D + + A+K+AL L ++ + Sbjct: 478 FDNYLGPRYQPDTYTYYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDINPQNQLAV 537 Query: 254 MG-----LIGYTTRVEKNIEPSW-----------------GTEKVRQYVTRDMDSLILKP 291 +G Y + + E + Sbjct: 538 VGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYQDRNG 597 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ A+ +A ++L +P+ F +G ++ + Sbjct: 598 TNYHAALLKADEMLQKVANNGHRKIMVFISDGVPTFYFGADNYRSGNGTVSDGNIINSQK 657 Query: 352 K---ICDKAKEN--FIKIVTISINASPNGQR------LLKTCVSSPEYHYNVVNADSLIH 400 D+ K + I ++ ++ N +LK S +Y+ + + + L Sbjct: 658 GSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYL-SGDDYYSGITDTEQLEK 716 Query: 401 VFQNISQL--MVHRKYS 415 I + + + S Sbjct: 717 TANKIVEDSKISNLTIS 733 >gi|120537195|ref|YP_957252.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120327030|gb|ABM21337.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 584 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 37/126 (29%), Gaps = 30/126 (23%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T AM A + LT+ ++ +I +TDG+ NN ++ Sbjct: 487 DSGGGTPLAEAMLYAARELTAS------------------HKARQVMIVITDGDPNNPQA 528 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I I I + + CV + + L +I+ Sbjct: 529 VNYLNGLIKG----HIDTYAIGIGSPAVKRFFENWCV--------ISDVSQLQSALFSIA 576 Query: 407 QLMVHR 412 ++ Sbjct: 577 SNVLEL 582 >gi|313222038|emb|CBY39058.1| unnamed protein product [Oikopleura dioica] Length = 1721 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 71/224 (31%), Gaps = 20/224 (8%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE-DV 252 V + H S D + A +++ F+ ++ D+ Sbjct: 197 PHVDVQNIHDHVGDCSIEHPSADIFFVLDGSYSTKRAGWESIIEFVQRFVKINFEHGGDM 256 Query: 253 YMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILTSD 308 GL+ Y+ VE + + G ++ +++ T + AM+ A + Sbjct: 257 NYGLLQYSDFVEPILSFADKEGQREIAEFIEILSQIKYHSGFSTLTGTAMRYAAET---- 312 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +K +I +TDG + + + + + + ++ Sbjct: 313 ---------EFMSERGARKNVRKIMIVVTDGRAKDI-DDNTVKIVGESLRAAG-DLTVVA 361 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + + + L SSP++ +N + L + + + Sbjct: 362 VGVNKAVESELVEIASSPDFVHNTKKFEDLHNFLSPVFAEICKE 405 >gi|322707155|gb|EFY98734.1| von Willebrand and RING finger domain containing protein [Metarhizium anisopliae ARSEF 23] Length = 1166 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 63/211 (29%), Gaps = 21/211 (9%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + P V + ++ + V PI + K+ +++AL ++++ Sbjct: 602 SVTTAPTEYTNFAVKTPYHPYSLVHVPIDVVVVVPISSSMQGVKINLVRDALKFMVNTLG 661 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 MGL+ + + T K T + S+ S A Sbjct: 662 ERD------RMGLVTFGSGGGGV-PIVGMTTKAWPGWTNILSSIKPVGQKSHRA------ 708 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + K + I+ ++D + S + + AK I Sbjct: 709 ---DVVEGANVAMDLLMQRKHNNPIAS--IMLISDASTADADSVDFVVSRAEAAK---IT 760 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 I + + + +++ + + V + Sbjct: 761 IHSFGLGMTHKPDTMIELSTKTKASYTYVKD 791 >gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein [Shewanella amazonensis SB2B] gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Shewanella amazonensis SB2B] Length = 713 Score = 44.5 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 75/249 (30%), Gaps = 49/249 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++F P + + V+D SGSM+ Sbjct: 309 STKGTVKLTFTPGDDLGPVTQ--GRDWVFVLDKSGSMN---------------------G 345 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K A L + L + + ++ ++ + + + Q + Sbjct: 346 KYATLVEGVRQGLGKLPAQDRFRIILFDESTQEFSKGFVPV----DSNNINQALAWVEGI 401 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD +K+A L +D+ ++ +TDG N Sbjct: 402 SPGNGTDLYQGLKRALTPLDADRSTG--------------------VVLITDGVANVGV- 440 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + ++ +++ T + S N L+ S +V NAD ++ NI+ Sbjct: 441 -TEKRRFLELMQQQDVRLFTFIMGNSANTPLLVPMTRLSNGVATSVSNADDIVGHLMNIT 499 Query: 407 QLMVHRKYS 415 + H+ Y Sbjct: 500 SKLTHQAYR 508 >gi|322698421|gb|EFY90191.1| hypothetical protein MAC_03706 [Metarhizium acridum CQMa 102] Length = 1056 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 25/211 (11%), Positives = 63/211 (29%), Gaps = 21/211 (9%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + P V + ++ + V PI + K+ +++AL ++++ Sbjct: 493 SVTTAPTEYTNFAVKTPHHPYSLVHVPIDVVVVVPISSSMQGVKINLVRDALKFMVNTLG 552 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 MGL+ + + T K + + S+ S A Sbjct: 553 ERD------RMGLVTFGSGGGGV-PIVGMTTKAWPGWSNILSSIKPVGQKSHRA------ 599 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + K + I+ ++D + S + + AK I Sbjct: 600 ---DVVEGANVAMDLLMQRKHNNPIAT--IMLISDASTADADSVDFVVSRAEAAK---IT 651 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 I + + + +++ + + V + Sbjct: 652 IHSFGLGMTHKPDTMIELSTKTKASYTYVKD 682 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 246 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 303 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 304 TGYKAGFEYAFDQLQNSNITRANC--------------NKVIMMFTDGGEDRVQDVFEKY 349 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 350 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|291541798|emb|CBL14908.1| hypothetical protein RBR_05200 [Ruminococcus bromii L2-63] Length = 1244 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 40/458 (8%), Positives = 105/458 (22%), Gaps = 76/458 (16%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 K SI + ++ + + + K+ ESA + + + Sbjct: 4 FKKTKGAVSIFLVIILVPMMTVSALFVDASKVSLAKSVSESAGD------LTLNTALTDY 57 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHI-----KESLSGYSAVFYNTEIQNIVNS----- 124 + A D+ ++++ +S A Y +I + Sbjct: 58 DTKLKDMYGLFATAQNTDELYASLEDYYRSCITSSGVSNEDADSYVDQIMAQLGLVGDSG 117 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 + +M + N+ + + F+++ +S I + Sbjct: 118 DVSDVLNMELVDFSVNKNSDATLSNASVMKKQIVDFMKYRAPINTGLSFLSSIQTFKTLS 177 Query: 185 MGE---------------------------RPIFLIELVVDLSG----------SMHCAM 207 + L+ D + S Sbjct: 178 KQSELVDKRQDYYKEEESVMKTLKSAWEYINKYNSLPLITDENYFSDMQKNLKDSAGWEN 237 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNA--LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + + + + N + +V+ D++ T Sbjct: 238 DYKFSVNRKTIMDLYDTQNYIDYYCNVQKVNNVESKNAAGDNVRVDLWRFTYSETGVAAP 297 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF------- 318 + + K ++ + + Q + S T+ Sbjct: 298 LQDYTQYYPKRVGGYDKNKLPTADGIKSLMNSFYRELQKMDSYGAEIKKTSAPVDVYDLQ 357 Query: 319 ------RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 R+ + + Q +N + K+ I + + Sbjct: 358 YLVQTNRKNLGPYTSSAQSVYTVYQKLKNAMIWVEAYDDDAKKELKDTQITV-------N 410 Query: 373 PNGQRLLKTCVSSPEYHYNVVNA-DSLIHVFQNISQLM 409 +L + + +N + F +ISQ + Sbjct: 411 SKSDKLYSQFSIIESKYESAMNEVSGIAKTFSDISQEL 448 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 244 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 301 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 302 TGYKAGFEYAFDQLQNSNITRANC--------------NKVIMMFTDGGEDRVQDVFEKY 347 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 348 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 391 >gi|148237705|ref|NP_001079047.1| complement factor B [Xenopus laevis] gi|1030718|dbj|BAA08371.1| Xenopus Bf B [Xenopus laevis] Length = 745 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 29/262 (11%), Positives = 85/262 (32%), Gaps = 48/262 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + +++ + + I +V+D S S+ + ++ + Sbjct: 211 FSSSMLENVDTTNLED--RSDRSVQILKDGLMNIFIVLDTSKSVGEEKFEEAKEAS---- 264 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEK 275 LF++ + +I Y + + S E Sbjct: 265 ----------------KLFIEKMADYDIKPRY---CIISYASVAIAVVSLRDPDSNDAEA 305 Query: 276 VRQYVTRDMDSLILK--PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ Y+ L + + ++G K + I Sbjct: 306 VTKHLEDFQYNNHADKQGTNTRAALHSIYEHLIEQE-----LAYEKEGKKADFMKIHNVI 360 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK----------IVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ + + I+ + + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKLIIRFLDIGIRTENPRLEYLDVYVFGLGSDIDQPEINELAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQ 403 + + +++ N + + F+ Sbjct: 421 KKDKEVHTFHLENVNKMKEFFE 442 >gi|213626825|gb|AAI70192.1| Complement factor B [Xenopus laevis] Length = 745 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 29/262 (11%), Positives = 85/262 (32%), Gaps = 48/262 (18%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 F +L + + +++ + + I +V+D S S+ + ++ + Sbjct: 211 FSSSMLENVDTTNLED--RSDRSVQILKDGLMNIFIVLDTSKSVGEEKFEEAKEAS---- 264 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP----SWGTEK 275 LF++ + +I Y + + S E Sbjct: 265 ----------------KLFIEKMADYDIKPRY---CIISYASVAIAVVSLRDPDSNDAEA 305 Query: 276 VRQYVTRDMDSLILK--PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V +++ + T++ A+ Y+ L + + ++G K + I Sbjct: 306 VTKHLEDFQYNNHADKQGTNTRAALHSIYEHLIEQE-----LAYEKEGKKADFMKIHNVI 360 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIK----------IVTISINASPNGQRLLKTCV 383 + +TDG+ N +K+ + + I+ + + + + + + Sbjct: 361 LLMTDGKFNMGGDPREEMKLIIRFLDIGIRTENPRLEYLDVYVFGLGSDIDQPEINELAS 420 Query: 384 SSPE--YHYNVVNADSLIHVFQ 403 + + +++ N + + F+ Sbjct: 421 KKDKEVHTFHLENVNKMKEFFE 442 >gi|254458848|ref|ZP_05072272.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084614|gb|EDZ61902.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 629 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 54/160 (33%), Gaps = 33/160 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + +C+D ++ K L ++ +G+I + S+ Sbjct: 93 DISDSMLCEDIYPNRLELAKKKALELINKATK-------DRVGVIAFAKNSYLVSPISFD 145 Query: 273 TEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 T+ V +++ S I + T+ + K + +K Sbjct: 146 TKTVSFLLSKLDTSSITQKGTNILTMLGT--------------------VEKTNTSTDKK 185 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 +++ L+DG + + D AKE I + + I Sbjct: 186 YLLILSDGGD-----ETDFSAEIDFAKEKGIIVFVLGIGT 220 >gi|171320625|ref|ZP_02909645.1| TadE family protein [Burkholderia ambifaria MEX-5] gi|171094138|gb|EDT39225.1| TadE family protein [Burkholderia ambifaria MEX-5] Length = 142 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 41/130 (31%), Gaps = 3/130 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + S+ FAL + L+++ +I V K + +A+ A AG V L+ Sbjct: 8 ARHARGVVSLEFALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPMLTT 67 Query: 75 LGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++S + A + +G + T + + Sbjct: 68 TQIANVALSYTQNSLVSGGSATAPTVAVTQANGTTAGTALTVTVTYTYSGLVLGTALSVL 127 Query: 132 MANNRLDSSN 141 + +S+ Sbjct: 128 TGPITISASS 137 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 44.5 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 97/381 (25%), Gaps = 50/381 (13%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A + + E +AK+ L I + + + + N V Sbjct: 235 AKALSDEIDSSVKPAEYAQYYAKKQLSSF---DIHDQVSAPERIFDEAPMAEQHNNRVAL 291 Query: 125 SRISMTHMANN---------RLDSSNNTIFYNMDVMTSYD---YRLQFIEHLLNQRYNQK 172 + M +D + + + + L + + Sbjct: 292 KVAFDSEMPIEHIRSPYHQINVDMAESGGAIVSLNRDAIANKDFVLTWKP-IQGNEPTAA 350 Query: 173 IVSFIPALLRIEMGERPIFL--------------IELVVDLSGSMHCAMNSDPEDVNSAP 218 + S I + + S Sbjct: 351 VFSQIGKTHTSQASSSEASTEPQTASEKYGLVMLMPPQGAEQQPSSIHRELILVIDTSGS 410 Query: 219 ICQDKKRTKMAALKNALLLF--LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + D ALK AL D +++ + + + + Sbjct: 411 MSGDAIIQAKTALKYALAGLRPTDKFNIVQFNSDVDKWSGMAMSATPY----------NL 460 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 Q T+ + A+ A L + T + S ++ + F+ Sbjct: 461 AQAQNYINRLEANGGTEMSIAINAA---LNIETVTDKETGTELDNNDLGSNLLRQVL-FI 516 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG +N I+ + ++ ++ TI I ++PN + + + + D Sbjct: 517 TDGAVSNESMLFELIEA--QLGDS--RLFTIGIGSAPNAHFMQRAAQLGRGTYTYIGKLD 572 Query: 397 SLIHVFQNISQLMVHRKYSVI 417 + ++ + + + + + Sbjct: 573 EVNQKVVSLLKKIEKPQVTDV 593 >gi|107028245|ref|YP_625340.1| TadE-like [Burkholderia cenocepacia AU 1054] gi|116687156|ref|YP_840403.1| TadE family protein [Burkholderia cenocepacia HI2424] gi|105897409|gb|ABF80367.1| TadE-like protein [Burkholderia cenocepacia AU 1054] gi|116652871|gb|ABK13510.1| TadE family protein [Burkholderia cenocepacia HI2424] Length = 177 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 10/142 (7%), Positives = 41/142 (28%), Gaps = 2/142 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + ++ FAL +M ++L + Y+ ++ A A + + ++ + Sbjct: 8 RSRTRGVVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAY 67 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + + ++ A T + + Sbjct: 68 PVSQAQCLVVYGSTTCGSSGTELVPGLTTSMVTICDAQ--RTTGCSDASDPSQFANLPTY 125 Query: 135 NRLDSSNNTIFYNMDVMTSYDY 156 + +++ + + Sbjct: 126 DSNNNAASGTATGAINVVEVKI 147 >gi|227830108|ref|YP_002831887.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|229578921|ref|YP_002837319.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|229582327|ref|YP_002840726.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|227456555|gb|ACP35242.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] gi|228009635|gb|ACP45397.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|228013043|gb|ACP48804.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 380 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 58/226 (25%), Gaps = 51/226 (22%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + L+ + + H + D + K+ K + I + Sbjct: 24 KVLLVPEKLGSATGFHYIVALDTSGSMTGY--------KIELAKQGAIELFKRIPKGN-- 73 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + I +++ V E + + T A+ A + Sbjct: 74 ----KVSFITFSSNVNVIKEFVDPLD----LTNEILQIAAGGQTALYTAILTANSLAKK- 124 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIKIV 365 +++ LTDG + + + K +++ Sbjct: 125 ------------------YQMPTYLLLLTDGNPTDET------NVGNYLKLPYFEKMQVY 160 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSL-----IHVFQNIS 406 + I N Q L + Y++ +A+ + I+ Sbjct: 161 SFGIGDDYNEQLLQNISDKTSGVMYHISDANEIPQKLPQKAVTQIA 206 >gi|242129222|gb|ACS83542.1| factor B/C2B [Larimichthys crocea] Length = 651 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 26/276 (9%), Positives = 67/276 (24%), Gaps = 35/276 (12%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 ++D E + + P R + + Sbjct: 106 QGPNTFDSPSDVAEAMAGSLAG-IMDVVSPESKRQNKNN------------NFGRSFRVA 152 Query: 209 SDPEDVNSAPICQDKKRTK--MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + TK + A + + +D + ++ + + Sbjct: 153 EGSRMNVYILLDTSGSITKEAFDKSREATIALITKLDSYEV---QLKFHVLSFASEAVDI 209 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS-DKKRSFFTNFFRQGVKIP 325 ++ + ++ + + +L+ K + + + + F Sbjct: 210 VDITE--SQISGSIDDVIWNLMEFNYSIH-GRKTGTNLYAALHRVNELISFFKENSATNH 266 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-----------KENFIKIVTISINASPN 374 Q II TDG +N + E + + I Sbjct: 267 FNETQNIIIIETDGYSNMGTKPQIALARIRNMLGYFDTSQDHTDETMLDVYVFGIGEQLK 326 Query: 375 GQRLLKTCVS--SPEYHYNVVNADSLIHVFQNISQL 408 +L ++ + + N ++L VF +I Sbjct: 327 KDQLNALASKKRDEQHVFILKNYETLGEVFNSIISD 362 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 56/190 (29%), Gaps = 29/190 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +S+ DK ++ A L+ ID + +GLI ++ + I + Sbjct: 152 DDSSSMKTSDKNDRRL----TAANELLEHIDG------NRKVGLIRFSKDIHCYIPMDYL 201 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + TD A+ K+ + + Sbjct: 202 KVNKSTLNHELENKAKEGGTDINDAL----------------YAVLNAFDKVGTATGSRS 245 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +I LTDG++ I + I+I IS+ + + + S+ Sbjct: 246 VILLTDGKSTTNVDEEYLINRANSM---NIQINVISLGNHTDKAFIKRITSSTGGKAAKT 302 Query: 393 VNADSLIHVF 402 + L + Sbjct: 303 SSDFYLDAAY 312 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 52/181 (28%), Gaps = 17/181 (9%) Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG 201 N Y++D+ + + N + ++ LL G R + LI D+ Sbjct: 135 NGTDYDVDIAQETMLVIDDSSSMKTSDKNDRRLTAANELLEHIDGNRKVGLIRFSKDIHC 194 Query: 202 S--MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK---------- 249 M + + + T + A+L D + + + Sbjct: 195 YIPMDYLKVNKSTLNHELENKAKEGGTDINDALYAVLNAFDKVGTATGSRSVILLTDGKS 254 Query: 250 --EDVYMGLIGYTTRVEKNIE-PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQI 304 LI + I S G + ++ R S K T S + AY + Sbjct: 255 TTNVDEEYLINRANSMNIQINVISLGNHTDKAFIKRITSSTGGKAAKTSSDFYLDAAYGV 314 Query: 305 L 305 Sbjct: 315 F 315 >gi|90578363|ref|ZP_01234174.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] gi|90441449|gb|EAS66629.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] Length = 259 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 67/254 (26%), Gaps = 37/254 (14%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMG-----ERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 + +K + L+ + + S D Sbjct: 27 TGSKGSSAQIKKTDASSGYGFDALKNDWPALNNDNALVSDNWLATNYLLIFDGSGSMDNT 86 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTE 274 + Q K+ A K A+ F++ I +GL + + + Sbjct: 87 NCGNGQ----KKIVAAKQAIQTFINDI------PNSANVGLYVFDNKDASLRVPLGNNNR 136 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + D T ++ +Y L + + ++ Sbjct: 137 ATLKQ--AIYDVTAGGATPLKSSLDSSYSALERQASKQLGYGEY-------------NVV 181 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +TDG+ + ++ I +++ + I TI K Y+ + N Sbjct: 182 IVTDGDASQGENPQPAIN--RIYRDSPVTIHTIGFCIGEQHALNAKGIT----YYQSANN 235 Query: 395 ADSLIHVFQNISQL 408 + L+ +N+ Sbjct: 236 PEKLLAGLKNVLAE 249 >gi|167836682|ref|ZP_02463565.1| hypothetical protein Bpse38_09361 [Burkholderia thailandensis MSMB43] Length = 137 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 40/134 (29%), Gaps = 5/134 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ ++ ++ F L F+L++ ++ V K + +A+ A AG V Sbjct: 1 MRRFG--QRGVVALEFVLVFPFFMLVLFGIVDVSLLLCDKAIITNASREAARAGVVACVP 58 Query: 71 NLSRLGDRFESISNHAKRALIDDAKR---FIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 ++ +++ + + + SG T + Sbjct: 59 PMTASAISQVALNYAQNNLVTGGTATTPVVAVDQSAGTTSGSPLKVTVTYTYQGLVLGSA 118 Query: 128 SMTHMANNRLDSSN 141 + L + Sbjct: 119 LSSLTGPVTLTAIT 132 >gi|126334644|ref|XP_001371234.1| PREDICTED: similar to integrin, alpha L (antigen CD11A (p180), lymphocyte function-associated antigen 1; alpha polypeptide) [Monodelphis domestica] Length = 1139 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 49/164 (29%), Gaps = 24/164 (14%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + ++T + S + + + + T++ ++ + + + Sbjct: 138 SNSSFHFAAVQFSTSSKTEFYFSDYIKIRNPHTLLANVRHMQELTNTYQGIRHVVEDVFN 197 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + +I +TDG+ + + + I I Sbjct: 198 MNRG-------------ARPEAVRVMIIITDGDATDTGNIE---------QAKHIIRYVI 235 Query: 368 SINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 I L S+P ++ + D L +F ++ + + Sbjct: 236 GIGGGIQEPEKLDDFASTPKSDFVKILDTYDKLKGLFDDLEKKI 279 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 40/95 (42%), Gaps = 7/95 (7%) Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQ 376 +K +I +TDG+ +K + + +A++ I I + P + Sbjct: 5 HHKNGARKSAKKILIVITDGQK--YKDPLEYRDVIPQAEKAGIIRYAIGVGRAFQEPTAR 62 Query: 377 RLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + L T S+P ++ + V N +L + + + + + Sbjct: 63 QELNTIGSAPPQDHVFKVDNFAALSSIQKQLQEKI 97 >gi|330791224|ref|XP_003283694.1| hypothetical protein DICPUDRAFT_147402 [Dictyostelium purpureum] gi|325086437|gb|EGC39827.1| hypothetical protein DICPUDRAFT_147402 [Dictyostelium purpureum] Length = 885 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 36/298 (12%), Positives = 87/298 (29%), Gaps = 48/298 (16%) Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS-YDYRLQFIEHLLNQRYNQKIVSFI 177 +N S + ++ N L + + K + Sbjct: 210 SFDLNYSLHANLSTPIESVELLNRKNATVTFENEEKTKLTLNYS-SKNGVSDSAKKMLI- 267 Query: 178 PALLRIEMGERPIFLIEL-------VVDLSGSMHCAMNSDPEDVNSAP-----ICQDKKR 225 +L + +P IE + ++ + + +D D + Sbjct: 268 -TVLVPKFSNKPESFIEYCPIEKTHSIGINFYPNFNIPADEVDQKAEYIFVVDCSGSMSG 326 Query: 226 TKMAALKNALLLFLDSID---LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 T ++ K AL + L S++ + + T+++ + E +RQ Sbjct: 327 TPISKAKRALEICLKSLNEKSKFNIYRFGSDFQTYSRTSQLYTDESLEQAMEYIRQTDAN 386 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ P +K IL ++ + + + LTDGE Sbjct: 387 L------GGTELLPPIK---DILKTEY----------------DPEYPRQVFILTDGE-- 419 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + + + N +I T I + + + ++ + Y+ + + + Sbjct: 420 VSERDELIEYVAKE--SNTTRIFTFGIGSGVDKELVIGLSKACKGYYELIDDNKDMEE 475 >gi|310822679|ref|YP_003955037.1| hypothetical protein STAUR_5440 [Stigmatella aurantiaca DW4/3-1] gi|309395751|gb|ADO73210.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 697 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 4/50 (8%), Positives = 21/50 (42%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 F ++ ++ ++ AL V+ + + + + +++ +++ Sbjct: 2 FTRVLRQSFQRQEGQALVMAALMVLVMSIAVITTVNIGHTVHERVRLQNT 51 >gi|332185423|ref|ZP_08387171.1| tadE-like family protein [Sphingomonas sp. S17] gi|332014401|gb|EGI56458.1| tadE-like family protein [Sphingomonas sp. S17] Length = 193 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 44/142 (30%), Gaps = 5/142 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ + + FAL FLLL+ +I + + A +A+ A +V+ ++ Sbjct: 18 LSDRRGATIVEFALVATPFLLLLIAIIQTSLAYLA----QEALESAVQVAARGVVTGQAQ 73 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 D S + L + + + +S +M + Sbjct: 74 ASDVKGSSTGMTSAQLAERFRINGCAALPSFMSCSRLYVDVKSAATGAGLGSNAMPLTFD 133 Query: 135 NRLDSSNNTIFYNMDVMTSYDY 156 +N Y++ S Sbjct: 134 GNGK-PSNAFSYDLGTQGSMVM 154 >gi|308068037|ref|YP_003869642.1| hypothetical protein PPE_01262 [Paenibacillus polymyxa E681] gi|305857316|gb|ADM69104.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 738 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 16/58 (27%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E ++ L + I I E +AA+ + S L + Sbjct: 12 ESGAVTVFLILILAVMFGFIAVFIDYARIAALHVQTERLTHAAVRSVMSAYDPTLQQE 69 >gi|45767838|gb|AAH67716.1| Col14a1 protein [Danio rerio] Length = 747 Score = 44.5 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 32/88 (36%), Gaps = 13/88 (14%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 K ++ LTDG + + + KI + + I I + G+ L S Sbjct: 14 RRGIPKVLVVLTDGRSQDDVN-----KISKEMQMEGYIIFAIGFADADYGE--LVNIASK 66 Query: 386 PE--YHYNVVNADSLIHVFQNISQLMVH 411 P + + V + F+ I + ++ Sbjct: 67 PSERHVFFVDDL----DAFKKIEEQLIT 90 >gi|170592037|ref|XP_001900776.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591928|gb|EDP30531.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 817 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 +DL+ ++ + ++ Y++ S K + I T + A+ +A Sbjct: 69 VDLIEIGPKNTQIAVMQYSSYTRVEFNFSANPNKESLRASLQKIRHISGTTKTGKALDKA 128 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + + G ++ + + +TDG +++ I + ++ Sbjct: 129 LHVF---------CHDSNFGTRLNQDDVAQVAVVVTDGHSHDD-----PIPAAMRLRQAG 174 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV---FQNIS 406 ++I+T+ I A N L++ N+ + SL F+ I+ Sbjct: 175 VEILTLGIGAHINMGELVEITGDQNLAFQNLTSQASLDQFVHQFKKIA 222 >gi|146338399|ref|YP_001203447.1| hypothetical protein BRADO1313 [Bradyrhizobium sp. ORS278] gi|146191205|emb|CAL75210.1| hypothetical protein; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 526 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 13/159 (8%), Positives = 45/159 (28%), Gaps = 20/159 (12%) Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + ++ + V S ++ + + + TD +A + Sbjct: 384 QWSKQDEILVLPFNDHVLWVASASGDEQEQAGLLKQALQLHAGGGTDFYQCGARALAAMK 443 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 I+ +TDG++ ++ D + + + Sbjct: 444 PTLDGG---------------AHLAAIVIMTDGKSYGDRATFEEPWRADGGR---VPVFG 485 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 ++ + ++L + ++ +L F+ + Sbjct: 486 VTFGDEADRKQLDALAKLTGGRVFD--GTKNLTDAFRAV 522 >gi|32394566|gb|AAM93981.1| alpha 1 type VII collagen precursor [Griffithsia japonica] Length = 194 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 52/178 (29%), Gaps = 27/178 (15%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKED---VYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 T+ A L AL+ ++ + + GL E + + Sbjct: 41 SGTEFAQLNAALVEITNTFSTAAPGSSFAAVDFSGLQNGGGVGEVVSPLTSDVTTFLAAL 100 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T S + Q L + K I+ +TDG Sbjct: 101 AANPKR--GGFTSSGTGLNLCDQELDGQAQ-------------------PKVILLITDGV 139 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 +N V+ K N + + I + N L T SSP+ + V + +L Sbjct: 140 DNRNPIGVDVEAA---IKANGNTVAVVGIGDNLNDDDLRDTISSSPDLYTFVDDFSAL 194 >gi|326797335|ref|YP_004315155.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] gi|326548099|gb|ADZ93319.1| hypothetical protein Marme_4119 [Marinomonas mediterranea MMB-1] Length = 187 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 21/45 (46%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ +A+E I FALS+ ++ + + + + ME A Sbjct: 3 RRFLANESGATVIEFALSLPIMFGILLVSTDLYNINRMRGDMEQA 47 >gi|221197788|ref|ZP_03570834.1| membrane protein [Burkholderia multivorans CGD2M] gi|221181720|gb|EEE14121.1| membrane protein [Burkholderia multivorans CGD2M] Length = 634 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 39/101 (38%), Gaps = 3/101 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F + G +++ + +++ A+ + L+++G I V + ++++ ++ + LA Sbjct: 11 FARSQRAGSRAQRGSVAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQ 69 Query: 67 KMVSNLSRLGDRFESISNHA--KRALIDDAKRFIKNHIKES 105 + + + ++A A A + Sbjct: 70 SVNDLCPQTDTTVTASGSNAVVTAAYRGAALNGFDAQASGN 110 >gi|183985704|gb|AAI66222.1| LOC100158554 protein [Xenopus (Silurana) tropicalis] Length = 899 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 63/192 (32%), Gaps = 32/192 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 ++ L A +++ + V++ Y+G++ ++ E + T+ + Sbjct: 315 ISGSMTNANRITRLYQASEVYI-----MQIVEQGAYVGIVTFSNVAEIKSQLVKITDTFQ 369 Query: 278 QYVTRDMD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + ++ T+ ++Q Q+ + I+ L Sbjct: 370 RESLKLKLPTVATGGTNICAGVQQGLQV----------------NRNLDQSTHGTEIVLL 413 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV--- 393 TDGE++ S ++ I TI++ + + L K + Sbjct: 414 TDGEDSGISSCFP------DITKSGAIIHTIALGNNADPG-LEKLADLTGGLKLYASDKV 466 Query: 394 NADSLIHVFQNI 405 + + LI F I Sbjct: 467 DVNGLIDSFSGI 478 >gi|21322624|emb|CAC27151.2| inter-alpha-trypsin inhibitor heavy chain 2 [Platichthys flesus] Length = 304 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 60/199 (30%), Gaps = 23/199 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM A+ LD + + H + + E S ++Y+ Sbjct: 6 KMKQTVEAMQAILDDLTIDDHFIIVDFNHNV-RCWNEELVHGSSIQVTDAKKYIQNIK-- 62 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 T+ A+ +A Q+L + II ++DG+ Sbjct: 63 -PNGGTNINEALMRAVQMLVRASNQGMI-----------DPRSVSMIILVSDGDPTVGEI 110 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV----VNADSLIH 400 K + + +E + ++ I + L + + + + A+ L Sbjct: 111 KLSTIQKNVKRVMREE-FSLFSLGIGFDVDYDFLERIAMENRGMAQRIYANHDAAEQLRA 169 Query: 401 VFQNISQLMVHRKYSVILK 419 + +S ++ R+ + Sbjct: 170 FYSQVSSPLL-RRIDIQFP 187 >gi|156377912|ref|XP_001630889.1| predicted protein [Nematostella vectensis] gi|156217919|gb|EDO38826.1| predicted protein [Nematostella vectensis] Length = 372 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 57/191 (29%), Gaps = 34/191 (17%) Query: 235 LLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLIL 289 + FLD + L ++GL+ Y + + + + ++ + + Sbjct: 192 VKEFLDKLVSELEIGPSKSHVGLVRYNQVADTLWDFNGAENNNLKSLKDAIEKIEYLP-- 249 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-- 347 T + A+K+ + + ++ +TDG+ N Sbjct: 250 GGTRTDLALKK-------------VNEDIFSPMGGARNDVPQVLVVITDGKTNQRSEPYS 296 Query: 348 ----------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + ++ + +KIV + + S + L + + + D Sbjct: 297 SVLQPLKVRILYPRDVTEQ--DKDVKIVAVGVGHSIGKKELDIIALGDASNVFMLETFDD 354 Query: 398 LIHVFQNISQL 408 L+ I Sbjct: 355 LVRRLNAIMNS 365 >gi|307152391|ref|YP_003887775.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306982619|gb|ADN14500.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 305 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 72/199 (36%), Gaps = 35/199 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSW 271 ++ + T MA A+ ++D L+ +K+ + ++G+ +V+ + Sbjct: 51 SSSTYQSEFNAPGTTMAQEVQAVKDYVD--KNLAILKQPNQIKILGFADQVKPLTVNFTT 108 Query: 272 GTEKVRQYVTRDMDSLI------LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 +++++Q + +D L T+ A+ + L + R Sbjct: 109 DSQQIKQQLENTLDVLKSNPEDLGGGTNIDLAISEGTDALAAISDRC------------- 155 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 + ++ +TDG+ + +AKE +KI + I P+ + + + Sbjct: 156 -----RELLIVTDGQAT------INSETITQAKEKNVKINAVVIG--PDSPDIQQATQDT 202 Query: 386 PEYHYNVVNADSLIHVFQN 404 + A +L +F Sbjct: 203 GGVYLISNQASNLSPLFTE 221 >gi|224051386|ref|XP_002199700.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 565 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 51/180 (28%), Gaps = 26/180 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + + + +T K + K + + T + Sbjct: 402 LEFVSNVAKAFEISDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLSAIHNIQYMSGGTAT 461 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + + P + F+I LTDG++ + + Sbjct: 462 GDAISFTTRTVF---------------GPVKDGPNKNFLIVLTDGQSYDDVTGPAAAAK- 505 Query: 355 DKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + I + ++ + A + LK S P + + L + +I + + Sbjct: 506 ----KAGITVFSVGVAWAPLDD---LKEMASEPRESHTFFTREFTGLEQMVPDIIRGICK 558 >gi|323492793|ref|ZP_08097935.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] gi|323312864|gb|EGA65986.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] Length = 694 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 37/249 (14%), Positives = 74/249 (29%), Gaps = 49/249 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + ++F P + + V+D SGSM Sbjct: 290 SKRGTVKLTFTPGDDLTRVTQ--GRDWVFVLDKSGSM---------------------SG 326 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K + L + L + + ++ T + P+ V + + Sbjct: 327 KYSTLVEGVRQGLGKLPAEDRFRVVMFDNNTYDLTGGFVAVNPT----NVSKALQSVEQV 382 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ M A + L D+ I+ +TDG N Sbjct: 383 EPSNGTNLYEGMSAAIRRLDDDRPTG--------------------IVLVTDGVANVGV- 421 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +++ +++ T + S N L+ S +V NAD +I NI+ Sbjct: 422 -TEKRRFFELMEKHDVRLFTFIMGNSANTPLLVPMTKLSNGVATSVSNADDIIGHLMNIT 480 Query: 407 QLMVHRKYS 415 + ++ Y Sbjct: 481 SKLTYQAYR 489 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 41/411 (9%), Positives = 92/411 (22%), Gaps = 72/411 (17%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-VFYNTEIQNIVNSS 125 ++ + L D + F +++ + + V + Sbjct: 316 DYAADNGSTAGEEDVNVPAQTVQLWGDERNFENSYLDYNGAYIKKWVEPVLSNNPTSDLH 375 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN--------QRYNQKIVSFI 177 T + N LD + + L + + I S Sbjct: 376 PEDATTLYNVYLDVIGSEKQEIS--PIDIVFVLDKSASMNEGTLEGGGQSKNAALIESVN 433 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAM--------NSDPEDVNSAPICQDKKRTKMA 229 + I +V S D+N ++ + Sbjct: 434 EISENLLSDPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTP 493 Query: 230 A--------LKNALLLFLDSIDLLSHVKEDVYM----GLIGYTTRVEKNIEPS------- 270 LKN + E + + G ++ + + Sbjct: 494 IGGTPLTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSIWGAWS 553 Query: 271 -WGTEKVRQYVTRDMDSLI----LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--K 323 W + + R T + + TN + K Sbjct: 554 SWSVMGDKIAIDRGDLFKNFETFSGNTS-NAGFTYPVTYASDFDRPVNGTNVEYRYGEVK 612 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKIC-----------DKAKENFIKIVTISINAS 372 ++ + + N +K +EN I +I + Sbjct: 613 EGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGID 672 Query: 373 PN----------GQRLLKTCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 + G+ +LK + +Y+ N + +I ++IS Sbjct: 673 GSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 723 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 59/177 (33%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + Q + T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSQEIPAIT--YRGGGT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 168 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 206 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 + ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 207 VAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|163857684|ref|YP_001631982.1| hypothetical protein Bpet3372 [Bordetella petrii DSM 12804] gi|163261412|emb|CAP43714.1| putative membrane protein [Bordetella petrii] Length = 706 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 14/172 (8%), Positives = 43/172 (25%), Gaps = 12/172 (6%) Query: 18 EKANFSIIFALSVMSFLLLIGF-LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ + ++ A + + L+ + Y K ++ A + A L+ + + Sbjct: 14 QRGSIAVSTAFA--VLIGLVMLLSAQIGYLFYMKRELQKAADLAALSAVQVLAPTGAASD 71 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 + A+ ++ + +A + + + S Sbjct: 72 CAAG-------SPVAVAAQTSAVANVPAFVDSIAAANVTVDCKFWDPARADSTGMHLFEP 124 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 +S + V ++ VS + Sbjct: 125 DAASG-GRVNAVRVRIDKTLTALI-PSVIGNWMGGTEVSVVAVASNTAPTAA 174 >gi|261819810|ref|YP_003257916.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261603823|gb|ACX86309.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 403 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 72/220 (32%), Gaps = 39/220 (17%) Query: 193 IELVVDLSGSMHCAMNSDPED------VNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + +++D SGSM +M++ P++ + +++ +++ + + +D + Sbjct: 204 VVIILDASGSMALSMDATPDELERWLEGDELVRDIEREPRRISLARQSTSNIIDKLPK-- 261 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + P K + + T A++QA +++ Sbjct: 262 DMNISLVAAADCRKVTASTPFPP----SKRTALKNQINRIEPIGKTALAEALEQAGKLVD 317 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKI 364 ++ + I+ +TDG+ ++ K K++ ++I Sbjct: 318 GVERDAI-------------------ILLVTDGDETCGGDPC---EVASKLKKSKPRLQI 355 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I N ++ + V N ++ Sbjct: 356 NVVDI---LNTGAGNCIASNTGGSVFAVNNTQEFSNIMNK 392 >gi|171695786|ref|XP_001912817.1| hypothetical protein [Podospora anserina S mat+] gi|170948135|emb|CAP60299.1| unnamed protein product [Podospora anserina S mat+] Length = 345 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 51/179 (28%), Gaps = 25/179 (13%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D ++ A K + + S + + + ++ + E + Sbjct: 5 DPSNLRLQAAKLLVGKLIPSSGTTT-GQTADQVTIVQFNETAEVLYPL-GDPSGAGATID 62 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 T + A LT + I LTDG + Sbjct: 63 GI---PANGGTFIGGGISAAVDELTRSGNKPDAAG----------------IFVLTDGAD 103 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLKTCVSSPEYHYNVVNADS 397 + +TI ++A++ I++ + +A R+ +SS V A+ Sbjct: 104 DPPSLISDTIDSINRAQQAGIRV-SFGFLSVDAEEQDSRITSAILSSGGTFTTVNTAED 161 >gi|162451829|ref|YP_001614196.1| cell wall anchor domain-containing protein [Sorangium cellulosum 'So ce 56'] gi|161162411|emb|CAN93716.1| LPXTG-motif cell wall anchor domain protein [Sorangium cellulosum 'So ce 56'] Length = 406 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 51/249 (20%), Gaps = 27/249 (10%) Query: 180 LLRIEMGERPIFLIELVVDLSGSM---HCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + D+SGSM + + + L + Sbjct: 84 ATQSASADLQPVYLAFAFDVSGSMGMGDEPWYDKELKWDPVVAATKQFFS--DPLSEGIN 141 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 L ++ Y T S E+ VT + T + Sbjct: 142 ASLRFFPSDDEDRD-EKCSAETYETPEVPMTPLPSPRFEEAITAVTPATSADWRGGTPT- 199 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + + ++ G Sbjct: 200 ----------LFVMQGTISYVEEQRRATPGRYAIVLVTDGYPAGCGTESNRIQRVSDAAA 249 Query: 356 KAKENFIKIVTISI------NASPNGQRLLKTC-VSSPEYHYNVV--NADSLIHVFQNIS 406 A E+ + I + A + L + E Y + N + F+ Sbjct: 250 GALEDGVPTYVIGVDNPPIEGAPHDLDNLHQIADAGGTERAYLIDTGNPGATSAAFKAAI 309 Query: 407 QLMVHRKYS 415 + S Sbjct: 310 DAIRGASVS 318 >gi|255327041|ref|ZP_05368117.1| conserved hypothetical protein [Rothia mucilaginosa ATCC 25296] gi|255296258|gb|EET75599.1| conserved hypothetical protein [Rothia mucilaginosa ATCC 25296] Length = 197 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 14/157 (8%), Positives = 46/157 (29%), Gaps = 8/157 (5%) Query: 3 LLSRFRFYFKKGIAS------EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 L R R + A E+ + + + + LL+ + + + + ++ Sbjct: 42 LQVRARVQSRALAADPVASNPEEGSIAPLIVGMLALLLLIGSVTVAITGAYLQTQHLQDV 101 Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +A + M + + + + + E++S S + Sbjct: 102 ADAQANSITRTMRT--PDEASGSAAWEYASAYLAEVQPGQDFQALRLENVSVDSDHSVHV 159 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + + +S+ + + ++ +S Sbjct: 160 YLSARIRPPLLSILVPDGIEVTAHGSSRLKTSQGPSS 196 >gi|15807991|ref|NP_285654.1| hypothetical protein DR_A0331 [Deinococcus radiodurans R1] gi|6460745|gb|AAF12450.1|AE001863_75 conserved hypothetical protein [Deinococcus radiodurans R1] Length = 494 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 21/255 (8%), Positives = 59/255 (23%), Gaps = 62/255 (24%) Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 L S + + + R + + Sbjct: 59 TLTVKPTAEARQARPDLSVVFVVDTSGSMRE--------VVTEPTERTGRTTQVDGQVYE 110 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VV + KM + AL + + ++ + Sbjct: 111 VV------------------------KGGKNKMQLMIEALRGIV----TSTLIRPGDRLA 142 Query: 256 LIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 L+ + E + + ++ V + T M+ +LT ++ Sbjct: 143 LVKFDDTAEVLVPFITASNQAQLAAAVDKLD--WYSGGTHMGAGMRAGAGLLTGERGS-- 198 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 + ++ ++DG+ + + I + + + +++ Sbjct: 199 -----------------RRMVLISDGQTFDAPLVEDQIGALAGLQ---VPVTVVAVGDEV 238 Query: 374 NGQRLLKTCVSSPEY 388 N L + Sbjct: 239 NADLLTAIADRTQGQ 253 >gi|307825380|ref|ZP_07655599.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733555|gb|EFO04413.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 600 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 20/167 (11%), Positives = 52/167 (31%), Gaps = 13/167 (7%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTS 307 ++D L+ Y + TE + ++ D + + +++ A+++A ++L Sbjct: 129 RKDGQTALLVYAGDAFTVTPLTDDTETIDSQLSALNTDIMPSQGSNTASALEKAVELLKQ 188 Query: 308 DKKRSFFTNFFRQGVKIPSLPF-----QKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + GV + K+ + + ++ K+ Sbjct: 189 AGLQKGRILLVTDGVDMDKTLAAVKTLDKYQLSILGVGTDDGAPIALPEG--GFLKDERG 246 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVV-NADSLIHVFQNISQL 408 IV N L K + + + N + V ++ + Sbjct: 247 NIVV----PKLNSSDLEKLAQAGNGVYQTITANDADIQTVLASMDRS 289 >gi|303241714|ref|ZP_07328211.1| hypothetical protein AceceDRAFT_3559 [Acetivibrio cellulolyticus CD2] gi|302590715|gb|EFL60466.1| hypothetical protein AceceDRAFT_3559 [Acetivibrio cellulolyticus CD2] Length = 318 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 14/138 (10%), Positives = 42/138 (30%), Gaps = 12/138 (8%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + + S+IF S + +LL + ++ M+ + GA+ ++ + + Sbjct: 10 LLKNRSGSTSVIFLCSAVIIILLSAIATDIGYIAIERYKMDRVLDQIAKIGATALIVDKN 69 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + ++I + S I I + ++ Sbjct: 70 ECTKVIKENAVKR------------IDNITKLEINVSDNNREMSINIERKLDYIFLKYIG 117 Query: 134 NNRLDSSNNTIFYNMDVM 151 + ++ +V Sbjct: 118 FKDKNINSRVTAKVSNVT 135 >gi|224051388|ref|XP_002199708.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 417 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 51/180 (28%), Gaps = 26/180 (14%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L + + + +T K + K + + T + Sbjct: 254 LEFVSNVAKAFEISDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLSAIHNIQYMSGGTAT 313 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ + + + P + F+I LTDG++ + + Sbjct: 314 GDAISFTTRTVF---------------GPVKDGPNKNFLIVLTDGQSYDDVTGPAAAAK- 357 Query: 355 DKAKENFIKIVTISIN-ASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVH 411 + I + ++ + A + LK S P + + L + +I + + Sbjct: 358 ----KAGITVFSVGVAWAPLDD---LKEMASEPRESHTFFTREFTGLEQMVPDIIRGICK 410 >gi|325525566|gb|EGD03356.1| TadE family protein [Burkholderia sp. TJI49] Length = 136 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 40/131 (30%), Gaps = 4/131 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++K + FA + LL++ ++ + Y K + +A+ AG + + Sbjct: 3 RKNQKGVAVVEFAFVLPVLLLIMFGIVEFGLFLYDKAVITNASREGARAGIV-LSNPPPT 61 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN---SSRISMTH 131 + N+ L+ + + G +Q + ++ Sbjct: 62 SSSVQTVVQNYTSNHLLTFGMQNTPTVASSGVGGSFGTPATVTVQYQFSGLVLGQLFTAL 121 Query: 132 MANNRLDSSNN 142 L ++ Sbjct: 122 NGPITLSATTT 132 >gi|218509982|ref|ZP_03507860.1| hypothetical protein RetlB5_22280 [Rhizobium etli Brasil 5] Length = 168 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 47/121 (38%), Gaps = 7/121 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + + +I FA+ + +++ +I V + + A +A++ + + + Sbjct: 10 LRRLLGDRQGVAAIEFAILALPLFIMLFGIIEVSLMFFVNS----ALDASVHKISRMIRT 65 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFI-KNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + +++ + D F + + ++ S + T I NS +++ Sbjct: 66 G--EVASSKITLAGFKAKICDDMLLSFNCSSDLVVKVNVLSDLSSATSTDPIDNSGNLAV 123 Query: 130 T 130 T Sbjct: 124 T 124 >gi|291569126|dbj|BAI91398.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 396 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 64/197 (32%), Gaps = 29/197 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL--LS 246 P I ++VDLSGSM+ S ++ +A FL+ I Sbjct: 69 PPAWIVVLVDLSGSMNELDTSGK--------------RRIDGALDATRRFLEQISDRGGD 114 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTE-------KVRQYVTRDMDSLILKPTDSTPAMK 299 V G G + + ++ K ++ + TD + Sbjct: 115 TKVAIVPFGEGGRNCPGFEVTQRGINSKFFPANDIKQTNFLDYLAAQTLCAATDIYGPLS 174 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-K 358 +A ++L + + F+ L + +I L+DG +N + + + Sbjct: 175 EAIRVLGNRQDPRFYV-----PEDSGRLEPRLSVILLSDGFHNQPNEQQDFDNLITLLER 229 Query: 359 ENFIKIVTISINASPNG 375 N I + T+ +P Sbjct: 230 NNNIIVHTLGYGLTPQQ 246 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 62/186 (33%), Gaps = 24/186 (12%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 KNAL L + + + + + RQ+V R Sbjct: 397 AKNALRYALRGLRPQDSFNIIEFNSDVSLLSSTPLPATAT-NLAMARQFVNRL---QADG 452 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T+ A+ A + + + + + +IF+TDG + Sbjct: 453 GTEMAQALNSAL----PRQAFNTASGEDKS---------LRQVIFMTDGS--VGNESALF 497 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I ++ +N ++ T+ I ++PN + + + + D + Q IS+L+ Sbjct: 498 ELIRNQIGDN--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGDVDEVE---QKISKLLA 552 Query: 411 HRKYSV 416 +Y V Sbjct: 553 KIQYPV 558 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 359 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 416 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 417 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 462 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 463 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 506 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 246 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 303 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 304 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 349 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 350 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|305663060|ref|YP_003859348.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377629|gb|ADM27468.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 411 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 58/188 (30%), Gaps = 18/188 (9%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 K+ K+A+L + +D + + L + +V K ++ + R Sbjct: 46 SSWSMDGEKIFFAKDAILNMVRMLDPDDY------ISLYSFCGKVHKVLDFIRIRDMDR- 98 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V + T++ + + + +I +TD Sbjct: 99 IVKAVAGIKLGGGTNTY-GVLEQIYMDIPSVLDRVKKEES-------DKIPSIRMIMVTD 150 Query: 339 GENNNFKSN-VNTIKICDKA-KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 G + I I ++ K I + I + N + L K + + + ++ N Sbjct: 151 GNPTVGIRDEDRIIDIAERLGKYLSISL-IIGVGDDYNERLLAKIALKTKGFFEHLDNPA 209 Query: 397 SLIHVFQN 404 + +N Sbjct: 210 KTSSILEN 217 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 242 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 299 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 300 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 345 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 346 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 242 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 299 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 300 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 345 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 346 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 578 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 635 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 636 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 681 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 682 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 725 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 281 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 338 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 339 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 384 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 385 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 428 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 317 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 374 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 375 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 420 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 421 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 464 >gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella putrefaciens 200] Length = 757 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 37/204 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV 276 + K+ALL L+ + +I + + + + + Sbjct: 384 TSGSMAGDSIVQAKSALLYALNGLKAEDSFN------IIEFNSELTQLSPTSLPANQTHL 437 Query: 277 RQYVTRDMDSLILKPT----DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T A+ + L+ + Sbjct: 438 ARARQFIHRLQADGGTEMALALNAALPRGINRLSESSQSLRQ------------------ 479 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +IF+TDG N ++ + I+ + E+ ++ T+ I ++PN + + + Sbjct: 480 VIFMTDGSVGNEQALFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYI 535 Query: 393 VNADSLIHVFQNISQLMVHRKYSV 416 N D + Q ISQL+ +Y V Sbjct: 536 GNVDEVE---QKISQLLSKIQYPV 556 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 242 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 299 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 300 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 345 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 346 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 311 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 368 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 369 TGYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 ++ +++ T S+ L+ ++ Y++ + + ++ Sbjct: 415 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|67677889|gb|AAH97100.1| Cfb protein [Danio rerio] Length = 761 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 56/219 (25%), Gaps = 13/219 (5%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKK-RTKMAALKNALLLFLDSIDLLSHVKEDVY 253 D G D+ A D K + ++ I Sbjct: 240 FEDDQHGKKISLDRGGKLDIYIAVDASDSIDPKDFDKAKKIIKTLIEKISYYEVSPNYEI 299 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + ++ K + EK R V D + I K Sbjct: 300 LMFATDVDQIVKMRDFKTN-EKARNIVKVFEDLDNFNYDKKGD--RTGTNIAKLYLKILD 356 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-------KENFIKIVT 366 + + K L Q II TDG+ N + + + +EN + + Sbjct: 357 SMSLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYV 416 Query: 367 ISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 + + + + + + + + D + + F Sbjct: 417 FGVGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFD 455 >gi|32471259|ref|NP_864252.1| signal peptide [Rhodopirellula baltica SH 1] gi|32396961|emb|CAD71931.1| hypothetical protein-signal peptide prediction [Rhodopirellula baltica SH 1] Length = 154 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 39/131 (29%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 + FA+ + FLLL+ + + + +A GA + + Sbjct: 22 KRAGATATEFAIVLPMFLLLVFACCDFARVIHFRQLVANAARVGATHGALNRFTAATESD 81 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 R + ++ + + + I S+ E + + + + Sbjct: 82 WRNDVVNVMREELAHLTSTDPNDSVIDVHFRDLSSGVRVVETEVTLPFRTVVQWPVLPTE 141 Query: 137 LDSSNNTIFYN 147 + ++ F+ Sbjct: 142 IQVHHHAEFHQ 152 >gi|242007523|ref|XP_002424589.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508032|gb|EEB11851.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1003 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 50/168 (29%), Gaps = 22/168 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE----PSWGTEKVRQYV-TR 282 + +K L+S+ V + V+ + T +++ + Sbjct: 253 LDIMKITARALLNSLGENDFVNVAW------FNNDVKWVVPCLKTLVQATTQIKNLLADA 306 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T A+ AY+ ++ K ++FL+DG Sbjct: 307 IERLTESNLTSYVTALDFAYEEFRKFEEIKKPWIGSNCH---------KIVMFLSDGGTE 357 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 +V C+ + IKI T + P +LK S ++ Sbjct: 358 WPT-DVINRH-CNNSNNENIKIFTFACGPHPIPTVILKEMACSTGGYF 403 >gi|111026119|ref|YP_708402.1| hypothetical protein RHA1_ro10051 [Rhodococcus jostii RHA1] gi|110824962|gb|ABH00244.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 153 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 36/151 (23%), Gaps = 17/151 (11%) Query: 7 FRFYFKKGIASE-------KANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANN 58 +K +AS K +++ + + + G ++ ++ Sbjct: 1 MTNALRKRLASRLPEEHRDKGAVTLMTVVFAPVLIFFVWGLIVDGGGMLTADQRADNVAE 60 Query: 59 AAILAG---------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 A A A + + + + + + + D + + Sbjct: 61 DAARAAGQQIIGSVSARGIDTVVDPVRATAAAKRYLFEAGVDGDVIPTGPRTLLITTRIV 120 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + ++ N + Sbjct: 121 YNSKLLPYPRTKLLVGTAAVNLNRTNAGEVY 151 >gi|148656025|ref|YP_001276230.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568135|gb|ABQ90280.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 932 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 27/248 (10%), Positives = 76/248 (30%), Gaps = 42/248 (16%) Query: 165 LNQRYNQKIVSFIPALLRIEMG-ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + + +P + I + P + +V+D+SGSM + P Sbjct: 383 PGGWRGTPVEAALPVTMDIPIYRTMPPVSVVIVIDISGSMAMTEDGIP------------ 430 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 K++ + + + ++ + R + P G+++ + Sbjct: 431 ---KLSLALDGARRIASLLRDED------ELTILPFDDRPGVVVGPLPGSQRDKAIEQMS 481 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 L + A+ A + + S + II +TDG + Sbjct: 482 QVRLGGSGINIHDALVAAARYV------------------RASDRPIRHIITITDGNDTV 523 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + + I ++ + + +I++ + + + A L + Sbjct: 524 QQ--EGALDIVRALRDERVTLTSIAVGQGSHVPFIRDMAAVGGGRTFLTERAADLPDLLL 581 Query: 404 NISQLMVH 411 + +++++ Sbjct: 582 DEAEMIIQ 589 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 11/127 (8%), Positives = 34/127 (26%), Gaps = 8/127 (6%) Query: 53 MESA-NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI-KESLSGYS 110 + +A + A++A A+ + L+ ++ N +RA + + + ++ Sbjct: 315 LRAAQIDVAVVAPAAMPDTLLAMSQYDAIALVNVPRRAFSESTLQHLATYVHDRGGGLIM 374 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 + + + + Y S + + Sbjct: 375 VGGPRSFGPGGWRGTPVEAALPVTMDIPI------YRTMPPVSVVIVIDISGSMAMTEDG 428 Query: 171 QKIVSFI 177 +S Sbjct: 429 IPKLSLA 435 >gi|328675376|gb|AEB28051.1| hypothetical protein FN3523_0194 [Francisella cf. novicida 3523] Length = 332 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 50/159 (31%), Gaps = 33/159 (20%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGAQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTIS 368 II +TD + I + + IK + I Sbjct: 194 -IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 28/242 (11%), Positives = 72/242 (29%), Gaps = 33/242 (13%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----KNALLLFLDS 241 ++ + + + A ++ + +L K + F+ Sbjct: 20 AQQQEVIAPGNSERNSCPEKADCPINVYFVLDTSESVTMQSPIDSLLYHMKQFVRQFISQ 79 Query: 242 IDLLSHVKE---DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + +++++ G + ++ V P + + + T + A+ Sbjct: 80 LQDETYLEQVALSWRYGGLHFSDVVRVFSPPDSDRASFTKSLESIVSIRK--GTFTDCAL 137 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 N ++ ++ S F + +TDG ++A+ Sbjct: 138 ----------------ANMTQEIRQLKSKGGVHFAVVITDGYVT-GSPCGGIKLQAERAR 180 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSP-----EYHYNVVNADSL-IHVFQNISQLMVHR 412 E I+I T++ ++ L+ S P + V + I ++M H Sbjct: 181 EEGIRIFTVA-PDQVPNEQGLRDMASMPLELYRNNYATVRPDLEIDQDTINRIIKVMKHE 239 Query: 413 KY 414 Y Sbjct: 240 AY 241 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 599 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 658 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 659 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYNQLIKES 716 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 717 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----NHDVTVTAIGI 757 Query: 370 NA 371 Sbjct: 758 GD 759 >gi|296473248|gb|DAA15363.1| integrin alpha M [Bos taurus] Length = 1152 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 20/166 (12%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 K L+ Y+ + + + + T + +++ + L Sbjct: 182 KSKTLFSLMQYSDDFQTHFTFNDFKRNPVPEFLVGPIRQLFGRTHTATGIRKVVRELFHS 241 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K +I +TDGE + + + +A I I Sbjct: 242 SSGARNHAI-------------KIMIVITDGEK--YLDPLEYSDVIPEADRKKIIRYVIG 286 Query: 369 INA---SPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + S ++ L T S P ++ + V N ++L + + + + Sbjct: 287 VGDAFRSRKSRQELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|149197817|ref|ZP_01874866.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] gi|149139038|gb|EDM27442.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] Length = 1078 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 52/192 (27%), Gaps = 22/192 (11%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTP-- 296 DS S+ G + R+ + K T+ Sbjct: 731 DSQKPGSYTFVIDNSGSMAAENRLPLIQKTLPNMFKAMNQDDEVTILSCEGGVTNLANRI 790 Query: 297 ------AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ---KFIIFLTDGENNNFKSN 347 ++ A + + + + +I L+DG + + Sbjct: 791 TASNHSQLETAVKNIEAGTVANLSVGIEEAYKLAAQNFRSGAVNRVILLSDGIASLGEKE 850 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSLIHV---- 401 +K + ++ I I + + L+T + +Y + + + + Sbjct: 851 AQEVLKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGVYYFGDSKEQMNDILVNN 910 Query: 402 ----FQNISQLM 409 F+ I++ + Sbjct: 911 FEASFKTIARDV 922 >gi|330982726|gb|EGH80829.1| hypothetical protein PSYAP_30018 [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 162 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 11/144 (7%), Positives = 42/144 (29%), Gaps = 4/144 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKR--ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + D + + ++ + + + ++ + Sbjct: 72 ASAPDFANQSATRNGFVPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVAAHPV 131 Query: 133 ANNRLD-SSNNTIFYNMDVMTSYD 155 + + Sbjct: 132 PRSIAAGIGALFEKTPSPPNVTIS 155 >gi|325109822|ref|YP_004270890.1| hypothetical protein Plabr_3271 [Planctomyces brasiliensis DSM 5305] gi|324970090|gb|ADY60868.1| hypothetical protein Plabr_3271 [Planctomyces brasiliensis DSM 5305] Length = 508 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 16/202 (7%), Positives = 52/202 (25%), Gaps = 3/202 (1%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 + I A+ + + + L+G + +AA A ++ S Sbjct: 24 ARSGKVLAIMAIVLPTIVALLGLTFDSSLLLQNYREQQQIADAAATLAARRIADGDSLAE 83 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 R +I + A + + + ++ E+ ++ + ++ Sbjct: 84 ARAAAI--GRVQQTNGWADAAVTVNQGPTSGSFAGDSDFVEVILTRDAPTYFIKAVSLAE 141 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + + + + + I +P+ + + + Sbjct: 142 -TVETRAVAGIRESTADTAVVVLEEDPAGIRLTGLPISLSLPSTNLGGFENIGLGQLSVN 200 Query: 197 VDLSGSMHCAMNSDPEDVNSAP 218 + + + D P Sbjct: 201 GAVIVNTEWGGFDENNDPVGCP 222 >gi|295837154|ref|ZP_06824087.1| membrane protein [Streptomyces sp. SPB74] gi|197699506|gb|EDY46439.1| membrane protein [Streptomyces sp. SPB74] Length = 153 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 16/141 (11%), Positives = 41/141 (29%), Gaps = 2/141 (1%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ +A ++ + + L + FL L F++ ++ A A + Sbjct: 13 RRSLADDRGSGAGAVILFALVFLSLAAFVVDGGLSISQRERAADLAEQAARYAAQDIDRE 72 Query: 72 L--SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 G+ + N R A+ + + A + E++ + + Sbjct: 73 ALYGNEGEAPINEGNCPARVAAFAAESGMSGADAAASGCVEADADHVEVRIQLTYRPVFT 132 Query: 130 THMANNRLDSSNNTIFYNMDV 150 + S + N Sbjct: 133 GIFYGGSIHVSGTAVAENKVG 153 >gi|156358477|ref|XP_001624545.1| predicted protein [Nematostella vectensis] gi|156211332|gb|EDO32445.1| predicted protein [Nematostella vectensis] Length = 1269 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 11/155 (7%), Positives = 32/155 (20%), Gaps = 23/155 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL- 289 K + + + +GL ++ +K + Sbjct: 932 FKQIINFVWNVVQSFQVSAHFTRVGLAIFSLEAFVIFNFHTYYDKNNILDALNNVQYPGT 991 Query: 290 --KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A + L +I + G + + Sbjct: 992 DGPGTYIGRGLHVAKEYLFE---------------ASGRPRVPHALIVVAAGRSLDDVIT 1036 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + I T+ + ++ Sbjct: 1037 PSL-----SLRRMGVDIYTVGVGKFYTKLQMHAMA 1066 >gi|156358475|ref|XP_001624544.1| predicted protein [Nematostella vectensis] gi|156211331|gb|EDO32444.1| predicted protein [Nematostella vectensis] Length = 638 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 11/155 (7%), Positives = 32/155 (20%), Gaps = 23/155 (14%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL- 289 K + + + +GL ++ +K + Sbjct: 138 FKQIINFVWNVVQSFQVSAHFTRVGLAIFSLEAFVIFNFHTYYDKNNILDALNNVQYPGT 197 Query: 290 --KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + A + L +I + G + + Sbjct: 198 DGPGTYIGRGLHVAKEYLFE---------------ASGRPRVPHALIVVAAGRSLDDVIT 242 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 + + + I T+ + ++ Sbjct: 243 PSL-----SLRRMGVDIYTVGVGKFYTKLQMHAMA 272 >gi|283779589|ref|YP_003370344.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438042|gb|ADB16484.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1740 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 44/361 (12%), Positives = 97/361 (26%), Gaps = 39/361 (10%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 S V ++ E+ + I + + + + + S Sbjct: 1068 SAPVPAIAASLPEAEAAIYFTDDQGRVGSTARILDTKSLMTQSNPVAWESVFVAAGLESR 1127 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 M +AN R + + ++ P + I Sbjct: 1128 GPLMAAVANVRGNEFAAPLLLRAAGGKRIEFAKPTYG---------------PPTVTIHG 1172 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 R + V+D S +M + D Q +M K AL L + Sbjct: 1173 FRRRQASVVFVLDCSHTMKALDEVESPDGGVRATSQ-----RMELAKGALRSMLTQLAER 1227 Query: 246 SHVKEDVYMGLIGYTTR-VEKNIEPSWGTEK------VRQYVTRDMDSLILKPTDSTPA- 297 + V VE N ++ Y + + + Sbjct: 1228 GDARVGVRFFGHRVGWSTVEANKLLRQNRYAGEIPPELQPYADVENILPLGRFDSVIAGR 1287 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLP--FQKFIIFLTDGENNNFKSNVNTIKICD 355 + +A + + + + + +K I+ +TDG+N F + + D Sbjct: 1288 VFEAMKTVEPWGESPIYLSITEALRDFAQEDGESEKSIVVITDGKNYQFNAPSQLARTKD 1347 Query: 356 K-AKENF---IKIVTISINASPN-----GQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 IK+ + N P+ + + ++ +A SL+ +++ Sbjct: 1348 DVLAARGSRDIKVHIVGFNIEPSESEVASREFREIAEATGGEFLPATSAGSLVRSLESVL 1407 Query: 407 Q 407 + Sbjct: 1408 R 1408 >gi|167006295|ref|YP_001661518.1| membrane protein [Streptomyces sp. HK1] gi|166162377|gb|ABY83498.1| membrane protein [Streptomyces sp. HK1] Length = 148 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 4/127 (3%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + S+ FA++ ++ L+++G L+ N S A ++ + G Sbjct: 19 GSMSLFFAVTTVAILMVMGLLVDGGGALNAANRATSIAQEAARTAGQQLDPAQAIEGTAI 78 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + A A D + ++ T + + + + Sbjct: 79 TIDPDAATGAAQDYLAT---ADVTGNVEITDGGQSITVTV-HDTYNTLFAQFVGKGTIHV 134 Query: 140 SNNTIFY 146 + Sbjct: 135 TGTATAQ 141 >gi|85706702|ref|ZP_01037794.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] gi|85668760|gb|EAQ23629.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] Length = 240 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 33/126 (26%), Gaps = 15/126 (11%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D+ A+ + LT + + ++ +TDG + Sbjct: 96 DAGAAVVAGIEALTPGGLTPIAASVAAAAEVLGYRTHPGIVVLVTDGNETCG-GTPCALG 154 Query: 353 ICDKAKENFIKIVTISINA-------------SPNGQRLLK-TCVSSPEYHYNVVNADSL 398 A+ + + I +GQ + K ++ + D L Sbjct: 155 TALAAEARDLTVHVIGFRVVHDPFSWNSPEAKGYDGQTVAKCLADATGGLFVSTETVDEL 214 Query: 399 IHVFQN 404 + + Sbjct: 215 VAALRE 220 >gi|88801114|ref|ZP_01116660.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] gi|88776143|gb|EAR07372.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] Length = 553 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 48/152 (31%), Gaps = 24/152 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQ 278 ++ ALK AL + V +GL+ + RV ++ + ++ Q Sbjct: 388 SGSMDGDRIRALKIALDE------SANFVSSRNSIGLVTFNDRVNVDLPIREFDLQQKSQ 441 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 ++ T + A+ A L + K + I L+D Sbjct: 442 FLGAVERMSAGGGTATNDAILVAAHELLNFAKT--------------HPEHKLTIFVLSD 487 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 GE N + K+ I + +I+ Sbjct: 488 GETRNGLPLGDVEKVIQML---NIPVHSIAYG 516 >gi|326500868|dbj|BAJ95100.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 752 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 32/264 (12%), Positives = 79/264 (29%), Gaps = 39/264 (14%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F + + + +R IF I L+ + + + V+++ Sbjct: 280 DFNFSYSVYSGDVSGGVLVQRSTLRDYDDRDIFSIFLLPGNNQNRKIFRKAVVFIVDTSG 339 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG-LIGYTTRVEKNIEPSWGTEKVR 277 Q K + +KNA+ + ++ + + L +++ +EK + Sbjct: 340 SMQGKP---IENVKNAISTAVSELEEGDYFNIVTFNDELHSFSSCLEKV-----NGKTTE 391 Query: 278 QYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + TD + +A +L++ I + Sbjct: 392 SAIDWMNSKFVAQGGTDIMHPLTEALALLSNSHGALPQ------------------IFLV 433 Query: 337 TDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 TDG + ++ +T+K + + I +I T + + N L ++ + Sbjct: 434 TDGSVEDERNICHTVKT-ELTNKGSISPRISTFGLGSYCNHYFLRMLASIGKGHYDAAFD 492 Query: 395 ADSL--------IHVFQNISQLMV 410 S+ I + Sbjct: 493 TGSIEGRMLQWFQKASSTIVADIS 516 >gi|311694769|gb|ADP97642.1| von Willebrand factor, type A [marine bacterium HP15] Length = 581 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 36/130 (27%), Gaps = 10/130 (7%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ A++ A +D S K+ D N + Sbjct: 119 TNLGAAIETASDDYFTDGDLSRTHFILLTDGKVD---------ISDDPAKNTAEETRILD 169 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQLMV 410 I E ++++ + + L S AD+L F Q ++ + Sbjct: 170 TIVADLIERGATFHPVALSEAADTDFLKALATDSGGRFQVADTADALNLAFLQALNTAVP 229 Query: 411 HRKYSVILKG 420 + + G Sbjct: 230 QEQIPIEGNG 239 >gi|326672458|ref|XP_695815.4| PREDICTED: von Willebrand factor A domain-containing protein 2 [Danio rerio] Length = 795 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 42/155 (27%), Gaps = 20/155 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ Y R + + + A+ Sbjct: 562 FDINRDVAQVGLVVYGRRPVTVFDLDKYNSGSAVLRAVGDAAFLGGKASVGSALLH---- 617 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + K K ++ LTDG + K +++ + I Sbjct: 618 ---------VLSQSLTVGKGARPGVNKAVVVLTDG---TGVEDAAVP--AQKIRDSGVSI 663 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 I + + LL+ S ++ V + D L Sbjct: 664 FLIGVG-DIQQELLLRIS-GSEDHMITVPSYDDLK 696 >gi|272938026|gb|ACZ96962.1| vwa2 protein [Danio rerio] Length = 260 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 42/155 (27%), Gaps = 20/155 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 ++ +GL+ Y R + + + A+ Sbjct: 35 FDINRDVAQVGLVVYGRRPVTVFDLDKYNSGSAVLRAVGDAAFLGGKASVGSALLH---- 90 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + K K ++ LTDG + K +++ + I Sbjct: 91 ---------VLSQSLTVGKGARPGVNKAVVVLTDG---TGVEDAAVP--AQKIRDSGVSI 136 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 I + + LL+ S ++ V + D L Sbjct: 137 FLIGVG-DIQQELLLRIS-GSEDHMITVPSYDDLK 169 >gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1] gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1] Length = 757 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 64/204 (31%), Gaps = 37/204 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV 276 + K+ALL L+ + +I + + + + + Sbjct: 384 TSGSMAGDSIVQAKSALLYALNGLKAEDSFN------IIEFNSELTQLSPTSLPANQTHL 437 Query: 277 RQYVTRDMDSLILKPT----DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T A+ + L+ + Sbjct: 438 ARARQFIHRLQADGGTEMALALNAALPRGINRLSESSQSLRQ------------------ 479 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +IF+TDG N ++ + I+ + E+ ++ T+ I ++PN + + + Sbjct: 480 VIFMTDGSVGNEQALFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYI 535 Query: 393 VNADSLIHVFQNISQLMVHRKYSV 416 N D + Q IS+L+ +Y V Sbjct: 536 GNVDEVE---QKISKLLSKIQYPV 556 >gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens CN-32] gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32] Length = 757 Score = 44.1 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 64/204 (31%), Gaps = 37/204 (18%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV 276 + K+ALL L+ + +I + + + + + Sbjct: 384 TSGSMAGDSIVQAKSALLYALNGLKAEDSFN------IIEFNSELTQLSPTSLPANQTHL 437 Query: 277 RQYVTRDMDSLILKPT----DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + T A+ + L+ + Sbjct: 438 ARARQFIHRLQADGGTEMSLALNAALPRGINRLSESSQSLRQ------------------ 479 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +IF+TDG N ++ + I+ + E+ ++ T+ I ++PN + + + Sbjct: 480 VIFMTDGSVGNEQALFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYI 535 Query: 393 VNADSLIHVFQNISQLMVHRKYSV 416 N D + Q IS+L+ +Y V Sbjct: 536 GNVDEVE---QKISKLLSKIQYPV 556 >gi|310640817|ref|YP_003945575.1| protein [Paenibacillus polymyxa SC2] gi|309245767|gb|ADO55334.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 738 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 16/58 (27%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E ++ L + I I E +AA+ + S L + Sbjct: 12 ESGAVTVFLILILAVMFGFIAVFIDYARIAALHVQTERLTHAAVRSVMSAYDPALQQE 69 >gi|94989622|ref|YP_597722.1| collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|94543130|gb|ABF33178.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|198417325|gb|ACH87880.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417327|gb|ACH87881.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417329|gb|ACH87882.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417331|gb|ACH87883.1| ancillary protein 1 [Streptococcus pyogenes] Length = 890 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 90/361 (24%), Gaps = 43/361 (11%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 A ++ G A R ++ K + + + + + N N Sbjct: 267 VALVTYASTIFDGTEATVSKGVADRNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDAN 326 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 I + + + N Y + ++ E L Q N + + Sbjct: 327 EVNILKSRIPKEAEHINGNRTLYQFGATFTQKALMKANEILETQSSNDRKKVIFHVTDGV 386 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 I + S + S + +D + +S Sbjct: 387 PTMSYAINFNPYIST-SYQNQFKSFLNKTPDRSGILQEDFIINGDDY--QIVKGDGESFK 443 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 L S K V G +V + + A+ + Y Sbjct: 444 LFSDRKVPVPGGTTQAAYQVPQ---------------NQLSVMSNEGY-----AINRGYI 483 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + T GE N K Sbjct: 484 YLY------WRDYNWVYPFDPKTKTVSATKQIKTHGEPTTLYFNG-------NIKPKGYD 530 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I T+ I + + + +++ S E + NV + + + + +V K+S+ Sbjct: 531 IFTVGIGVNGDPGATPLEAKEFMQSISSKTENYTNVDDTNKIYDELNKYFKTIVEEKHSI 590 Query: 417 I 417 + Sbjct: 591 V 591 >gi|77552210|gb|ABA95007.1| expressed protein [Oryza sativa Japonica Group] Length = 629 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 64/206 (31%), Gaps = 35/206 (16%) Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 S ++ L+ A+ L + ++ + ++ + + V + + Sbjct: 127 SCGGGGGAPARRLDLLRKAMDLVIGNLGADD------RLAIVPFHSSVVDATGLLEMSVE 180 Query: 276 VRQYVTRDMDSLI-LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 R +R + SL T PA+ A +IL + + R G ++ Sbjct: 181 GRGVASRKVQSLAVAGGTKLFPALNAAVEILEAR---CWEAKRERVGA----------VV 227 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 ++DG++ I +A + + + + + + + + + Sbjct: 228 LISDGDD---------RTIFREAINPRYPVHAFGFRGAHDARAVHHVADHTSGVYGVLDD 278 Query: 395 ADS-LIHVFQNISQLMVHRKYSVILK 419 + F R+ + ++ Sbjct: 279 EHDRVTDAFAA-----CVRRVTSVVA 299 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 27/229 (11%), Positives = 68/229 (29%), Gaps = 44/229 (19%) Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 K+V I + P + ++D+SGSM+ K+ Sbjct: 230 HKLVQIGLQGKTISTAKLPSSNLVFLIDVSGSMN-------------------DSNKLPL 270 Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 L ++ L D + + ++ Y + + G +K Sbjct: 271 LVSSFKLLTDQLRKTD------RVAIVVYAGNSGLVLPSTSGDQKTTIK-DALNKLSAGG 323 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVN 349 T ++ AY++ + + +I TDG+ N S+ + Sbjct: 324 STAGGAGIRLAYEVAAKNYIKGGNN----------------RVILATDGDFNVGASSDED 367 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADS 397 K+ ++ +++ + + + ++ + + N + Sbjct: 368 MEKLIEEKRKSGVFLTVLGFGMGNLKDSKMEVLADKGNGNYAYIDNINE 416 >gi|126340390|ref|XP_001364302.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Monodelphis domestica] Length = 1092 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 49/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 272 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 329 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 330 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 371 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + K C + Y+Y + + ++ Sbjct: 372 -EIFAKYNKDKKVRVFTFSVGQHNYDRDPVKWKACRNC-GYYYEIPSIGAI 420 >gi|145516504|ref|XP_001444144.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411548|emb|CAK76747.1| unnamed protein product [Paramecium tetraurelia] Length = 829 Score = 44.1 bits (102), Expect = 0.047, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 53/184 (28%), Gaps = 29/184 (15%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVR 277 +++ K AL+LFL S+ +D +I + T K S + Sbjct: 334 SGSMSGSRIKKAKEALILFLKSL------PQDSEYNIISFGTNFTKLWNVSQNYSQNTLE 387 Query: 278 QYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T +K Q++ S + L Sbjct: 388 TAIKHVEEMDADMGGTCIIAPLK---QMIYHKNYG-------------ASKNTTLNVFLL 431 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG++ K + + +I T+ I + + + + V + + Sbjct: 432 TDGQDTADPIIDLVQK--NNLAQ--TRIYTLGIGRECSQYLIRRVAEVGNGKYQIVSDKE 487 Query: 397 SLIH 400 + Sbjct: 488 DINE 491 >gi|119890597|ref|XP_001256059.1| PREDICTED: alpha 3 type VI collagen, partial [Bos taurus] Length = 1632 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 52/167 (31%), Gaps = 17/167 (10%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ LS + + +G++ Y+ + + K Q + + + Sbjct: 261 LDFLVNLLERLSIGTQQIRVGVVQYSDEPRTMFSLNSYSTKA-QVLDAVKALGFIGGELA 319 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G +++ + Sbjct: 320 NVGLALDF---------VVENHFTRAGGSRAEEGVPQVLVLISAGPSSDEIRDG------ 364 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + + + A + L+ ++ + V SL V Sbjct: 365 -VIALKQASVFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDV 410 >gi|159900723|ref|YP_001546970.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893762|gb|ABX06842.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 421 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 66/237 (27%), Gaps = 46/237 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 Q + + + V+D SGSM K Sbjct: 20 QTQQVVYLLLDITATPAVAHVQMPVNVSFVLDHSGSM--------------------KGD 59 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM ++ A L + V ++ + ++ V + D+ Sbjct: 60 KMRCVREATQRALGLMGPQDIVSVVIFDHRRETIISAQPV----RNVAALQAEVGKIKDA 115 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T PA++ A + + + II LTDG+ + Sbjct: 116 ---GGTKIAPALEAALNEI----------------RRSQNANTISRIILLTDGQTEGERD 156 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +++ ++ + + + + + N L++ S N + + FQ Sbjct: 157 ---CLRLAEEIGKASVPLTALGVGDDWNEDLLIEMANRSGGVAEYFSNPNDIASFFQ 210 >gi|156383255|ref|XP_001632750.1| predicted protein [Nematostella vectensis] gi|156219810|gb|EDO40687.1| predicted protein [Nematostella vectensis] Length = 174 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 13/150 (8%), Positives = 44/150 (29%), Gaps = 20/150 (13%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A + + ++GL+ ++T I + + V+ ++ ++ Sbjct: 32 ANFNALMKFTTNMAKNFDISVSATHVGLVLFSTTASVKITLNEFYDIVKLTSAINLITIQ 91 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T++ ++ L + +I LT S Sbjct: 92 GGFTNTGKGLQAVKSDLFD---------------ATTRKNVPRVLIVLT-----VASSLD 131 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRL 378 + ++N + + + + + +++ Sbjct: 132 DVRAPSQALRDNSVTVYVVGVGERVDVEQM 161 >gi|87200511|ref|YP_497768.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87136192|gb|ABD26934.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 193 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 29/71 (40%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 K +A + +I FAL + FLL I + + Y ++ A A + A + Sbjct: 6 KLVAEDCGVTTIEFALVLPVFLLAIVGCLDLGQMVYAVGVLDGAVEKAARSAALETGDTT 65 Query: 73 SRLGDRFESIS 83 + + + +S Sbjct: 66 AADAEVEDVMS 76 >gi|242061830|ref|XP_002452204.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] gi|241932035|gb|EES05180.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] Length = 369 Score = 44.1 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 23/73 (31%), Gaps = 8/73 (10%) Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYN 391 ++ ++DG+ N+ + K + T A + L +S Sbjct: 20 VMLMSDGQQNHGGN-------AADVKIGNAPVYTFGFGADYDPTVLNAVARNSMGGTFSV 72 Query: 392 VVNADSLIHVFQN 404 V + D L F Sbjct: 73 VNDVDKLTMAFSQ 85 >gi|78357412|ref|YP_388861.1| hypothetical protein Dde_2369 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219817|gb|ABB39166.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 163 Score = 44.1 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 33/110 (30%), Gaps = 4/110 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + FR + S+ AL++ L+++ LI + + + A L Sbjct: 1 MHAFRLVTGRLRHDTSGLSSLELALTLPVLLMMVFGLIEFGYNLFART----TVDKAALI 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 GA V+ + R L + ++ +A Sbjct: 57 GARYAVTGQGFDDGTRHARIVQEARRLTGVLAGSSPQSVTVTIGSIAAGA 106 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 62/222 (27%), Gaps = 42/222 (18%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 I V+D+SGSM KM A+ LD + H Sbjct: 310 NILFVIDVSGSMW--------------------GVKMKQTVEAMKTILDDLRAEDHFSVI 349 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + I T + +++ T+ A+ +A IL Sbjct: 350 DFNQNI--RTWRNDLFQLQ--KHRLQIAKRYIEKIQPSGGTNINEALLRAIFILNEANN- 404 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENFIKIVTISI 369 + + II ++DG+ K + + + ++ I + ++ + Sbjct: 405 ----------LGLLDPNSVSLIILVSDGDPTVGELKLSKIQKNVKENIQD-NISLFSLGM 453 Query: 370 NASPNGQRLLKTCVSSPEYHYNV----VNADSLIHVFQNISQ 407 + L + + + + L + +S Sbjct: 454 GFDVDYDFLKRLSNENHGIAQRIYGNQDTSSQLKKFYNQVST 495 >gi|123428998|ref|XP_001307617.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121889256|gb|EAX94687.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 642 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 89/317 (28%), Gaps = 37/317 (11%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + + K + + + ++S ++ M+N ++ + T Sbjct: 97 SFLADINENGYFYKFPLTHKYQQGSVTNDYSDKPESFHFSTSIMTQKEMSNVKVSVNGTT 156 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 + T + N + + + + + + S Sbjct: 157 NVIDSHNAT----------FVTNDTPTKDAIIIETQIKDKDKNIAISSDGYIAITTYPSF 206 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 + S+ E +++ + +F+ S+ +I + + Sbjct: 207 EGPITSNSEFYFIVDCSGSMTGSRILKAIECMRMFIQSL------PVGCRFSIIKFGSNF 260 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 ++ T+K + AM+ Y I + ++ Sbjct: 261 HTILKSCDYTDK-----------------NLANAMQLLYTINSDMGGTDIYSPLKYVSDI 303 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 P F K I LTDGE + + K + N +I +I + + + + Sbjct: 304 KPKKGFVKQIFLLTDGEVQDSDEIC---AMAYKNRSNN-RIFSIGLGSGADPGLIKGIAR 359 Query: 384 SSPEYHYNVVNADSLIH 400 S + + + D+L Sbjct: 360 KSGGNYTIIGDNDNLNQ 376 >gi|145297762|ref|YP_001140603.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] gi|88866552|gb|ABD57328.1| TapY1 [Aeromonas salmonicida subsp. salmonicida A449] gi|142850534|gb|ABO88855.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 1203 Score = 43.7 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 23/257 (8%), Positives = 56/257 (21%), Gaps = 60/257 (23%) Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI 267 + +++ K+A+ + S + T Sbjct: 222 YTANYIRWYYGPTGYAVESRLRIAKDAVKSVISSTPGVDFGLAIFNTNNSTSTNGGRVVR 281 Query: 268 EPSWGTEKV------RQYVTRDMDSLILKPTDSTPAMKQAYQ------ILTSDKKRSFFT 315 + V + + T + +AY+ +L + + Sbjct: 282 RILSNDDMVGTITAEQNLLNTVEGLSATTNTPLCETLYEAYRFYGGQPVLYGKQGGTRDK 341 Query: 316 NFFRQGVKIPSLP----FQKFIIFLTDGENNN---------------------------- 343 S ++I++TDG + Sbjct: 342 LAEDPEGTYTSPYDKCSNNGYVIYVTDGAPTSDTDADSAVKALMDSLSTDERAAYGSTVG 401 Query: 344 ---FKSNVNTIKICDKAKENFI----------KIVTISINASP---NGQRLLKTCVSSPE 387 + + K + T+ G L +T Sbjct: 402 YGSGSAKSYLAAMAGYMKNKDVNTSSPGKQTVTTFTVGFGDEAISGAGNLLAETARRGGG 461 Query: 388 YHYNVVNADSLIHVFQN 404 +Y V+A++L ++ Sbjct: 462 QYYPAVSAEALTQALRS 478 >gi|116671475|ref|YP_832408.1| TadE family protein [Arthrobacter sp. FB24] gi|116611584|gb|ABK04308.1| TadE family protein [Arthrobacter sp. FB24] Length = 129 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 12/109 (11%), Positives = 36/109 (33%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ASE+ ++ FA+ ++L+ ++ + + S+ +A + A + + Sbjct: 3 RASERGAVAVEFAILAPVLVMLLLGIMEFSRAYNAQASLSAAAREGVRVMAISNKISDAN 62 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 R ++S + + + G + + + Sbjct: 63 KAARDTAVSLQPGLKDTNIVFKNLDTGTTSCAPGNRMTITISYNLSTMT 111 >gi|71280467|ref|YP_270055.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146207|gb|AAZ26680.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 786 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 44/162 (27%), Gaps = 21/162 (12%) Query: 256 LIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +I + E + + + T+ + A + Sbjct: 433 IIAFDNDTELLFPVTHMASAHNISKAQQFIDGLSANGGTEMYRPLSNALMMKKD------ 486 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 K S + I+F+TDG N + + ++ T+ I A+P Sbjct: 487 ---------KTQSSKAIRQIVFITDGAVANEFELMQLLNTAQG----DFRLYTVGIGAAP 533 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 NG + K + + N + + + + Sbjct: 534 NGYFMKKAAQFGRGSYVFIQNKSEVQRKMSHFMTKISQPALT 575 >gi|313814173|gb|EFS51887.1| conserved hypothetical protein [Propionibacterium acnes HL025PA1] Length = 169 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 39/126 (30%), Gaps = 1/126 (0%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ AL + + L++ K +SA A AG+ + Sbjct: 36 RGTVAVEAALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVGQ 95 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 ++ A + A + + + G++ I S + + ++ Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPV-GFAGTASARVSCTIKLSDLLVPGMPGSFHIE 154 Query: 139 SSNNTI 144 S ++ Sbjct: 155 SVAHST 160 >gi|291404848|ref|XP_002718766.1| PREDICTED: von Willebrand factor A domain containing 2 [Oryctolagus cuniculus] Length = 787 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 42/129 (32%), Gaps = 24/129 (18%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQ 300 D L +E V +G + + + +V+ V R + T++ A+K+ Sbjct: 79 DALDISQERVRVGAMQFGSAPHLEFPLDAFPTRQEVKAQVKRMI--FKGGRTETGLALKR 136 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + ++ +TDG++ ++ + KE Sbjct: 137 LLHGGFPGGRNGS---------------VPQLLVIVTDGKSQGHVASP-----AKQLKER 176 Query: 361 FIKIVTISI 369 + + + + Sbjct: 177 GVTVFAVGV 185 >gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 722 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 37/305 (12%), Positives = 89/305 (29%), Gaps = 62/305 (20%) Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL-----QFIEHLLNQRY 169 T I+ + + ++ + ++N F + + +++ Sbjct: 166 ATTIKTQRDIQDLKVSVDGTKDVKDAHNATFVTNNPPAKDAIVIETHIKDEDKNVAISSD 225 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ P +VD SGSM C +++ Sbjct: 226 GYIAITTYPYFEG---PIDSNSEFYFIVDCSGSMSC--------------------SRIN 262 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + LF+ S+ ++ + + E + P T++ +D++ Sbjct: 263 NAIKCMRLFIQSL------PVGCRFSILRFGSHFETVLPPCDYTDENVANAMNLLDNISA 316 Query: 290 --KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ ++ L + + K I FLTDGE +N Sbjct: 317 NMGGTNILAPLQH-VSDLQASEGFV------------------KQIFFLTDGEVDNSDII 357 Query: 348 VNTIKICDKAKENFI-KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH-VFQNI 405 K +I +I + + + + S + + + D++ V + + Sbjct: 358 C-----ATALKNRSTNRIFSIGLGSGADPGLIKGMARKSGGNYAIIGDNDNMNEKVIEML 412 Query: 406 SQLMV 410 S + Sbjct: 413 SSAIS 417 >gi|328676286|gb|AEB27156.1| hypothetical protein FNFX1_0208 [Francisella cf. novicida Fx1] Length = 332 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 57/190 (30%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD + I + + IK + I + + + Sbjct: 194 -IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|260462604|ref|ZP_05810810.1| TadE family protein [Mesorhizobium opportunistum WSM2075] gi|259031510|gb|EEW32780.1| TadE family protein [Mesorhizobium opportunistum WSM2075] Length = 181 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 16/179 (8%), Positives = 41/179 (22%), Gaps = 12/179 (6%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M ++ + + + + + L L+L + + + K ME+ Sbjct: 1 MAMIRTLFRHLDRFRRDHRGSVLVEMTLITPLMLILSAGVFEFGNLIHDKLLMEAGL--- 57 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ +R R S D + N + Sbjct: 58 ---------TDGARYAARCNSQLYTDAGLAGIDCTGYAANIAVFGKPIVTQADVPRVSGW 108 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + + T N + + T+ ++ + I Sbjct: 109 QTSGATSFATVTMANSCQDAVSAGVTLYRSTTAQVCVVRAVGSYPYTGVGMLSFIGIGP 167 >gi|260642112|ref|ZP_05414581.2| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] gi|260623548|gb|EEX46419.1| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] Length = 615 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 37/355 (10%), Positives = 89/355 (25%), Gaps = 64/355 (18%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-- 107 K + A+ A + + N K F + S S Sbjct: 117 KTKAMTGAYVAVCPTAMYDMDTRMNTEEYDRIQENGFKSVADTPLSTFSIDVDPASYSNM 176 Query: 108 ---GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL 164 + + + Sbjct: 177 RRFINRGELPPADAIRTEELVNYFSYDYPKPTGNDPVKITVEAGTCTWNTA--------- 227 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 ++V I + P + ++D+SGSM Sbjct: 228 ------HRLVRIGLKAKEIPTEQLPASNLVFLIDISGSMW-------------------G 262 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ +K++L L ++++ + + ++ Y +E + G +K ++ Sbjct: 263 ANRLDLVKSSLKLLVNNL------RNKDKVAIVTYAGSAGVKLEATSGGDK-QKIREAID 315 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + AYQI + II +DG+ N Sbjct: 316 ELTAGGSTAGGAGIHLAYQIAKKNFISDGNN----------------RIILCSDGDFNVG 359 Query: 345 KSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADS 397 S+ ++ +K +++ + + + + ++ H + N Sbjct: 360 VSSAEGLEQLIEKERKSGVHLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNLQE 414 >gi|208780563|ref|ZP_03247902.1| TPR domain protein [Francisella novicida FTG] gi|208743538|gb|EDZ89843.1| TPR domain protein [Francisella novicida FTG] Length = 332 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 57/190 (30%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD + I + + IK + I + + + Sbjct: 194 -IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 28/238 (11%), Positives = 55/238 (23%), Gaps = 22/238 (9%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSG---SMHCAMNSDPEDVNSAPICQ 221 N +F P+ +G + + A + Sbjct: 1474 NNNYIQNPRSAFEPSQCNGNLGNDVCRCCCAGHLCNFNEKPCQAPQACHIRKADVAIVVD 1533 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 K + + V L+ Y RV++ + K Sbjct: 1534 SSSSVKYNNFQKMKSFTSSLFQSFKLGDDRVRAALVRYNRRVDQRFGFADTNTKEEVQDG 1593 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 D T + A++ + QK +I +TDG + Sbjct: 1594 VDAMPYRGGGTLTGQALQH-----------VIDNTLKTENGARDD--AQKIVITITDGRS 1640 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + + + I I A + + P ++V D L Sbjct: 1641 QDDVLTPS-----RALRAAGVITFGIGIGARV-LETQINEIGGDPSRTFSVETFDQLS 1692 >gi|254373669|ref|ZP_04989153.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] gi|151571391|gb|EDN37045.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] Length = 332 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 57/190 (30%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD + I + + IK + I + + + Sbjct: 194 -IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|118496822|ref|YP_897872.1| hypothetical protein FTN_0208 [Francisella tularensis subsp. novicida U112] gi|194324497|ref|ZP_03058269.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|254372186|ref|ZP_04987678.1| TPR domain protein [Francisella tularensis subsp. novicida GA99-3549] gi|118422728|gb|ABK89118.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella novicida U112] gi|151569916|gb|EDN35570.1| TPR domain protein [Francisella novicida GA99-3549] gi|194321332|gb|EDX18818.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 332 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 57/190 (30%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD + I + + IK + I + + + Sbjct: 194 -IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|242034177|ref|XP_002464483.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] gi|241918337|gb|EER91481.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] Length = 222 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 12/72 (16%), Positives = 23/72 (31%), Gaps = 8/72 (11%) Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNV 392 + ++DG+ N+ + K + + T A + L +S V Sbjct: 1 MLMSDGQQNHGGN-------AADVKISNAPVYTFGFGADYDPTVLTAVVGNSMGGTFSVV 53 Query: 393 VNADSLIHVFQN 404 + D L F Sbjct: 54 NDVDKLTMAFSQ 65 >gi|270014558|gb|EFA11006.1| hypothetical protein TcasGA2_TC004591 [Tribolium castaneum] Length = 1235 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 50/175 (28%), Gaps = 5/175 (2%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLI 288 + LD+ ++ Y T T + Sbjct: 252 IASLTVNTILDTFSNNDYINILYYSNQTTNYTIPCFRNLLVQATPENIVLFKEAIRHLGP 311 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A++ A+ IL + ++ N + I+ +TDG + NF V Sbjct: 312 SGKTDFPQALQMAFDILENYREIRGCNNEEIDEEGKSKAC-NQAIMLITDGISRNFSDIV 370 Query: 349 NTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 D K ++I T I + + C + ++ V + + Sbjct: 371 MRNNQLDGGKTIPVRIFTYLIGKEVTNVEEIRWMACA-NRGFYTQVQTLEQVTSA 424 >gi|189233997|ref|XP_971945.2| PREDICTED: similar to AGAP009579-PA [Tribolium castaneum] Length = 1056 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 50/175 (28%), Gaps = 5/175 (2%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLI 288 + LD+ ++ Y T T + Sbjct: 252 IASLTVNTILDTFSNNDYINILYYSNQTTNYTIPCFRNLLVQATPENIVLFKEAIRHLGP 311 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A++ A+ IL + ++ N + I+ +TDG + NF V Sbjct: 312 SGKTDFPQALQMAFDILENYREIRGCNNEEIDEEGKSKAC-NQAIMLITDGISRNFSDIV 370 Query: 349 NTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 D K ++I T I + + C + ++ V + + Sbjct: 371 MRNNQLDGGKTIPVRIFTYLIGKEVTNVEEIRWMACA-NRGFYTQVQTLEQVTSA 424 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 55/191 (28%), Gaps = 22/191 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM A + A +D +D + I + + P+ + + Sbjct: 324 KMVAARRAAGRIVDMLDAGDRFCVLAFDDRIETPPAMPDGLVPASDRNRF-AASSWLGSL 382 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A ++L + Q ++ +TDG+ + Sbjct: 383 RSRGGTVMAQPLTNAVEMLADSGEDR-----------------QASVVLVTDGQISG--E 423 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + +I + ++ + N L + V + D L V ++ Sbjct: 424 DHLLRSLAPVVGR--TRIYCVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARLA 481 Query: 407 QLMVHRKYSVI 417 + + + I Sbjct: 482 RTIGRPALTSI 492 >gi|115358179|ref|YP_775317.1| hypothetical protein Bamb_3429 [Burkholderia ambifaria AMMD] gi|115283467|gb|ABI88983.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 346 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 21/43 (48%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 ASE+ + ++ F + ++ L I + +N +++A + Sbjct: 13 ASERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAAD 55 >gi|321460551|gb|EFX71592.1| hypothetical protein DAPPUDRAFT_326968 [Daphnia pulex] Length = 950 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 65/189 (34%), Gaps = 34/189 (17%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKV 276 K+ ++ L ++ ++ + +G++ +++ E + EK Sbjct: 301 VSGSMKEFNRIGKLGESVRAWIKT-----DFPSGNQLGMVQFSSNAEILSDLRMIADEKS 355 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ + + + T ++ A Q+L I+ + Sbjct: 356 REEMMAKVPKEVFVATCIGCGLQLAMQMLKDGG----------------------IIVLV 393 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG+N+ +++ +K + I+++TI+ + + + Y + + D Sbjct: 394 TDGKNSPGYHDISDVK--KDIVDAKIRVITIAYGSEADKNVEH-LADVTGGKSYFIKDDD 450 Query: 397 ---SLIHVF 402 +L F Sbjct: 451 SSEALQQAF 459 >gi|114705831|ref|ZP_01438734.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] gi|114538677|gb|EAU41798.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] Length = 548 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 17/161 (10%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 7 FRFYFKKGIA-SEKANFSIIFALSVMSFL-LLIGFLIYVLDWHYKKNSMESAN------- 57 I+ S++ N ++ FAL + + GF + + ++ S+++A Sbjct: 1 MIRSLSAFISGSDRGNIAVSFALLL-PLILGFGGFATDLGLAYLERRSLQAATDAAALAA 59 Query: 58 -----NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 +A +A S+ S R + + + + + + Sbjct: 60 IRHPHDAEAIAAKVFAASSQKDATFDVVFGSYEPDRNRDERFRPEGSSAVSV-VRVEARK 118 Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 ++ + ++ + + ++ A+ L++ + + Sbjct: 119 PHHYTLARVLGVNGVELSASADASLNAQVVFQAGSRLMKVD 159 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 3 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 51 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 52 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 108 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 109 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 161 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 162 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 213 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 764 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 812 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 813 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 869 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 870 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 922 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 923 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 974 >gi|170703647|ref|ZP_02894384.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170131446|gb|EDT00037.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 346 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 42/127 (33%), Gaps = 4/127 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ASE+ + ++ F + ++ L I + +N +++A +A A + Sbjct: 13 ASERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAADA----AALAAADAMMSP 68 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S A A A+ + + + + T + + I+ Sbjct: 69 TGGPLNWSQAAPAANGVIAQNSAAGAALATGTVTTGYWNVTRNPATMQPTTITPGAYDVP 128 Query: 136 RLDSSNN 142 + + + Sbjct: 129 AVQVTVS 135 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|153006807|ref|YP_001381132.1| TadE family protein [Anaeromyxobacter sp. Fw109-5] gi|152030380|gb|ABS28148.1| TadE family protein [Anaeromyxobacter sp. Fw109-5] Length = 134 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 38/120 (31%), Gaps = 3/120 (2%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + ++ FAL + +LL I +++ + +A AG+ + Sbjct: 7 MVRHARGAAAVEFALVLPVLMLLCLGAIEWGFHFFQREIIVNAAREGARAGSIADADAET 66 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 DR + A +D + + + S + + + + + M Sbjct: 67 VAEDRALAYLGV---AGLDGTTCDVAPTLTNTGSPPTPAIRVSIDCTGGSFTLLFKDFMP 123 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 247 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 295 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 296 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 352 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 353 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 405 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 406 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 457 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ ++ G Sbjct: 262 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKVASYGVKPRYG 310 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 311 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 367 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 368 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 420 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 421 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 472 >gi|300727144|ref|ZP_07060563.1| BatA protein [Prevotella bryantii B14] gi|299775688|gb|EFI72279.1| BatA protein [Prevotella bryantii B14] Length = 134 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + Q L + ++ Y N L ++++I + + K SV Sbjct: 54 VDIDIQTLKEIAATTEGNFYRATNTAQLKSIYKDIDK-LEKTKMSV 98 >gi|297716667|ref|XP_002834627.1| PREDICTED: collagen alpha-2(VI) chain-like, partial [Pongo abelii] Length = 279 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 58/190 (30%), Gaps = 29/190 (15%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLS 246 + + L + + V + T KN ++ ++ + Sbjct: 52 VSPVPLNDPATPPDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDP 111 Query: 247 HVKEDVYMGLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +G++ Y+ + I+ ++ V I T + A+K A Sbjct: 112 KSETGTRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFA 169 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 Y L + +R F + +TDG ++ ++N +C + Sbjct: 170 YDRLIKESRRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRD 210 Query: 362 IKIVTISINA 371 + + I I Sbjct: 211 VTVTAIGIGD 220 >gi|298249212|ref|ZP_06973016.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297547216|gb|EFH81083.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 420 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + ++ + P+ G E+ + + + T + A+ +++ Sbjct: 72 SMQFMVVTFNDNARIIFGPAAGIEENKNRAIAAIQTVYAASGTRMSTALNTIVDKFGNNQ 131 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 R+ I+FLTDG N + V + + I I + Sbjct: 132 SRAT------------------RILFLTDG-KNEGEPRVALDRAVARCSAANISISAWGV 172 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + LL ++ + + + F + Sbjct: 173 GTDWDAAELLHMAEATRGSADIIPTPNQVEAAFSS 207 >gi|90577458|ref|ZP_01233269.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] gi|90440544|gb|EAS65724.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] Length = 1679 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 39/379 (10%), Positives = 89/379 (23%), Gaps = 61/379 (16%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + D + S+ + + Sbjct: 1065 NGQEYHGEAKDGHFEIDVQGKDLLDDPDRTVDVSVVTSDGANHTASASASHQYQVDGVVI 1124 Query: 132 MANN-----------RLDSSNNTIFYNMDVMTSYDYRLQFIEHL-----------LNQRY 169 N L + T + +Y + + Sbjct: 1125 QGGNGDDVITGGNGSDLLIGDLTPPQADQQPVNINYVMDISGSMYFGRLLTLDAVKGHVA 1184 Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLS-----------GSMHCAMNSDPEDVNSAP 218 + + + E V +S + + D Sbjct: 1185 KSYEIYVGRNSQLTAADGKEVSHTEGWVTVSYDQLKAGLQYDAGSYDNIQIKASDGTEYS 1244 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + + K + SI K + ++ +++ V+ N + + Sbjct: 1245 DNFNSLPSLIDMAKKSYQTLTSSIIDSVEDKSKITFNMVTFSSDVKGNTSFHYDETS-KT 1303 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 +V ++ L + F I + I FL+D Sbjct: 1304 FVNDQHQTINNY-----------IDSLVAGGGTQFEGALSDISRHITDPSMRNVIYFLSD 1352 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS----------PEY 388 G++ + + +IV+I++ S + ++ + P Y Sbjct: 1353 GKDEDKFHPQGIHFL------KGTEIVSIAVGPSADATQINQIAQMGTGYDHNNPNAPSY 1406 Query: 389 HYNVVNADSLIHVFQNISQ 407 + NA+ L VF NI Q Sbjct: 1407 SKIITNANELDGVFHNIGQ 1425 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 49/157 (31%), Gaps = 16/157 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 KMA K AL L + + V+ R + ++ ++ Sbjct: 263 NSYSMYGNKMAQTKEALGTILGELPEDDYFAIIVF-STTFVVWRPYLSKATEENVKEAQE 321 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 YV ++ T+ A ++L + ++ +I LTD Sbjct: 322 YVKTI---EVIGGTELHDATIHGVEMLYAAQRN-----------GTAPKNMVLMMILLTD 367 Query: 339 GENN-NFKSNVNTIKICDKAKENFIKIVTISINASPN 374 G+ N +S + KA + I + ++ + Sbjct: 368 GQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDAD 404 >gi|28948724|pdb|1N3Y|A Chain A, Crystal Structure Of The Alpha-X Beta2 Integrin I Domain Length = 198 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 22/167 (13%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + L+ ++ + + + + T + A++ L Sbjct: 43 RPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHA 102 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTI 367 K +I +TDG+ + + I + D A I I Sbjct: 103 SYG-------------ARRDAAKILIVITDGKKEGDSLDYKDVIPMADAA---GIIRYAI 146 Query: 368 SINA---SPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + + N + L S P + + V + D+L + + + + Sbjct: 147 GVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 193 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 56/169 (33%), Gaps = 22/169 (13%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQ 303 + V +GL Y + + ++ V + +L T++ A++ Sbjct: 266 DISSDRVRVGLAQYNDNIYPAFQL--NQHPLKSMVLEQIQNLPYRPGGTNTGSALEF--- 320 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + +I +TDGE+N+ ++ D+ KE+ + Sbjct: 321 --------IRTNYLTEESGSRAKDRVPQIVILVTDGESND-----EVQEVADRLKEDGVV 367 Query: 364 IVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQLMVH 411 + + + Q L K E +N N + L +I Q + Sbjct: 368 VYVVG-VNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCS 415 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 47/184 (25%), Gaps = 18/184 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ + E + +GL Y+ S K Sbjct: 46 QHARSVRNFLHILVNSFNVSSETIRVGLAQYSDVPHSEFLLSTYHRKGDVLRHIRQFQFK 105 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + A+K +F S + + ++ G + Sbjct: 106 PGGKNMGLALKF-----------ILDHHFQEASGSRASQGVPQIAVVISSGPAEDHVHGP 154 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQ 407 + + I + + + L++ S E + N L ++ Q + Q Sbjct: 155 -----AEALRRAGILLYVVGV-RDAVWAELMEIASSPQENFTSFLPNFSGLSNLAQKLRQ 208 Query: 408 LMVH 411 + Sbjct: 209 ELCD 212 >gi|268553179|ref|XP_002634574.1| Hypothetical protein CBG08382 [Caenorhabditis briggsae] gi|187032742|emb|CAP28216.1| hypothetical protein CBG_08382 [Caenorhabditis briggsae AF16] Length = 369 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 49/152 (32%), Gaps = 14/152 (9%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + T +V + D TD + AY L K+ + Sbjct: 223 WSATDIPNNANVTAEVLTMLDYVPDVTADNSTDLECLFRYAYDGLGDSKEFA-------- 274 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFIKIVTISINASPNGQRLL 379 +K +I N+++ + + K + E K++ + + A + L Sbjct: 275 ----ERYGIEKVVIIFVAANANDYEDYNESFEFAHKIRTEMDAKVIIVGMGAGLDQSSLS 330 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 K +S + + + D+L + I+ + Sbjct: 331 KLAYAS-GFAFFSSSYDNLSSLIPQINNAICS 361 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 60/177 (33%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + +R+ + T Sbjct: 110 LSDFPVVPTATRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLRREIPAIA--YRGGGT 167 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 168 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 206 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 + ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 207 VAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 257 >gi|319940449|ref|ZP_08014794.1| hypothetical protein HMPREF9464_00013 [Sutterella wadsworthensis 3_1_45B] gi|319806075|gb|EFW02824.1| hypothetical protein HMPREF9464_00013 [Sutterella wadsworthensis 3_1_45B] Length = 529 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 13/210 (6%), Positives = 49/210 (23%), Gaps = 25/210 (11%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L R + + + E + ++ ++ + L F++ ++ A Sbjct: 5 LCKRAQLLLARVLREESGSAAVNVVFFLIGGIALTAFVVDATRVSADGARLKQA------ 58 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + + ++ ++ A+R++ ++ S Sbjct: 59 --------TDAAAQAVAMEAAKDSETDVLGMAQRYVAVNLGLDKEQLSRDLSVAVEPVTW 110 Query: 123 N------SSRIS---MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 N S + + + + + + + Sbjct: 111 NDYEGYRVSASFRALPSLLGGPGKTVEVASAAVAIYNPLEIALVVPSTINETQRDMQAIR 170 Query: 174 VSFIPALLRIEMGERP--IFLIELVVDLSG 201 + G + L+ ++ Sbjct: 171 DIGEAFYDEVIDGRADRWMALVPYSDGVNV 200 >gi|308501381|ref|XP_003112875.1| CRE-CLEC-160 protein [Caenorhabditis remanei] gi|308265176|gb|EFP09129.1| CRE-CLEC-160 protein [Caenorhabditis remanei] Length = 632 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 63/183 (34%), Gaps = 29/183 (15%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-----RDM 284 +K + + + L ++++ V +GLI Y+ E +PS ++ R Sbjct: 301 MVKAEINTLVGQMSLDPNIQKHVQVGLIKYSNEAEIVFKPSDYDDEDEFTEDLWSDPRLQ 360 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D + + +++A +++ S ++ + N Sbjct: 361 DVDKVDEVNLHLGLQKAAKMIGSMRRGVRKVVVVYA-------------------ASYND 401 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT--CVSSPEYHYNVVN---ADSLI 399 + N + +I +E+ +I+T++ + ++K S + + + + Sbjct: 402 EGNDDARQIAANIRESGYEIITVAFVEPESSNLVMKIGEIASPRMNFTSFRDDLLVEEME 461 Query: 400 HVF 402 F Sbjct: 462 DAF 464 >gi|270007556|gb|EFA04004.1| hypothetical protein TcasGA2_TC014153 [Tribolium castaneum] Length = 813 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 5/86 (5%) Query: 327 LPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 +Q IIFLTD S I K N I I ++S + + + + Sbjct: 423 NRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNRIPIFSLSFGEDVDKNFMRQLAAKN 482 Query: 386 ---PEYHYNVVNAD-SLIHVFQNISQ 407 + Y ++A +++ +++IS Sbjct: 483 LGFSGHIYEALDASVQILNFYRSISS 508 >gi|189237279|ref|XP_973594.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 5/86 (5%) Query: 327 LPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 +Q IIFLTD S I K N I I ++S + + + + Sbjct: 423 NRYQPMIIFLTDSYPTVGMTSQNEIINTVTKVNNNRIPIFSLSFGEDVDKNFMRQLAAKN 482 Query: 386 ---PEYHYNVVNAD-SLIHVFQNISQ 407 + Y ++A +++ +++IS Sbjct: 483 LGFSGHIYEALDASVQILNFYRSISS 508 >gi|169627579|ref|YP_001701228.1| hypothetical protein MAB_0475 [Mycobacterium abscessus ATCC 19977] gi|169239546|emb|CAM60574.1| Hypothetical protein MAB_0475 [Mycobacterium abscessus] Length = 129 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 10/68 (14%), Positives = 26/68 (38%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ +II + L L V ++ ++A + + LAGA + ++ Sbjct: 14 RADGGAATIIAVAMIAVLLALTAGGAAVGSAVVARHRAQAAADLSALAGAQHALYGVTPA 73 Query: 76 GDRFESIS 83 +++ Sbjct: 74 CAEAGAVA 81 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 40/119 (33%), Gaps = 15/119 (12%) Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + K+ +K +I LTDGE+ + I D K N I + Sbjct: 204 DPKGTVLYDAVAQAANDKLREQTGRKALILLTDGEDL-GSATKPLQAIADAQKANTI-VY 261 Query: 366 TI-----------SINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 I + + + Q + + ++ NV N L F+ I++ + + Sbjct: 262 VILIADRGFYGGYTFGYTGDAQ-MRRLAEATGGRMINVGNNGAKLTAAFKEIARELRTQ 319 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 52/168 (30%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + + ++ + + + Sbjct: 241 KTSVYEMLDTLSDDDYVNVASFNEKAKPVSCFKHLVQ--ANIRNKKVFKEDVQGMVAKGT 298 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD + A+ L + K I+ TDG + + Sbjct: 299 TDYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 344 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 K +K +++ T S+ L+ ++ Y++ + + ++ Sbjct: 345 KWPNKT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|61806576|ref|NP_001013521.1| hypothetical protein LOC541376 [Danio rerio] gi|60649627|gb|AAH90437.1| Zgc:113325 [Danio rerio] Length = 450 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 51/168 (30%), Gaps = 19/168 (11%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K +++ LD++ +V Y + + + M+ Sbjct: 240 KTSVIEMLDTLSDDDYV-NVARFNEKAYAVVPCFTTLVQANIKNKKIFKEAVMNMQAKGT 298 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD + A+ L +D P K I+ TDG + + Sbjct: 299 TDYKTGFQFAFDQLLNDTSA-------------PRANCNKMIMMFTDGGEDRAQDIFEKY 345 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 346 NWPNRT----VRVFTFSVGQHNYDVTPLQWIACFNKGYYFEIPSIGAI 389 >gi|283779853|ref|YP_003370608.1| TadE family protein [Pirellula staleyi DSM 6068] gi|283438306|gb|ADB16748.1| TadE family protein [Pirellula staleyi DSM 6068] Length = 156 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 43/130 (33%), Gaps = 4/130 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + +A ++ FA+ F LL+ +I + A GA V + Sbjct: 23 RSYRRNRRAAATVEFAVVAPIFFLLVLGMIEYGRMVMV----QQVITNASREGARTAVLD 78 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + +++++ I A + + + G V + S S + Sbjct: 79 GTTTAEVTATVNSYLASGSISGATVVVTPNPPTNADGGDPVTVTVSVPFSQVSWLPSPMY 138 Query: 132 MANNRLDSSN 141 + L S+ Sbjct: 139 LGGKTLSSTT 148 >gi|172062967|ref|YP_001810618.1| hypothetical protein BamMC406_3936 [Burkholderia ambifaria MC40-6] gi|171995484|gb|ACB66402.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 346 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 21/43 (48%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 ASE+ + ++ F + ++ L I + +N +++A + Sbjct: 13 ASERGSVALFFLMFLIPLLSFGALAIDIAWVATVRNQLQNAAD 55 >gi|329851852|ref|ZP_08266533.1| hypothetical protein ABI_46220 [Asticcacaulis biprosthecum C19] gi|328839701|gb|EGF89274.1| hypothetical protein ABI_46220 [Asticcacaulis biprosthecum C19] Length = 172 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 42/150 (28%), Gaps = 2/150 (1%) Query: 4 LSRF--RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 LSR + + + + FAL V L ++ + + K+ +M S+ ++A Sbjct: 5 LSRLNVKTFLSRFARERHGAAIVEFALIVPVLLGVLAAVDDLSAVTMKEKAMRSSVSSAA 64 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 S + + A + K + + S + Sbjct: 65 QYVMRGGSDLSVARQIVLSSWTPETEDATVSAVKSCYCGSVAATCSNVCTDASIPQAYVR 124 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 V+ + + + + V Sbjct: 125 VSIVQPYDGWYTDTSIVAQQEVRVRWGGVS 154 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 63/171 (36%), Gaps = 26/171 (15%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ +GL+ Y +RV+ + + Q +++ T A+ Y Sbjct: 564 FNINRDVTQIGLVVYGSRVQTTFALDTHPTSSSLLQAISQAPYMDGAGST--GSALLHVY 621 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +K K ++ +T+G K + N I Sbjct: 622 EEVMTVQKG-------------ARPGVSKAVVVITEGTGTEDAVVP-----AQKLRNNGI 663 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS-QLMVHR 412 ++ I++ + LL+ SP++ +V + + L + +Q++ + + Sbjct: 664 SVLVIAVG-PVLKETLLRL-AGSPDFLIHVASYEDLEN-YQDLFIERICEE 711 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 25/228 (10%), Positives = 68/228 (29%), Gaps = 34/228 (14%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 V + + + I I L D + + +R+K A+K Sbjct: 17 KVLPSLTVQELHANQETIVKISSASQLMQCSAALDVLFALDGSHSIGKGSFERSKYFAIK 76 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILK 290 D L+ + V +G++ +++ + +V++ + + + Sbjct: 77 --------MCDALAIYPDRVRVGVLQFSSVPQLEFPLDSFFTREEVKEKIKKIV--FKGG 126 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T++ A+K + + + +I +TDG++ Sbjct: 127 STETGLALKYLLHKGFPGGR---------------NSSVPQLLIIVTDGKSQGNIDLP-- 169 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH-YNVVNADS 397 ++ KE + + + + + L + + + D Sbjct: 170 ---ANQLKERGVMVFAVGVRFPRWTE-LHTLASEPKDQYVLFAEHVDD 213 >gi|297666856|ref|XP_002811720.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Pongo abelii] Length = 968 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 381 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDSLEQAQSWIRD 437 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 438 MKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 470 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 471 -DQPPETVIDQVKLFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 518 >gi|162451937|ref|YP_001614304.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] gi|161162519|emb|CAN93824.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] Length = 641 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 23/241 (9%), Positives = 61/241 (25%), Gaps = 22/241 (9%) Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + + +D S SM ++ + + ++ +A L Sbjct: 195 TATASTSPFRPVTLFVALDRSTSMRQTLSGVDVSADRWIPATNALKSFFQDPASAGLGVA 254 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + +T ++ + + Sbjct: 255 FRFWPHDNPGPCSDTPSCPPAGGGGCALPLVPFNGAAADRLTAASAAMGDQQ---EIKLM 311 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF-------IIFLTDGENNNFKSNVNTIK 352 QA+ G + ++ ++FLTDG ++ N Sbjct: 312 QAFINF-PTPNGETPMYPALDGATTWANNYKDAHPEEEVAVVFLTDGVPSSC--NTVPNA 368 Query: 353 ICDKAKE----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI----HVFQN 404 I + A+ + +++ + + N + + YN+ +L + Sbjct: 369 IGNLARNAFVNHGVRVHAVGFG-NSNAELINLIADQGGGRAYNLSAGSTLEGSVLDALVS 427 Query: 405 I 405 I Sbjct: 428 I 428 >gi|281358203|ref|ZP_06244686.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281315293|gb|EFA99323.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 232 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 65/209 (31%), Gaps = 38/209 (18%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + R + +VD+SGSM + + L+ A+ + S + Sbjct: 54 VEVRAEKRVVFLVDVSGSMGAVTPEGGS--------RLDVMKR--ELRRAVGSAVASANR 103 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM----DSLILKPTDSTPAMKQ 300 + KE + +++ ++ + + R V R T+ M Sbjct: 104 IGAPKEAGNFRVWAFSSGLQLFPDLEPCGFRDRSAVERLNRFVGALGAGGSTN----MLM 159 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A++ + + + FL+DG+ ++ + T + K+ Sbjct: 160 AWRKILE----------------LTKHGQLDTVYFLSDGDPSDCSAEELTRLLTRLPKD- 202 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYH 389 + + +I + L + + + Sbjct: 203 -VTVHCFAIGLDSS--LLREIAAAHNGNY 228 >gi|90021388|ref|YP_527215.1| TPR domain-containing protein [Saccharophagus degradans 2-40] gi|89950988|gb|ABD81003.1| TPR repeat [Saccharophagus degradans 2-40] Length = 658 Score = 43.7 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 69/208 (33%), Gaps = 25/208 (12%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D++ + + +D K ++ A K +D ++ GLI + + Sbjct: 100 WDLSPSMLAEDLKPSRAARAK---YKLIDLFAK----RQTGLSGLIAFAGEAHIVTPLTD 152 Query: 272 GTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 V+ + D + ++ ++ A++ A ++L F G+ + Sbjct: 153 DARTVKNLLNGLSPDMMPVQGSNPEMALELAVKLLKEGGVARGDIIFVTDGIDPAAFNTL 212 Query: 331 KFIIFLT------DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + +I T G + + + AK +I I N L + +S Sbjct: 213 EHLIKPTQHRVTVWGFGSEQGAPIPLSGNQGFAKTRSGEI----IMTKRNDNELSEAAIS 268 Query: 385 SPEYH--YNVVNADSLIHVFQNISQLMV 410 + ++ NAD NI+ + Sbjct: 269 MGGIYIPFSSTNADV-----DNIAASLS 291 >gi|254255256|ref|ZP_04948572.1| Flp pilus assembly protein TadG [Burkholderia dolosa AUO158] gi|124900993|gb|EAY71743.1| Flp pilus assembly protein TadG [Burkholderia dolosa AUO158] Length = 177 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 10/68 (14%), Positives = 26/68 (38%), Gaps = 1/68 (1%) Query: 14 GIASE-KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 S + ++ FAL +M ++L+ + Y+ ++ A A + + S+ Sbjct: 6 FRRSRTRGVVAVEFALVLMPMIVLVTGVAEFGRAIYQYETLTKATRDAARYLSVYLPSDP 65 Query: 73 SRLGDRFE 80 + + Sbjct: 66 AYPTAAAQ 73 >gi|123233471|emb|CAM28080.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 637 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 49/138 (35%), Gaps = 23/138 (16%) Query: 277 RQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + T + A+ +F + +K + Sbjct: 1 KSLLQAVANLPYKGGNTLTGMAL-------------NFIRQQNFRTQAGMRPRARKIGVL 47 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVV 393 +TDG++ + K K+ +++ I I + + LK + P+ + YNV Sbjct: 48 ITDGKSQDD-----VEAPSKKLKDEGVELFAIGIKNADEVE--LKMIATDPDDTHAYNVA 100 Query: 394 NADSLIHVFQNISQLMVH 411 + +SL + +++ + + Sbjct: 101 DFESLSRIVDDLTINLCN 118 >gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS155] gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS155] Length = 771 Score = 43.7 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 36/248 (14%), Positives = 70/248 (28%), Gaps = 44/248 (17%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 N +V +P + + LV+D SGSM Sbjct: 367 NEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMA--------------------GD 406 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + KNALL L + + + + + + Sbjct: 407 SIVQAKNALLYALKGLKPEDSFNIIEFNSSLS----QFSATPLPATSSNLSRARQFVSRL 462 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+ A + +IF+TDG N ++ Sbjct: 463 QADGGTEMALALDAALPKSLGSAPSDAVQPL-------------RQVIFMTDGSVGNEQA 509 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I+ + E+ ++ T+ I ++PN + + + D + IS Sbjct: 510 LFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVDEVGE---KIS 562 Query: 407 QLMVHRKY 414 L+ +Y Sbjct: 563 ALLSKIQY 570 >gi|32476038|ref|NP_869032.1| hypothetical protein RB9710 [Rhodopirellula baltica SH 1] gi|32446582|emb|CAD76417.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 1631 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 33/313 (10%), Positives = 88/313 (28%), Gaps = 31/313 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + D +++S HA A + + + SL+ + + + + Sbjct: 989 AVMSIHRDGIKTSDAQQAVSIHATDAPTNVELTLAVDQKRPSLAEVNFRGNRYQSSVMSS 1048 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + +T D + TI MD + + + + + + S + Sbjct: 1049 GTAFGVTSRGQTVQDGAEITIRDAMDAGRAITFVMDCSASMNDPLGEEMGRSAL------ 1102 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-- 241 G + E M + P V K++ + Sbjct: 1103 --GAQRASKFEAARSAVYEMMRRLQPGPSQVGLVLYGHRMAIRAGDPAKDSGDGSGQTTL 1160 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + H + + + + T ++ ++ T ++ +A Sbjct: 1161 LQKRYHKRFPFPPTIQPFEDVEVALPTGRFDTAELELARQHFDAAVPWGQTPLYLSIWKA 1220 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKA 357 + ++ +K ++ ++DG N F I ++ +A Sbjct: 1221 MEDIS-----------------RTGDGVRKDVVVISDGRNYQFNPTPEAIFSIGQLVTQA 1263 Query: 358 KENFIKIVTISIN 370 K +++ I Sbjct: 1264 KTLGVQVHVIGYG 1276 >gi|301606955|ref|XP_002933090.1| PREDICTED: integrin alpha-1-like [Xenopus (Silurana) tropicalis] Length = 1147 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 64/170 (37%), Gaps = 27/170 (15%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRS 312 +G++ Y V + + T + + A + ++ + Sbjct: 175 VGIVQYGENVTHEFNLNTYDSTAKVRQASLGIKQRGGSQTMTALGIDTARKEAFTEARG- 233 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +K ++ +TDGE+++ + IK C+ + I+ +I+I S Sbjct: 234 ------------ARKGVKKVMVVVTDGESHDNYRLNSVIKECET---DDIQRFSIAILGS 278 Query: 373 PNGQRL--------LKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 N L +K+ S P ++ +NV + +L+++ +++ + + Sbjct: 279 YNRGNLSTETLVEEIKSIASEPKEKHFFNVSDELALLNIVESLGERIFAL 328 >gi|315497471|ref|YP_004086275.1| tade family protein [Asticcacaulis excentricus CB 48] gi|315415483|gb|ADU12124.1| TadE family protein [Asticcacaulis excentricus CB 48] Length = 187 Score = 43.7 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 51/174 (29%), Gaps = 13/174 (7%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++ FAL F LI I + + S++ LA A + Sbjct: 25 LWRDRRGATAVEFALIAFPFFGLIMGCIELAIVLFAGVSLD-------LATAKVSRELRT 77 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + + + A +I + K L F N ++ + + ++ Sbjct: 78 GTSGKATTSAIFITKVCNEMA--WIGSDCKSKLRVDVRTFTNFQMVSQAPDVIVDGKFVS 135 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + ++ + Y F L + + + + + E Sbjct: 136 M-QYTVGGSSQIQLVRA---YYPWPVFSPFLKPGLGSLSSGETVLSSIIVFKNE 185 >gi|281207766|gb|EFA81946.1| hypothetical protein PPL_05180 [Polysphondylium pallidum PN500] Length = 1990 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 29/237 (12%), Positives = 65/237 (27%), Gaps = 32/237 (13%) Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + + + VVD +GSM + +++ + + ++ A++ + D Sbjct: 1655 NVKDIRKLDLVFVVDDTGSMGSEIAKVKQEIQNIVDDIVSIGSI--EVRVAMVFYNDHTP 1712 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 H K V K + + ++R+ + + + D AM + Sbjct: 1713 NSDHSK------------SVCKVFKFTSDIPELRRGLDSVV---VHGGADHPEAMADGFY 1757 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +T T P F G + ++ + Sbjct: 1758 EVTKLDFAKSSTKVCIVIGDAPPHGFSGSGDSFPQGCPCGHDLIASVRQLVQG----GVT 1813 Query: 364 IVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHV----------FQNISQLM 409 T+ S + L S + NA L + I++ + Sbjct: 1814 FYTVMCRGDSQTYETLNAIADLSEGRFVLLNNASELTEIITGSAKASILLDTIAEEV 1870 >gi|146284567|ref|YP_001165520.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320700|gb|ABP62846.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 401 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 27/266 (10%), Positives = 73/266 (27%), Gaps = 68/266 (25%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + + P LR M +RP+ Sbjct: 198 QQAPEMVLIFDASGSM------SISMDITPDELRRLMQDRPVKNF--------------- 236 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 D++ +++ + +D + +D+ + L+ T + ++ Sbjct: 237 -------------DREPRRISLAHRSAKQLIDEV------PKDMDISLVSAATCQQVSVT 277 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 P+ + + + + T A+++A +++ + + Sbjct: 278 PALSFAQRDELKYAIDNIQPVGKTALAEALEKAGKLVDGVDRDAI--------------- 322 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENF--IKIVTISINASPNGQRLLKTCVSSP 386 I+ +TDGE + + K+ +++ + I + G + Sbjct: 323 ----IVLITDGEETCGGDPCV---VAQQLKQQKPRLQVNVVDIMNTGAGNC---IASQTG 372 Query: 387 EYHYNVVNADSLIHVFQN-ISQLMVH 411 Y V N + I + + Sbjct: 373 GSVYAVNNTHEFNEMMNQAIKEYIPE 398 >gi|220918102|ref|YP_002493406.1| TadE family protein [Anaeromyxobacter dehalogenans 2CP-1] gi|219955956|gb|ACL66340.1| TadE family protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 140 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 29/79 (36%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + SE+ ++ FAL + L ++ I + + + + ++ AG + S Sbjct: 1 MRRARSERGAAAVEFALVLPLLLTIVFGTIEWGYYFFNRQVVINSAREGARAGTLQYASG 60 Query: 72 LSRLGDRFESISNHAKRAL 90 S + N+ A Sbjct: 61 TSASAVAVTTAENYLTSAG 79 >gi|116207766|ref|XP_001229692.1| hypothetical protein CHGG_03176 [Chaetomium globosum CBS 148.51] gi|88183773|gb|EAQ91241.1| hypothetical protein CHGG_03176 [Chaetomium globosum CBS 148.51] Length = 1079 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 60/197 (30%), Gaps = 28/197 (14%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +++AL + ++ MGL+ + + T K + S Sbjct: 543 KINLVRDALRFMVQALGERD------RMGLVTFGSSGGGV-PIVGMTTKAWSGWGNVLSS 595 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + S A + + G K + I+ ++D ++ S Sbjct: 596 IKPVGQKSHRA---------DVVEGANVAMDLLMGRKFNNPIAT--IMLISDASTSDTDS 644 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH------ 400 + + AK I I + + + +++ + + V + L Sbjct: 645 VDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKDWMMLRECLAGCL 701 Query: 401 -VFQNISQLMVHRKYSV 416 Q++S V K + Sbjct: 702 GSMQSLSHQNVKLKLRL 718 >gi|323137419|ref|ZP_08072497.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322397406|gb|EFX99929.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 228 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 20/218 (9%), Positives = 51/218 (23%), Gaps = 11/218 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ ++ FAL + L++ L+ + ++ A LA Sbjct: 10 LAQDDRGIAAVEFALVLPLMLMIYLGLVELSRGMRAAQKLD--LVAHTLA--------DL 59 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D +A T + + S + Sbjct: 60 TAQQLTGGSNTGQAGLTEADITAVFSAATTIMSPLPTANLKMTISEVAITSPSAGVWQAK 119 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + N+ ++T+ D + Y P + + + + Sbjct: 120 TTWTVTRNSATARPCQILTAQDATPVSFTSMP-TSYTTVTNGVNPTVGPVIVADVVYNYS 178 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 V + + AP+ + + Sbjct: 179 PGVHFEIFKWGSPPTWTMQRTSYAPVRNTYVPSHIQYF 216 >gi|317123684|ref|YP_004097796.1| hypothetical protein Intca_0524 [Intrasporangium calvum DSM 43043] gi|315587772|gb|ADU47069.1| hypothetical protein Intca_0524 [Intrasporangium calvum DSM 43043] Length = 152 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 9/77 (11%), Positives = 27/77 (35%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 E+ +++ +++ L++ + + + A + A+LAGA+ + + Sbjct: 39 ERGAATVLVIGVLVALLVVTTGGLLIASAVVASHRARLAADLAVLAGATALRDASTVERA 98 Query: 78 RFESISNHAKRALIDDA 94 + A Sbjct: 99 CAAARRVATLNGAELGA 115 >gi|271963054|ref|YP_003337250.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] gi|270506229|gb|ACZ84507.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] Length = 594 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 59/221 (26%), Gaps = 31/221 (14%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 R + +VVD SGSM E +++ K A + L Sbjct: 404 RKRANVLIVVDKSGSMEEEAAGTGE-------------SRLELAKKAAINALPQFRGDDK 450 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + + + ++ T A + + Sbjct: 451 VGLWAFSTRQDGDRDYRELVPIDSVSKIGPALRDELDGLTAGGGTGLYDTTLAAVERM-- 508 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++FLTDG N + + K +++ TI Sbjct: 509 --------------RGARDAGAINAVVFLTDG-KNEKTGGSDLDNLLGKL-NPDVRLFTI 552 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + L + ++ Y+ AD++ VF ++ Sbjct: 553 GYGEGADQGVLKRIAEATDGAAYDSSRADTIDQVFTSVISN 593 >gi|302632554|ref|NP_001181863.1| integrin alpha-M [Pan troglodytes] Length = 1153 Score = 43.7 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 93/339 (27%), Gaps = 41/339 (12%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N + + + S+ + Sbjct: 18 NLDTENAMTFQENARGFGQTVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 77 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS------- 200 + V + L ++++ P + + + + + + Sbjct: 78 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 136 Query: 201 -----GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + ++ + +M + ++ L L + + Sbjct: 137 FPEALRGCPQEDSDIAFLIDGSGSINPYDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEF 196 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I +T + +N R V L T + +++ + L + + Sbjct: 197 RIHFTFKEFQNNP------NPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKN 248 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---S 372 F K ++ +TDGE + + +A + I + S Sbjct: 249 AF-------------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFHS 293 Query: 373 PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 ++ L T S P ++ + V N ++L + + + + Sbjct: 294 EKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|33340728|gb|AAQ14925.1| Mac-1 alpha subunit [Pan troglodytes] Length = 1144 Score = 43.7 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 93/339 (27%), Gaps = 41/339 (12%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 ++A F +N + + + S+ + Sbjct: 9 NLDTENAMTFQENARGFGQTVVQLQGSRVVVGAPQEIVAANQRGSLYQCDYSTGSCEPIR 68 Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLS------- 200 + V + L ++++ P + + + + + + Sbjct: 69 LQVPVE-AVNMSLGLSLAATTSPPQLLACGPTVHQTCSENTYVKGLCFLFGSNLRQQPQK 127 Query: 201 -----GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 + ++ + +M + ++ L L + + Sbjct: 128 FPEALRGCPQEDSDIAFLIDGSGSINPYDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEF 187 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I +T + +N R V L T + +++ + L + + Sbjct: 188 RIHFTFKEFQNNP------NPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKN 239 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---S 372 F K ++ +TDGE + + +A + I + S Sbjct: 240 AF-------------KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFHS 284 Query: 373 PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 ++ L T S P ++ + V N ++L + + + + Sbjct: 285 EKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 323 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 43.7 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 78/307 (25%), Gaps = 48/307 (15%) Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + + + + ++ D D T+ L + Sbjct: 16 AKDDDDGGNPFSANDEASSITTDGGATEDEAGDDGSTTTPMYFD----LPGEESQPTAEG 71 Query: 176 FIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 +E+ P + L+VD SGSM P V + T + Sbjct: 72 ATTDCANVEVDTSPTTPTVVLLVDQSGSMWDDFGGQPRWVAL-------ENTLFDPVNGV 124 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + D + + + + E + ++ L T + Sbjct: 125 VKPLEDQVRFGLALYSSMNG---SFGGECPLITEFAPSFGNHASLAATFASAMPLDDTPT 181 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--- 351 ++K + L F G K I+ TDGE + Sbjct: 182 GDSIKAVAETL---------AAFPEDG--------PKIIVLATDGEPDTCAVPDPQEGQP 224 Query: 352 ---KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY----------HYNVVNADSL 398 + A ++ I+ IS+ L + + Y +N L Sbjct: 225 LSLEATQAAFDDGIRTFVISVGNQVTDAHLQELANAGVGLPTQGAVENAPFYKTLNPAEL 284 Query: 399 IHVFQNI 405 + F+ + Sbjct: 285 VSAFEAV 291 >gi|34534804|dbj|BAC87116.1| unnamed protein product [Homo sapiens] Length = 725 Score = 43.7 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 79 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETGLALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + + KE + Sbjct: 139 HRGLPGGR---------------NASVPQILIIVTDGKSQGDVALPS-----KQLKERGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|86360430|ref|YP_472318.1| hypothetical protein RHE_PE00154 [Rhizobium etli CFN 42] gi|86284532|gb|ABC93591.1| hypothetical protein RHE_PE00154 [Rhizobium etli CFN 42] Length = 547 Score = 43.7 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 21/54 (38%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + + ++ S I ++ L+I +++ Y K + +AA L Sbjct: 4 SFINRLHRDQRGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|227114884|ref|ZP_03828540.1| hypothetical protein PcarbP_18065 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 204 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 14/73 (19%), Positives = 30/73 (41%), Gaps = 2/73 (2%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES--ANNAAI 61 SRF ++ S +A+ ++ AL+ L + + ++ ME A++ Sbjct: 8 TSRFLSGMRRFWLSRRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQRVGAIASV 67 Query: 62 LAGASKMVSNLSR 74 LA K+ + + Sbjct: 68 LAMQQKLDESGLQ 80 >gi|260786377|ref|XP_002588234.1| hypothetical protein BRAFLDRAFT_124701 [Branchiostoma floridae] gi|229273394|gb|EEN44245.1| hypothetical protein BRAFLDRAFT_124701 [Branchiostoma floridae] Length = 1044 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 53/169 (31%), Gaps = 27/169 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ++ +G++ + + N + G E R + + T +++ Sbjct: 296 TSHVYNLPIGSSLGIVTFNSEATINAPMTVIGNETTRDALVGALPMTTGGKTSIGSGLQE 355 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A +L +D II ++DG+ + + + + Sbjct: 356 ALGLLGNDLG---------------------RIILISDGQEDELPHIADVLPA---LRVA 391 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEY-HYNVVNADSLIHVFQNISQL 408 + T++I A + L + + Y+ + + + + I Sbjct: 392 GHTVHTVAIGADGDP-MLEQLSRDTGGKSFYHTRWSTNFPGILRTIEAE 439 >gi|260813733|ref|XP_002601571.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] gi|229286869|gb|EEN57583.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] Length = 161 Score = 43.3 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 46/159 (28%), Gaps = 23/159 (14%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D ++ +G+ Y+ R +K + S T + A++ Sbjct: 26 DVVNSFDVSPTATRVGVAQYSDRNSLVFNLGDHADKPSTVSAINGISYQRGGTKTGAALE 85 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 F RQ K +I LTDG KS Sbjct: 86 -----------------FVRQNAAWRGGAVPKVMIVLTDG-----KSGDAVAAPSQSLAA 123 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + I + + + L + S + + + ++L Sbjct: 124 DGVAVYAIGVG-NFDHAELQQIANSDQDKVIELTDFNAL 161 >gi|284799403|ref|ZP_06390123.1| PilC protein [Neisseria subflava NJ9703] gi|284797751|gb|EFC53098.1| PilC protein [Neisseria subflava NJ9703] Length = 1126 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 33/76 (43%), Gaps = 10/76 (13%) Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNA 395 DG+ ++ K + ++ ++ T+ + G+ L+ S P++++N Sbjct: 351 DGDPSDPKGIDYSKQL--------VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKK 402 Query: 396 DSLIHVFQNISQLMVH 411 + L+ F+ I + + Sbjct: 403 EDLLEAFKTIVDNIEN 418 >gi|123482632|ref|XP_001323846.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121906718|gb|EAY11623.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 720 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 37/317 (11%), Positives = 93/317 (29%), Gaps = 55/317 (17%) Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 I + + + E+++I ++ +M + ++ ++ Sbjct: 155 TIPNNFEFSTTIMTTKELRDIHVTANETMNIIDSHNATFVTKIPPKEDAIIIE-TLIKDE 213 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + +S P + ++D SGSM Sbjct: 214 DKSIAVSSDGYISISIYPQFEGKVEQK---SEFYFIIDCSGSM----------------- 253 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +++ K L + + S+ +I + ++ + + K +Y Sbjct: 254 ---SGSRIENAKFCLNILIHSL------PIGCRFSIIQFGNSYKEVVSICDYSNKNVKYA 304 Query: 281 TRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + TD ++ ++ K F + I LTD Sbjct: 305 MSAIARINADMGGTDILSPLEYVFK-------------------KKLGKGFIRKIFLLTD 345 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE +N K +EN +I I + + + + S + + + D++ Sbjct: 346 GEVHNSDMIC---SRVQKERENN-RIFAIGLGSGADPGLIKNISAKSGGNYVLIADDDNM 401 Query: 399 IHVFQNISQLMVHRKYS 415 ++ I + + S Sbjct: 402 NNMIVEIMKSALSPSLS 418 >gi|288921206|ref|ZP_06415492.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347413|gb|EFC81704.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 625 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 30/299 (10%), Positives = 73/299 (24%), Gaps = 62/299 (20%) Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 E + M + ++ F + T+ L + Sbjct: 378 PETGLAARPQELGRKPMMAPTVMTALQATFIGVHQRTNTLMVLDSSGSMAEP-------- 429 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + D S + + L Sbjct: 430 -----------------VPASGDRSR-----------LAVALDAAKAALPMFADGSNLGL 461 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 F + E V +G + + + G + + + T Sbjct: 462 WRFSSRLHGADDWDELVGLGPVEEQVGGADGADEAGGVPRRQAVIDEMSRIEPRGDTGLY 521 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ L + + ++ LTDG N+ +++ ++ Sbjct: 522 ETTLAAFRHLNQHYEDGW----------------PNQVVLLTDGR-NSDPGSMSLDELVR 564 Query: 356 KAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS----LIHVFQNI 405 + + ++I+TI + L + ++ Y ++ +S L+ F I Sbjct: 565 TLRREYSALHPVRIITIGYGEDADLGALARISDATGAQSYPALDPNSIFVVLVGAFTEI 623 >gi|254443409|ref|ZP_05056885.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257717|gb|EDY82025.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 257 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 67/185 (36%), Gaps = 35/185 (18%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ A K AL+ F +++ ED +GL+ + V + + G + ++ Sbjct: 93 KIEAGKQALVTFANNL------PEDANLGLLTF-DPVRELLPL--GRGNRQAFIGSVSQI 143 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T ++ Y++LT +R + ++ +TDG +++ Sbjct: 144 RAKGRTPLVESIVTGYRVLTEQAQRQSGYG-------------RYVLVIVTDGASSDGNP 190 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQN 404 +++ +E+ I++ TI + + L P + + +LI Sbjct: 191 AGVAMEV---TRESPIEVQTIGFGVADHALNL-------PGVTQYVTASSPKALIDALNQ 240 Query: 405 -ISQL 408 I+ Sbjct: 241 VIASE 245 >gi|153006057|ref|YP_001380382.1| hypothetical protein Anae109_3202 [Anaeromyxobacter sp. Fw109-5] gi|152029630|gb|ABS27398.1| hypothetical protein Anae109_3202 [Anaeromyxobacter sp. Fw109-5] Length = 370 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 51/183 (27%), Gaps = 12/183 (6%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 F +D + ++ K+ R +D + + Sbjct: 147 TGTFSALLDWFQDEYVCDGSPALPDAAVLDIPTPIPGDATKLFAATARMVDRMKQQYDAI 206 Query: 295 ---TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 P+ + A + T ++F F G + + Sbjct: 207 VAPGPSDQFAMVVFTDGWD---NYSWFDNSAAAAFSLSAAGGTFSCAG-----TAPMTLD 258 Query: 352 KICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + +K + +++ I + + L + + N+D + ++FQ I++ Sbjct: 259 GLLEKLRAFPQLQVHVIGLGNTIKASELSAIAETGHGRFVSNPNSDQVAYLFQEIAREFT 318 Query: 411 HRK 413 + Sbjct: 319 TAR 321 >gi|83645400|ref|YP_433835.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83633443|gb|ABC29410.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 749 Score = 43.3 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 66/222 (29%), Gaps = 48/222 (21%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 P + +++D+SGSM + + AL L + + Sbjct: 30 PPADVRVLIDVSGSMKKNDPKNLR-------------------RPALNLVTELLPEGDSA 70 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 + + + K + T+ A+++A Sbjct: 71 GVWTFGQYVNELAPHQVVDPGWRRLAKDKAREISSTALY----TNIGAALEKA------- 119 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--------ENNNFKSNVNTIKICDKAKEN 360 + F +G + I LTDG N + + ++ + Sbjct: 120 ------SEDFVEGKDYSNT----HFILLTDGVVDISQKPGENVAERDRVLTQVLKRVAGF 169 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 KI TI+++ + + L + + S + N++ L VF Sbjct: 170 GAKIHTIALSRNADQMLLQRLSIGSNGINAIAENSEQLSRVF 211 >gi|291008392|ref|ZP_06566365.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 538 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 28/99 (28%), Gaps = 4/99 (4%) Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVT 366 I+ +TDGENN S + + + T Sbjct: 438 WSALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQDFLRNHQARDPAAKAVHTYT 497 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N L + ++ + NA SL F+ I Sbjct: 498 VRFG-EANPGELDQAARATGGRMVDA-NATSLSEAFKEI 534 >gi|254497958|ref|ZP_05110722.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] gi|254352852|gb|EET11623.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] Length = 607 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 54/171 (31%), Gaps = 26/171 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V + + ++ +E+ I P T K ++ + + + + A + Sbjct: 64 VNNRAVVSVSTFSNDLEQIIPPQIVTAKFQESFLKKKKQIKSDGAWTN--IDAALNGVNK 121 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-------AKEN 360 ++ +K II LTDG + +N + Sbjct: 122 SWSKN-----------------KKVIILLTDGMLDLGSDALNKKSTQQLNETTIPILQRE 164 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +++ TI ++ + L + + V++A L + I ++ Sbjct: 165 HVQVYTIGLSNEADSTLLSNISLKTNALFQPVISAKDLDNALYAIFSSVIS 215 >gi|144898055|emb|CAM74919.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 174 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 44/170 (25%), Gaps = 11/170 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + ++ FAL + F L+I ++ +K ME A A Sbjct: 5 LTKILHRLRGDSRGSTAVEFALVALPFFLMIAGMVETGYVAFKAAVMEGATREAA----- 59 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + A R + I + F + + Sbjct: 60 -----RQVRTGVVQGAGDAAARFQQEFCPNLIGLFPCQDFYFDVRSFADFATIALPAPVF 114 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-LLNQRYNQKIVS 175 + N + + V+ + + I + ++S Sbjct: 115 DAAGIPTNVQFSPGGANTVVTVRVIHVHSFITPLIGSLMGGGDGTLPLIS 164 >gi|47218505|emb|CAF97239.1| unnamed protein product [Tetraodon nigroviridis] Length = 1060 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 19/187 (10%), Positives = 53/187 (28%), Gaps = 21/187 (11%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + + +K+ + + S E V G+ + + I + + + Sbjct: 475 QNSFSHVKDFISAIITSFKDSVVGTEGVRFGVTVFGDVPKMRIALT-DYSSQEEVLRAIR 533 Query: 285 DSLILKPTD-STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 D + A+ Q + S I K + +T+G Sbjct: 534 DLPYEGRSRRIGDALTFLVQHVFS--------------PVIRRDHGPKIAVLITNGR--- 576 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 S+ ++ I + + + + + E+ + +L ++ Sbjct: 577 --SDDPVDAAARLVADSGISLFAVGVGGADASELRRMVSEPREEHLLLGADYSALENLLA 634 Query: 404 NISQLMV 410 +S+ + Sbjct: 635 RLSRRVC 641 >gi|116255769|ref|YP_771602.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115260417|emb|CAK03521.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 545 Score = 43.3 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 21/53 (39%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 K+ E+ S I ++ L+I +++ Y K + +AA L Sbjct: 5 LIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 43.3 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 51/186 (27%), Gaps = 20/186 (10%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ + E + +GL Y+ S K Sbjct: 45 QHARSVRNFLYILVNSFNVSSETIRVGLAKYSDVPHSEFLLSTYHRKGDVLRHIRQFQFK 104 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A+K +F S + + ++ G + Sbjct: 105 PGGKKMGLALKF-----------ILDHHFQEASGSRASQEVPQIAVVISSGPVEDHVHGP 153 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNIS 406 ++ I + I + + + L+ SSP+ ++ V N L ++ Q + Sbjct: 154 -----AKALRKAGILLYAIGVRDAVWAE--LREIASSPQENFTSFVPNFSGLSNLAQKLR 206 Query: 407 QLMVHR 412 Q + Sbjct: 207 QELCDT 212 >gi|302392065|ref|YP_003827885.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] gi|302204142|gb|ADL12820.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] Length = 230 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 39/113 (34%), Gaps = 5/113 (4%) Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 R + ++ T++ + L K + GV + +L I+ +TDG Sbjct: 102 DRSTNLVVNIATNNKSTLLAKINNLKPAGKSPIGKALSQAGVDLINLDGNNHILLVTDGR 161 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN-ASPNGQRLLKTCV-SSPEYHYN 391 + + D+ +N I I + I + +LK+ ++ Sbjct: 162 DTGNIVPSR---VADRLSKNGIYIHVLKIGAVGNQQRAVLKSIARLGGGRYFT 211 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 40/95 (42%), Gaps = 7/95 (7%) Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---ASPNGQ 376 +K +I +TDG+ +K + + +A++ I I + P + Sbjct: 5 HHKNGARKSAKKILIVITDGQK--YKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTAR 62 Query: 377 RLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + L T S+P ++ + V N +L + + + + + Sbjct: 63 QELNTISSAPPQDHVFKVDNFAALGSIQKQLQEKI 97 >gi|327538530|gb|EGF25193.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1460 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 30/313 (9%), Positives = 86/313 (27%), Gaps = 31/313 (9%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + D +++S A A + + + SL+ + + + + Sbjct: 818 AVMSIHRDGIKTSDAQQAVSIQATDAPTNVELTLAVDQKRPSLAEVNFRGNRYQSSVMSS 877 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + +T + TI MD + + + + + + S + Sbjct: 878 GTAFGVTSRGQTVQKGAGITIRDAMDAGRAITFVMDCSASMNDPLGEEMGRSAL------ 931 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS-- 241 G + E M + P + K++ + Sbjct: 932 --GAQRASKFEAARSAVYEMMRRLQPGPSQIGLVLYGHRMAIRAGDPAKDSGDGSGQTTL 989 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + H + + + + T ++ ++ T ++ +A Sbjct: 990 LQKRYHKRFPFPPTIQPFEDVEVALPTGRFDTAELELARQHFDAAVPWGQTPLYLSIWKA 1049 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI----KICDKA 357 + ++ +K ++ ++DG N F I ++ +A Sbjct: 1050 MEDIS-----------------RTGDGVRKDVVVISDGRNYQFNPTPEAIFSIGQLVTRA 1092 Query: 358 KENFIKIVTISIN 370 K +++ I Sbjct: 1093 KTLGVQVHVIGYG 1105 >gi|114568964|ref|YP_755644.1| hypothetical protein Mmar10_0413 [Maricaulis maris MCS10] gi|114339426|gb|ABI64706.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 183 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 10/91 (10%), Positives = 26/91 (28%), Gaps = 8/91 (8%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + ++ + ++ FAL +LL + V + I + +S Sbjct: 9 LTGFLRRFGGDRRGVSAVEFALIAPFMILLYLGSVEVSLALS--------IDRKITSISS 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRF 97 + +++ + A F Sbjct: 61 ALADLVAQDDVITDDEITDILNAGAVIVAPF 91 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 73/232 (31%), Gaps = 38/232 (16%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + P + ++D SGSM + K A+ Sbjct: 291 STGGAVAQQMPSREVVFLLDTSGSMA--------------------GESIVQAKRAVDFA 330 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 L + +V + + + + + + +++ T+ PA+ Sbjct: 331 LTQLRPEDNVNIIQFNDAPQALWK--RAMPATA--KHIQRARNWVASLHADGGTEMAPAL 386 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A + R + + ++F+TDG +N + ++ I+ K Sbjct: 387 TLALN----------KPSLHRDDSDLLGSHKLRQVVFITDGSVSNEDALMSLIES--KLA 434 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 +N ++ TI I ++PN + + + + + + H + + Sbjct: 435 DN--RLFTIGIGSAPNSYFMTQAAQAGRGTFTYIGDIQQVQHKMTALFNKLT 484 >gi|92116019|ref|YP_575748.1| TadE-like [Nitrobacter hamburgensis X14] gi|91798913|gb|ABE61288.1| TadE-like protein [Nitrobacter hamburgensis X14] Length = 204 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 12/125 (9%), Positives = 34/125 (27%), Gaps = 8/125 (6%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + +I FA+ V L+++ I V K + A + S Sbjct: 15 MRRDSRGVAAIEFAMIVPLMLVMLFGTIDVSSGVAVKRK--------VTLVARTLSDLTS 66 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + A + ++ ++S + + + + Sbjct: 67 QSKVVGDADITNFLAASYGIMWPYPSAPVQATISELYIDPATSVARVQWSQGSSPRGVGS 126 Query: 134 NNRLD 138 + Sbjct: 127 TVSIS 131 >gi|1705853|sp|P54290|CA2D1_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|203955|gb|AAA41088.1| dihydropyridine-sesitive L-type calcium channel alpha-2 subunit [Rattus norvegicus] Length = 1091 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 270 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 327 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 328 GITDYKKGFTFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 369 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 370 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 418 >gi|291223805|ref|XP_002731898.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 885 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 63/188 (33%), Gaps = 27/188 (14%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-GTEKV 276 ++ L A ++ L ++G++ ++T + + ++V Sbjct: 311 VSGSMFTDDRIHLLVQAATRYIRYTLPL-----GSWLGMVEFSTSSRITLPLTQITDDQV 365 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R+ + + ++ T A+ ++L + + + + Sbjct: 366 REDLIDMLPKIVNGHTCIGCALLDGIEVLDRSTEGTEG----------------GRLFLI 409 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA- 395 TDG N + I K + + + T++++ + Q + + V+ +Y + Sbjct: 410 TDGLENYEPFINDVIG--GVIK-SGVVVDTLALSDEADPQLTMLSQVTGGRSYYYSESEI 466 Query: 396 -DSLIHVF 402 +L F Sbjct: 467 STALHDAF 474 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 90/308 (29%), Gaps = 66/308 (21%) Query: 113 FYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRY--- 169 +NS + + + F ++ D L++ + Sbjct: 225 NATLNTGLTLNSVTSRNHRVNWSESTGNYLVTFNQSNIKMDRDIWLEWQPSPSSAPQAAI 284 Query: 170 --------NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + +V +P ++ + + I V+D SGSM Sbjct: 285 FTESKGQHDYALVMLMPPQVKSQDLQDFDRDITFVIDTSGSM------------------ 326 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-WGTEKVRQYV 280 + K +L L +D + ++ + + E S GT + +QY Sbjct: 327 --GGRPIVDAKESLQLAIDRLSEKDRFN------VVAFNNDTTRLFETSVEGTTRNKQYA 378 Query: 281 -TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T+ PA+ A + + + F K ++F+TDG Sbjct: 379 RDFVKHLNAGGGTEMAPALNAALK-------------------RTTTKDFIKQVVFITDG 419 Query: 340 ENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 N + + K ++ T+ I ++PN + + + V N Sbjct: 420 AVGN------EAALFSQIKNELGDARLFTVGIGSAPNSYFMTRAAQFGLGSYVFVRNTAD 473 Query: 398 LIHVFQNI 405 + ++ Sbjct: 474 IKQQMDSL 481 >gi|108763155|ref|YP_629622.1| hypothetical protein MXAN_1365 [Myxococcus xanthus DK 1622] gi|108467035|gb|ABF92220.1| hypothetical protein MXAN_1365 [Myxococcus xanthus DK 1622] Length = 1494 Score = 43.3 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 21/184 (11%), Positives = 55/184 (29%), Gaps = 13/184 (7%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + + + + ++ + + +G++ +R D + T T Sbjct: 298 RFNTGTPLTQAMWGASTYFRSAGSDPFPDWFGSDYLRDSGFNDEAAPGRAATCFTCGFN- 356 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD----- 355 A +LT + + + ++ +L +N+ S+ + +I Sbjct: 357 AMILLTDGEPNEPGGDSAQVPAQVRNLDVPCSNCAAASQGSNSGGSSSHIHRIAKWMWTN 416 Query: 356 KAK-----ENFIKIVTISINASPNGQ-RLLKT-CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + T+ + LL+ + Y N+ L Q I Sbjct: 417 DLRPELSGSQAVATYTVGFALTNTQAINLLRVTADAGGGRFYAATNSSQLKTALQAIVDD 476 Query: 409 MVHR 412 + +R Sbjct: 477 VQNR 480 >gi|319639064|ref|ZP_07993821.1| PilC protein [Neisseria mucosa C102] gi|317399642|gb|EFV80306.1| PilC protein [Neisseria mucosa C102] Length = 1123 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 33/76 (43%), Gaps = 10/76 (13%) Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN--GQRLLKTCVSSPEYHYNVVNA 395 DG+ ++ K + ++ ++ T+ + G+ L+ S P++++N Sbjct: 349 DGDPSDPKGVDYSKQL--------VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKK 400 Query: 396 DSLIHVFQNISQLMVH 411 + L+ F+ I + + Sbjct: 401 EDLLEAFKTIVDNIEN 416 >gi|221119984|ref|XP_002166572.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 5008 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 61/186 (32%), Gaps = 22/186 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + F+ V +G++ + + +++ S +KV+ ++ + Sbjct: 4364 IQQFMGDFISKVMNDSKVSVGVVSFGQFQQMDVQLSKDQQKVQLGISAMKYMGDTG--NL 4421 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 T A+K Q + ++ Q+ I L G + + N +I Sbjct: 4422 TSALKFVDQNIYQTDQK--------------RSNVQQLCIIL-GGIPHY---SQNADEIA 4463 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLMVHR 412 DK K +++ + I LL S P + + LI ++ + Sbjct: 4464 DKLKSQGVEVFAVGIGKMFKTDTLLLQIASIPILGHTLFTDYENLLITATNSLYNKIKKG 4523 Query: 413 KYSVIL 418 Y+ I Sbjct: 4524 WYAYIA 4529 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 774 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 822 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 823 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 879 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 880 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 932 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 933 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 989 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 468 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 516 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 517 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 573 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 574 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 626 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 627 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 683 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 261 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 309 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 310 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 366 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 367 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 419 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 420 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 261 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 309 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 310 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 366 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 367 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 419 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 420 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 258 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 306 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 307 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 363 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 364 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 416 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 417 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 473 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 229 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 277 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 278 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 334 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 335 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 387 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 388 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 444 >gi|148685681|gb|EDL17628.1| integrin alpha M, isoform CRA_c [Mus musculus] Length = 1037 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 167 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 221 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 222 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 268 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 269 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 326 Query: 403 QNISQLM 409 + + + Sbjct: 327 NQLQEKI 333 >gi|148685680|gb|EDL17627.1| integrin alpha M, isoform CRA_b [Mus musculus] Length = 1168 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 167 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 221 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 222 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 268 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 269 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 326 Query: 403 QNISQLM 409 + + + Sbjct: 327 NQLQEKI 333 >gi|132626289|ref|NP_001076429.1| integrin alpha-M isoform 1 precursor [Mus musculus] Length = 1154 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|132626295|ref|NP_032427.2| integrin alpha-M isoform 2 precursor [Mus musculus] gi|148685679|gb|EDL17626.1| integrin alpha M, isoform CRA_a [Mus musculus] gi|162318464|gb|AAI56095.1| Integrin alpha M [synthetic construct] gi|162319590|gb|AAI56992.1| Integrin alpha M [synthetic construct] Length = 1153 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|74222716|dbj|BAE42227.1| unnamed protein product [Mus musculus] Length = 1064 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 76 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 130 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 131 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 177 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 178 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 235 Query: 403 QNISQLM 409 + + + Sbjct: 236 NQLQEKI 242 >gi|74212839|dbj|BAE33379.1| unnamed protein product [Mus musculus] Length = 967 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|74213162|dbj|BAE41718.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|74212905|dbj|BAE33399.1| unnamed protein product [Mus musculus] Length = 1232 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|74191759|dbj|BAE32836.1| unnamed protein product [Mus musculus] Length = 1166 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|74178358|dbj|BAE32446.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|124956|sp|P05555|ITAM_MOUSE RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: CD_antigen=CD11b; Flags: Precursor gi|52983|emb|CAA30479.1| unnamed protein product [Mus musculus] Length = 1153 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 73/237 (30%), Gaps = 39/237 (16%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D + K L ++ + G Sbjct: 259 VLDPSGSMNIYLVLDGSDSIGSSN--------FTGAKRCLTNLIEKVASYGVRPRY---G 307 Query: 256 LIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + + S + V + + + D + T++ A++ Y +++ Sbjct: 308 LLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--- 364 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 II +TDG +N + V I+ +E Sbjct: 365 -----WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPRE 417 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 +++ + + + + + + V + + L +VF + Sbjct: 418 DYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 474 >gi|26333265|dbj|BAC30350.1| unnamed protein product [Mus musculus] Length = 1036 Score = 43.3 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K + ++ K L+ Y+ + + Sbjct: 166 FQKMKEFVSTVMEQFK-----KSKTLFSLMQYSDEFRIHFTFNDFKRNPSPRSHVSPIKQ 220 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + +++ + L K ++ +TDGE + Sbjct: 221 LNGRTKTASGIRKVVRELFHKTNG-------------ARENAAKILVVITDGEKFGDPLD 267 Query: 348 VNTIKICDKAKENFIKIVTISINAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVF 402 + +A + I + + P +R L T S P E+ + V N ++L + Sbjct: 268 --YKDVIPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQ 325 Query: 403 QNISQLM 409 + + + Sbjct: 326 NQLQEKI 332 >gi|332295408|ref|YP_004437331.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] gi|332178511|gb|AEE14200.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] Length = 438 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 77/222 (34%), Gaps = 41/222 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 QK+ + + E+ ER + V+D SGSM +N ++ Sbjct: 16 DTSGQKLFVLLSIEVSKEVEERGKLFVSFVLDTSGSMSETVN---------------DKS 60 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ + +L L+ + +K+D + ++ + V+ + + TEK + + + + Sbjct: 61 KIEIVIESLKKILE----SNILKDDDEISIVTFDDEVKIVLPFTAATEKEKIFSSFEQIR 116 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ MK + +L K ++ LTDG Sbjct: 117 TGTVGTNLGAGMKVSLDLLKDKAGI-------------------KKMVVLTDGNV---FD 154 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 K+ D+ + I ++++ + N L + + Sbjct: 155 LDLVEKVLDELVFSNISVISVGVGDEWNEDLLCRISDRTLGK 196 >gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica] gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica] Length = 1393 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 35/232 (15%), Positives = 69/232 (29%), Gaps = 50/232 (21%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + + V+D SGSM TK+ K+A + S+D Sbjct: 781 SSSSAPYNVVFVMDKSGSM--------------------IGTKLDQTKDAFRSMISSLDR 820 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + R + ++ E+ R +++R T+ A+ A ++ Sbjct: 821 NAKFSIVGFNYATT-AWRNKLVRATNYNVEEARSFISRIS---AGGGTNMHAALLDAIEL 876 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---- 360 S+ + I+F+TDG + + D K Sbjct: 877 CNSESSST----------------VPCMIMFMTDGTATVG-VTEESRILADVTKSRQQGK 919 Query: 361 -FIKIVTISINASPNGQRLLKTCVSSPE---YHYNVVNAD-SLIHVFQNISQ 407 I + I A + L + V + + NA + F + + Sbjct: 920 ANIALNVIGFGAGISYSFLSRLSVLNSGIARQIFEDTNAAFQMQGFFDEVMK 971 >gi|262393420|ref|YP_003285274.1| hypothetical protein VEA_002647 [Vibrio sp. Ex25] gi|262337014|gb|ACY50809.1| hypothetical protein VEA_002647 [Vibrio sp. Ex25] Length = 154 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 35/149 (23%), Gaps = 5/149 (3%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + + I S++ I F L++ L+L +I V + + A Sbjct: 6 RKHQRPNICSQQGLAIIEFILALPVLLMLTVLVIDVSRAFIQYTEVNKALQNGARYAVVD 65 Query: 68 -----MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 +++ + + A I S T Sbjct: 66 TYGTLNFESIADETSIKNVVVYGSPIASTTPVIDHISVDDIVITQPTSTNKVVTLSATYN 125 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVM 151 S ++ L S Sbjct: 126 YVPIFSTLPFSSTSLQFSIGATTSMRTGP 154 >gi|296141040|ref|YP_003648283.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296029174|gb|ADG79944.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 527 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 19/209 (9%), Positives = 52/209 (24%), Gaps = 31/209 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW- 271 D + + T++ L + + I + V + + + + Sbjct: 333 DTSGSMNESAGDTTRIGMLASGFTKVVTQIPDANAVGLWTFSIGSATRPDWTEVVPTARL 392 Query: 272 -----GTEKVRQYVTRDMDSL--ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + + T AY+ + Sbjct: 393 DARRGDKSQRQALLDGVNALPRKVGGATGLYDTTLAAYRRAVENF--------------- 437 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-----ENFIKIVTISINASPNGQRLL 379 + +I LTDG + + + + + I T+ I+ + L Sbjct: 438 -DPAYSNSLILLTDGSDEKPGGMS-LDDLVAQLRTLVDPARPVNIHTVGISKDADLPALK 495 Query: 380 KTCVSSPEYHYNVVNADS-LIHVFQNISQ 407 + ++ + L I++ Sbjct: 496 RIADATGGTAQEADSEQQMLTDFVTAIAK 524 >gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica OS195] gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS195] gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein [Shewanella baltica OS678] Length = 771 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 79/250 (31%), Gaps = 44/250 (17%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 N +V +P + + LV+D SGSM Sbjct: 367 NEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMA--------------------GD 406 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + KNALL L + + + + S + RQ+V+R Sbjct: 407 SIVQAKNALLYALKGLKPEDSFNIIEFNSSLSQFS-ATSLPATSSNLSRARQFVSRL--- 462 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+ A ++ + +IF+TDG N ++ Sbjct: 463 QADGGTEMALALDAAL-------------PKSLGSASPDAVQPLRQVIFMTDGSVGNEQA 509 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I+ + E+ ++ T+ I ++PN + + + D + IS Sbjct: 510 LFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVDEVDE---KIS 562 Query: 407 QLMVHRKYSV 416 L+ +Y V Sbjct: 563 ALLSKIQYPV 572 >gi|84685161|ref|ZP_01013060.1| hypothetical protein 1099457000257_RB2654_09849 [Maritimibacter alkaliphilus HTCC2654] gi|84666893|gb|EAQ13364.1| hypothetical protein RB2654_09849 [Rhodobacterales bacterium HTCC2654] Length = 164 Score = 43.3 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 14/161 (8%), Positives = 43/161 (26%), Gaps = 15/161 (9%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWH-YKKNSMESANNAAILAG 64 R + ++ ++ F L + L+L G I + ++ + A Sbjct: 2 RAFRHIRRFAHRDEGAVLAEFGLVLPLMLILFGVTIEAARTFWAYQATIAGVRD-ATRYV 60 Query: 65 AS--------KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-----GYSA 111 A ++ ++L I +A + + + +L+ + Sbjct: 61 ARVETPTICDEVGADLDDWQATVTDIVRNASDGTLIFPASITVSSVTAALTCASGDYRTG 120 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT 152 + ++N + + + + Sbjct: 121 TVPVATVTAVLNITFPFARVFELAGMTLPTASTQVSDQGRI 161 >gi|301764709|ref|XP_002917776.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Ailuropoda melanoleuca] Length = 1091 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|281338501|gb|EFB14085.1| hypothetical protein PANDA_006133 [Ailuropoda melanoleuca] Length = 984 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 141 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 198 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 199 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 240 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 241 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 289 >gi|194209576|ref|XP_001915257.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Equus caballus] Length = 1016 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 245 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 302 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 303 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 344 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 345 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 393 >gi|167744177|ref|ZP_02416951.1| hypothetical protein Bpse14_39258 [Burkholderia pseudomallei 14] gi|167851185|ref|ZP_02476693.1| hypothetical protein BpseB_38456 [Burkholderia pseudomallei B7210] gi|167916479|ref|ZP_02503570.1| hypothetical protein Bpse112_38762 [Burkholderia pseudomallei 112] Length = 396 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 74/319 (23%), Gaps = 20/319 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I+ + + S I + S I +R N Sbjct: 61 ITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVN 120 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDL 199 I V + + IP + P+ + Sbjct: 121 WFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGD 180 Query: 200 -------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +G+ + +S + A + + D + Sbjct: 181 WLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTN 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DSTPAMKQA 301 + Y + + + D + + T + Sbjct: 241 AYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITG 300 Query: 302 YQILTSDKKRSFFTNFFRQ 320 S + R+ Sbjct: 301 INTGGSYNPGYYAAGADRR 319 >gi|134101426|ref|YP_001107087.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] gi|133914049|emb|CAM04162.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 501 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 28/99 (28%), Gaps = 4/99 (4%) Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVT 366 I+ +TDGENN S + + + T Sbjct: 401 WSALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQDFLRNHQARDPAAKAVHTYT 460 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + N L + ++ + NA SL F+ I Sbjct: 461 VRFG-EANPGELDQAARATGGRMVDA-NATSLSEAFKEI 497 >gi|150398536|ref|YP_001329003.1| TadE family protein [Sinorhizobium medicae WSM419] gi|150030051|gb|ABR62168.1| TadE family protein [Sinorhizobium medicae WSM419] Length = 204 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 2/56 (3%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R F++ I + +I FA+ + F +++ I + + +A + LA Sbjct: 16 RDPFRRLIGDREGATAIEFAILALPFFIVVFASIETFVAFAGEQLLANATD--TLA 69 >gi|161086904|ref|NP_001104317.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 3 [Rattus norvegicus] gi|17864880|gb|AAL47093.1|AF400662_1 L-type calcium channel alpha2/delta subunit [Rattus norvegicus] Length = 1079 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|31542335|ref|NP_037051.2| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 1 [Rattus norvegicus] gi|11055592|gb|AAG28164.1|AF286488_1 voltage-gated calcium channel alpha2/delta-1 subunit [Rattus norvegicus] Length = 1091 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|161086906|ref|NP_001104318.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Rattus norvegicus] gi|27450704|gb|AAO14652.1|AF486276_1 calcium channel alpha-2 delta-1 subunit isoform e [Rattus norvegicus] Length = 1084 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|47523446|ref|NP_999348.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Sus scrofa] gi|3341749|gb|AAC36289.1| voltage-dependent calcium channel alpha-2 delta subunit precursor [Sus scrofa] Length = 1091 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|161086900|ref|NP_001104315.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform c [Mus musculus] gi|1905821|gb|AAB50140.1| voltage-gated calcium channel alpha2/delta subunit, alpha2c isoform [Mus musculus] gi|109732367|gb|AAI15872.1| Cacna2d1 protein [Mus musculus] gi|148671297|gb|EDL03244.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Mus musculus] Length = 1086 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|161086898|ref|NP_001104314.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform b [Mus musculus] gi|1905819|gb|AAB50139.1| voltage-gated calcium channel alpha2/delta subunit, alpha2b isoform [Mus musculus] Length = 1091 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|161086902|ref|NP_001104316.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform d [Mus musculus] gi|1905823|gb|AAB50141.1| voltage-gated calcium channel alpha2/delta subunit, alpha2d isoform [Mus musculus] gi|148671298|gb|EDL03245.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_e [Mus musculus] Length = 1079 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|73981989|ref|XP_852918.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Canis familiaris] Length = 1147 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 327 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 384 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 385 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 426 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 427 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 475 >gi|6753234|ref|NP_033914.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform e [Mus musculus] gi|1905825|gb|AAB50142.1| voltage-gated calcium channel alpha2/delta subunit, alpha2e isoform [Mus musculus] gi|148671296|gb|EDL03243.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Mus musculus] Length = 1084 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|74189919|dbj|BAE24585.1| unnamed protein product [Mus musculus] Length = 319 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 45/137 (32%), Gaps = 17/137 (12%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-P 291 + D + + + +G++ Y V + + V + Sbjct: 188 SVTAFLNDLLKRMDIGPKQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQ 247 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + + A + ++ + +K ++ +TDGE+++ I Sbjct: 248 TMTALGIDTARKEAFTEARG-------------ARRGVKKVMVIVTDGESHDNYRLKQVI 294 Query: 352 KICDKAKENFIKIVTIS 368 + C+ + I+ +I+ Sbjct: 295 QDCE---DENIQRFSIA 308 >gi|126722583|ref|NP_001075745.1| voltage-dependent calcium channel subunit alpha-2/delta-1 preproprotein [Oryctolagus cuniculus] gi|116409|sp|P13806|CA2D1_RABIT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|164763|gb|AAA81562.1| dihydropryridine-sensitive calcium channel alpha-2 subunit [Oryctolagus cuniculus] Length = 1106 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 273 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 330 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 331 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 372 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 373 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 421 >gi|161086896|ref|NP_001104313.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform a [Mus musculus] gi|46576352|sp|O08532|CA2D1_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|1905817|gb|AAB50138.1| voltage-gated calcium channel alpha2/delta subunit, alpha2a isoform [Mus musculus] gi|148671294|gb|EDL03241.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Mus musculus] Length = 1103 Score = 43.3 bits (100), Expect = 0.073, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|304411390|ref|ZP_07393004.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|307305288|ref|ZP_07585036.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] gi|304350245|gb|EFM14649.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|306911591|gb|EFN42016.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] Length = 771 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 38/250 (15%), Positives = 74/250 (29%), Gaps = 44/250 (17%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 N +V +P + + LV+D SGSM Sbjct: 367 NEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMA--------------------GD 406 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + KNALL L + + + + + + Sbjct: 407 SIVQAKNALLYALKGLKPEDSFNIIEFNSSLS----QFSATPLPATSSNLSRARQFVSRL 462 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+ A V ++ + +IF+TDG N ++ Sbjct: 463 QADGGTEMALALDAAL-------------PKSLGSVSPDAVQPLRQVIFMTDGSVGNEQA 509 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I+ + E+ ++ T+ I ++PN + + + D + IS Sbjct: 510 LFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVDEVD---AKIS 562 Query: 407 QLMVHRKYSV 416 L+ +Y V Sbjct: 563 ALLSKIQYPV 572 >gi|156349150|ref|XP_001621939.1| predicted protein [Nematostella vectensis] gi|156208297|gb|EDO29839.1| predicted protein [Nematostella vectensis] Length = 147 Score = 43.3 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 17/167 (10%), Positives = 47/167 (28%), Gaps = 23/167 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 I+ + + + + ++ D + T A+K Sbjct: 2 IENFEISPSSTRISVATFENFPKMEFSFQKYSDLNSAKFAVDAIQISNGGTRIGEALKL- 60 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + ++ +TDG S+ + + ++ Sbjct: 61 ---------------VKTDMFNTARSNVPRMLLVMTDGR-----SSDDVVAPSRALRDIG 100 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 + I+T+ + + + + LK S + + D L + + + Q Sbjct: 101 VTILTLGLGSDYDLDQ-LKMIASGVDLVFE-SPFDQLPQMVEKVQQK 145 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 32/296 (10%), Positives = 67/296 (22%), Gaps = 23/296 (7%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM---VSNLSRL 75 I AL ++ + ++G + V + K+ ++ NAA LAG S S+ Sbjct: 18 HGQSIPIIALMILILVAMVGLSVDVGNTFSKERQAVASANAASLAGMSAYMARSSSTLDT 77 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA-----------------VFYNTEI 118 ++ L+ + + + Sbjct: 78 TIYQAITASLQSNGLVVGDGTNNTVEVTANYLDSQGNLLAGHPVVGSGGTAPNGAAYIRV 137 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 Q + + + L + + V + + I P Sbjct: 138 QLSGMVNTSFARVVGRDDLPINADAHAGLCQVNSGVYPIA-VDNAYIGNGVFNNIGVTNP 196 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + V D S ++ ++A Sbjct: 197 STEYKVLSNGMVQRRVYVRDGDDSPGQFGWLRWKEDKGELGQAAGSAGELAQSLTGDGNL 256 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPT 292 D + + +WG VT +D I T Sbjct: 257 DWGFDEAPWPSNETAPSDYPNNPHSINIGDWAWGNSGWSNSNAVTSAIDQHIANST 312 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 20/202 (9%), Positives = 56/202 (27%), Gaps = 27/202 (13%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 Y L + Q ++ D+ S Sbjct: 399 SQKPIQYVVVLDASGSMSANFDGQCNN-----SGGVKQCANGPSGFP---DVQVSNTGYD 450 Query: 208 NSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEK 265 + K ++ L N + ++ + M ++ + + Sbjct: 451 YWWTTESQRRIYVAKKALERLVTLSN-----MPGNPGYTNTRPSDQMAVVWFNDGVSSSQ 505 Query: 266 NIEPSWGTEKVRQYVTRDM----DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + ++ Y+T + T+ + +A +L + ++ N Sbjct: 506 TQAFTNNPTTLKNYITTLNNVNGNYRSAGGTNGAGGLYRA-SLLYQNAPKTVSFNG---- 560 Query: 322 VKIPSLPFQKFIIFLTDGENNN 343 ++ +++ ++F+TDG +N Sbjct: 561 ---TNVEYKRVVLFVTDGVSNY 579 >gi|306922629|gb|ADN07507.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 16/139 (11%), Positives = 37/139 (26%), Gaps = 20/139 (14%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + Y+ + S T + A++ Sbjct: 70 SGAASAQGVRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRH--- 126 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + F + K I +TDG++ + K K +K Sbjct: 127 ---------VSDHVFLPHL--TRPGIPKVCILITDGKSQDLVDP-----AAQKLKGQGVK 170 Query: 364 IVTISINASPNGQRLLKTC 382 + + I + + + L + Sbjct: 171 LFAVGI-KNADPEELKRVA 188 >gi|306922621|gb|ADN07500.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 16/139 (11%), Positives = 37/139 (26%), Gaps = 20/139 (14%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + + V + Y+ + S T + A++ Sbjct: 70 SGAASAQGVRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRH--- 126 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + F + K I +TDG++ + K K +K Sbjct: 127 ---------VSDHVFLPHL--TRPGIPKVCILITDGKSQDLVDP-----AAQKLKGQGVK 170 Query: 364 IVTISINASPNGQRLLKTC 382 + + I + + + L + Sbjct: 171 LFAVGI-KNADPEELKRVA 188 >gi|284053489|ref|ZP_06383699.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 396 Score = 43.3 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 64/197 (32%), Gaps = 29/197 (14%) Query: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL--LS 246 P I ++VDL+GSM+ S ++ +A FL+ I Sbjct: 69 PPAWIVVLVDLTGSMNELDTSGK--------------RRIDGALDATRRFLEQISDRGGD 114 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTE-------KVRQYVTRDMDSLILKPTDSTPAMK 299 V G G + + ++ K ++ + TD + Sbjct: 115 TKVAIVPFGKGGANCPGFEVTQRGINSKFFPANDIKQTNFLDYLAAQTLCAATDIYGPLS 174 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-K 358 +A ++L + + F+ L + +I L+DG +N + + + Sbjct: 175 EAIRVLGNRQDPRFYV-----PEDSGRLEPRLSVILLSDGFHNQPNEQQDFDNLITLLER 229 Query: 359 ENFIKIVTISINASPNG 375 N I + T+ +P Sbjct: 230 NNNIIVHTLGYGLTPQQ 246 >gi|296205952|ref|XP_002806985.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Callithrix jacchus] Length = 3176 Score = 43.3 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 14/170 (8%), Positives = 48/170 (28%), Gaps = 22/170 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTD 293 L ++ ++ L + + +G++ ++ + K + + Sbjct: 262 LDFLVNLLEKLPVGTQQIRVGVVQFSDEPRTMFSLDTYSSKAQVLSAVKALGFAGGELAN 321 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ +F R G + ++ ++ G +++ + Sbjct: 322 IGLALDF-----------VVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIGDGVV--- 367 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 368 --ALKQ--ASVFSFGLGAQAASRAELQHVATDDNLVFTVPEFRSFGDLQE 413 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 17/187 (9%), Positives = 56/187 (29%), Gaps = 21/187 (11%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT--D 293 + + L+ V +G++ ++ V +V + + + + Sbjct: 1459 DFVIRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQVP-VLEAIRRLRLRGGSPLN 1517 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A++ + F + + ++ + G++ + S Sbjct: 1518 TGRALEFVARNF-----------FVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----F 1561 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + I ++ + + L+T + P + V + L ++ + I Sbjct: 1562 AQVIRSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVRDFRDLPNIEERIMNSFGPSA 1619 Query: 414 YSVILKG 420 + G Sbjct: 1620 ATPAPPG 1626 >gi|115976464|ref|XP_001187868.1| PREDICTED: similar to Synaptotagmin IX [Strongylocentrotus purpuratus] Length = 1426 Score = 43.3 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 65/186 (34%), Gaps = 28/186 (15%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMD 285 ++ L+ A ++++ + +G+I ++ ++ R+ + + Sbjct: 267 RLVLLQQASANIIENVLPM-----GSRLGIIAFSEYATVRHNIVEIDSQDTRRSLLNSLP 321 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T ++ A Q L + S P +I +TDGE N+ Sbjct: 322 KKADGRTSIGRGVELAVQKL----------------REYESDPAGSTLIVITDGEQNSHP 365 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-IHVFQN 404 + ++ K + + + T+++ +G+ L + + + + + + F+ Sbjct: 366 YIQDVLQ-----KTDNLTVNTVAVGGEADGELELLAVHTGGDSFSHTDKSSEIYLSFFKA 420 Query: 405 ISQLMV 410 + M Sbjct: 421 AAAQMS 426 >gi|53802770|ref|YP_115473.1| TPR domain-containing protein [Methylococcus capsulatus str. Bath] gi|53756531|gb|AAU90822.1| TPR domain protein [Methylococcus capsulatus str. Bath] Length = 579 Score = 43.3 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 19/219 (8%), Positives = 45/219 (20%), Gaps = 60/219 (27%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + A D + +++ L+ A+ L L+ Y + + Sbjct: 83 SPAVDATDLRPSRLTRLRYAVTDLLKRRKDGQ-------TALLVYGGDAFTVTPLTDDVD 135 Query: 275 KVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + D + D+ A+ A Q+L R + + Sbjct: 136 TIDAQLPALSSDLVPATGHDAGRALTLAAQLLRDAGFRHGDILLAAVRADPAAAELAAEL 195 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-------------------- 373 +I + + Sbjct: 196 ------------------------HGEGFRISVLGVGTEAGAPIPLRNGGFRKGASGEIE 231 Query: 374 ----NGQRLLKTCVSSPEYHYN----VVNADSLIHVFQN 404 + L + + D+L F+ Sbjct: 232 LSRLDAPALRELAAQGGGRYLELNSGAATTDALTTFFEQ 270 >gi|84501599|ref|ZP_00999771.1| hypothetical protein OB2597_15395 [Oceanicola batsensis HTCC2597] gi|84390220|gb|EAQ02779.1| hypothetical protein OB2597_15395 [Oceanicola batsensis HTCC2597] Length = 432 Score = 43.3 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 7/49 (14%), Positives = 18/49 (36%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 +F+ A + + +I+ + L + F I +N ++ Sbjct: 1 MTGHFRSFAARAEGSMTILSLYFLAGVLAVSAFAIDFGYLMSARNQLQV 49 >gi|311271865|ref|XP_001927121.2| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A domain-containing protein 2 [Sus scrofa] Length = 769 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 9/124 (7%), Positives = 35/124 (28%), Gaps = 20/124 (16%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V +G + + + + + + T++ A+K + Sbjct: 82 DVDPARVRVGAVQFGSTPRLEFPLDAFSTQQEVKAEIRRMAFKGGRTETGLALKYLLRKG 141 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + + + ++ +TDG + + ++ K+ + + Sbjct: 142 FPGGR---------------NASVPQVLLIVTDGRSQGHVAEP-----AEQLKQRDVTVF 181 Query: 366 TISI 369 + + Sbjct: 182 AVGV 185 >gi|302521148|ref|ZP_07273490.1| conserved hypothetical protein [Streptomyces sp. SPB78] gi|333025165|ref|ZP_08453229.1| hypothetical protein STTU_2669 [Streptomyces sp. Tu6071] gi|302430043|gb|EFL01859.1| conserved hypothetical protein [Streptomyces sp. SPB78] gi|332745017|gb|EGJ75458.1| hypothetical protein STTU_2669 [Streptomyces sp. Tu6071] Length = 154 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 16/150 (10%), Positives = 41/150 (27%), Gaps = 5/150 (3%) Query: 6 RFRFYFK-KG--IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 R + + + ++ + + L + FL L F++ ++ A Sbjct: 5 RLKSLVRERFPARTDDRGSGAGAVILFALVFLSLAAFVVDGGLSISQRERAADLAEQAAR 64 Query: 63 AGASKMVSNL--SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A + G+ + N R A+ + + A + E++ Sbjct: 65 YAAQDIDREALYGNEGEAPINAGNCPARVAAFAAESGMSGADAAASGCVEADADHVEVRI 124 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 + + + S + N Sbjct: 125 QLTYRPVFTGIFYGGSIHVSGTAVAENKVG 154 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 43.3 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 53/164 (32%), Gaps = 27/164 (16%) Query: 251 DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 L+ Y++ V GTE + + I T +T A+ +I + Sbjct: 102 SWRAALLQYSSHVIIEQTLKQWKGTENFKSSIAPMA--YIGHGTYTTYAITNMTKIFVEE 159 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 S K + LTDG + + + D AK +K+ TI Sbjct: 160 S----------------SPERIKIALLLTDGFFHPRNPD-IFSAMAD-AKNQGVKVFTIG 201 Query: 369 I----NASPNGQRLLKTCVSSPEYH-YNVVNADSLIHVFQNISQ 407 I N N L + YN+ + + + V I+Q Sbjct: 202 ITRTANDPVNAANLRLLSSTPASRFLYNLQDTNVMEKVITQIAQ 245 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 79/232 (34%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D D A A ++ I+ L+ G Sbjct: 175 VLDPSGSMNIYLVLDGSDSIGASNFTG-----------AKKCLVNLIEKLASYGVKPRYG 223 Query: 256 LIGYTT----RVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T V+ + S + V + + D + T++ A++ Y +++ Sbjct: 224 LVTYATXXXIWVKVSEAVSSNADWVTKQLNEINYEDHKLKSGTNTEEALQAVYSMMS--- 280 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC----------DKAKE 359 + + +G II +TDG +N + I +E Sbjct: 281 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDXXXXXXYIGKDRKNPRE 333 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 334 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 385 >gi|73980136|ref|XP_851163.1| PREDICTED: similar to vitrin isoform 2 [Canis familiaris] Length = 645 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 26/285 (9%), Positives = 68/285 (23%), Gaps = 30/285 (10%) Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 S S + T+ + + + ++ + + + + + Sbjct: 144 TYSSSKSPTAKTGEATKAYQKPSVPGTTAQPVTLMQVTGATASEASHTSLPKPSPSAVST 203 Query: 161 IEHLLNQRYNQKIVSFIPALL-RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 L +Q + L + + + S+ DP Sbjct: 204 TSSLRSQPMGHRSWELGEMDLWKPGSVLLDAGFVPKEELSTQSLEPVSQGDPNCKIDLSF 263 Query: 220 CQDKKR----TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GT 273 D + K D L MG++ Y + Sbjct: 264 LIDGSSGIGKRRFRIQKQ---FLADVAQTLDIGPAGPLMGVVQYGDNPAAQFNLRTHMNS 320 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 ++ + + +++ A+ + S + Sbjct: 321 RDLKTAIEKITQR--GGLSNAGRAISYVTKNFFSKANGNRGGA-------------PNVA 365 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 + + DG + + A+E+ I I I+I + ++ Sbjct: 366 VVIVDGWPTD-----KVEEASRFARESGINIFFITIEGATENEKQ 405 >gi|53723209|ref|YP_112194.1| hypothetical protein BPSS2192 [Burkholderia pseudomallei K96243] gi|52213623|emb|CAH39677.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 396 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 74/319 (23%), Gaps = 20/319 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I+ + + S I + S I +R N Sbjct: 61 ITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVN 120 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDL 199 I V + + IP + P+ + Sbjct: 121 WFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGD 180 Query: 200 -------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +G+ + +S + A + + D + Sbjct: 181 WLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTN 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DSTPAMKQA 301 + Y + + + D + + T + Sbjct: 241 AYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITG 300 Query: 302 YQILTSDKKRSFFTNFFRQ 320 S + R+ Sbjct: 301 INTGGSYNPSYYAAGADRR 319 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 6/31 (19%), Positives = 12/31 (38%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 + A+ L G ++V D K ++ Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQE 31 >gi|154484357|ref|ZP_02026805.1| hypothetical protein EUBVEN_02070 [Eubacterium ventriosum ATCC 27560] gi|149734834|gb|EDM50751.1| hypothetical protein EUBVEN_02070 [Eubacterium ventriosum ATCC 27560] Length = 731 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 20/211 (9%), Positives = 50/211 (23%), Gaps = 39/211 (18%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG------ERPIFLIELV 196 T + V L + ++ + + A+ + + ++ Sbjct: 105 TTSGIIHVPLDVVLVLDISGSMSVDGDGKRADNLVKAVNKTITYLMKEDENNRVAVVAYN 164 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 + + + + +K + + + +G Sbjct: 165 TKATKILPLGRYYVGSKADYSNTNSYFSVSK---------------SSKNDGYKKLDIGK 209 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 I + T +KQ IL ++K ++ + Sbjct: 210 IK-------------NESTNNLTDKASKSIEVYGATYIQDGIKQGADILINEKTTTYTDS 256 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 Q V I+ L+DG S+ Sbjct: 257 VSGQKVTR-----TPNILLLSDGCPTLGNSD 282 >gi|160874283|ref|YP_001553599.1| type IV pilin biogenesis protein [Shewanella baltica OS195] gi|160859805|gb|ABX48339.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS195] gi|315266516|gb|ADT93369.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS678] Length = 1168 Score = 43.3 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 29/317 (9%), Positives = 71/317 (22%), Gaps = 67/317 (21%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 ++ + V + + ++ A Sbjct: 182 TGS-GRKKKAYPYFISSSPSSTWADALAAAKNTDFGVGQPVTFYADNYLRWYWLSKAGKL 240 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHV--KEDVYMGLIGYTTRVEKNIEPSWGTEKVR- 277 K +++ K A+ + S + + G + T+ R Sbjct: 241 PTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRA 300 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDK---------------KRSFFTNFFRQGV 322 +T + L T M +AY+ + + + +G Sbjct: 301 SLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGG 360 Query: 323 KIPSLPFQ----KFIIFLTDGENN--------------------NFKSNVNTIKICDKAK 358 S ++I++TDG ++ S + Sbjct: 361 SYESPFKVCTDIAYVIYVTDGAPTVDKSANNDVISLTSTGSKDGDYSSFSKNLDTASYLP 420 Query: 359 E----------------------NFIKIVTISI--NASPNGQRLLKTCVSSPEYHYNVVN 394 ++ TI A L +T ++ N Sbjct: 421 ALASYMFNNDLINKLDSSNTEQMQNVRTYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQN 480 Query: 395 ADSLIHVFQNISQLMVH 411 + L + + +++ Sbjct: 481 SIELQNALNDALSNILN 497 >gi|149925200|ref|ZP_01913495.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] gi|149813928|gb|EDM73579.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] Length = 353 Score = 43.3 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 31/243 (12%), Positives = 60/243 (24%), Gaps = 48/243 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + LVVD SGSM T+ AL + + S + + Sbjct: 66 VVLVVDRSGSM--------------SDNPLGDETRWEALHGVISEVV------SGQEASL 105 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +GL + + +W D+ A+ I + Sbjct: 106 QLGLTMF---PAADAGTTWEQGACLAPTQLDVAVGADTGAAILGALPGPNAITQGGTPAA 162 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENN----------NFKSNVNTIKICDKAKENFI 362 ++ K ++ +TDG N + + A + I Sbjct: 163 AAVQLAADHLRARDTQDPKLLVLVTDGAANCAADSADWQASLVYDEALQDAVANASMDGI 222 Query: 363 KIVTISINASPN---------GQRLLKTCVSSPEY------HYNVVNADSLIHVFQNISQ 407 + I ++L + Y V + L +I+ Sbjct: 223 TTHVVGIQIDTELDAQAGVVPAEQLHEVAQLGGAGLDGEYAFYQVEDQAMLSAALSSITA 282 Query: 408 LMV 410 + Sbjct: 283 DIS 285 >gi|107022601|ref|YP_620928.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116689550|ref|YP_835173.1| membrane protein [Burkholderia cenocepacia HI2424] gi|105892790|gb|ABF75955.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116647639|gb|ABK08280.1| membrane protein [Burkholderia cenocepacia HI2424] Length = 603 Score = 43.3 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 15/148 (10%), Positives = 46/148 (31%), Gaps = 12/148 (8%) Query: 13 KGIASEKANFSIIFALS-VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ FS++ ++ +++ L + + +++ ++ + A LA +M Sbjct: 11 RRASRQRGAFSVMAIIATLIAITTLGAIGV--GNLFFQRRDVQRIADMAALAAVQRMDDA 68 Query: 72 LSRLGDRFE--SISNHAKRALIDDAK-------RFIKNHIKESLSGYSAVFYNTEIQNIV 122 S+ + SN + D + + S + +V Sbjct: 69 CSQPTATATSNAQSNGLDASNGDTITIECGRWDTSVNPAPSYYAAATSGTTQLNAAKVVV 128 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDV 150 + + + N+D Sbjct: 129 TRQVPFFFVGPPQTVSAVSTARSTNIDT 156 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 43.3 bits (100), Expect = 0.080, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 83/293 (28%), Gaps = 64/293 (21%) Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN-----------QRYNQKIVSFIPAL 180 + + + I + D+ L++ + + + + P Sbjct: 311 VRIETTNDATREITLDGTAAADRDFVLEWSAVANDAPQVGLFREHVGKDDYVLAYVTPPA 370 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + ++ + V+D SGSM T + K +L L Sbjct: 371 --VASAKKAQREVVFVIDNSGSM--------------------GGTSIEQAKASLDYALS 408 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + +I + + + E S + + M T PA+ Sbjct: 409 HLQPGDRFN------VIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQGGTAMLPAL 462 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 A L + + + I+FLTDGE +N + I A+ Sbjct: 463 HAA---LDDSHQGNGL----------------RQIVFLTDGEISNEQ--QLLDAIA--AR 499 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +I + I +PN + ++ +A + + + + + Sbjct: 500 RGRSRIFMVGIGTAPNSYLMNHAAELGRGTFTHIGSAAEVDERMRALFDKLEN 552 >gi|307288765|ref|ZP_07568743.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|306500234|gb|EFM69573.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|315166099|gb|EFU10116.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1302] Length = 711 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 129 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 188 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 189 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 244 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 303 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 304 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 342 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 343 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 383 >gi|118372347|ref|XP_001019370.1| TPR Domain containing protein [Tetrahymena thermophila] gi|89301137|gb|EAR99125.1| TPR Domain containing protein [Tetrahymena thermophila SB210] Length = 1208 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 61/170 (35%), Gaps = 30/170 (17%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAY 302 +++K+ + LI + V E + + +Y + A+ Sbjct: 1044 NYIKQFDRLALISFNHNVNVCFELQVSGKNDKFIDKYFKDAKNLACTGDKALYNAIYDGI 1103 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK---- 358 ++ K P ++++ +TD +N + + ++ K Sbjct: 1104 KLFK----------------KTEPQPNSRWLVAITDNVDNYSRIDE------EQLKPLFF 1141 Query: 359 ENFIKIVTISINASPNGQRL-LKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 +N +K++ I +N N + + LK C ++ ++ L FQ+I+ Sbjct: 1142 QNNVKLILIGLNLKSNAKEVYLKLCKNTGGTFIENPDSMDLNVAFQSITN 1191 >gi|328947151|ref|YP_004364488.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447475|gb|AEB13191.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 538 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 54/199 (27%), Gaps = 46/199 (23%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +SF + ++ + + L+ D+S SM +++ A+ + Sbjct: 74 ISFGTDSVPVQKNGKAVSLVF---DISYSMEAKDAPGG-------------ISRLQAVSS 117 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPT 292 LD + + + + + VR + + + T Sbjct: 118 YASELLDR-------MNGCAVSAVLAKGDGIIAVPLTEDFQSVRSLIENLSPLLMTSEGT 170 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +K A + FI TD E + N Sbjct: 171 SLGNGIKSALSSFPEQSSAA------------------SFIWLFTDCEETD---NSLQSS 209 Query: 353 ICDKAKENFIKIVTISINA 371 + + K + I +V I + Sbjct: 210 LSECLK-SGIPVVIIGFGS 227 >gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99] gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length = 733 Score = 43.3 bits (100), Expect = 0.082, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 55/191 (28%), Gaps = 22/191 (11%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 KM A + A +D +D + I + + P+ + + Sbjct: 285 KMVAARRAAGRIVDMLDAGDRFCVLAFDDRIETPPAMPDGLVPASDRNRF-AASSWLGSL 343 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A ++L + Q ++ + DG+ + Sbjct: 344 RSRGGTVMAQPLTNAVEMLADSGEDR-----------------QASVVLVADGQISGEDH 386 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + ++ +I + ++ + N L + V + D L V ++ Sbjct: 387 LLRSLAPAVG----RTRIYCVGVDRAVNAGFLERLAGLGSGRAELVESEDRLDEVMARLA 442 Query: 407 QLMVHRKYSVI 417 + + + I Sbjct: 443 RTIGRPALTSI 453 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 43.3 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 51/168 (30%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + + ++ + + + Sbjct: 278 KTSVHEMLDTLSDDDYVNVASFNEKAKPVSCFKHLVQ--ANIRNKKVFKEDVQGMVAKGT 335 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD + A+ L + K I+ TDG + + Sbjct: 336 TDYKAGFEYAFDQLQNSNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 381 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 +K +++ T S+ L+ ++ Y++ + + ++ Sbjct: 382 NWPNKT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 425 >gi|92113788|ref|YP_573716.1| membrane protein-like protein [Chromohalobacter salexigens DSM 3043] gi|91796878|gb|ABE59017.1| membrane protein-like protein [Chromohalobacter salexigens DSM 3043] Length = 604 Score = 43.3 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 13/110 (11%), Positives = 35/110 (31%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 +++ ++ + +L + + V +Y+K ++ + A + A+ + Sbjct: 10 ARAQRGAIGLLAMGVLGLVILCLVLALDVGRLYYEKARLQKLADTAAMEAAAALARTQDM 69 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 R + +N A+ A + I V + V Sbjct: 70 STLREMAAANLARSEGDGAALTLQRGTIAVDDGINVFVAEEAPTSDAVRV 119 >gi|332828717|gb|EGK01409.1| hypothetical protein HMPREF9455_02242 [Dysgonomonas gadei ATCC BAA-286] Length = 340 Score = 42.9 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 21/228 (9%), Positives = 67/228 (29%), Gaps = 59/228 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ + +D K +++A K L +D + D + ++ + + + Sbjct: 97 DVSNSMMAEDIKPSRLAKAKQMLTRIIDE-------RRDDKVAIVVFAGEAFIQLPLTPD 149 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + ++ + ++ T A+ + + K Sbjct: 150 NQSAKLFLETIDPSLVPVQGTAIGSAIDISMSCFS------------------NDTDIDK 191 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------ASPNGQRL----- 378 I+ +TDGE + + + + + + + I SP + Sbjct: 192 AIVLITDGEGHEGDAEGAAARA----ADQGVHVNVVGIGTTEGAMIPVSPGSNNMKRDTQ 247 Query: 379 -------------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + + + + N+ +++ + + Sbjct: 248 GQPVVTKLNEEMCRQIAKAGKGLYAHADNS---NSALKSLQSELEKLQ 292 >gi|257083477|ref|ZP_05577838.1| predicted protein [Enterococcus faecalis Fly1] gi|256991507|gb|EEU78809.1| predicted protein [Enterococcus faecalis Fly1] Length = 711 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 129 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 188 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 189 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 244 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 303 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 304 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 342 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 343 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 383 >gi|241207151|ref|YP_002978247.1| hypothetical protein Rleg_4470 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861041|gb|ACS58708.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 193 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 42/154 (27%), Gaps = 16/154 (10%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ + +I FA+ ++L + Sbjct: 5 NPFTRLVLTVRRLARDRRGAGAIEFAILFPVLVMLYIGAFEITVG--------------- 49 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L+ + ++ + D + K AL + + + + I Sbjct: 50 LSVSKRVTRAAGTVADLVTQQQSVTKSALAQMPSVATAIFVPYNSTSLTLKITGITIDAG 109 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 N+ + + + + + NT N+ Sbjct: 110 ANAK-VLWSWAKDGTVPYAKNTTVSNVPADMKTA 142 >gi|241662158|ref|YP_002980518.1| hypothetical protein Rpic12D_0541 [Ralstonia pickettii 12D] gi|240864185|gb|ACS61846.1| putative transmembrane protein [Ralstonia pickettii 12D] Length = 344 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 13/171 (7%), Positives = 43/171 (25%), Gaps = 5/171 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ ++ + ++ L + I + + +N +++A + A+ + Sbjct: 10 ARRQRGVVGVMAPVLMLVILSIGAIAIDLGYLYVIRNELQNAAD-----AAALAGAAGLY 64 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + SN + + S + + T + + I+ Sbjct: 65 PASPKPNWSNGVSQGTSAIKLNASSGATLKDGSVQAGYWNLTGSPAGLQAQTITPGANDV 124 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + + + F + + V P Sbjct: 125 PAVQVTISRSSGKNGGPVGTWLASIFNGAAASVQTTAVAVIAAPGTANAGA 175 >gi|170700849|ref|ZP_02891838.1| TadE family protein [Burkholderia ambifaria IOP40-10] gi|170134257|gb|EDT02596.1| TadE family protein [Burkholderia ambifaria IOP40-10] Length = 177 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 39/151 (25%), Gaps = 6/151 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + ++ FAL +M ++L + Y+ ++ A A Sbjct: 1 MKRLPIRRSRM----RGVAAVEFALVLMPMIVLATGVAEFGRAIYQYETLTKATRNAARY 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + ++ + + + + + + A + Sbjct: 57 LSVFLPNDSAYPLAAAQCLVVYGSTTCGSAGTELVPGLTTSMVIVCDATHST--DCADAS 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +++ + + Sbjct: 115 DPSQFANLPTYDSTNNAPSGTATGSINLVEV 145 >gi|209546010|ref|YP_002277900.1| hypothetical protein Rleg2_5625 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538867|gb|ACI58800.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 546 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 21/53 (39%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 K+ E+ S I ++ L+I +++ Y K + +AA L Sbjct: 5 LIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|187736264|ref|YP_001878376.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426316|gb|ACD05595.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 754 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 26/227 (11%), Positives = 72/227 (31%), Gaps = 60/227 (26%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D T++ K A LD++ + G+I ++ + + Sbjct: 96 DCSRSMLSKDASPTRLGRAKTAAYDLLDALPGDN-------FGIIIFSGDAVLLMPLTHD 148 Query: 273 TEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 +++ + + + T+ ++ A Q DK+ + Sbjct: 149 HNALKETIEQLQFGWVSQGGTNLENVVRLALQTFKRDKEAD----------------AKN 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKI--------------------------- 364 ++ L+DGE+ T K + A+++ + I Sbjct: 193 ALVILSDGEDTV----NITYKTAEAARQHQLIIVTAGIGTTIGTTIPDEQSPSGLYRDRR 248 Query: 365 --VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + + N + L + + + + +L ++I+ + Sbjct: 249 GQHVV---SKLNPESLQYLARQTEGQYVQLSDGAALNRFVKDIADRL 292 >gi|148555257|ref|YP_001262839.1| TadE family protein [Sphingomonas wittichii RW1] gi|148500447|gb|ABQ68701.1| TadE family protein [Sphingomonas wittichii RW1] Length = 135 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 9/126 (7%), Positives = 31/126 (24%), Gaps = 4/126 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME----SANNAAILAGAS 66 + + + ++ FAL +FL+ + I + +++ ++ A L Sbjct: 1 MTRLLRDARGVTAVEFALVAPAFLMFMFLTIDGARMAWTYQTLQEVAVNSARCAALGVTG 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + + K + + + + Sbjct: 61 CKTAGEVQSYAVARAAAGGVKLTAAAVTLTPAATCSSVAGMTRVTIASTYQGASTKLLPS 120 Query: 127 ISMTHM 132 Sbjct: 121 KLTALS 126 >gi|170732854|ref|YP_001764801.1| membrane protein [Burkholderia cenocepacia MC0-3] gi|169816096|gb|ACA90679.1| membrane protein [Burkholderia cenocepacia MC0-3] Length = 608 Score = 42.9 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 15/148 (10%), Positives = 46/148 (31%), Gaps = 12/148 (8%) Query: 13 KGIASEKANFSIIFALS-VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + ++ FS++ ++ +++ L + + +++ ++ + A LA +M Sbjct: 11 RRASRQRGAFSVMAIIATLIAITTLGAIGV--GNLFFQRRDVQRIADMAALAAVQRMDDA 68 Query: 72 LSRLGDRFE--SISNHAKRALIDDAK-------RFIKNHIKESLSGYSAVFYNTEIQNIV 122 S+ + SN + D + + S + +V Sbjct: 69 CSQPTATATSNAQSNGLDASNGDTINIECGRWDTSVNPAPSYYAAATSGTTQLNAAKVVV 128 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDV 150 + + + N+D Sbjct: 129 TRQVPFFFVGPPQTVSAVSTARSTNIDT 156 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 48/168 (28%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K ++ LD++ +V + + ++ + + + Sbjct: 360 KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQ--ANVRNKKVFKEAVQGMVAKGT 417 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T + A+ L + K I+ TDG + + Sbjct: 418 TGYKAGFEYAFDQLQNPNITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 463 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-EYHYNVVNADSL 398 ++ +++ T S+ L+ + Y++ + + ++ Sbjct: 464 NWPNRT----VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 507 >gi|172065276|ref|YP_001815988.1| TadE family protein [Burkholderia ambifaria MC40-6] gi|171997518|gb|ACB68435.1| TadE family protein [Burkholderia ambifaria MC40-6] Length = 177 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 39/151 (25%), Gaps = 6/151 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + ++ FAL +M ++L + Y+ ++ A A Sbjct: 1 MKRLPIRRSRM----RGVAAVEFALVLMPMIVLATGVAEFGRAIYQYETLTKATRNAARY 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + ++ + + + + + + A + Sbjct: 57 LSVFLPNDSAYPLAAAQCLVVYGSTTCGSAGTELVPGLTTSMVIVCDATHS--SDCADAS 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +++ + + Sbjct: 115 DPSQFANLPTYDSTNNAPSGTATGSINLVEV 145 >gi|114579064|ref|XP_515640.2| PREDICTED: hypothetical protein LOC459430 isoform 2 [Pan troglodytes] Length = 1294 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + ++Q + Sbjct: 521 SKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA---EVNEDNLKQAQSWIRH 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETRAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKLFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|114579066|ref|XP_001156305.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 1059 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + ++Q + Sbjct: 521 SKLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA---EVNEDNLKQAQSWIRH 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 IKIGSSTNTLSALKTAFA-----------------------DKETRAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPETVIDQVKLFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|310830437|ref|YP_003965538.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] gi|309249904|gb|ADO59470.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] Length = 338 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 49/135 (36%), Gaps = 16/135 (11%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFL-LLIGFLIYVLDWHYKKNSMESANNAAI 61 + R + I EK S+ +V L I I + + K+ ++ N+AA Sbjct: 1 MFKRIKK-----IRDEKGFVSVFVIFAVSFILPFFIFHTIEMTYLYGMKDKFQNFNDAAA 55 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 AGA ++ + +S + D+ K +K + E+ G A E I Sbjct: 56 SAGAMQL---------EKKVVSKGDLKFHEDEVKNVVKRILSENY-GLDAHLNPQENSYI 105 Query: 122 VNSSRISMTHMANNR 136 + + ++ + Sbjct: 106 TDVPNVKVSIVNQES 120 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI+++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIALSTVVNEAKLRLISGDSSS 208 >gi|118081932|ref|XP_414993.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 360 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 45/160 (28%), Gaps = 21/160 (13%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 + M + Y + +IE + V S+ A+ A ++ Sbjct: 15 IRMAVALYGEKPRMSIELTDYVTIEEILVAIQEISIKGSSLKVGSALAFAAHAMSQ---- 70 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 K ++ +T G KS+ ++ + + + I Sbjct: 71 ----------PATLRDNAAKVVVLITSG-----KSSDLVEDKAQVLQDAGVTVFAVGI-K 114 Query: 372 SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMV 410 + L K + E+ V + L +S+ + Sbjct: 115 DADKHELNKIASEPTAEHVIYVDDFHLLHSAAPKLSRRLC 154 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI+++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIALSTVVNEAKLRLISGDSSS 208 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHVKE---DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D I L+ + D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 LI + T S A+ A ++L + ++ K ++ +TDG ++ Sbjct: 124 MN--LIGQGTFSYYAISNATRLLKREGRKDGV----------------KVVLLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ + I +TI+++ N +L S Sbjct: 166 PKNPD--VQSISEDARISGISFITIALSTVVNEAKLRLISGDSSS 208 >gi|327330197|gb|EGE71946.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL097PA1] Length = 322 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLNGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAKSWTADSADKLQEVYQQVHSSVG 287 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 34/89 (38%), Gaps = 7/89 (7%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN---ASPNGQRLLKTC 382 K +I +TDGE + + I + + I I + + ++ L + Sbjct: 250 RPGASKVLIVITDGEK-SGDPLQYSDVIPEAERA-GIIRFAIGVGKAFSGGTAKQELISI 307 Query: 383 VSSP--EYHYNVVNADSLIHVFQNISQLM 409 S P ++ + V N D+L + + + Sbjct: 308 ASQPEDDHVFPVDNFDALKDIQNKLQDKI 336 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 40/177 (22%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + + T Sbjct: 68 LSDFPVVPTATRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLSREIPAIT--YRGGGT 125 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 126 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 164 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQ 407 + ++ ++I T I R L S+P E+ Y + + + F+ +++ Sbjct: 165 VAASLRDFGVEIFTFGIWQGN--IRELNDMASTPKEEHCYLLHSFEE----FEALAR 215 >gi|168699404|ref|ZP_02731681.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 367 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 53/181 (29%), Gaps = 22/181 (12%) Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 +V+DLS SM +DP +++ A ++ L L + Sbjct: 112 DTVIVIDLSRSMLAEDMADP-----------GAKSRWEAARSGALDLLA----AMERRGG 156 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ + R + + + R + P + + R Sbjct: 157 HRVGVVLFAARPKLVCPLTTDYKHARAVLRAVNGRFPPPECRPGP------EADATSGTR 210 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 P + I+ ++DG++ + A+ + + T+ + Sbjct: 211 FGAALVAAVAAHDPRFVGAQDIVLISDGDDPEESDREWVRG-ANAARTANVPVHTVGVGN 269 Query: 372 S 372 Sbjct: 270 P 270 >gi|50842461|ref|YP_055688.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427106|ref|ZP_06428822.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130538|ref|YP_003581201.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840063|gb|AAT82730.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159575|gb|EFD07763.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375881|gb|ADD99735.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|313764513|gb|EFS35877.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772104|gb|EFS38070.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313792200|gb|EFS40301.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|313801849|gb|EFS43083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807458|gb|EFS45945.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809968|gb|EFS47689.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313813000|gb|EFS50714.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816054|gb|EFS53768.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818503|gb|EFS56217.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820269|gb|EFS57983.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822922|gb|EFS60636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825146|gb|EFS62860.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827717|gb|EFS65431.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830297|gb|EFS68011.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833671|gb|EFS71385.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|313838673|gb|EFS76387.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] gi|314915507|gb|EFS79338.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314918208|gb|EFS82039.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] gi|314920023|gb|EFS83854.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925156|gb|EFS88987.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932037|gb|EFS95868.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955907|gb|EFT00307.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958219|gb|EFT02322.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960060|gb|EFT04162.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962859|gb|EFT06959.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967773|gb|EFT11872.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973302|gb|EFT17398.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975980|gb|EFT20075.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978483|gb|EFT22577.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314984001|gb|EFT28093.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] gi|314989988|gb|EFT34079.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078074|gb|EFT50125.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315080702|gb|EFT52678.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] gi|315084374|gb|EFT56350.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085715|gb|EFT57691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088865|gb|EFT60841.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096217|gb|EFT68193.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098475|gb|EFT70451.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101165|gb|EFT73141.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315108386|gb|EFT80362.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326129|gb|EGE67919.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA2] gi|327331996|gb|EGE73733.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA3] gi|327443198|gb|EGE89852.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445983|gb|EGE92637.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448037|gb|EGE94691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450841|gb|EGE97495.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453082|gb|EGE99736.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453813|gb|EGF00468.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753529|gb|EGF67145.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754260|gb|EGF67876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754489|gb|EGF68105.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760649|gb|EGF74216.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL099PA1] gi|332675378|gb|AEE72194.1| hypothetical protein PAZ_c10190 [Propionibacterium acnes 266] Length = 322 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLNGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAKSWTADSADKLQEVYQQVHSSVG 287 >gi|115361036|ref|YP_778173.1| TadE family protein [Burkholderia ambifaria AMMD] gi|115286364|gb|ABI91839.1| TadE family protein [Burkholderia ambifaria AMMD] Length = 177 Score = 42.9 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 39/151 (25%), Gaps = 6/151 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R + + ++ FAL +M ++L + Y+ ++ A A Sbjct: 1 MKRLPIRRSRM----RGVAAVEFALVLMPMIVLATGVAEFGRAIYQYETLTKATRNAARY 56 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + ++ + + + + + + A + Sbjct: 57 LSVFLPNDSAYPLAAAQCLVVYGSTTCGSAGTELVPGLTTSMVIVCDATHST--DCGDAS 114 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + +++ + + Sbjct: 115 DPSQFANLPTYDSTNNAPSGTATGSINLVEV 145 >gi|257886572|ref|ZP_05666225.1| predicted protein [Enterococcus faecium 1,141,733] gi|257822626|gb|EEV49558.1| predicted protein [Enterococcus faecium 1,141,733] Length = 689 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 34/372 (9%), Positives = 88/372 (23%), Gaps = 45/372 (12%) Query: 49 KKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI--KNHIKESL 106 K ++ A +L +V+ E+ + H + + + Sbjct: 6 TKTNL-----AGMLMIVFSLVAMFFGGDTFAETNAFHYQNNAEGEFLTNGVKEGDTYNYD 60 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMA------NNRLDSSNNTIFYNMDVMTSYDYRLQF 160 G + E I + + D + + + + Sbjct: 61 YGNAPEANFDEENYINYENSAFVKKSVKKAEGIQGLFDVTLAIKGNQLKKPIDLVMVIDY 120 Query: 161 IEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + ++ + + +E G I ++ + + + + + Sbjct: 121 SSSMTGEKLSNALKGLQEFGEELDDSLESGNIRIGIVAYNRFVYSTDDFLTDINQLEYFL 180 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 + T M L+ K V++G Sbjct: 181 RNTAESHTGTFMQKGLLEGQSLLEEKSRPEAEKMLVHIG----------------DDSAN 224 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 R Y+ ++ + + Sbjct: 225 RSYLPKENAQVFHNSGEIVD-----------YNGYHTDQYVTEFQTDSEKYQTSGSSTDP 273 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV-VNA 395 ++ N T+ K IK +++ S G+ + + SSP + ++ N Sbjct: 274 NAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSIDENL 333 Query: 396 DSLIHVFQNISQ 407 L + + I+ Sbjct: 334 TGLGNALKEIAN 345 >gi|241191500|ref|YP_002968894.1| hypothetical protein Balac_1485 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241196905|ref|YP_002970460.1| hypothetical protein Balat_1485 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|240249892|gb|ACS46832.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis Bl-04] gi|240251459|gb|ACS48398.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295794492|gb|ADG34027.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis V9] Length = 671 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 68/233 (29%), Gaps = 66/233 (28%) Query: 228 MAALKNALLLFLDSIDLLSH-----------------VKEDVYMGLIGYT------TRVE 264 M ALK+A+ FLD ++ + K +G Y + Sbjct: 1 MDALKDAVTYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQ 60 Query: 265 KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 +W E +++ T + ++ A + L S Sbjct: 61 TVHSLAWTPEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSG---------------- 104 Query: 325 PSLPFQKFIIFLTDGENNNFKSN-----VNTIKICDKAKENFIKIVTISINASPN----- 374 QK +F +DG + N IK + K + ++++I + Sbjct: 105 -RPGAQKLTVFYSDGSPTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTD 163 Query: 375 -GQRLLKTCVSS--------------PEYHYNVVNA-DSLIHVFQNISQLMVH 411 + + S+ +Y V+A L +F+ I ++ Sbjct: 164 NANKFMNYVSSNYPKAQSMSEPHDRVEGTYYYAVSARTDLQTIFKEIISIVTS 216 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 11/85 (12%), Positives = 31/85 (36%), Gaps = 1/85 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + + + A+++ +++ + Y K ++ A + A LAGA ++ S Sbjct: 15 RQIRGSILVNTAIALS-LIVITLIGTELGYLFYMKRELQKATDLAALAGAKEISYAGSCP 73 Query: 76 GDRFESISNHAKRALIDDAKRFIKN 100 + + + D + + Sbjct: 74 SAKTAAKLSANGTGSTDRNRNLPIS 98 >gi|224456528|ref|ZP_03665001.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis MA00-2987] gi|254370859|ref|ZP_04986864.1| hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874285|ref|ZP_05246995.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|151569102|gb|EDN34756.1| hypothetical protein FTBG_00674 [Francisella tularensis subsp. tularensis FSC033] gi|254840284|gb|EET18720.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|282158590|gb|ADA77981.1| hypothetical protein NE061598_01655 [Francisella tularensis subsp. tularensis NE061598] Length = 332 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 56/190 (29%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD I + + IK + I + + + Sbjct: 194 -IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|5103|emb|CAA77507.1| ste11 [Schizosaccharomyces pombe] Length = 468 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 31/358 (8%), Positives = 87/358 (24%), Gaps = 32/358 (8%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A++ +L ++ L + AL+ A + K+ K +L S + Sbjct: 105 ASDYVVLQQIAQSSKTLKQTEPEKPVNEEETLAALLAPALSYPKS-GKSNLIETSELSCL 163 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + I + + S++ S YR + + + + + Sbjct: 164 SSSPMIRSHTIPSLSFTDQVSTTISTLDKSEQAPSSLGIYYRSPSSGSPIGRTKSVCLAN 223 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + M + + S+ + P++ ++ + K + Sbjct: 224 KARIVPKRSMSSDGCVDKSYQMSKTPSLEANL---PQNSSNCSARRVPKFDSKGTVSEQS 280 Query: 236 LLFLDSIDLLSHVKEDVYMGL-----IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + + + + + + Sbjct: 281 NSDSPELSADKVLSHCSPIDARPSTPSCPNASISPKTPNTGDHYGFDGAEYLGTPLSVGS 340 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + ++L++ + + R + + NN + Sbjct: 341 TTAYLYGQET--ELLSTPYCHTSYPAMSRLNSSSGYTCVSSSSVTNSGHTENNTWRSDEQ 398 Query: 351 IKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K + + Q L S + ++ L +F I+ Sbjct: 399 SK---------------GFVDINSFSQSLF-----SNGNYEFAAHSQELDDLFSQITD 436 >gi|89255638|ref|YP_512999.1| TPR repeat-containing protein [Francisella tularensis subsp. holarctica LVS] gi|115314142|ref|YP_762865.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|167009920|ref|ZP_02274851.1| hypothetical protein Ftulh_04147 [Francisella tularensis subsp. holarctica FSC200] gi|169656499|ref|YP_001427653.2| hypothetical protein FTA_0220 [Francisella tularensis subsp. holarctica FTNF002-00] gi|254367032|ref|ZP_04983068.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|254368551|ref|ZP_04984567.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|290953464|ref|ZP_06558085.1| hypothetical protein FtulhU_03740 [Francisella tularensis subsp. holarctica URFT1] gi|295313262|ref|ZP_06803899.1| hypothetical protein FtulhU_03725 [Francisella tularensis subsp. holarctica URFT1] gi|89143469|emb|CAJ78645.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica LVS] gi|115129041|gb|ABI82228.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|134252858|gb|EBA51952.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|157121454|gb|EDO65645.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|164551567|gb|ABU60697.2| hypothetical protein with von Willebrand factor type A domain [Francisella tularensis subsp. holarctica FTNF002-00] Length = 332 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 56/190 (29%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVINSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD I + + IK + I + + + Sbjct: 194 -IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|295395230|ref|ZP_06805438.1| conserved hypothetical protein [Brevibacterium mcbrellneri ATCC 49030] gi|294971992|gb|EFG47859.1| conserved hypothetical protein [Brevibacterium mcbrellneri ATCC 49030] Length = 1021 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 35/321 (10%), Positives = 80/321 (24%), Gaps = 60/321 (18%) Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + + + +P Sbjct: 15 FFATFFLLAQSLAGFSTPVHAQPSEDPSQ--DSGDSQQSGGSDLGSAKPLPEQFSACAAA 72 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNS-APICQDKKRTKMAALKNALLLFLDSIDLLS 246 + ++VD SGS+ + ++ + + + K TK L A+ F + Sbjct: 73 GGNIDVLILVDESGSLVDSDPNNARVTSGLHLVSRMSKLTKSGNLSVAVSGFGHEYSTVR 132 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +E G E +R + + TD AM +A + L Sbjct: 133 DWQEVKNDG----------------DVESLRGAINDLSNRTNGIDTDYWTAMDEARKQLA 176 Query: 307 SDKKRSFFTN---------------------FFRQGVKIPSLPFQKFIIFLTDG----EN 341 + + DG E Sbjct: 177 NRAHERGSDAKSCQAIIWFSDGELDYEVRKGNMASKYGETKPFAPDISLKTEDGAKKVEE 236 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSS--------------- 385 + + + D+ + + + + + + + L+++ + Sbjct: 237 KAREDICRSGGLADQLRSSRVAMFGVGLGSKDGKEFDLMRSIATGKDSGGKSCGDLTDPV 296 Query: 386 PEYHYNVVNADSLIHVFQNIS 406 P Y + DSL+ F +IS Sbjct: 297 PGEFYLASDIDSLLIAFDSIS 317 >gi|270159275|ref|ZP_06187931.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|289165901|ref|YP_003456039.1| type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] gi|269987614|gb|EEZ93869.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|288859074|emb|CBJ13003.1| putative type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] Length = 1159 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 35/352 (9%), Positives = 80/352 (22%), Gaps = 26/352 (7%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS-SRISMTHMANNRLDS 139 + + + + S + + + + Sbjct: 183 AGNTYVNNPCYNYTSASSTIQSNCSSIASFYGSSVLSSNAYMQIGATSDNPAINDVLYAG 242 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS---FIPALLRIEMGERPIFLIELV 196 S + S + N ++S +P + + Sbjct: 243 SGFPGVFLTYSGPSPATPYPPNFSISNYNNGNVLISYSKSLPNIGGFGTSPTNAGFVPYS 302 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-LSHVKEDVYMG 255 + S S + S + ++ + L Sbjct: 303 PQVLYSQRGFGYSGTQSATSGNVLVPMTSAGTNPTTTSINNAIGIFTPYLQPETNSTSTK 362 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I + + Y+T T ++ Y IL SD + Sbjct: 363 EIKASAVQSPVAGL---MTTAKTYLTSL------GNTSGNGCPQKKYVILISDGLPTQDL 413 Query: 316 NFFRQG--VKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 N + + F DG ++ N ++ + I K+ I I + A Sbjct: 414 NGKLWPPLGSAAATGYGITATFNPDGSLQSTNCQALTDAISAITALKQQGIPTYIIGLGA 473 Query: 372 SPN------GQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKYS 415 N L + +Y N+ L++ I + + ++ Sbjct: 474 GVNPSLNPQAAASLTAMAVAGGTTNYYPATNSTDLVNDLNAIMISVQNGSFT 525 >gi|254485826|ref|ZP_05099031.1| TadE-like protein [Roseobacter sp. GAI101] gi|214042695|gb|EEB83333.1| TadE-like protein [Roseobacter sp. GAI101] Length = 143 Score = 42.9 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 38/128 (29%), Gaps = 1/128 (0%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAGASK 67 + +I F L L+ ++ + + +S+ A AA + A Sbjct: 7 RHLNNFRRDTSGAVAIEFVLIAPLLFALLFGIMTIGYYVGVSHSVSQLATGAARASVAGL 66 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L + + S ++ L A + G + +++ + Sbjct: 67 DMQERVELAEAYLSRASVNYPLLTQSAVTPDIRTETTNPPGITIRVTYAVDGTVLDLANS 126 Query: 128 SMTHMANN 135 + ++ Sbjct: 127 LLKLNISD 134 >gi|329847247|ref|ZP_08262275.1| tadE-like family protein [Asticcacaulis biprosthecum C19] gi|328842310|gb|EGF91879.1| tadE-like family protein [Asticcacaulis biprosthecum C19] Length = 177 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 44/181 (24%), Gaps = 11/181 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + K ++ FAL + LI I + S+++A + A + Sbjct: 6 VRNRLQSFCRDHKGAAAVEFALIAGPLVFLICACIELALVILLSVSLDNATDVASRQIRT 65 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + K+ + D + + + +EI Sbjct: 66 GIATTGNT-------SLTVFKQKVCDKMGWLSGSCMSSLKIDVTTYNNFSEIPTTDLIKD 118 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 N + + + + F L + + Sbjct: 119 GEFDESKFNF-NIGGASKIQLVRAYYEWPL---FTPFLNAGLTTLSNQDAVITSKVVFRN 174 Query: 187 E 187 E Sbjct: 175 E 175 >gi|312136517|ref|YP_004003854.1| magnesium chelatase [Methanothermus fervidus DSM 2088] gi|311224236|gb|ADP77092.1| Magnesium chelatase [Methanothermus fervidus DSM 2088] Length = 269 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 61/189 (32%), Gaps = 26/189 (13%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + L I L + +D + + + + + + T Sbjct: 94 VKGVLKRIMLDAQRHKDKLAIIGFKGREARVILPSTRRIFSFKDKIENIS---VGGTTPM 150 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 +K+A +IL K S + ++ L+DG N N T + Sbjct: 151 AHGLKKAIEILK--------------REKKKSNEYVPILVLLSDGMPNIALKNSPTRDVL 196 Query: 355 DKAKENF-IKIVTISINASPN----GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + AKE I T+ IN + ++S +Y V N ++ + + ++ Sbjct: 197 ELAKELNRSDIHTVIINFEKRIMRGRNFNFELALTSGGKYYEVGN----RNIVETVGSII 252 Query: 410 VHRKYSVIL 418 H + S+ Sbjct: 253 KHERNSLFY 261 >gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS185] gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS185] Length = 772 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 38/252 (15%), Positives = 75/252 (29%), Gaps = 44/252 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 N +V +P + + LV+D SGSM Sbjct: 366 GMNEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMA-------------------- 405 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KNALL L + + + + + + + Sbjct: 406 GDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSLLS----ATPLPATSSNLSRARQFVS 461 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ A+ A V ++ + +IF+TDG N Sbjct: 462 RLQADGGTEMALALDAAL-------------PKSLGSVSPDAVQPLRQVIFMTDGSVGNE 508 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ + I+ + E+ ++ T+ I ++PN + + + D + Sbjct: 509 QALFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVDEVD---AK 561 Query: 405 ISQLMVHRKYSV 416 IS L+ +Y V Sbjct: 562 ISALLSKIQYPV 573 >gi|17567355|ref|NP_508551.1| hypothetical protein F28B4.3 [Caenorhabditis elegans] gi|1125759|gb|AAA83584.1| Hypothetical protein F28B4.3 [Caenorhabditis elegans] Length = 2229 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 24/255 (9%), Positives = 69/255 (27%), Gaps = 9/255 (3%) Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + + Q + + G + + + + Sbjct: 1970 MVPVTGSMA--PVTQATQQTQGPVTQQTQGPITQATQPPQTVQTQAPVTPTQNPQTGLQF 2027 Query: 217 APICQDKKRTKMAALKNALLLFLDSID-LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK 275 + ++ F+ ++ ++GL+ + P Sbjct: 2028 DIVFLIDGSQAAQQNFDSFTKFIQTMMVSFDVGIAGAHVGLVVVAADLNDQAPPVANLNA 2087 Query: 276 VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 + + + S + D A Q+LT + + T++ + I++ Sbjct: 2088 ITSQ--QMLISYLNGLKDGYTDFDDAGQVLTYNLQVVSSTDYMAATAGYRAGISNHVIVY 2145 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 +T + T + I+T+ + + + L+T + + Sbjct: 2146 IT---STTSFFTDPTPSAKTIIAQKKYGIITVGYGGAVDTGK-LQTISGGSACSFTATDF 2201 Query: 396 DSLIHVFQNISQLMV 410 +L + + I QL+ Sbjct: 2202 TTLNNQIKPIQQLIT 2216 >gi|314923048|gb|EFS86879.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966820|gb|EFT10919.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093260|gb|EFT65236.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103482|gb|EFT75458.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327646|gb|EGE69422.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL103PA1] Length = 322 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAQSWTADSADKLQEVYQQVHSSVG 287 >gi|31615654|pdb|1NA5|A Chain A, Integrin Alpha M I Domain Length = 197 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 65/206 (31%), Gaps = 25/206 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P++ + D + + + F+ ++ K L+ Y+ + Sbjct: 2 PQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEEFRIHFTFK 60 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 R V L T + +++ + L + + F Sbjct: 61 EFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF----------- 107 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSS 385 K ++ +TDGE + + +A + I + S ++ L T S Sbjct: 108 --KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASK 163 Query: 386 P--EYHYNVVNADSLIHVFQNISQLM 409 P ++ + V N ++L + + + + Sbjct: 164 PPRDHVFQVNNFEALKTIQNQLREKI 189 >gi|31615583|pdb|1MF7|A Chain A, Integrin Alpha M I Domain Length = 194 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 65/206 (31%), Gaps = 25/206 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P++ + D + + + F+ ++ K L+ Y+ + Sbjct: 2 PQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEEFRIHFTFK 60 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 R V L T + +++ + L + + F Sbjct: 61 EFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF----------- 107 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSS 385 K ++ +TDGE + + +A + I + S ++ L T S Sbjct: 108 --KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASK 163 Query: 386 P--EYHYNVVNADSLIHVFQNISQLM 409 P ++ + V N ++L + + + + Sbjct: 164 PPRDHVFQVNNFEALKTIQNQLREKI 189 >gi|157831557|pdb|1JLM|A Chain A, I-Domain From Integrin Cr3, Mn2+ Bound Length = 192 Score = 42.9 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 65/206 (31%), Gaps = 25/206 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P++ + D + + + F+ ++ K L+ Y+ + Sbjct: 3 PQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEEFRIHFTFK 61 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 R V L T + +++ + L + + F Sbjct: 62 EFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF----------- 108 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSS 385 K ++ +TDGE + + +A + I + S ++ L T S Sbjct: 109 --KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASK 164 Query: 386 P--EYHYNVVNADSLIHVFQNISQLM 409 P ++ + V N ++L + + + + Sbjct: 165 PPRDHVFQVNNFEALKTIQNQLREKI 190 >gi|269966895|ref|ZP_06180968.1| hypothetical protein VMC_23980 [Vibrio alginolyticus 40B] gi|269828562|gb|EEZ82823.1| hypothetical protein VMC_23980 [Vibrio alginolyticus 40B] Length = 142 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 41/145 (28%), Gaps = 12/145 (8%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS- 73 +A++K I F L++ L+L +I V + + A L Sbjct: 1 MANQKGLAIIEFILALPVLLMLTVLVIDVSRAFIQYTEVNKALQNGARYAVVDTYGTLDF 60 Query: 74 ----RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 D + + +I+ ++ + I N I Sbjct: 61 EGIAEESDIKNVVIYGTPSGVGTPVIDYIEA-ADIIVTQPTDTNKVVTISATYNYEPIF- 118 Query: 130 THMANNRLDSSNNTIFYNMDVMTSY 154 + L + ++ +++ S Sbjct: 119 -----STLPFTGTSLAFSIGSSASM 138 >gi|260799828|ref|XP_002594886.1| hypothetical protein BRAFLDRAFT_124461 [Branchiostoma floridae] gi|229280123|gb|EEN50897.1| hypothetical protein BRAFLDRAFT_124461 [Branchiostoma floridae] Length = 394 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 8/50 (16%), Positives = 16/50 (32%) Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + + D A+++ I + ++ I N LL P Sbjct: 230 RYMRDTPLDDHVAEADAARDDGITLYSVGIGDPVNSAVLLDIAEIPPNVF 279 >gi|293604074|ref|ZP_06686485.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292817556|gb|EFF76626.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 243 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 4/67 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 ++ F F + ++ +I FA++ LL+ I W + + A A A Sbjct: 1 MTAFFFAASR----QRGTAAIEFAVAGAMVLLVGLLGIEAARWQAVRQMAQVALMQAARA 56 Query: 64 GASKMVS 70 G + Sbjct: 57 GITAHAD 63 >gi|149909169|ref|ZP_01897826.1| hypothetical protein PE36_09161 [Moritella sp. PE36] gi|149807693|gb|EDM67639.1| hypothetical protein PE36_09161 [Moritella sp. PE36] Length = 162 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 48/156 (30%), Gaps = 6/156 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + S++ + SI FA++++ F +L+ LI + + + + A+ + A ++V Sbjct: 9 CSQVKRSQRGSVSIEFAVTILPFFVLLLALIEISRFMMVSS----VIDVALTSAARQLVV 64 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++ L + + S S + Sbjct: 65 TSAGEDLTSKLQLTLSELDLPLLDSNKVTVEARYYTSLTSLADGSGLEDFQQQDFAEFTL 124 Query: 131 HMANNRLDSSN--NTIFYNMDVMTSYDYRLQFIEHL 164 L S ++ + L+ + Sbjct: 125 VYPYQALFISGQIDSFSRLTSFKRTILVSLEGSDSY 160 >gi|149046637|gb|EDL99462.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Rattus norvegicus] Length = 939 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|19113448|ref|NP_596656.1| transcription factor Ste11 [Schizosaccharomyces pombe 972h-] gi|12644238|sp|P36631|STE11_SCHPO RecName: Full=Transcription factor ste11 gi|2980826|emb|CAA18162.1| transcription factor Ste11 [Schizosaccharomyces pombe] Length = 468 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 31/358 (8%), Positives = 87/358 (24%), Gaps = 32/358 (8%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A++ +L ++ L + AL+ A + K+ K +L S + Sbjct: 105 ASDYVVLQQIAQSSKTLKQTEPEKPVNEEETLAALLAPALSYPKS-GKSNLIETSELSCL 163 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + I + + S++ S YR + + + + + Sbjct: 164 SSSPMIRSHTIPSLSFTDQVSTTISTLDKSEQAPSSLGIYYRSPSSGSPIGRTKSVCLAN 223 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + + M + + S+ + P++ ++ + K + Sbjct: 224 KARIVPKRSMSSDGCVDKSYQMSKTPSLEANL---PQNSSNCSARRVPKFDSKGTVSEQS 280 Query: 236 LLFLDSIDLLSHVKEDVYMGL-----IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 + + + + + + Sbjct: 281 NSDSPELSADKVLSHCSPIDARPSTPSCPNASISPKTPNTGDHYGFDGAEYLGTPLSVGS 340 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + ++L++ + + R + + NN + Sbjct: 341 TTAYLYGQET--ELLSTPYCHTSYPAMSRLNSSSGYTCVSSSSVTNSGHTENNTWRSDEQ 398 Query: 351 IKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 K + + Q L S + ++ L +F I+ Sbjct: 399 SK---------------GFVDINSFSQSLF-----SNGNYEFAAHSQELDDLFSQITD 436 >gi|330447678|ref|ZP_08311326.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491869|dbj|GAA05823.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 257 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 33/262 (12%), Positives = 77/262 (29%), Gaps = 38/262 (14%) Query: 157 RLQFIEHLLNQRYNQKIVSF--IPALLRIEMGERPIFLIE--LVVDLSGSMHCAMNS--- 209 + + ++ +Q++VS +P + + S S + + Sbjct: 14 VFFINKSMKDKPTSQQLVSAPSLPDVSTGYGFDALKNNWPTLNNTQTSSSDNWLAANYLL 73 Query: 210 --DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN- 266 D C + + K+ A K A+ F++ I +D +GL + Sbjct: 74 IFDGSGSMDNTNCGNG-QRKIVAAKEAMQTFINDI------PQDANVGLYVFDNADSSLR 126 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + G D T ++ Y L ++ + Sbjct: 127 VPL--GINNRATLKQAIYDVKAGGTTPLKSSLTSGYTALEKQAEKQLGYGEY-------- 176 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 ++ +TDG+ + + + I + + + I TI + Sbjct: 177 -----NVVIVTDGDASVGEEPE--VAISRIYQNSPVTIHTIGFCIGNRHALNAEGIT--- 226 Query: 387 EYHYNVVNADSLIHVFQNISQL 408 Y+ + N + L+ Q++ Sbjct: 227 -YYQSANNPEKLLAGLQSVLAE 247 >gi|198417365|gb|ACH87900.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1042 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 32/317 (10%), Positives = 83/317 (26%), Gaps = 56/317 (17%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + + + I ++ + + + + +G+++ Sbjct: 433 MTGKKQPLDILVVVDRSGSMQDG---------IGSIEKYKYWKYKYDEYYHIWRNAGTIY 483 Query: 205 CAMNSDPEDVNSAPICQDKKR--------TKMAALKNAL---LLFLDSIDLLSHVKEDVY 253 P D + + A+K+AL L ++ + Sbjct: 484 FDNYLGPRYQPDTYTYYDYQSKESVPFGIKRDQAVKDALIGSTGLLQKFLDINPQNQLAV 543 Query: 254 MG-----LIGYTTRVEKNIEP-----------SWGTEKVRQYVTRDMDSLIL------KP 291 +G Y + S + ++ + T S Sbjct: 544 VGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYEDRNG 603 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ A+ +A + L +P+ F +G ++ + Sbjct: 604 TNYHAALLKADEKLQKVANNGHRKIMVFISDGVPTFYFGADNYRSGNGTVSDSNIINSQK 663 Query: 352 K---ICDKAKEN--FIKIVTISINASPNGQR------LLKTCVSSPEYHYNVVNADSLIH 400 D+ K + I ++ ++ N +LK S +Y+ + + + L Sbjct: 664 GSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYL-SGDDYYSGITDTEQLEK 722 Query: 401 VFQNISQL--MVHRKYS 415 I + + + S Sbjct: 723 TANKIVEDSKISNLTIS 739 >gi|198436180|ref|XP_002124514.1| PREDICTED: similar to von Willebrand factor A domain containing 3A [Ciona intestinalis] Length = 1107 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 51/208 (24%), Gaps = 54/208 (25%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 G + +++D+SGSM M LK + + Sbjct: 860 GSILESNVVILIDVSGSMSYN---------------------MDELKKEITSLI----WE 894 Query: 246 SHVKEDVYMGLIGYTTRVE-KNIEPS-WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 ++ ++ + V + + A++ A Sbjct: 895 QLNGNKTAFNIVAFSNTSTKWQDSITESNQSACHDAVQWVSALTAHGGSATLKAIQVALA 954 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + I LTDG+ + S T+ + I Sbjct: 955 -----------------------DEEAEAIYLLTDGKPD--SSIKLTLSEASNLNKKNIP 989 Query: 364 IVTISINASPN-GQRLLKTCVS-SPEYH 389 I TIS N LK+ S S Sbjct: 990 IHTISFNCDNREANDFLKSLSSNSGGRF 1017 >gi|227519070|ref|ZP_03949119.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|227073490|gb|EEI11453.1| conserved hypothetical protein [Enterococcus faecalis TX0104] Length = 711 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 129 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 188 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 189 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 244 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 303 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 304 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 342 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 343 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 383 >gi|218779355|ref|YP_002430673.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760739|gb|ACL03205.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 504 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 28/237 (11%), Positives = 69/237 (29%), Gaps = 39/237 (16%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + Q +++ + + LV+D S SM +++ P N+ P Sbjct: 1 MRKGFAWQCLLTIVLFAFIFDPAAAEPPGNMLVLDCSQSMQRTLDTRPGMENARPAALRF 60 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 ++ +N+L + + E + + R ++ Sbjct: 61 VKS---LPQNSLAGIVAYGQNAAKG-----------CDNAEVLVPLAPY--DRRALISAI 104 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN- 342 A+++A++ G+ I +TDG ++ Sbjct: 105 KKVQPQGKAPLAAALRKAWEQ--------------GAGLSQGC-----VITLITDGWDDC 145 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINAS-PNGQRLLKTCVSSPEYHYNVVNADSL 398 + KA+ I + I + + + +L++ +S + L Sbjct: 146 WGDPVSMVEDL--KARGAGIIVNIIGVAPNREDAAKLMRLARASGGAYRAADTRADL 200 >gi|226349296|ref|YP_002776410.1| hypothetical membrane protein [Rhodococcus opacus B4] gi|226245211|dbj|BAH55558.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 153 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 10/151 (6%), Positives = 36/151 (23%), Gaps = 17/151 (11%) Query: 7 FRFYFKKGIASE-------KANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANN 58 +K +AS K +++ + + + G ++ ++ Sbjct: 1 MTNALRKRLASRLPAEHRDKGAVTLMTVVFAPVLIFFVWGLIVDGGGMLTADQRADNVAE 60 Query: 59 AAILAG---------ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 A A A + + + + + + + D + + Sbjct: 61 DAARAAGQQIIGSVSARGIGTVVDPVRATAAAKRYLFEAGVDGDVIPTGPRTLLITTRIV 120 Query: 110 SAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + ++ N + Sbjct: 121 YNSKLLPYPRTKLLIGTAAVNLNRTNAGEVY 151 >gi|241667106|ref|YP_002985190.1| hypothetical protein Rleg_7224 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240862563|gb|ACS60228.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 546 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 21/53 (39%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 K+ E+ S I ++ L+I +++ Y K + +AA L Sbjct: 5 LIKRLHRDERGFLSPIILYMTIALALMIVWILNTGQMIYDKQRTQDTADAAAL 57 >gi|319785614|ref|YP_004145090.1| hypothetical protein Mesci_6033 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171502|gb|ADV15040.1| hypothetical protein Mesci_6033 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 205 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 15/194 (7%), Positives = 52/194 (26%), Gaps = 11/194 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + ++ FAL + L++ + + S + ++ Sbjct: 20 FWSDRRGIAAVEFALIMPILLIMYFLTMEASQAIETSKKVSR--------IGSMVADLVT 71 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + + + + +++ +++ T + + ++ Sbjct: 72 QQTSVLKADVDAIMQIGSVTLQPYNRSNPTITVTAIQVSADATRALVVWSRKLVAGVASP 131 Query: 134 NNRLDSSNNTIFYNMDVM---TSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + L + + +Q+ + A I MGE Sbjct: 132 GAAATTETTIPASLRVANTFLIRVESNLGYTPVIAWSASSQQKLGLTSAFSNITMGETYF 191 Query: 191 FLIELVVDLSGSMH 204 V + S Sbjct: 192 LRPRRSVTIPCSDC 205 >gi|167463706|ref|ZP_02328795.1| hypothetical protein Plarl_14314 [Paenibacillus larvae subsp. larvae BRL-230010] Length = 695 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 8/61 (13%), Positives = 19/61 (31%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + + S+ L ++ LI + E A A + + S ++ Sbjct: 4 FWQNRRGSVSVYIILLLVPVFFFQAVLIDFARVKAAQKESEQALKAGLRSVLSAFQPDVQ 63 Query: 74 R 74 Sbjct: 64 T 64 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 56/181 (30%), Gaps = 26/181 (14%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + V G + Y+ VE + + + + T + A+ Q Sbjct: 12 QNQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYL--GRGTFTDCAISNMTQQ 69 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 S R KF + +TDG ++A++ IK+ Sbjct: 70 FQSQTARDV-----------------KFAVVITDGHVT-GSPCGGMKMQAERARDMGIKL 111 Query: 365 VTISINASPNGQRLLKTCVSSPEYH---YNVVNADSL---IHVFQNISQLMVHRKYSVIL 418 ++ + Q L + + + Y + D+L + + I + M H Y+ Sbjct: 112 FAVAPSEDVYEQGLREIASPPHDLYRSNYTITPKDALHIDENTIERIIKAMKHEAYAECY 171 Query: 419 K 419 K Sbjct: 172 K 172 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 56/182 (30%), Gaps = 33/182 (18%) Query: 190 IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 I V+D S S+ + ++ + + K+ + ++ + Sbjct: 537 ALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGS-----IAKDPKSETGARVGVVQYSH 591 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 E + + E+ S E V++ I T + A++ AY L + Sbjct: 592 EGTFEAIKLDD---ERINSLSSFKEAVKRL------EWIAGGTWTPSALQFAYNKLIKES 642 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG + + N +C + + TI I Sbjct: 643 RREKAQVFA---------------VVITDGRYDPRDDDKNLGALC----GRDVLVNTIGI 683 Query: 370 NA 371 Sbjct: 684 GD 685 >gi|220915780|ref|YP_002491084.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter dehalogenans 2CP-1] gi|219953634|gb|ACL64018.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 1723 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 38/184 (20%), Gaps = 12/184 (6%) Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL--ILKPTDSTP 296 + + I Y + + + G T D T Sbjct: 462 WEKNTSSYESSSNDNQYSICYLCQTSSVVVLTDGEPSPASSSTSPNDGDRLPSGGTTLAD 521 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF-QKFIIFLTDGENNNFKSNVNTIKI-C 354 Y TS + + Q + + + +N + C Sbjct: 522 TTSGKYAGATSTGIKDITSTDCPQCANFSGTAAWKDSLAKVAWYMHNMDLRDNAETTWDC 581 Query: 355 DKAKENFIKI---VTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQL 408 K ++ T+ L + N +L I + Sbjct: 582 K--KNGGKQVLDTYTVGFATSQLPDAKTILHAAAEAGGGKFVPAENPQTLAEGLSYILEK 639 Query: 409 MVHR 412 + R Sbjct: 640 INER 643 >gi|153008591|ref|YP_001369806.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] gi|151560479|gb|ABS13977.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] Length = 165 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 46/149 (30%), Gaps = 14/149 (9%) Query: 5 SRFRFYFKK------GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-AN 57 RF ++ ++ ++ FA+ FLL++ +I + N+++ A Sbjct: 16 DRFSNLHRRIPSLLFFKRNKSGTAAVEFAILAPVFLLILMGMIAFGLYLGVANAVQQLAA 75 Query: 58 NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 +A ++ + + + K A K + + + Sbjct: 76 DA-----TRTALAGIDPPERLTLATTYIQKNAAKYSLIDPAKMQVSVDNAQSDPNQFTVT 130 Query: 118 IQNIVNSSRI--SMTHMANNRLDSSNNTI 144 I+ + I MT + + + Sbjct: 131 IRYNAQNLPIWNLMTGLPLPDTTITRAST 159 >gi|74203017|dbj|BAE26210.1| unnamed protein product [Mus musculus] Length = 791 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 45/155 (29%), Gaps = 21/155 (13%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G + + + + RQ V + ++ K + + A Sbjct: 79 DALDISPGRVRVGALQFGSTPHLEFPL--DSFSTRQEVKESIKGIVFKGGRTETGL--AL 134 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L+ + + +I +TDG++ + + +E I Sbjct: 135 KRLSRGFPGGRNGS------------VPQILIIITDGKSQGPVALP-----AKQLRERGI 177 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + + + L ++ + Sbjct: 178 VVFAVGVRFPRWDELLTLASEPKDQHVLLAEQVED 212 >gi|323498500|ref|ZP_08103493.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] gi|323316389|gb|EGA69407.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] Length = 697 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 73/251 (29%), Gaps = 49/251 (19%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 + + ++F P + + V+D SGSM Sbjct: 288 ASSKRGTVKLTFTPGDDLTRVTQ--GRDWVFVLDKSGSM--------------------- 324 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 K + L + L + + ++ T + V + + Sbjct: 325 SGKYSTLVEGVRQGLGKLPSEDRFRVVMFDSNTYDLTGGFVAV----NAANVTKALQSVE 380 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ M A + L D+ I+ +TDG N Sbjct: 381 QVEPSNGTNLYEGMAAATRKLDDDRPTG--------------------IVLVTDGVANVG 420 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + D +++ +++ T + S N L+ S +V NAD +I + Sbjct: 421 V--TEKRRFFDLMEKHDVRLFTFIMGNSANTPLLVPMTKLSNGIATSVSNADDIIGHLMS 478 Query: 405 ISQLMVHRKYS 415 I+ + ++ Y Sbjct: 479 ITSKLTYQAYR 489 >gi|221126641|ref|XP_002157470.1| PREDICTED: similar to microneme 1 [Hydra magnipapillata] Length = 403 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 20/188 (10%), Positives = 50/188 (26%), Gaps = 24/188 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDM 284 K + D +I ++ + + S W E + ++ Sbjct: 236 NNWKKTIQFVQDFSKEFKFGPTGAQFAVIDFSDKATLQVNISDSRFWSNEGFTKKISNIK 295 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A++ + + K K +I +TDG + Sbjct: 296 YKM--GKTRTDLALRLTRRKVFCRKCGLRV-------------DVSKLVIVVTDGRST-- 338 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 T+ K+ I+ I + + L + + + + + Sbjct: 339 -FPFLTLSEARLIKKQS-SIICIGVGDEVDINELNTI-ATDENHVFLLNGYRYINDNINQ 395 Query: 405 ISQLMVHR 412 I ++ Sbjct: 396 ILKMSCKL 403 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 56/165 (33%), Gaps = 24/165 (14%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V +I Y+ I S W E + V+ T + A+K A + Sbjct: 2 GPTGVRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIE--YSHGKTRTDLALKVARK 59 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK-ENFI 362 + ++ K +I LTDG++ T K EN + Sbjct: 60 HVFCNECSL-------------QHNIPKLLIVLTDGQST---FPKLTQFEAHLIKVENNL 103 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I+++ ++ + + LK+ + ++ + + + L I + Sbjct: 104 TIISVGVSDQVDIEE-LKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|187932171|ref|YP_001892156.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713080|gb|ACD31377.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 332 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 56/190 (29%), Gaps = 41/190 (21%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+ + D +++ K + L I +G+I +++ + Sbjct: 98 DVSQSMDTTDVSPSRLERAKYKIFDILRRIKEGQ-------VGMIVFSSEPFVVSPLTSD 150 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + VT D + ++ + A+K++ Q++ + Sbjct: 151 ANTIENLVTVITSDIVPVQGHNIYKALKKSAQLIEQAGVQQGQ----------------- 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIK--IVTISINASPNGQRLLKTCVSSPEYH 389 II +TD I + + IK + I + + + Sbjct: 194 -IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG--------TPMGGIAKDEKGN 239 Query: 390 YNVVNADSLI 399 Y + ++ Sbjct: 240 YLKDSQGNIQ 249 >gi|169234588|ref|NP_001038425.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Danio rerio] gi|169154233|emb|CAH68946.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 1 (CACNA2D1) [Danio rerio] Length = 1069 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 58/185 (31%), Gaps = 30/185 (16%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVK----EDVYMGLIGYTTRVEKNIEPSWGTE 274 + ++ ++ L+++ +V + + + V+ N+ + Sbjct: 258 ASGSVSGLTLKLIRTSVSEMLETLSDDDYVNIVSFNNSAKSVACFENLVQANV---RNKK 314 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 +++ V + TD K+A+ L S K I+ Sbjct: 315 TLKEAVQKIT---ANGTTDYKIGFKEAFNQLASMNVSRANC--------------NKIIM 357 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 TDG + + + ++I T S+ + ++ + Y+Y + Sbjct: 358 LFTDGGEDKASEIFDEYNS-----DKRVRIFTFSVGQHNYDKAPIQYMACHNKGYYYEIP 412 Query: 394 NADSL 398 + ++ Sbjct: 413 SIGAI 417 >gi|157838288|pdb|1BHQ|1 Chain 1, Mac-1 I Domain Cadmium Complex gi|157838289|pdb|1BHQ|2 Chain 2, Mac-1 I Domain Cadmium Complex Length = 189 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 62/190 (32%), Gaps = 28/190 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +M + ++ L L + + I +T + +N R V Sbjct: 18 FRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNP------NPRSLVKPIT 71 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 L T + +++ + L + + F K ++ +TDGE Sbjct: 72 QLL--GRTHTATGIRKVVRELFNITNGARKNAF-------------KILVVITDGEKFG- 115 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + + +A + I + S ++ L T S P ++ + V N ++L Sbjct: 116 -DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALK 174 Query: 400 HVFQNISQLM 409 + + + + Sbjct: 175 TIQNQLREKI 184 >gi|149046634|gb|EDL99459.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Rattus norvegicus] Length = 927 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|253722212|pdb|1IDN|1 Chain 1, Mac-1 I Domain Metal Free gi|253722213|pdb|1IDN|2 Chain 2, Mac-1 I Domain Metal Free gi|313507148|pdb|1BHO|1 Chain 1, Mac-1 I Domain Magnesium Complex gi|313507149|pdb|1BHO|2 Chain 2, Mac-1 I Domain Magnesium Complex Length = 190 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 62/190 (32%), Gaps = 28/190 (14%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +M + ++ L L + + I +T + +N R V Sbjct: 19 FRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNP------NPRSLVKPIT 72 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 L T + +++ + L + + F K ++ +TDGE Sbjct: 73 QLL--GRTHTATGIRKVVRELFNITNGARKNAF-------------KILVVITDGEKFG- 116 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + + +A + I + S ++ L T S P ++ + V N ++L Sbjct: 117 -DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALK 175 Query: 400 HVFQNISQLM 409 + + + + Sbjct: 176 TIQNQLREKI 185 >gi|283787707|ref|YP_003367572.1| tight adherence protein TadE [Citrobacter rodentium ICC168] gi|282951161|emb|CBG90852.1| putative tight adherence protein TadE [Citrobacter rodentium ICC168] Length = 170 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 28/81 (34%), Gaps = 3/81 (3%) Query: 7 FRFYFKKGIA---SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + +K A E+ ++ AL + +I L + Y +++ A A Sbjct: 8 ITVFRRKLFAGAGDERGAVAVELALVFFPLMFMIFLLFELCRVTYISSALNLATAEASRY 67 Query: 64 GASKMVSNLSRLGDRFESISN 84 A +S+ E + N Sbjct: 68 AAVAKISDKDYERIFREKLQN 88 >gi|260799772|ref|XP_002594858.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] gi|229280095|gb|EEN50869.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] Length = 930 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 37/140 (26%), Gaps = 25/140 (17%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + T + A+ +A + K Sbjct: 247 NMTTTLSAINAIK--YQKGATYTPKALDRARKE-----------------ALWRGEAVPK 287 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 +I L+DG + + + I + I + + LL + Sbjct: 288 VMIVLSDGRSAID-----VTEASKALADAGIIVYAIGVGR-ADHDELLLIANNDLSKVIE 341 Query: 392 VVNADSLIHVFQNISQLMVH 411 + + ++LI +++++ Sbjct: 342 LSDFNALIAEIDLLAEVVCE 361 >gi|77918074|ref|YP_355889.1| hypothetical protein Pcar_0459 [Pelobacter carbinolicus DSM 2380] gi|77544157|gb|ABA87719.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 370 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/230 (10%), Positives = 64/230 (27%), Gaps = 48/230 (20%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 P+ + +T + + F+D + +GL+ +++ + ++ + ++ Sbjct: 101 PVSRFSSQTLGDLALDGIESFIDMRRHEDY------IGLVAFSSYAKLLAPLTFDKDLLK 154 Query: 278 QYVTRDMD---------SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS-- 326 + T+++ A+ + S + + Sbjct: 155 AKLALVRSKNHSRIYRELGAGGGTNASEAVWLSLSAFFSMLPEDNRLSVEQIAGMREFLL 214 Query: 327 -------LPFQKF----------IIFLTDGE-------NNNFKSNVNTIKICDKAKENFI 362 QK +I TDG + N + I K + Sbjct: 215 GEPGALLDIPQKIRNAGLGTGMAVILFTDGRIEPTLRAHVRRGGLPNLVNIITLMKAVGV 274 Query: 363 KIVTISIN--ASPNGQRLLKTCVSSP--EYHYNVV---NADSLIHVFQNI 405 + IS+ Q+ ++ P + + + V++ I Sbjct: 275 RFYIISVGGEVDAAVQKAMEPTAGEPSVGRIFVTSRGLDRQQIEAVYREI 324 >gi|193786519|dbj|BAG51302.1| unnamed protein product [Homo sapiens] Length = 164 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 42/122 (34%), Gaps = 19/122 (15%) Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + A++ + S+ + + + + TDG + + S Sbjct: 4 GTMTGLALRHMVEHSFSEAQGARPRAL----------NVPRVGLVFTDGRSQDDISV--- 50 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQL 408 +AKE I + + + + + L+ S P + + ++ H+ +N+ Sbjct: 51 --WAARAKEEGIVMYAVGVGKAVEAE--LREIASEPAELHVSYAPDFGTMTHLLENLRSS 106 Query: 409 MV 410 + Sbjct: 107 IC 108 >gi|156382093|ref|XP_001632389.1| predicted protein [Nematostella vectensis] gi|156219444|gb|EDO40326.1| predicted protein [Nematostella vectensis] Length = 801 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 52/187 (27%), Gaps = 24/187 (12%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 K F + ++G++ Y+ ++ ++ + + Sbjct: 584 FKKIKNTFTSLLKFFKVNNRKAHLGIVEYSDDARMTKPLDQEYDR-KELIKVIDEIQPSG 642 Query: 291 PT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + AMK+A + T + +P Sbjct: 643 NGFNVAAAMKRASDHAFTIFGGVRQTVPKTFVLFVPRDSTADM---------------AE 687 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 K K + IV +++ N + KT S P + + N F +I+ + Sbjct: 688 VNKQAQALKAMGVHIVLFAVDKVTNLDK-WKTIASQPARRFFIGNT------FDDIAGQI 740 Query: 410 VHRKYSV 416 K ++ Sbjct: 741 YDAKETI 747 >gi|152988351|ref|YP_001350200.1| hypothetical protein PSPA7_4864 [Pseudomonas aeruginosa PA7] gi|150963509|gb|ABR85534.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7] Length = 556 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 38/306 (12%), Positives = 86/306 (28%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQT 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E S Sbjct: 66 SAQATSFAKASASLNGFDANAAGSGLLAEVGGVQSLGGLRRFS----------------- 108 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 + N+ V L + + F A + + + Sbjct: 109 -------SSASNVAVDNEAVRVL-VSKSVPGSLVANLAGLFGGANANVNLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T ++ A D+ + + MG Sbjct: 161 SATISAGTGLASLNSSQSALLNPILSGLLGTDVSLSAAAYNGIADAKLSVLDILGPDGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LIG T + + ++Q + ++ L S A+K + S + Sbjct: 221 LIGVDTSLGTIEQLLNTNVGLQQVLAASVNVLAKSGVASVEALKTQLLGVKSATLKLGDV 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|198430887|ref|XP_002124814.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 400 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 32/350 (9%), Positives = 83/350 (23%), Gaps = 41/350 (11%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 AS + +G +N + +F + + + Sbjct: 54 ASANGTAKLLIGAPNSRRNNFGSSQGVLYGIKFAVASGIYDIVAKDPEKNAPIVVTNLRK 113 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 ++ +N ++ S++ N + + + Sbjct: 114 DADAIGISVSNSKEAVTVCAPMRYRHCRSHNIV---SGSCFNSKDFGESWQSAKPT--VP 168 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + V+D S S+ ++ + A ++ D Sbjct: 169 FCSDQALDLLFVLDGSFSVGETNFGLVKNW---------------VVALAKSFDIEKQDN 213 Query: 245 LSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ + + Y+ + + SL T + A+ + Sbjct: 214 IGIIQYSHWYPGVPYSQQPYMKTEVPLGLYKNFTLFEKIAQNISLQGFTTYTAHALNKTV 273 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + ++ K +I +TDG ++ + + I Sbjct: 274 LDFMASERFM-------------HENVTKVMILITDGRADDAID---LYSSAEYVRSQGI 317 Query: 363 KIVTISINASPNGQRLLKTCVSSPE---YHYNVVNADSLIHVFQNISQLM 409 I I S +L + V SL + + + Sbjct: 318 ITFAIGIGNSVLRDQLQIVANGKLGEDTRVFEVTTFTSLNSILSLLRSSI 367 >gi|157273515|gb|ABV27414.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 327 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 25/211 (11%), Positives = 64/211 (30%), Gaps = 20/211 (9%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 C +++ K A L+FL+ I L + L+ + + + +RQ Sbjct: 90 DCSGSMVSRLPLAKRATLMFLERILRLPQDR----AALLACQQDLLLVQPLTNSLDALRQ 145 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + L P + + + + + ++ ++ ++D Sbjct: 146 ALATLDEQLPSPLGRVVPFEPKQATPPGTALYAALYAAIELLPEERDDAR-RRVVVAISD 204 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------NGQRLLKTCVSSPEY 388 G ++ ++ + A N + + + I N L + C S+ Sbjct: 205 GFDSEGLI--RVGEVIEHAWRNQVSLYALGIGQPELTATDTNQMVNRADLERLCNSTGGQ 262 Query: 389 HYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 + F + ++S +L Sbjct: 263 AFFPRLDREFFTAFAQVDAD---LRHSFVLA 290 >gi|118347184|ref|XP_001007069.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288836|gb|EAR86824.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 821 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 56/186 (30%), Gaps = 32/186 (17%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQ 278 D+ + + A ++ + L Y +I + + + + ++ Sbjct: 343 IDRSGSMCTIFQKARDTLIEFLQRLPDDS---YFNVISFGSGYQFLFEEAKKKNKQSMKS 399 Query: 279 YVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + T+ +++ +Q K + +Q I LT Sbjct: 400 ALEQISKFSADMGGTEIYQPLEKIFQ------------------CKNVNDLYQMQIFLLT 441 Query: 338 DGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 DG+ + + N ++ I + + + Q L + S + V NA Sbjct: 442 DGQV------SQPDMVVQLIRNNSHKARVHCIGLGSGVDKQLLRRCSESGRGANRQVDNA 495 Query: 396 DSLIHV 401 L V Sbjct: 496 SELKEV 501 >gi|149046636|gb|EDL99461.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Rattus norvegicus] Length = 920 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQ---- 370 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 371 -EIFAKYNKDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|196228665|ref|ZP_03127531.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] gi|196226946|gb|EDY21450.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] Length = 921 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 9/85 (10%), Positives = 29/85 (34%), Gaps = 3/85 (3%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + T + ++ +DG++N +S +++ + I T Sbjct: 314 EGEITNLATALKNSASGDTKEQQRGAVVLFSDGQHNEGES---PLEVAKILAARGLPIYT 370 Query: 367 ISINASPNGQRLLKTCVSSPEYHYN 391 + + + L + +P+ + Sbjct: 371 VGFGSDIRPRDLAVVKIDAPDAVFF 395 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 61/223 (27%), Gaps = 23/223 (10%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS--IDLLSHVKEDVYM 254 VD + + + + N + +F + I S + Sbjct: 30 VDRPCGTDLSNLWLDVVLVVDNSEEMGSQRLFDVAANIIDVFGANTRIGSNSSEPITTRV 89 Query: 255 GLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 GLI Y N S + + + A Q+L Sbjct: 90 GLITYNFNATLNANLSQFQSYDDLSNGVFHSLSNVTNSTDSFIGTGLAMAEQLLR----- 144 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 RQ ++K II + I D+ K + +KI+T+ Sbjct: 145 -------RQNFNTTRDHYKKVIIVYASAFQ--RNEDETPEWIADRLKGSGVKIITVGYGN 195 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 S + L S P +N +LI I ++ K+ Sbjct: 196 SHGLIKSLSNIAS-PGLSFNSSGDGNLI---NQIQTSLLQGKF 234 >gi|221135718|ref|XP_002171014.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 132 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 37/109 (33%), Gaps = 14/109 (12%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A++ A + + + K II LTDG + Sbjct: 5 TRIDRALRLAQKDMFTSANGGRVG-------------VSKLIILLTDGSQTPGGDAEDPE 51 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 +I D+ + + + I+ + I ++ N + L + Y DSL Sbjct: 52 RIADELRNDGVVILGVGIGSAVN-ETELSHITGGKKNAYTAATFDSLTD 99 >gi|315173854|gb|EFU17871.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1346] Length = 683 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 101 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 160 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 161 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 215 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 216 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 275 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 276 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 314 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 315 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 355 >gi|308473880|ref|XP_003099163.1| CRE-CLEC-65 protein [Caenorhabditis remanei] gi|308267636|gb|EFP11589.1| CRE-CLEC-65 protein [Caenorhabditis remanei] Length = 376 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 44/166 (26%), Gaps = 14/166 (8%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 I +G I Y K++ + D LK T Sbjct: 66 QIGTGYTDPRSTRVGFITYNYNATDV----ADFYKLQSWADLDSQIQRLKMTPL------ 115 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L+ + ++K +I T + K + K Sbjct: 116 ARTTLSKMDTALYAAINMINSTAGFRDNYKKMVIVFTSVHGSYQK--NPPKDVSKILKAR 173 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN--ADSLIHVFQN 404 I +VT++ +S + Q LK S + ++ + Sbjct: 174 GIPVVTVNTGSSSDTQSYLKNIASDNMAYAMADGNATQEILKAMTD 219 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 67/190 (35%), Gaps = 39/190 (20%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP---SWGTEKVRQYVTRDMDSLIL 289 A L+F+D + + +G++ + R+E+ + + ++ V + + Sbjct: 63 TATLIFMDLLGPKDY------LGVLAFDDRIEELVPLQQVADNKGTFKEAVEG--NLVPR 114 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN----NFK 345 TD A+++A++ L S + ++ ++FLTDGE N Sbjct: 115 GFTDYVGALEEAFEQLHSVETGD----------------ARQVVVFLTDGEPNPHLDARN 158 Query: 346 SNVNTIKICDKA-------KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + + + + ++ + + L + + + + L Sbjct: 159 DDEFMEGYLGELWDLTGEYAAAGVPVYPVAFSDEVGPEVLEQIAGHTGADFVLMPDPGDL 218 Query: 399 -IHVFQNISQ 407 + F+ +S+ Sbjct: 219 VVTFFELVSR 228 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 45/128 (35%), Gaps = 22/128 (17%) Query: 254 MGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + LI Y+ + + + + V +++ T + A+ QAY++LT Sbjct: 1 LALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIK--GTTSTHIALHQAYKLLTDTD 58 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + +K II TDG + + + K+ ++I I++ Sbjct: 59 NENGV-----------REGVKKMIIIFTDGHSQRS-----PQDMALRLKDKGVEIFAITL 102 Query: 370 NASPNGQR 377 +P Sbjct: 103 TPAPYADE 110 >gi|319937919|ref|ZP_08012319.1| hypothetical protein HMPREF9488_03155 [Coprobacillus sp. 29_1] gi|319806825|gb|EFW03464.1| hypothetical protein HMPREF9488_03155 [Coprobacillus sp. 29_1] Length = 170 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 41/128 (32%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K ++ N +I F+L + +++ Y + ++ + A L GA+ V+ Sbjct: 2 MKAFYRNQDGNVTIAFSLLTPFIIFYFLWVVSSWQAMYIQLQTKAVIDFATLGGATTGVA 61 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + D + D + + L +A E + Sbjct: 62 ERTSASDIQAACYIPVVGGDYGDNVKEYGADVATQLLEENAYNTLPESVAKQIVQQAKGF 121 Query: 131 HMANNRLD 138 ++ + ++ Sbjct: 122 WLSEDEIN 129 >gi|221066731|ref|ZP_03542836.1| membrane protein-like protein [Comamonas testosteroni KF-1] gi|220711754|gb|EED67122.1| membrane protein-like protein [Comamonas testosteroni KF-1] Length = 723 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 9/50 (18%), Positives = 21/50 (42%), Gaps = 1/50 (2%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + FAL ++G + + +Y K +++ + A L A + + Sbjct: 1 MQFALVAGVIFSMLGV-VDLGYMYYAKRNLQRIADLAALEAAQSINAEQD 49 >gi|78186670|ref|YP_374713.1| hypothetical protein Plut_0798 [Chlorobium luteolum DSM 273] gi|78166572|gb|ABB23670.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 156 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 42/133 (31%), Gaps = 9/133 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I S K + + FAL + FL L+ ++ Y K + A AG + Sbjct: 28 IRSRKGSVLVEFALILPVFLALLFGVVSFSAALYNKTVLTMATREGARAGVLFVP----- 82 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI--SMTHM 132 DR ++I + D + +S + + NI+ S Sbjct: 83 --DRTDAIIRSSATLAADRVCQNNLISFGAPVSAAVSTNSTPTLDNILTVSATVDFTGIF 140 Query: 133 ANNRLDSSNNTIF 145 L S+ T Sbjct: 141 IVPDLVISSQTTM 153 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 54/183 (29%), Gaps = 29/183 (15%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + V +GL Y+ E + + + + T+ AM+ A Sbjct: 910 LSAERQGGASVRVGLAQYSRIARLEAELTNNLTHLVHHTEQAA--FQNDGTNVLGAMEYA 967 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 + L S +K ++ +DG + K + + Sbjct: 968 IRSLEGRGDTSGG---------------RKKLVLFSDGRSQAITE-ALLEKRAREVADAG 1011 Query: 362 IKIVTISINASPNGQRL--------LKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 +++ I++ N L + + + V + SL + + V R+ Sbjct: 1012 VEVFVIAVGNQVNEANLRTLVSRGRQDDISFAQRHLFRVPDYASL---LRGVFHQTVSRR 1068 Query: 414 YSV 416 S+ Sbjct: 1069 VSL 1071 >gi|304315865|ref|YP_003851010.1| von Willebrand factor A [Thermoanaerobacterium thermosaccharolyticum DSM 571] gi|302777367|gb|ADL67926.1| von Willebrand factor type A [Thermoanaerobacterium thermosaccharolyticum DSM 571] Length = 230 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 4/79 (5%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEYH 389 + II +TDG++N N A + I + I I N + +K + Sbjct: 7 RQIIVVTDGKSNVG---GNPADAAYLANKKGITVSAIGIVDDGNLSHKEIKDIANWGGGV 63 Query: 390 YNVVNADSLIHVFQNISQL 408 Y+++ +D I I+Q Sbjct: 64 YDIIYSDEFIKSLSAITQK 82 >gi|149573051|ref|XP_001520334.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 581 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 34/95 (35%), Gaps = 7/95 (7%) Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA----SPNG 375 + K ++ +TDG+ ++ + A+ + I + S Sbjct: 5 NPISGAREDANKILVVITDGQKYG--DSLKYSEAIPLAEAKGVIRYAIGVGDAFDFSSTA 62 Query: 376 QRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLM 409 + L+ S ++ + V N +L ++ Q + + Sbjct: 63 EELITIASSPEQDHVFRVDNFGALDNIQQQLQDKI 97 >gi|332260454|ref|XP_003279304.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Nomascus leucogenys] Length = 1160 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + + + Q + D Sbjct: 435 SKLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA---EVNEDNLEQAQSWIRD 491 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 492 MKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 524 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 525 -DQPPEMVIDQVKVFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 572 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 50/176 (28%), Gaps = 35/176 (19%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 E I V+D S SM+ + ++K AAL A+ +++ Sbjct: 218 QEISPIDIVFVLDKSASMNE------------GTLEGGGQSKNAALIEAVNEISENLLS- 264 Query: 246 SHVKEDVYMGLIGYTTR---------VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + + Y + ++ ++ + T T Sbjct: 265 DPNMDIRIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTL 324 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 +K Y+ L +D + +K +I + DG + + T Sbjct: 325 GLKNGYETLYADNGGE-------------NRNPEKILIVVGDGTPTFSYAPIQTRS 367 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 32/333 (9%), Positives = 80/333 (24%), Gaps = 73/333 (21%) Query: 147 NMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + L + Q N ++ + + + P I + + Sbjct: 218 QEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNE-ISENLLSDPNMDIR-IGMV 275 Query: 200 SGSMHC-----------AMNSDPEDVNSAPICQDKKRTKMAA--------LKNALLLFLD 240 + + + D+N ++ + LKN Sbjct: 276 NFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGYETLYA 335 Query: 241 SIDLLSHVKEDVYM----GLIGYTTRVEKNIEPSWGTEKVRQ-YVTRDMDSLILKPTDST 295 + E + + G ++ + + G V D ++ Sbjct: 336 DNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSGGGAWSSWSVMEDKIAVDDGDLFRN 395 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-------------- 341 T+ + T + Q + +G++ Sbjct: 396 FEDFDDDDNTTNAGFTNPVTYASDFDRPVNGTNVQYRYGEVKEGDDKAFHWVGTGAASNG 455 Query: 342 NNFKSNVNTIKIC-----------DKAKENFIKIVTISINASPN----------GQRLLK 380 + + +K +EN I +I + + G+ +LK Sbjct: 456 TTGEPDTQEKSSAINTVAYHHWLKNKYQENPPSIFSIGLGIDGSVAGRQRLDAIGRNVLK 515 Query: 381 TCVSSPE-----YHYNVVNADSLIHVFQNISQL 408 + +Y+ N + +I ++IS Sbjct: 516 NIADLNDDGTTPRYYDANNKNDIITALEDISST 548 >gi|167589629|ref|ZP_02382017.1| TadE family protein [Burkholderia ubonensis Bu] Length = 142 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 39/129 (30%), Gaps = 3/129 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + S+ F L + L+++ +I V K + +A+ A AG V L+ Sbjct: 9 RHARGVISLEFVLMLPFLLMVLLGIIDVSLILCDKAVITNASREAARAGVVVRVPMLTTT 68 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN---SSRISMTHM 132 +++ + A ++ ++ S + Sbjct: 69 QVANVALNYTQNSLITGGAATTPTVNVTQANGTTSGSALTVTVTYTYTGLVLGSAFSALT 128 Query: 133 ANNRLDSSN 141 + +++ Sbjct: 129 GPVTVSATS 137 >gi|315150240|gb|EFT94256.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0012] Length = 711 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 68/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 129 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALNGLQQFGEELSDSLTDGHVRIGI 188 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 189 VAYNRLTYSTADFSTDMNDLEDFLRTTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ + +D + + + S Sbjct: 244 KKMLVHIGDGSANASFLPRENAQSYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 303 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 304 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 342 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 343 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 383 >gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella baltica OS223] gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS223] Length = 772 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 43/252 (17%), Positives = 80/252 (31%), Gaps = 44/252 (17%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 N +V +P + + LV+D SGSM Sbjct: 366 GMNEDNYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMA-------------------- 405 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + KNALL L + + + + S + RQ+V+R Sbjct: 406 GDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSQFS-ATSLPATSSNLSRARQFVSRL- 463 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T+ A+ A V ++ + +IF+TDG N Sbjct: 464 --QADGGTEMALALDAAL-------------PKSLGSVSPDAVQPLRQVIFMTDGSVGNE 508 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++ + I+ + E+ ++ T+ I ++PN + + + D + Sbjct: 509 QALFDLIRY--QIGES--RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVDEVD---AK 561 Query: 405 ISQLMVHRKYSV 416 IS L+ +Y V Sbjct: 562 ISALLSKIQYPV 573 >gi|268532310|ref|XP_002631283.1| C. briggsae CBR-CLEC-60 protein [Caenorhabditis briggsae] gi|187036877|emb|CAP23542.1| CBR-CLEC-60 protein [Caenorhabditis briggsae AF16] Length = 408 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 51/175 (29%), Gaps = 22/175 (12%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY---VTRDM 284 +AA ++ I + +GL+ Y Y T Sbjct: 62 IAASIASVFSNGTRIGTDPYEPRTTRLGLVTYNAVANTVANLDTYQSLDDVYDGIFTALS 121 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + + QA IL K +Q+ +I Sbjct: 122 QVSSSDESYIVHGLAQAEDILEDGKSN------------KNRTHYQRVVIVY--ASTYKG 167 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADS 397 ++ I + D+ K + IVT++ + +G L L+ + P YN N D Sbjct: 168 TGPLDPIPVADRLKTAGVTIVTVAYDQDGDGALLADLQLIATPP---YNFSNTDQ 219 >gi|313831092|gb|EFS68806.1| TadE-like protein [Propionibacterium acnes HL007PA1] gi|314932293|gb|EFS96124.1| TadE-like protein [Propionibacterium acnes HL067PA1] Length = 133 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 8/135 (5%), Positives = 29/135 (21%), Gaps = 12/135 (8%) Query: 14 GIASEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 E+ + S+ L V L++ G + A + + ++ Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISA----------TQAAQDAAVAAS 50 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + A + + + + ++ Sbjct: 51 RAGTDAAATPQLAGHDGAAVAVQAARQALSAAGVDGSVQEDGSTITVTTSQSRPTVFLSA 110 Query: 132 MANNRLDSSNNTIFY 146 + +++ Sbjct: 111 IGISQVTGHGQAHAQ 125 >gi|291301210|ref|YP_003512488.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290570430|gb|ADD43395.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 609 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 19/176 (10%), Positives = 51/176 (28%), Gaps = 10/176 (5%) Query: 226 TKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRD 283 ++M ++ +L LD ++V T +K +E + + ++ Sbjct: 432 SRMDIIRKSLERSLDLFGEQANVGLWRYPYDDPVAGTAYQKLVEIGEFDKSRQDDIESQL 491 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + +AY+ + + ++ + +D +N Sbjct: 492 SAVEPAADGGLNDTVVEAYKNVLDNYNKTTGAINLVVVISDGGSE--------SDASLSN 543 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + K ++ I+TI + L ++ +Y D + Sbjct: 544 EDVTEELKDLSAKDRDKEASIMTIGYGKDADKDHLDAIATATQGRYYPAKWNDEIN 599 >gi|153867800|ref|ZP_01998040.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] gi|152144880|gb|EDN71960.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] Length = 276 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 35/124 (28%), Gaps = 7/124 (5%) Query: 292 TDSTPAMKQ---AYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTDGENNNFKS 346 T + LT D F K ++F TDG + Sbjct: 74 TSLNDDINTFYGGLNSLTIDGGSDISMAFAPINQLLSQARPHAAKIVVFYTDGYVFHSGK 133 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + + ++I + + + + P N +++ F+ ++ Sbjct: 134 MDAIVNDAEALRNQGVQIFAV--SPPEDDASAMSLITGYPNRVLRPNNLPDIVNRFRYVA 191 Query: 407 QLMV 410 +V Sbjct: 192 DAVV 195 >gi|84996511|ref|XP_952977.1| thrombospondin-related protein [Theileria annulata strain Ankara] gi|71532874|emb|CAJ20069.1| thrombospondin-related protein, putative [Theileria annulata] Length = 606 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 58/179 (32%), Gaps = 31/179 (17%) Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-----KPTDSTPAM 298 ++ V++ ++ ++T + I K Q +D L T + A+ Sbjct: 241 SINISPNYVHLSMVTFSTSIRWLISFLDPASKDEQLALAVLDKLKNSKPVFGYTFTGQAL 300 Query: 299 K---QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 +A + + + K II +TDG + T + Sbjct: 301 NFISEAVYMFGARRNS------------------PKGIIIITDGSSTQTNV---TSQASA 339 Query: 356 KAKENFIKIVTISINASPNGQ-RLLKTCVSSPEY-HYNVVNADSLIHVFQNISQLMVHR 412 ++ + I+ + + + + R + C + E + + N D +I + + Sbjct: 340 LLRDAGVTILVVGVGKAKESECRGIVGCSTKGECPLFFMTNWDEIIRKVGELMAEVCET 398 >gi|307292638|ref|ZP_07572484.1| hypothetical protein SphchDRAFT_0110 [Sphingobium chlorophenolicum L-1] gi|306880704|gb|EFN11920.1| hypothetical protein SphchDRAFT_0110 [Sphingobium chlorophenolicum L-1] Length = 198 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 35/101 (34%), Gaps = 1/101 (0%) Query: 6 RFR-FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R R + A ++ FA+ +M L++ + + Y ++ A A Sbjct: 2 RMRLSLHHRLAADQRGASVPEFAMILMPLCLILFGGLEMGYQIYVRSVTLGALERASRLS 61 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 + V++ + D +I A I +K + + Sbjct: 62 TIQTVNSTAVEADIEATIKRIVPSATISTSKSSFYQYSNIN 102 >gi|308512481|ref|XP_003118423.1| hypothetical protein CRE_00420 [Caenorhabditis remanei] gi|308239069|gb|EFO83021.1| hypothetical protein CRE_00420 [Caenorhabditis remanei] Length = 2228 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 63/218 (28%), Gaps = 10/218 (4%) Query: 196 VVDLSGSMHCAMNSDPEDV--NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V ++ + P D +T + + + Sbjct: 2007 VTQVTQGPVTSTTPVPHTALQFDIVFLIDGSQTAQDSFDWLTKFVQTFMVSFNVGLNGAR 2066 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +GLI ++ P+ + + + +LK + A Q+LT + + Sbjct: 2067 VGLIVVAPDLDDQPPPAAQFNSITSQASLISNLNLLKE--NYADFDHAGQVLTYNLQIVT 2124 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ G + I + TI + I+TI Sbjct: 2125 SNDYKSAGYRSNIDNHVLVYITTNTAFYTDPTPTAQTI-----IAQQQYGIITIGYGTGF 2179 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + L+T + + SL + + I QL+++ Sbjct: 2180 DNNK-LQTISGGAACSFTAPDFASLNNQIKTIQQLIIN 2216 >gi|301631653|ref|XP_002944913.1| PREDICTED: hypothetical protein LOC100490469 [Xenopus (Silurana) tropicalis] Length = 988 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 22/280 (7%), Positives = 58/280 (20%), Gaps = 32/280 (11%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 V + L + + + + + +LV D + Sbjct: 41 TVESYVAPNVILSLDDSGSM--NERDMIGDTQTRTQVLKDALHEVFNDSDLVPDKKIRLA 98 Query: 205 CAMNSDPEDVNS-------------APICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 +D DV+ A + + + +++ Sbjct: 99 WQSMNDCTDVDGKNWTPALGSAAASAASASYPNAMRYNNAPQKETFTNPRTEKTATLEKY 158 Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 T +G + + R + T + Sbjct: 159 WNPRYNPATWPHMVTFTIGFGKDSLPIIQFRPAGNTSTTTTPINIGNYWNADPTQPLWLQ 218 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + K + + DG ++ + T+ Sbjct: 219 GTVHDGTNGNNGRLVP-PSKTLPYGFDGSFADYAAGRAQW-------------FTV---T 261 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + ++ Y V + L F+ I + + Sbjct: 262 HQVTEDMWHAAINGRGRFYAVERGEDLKLAFREIIRTIGT 301 >gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 843 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 48/172 (27%), Gaps = 29/172 (16%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKV 276 ++ +A K + F+ S++ +I ++ + Sbjct: 350 TSGSQRGEPLAKSKQLMRRFIQSLNPDD------TFSIIDFSDTTTALSATPLTNTVTNQ 403 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 ++ + T+ ++ +F VK + I+ + Sbjct: 404 QKAIAYINQLEANGGTELLNGIQT-------------VMDFPSPPVKR-----LRSIVLI 445 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 TDG N + ++ K ++ + + +S N L + Sbjct: 446 TDG--YIGNENEVLSVVKNQLKSGN-RLYSFGVGSSVNRFLLNRLAEIGRGT 494 >gi|67968521|dbj|BAE00622.1| unnamed protein product [Macaca fascicularis] Length = 480 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 3/70 (4%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 II +TDGE + +T++ KA++ + T+ + + + S Sbjct: 18 GNKVPSMIIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTVGV--ADYKLDQITAIADS 74 Query: 386 PEYHYNVVNA 395 PE+ + V N Sbjct: 75 PEHVFAVENG 84 >gi|134298678|ref|YP_001112174.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] gi|134051378|gb|ABO49349.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] Length = 183 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 24/56 (42%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 K + +K I+F + +L + I + +++ ++A LAGAS Sbjct: 3 NKVLKGKKGFTFILFVPVFLIIMLFMARGIDWGMATVARGKLQTISDAGSLAGASA 58 >gi|283782261|ref|YP_003373016.1| TadE family protein [Pirellula staleyi DSM 6068] gi|283440714|gb|ADB19156.1| TadE family protein [Pirellula staleyi DSM 6068] Length = 145 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 39/115 (33%), Gaps = 3/115 (2%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R K+ + A ++ FAL++ LL +I +NS+E+A A Sbjct: 9 RSSSKRTTRARSAAVTVEFALTIPLLFLLFFGMIEFARIAMIENSVENAAYEGARAA--- 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 +V + + + S + + ++I + + + N Sbjct: 66 IVPGGTSTSAQTAAQSALSAAMISGATVSVTPSNITIATTSVTVDVSVPLNSNTW 120 >gi|167767549|ref|ZP_02439602.1| hypothetical protein CLOSS21_02068 [Clostridium sp. SS2/1] gi|167710841|gb|EDS21420.1| hypothetical protein CLOSS21_02068 [Clostridium sp. SS2/1] gi|291558491|emb|CBL37291.1| hypothetical protein CL2_01850 [butyrate-producing bacterium SSC/2] Length = 480 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 9/37 (24%), Positives = 17/37 (45%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK 50 I E+ + S A+ + FLLLI + + + + Sbjct: 1 MIRDEQGSMSAFLAMLFLVFLLLISVCVEGIYMYTAR 37 >gi|116626778|ref|YP_828934.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] gi|116229940|gb|ABJ88649.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] Length = 466 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 37/113 (32%), Gaps = 2/113 (1%) Query: 16 ASEKANFSIIFA-LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM-VSNLS 73 A+ + SI A + V ++GF + + + + + A AA +A AS + + + Sbjct: 31 AAREGAISIQLAVIMVPVLFGMMGFALDLGRLYLVRGELNHAAEAAAIAAASHLIGTTGA 90 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + + + + +++T N Sbjct: 91 LDAATTAAQQTLTLNKYNFGSLTPGEGSGNLTSTITDPAYFSTVAGATGNDPN 143 >gi|2342594|emb|CAA73785.1| matrilin-3 precursor [Homo sapiens] Length = 310 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 34/84 (40%), Gaps = 9/84 (10%) Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 G + PS K I +TDG + + ++ +A+ + I++ + ++ + Sbjct: 1 EAGAREPSSNIPKVAIIVTDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMAS-- 53 Query: 379 LKTCVSSP--EYHYNVVNADSLIH 400 LK S P E+ + V + Sbjct: 54 LKMMASEPLEEHVFYVETYGVIEK 77 >gi|302686254|ref|XP_003032807.1| hypothetical protein SCHCODRAFT_82034 [Schizophyllum commune H4-8] gi|300106501|gb|EFI97904.1| hypothetical protein SCHCODRAFT_82034 [Schizophyllum commune H4-8] Length = 1039 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 16/193 (8%), Positives = 51/193 (26%), Gaps = 12/193 (6%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + K ++N L + ++ + + + RV K S Sbjct: 529 PPPGSPASTAQLKCRVIRNTLEFIMGNLGQKDRISFVTFEVGV--GGRVRKTPFLSVVRA 586 Query: 275 KVRQYVTRDMDSLILK-------PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 ++ + ++ ++ T + + + + +D + + Sbjct: 587 TSKKRLENFVEGIVFGRIPGDDSGTINDEFLVRGAKDEKTDVVTAVNHGLDVVLQRKSKN 646 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 P ++ ++ + + A + I + S + L + Sbjct: 647 PVSGMLLISDAADSTRRAQMDLVLARAEAA---NVPIHSFGYGRSHDPASLWLMSNHTSG 703 Query: 388 YHYNVVNADSLIH 400 + V + L Sbjct: 704 TYTFVKDWYDLRD 716 >gi|72162665|ref|YP_290322.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916397|gb|AAZ56299.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 596 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 40/147 (27%), Gaps = 25/147 (17%) Query: 265 KNIEPSWGTEKVRQYVTRDMDS-----LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 Q+ T T + AYQ LT Sbjct: 448 PIRPLDTDVNGATQHETLTAALAQLAPKPDGDTGLYDTILAAYQELTETYAPDRVN---- 503 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK----ENFIKIVTISINASPNG 375 ++ LTDG N+++ S + + + + + I+ IS + Sbjct: 504 ------------TLLVLTDGNNDDYDSITLEELLKELGEAYQPDRPVSIIAISFGPDVDP 551 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVF 402 + L + + Y + + +F Sbjct: 552 EPLERIAEVTRGAAYTTEDPTEIGEIF 578 >gi|326670658|ref|XP_003199261.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1823 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 43/371 (11%), Positives = 97/371 (26%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 46 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 103 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 104 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 156 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 157 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 204 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N + Sbjct: 205 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTY 261 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 262 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 309 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 310 QILVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 362 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 363 SVSDFSELPNV 373 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 43/371 (11%), Positives = 97/371 (26%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 845 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 902 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 903 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 955 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 956 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 1003 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N + Sbjct: 1004 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTY 1060 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 1061 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 1108 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 1109 QILVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 1161 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 1162 SVSDFSELPNV 1172 Score = 39.1 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 322 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 381 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 382 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 429 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 430 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 485 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 486 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 531 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 532 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 583 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 1121 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 1180 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 1181 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 1228 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 1229 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 1284 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 1285 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 1330 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 1331 KELGVL--TLGIGSRGSDSRDLQRISYEANYALSVADFSELPNVQEQLLASVQT 1382 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 50/180 (27%), Gaps = 25/180 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 608 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 667 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ Y+ + I I T + A+K AY L + +R Sbjct: 668 GVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR 727 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +TDG ++ ++N +C + + + I I Sbjct: 728 QKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 768 >gi|159124147|gb|EDP49265.1| von Willebrand and RING finger domain protein [Aspergillus fumigatus A1163] Length = 1013 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 54/168 (32%), Gaps = 21/168 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L++AL + ++ MGL+ + + T K + + S Sbjct: 511 KITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSWGGWNKILSS 563 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + L +D + S P I+ ++D ++ +S Sbjct: 564 IRPVGQ----------KSLRADVVEGANVAMDLLMQRKSSNPIS-TILLISDSSTSDPES 612 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK + I + + + +++ + + V + Sbjct: 613 VDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 657 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 50/180 (27%), Gaps = 25/180 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 609 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 668 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ Y+ + I I T + A+K AY L + +R Sbjct: 669 GVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR 728 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +TDG ++ ++N +C + + + I I Sbjct: 729 QKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 769 >gi|118370680|ref|XP_001018540.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89300307|gb|EAR98295.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 930 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 51/182 (28%), Gaps = 27/182 (14%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 +++NA+ + I L +++ + S Sbjct: 340 QSIQNAIEALILFIKSLPLDSYFNIYSFGTEFSKLFDQSQKYSNENVELALNEIITYSAN 399 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ + + F Q + + I LTDG+ N Sbjct: 400 YGGTNIYQPL----------------SEIFNQPYVKG---YGRQIYILTDGQIENK---- 436 Query: 349 NTIKICDKAKENFI--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + + N I ++ I I + ++++ S H +V + + NI Sbjct: 437 --ENVMHLIQSNNISNRVHAIGIGLYVDKDLIIQSAKSGKGCHAHVTDQSLIQESIINIL 494 Query: 407 QL 408 Q Sbjct: 495 QN 496 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 50/180 (27%), Gaps = 25/180 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 616 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 675 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ Y+ + I I T + A+K AY L + +R Sbjct: 676 GVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR 735 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +TDG ++ ++N +C + + + I I Sbjct: 736 QKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 776 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 24/164 (14%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDM--DSLILKPTDSTP 296 ++ + M L+ Y ++ ++ ++ + + + R +S T Sbjct: 879 TLARRDDDPLNARMALLQYGSQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVH 938 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + +R +F +FLTDG N + Sbjct: 939 AINNVVRGARGGARRHAELSF----------------VFLTDGVTGNDS----LEESVHS 978 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + +++ + L K + + + DSL Sbjct: 979 MRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSLAQ 1022 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 50/180 (27%), Gaps = 25/180 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 611 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 670 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ Y+ + I I T + A+K AY L + +R Sbjct: 671 GVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR 730 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +TDG ++ ++N +C + + + I I Sbjct: 731 QKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 771 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 24/164 (14%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDM--DSLILKPTDSTP 296 ++ + M L+ Y ++ ++ ++ + + + R +S T Sbjct: 874 TLARRDDDPLNARMALLQYGSQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVH 933 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + +R +F +FLTDG N + + Sbjct: 934 AINNVVRGARGGARRHAELSF----------------VFLTDGVGN----DSLLEESVHS 973 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + L K + + + DSL Sbjct: 974 MRTRNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSLAQ 1017 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 50/180 (27%), Gaps = 25/180 (13%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 346 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 405 Query: 255 GLIGYT-TRVEKNIEPSWGTEKVRQYVTRD--MDSLILKPTDSTPAMKQAYQILTSDKKR 311 G++ Y+ + I I T + A+K AY L + +R Sbjct: 406 GVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR 465 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 F + +TDG ++ ++N +C + + + I I Sbjct: 466 QKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 506 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 24/164 (14%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDM--DSLILKPTDSTP 296 ++ + M L+ Y ++ ++ ++ + + + R +S T Sbjct: 609 TLARRDDDPLNARMALLQYGSQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVH 668 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + +R +F +FLTDG N + + Sbjct: 669 AINNVVRGARGGARRHAELSF----------------VFLTDGVGN----DSLLEESVHS 708 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + +++ + L K + + + DSL Sbjct: 709 MRTRNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSLAQ 752 >gi|70991487|ref|XP_750592.1| von Willebrand and RING finger domain protein [Aspergillus fumigatus Af293] gi|66848225|gb|EAL88554.1| von Willebrand and RING finger domain protein [Aspergillus fumigatus Af293] Length = 1037 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 54/168 (32%), Gaps = 21/168 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L++AL + ++ MGL+ + + T K + + S Sbjct: 535 KITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSWGGWNKILSS 587 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + L +D + S P I+ ++D ++ +S Sbjct: 588 IRPVGQ----------KSLRADVVEGANVAMDLLMQRKSSNPIS-TILLISDSSTSDPES 636 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK + I + + + +++ + + V + Sbjct: 637 VDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 681 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 36/277 (12%), Positives = 75/277 (27%), Gaps = 57/277 (20%) Query: 147 NMDVMTSYDYRLQFI---EHLLNQRYNQKIVSF--------IPALLRIEMGERPIFLIEL 195 D+ LQ+ L + + V +P + P I Sbjct: 262 QGSAEMDRDFVLQWSAASGSLPGAAFFTERVDDQYYGLLMLVPPASQRAAETVPR-EIVF 320 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 VVD SGSM +K A ++ L + Sbjct: 321 VVDTSGSMGGVS-----------------------IKQAKGSLTRALRHLGP---NDRFN 354 Query: 256 LIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +I + + + + ++ T+ PA++ A ++ Sbjct: 355 VIEFNSSHRALFQHAVPASHHNLQLASEYVRHLEASGGTEMMPALQLALKL--------- 405 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ P + +IF+TDG + I D ++ T+ I ++P Sbjct: 406 ----PGAQDELRPEPALRQVIFITDGA--VGNESALFEHIVDSL--GGSRLFTVGIGSAP 457 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 N + K + + + + + Sbjct: 458 NAWFMRKAAEYGRGTFTYIGDVAEVGEKMDALFLNLT 494 >gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis HAW-EB4] gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis HAW-EB4] Length = 850 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 17/150 (11%), Positives = 53/150 (35%), Gaps = 11/150 (7%) Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF-- 332 + + T+ + A+ A L +D+ + + + ++ Sbjct: 515 NLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAA 574 Query: 333 -----IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 ++F+TDG + +I ++ E+ ++ TI I ++PN + + Sbjct: 575 TPLRQVLFITDGA--VANESRLFEQIKNQLGES--RLFTIGIGSAPNAHFMQRAAEVGRG 630 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 + + D + ++ + + + + + Sbjct: 631 TYTYIGKLDEVNQKVVSLLEKIEKPQVTDV 660 >gi|299137493|ref|ZP_07030675.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600898|gb|EFI57054.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 451 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 65/214 (30%), Gaps = 47/214 (21%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 + ++NA F S+ + + + Y R+ + + + +RQ + Sbjct: 209 ASNSYAFIQDMQNASASFFHSLQPDDY------VAVETYDMRMHVLTDFTNNKDTIRQAL 262 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T+ A+ + L+ + +K+II +++G Sbjct: 263 QSLT-MPGFRETNEFDALYETLDRLS-------------------RVEGRKYIILISNGR 302 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN------ASPNG--------------QRLLK 380 + + ++ + + + I TIS + G ++ Sbjct: 303 DTMSRLTLDQM-MAKIKATPNVTIYTISTGGLARELSDARGQMGGSTRMNYLQADNQMRT 361 Query: 381 TCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKY 414 + Y + L +F I++ + + Sbjct: 362 FAQMTGGASYQPLFQGELPDIFSQINESIRTQYV 395 >gi|269955639|ref|YP_003325428.1| hypothetical protein Xcel_0836 [Xylanimonas cellulosilytica DSM 15894] gi|269304320|gb|ACZ29870.1| hypothetical protein Xcel_0836 [Xylanimonas cellulosilytica DSM 15894] Length = 145 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 8/140 (5%), Positives = 34/140 (24%), Gaps = 10/140 (7%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 E+ + S L+ + ++ + + + + A A A++ + Sbjct: 13 ERGSISGFACLAALVLMMCLALAVDLGG--------QVAAEQNARAVAAEAARVGGQQIS 64 Query: 78 RFESISNHAKRALIDDAKRFIKN--HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + A+ + + + + + + + + Sbjct: 65 AATVRGDDVTLAVGEARAAALTHLRDAGVAGDVQVTGGDTVTVTVNDTYPTTFLGVLGFD 124 Query: 136 RLDSSNNTIFYNMDVMTSYD 155 L + + + Sbjct: 125 TLPVNGTVAARAIRTIGDQA 144 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 11/58 (18%), Positives = 29/58 (50%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 L + K + K +F++ A+ + + +L + LI V ++ Y+K+ +++ + Sbjct: 3 LKKSWRNCKGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTA 60 >gi|50119742|ref|YP_048909.1| hypothetical protein ECA0797 [Pectobacterium atrosepticum SCRI1043] gi|49610268|emb|CAG73711.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 210 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 24/70 (34%), Gaps = 2/70 (2%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 RF + S +A+ ++ AL+ L + + ++ ME A +A Sbjct: 16 RFLSGLCRFWFSRRASIAVETALAFPIVLAIGSLCADLYTVGLERTRMEQ--RAGAIASI 73 Query: 66 SKMVSNLSRL 75 M L Sbjct: 74 LAMQQKLDEN 83 >gi|87310695|ref|ZP_01092823.1| BatB [Blastopirellula marina DSM 3645] gi|87286676|gb|EAQ78582.1| BatB [Blastopirellula marina DSM 3645] Length = 364 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 56/188 (29%), Gaps = 46/188 (24%) Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL 245 + + V+D+S SM + +D ++ K + L Sbjct: 86 TPQKGIEVMFVLDVSRSM---------------LAEDVTPNRLQRAKQQIKDMLK----- 125 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI-LKPTDSTPAMKQAYQI 304 V +GLI + +++ + E +Q + + + A+ A + Sbjct: 126 --VMAGDRVGLIAFAGETRQSVPLTSHYEDFKQTLDSVGPHTVRSGGSRLGDAIAAASKG 183 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA-KENFIK 363 K I+ TDGE+ K +++ E I+ Sbjct: 184 FIDK------------------TYDHKAIVVFTDGEDQESK----PVEVAKSLYAERGIR 221 Query: 364 IVTISINA 371 + T+ + Sbjct: 222 VFTVGLGD 229 >gi|83951472|ref|ZP_00960204.1| hypothetical protein ISM_12955 [Roseovarius nubinhibens ISM] gi|83836478|gb|EAP75775.1| hypothetical protein ISM_12955 [Roseovarius nubinhibens ISM] Length = 178 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 8/49 (16%), Positives = 19/49 (38%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + + N ++ F + FL L F + + + +E + A+ Sbjct: 11 RFWRGQAGNATVEFVIVFPVFLTLFLFSLELAIITLRHTMLERGLDIAV 59 >gi|209551753|ref|YP_002283670.1| hypothetical protein Rleg2_4182 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537509|gb|ACI57444.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 193 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 10/113 (8%), Positives = 31/113 (27%), Gaps = 8/113 (7%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 ++ +R ++ K +I FA+ ++L + A Sbjct: 4 LNTFTRLVLTVRRLAQDRKGAGAIEFAILFPVLIMLYIGAFEITIGLSVSKR-------A 56 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A + + +++ +S + + ++G + Sbjct: 57 TRAAGT-VADVVTQQQSVTKSALAQMPSVANSIFVPYNTTSLTLKITGITIDA 108 >gi|242048544|ref|XP_002462018.1| hypothetical protein SORBIDRAFT_02g012640 [Sorghum bicolor] gi|241925395|gb|EER98539.1| hypothetical protein SORBIDRAFT_02g012640 [Sorghum bicolor] Length = 333 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 14/52 (26%), Gaps = 1/52 (1%) Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQN 404 K + T A + L S+ E V + D L F Sbjct: 14 AADVKIGNAPVYTFGFGADYDPTVLNAVARNSTGETFSVVNDVDKLSMAFSQ 65 >gi|114798948|ref|YP_761697.1| Flp/Fap pilin component [Hyphomonas neptunium ATCC 15444] gi|114739122|gb|ABI77247.1| Flp/Fap pilin component [Hyphomonas neptunium ATCC 15444] Length = 205 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 17/171 (9%), Positives = 46/171 (26%), Gaps = 16/171 (9%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K + +E ++ FA+ +L+ I + + + A AS + Sbjct: 22 VKSLLRNEDGVSAVEFAVIAPLMVLIFFGCIELSFLMR--------ADRRVTATASSLGD 73 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY----NTEIQNIVNSSR 126 SRL ++ A + + + + ++ + R Sbjct: 74 LTSRLATVTDADMRELYNAATVMMQPYPASETRMRITSIEDNGNGQKRVKWSDGHEMTPR 133 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYD----YRLQFIEHLLNQRYNQKI 173 + + + +V Y + + + + Y + Sbjct: 134 AVNSLVNIPDGIVPSPGSVILTEVEYDYSSGFGFVIDTSTTIADSFYLRPR 184 >gi|167821376|ref|ZP_02453056.1| hypothetical protein Bpse9_40018 [Burkholderia pseudomallei 91] Length = 396 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 74/319 (23%), Gaps = 20/319 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I+ + + S I + S I +R N Sbjct: 61 ITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVN 120 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDL 199 I V + + IP + P+ + Sbjct: 121 WFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGD 180 Query: 200 -------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +G+ + +S + A + + D + Sbjct: 181 WLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTN 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DSTPAMKQA 301 + Y + + + D + + T + Sbjct: 241 AYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITG 300 Query: 302 YQILTSDKKRSFFTNFFRQ 320 S + R+ Sbjct: 301 INTGGSYNPSYYAAGADRR 319 >gi|76798101|ref|ZP_00780356.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|76586556|gb|EAO63059.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] Length = 420 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 40/132 (30%), Gaps = 20/132 (15%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A+ Y L + + + T GE N Sbjct: 3 NEGYAINSGYIYLY------WRDYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNG---- 52 Query: 353 ICDKAKENFIKIVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + I T+ I + + ++ +++ S E + NV + + + Sbjct: 53 ---NIRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKY 109 Query: 406 SQLMVHRKYSVI 417 + +V K+S++ Sbjct: 110 FKTIVEEKHSIV 121 >gi|329946213|ref|ZP_08293826.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 170 str. F0386] gi|328527811|gb|EGF54802.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 170 str. F0386] Length = 370 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 59/208 (28%), Gaps = 31/208 (14%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 +C D + + + L F D ++ + +GL+ + + + + + Sbjct: 115 NRDIVLCLDVSTSMVKIDSSVLTTFADILEDFDGER----VGLVAWNSAAQTIVPLTDDY 170 Query: 274 EKVRQYVTRDMDSLILKPTD--------STPAM-KQAYQILTSDKKRSFFTNFFRQGVKI 324 + +RQ + D L + P + A L Q Sbjct: 171 DLLRQQMDDLGDVLDIDPQNPTYKQQLRYLEAFSGTQNTSLDGSSLAGDGLASCAQAFDN 230 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVN--TIKICDKAKENFIKIVTISINASPNGQRLLKTC 382 L + II TD + + K D E I++ +I A + + Sbjct: 231 QGLERSRSIILATDNQVIDPKKEQIYTLPDAVDLLAERKIRLFSIY-GADDDQSSKYELD 289 Query: 383 VS---------------SPEYHYNVVNA 395 S Y+V ++ Sbjct: 290 QSPEDSREELKTVTEGKGKGRFYDVEDS 317 >gi|328949411|ref|YP_004366747.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328449735|gb|AEB15450.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 385 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 43/117 (36%), Gaps = 7/117 (5%) Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + R + + +K +I L+DG + N + + ++ + + + + Sbjct: 159 LSEGQPRLYDSIMNLIRKVEQKKGKRKVVIILSDGRDQNSRFSK--DQLIETLSNSGLPV 216 Query: 365 VTISINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILK 419 T+ + + + L + + +Y + + + + K S I+K Sbjct: 217 YTVGMKVLSNQSLSNLDEISQLTGGTYYYSTRFKDIPDNLKKVVDCI---KQSYIVK 270 >gi|108757686|ref|YP_629407.1| hypothetical protein MXAN_1147 [Myxococcus xanthus DK 1622] gi|108461566|gb|ABF86751.1| hypothetical protein MXAN_1147 [Myxococcus xanthus DK 1622] Length = 525 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 12/86 (13%), Positives = 33/86 (38%), Gaps = 5/86 (5%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-----AILAGASKMV 69 +++ ++ ALS + L++ + +K ++ ++A A+L S Sbjct: 19 RRAQRGQSIVLGALSFLVLALMVTLSFNLAHALRQKMGLQQHSDALAYSMAVLEARSLNY 78 Query: 70 SNLSRLGDRFESISNHAKRALIDDAK 95 +S ++ ++ A + A Sbjct: 79 YAVSNRAIAASYVAMNSIHAYMVAAS 104 >gi|268530440|ref|XP_002630346.1| C. briggsae CBR-CLEC-62 protein [Caenorhabditis briggsae] Length = 386 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQI 304 K V +GL+ Y + + + + A + Sbjct: 71 DKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAAESV 130 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L++ +K S Q +I+ +D ++ + T ++ K + + I Sbjct: 131 LSAGRKN-----------ATRSNYKQLVLIYASDYRDDGEEDPRPT---AERMKSSGVSI 176 Query: 365 VTISINASPNGQRLLKTCV-SSPEYHYNVVNAD---SLIHVF 402 T++ + + N + +SP +++ +AD + Sbjct: 177 ATVAFDQTGNEGVVKAIGEIASPGFNFTNEDADLVREIQGAM 218 >gi|328875636|gb|EGG24000.1| hypothetical protein DFA_06138 [Dictyostelium fasciculatum] Length = 832 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 54/182 (29%), Gaps = 34/182 (18%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVR 277 ++ + K AL + + S+ E+ ++ + + +K S + + Sbjct: 350 SGSMSGDRIESAKRALEIMMRSL------NENTMFNIVSFGSGFKKLFPSSTRYNDASLE 403 Query: 278 QYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T+ +K + + + + L Sbjct: 404 KASSLIASMDADMGGTELFEPIKDVLAM-------------------PALPEYPRQVFIL 444 Query: 337 TDGENNNFKS-NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 TDG + + C+ + I T+ I + + ++ + ++ V + Sbjct: 445 TDGAVSQREVLIKYVASECNTTR-----IFTLGIGNGVDKELVIGLSKACKGHYEMVSDT 499 Query: 396 DS 397 S Sbjct: 500 SS 501 >gi|314980651|gb|EFT24745.1| TadE-like protein [Propionibacterium acnes HL072PA2] Length = 133 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 8/135 (5%), Positives = 29/135 (21%), Gaps = 12/135 (8%) Query: 14 GIASEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 E+ + S+ L V L++ G + A + + ++ Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISA----------TQAAQDAAVAAS 50 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + A + + + + ++ Sbjct: 51 RAGTDAAATPQLAGHDGAAVAVQAARQALSAAGGDGSVQEDGSTITVTTSQSRPTVFLSA 110 Query: 132 MANNRLDSSNNTIFY 146 + +++ Sbjct: 111 IGISQVRGHGQAHAQ 125 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 65/209 (31%), Gaps = 22/209 (10%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN--ALLLFLDSIDL 244 + VVD S M + + TK N ++ F I L Sbjct: 34 TNLWLDVIAVVDNSRGMTVNGLNYVSIFS--LTMPSSLFTKFQIASNIASVFGFGTRIGL 91 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTE---KVRQYVTRDMDSLILKPTDSTPAMKQA 301 + +GL+ Y + + + + + +++ + + + A Sbjct: 92 NASEPRTTRLGLVTYNSVATQMADLNQYQSLHDAFNRIFDDLSNTVDTTESYLSTGLTLA 151 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 ++ S +QK +I N +SN I D+ K + Sbjct: 152 EKMFNDQSVNS------------TRAHYQKVVIVYASKYQTNGESN--PESIADRLKLSG 197 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHY 390 +KI+T++ + + L S P + + Sbjct: 198 VKIITVAYGNAYGLMKSLSIIAS-PGFAF 225 >gi|260177188|gb|ACX33912.1| hypothetical CBS domain-containing protein [uncultured prokaryote AT3] Length = 433 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 48/144 (33%), Gaps = 6/144 (4%) Query: 25 IFALSVMSFLLL--IGFLI-YVLDWHYKKNSMESANNAAILAGASKMV--SNLSRLGDRF 79 + AL +++FL+ F + + ++ ++ ++ L+ A + S SR Sbjct: 1 MSALGILAFLIFTCGFFALAEMALASSRRAKLQQLADSGDLSAARALQIKSTPSRFIAAT 60 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 ++ A + + H++ + V + + + ++ +T A + Sbjct: 61 QTGLTAASLLAGIFGENALAIHVEHFIEQSLPVLNAAKSEIAITATVTVVTAFAIVFGEI 120 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEH 163 I + F+ Sbjct: 121 IPKRIALANPEKVA-SLVAPFMGF 143 >gi|148669822|gb|EDL01769.1| von Willebrand factor A domain containing 2 [Mus musculus] Length = 748 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 40/127 (31%), Gaps = 21/127 (16%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G + + + + RQ V + ++ K + + A Sbjct: 36 DALDISPGRVRVGALQFGSTPHLEFPL--DSFSTRQEVKESIKGIVFKGGRTETGL--AL 91 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L+ + + +I +TDG++ + + +E I Sbjct: 92 KRLSRGFPGGRNGS------------VPQILIIVTDGKSQGPVALP-----AKQLRERGI 134 Query: 363 KIVTISI 369 + + + Sbjct: 135 VVFAVGV 141 >gi|109733269|gb|AAI16637.1| Von Willebrand factor A domain containing 2 [Mus musculus] Length = 791 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 40/127 (31%), Gaps = 21/127 (16%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G + + + + RQ V + ++ K + + A Sbjct: 79 DALDISPGRVRVGALQFGSTPHLEFPL--DSFSTRQEVKESIKGIVFKGGRTETGL--AL 134 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L+ + + +I +TDG++ + + +E I Sbjct: 135 KRLSRGFPGGRNGS------------VPQILIIVTDGKSQGPVALP-----AKQLRERGI 177 Query: 363 KIVTISI 369 + + + Sbjct: 178 VVFAVGV 184 >gi|26325252|dbj|BAC26380.1| unnamed protein product [Mus musculus] Length = 721 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 40/127 (31%), Gaps = 21/127 (16%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G + + + + RQ V + ++ K + + A Sbjct: 79 DALDISPGRVRVGALQFGSTPHLEFPL--DSFSTRQEVKESIKGIVFKGGRTETGL--AL 134 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L+ + + +I +TDG++ + + +E I Sbjct: 135 KRLSRGFPGGRNGS------------VPQILIIVTDGKSQGPVALP-----AKQLRERGI 177 Query: 363 KIVTISI 369 + + + Sbjct: 178 VVFAVGV 184 >gi|42741661|ref|NP_766428.2| von Willebrand factor A domain-containing protein 2 precursor [Mus musculus] gi|81893914|sp|Q70UZ7|VWA2_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|27657433|emb|CAD60277.1| AMACO [Mus musculus] gi|74225702|dbj|BAE21683.1| unnamed protein product [Mus musculus] Length = 791 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 40/127 (31%), Gaps = 21/127 (16%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L V +G + + + + RQ V + ++ K + + A Sbjct: 79 DALDISPGRVRVGALQFGSTPHLEFPL--DSFSTRQEVKESIKGIVFKGGRTETGL--AL 134 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + L+ + + +I +TDG++ + + +E I Sbjct: 135 KRLSRGFPGGRNGS------------VPQILIIVTDGKSQGPVALP-----AKQLRERGI 177 Query: 363 KIVTISI 369 + + + Sbjct: 178 VVFAVGV 184 >gi|110636421|ref|YP_676629.1| TadE-like [Mesorhizobium sp. BNC1] gi|110287405|gb|ABG65464.1| TadE-like protein [Chelativorans sp. BNC1] Length = 189 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 21/51 (41%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 ++ +K +I FAL FLL++ +I + + +A + Sbjct: 17 TALLRRFGREKKGATAIEFALLAFPFLLVLFAIIESAVAFVAQELLTNAAD 67 >gi|1616950|gb|AAB16869.1| CD11b [Sus scrofa] Length = 920 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 60/202 (29%), Gaps = 27/202 (13%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 ++ + F+ ++ K L+ Y+ + + + Sbjct: 24 GSGSINRLDF--QRMKEFVSTVM-GQFQKSKTLFALMQYSEDFYTHFTFNDFKRNPSPKL 80 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 +L T + +++ + L K + K ++ +TDGE Sbjct: 81 LVRPIRQLLGRTHTATGIRKVVRELFHSKSGARENAL-------------KILVVITDGE 127 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPN---GQRLLKTCVSSP--EYHYNVVNA 395 + + +A + I + + N + L T S P ++ + V N Sbjct: 128 KFG--DPLGYEDVIPEADRKGVIRYVIGVGDAFNSWKSREELNTIASKPCGDHVFQVNNF 185 Query: 396 DSLIHVFQNISQLMVHRKYSVI 417 + I + + +++ Sbjct: 186 ----EAVKTIQNQLQEKTFAIE 203 >gi|326670660|ref|XP_003199262.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1404 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 43/371 (11%), Positives = 97/371 (26%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 651 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 708 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 709 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 761 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 762 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 809 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N + Sbjct: 810 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTY 866 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 867 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 914 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 915 QILVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 967 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 968 SVSDFSELPNV 978 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 54/185 (29%), Gaps = 24/185 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+K+ + ++ + + + ++ Y+ E N + T K + Sbjct: 17 DGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTYTTK-EE 72 Query: 279 YVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 V + A++ F + ++ L Sbjct: 73 IVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVPQILVLL 121 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 + G S + KE + T I + + R L+ P Y +V + Sbjct: 122 SGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSVSDFS 174 Query: 397 SLIHV 401 L +V Sbjct: 175 ELPNV 179 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 128 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 187 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 188 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 235 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 236 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 291 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 292 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 337 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 338 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 389 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 927 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 986 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 987 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 1034 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 1035 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 1090 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 1091 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 1136 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 1137 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 1188 >gi|301622626|ref|XP_002940637.1| PREDICTED: integrin alpha-L-like [Xenopus (Silurana) tropicalis] Length = 1031 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 69/183 (37%), Gaps = 22/183 (12%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 N + F+ ++ + V+ I ++T + + + V L+ T Sbjct: 179 NIVKEFMKNVIKSLENETSVHFAAIQFSTHPKTEFTFADFQKDRNPDVLLANYRLLKGFT 238 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ A++ + ++K S +K ++ L DGE+ + +T K Sbjct: 239 NTYKAIQYTLDRIFTEKYGS-------------RPSAKKVLVILADGEST----DDDTTK 281 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVV--NADSLIHVFQNISQL 408 +KA + + I + + + L+ VS + E+ + +A L +F + + Sbjct: 282 AIEKADKARVSRYIIGVGQNFKTED-LEAFVSWPAKEHTRTIEKFDAQQLTILFAELQRK 340 Query: 409 MVH 411 ++ Sbjct: 341 ILS 343 >gi|223940571|ref|ZP_03632417.1| von Willebrand factor type A [bacterium Ellin514] gi|223890763|gb|EEF57278.1| von Willebrand factor type A [bacterium Ellin514] Length = 965 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 35/107 (32%), Gaps = 6/107 (5%) Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVK-IPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + + SF S K II +DG+ N ++ + D Sbjct: 467 ALGRQIAGMNQGDLPSFQGVMTLAHEGLKKSHSNLKHIIVFSDGDPNAPSPSLMNDIVSD 526 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + I + T+ I + ++ Y V + + L +F Sbjct: 527 R-----ITVSTVLIAGHAGPETMIWIADHGKGRFYPVTSPNELPQIF 568 >gi|167648158|ref|YP_001685821.1| hypothetical protein Caul_4199 [Caulobacter sp. K31] gi|167350588|gb|ABZ73323.1| conserved hypothetical protein [Caulobacter sp. K31] Length = 188 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 13/174 (7%), Positives = 43/174 (24%), Gaps = 22/174 (12%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLL-IGFLIYVLDWHYKKNSMESANNAAILAG 64 R ++ + + ++ FA +L G + + + + Sbjct: 9 RPIKFWVQFWRDRRGASAVEFAFIAPVLVLFYCGMS-ELTEAMIAQRRLSH--------I 59 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 AS + ++R ++ + F +G + Sbjct: 60 ASSIGDVVARDTQLTDARRTDVFKVGSVLMAPFP-------TTGLRMCIVSITSNAAGTI 112 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 + + +N+ + N+ L ++ + + + Sbjct: 113 DTVDWSEPSNSPTNCPAKGAVINIPAS-----VLPAGGSVIMSKASYDYEPPVK 161 >gi|149913214|ref|ZP_01901748.1| hypothetical protein RAZWK3B_04460 [Roseobacter sp. AzwK-3b] gi|149813620|gb|EDM73446.1| hypothetical protein RAZWK3B_04460 [Roseobacter sp. AzwK-3b] Length = 136 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 29/86 (33%), Gaps = 1/86 (1%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM-ESANNA 59 M+ L R + S+ F L + FL+++ I V +++ + + Sbjct: 1 MNFLPRLARCMRAYRKKSDGTVSVEFVLWMPLFLVILALAIDVSLLFMSQSNYWSVSRDT 60 Query: 60 AILAGASKMVSNLSRLGDRFESISNH 85 A L M ++ + S Sbjct: 61 ARLVARHAMDGTTAKSYAEMRAGSFF 86 >gi|4154309|gb|AAD04919.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ F+ ++ + + ++ Y + ++ + EKV D+ Sbjct: 1705 SYFDEMKSFTKAFISRANIGPRL---TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLM 1761 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 P+ A+ A + +TS+ + + +I +TD ++ Sbjct: 1762 QQEGGPSQIGDALSFAVRYVTSEVHGARPGAS------------KAVVILVTDVSVDS-- 1807 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A+ N + + I I + +L Sbjct: 1808 ----VDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKAG 1845 >gi|1478046|gb|AAB05549.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ F+ ++ + + ++ Y + ++ + EKV D+ Sbjct: 1705 SYFDEMKSFTKAFISRANIGPRL---TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLM 1761 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 P+ A+ A + +TS+ + + +I +TD ++ Sbjct: 1762 QQEGGPSQIGDALSFAVRYVTSEVHGARPGAS------------KAVVILVTDVSVDS-- 1807 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A+ N + + I I + +L Sbjct: 1808 ----VDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKAG 1845 >gi|50950127|ref|NP_001002932.1| von Willebrand factor precursor [Canis lupus familiaris] gi|40786760|gb|AAB93766.2| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ F+ ++ + + ++ Y + ++ + EKV D+ Sbjct: 1705 SYFDEMKSFTKAFISRANIGPRL---TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLM 1761 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 P+ A+ A + +TS+ + + +I +TD ++ Sbjct: 1762 QQEGGPSQIGDALSFAVRYVTSEVHGARPGAS------------KAVVILVTDVSVDS-- 1807 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A+ N + + I I + +L Sbjct: 1808 ----VDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKAG 1845 >gi|12644030|sp|Q28295|VWF_CANFA RecName: Full=von Willebrand factor; Short=vWF; Flags: Precursor Length = 2813 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ F+ ++ + + ++ Y + ++ + EKV D+ Sbjct: 1705 SYFDEMKSFTKAFISRANIGPRL---TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLM 1761 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 P+ A+ A + +TS+ + + +I +TD ++ Sbjct: 1762 QQEGGPSQIGDALSFAVRYVTSEVHGARPGAS------------KAVVILVTDVSVDS-- 1807 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + A+ N + + I I + +L Sbjct: 1808 ----VDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKAG 1845 >gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 747 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 60/197 (30%), Gaps = 14/197 (7%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + K A+ L + + + + + + + + Sbjct: 346 TSGSMSGEAIVQAKKAMGYALAGLGARDSFNVIAFNSDVHALS----AQSLAATAKNIGR 401 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 T+ PA+ +A +F GV+ K ++F+TD Sbjct: 402 ANQFIRTLKADGGTEMGPALTRALDNGNHSTSHQDEEDFDSDGVR------LKQVLFMTD 455 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G N +S N I+ K ++ TI I A+PN + + + + Sbjct: 456 GAVANERSLFNLIED----KIGHSRLFTIGIGAAPNSHFMERAAEFGKGTFTYIGKLGEV 511 Query: 399 IHVFQNISQLMVHRKYS 415 +++ + H + + Sbjct: 512 QQKIESLLYKIEHPQVT 528 >gi|126730252|ref|ZP_01746063.1| hypothetical protein SSE37_10869 [Sagittula stellata E-37] gi|126708985|gb|EBA08040.1| hypothetical protein SSE37_10869 [Sagittula stellata E-37] Length = 177 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 23/53 (43%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + ++ A + + + AL + FLLLI + + + +E A + + Sbjct: 7 SFLRRFRAEDDGSMVVPIALWMPIFLLLIISSVELGTITVRSTVLERALDQTV 59 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 45/158 (28%), Gaps = 37/158 (23%) Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSD 308 +G+I + + + E V + D + + T A+ A Sbjct: 127 SGNRLGVILFAGEAYMQVPLTTDYEAVSLLLQSANPDMISSQGTSIGEALAIA------- 179 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + ++ +TDGE++ ++ + +KI TI Sbjct: 180 ------------QTNTSKSNGNRVVLVITDGEDHEARAEAQARQA----ARAGMKIFTIG 223 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 I S ++ + + ++IS Sbjct: 224 IG-------------SEEGGFVPFMDENGMADYKRDIS 248 >gi|313680435|ref|YP_004058174.1| von willebrand factor type a [Oceanithermus profundus DSM 14977] gi|313153150|gb|ADR37001.1| von Willebrand factor type A [Oceanithermus profundus DSM 14977] Length = 747 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 67/203 (33%), Gaps = 32/203 (15%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-T 281 +K+A +D+ +G++ + +R + P T + ++ T Sbjct: 345 GAPSKLARAVEGARKLVDAAGPED------TLGIVTFASRSRWLLAPKAMTYRAKREAET 398 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 R T A A + L ++I+ L+DG+ Sbjct: 399 RLDALEARGGTQLATAYAAAAEALEPL------------------DARTRWILVLSDGQL 440 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + T+ +A +K +T+++ A + L + ++ + +L V Sbjct: 441 ED--DPQRTLARARQAAARGVKTLTLALGADADRPFLARLAREGGGRFLDLADPAALPQV 498 Query: 402 FQNISQL-----MVHRKYSVILK 419 + + +V ++ + L+ Sbjct: 499 LALLGEEAFKPPVVEGRFRLQLR 521 >gi|296223008|ref|XP_002757441.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Callithrix jacchus] Length = 1289 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 49/169 (28%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V R + E + Q + D Sbjct: 521 SKLDLVKDKIIKFIQEQLKYKSRFNFVKFDGQAVAWREQLA---EVNEENLEQAQSWIRD 577 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A+K A+ + I LTDG Sbjct: 578 MKIGSSTNTLSALKTAFA-----------------------DKETQAIYLLTDGRP---- 610 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPNGQR--LLKTCVSSPEYHYN 391 + + D+ K I I TIS N + L + + + Sbjct: 611 -DQPPEMVIDQVKVFQEIPIYTISFNYNDEIANRFLKEIAALTGGEFHF 658 >gi|284991843|ref|YP_003410397.1| hypothetical protein Gobs_3425 [Geodermatophilus obscurus DSM 43160] gi|284065088|gb|ADB76026.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 298 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 9/199 (4%), Positives = 37/199 (18%), Gaps = 15/199 (7%) Query: 31 MSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL 90 + L + V + ++ ++ + + L+ D RA Sbjct: 8 VPILGFTAIAVDVGALYAERARLQ----------TAADAAALAVARDCARGAC-GDMRAT 56 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 D + + ++ + + + + + Sbjct: 57 AQDLVDANTGNATAAPPVLASDPVRVTVTGSTPTEHWFAPVIGHESTQVTATATVAWGAP 116 Query: 151 MTSYD---YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAM 207 + E + + + + + G Sbjct: 117 DAGTAMLPLIFSWCEFTAQTGGGLPSTT-TQRTIELPKKSDTGCTGPSNMFVPGGFGWLT 175 Query: 208 NSDPEDVNSAPICQDKKRT 226 + + + Sbjct: 176 TDGGNTCRATSRVGGRFTS 194 >gi|156393752|ref|XP_001636491.1| predicted protein [Nematostella vectensis] gi|156223595|gb|EDO44428.1| predicted protein [Nematostella vectensis] Length = 419 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 56/173 (32%), Gaps = 23/173 (13%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQ---YVTRDMDSLILKPTDSTPA 297 +++++ + + +I + + V + + S EK + T+ A Sbjct: 72 LNVIAVGFDATRVEIITFGSDVNRRVPFISEAHEKDTKCTFNEKFANVVHEWGMTNMRGA 131 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 ++AY++ N +I +TDG N N + + + Sbjct: 132 FEKAYEVCKGTWSGKKRLNIKT------------TVILITDGHWNWPWQNPDPVPKAQQL 179 Query: 358 KENFIKIVTISINASPNGQRLLKTCVS--SPEYH-YNVVNADSLIHVFQNISQ 407 ++I+ + + L + + + + + N D F ++ Sbjct: 180 IREGVEILAFGVGYGISLSNLQTITANQRAGHTYAFQISNFDE----FNKLAT 228 >gi|72679950|gb|AAI00088.1| LOC498793 protein [Rattus norvegicus] Length = 568 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 38/127 (29%), Gaps = 18/127 (14%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-- 344 T+ A+ +A IL + + + II ++DG+ Sbjct: 3 QPSGGTNINEALLRAIFILNEASN-----------MGLLNPDSVSLIILVSDGDPTVGEL 51 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV----VNADSLIH 400 K + + ++ I + ++ I + L + + + + L Sbjct: 52 KLSKIQKNVKQNIQD-NISLFSLGIGFDVDYDFLKRLSNENRGIAQRIYGNRDTSSQLKK 110 Query: 401 VFQNISQ 407 + +S Sbjct: 111 FYNQVST 117 >gi|47229797|emb|CAG06993.1| unnamed protein product [Tetraodon nigroviridis] Length = 1160 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 54/173 (31%), Gaps = 29/173 (16%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE-----PSWGTEKVRQYVTRDMDS 286 K +++ LD++ +V + + + + + + + M Sbjct: 182 KTSVMEMLDTLSDDDYVN------VARFNEKADAVVPCFRTLVQANVRNKKIFKEAVMHM 235 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A++ L ++ P K I+ TDG + + Sbjct: 236 QAKGTTDYKSGFTFAFEQLLNESSA-------------PRANCNKMIMMFTDGGEDRAQE 282 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 +K +++ T S+ L+ S+ Y++ + + ++ Sbjct: 283 IFEKYNWPNKT----VRVFTFSVGQHNYDVTPLQWIACSNKGYYFEIPSIGAI 331 >gi|326670656|ref|XP_003199260.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1401 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 43/371 (11%), Positives = 97/371 (26%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 651 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 708 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 709 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 761 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 762 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 809 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N + Sbjct: 810 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTY 866 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 867 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 914 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 915 QILVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 967 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 968 SVSDFSELPNV 978 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 51/183 (27%), Gaps = 20/183 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+K+ + ++ + + + ++ Y+ E N + T K + Sbjct: 17 DGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTYTTK-EE 72 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V + F + ++ L+ Sbjct: 73 IVDAVRGLRHKGGRPLYTG---------EALQFVRDNVFTASSGSRRLEGVPQILVLLSG 123 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G S + KE + T I + + R L+ P Y +V + L Sbjct: 124 GR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSVSDFSEL 176 Query: 399 IHV 401 +V Sbjct: 177 PNV 179 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 37/328 (11%), Positives = 102/328 (31%), Gaps = 31/328 (9%) Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 +A +F+++++ + SG + +I +++ R + +D++ +++ Sbjct: 88 LYTGEALQFVRDNVFTASSGSRRLEGVPQILVLLSGGRSF------DSVDAAASSLKELG 141 Query: 149 DVMTSYDYRLQFIEHL-LNQRYNQKIVSFIPALLRIEMGERPIFLIEL----VVDLSGSM 203 + R L +S + E+ + +++ V S ++ Sbjct: 142 VLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSSTV 201 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 ++ +DV D R+ A+++ + ++ + + ++ Y+ Sbjct: 202 TAEYSTPRKDVVFLLDGSDGTRSSFPAMRDFVQRVVEKFN---IEANRDRVSVVQYSRDA 258 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E + + K + R + + F Sbjct: 259 EVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG---------AALQYVRDNVFTASSGS 308 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV 383 + +I L+ G S + KE + T+ I + + R L+ Sbjct: 309 RRLEGVPQILILLSGGR-----SFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRIS 361 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLMVH 411 Y +V + L +V + + + Sbjct: 362 YEANYALSVADFSELPNVQEQLLASVQT 389 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 927 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 986 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 987 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 1034 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 1035 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVKGLRHKGGRPLYTG 1090 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 1091 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 1136 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 1137 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 1188 >gi|321466601|gb|EFX77596.1| hypothetical protein DAPPUDRAFT_30003 [Daphnia pulex] Length = 3316 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 31/85 (36%), Gaps = 12/85 (14%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 K + +TDG +N + +E ++I T I + L S Sbjct: 5 ARSEAVKAVFLITDGYSNGGD----PRPAAKRLREQGVRIFTFGIRNGN--VKELYDMAS 58 Query: 385 SP--EYHYNVVNADSLIHVFQNISQ 407 P E+ Y V + + F+ +++ Sbjct: 59 EPAQEHSYIVDSFEE----FEALAR 79 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 48/190 (25%), Gaps = 23/190 (12%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVRQYVTR 282 KM K AL L + H + + ++ K +R Sbjct: 381 GNKMKQTKEALFTILKDLRPQDHFNIIGFSKRVKVWQQNQMVKVSP-----NNIRDAKKF 435 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ + Q+L + + +IFLTDG Sbjct: 436 IYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS-----------LMIFLTDGRAT 484 Query: 343 NFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN----ADS 397 + + + A + + +I + L K + + + A Sbjct: 485 IGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQ 544 Query: 398 LIHVFQNISQ 407 L + I Sbjct: 545 LKGFYDEIGT 554 >gi|225874357|ref|YP_002755816.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] gi|225791485|gb|ACO31575.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] Length = 333 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 49/137 (35%), Gaps = 3/137 (2%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 +A FL+S+ + ++ + V+ +PS K +Q +++ + + Sbjct: 64 HASKQFLNSLLSAHASGSAPRVFVVQFNRDVDLLEDPSASASKAQQALSQVGVAQFHGDS 123 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 ++ + + + + + + + +K I+ L+DG + K Sbjct: 124 NADSTANKG-RHAKASGAALYDAIYLATHNVLNTTAGRKVIVVLSDGIDQGSK--TTLNG 180 Query: 353 ICDKAKENFIKIVTISI 369 + A+ + + I Sbjct: 181 AVEAAQRAGVAVYAIYF 197 >gi|167924336|ref|ZP_02511427.1| hypothetical protein BpseBC_37618 [Burkholderia pseudomallei BCC215] Length = 396 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 74/319 (23%), Gaps = 20/319 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I+ + + S I + S I +R N Sbjct: 61 ITAGHLNHALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVN 120 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDL 199 I V + + IP + P+ + Sbjct: 121 WFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGD 180 Query: 200 -------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +G+ + +S + A + + D + Sbjct: 181 WLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTN 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DSTPAMKQA 301 + Y + + + D + + T + Sbjct: 241 AYNTRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITG 300 Query: 302 YQILTSDKKRSFFTNFFRQ 320 S + R+ Sbjct: 301 INTGGSYNPSYYAAGADRR 319 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 48/190 (25%), Gaps = 23/190 (12%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVRQYVTR 282 KM K AL L + H + + ++ K +R Sbjct: 349 GNKMKQTKEALFTILKDLRPQDHFNIIGFSKRVKVWQQNQMVKVSP-----NNIRDAKKF 403 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ + Q+L + + +IFLTDG Sbjct: 404 IYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS-----------LMIFLTDGRAT 452 Query: 343 NFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN----ADS 397 + + + A + + +I + L K + + + A Sbjct: 453 IGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQ 512 Query: 398 LIHVFQNISQ 407 L + I Sbjct: 513 LKGFYDEIGT 522 >gi|170720024|ref|YP_001747712.1| hypothetical protein PputW619_0838 [Pseudomonas putida W619] gi|169758027|gb|ACA71343.1| conserved hypothetical protein [Pseudomonas putida W619] Length = 651 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 12/132 (9%), Positives = 36/132 (27%), Gaps = 13/132 (9%) Query: 15 IASEKANFSIIF----ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMV 69 A ++ ++ AL+++ L++ + + ++ ++ + A L A V Sbjct: 6 AARQRGAIGLMAVTTLALALLFMLVV----VDSGRLYLEQRKLQRIADMAALEAAGQSAV 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + A+ + A T + + I + Sbjct: 62 CTGNGPQATAIASVAAARNGHTPGS----PLVASCGYLQTGANSLRTFTSDNARNEAIRV 117 Query: 130 THMANNRLDSSN 141 + Sbjct: 118 DVSNTVTTSFAA 129 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 48/190 (25%), Gaps = 23/190 (12%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVRQYVTR 282 KM K AL L + H + + ++ K +R Sbjct: 330 GNKMKQTKEALFTILKDLRPQDHFNIIGFSKRVKVWQQNQMVKVSP-----NNIRDAKKF 384 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T+ + Q+L + + +IFLTDG Sbjct: 385 IYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKSVS-----------LMIFLTDGRAT 433 Query: 343 NFK-SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN----ADS 397 + + + A + + +I + L K + + + A Sbjct: 434 IGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDAASQ 493 Query: 398 LIHVFQNISQ 407 L + I Sbjct: 494 LKGFYDEIGT 503 >gi|116249975|ref|YP_765813.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115254623|emb|CAK05697.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 193 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 10/112 (8%), Positives = 30/112 (26%), Gaps = 8/112 (7%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ K +I FA+ ++L + + A Sbjct: 5 NPFTRLVLTMRRLARDRKGAGAIEFAILFPVLVMLYIGAFEITVGLSVSKRVTRA----- 59 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A + +++ +S + + ++G + Sbjct: 60 ---AGTVADLVTQQQSVTKSALAQMPSVATAIFVPYNSTSLTLKITGITIDA 108 >gi|254418896|ref|ZP_05032620.1| hypothetical protein BBAL3_1206 [Brevundimonas sp. BAL3] gi|196185073|gb|EDX80049.1| hypothetical protein BBAL3_1206 [Brevundimonas sp. BAL3] Length = 186 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 36/121 (29%), Gaps = 8/121 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R ++ E ++ FAL LL ++ + + A AS Sbjct: 3 RSLIRRLAGDESGVSAVEFALIAPVMLLFYAGMVDLCQGYMALKRTSHA--------ASA 54 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + +S+ ++ N F +++ +S + V N N S Sbjct: 55 VADLVSQSRTITKADINSIFEVGPAIMAPFASTSMEQRISSVTRVSANKYTLNWSRSWTP 114 Query: 128 S 128 Sbjct: 115 D 115 >gi|17537921|ref|NP_496259.1| C-type LECtin family member (clec-60) [Caenorhabditis elegans] gi|3881710|emb|CAA88985.1| C. elegans protein ZK666.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 406 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 59/200 (29%), Gaps = 27/200 (13%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V+++ + T +AA +++ I +GL+ Y + + Sbjct: 48 VDNSIGMTNGGLTSIAANIASVVSSGTRIGTNPSEPRTTRLGLVTYNKAAAIQADLNQYQ 107 Query: 274 EKVRQYVT---RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 Y + + + +A +L + K+ +Q Sbjct: 108 SLDDVYDNVFRALSSVSTSEESYLANGLARAEDVLEAGKQG------------YNRTHYQ 155 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEY 388 + +I + + + ++ K + + ++T++ + +G L L S P Sbjct: 156 RVVIVYASAYKGSGALDPVP--VAERLKTSGVTVITVAYDQDGDGALLADLAKIASPP-- 211 Query: 389 HYNVVNAD------SLIHVF 402 + D + Sbjct: 212 YNFTNTEDNGQVIGEIQDAL 231 >gi|126173305|ref|YP_001049454.1| type IV pilin biogenesis protein [Shewanella baltica OS155] gi|125996510|gb|ABN60585.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS155] Length = 1168 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/290 (10%), Positives = 69/290 (23%), Gaps = 66/290 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V +++ ++ A K +++ K A+ + S + Sbjct: 208 AAANNTDFGVGQPVTLYTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDF 267 Query: 248 V--KEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQI 304 + G + T+ R +T + L T M +AY+ Sbjct: 268 GLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRY 327 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQK-------------------FIIFLTDGENN--- 342 + + + P+ ++I++TDG Sbjct: 328 FAGKGVKYGHGDTDYGSYVGNNPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDK 387 Query: 343 -----------------NFKSNVNTIKICDKAKE----------------------NFIK 363 N+ S + ++ Sbjct: 388 NANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQVQNVR 447 Query: 364 IVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI A L +T ++ N+ L + + +++ Sbjct: 448 TYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|319937918|ref|ZP_08012318.1| hypothetical protein HMPREF9488_03154 [Coprobacillus sp. 29_1] gi|319806824|gb|EFW03463.1| hypothetical protein HMPREF9488_03154 [Coprobacillus sp. 29_1] Length = 134 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 31/116 (26%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I E ++ AL L LI F I V ++ K + N AI A++ + Sbjct: 3 FIRDESGETTVEMALLFPVILGLIMFAIGVNMFYEGKIATSVGANEAIRYAATQTSYENA 62 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + + + + I+ Sbjct: 63 KREANERLKAIYTQHNIKLTKFSLTHIDVNRDGKYSIGDKLVLNIETKKGIWSEYT 118 >gi|115687249|ref|XP_792282.2| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 2422 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 20/189 (10%), Positives = 60/189 (31%), Gaps = 34/189 (17%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-------QYV 280 + N L + + + + +I ++++ + + + + Sbjct: 214 IENFFNELKFVKKLLADFTVAPDATRVAIITFSSKHRVELNVNQLDNSIAGKHHHKCALL 273 Query: 281 TRD--MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 D + + T + A + A ++L + S + + LTD Sbjct: 274 NEDLPKITYVGGGTYTKGAFELAKKVLQGARANS-----------------TQAVFLLTD 316 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G +N + + K++ +++ + I + L E+ Y + + Sbjct: 317 GLSNGPN----PVPVAVSLKDDGVEVFSFGIRDGYIPELLQMASEKKDEHCYILDSFAE- 371 Query: 399 IHVFQNISQ 407 F+ +++ Sbjct: 372 ---FEALAR 377 >gi|149028000|gb|EDL83451.1| complement factor B, isoform CRA_d [Rattus norvegicus] Length = 543 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 69/220 (31%), Gaps = 39/220 (17%) Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 GSM+ M D D A K L ++ + GL+ Y Sbjct: 46 GSMNIYMVLDGSDSIGASN--------FTGAKRCLANLIEKVASYGVKPRY---GLVTYA 94 Query: 261 TRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 T + + S + V + + + D + T++ A++ Y +++ Sbjct: 95 TVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYSMMS-------- 146 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT----IKICDKAKENF------IKI 364 II +TDG +N V + D ++ + + Sbjct: 147 WPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDV 204 Query: 365 VTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + P L + ++ ++ + V + + L +VF Sbjct: 205 YVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVF 244 >gi|84997465|ref|XP_953454.1| hypothetical protein [Theileria annulata strain Ankara] gi|65304450|emb|CAI76829.1| hypothetical protein, conserved [Theileria annulata] Length = 391 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 46/165 (27%), Gaps = 3/165 (1%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 V D G + K+ ++ S T T A+ + + Sbjct: 124 VYNDWTDSCFGGDNSNSVDTNLKLMWNKIIDFINE-GLSYPDSDTQLTSAISMGFLCINY 182 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 +F + +K IIF N + + I A I I T Sbjct: 183 SVYINFSFINLNRIKCTNEGFGRKIIIFDISNSENYKSQYIGLMNIAYSALSQNITINTF 242 Query: 368 SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 ++ + L + + + V + F +I+Q + Sbjct: 243 ALGQP--SRILEQLSAITNAKYLLVSKLLNFDLNFNHINQYLTQL 285 >gi|32472885|ref|NP_865879.1| hypothetical protein RB4059 [Rhodopirellula baltica SH 1] gi|32444122|emb|CAD73564.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 156 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 9/104 (8%), Positives = 31/104 (29%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + A+ + FA+ LL I + + + +N + A A + Sbjct: 22 RRAATRDGATLVEFAIVCNVLLLTIFMCMELARMNMARNLAQDAAYYAARTAIVPGATAD 81 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 + + + + + + + +++ ++ Sbjct: 82 EAIAEAETIMESLFASGYDVECTPINDDTEEVTVTVSLSLDDVA 125 >gi|332872319|ref|XP_003319171.1| PREDICTED: collagen alpha-2(VI) chain isoform 2 [Pan troglodytes] Length = 828 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|332872317|ref|XP_003319170.1| PREDICTED: collagen alpha-2(VI) chain isoform 1 [Pan troglodytes] Length = 918 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|332872315|ref|XP_531504.3| PREDICTED: collagen alpha-2(VI) chain isoform 3 [Pan troglodytes] Length = 1019 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|332256729|ref|XP_003277468.1| PREDICTED: collagen alpha-2(VI) chain [Nomascus leucogenys] Length = 1124 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 736 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 795 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 796 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 853 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 854 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 894 Query: 370 NA 371 Sbjct: 895 GD 896 >gi|297287371|ref|XP_001099130.2| PREDICTED: collagen alpha-2(VI) chain-like isoform 3 [Macaca mulatta] Length = 1029 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 611 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 670 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 671 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 728 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 729 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 769 Query: 370 NA 371 Sbjct: 770 GD 771 >gi|296232327|ref|XP_002807820.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Callithrix jacchus] Length = 1018 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|119629723|gb|EAX09318.1| collagen, type VI, alpha 2, isoform CRA_c [Homo sapiens] Length = 1019 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|119629721|gb|EAX09316.1| collagen, type VI, alpha 2, isoform CRA_a [Homo sapiens] Length = 828 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|119629722|gb|EAX09317.1| collagen, type VI, alpha 2, isoform CRA_b [Homo sapiens] Length = 918 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|115527066|ref|NP_478054.2| collagen alpha-2(VI) chain isoform 2C2a precursor [Homo sapiens] Length = 918 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|115527070|ref|NP_478055.2| collagen alpha-2(VI) chain isoform 2C2a' precursor [Homo sapiens] Length = 828 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|115527062|ref|NP_001840.3| collagen alpha-2(VI) chain isoform 2C2 precursor [Homo sapiens] gi|125987812|sp|P12110|CO6A2_HUMAN RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor Length = 1019 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|78776856|ref|YP_393171.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497396|gb|ABB43936.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 595 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 68/219 (31%), Gaps = 56/219 (25%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + + +D ++ + K L FL +G++ + S+ Sbjct: 93 DISDSMLAEDVYPKRLESAKRKALAFLKEAKDE-------RVGVVAFAKDSYLVSPLSFD 145 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + + T T + S F + KI +K Sbjct: 146 KHSVSFLLEQLDT------TSITE-------------QGSDFLSVIGAVEKIQKDEKKKV 186 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQR----------- 377 ++ L+DG + + +I AK++ I I + I A + Sbjct: 187 LLILSDG----GDKSDFSEEIA-LAKKSGITIFILGIATKQGAPIKREDGTFIKYNDEII 241 Query: 378 ---LLKTCVS----SPEYHYNVVNADSLIHVFQNISQLM 409 L + S + + + + SL + + I + + Sbjct: 242 ISKLNENIASLATKTGGVY--IESTTSLEDIKRVI-KEI 277 >gi|41350923|gb|AAH65509.1| Collagen, type VI, alpha 2 [Homo sapiens] gi|190690005|gb|ACE86777.1| collagen, type VI, alpha 2 protein [synthetic construct] gi|190691377|gb|ACE87463.1| collagen, type VI, alpha 2 protein [synthetic construct] Length = 1019 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|12803331|gb|AAH02484.1| COL6A2 protein [Homo sapiens] gi|30582665|gb|AAP35559.1| collagen, type VI, alpha 2 [Homo sapiens] Length = 425 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 7 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 66 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 67 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 124 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 125 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 165 Query: 370 NA 371 Sbjct: 166 GD 167 >gi|30584073|gb|AAP36285.1| Homo sapiens collagen, type VI, alpha 2 [synthetic construct] Length = 426 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 7 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 66 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 67 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 124 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 125 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 165 Query: 370 NA 371 Sbjct: 166 GD 167 >gi|179711|gb|AAA35620.1| alpha-2 collagen type VI-a' [Homo sapiens] Length = 429 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 11 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 70 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 71 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 128 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 129 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 169 Query: 370 NA 371 Sbjct: 170 GD 171 >gi|179710|gb|AAA35619.1| alpha-2 collagen type VI-a [Homo sapiens] Length = 328 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 11 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 70 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 71 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 128 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 129 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 169 Query: 370 NA 371 Sbjct: 170 GD 171 >gi|13603394|gb|AAA52056.2| type VI collagen alpha 2 chain precursor [Homo sapiens] Length = 1019 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|105706|pir||C35243 collagen alpha 2(VI) chain precursor, short splice form - human (fragment) gi|179709|gb|AAA35618.1| alpha-2 collagen type VI [Homo sapiens] Length = 238 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 11 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 70 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 71 GVVQYSHEGTFEAIQLDDEHIDSLSSFKEAVKNLE--WIAGGTWTPSALKFAYDRLIKES 128 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 129 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGI 169 Query: 370 NA 371 Sbjct: 170 GD 171 >gi|307719356|ref|YP_003874888.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] gi|306533081|gb|ADN02615.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] Length = 331 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/223 (11%), Positives = 65/223 (29%), Gaps = 56/223 (25%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D++ + + +D +++ K L+ + I G++ + + E + + Sbjct: 94 FDISRSMLVRDVPPSRLEVAKEIALMLVSRI-------SGARWGVVAFKGKGELLLPLTP 146 Query: 272 GTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + L TD + +A + Q Sbjct: 147 DLLGLEDAIGLLTPVLLRSPGTDVASGLSRALEAF------------------PQQSNRQ 188 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP----------------- 373 + +I L+DGE + + A+ + + T+ I Sbjct: 189 RLVILLSDGEALTGEIGPVL----ELARNLGVAVHTVGIGTESGGPVPLEGEDVLKKPSG 244 Query: 374 -------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 + L + + ++V +A+ FQ++ + Sbjct: 245 EPVISRLDASLLKRIAEITGGRFFSVRDAEG--QTFQHVVSTI 285 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 67/198 (33%), Gaps = 13/198 (6%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPE-DVNSAPICQDKKRTKMAALKNALLLFL 239 +I + + +D+S S N++ + ++++ + K+A +K A+ L Sbjct: 1225 GQISVSIEDDQPVAKNIDISVSPETKSNTNVQLILDTSGSMSNSSNGKLAIMKAAVSKML 1284 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 D + V+ + LI + +R + V + TD A+K Sbjct: 1285 DQYHDMGDVR----VQLIDFNSRSTRLEFNGRAWMTVSEAKYLVNRLTAGGGTDYDDAVK 1340 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A Q D+ P DG ++ S+ K + Sbjct: 1341 KARQSWDHDEHLQLDNANNVSYFISDGKPQ--------DGHDDATISDNEETKWANHLIS 1392 Query: 360 NFIKIVTISINASPNGQR 377 N I +I IN+S + Sbjct: 1393 NGITSQSIGINSSGSLND 1410 >gi|331006793|ref|ZP_08330059.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC1989] gi|330419411|gb|EGG93811.1| TPR domain protein in aerotolerance operon [gamma proteobacterium IMCC1989] Length = 668 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 21/212 (9%), Positives = 65/212 (30%), Gaps = 28/212 (13%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D++ + D K +++ + + L L+ Y + Sbjct: 127 DLSPSMRATDNKPSRIVRARLKIKDLLSQRKDG-------LTALVVYAGEAHTVTPLTDD 179 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS----- 326 T+ + ++ L + ++ AM A ++ + G+ + Sbjct: 180 TKTIANLLSTLTPGLLPIPGSNIEMAMTMATTLVADSGIQHASYVVLTDGIDPKAINSIE 239 Query: 327 ----LPFQKFIIFLTDG------ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 + + + G ++ + ++ + ++ I I A+ N Sbjct: 240 RTLEKQAHQSLFIMGIGTPEGAPIPSDGSDSNSSQNKSEFLRDGNNNI----IQAARNDA 295 Query: 377 RLLKTCVSSPEYHYNVV-NADSLIHVFQNISQ 407 L + + ++ + + D + + Q+I + Sbjct: 296 VLQEVASAVNGHYLPLQASDDDIDFILQSIER 327 >gi|330834711|ref|YP_004409439.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566850|gb|AEB94955.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 379 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 59/192 (30%), Gaps = 41/192 (21%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K + S+ ++ Y L+ + +V E + ++ + Sbjct: 54 KLQMAKEGANELVSSL------PQESYFSLLAFDEKVSIL------KEHSQSHLIDIDEI 101 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN--- 343 T A+++A ++ + I LTDG + Sbjct: 102 KAGSGTSLYKALEEASKLAE-------------------RYRQPSYFILLTDGVPTDRGC 142 Query: 344 --FKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 S ++ C + + +++++ I N + L + Y++ + Sbjct: 143 THGLSRKFDLERCLPVYQGLSLPHNVQVISFGIGQDYNEKILSLISEKGNGFFYHIKDPK 202 Query: 397 SLIHVFQNISQL 408 ++ +++ Sbjct: 203 EIVEKMPKLAKS 214 >gi|115665362|ref|XP_001180845.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] gi|115941435|ref|XP_001179810.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 763 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 60/173 (34%), Gaps = 35/173 (20%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDSTPAMKQAYQ 303 S V + Y+ ++ + + + T + R+ + + + T ++ A Q Sbjct: 195 QSIVPNNSYVAIVEFNYGAIVDSNMTELTSAISRKDLASLLPTYADGATCIGCGIQTAIQ 254 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + S ++I L+DG+ N+ +T+ D +++ + Sbjct: 255 VAQYNGMDSRGV----------------YLILLSDGQENSGTLIADTL---DDIEDSGVI 295 Query: 364 IVTISI-NASPNGQRLLKT-------CVSSPEYHYNVVNADSLIHVFQNISQL 408 + +I+ A + L + C +A +I F++I Sbjct: 296 VHSIAFYEADTQLEDLAQMTGGISATCADGG-------SAQCVISAFESIIAQ 341 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 10/64 (15%), Positives = 20/64 (31%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 D+S N I L + +R+N+ I+ + I + ++E Sbjct: 149 EDTDTSPNFIVVQPSGSLRIVLVLDTSGSMDGERFNKMIIGAKNFIQSIVPNNSYVAIVE 208 Query: 195 LVVD 198 Sbjct: 209 FNYG 212 >gi|271962857|ref|YP_003337053.1| hypothetical protein Sros_1312 [Streptosporangium roseum DSM 43021] gi|270506032|gb|ACZ84310.1| hypothetical protein Sros_1312 [Streptosporangium roseum DSM 43021] Length = 156 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 15/143 (10%), Positives = 38/143 (26%), Gaps = 4/143 (2%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 SR R + A E+ + S+ + ++ LL G L+ + A A Sbjct: 9 TSRVR---RGSAARERGSMSVFTVIFSVAVFLLAGLLVDGGAAINSRLRAADIAEQAARA 65 Query: 64 GASKMVSNLSRLGDRFESISNHAK-RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A ++ R + + + + ++ + ++ V Sbjct: 66 AADQIDVEHLRSTGQARLMEEGTICAKAEEVVSAHATDDVQLAECTAGGGQAEVTVRVSV 125 Query: 123 NSSRISMTHMANNRLDSSNNTIF 145 + + + Sbjct: 126 SWEAFFLAAFGFPGSEMEAEATA 148 >gi|327539547|gb|EGF26157.1| TadE-like protein [Rhodopirellula baltica WH47] Length = 154 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 39/128 (30%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + FA+ + FLLL+ + + + +A GA + + R Sbjct: 25 GATATEFAIVLPMFLLLVFACCDFARVIHFRQLVANAARVGATHGALNRFTAATESDWRS 84 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + ++ + + + I S+ E + + + + + Sbjct: 85 DVVNVMREELAHLTSTDPSDSVIDVHFRDLSSGVRVVETEVTLPFRTVVQWPVLPTEIQV 144 Query: 140 SNNTIFYN 147 ++ F+ Sbjct: 145 HHHAEFHQ 152 >gi|89069886|ref|ZP_01157220.1| hypothetical protein OG2516_06277 [Oceanicola granulosus HTCC2516] gi|89044562|gb|EAR50681.1| hypothetical protein OG2516_06277 [Oceanicola granulosus HTCC2516] Length = 178 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 10/62 (16%), Positives = 22/62 (35%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 + + R ++ E N ++ F L FL L+ + +E + ++ Sbjct: 1 MTRQARSALRRLWRDEDGNPTVEFVLVFPLFLTLMISAFESGILMTRHMMLERGLDISVR 60 Query: 63 AG 64 A Sbjct: 61 AI 62 >gi|154489100|ref|ZP_02029949.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] gi|154083237|gb|EDN82282.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] Length = 560 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 61/192 (31%), Gaps = 24/192 (12%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S + K +A L+ AL + +DV + + ++ + Sbjct: 389 DYSGSMSGKGKSGAVAGLQAALDTDQARASHIEPGDDDVNVFIPFNSSAKVAQV---AQG 445 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 ++ + + + D A++ A + L + I Sbjct: 446 KQTATLLAASENQVANGNADIYNALEVALKNL-----------------PSDRDDYTVAI 488 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 LTDG+++ K + + K + I +I + Q+L S ++ Sbjct: 489 ALLTDGQSDTAKLDEFKQQYASDGK--GVPIFSIMFG-DADSQQLDDLAKLSNGKVFDGR 545 Query: 394 NADSLIHVFQNI 405 N + L +F+ + Sbjct: 546 NGN-LSGIFREV 556 >gi|269125512|ref|YP_003298882.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310470|gb|ACY96844.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 814 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 63/199 (31%), Gaps = 29/199 (14%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 T+M A L + S + + V+ + ++ + E+V R + Sbjct: 632 TRMQATARVAQGGLSLLPNDSELGQWVFSTKLDGDKDYKEIVPMGPLGERVGSVTRRQLL 691 Query: 286 SL--------ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T + AY+ ++ K F I+ T Sbjct: 692 LSSLSRIEPKPTGDTGLYDTILAAYRYMS----------------KTYKPEFGNSILLFT 735 Query: 338 DGENNNFKSNVN--TIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 DG+N + T++ + + I+++ + + L + + Y Sbjct: 736 DGKNEDDDGPTLRQTLRELESMIDPTRPIQVIMLGFGPGVDVNELKQIAKVTRGDVYVTQ 795 Query: 394 NADSLIHVF-QNISQLMVH 411 N + + +F Q +S+ M + Sbjct: 796 NPNEIQKIFLQALSKRMAN 814 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 19/137 (13%), Positives = 43/137 (31%), Gaps = 32/137 (23%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---------VRQYVTRDMDSLILKPT 292 + V + ++ ++++ + + + + + + T Sbjct: 112 LSDFPVVPTATRVAIVTFSSKNNVVPRVDYISSRRAHQHKCALLSREIPAIT--YRGGGT 169 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + A +QA QIL ++ S K I +TDG +N Sbjct: 170 YTKGAFQQAAQILRHSRENS-----------------TKVIFLITDGYSNGGD----PRP 208 Query: 353 ICDKAKENFIKIVTISI 369 I ++ ++I T I Sbjct: 209 IAASLRDFGVEIFTFGI 225 >gi|300781256|ref|ZP_07091110.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] gi|300532963|gb|EFK54024.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] Length = 528 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 54/188 (28%), Gaps = 30/188 (15%) Query: 227 KMAALKNALLLFLD----SIDLLSHVKEDVYMGLIGYTTRVEKNI---EPSWGTEKVRQY 279 ++A+L+ + +D + +++ + ++T + + +Y Sbjct: 358 RIASLQQIMTSLIDGSAATATGDVALRDGELVTFQSFSTAPHEPLLGEFLRDDRITKAKY 417 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 D + T + A + + S I+ L+DG Sbjct: 418 QGYVNDLVADGQTAIYDTLFDALRSSDPNAGIS-------------------SIVLLSDG 458 Query: 340 ENNNFKSNVNTIKICDKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 E + K + I + I N + + ++ +N D Sbjct: 459 EVTHGMDYYAFEKQYQGLSPEQRSIPVFVILYG-EANASEMNNLAELTGGAVFDALNGD- 516 Query: 398 LIHVFQNI 405 L F+ I Sbjct: 517 LDAAFKEI 524 >gi|168213186|ref|ZP_02638811.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] gi|170715195|gb|EDT27377.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] Length = 1349 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 99/365 (27%), Gaps = 35/365 (9%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +L + N + + K D+ K F I + + Sbjct: 76 VLVLDTSQTVNEKQGDIEKAANDFVNKFLDSDNYKNFKIGIISYNEKAEVVQELTMDKNK 135 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 I +S S ++ +N ++ +RL + N++ + + Sbjct: 136 IKSSIADSYKDAIKDKRSYNNWLNGNHLGTNVGDAFRLAISMLGKDSNPNKEKIVIFMSN 195 Query: 181 LRIEMGERP-IFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 + E + + + + + ++ ++LKN + Sbjct: 196 GKPNAYTGKFYKGHESTLTENQINSLKDIACKRLYDITSLNGNENELGKSSLKN--EKII 253 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + ID L Y + + T+ + + K T+ Sbjct: 254 NKIDNY-KEYFYNPESLDEYRKQDFQKRGLRTNTDNDKSKFNNLI-LNEYKITN------ 305 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 + N + V P+ K I G + ++K+ + K+ Sbjct: 306 ------ADSYINKIYGNPYCINVCPPNNQGGKNAICYYMGSEIGY---AYSLKMAKEMKD 356 Query: 360 NFIKIVTISIN----ASPNGQRLLKTC-----------VSSPEYHYNVVNADSLIHVFQN 404 I I + S + +K V+S Y+ SL VF N Sbjct: 357 LGISTYPIYFDTKSKDSKQRENTMKLISKFAGNNENDGVTSVNTVYSNDITASLNDVFTN 416 Query: 405 ISQLM 409 + + + Sbjct: 417 LDKNI 421 >gi|78189842|ref|YP_380180.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172041|gb|ABB29137.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 334 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 48/151 (31%), Gaps = 16/151 (10%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 + N L+ I +S + GL+ + + + + + Sbjct: 98 SRSMQATDVAPNRLMRAKQEIAAISQNVQGGRRGLLIFAASPLLHCPLTTDRDGFATLLN 157 Query: 282 -RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + + + T PA A I + + S + I+ L+DGE Sbjct: 158 MAAPELIEEQGTRLQPAFALASTIFDVANESNA-----------ASTRGVQVIVLLSDGE 206 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ + N + + + +++ I + + Sbjct: 207 DH----DSNVQRAAQQLAKQSVQLFVIGVGS 233 >gi|160931316|ref|ZP_02078716.1| hypothetical protein CLOLEP_00153 [Clostridium leptum DSM 753] gi|156869700|gb|EDO63072.1| hypothetical protein CLOLEP_00153 [Clostridium leptum DSM 753] gi|314974315|gb|EFT18411.1| TadE-like protein [Propionibacterium acnes HL053PA1] gi|314977802|gb|EFT21896.1| TadE-like protein [Propionibacterium acnes HL045PA1] gi|315081091|gb|EFT53067.1| TadE-like protein [Propionibacterium acnes HL078PA1] gi|315090034|gb|EFT62010.1| TadE-like protein [Propionibacterium acnes HL072PA1] gi|315096617|gb|EFT68593.1| TadE-like protein [Propionibacterium acnes HL038PA1] gi|327325745|gb|EGE67540.1| putative membrane protein [Propionibacterium acnes HL096PA2] gi|327330167|gb|EGE71917.1| putative membrane protein [Propionibacterium acnes HL096PA3] gi|327333253|gb|EGE74974.1| putative membrane protein [Propionibacterium acnes HL097PA1] gi|327443402|gb|EGE90056.1| TadE-like protein [Propionibacterium acnes HL043PA1] gi|327447494|gb|EGE94148.1| TadE-like protein [Propionibacterium acnes HL043PA2] gi|327456785|gb|EGF03440.1| TadE-like protein [Propionibacterium acnes HL087PA3] gi|328761106|gb|EGF74653.1| putative membrane protein [Propionibacterium acnes HL099PA1] Length = 133 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 8/135 (5%), Positives = 29/135 (21%), Gaps = 12/135 (8%) Query: 14 GIASEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 E+ + S+ L V L++ G + A + + ++ Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQISA----------TQAAQDAAVAAS 50 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + A + + + + ++ Sbjct: 51 RAGTDAAATPQLAGHDGAAVAVQAARQALSAAGVDGSVQEDGSTITVTTSQSRPTVFLSA 110 Query: 132 MANNRLDSSNNTIFY 146 + +++ Sbjct: 111 IGISQVRGHGQAHAQ 125 >gi|257415126|ref|ZP_05592120.1| predicted protein [Enterococcus faecalis AR01/DG] gi|257156954|gb|EEU86914.1| predicted protein [Enterococcus faecalis ARO1/DG] Length = 711 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 68/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 129 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 188 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 189 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 243 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ + +D + + + S Sbjct: 244 KKMLVHIGDGSANASFLPRENAQSYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 303 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 304 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 342 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 343 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 383 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 13/150 (8%), Positives = 46/150 (30%), Gaps = 15/150 (10%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK 290 +++ + + H+ + + G + + ++ + + Sbjct: 850 VEDVAQQLMLARSSSDHMNARIALLQYGSERDQDVVFPLTHNLTEISDALAQIKYLDS-- 907 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 ++ A+ A + G ++ + +F+TDG + Sbjct: 908 SSNIGSAIIHAINNI---------VLSPGNGQRLARRNAELSFVFITDGITGSKN----L 954 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLK 380 + + K+ + +++ + + LLK Sbjct: 955 EEAINSMKKQDVMPTVVALGSDVDMDVLLK 984 >gi|22219356|pdb|1M1U|A Chain A, An Isoleucine-Based Allosteric Switch Controls Affinity And Shape Shifting In Integrin Cd11b A-Domain Length = 195 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 62/200 (31%), Gaps = 25/200 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 P++ + D + + + F+ ++ K L+ Y+ + Sbjct: 7 GSPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEEFRIHFT 65 Query: 269 PS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 R V L T + +++ + L + + F Sbjct: 66 FKEFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF--------- 114 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCV 383 K ++ +TDGE + + +A + I + S ++ L T Sbjct: 115 ----KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIA 168 Query: 384 SSP--EYHYNVVNADSLIHV 401 S P ++ + V N ++L + Sbjct: 169 SKPPRDHVFQVNNFEALKTI 188 >gi|221121786|ref|XP_002165500.1| PREDICTED: similar to procollagen, type XIV, alpha 1 [Hydra magnipapillata] Length = 3126 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 53/158 (33%), Gaps = 27/158 (17%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 +E+ ++GL+ Y T + + + + + + A+ A + L Sbjct: 788 DQENSHLGLVTYATDAQIMLNFHHFNDPDTLTEARDAVRVK-PHTGKYTGQALSLAKEGL 846 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +I +TDG S+ + + ++ +KI+ Sbjct: 847 FDKGH---------------RSDALDVLILMTDG-----PSSDDVTEPSRALRDMGVKII 886 Query: 366 TISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV 401 + I + ++ L S P+ + + + D+L + Sbjct: 887 AVGIGNQID-RKQLNDIASDPDDEHVFTA-DYDNLGTI 922 >gi|197118197|ref|YP_002138624.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087557|gb|ACH38828.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 318 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 55/160 (34%), Gaps = 30/160 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + +D ++++AA K A+ + + +GL+ + + Sbjct: 88 DTSKSMLTRDLGQSRLAAAKEAVRQAMAGLK-------GERVGLVVFAGSAFLVCPLTTD 140 Query: 273 TEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 Q + ++L L T A+K+A + L + K Sbjct: 141 YALFDQVLKEAGEETLPLPGTSLAAALKEARRALQGEGDE------------------PK 182 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ L+DGE++ + + +K+ ++ Sbjct: 183 VVVLLSDGEDHEGEYVAAARALNAA----GVKLYAVAAGT 218 >gi|115378470|ref|ZP_01465629.1| CglB [Stigmatella aurantiaca DW4/3-1] gi|115364532|gb|EAU63608.1| CglB [Stigmatella aurantiaca DW4/3-1] Length = 422 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/332 (9%), Positives = 68/332 (20%), Gaps = 88/332 (26%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + L + + + R S Sbjct: 61 TAQALKPNLMVLLDTSGSM-----------------TLPVNTRDPNCYRANNTPSPDDDY 103 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + +++ T+ + L+ A+ FL ++ + Y G + + Sbjct: 104 CGQTPSTACDTSKC-----PTRWSELQGAMSRFLSESGGVARMGLTTYPGPAVGSNSLRC 158 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM-KQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + + S F Q + Sbjct: 159 EASTVVNKNIPQSDADEALLGAANDIQSVILGIPNAGTNAPSGGTPTSLSLQFVGQQPDV 218 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNT---IKIC--------------------------- 354 + F++ LTDG N N N + C Sbjct: 219 QATDRDNFVLLLTDGLPNCNPDNQNAGTNVDACQCTLANIGNNTGCQGDYVRRGCLDKDA 278 Query: 355 -----DKAKENFIKIVTISINAS-------------PNGQRLLKTCVSSPE--------- 387 + K I+ + + A ++C P Sbjct: 279 SVVAVEDLKRKGIRTIVVGFGAETATGNGPATLNAMATAGDFARSCRDDPNACGAGDTCD 338 Query: 388 --------YHYNVVNADSLIHVFQNISQLMVH 411 Y N + L + I L+ Sbjct: 339 AQTKLCGRRFYQAANQEELAEALREIIDLVGT 370 >gi|47228040|emb|CAF97669.1| unnamed protein product [Tetraodon nigroviridis] Length = 621 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 56/191 (29%), Gaps = 29/191 (15%) Query: 235 LLLFLDS------IDLLSHVKEDVYMGLIGYTTRVE--KNIEPSWGTEKVRQYVTRDMDS 286 F+D+ + S + D + L+ Y + E + + + S Sbjct: 16 AKEFIDNVARRLNLASSSSDERDARIALLQYGSATEQRVEFPLTHNFTVISDSLDNL--S 73 Query: 287 LILKPTDSTPAMKQAYQIL--------------TSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + A+ A + ++ T + G ++ + Sbjct: 74 YMDSSSALGSAIIYAINNIVTPKVQRDRQRQGNNTENVSLLPTLCPQVGERLARRHAELA 133 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 +F+TDG + + + + + + I++ + + L K + + Sbjct: 134 FVFITDGITSREQLDEGVTAM---RRGGGVPT-VIAVGKDTDEEVLRKVALGDATAIFRA 189 Query: 393 VNADSLIH-VF 402 N L F Sbjct: 190 DNYHMLNKSAF 200 >gi|309364363|emb|CAP25017.2| CBR-CLEC-62 protein [Caenorhabditis briggsae AF16] Length = 393 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 52/162 (32%), Gaps = 21/162 (12%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQI 304 K V +GL+ Y + + + + A + Sbjct: 78 DKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAAESV 137 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L++ +K S Q +I+ +D ++ + T ++ K + + I Sbjct: 138 LSAGRKN-----------ATRSNYKQLVLIYASDYRDDGEEDPRPT---AERMKSSGVSI 183 Query: 365 VTISINASPNGQRLLKTCV-SSPEYHYNVVNAD---SLIHVF 402 T++ + + N + +SP +++ +AD + Sbjct: 184 ATVAFDQTGNEGVVKAIGEIASPGFNFTNEDADLVREIQGAM 225 >gi|304315800|ref|YP_003850945.1| hypothetical protein Tthe_0286 [Thermoanaerobacterium thermosaccharolyticum DSM 571] gi|302777302|gb|ADL67861.1| conserved hypothetical protein [Thermoanaerobacterium thermosaccharolyticum DSM 571] Length = 185 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 42/158 (26%), Gaps = 7/158 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALS-VMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 ++L K K SI + ++ F+ L FLI + KNS + A + Sbjct: 4 LNLFMGLFHQLVKNTRDRKGISSIFLVMLFLIPFITLSVFLIESRYLYTIKNSADDAVVS 63 Query: 60 AILAGASKMVSNLSR------LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + LA + D ++ + + + S Sbjct: 64 SALAALKSADPVDAAYGEYVLDKDAAKNTFYEYLKKNMKLDDNLNPMSGSVAASSVHVDE 123 Query: 114 YNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + + T + N + + + Sbjct: 124 FIVYDPGDYPTYCPKGTLIRNTSIHVVISFKARRPGLT 161 >gi|218887818|ref|YP_002437139.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758772|gb|ACL09671.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 162 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 11/72 (15%), Positives = 25/72 (34%), Gaps = 1/72 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSN 71 + E S+ A + L ++ L+ + ++++E A A + Sbjct: 10 RLWRGEGGVGSLEVAFMLPVLLAMLFGLVEFGYNLFARSTVEKAAQVGARFAITGEGFDT 69 Query: 72 LSRLGDRFESIS 83 +RL E+ Sbjct: 70 GNRLALIKEAAR 81 >gi|332018134|gb|EGI58743.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Acromyrmex echinatior] Length = 2218 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 35/126 (27%), Gaps = 29/126 (23%) Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + T + A+ +A IL K + +TDG +N Sbjct: 171 NITYSGGGTYTRGALLEALAILEKS-----------------REAANKVVFLITDGFSNG 213 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHV 401 K + T I + L S PE + Y + + Sbjct: 214 GD----PRPAAHLLKNTGAIVFTFGIRTGNVEE--LHDIASHPEYTHSYLLDSFAE---- 263 Query: 402 FQNISQ 407 F+ +++ Sbjct: 264 FEALAR 269 >gi|38569085|gb|AAR24260.1| thrombospondin-related antigen protein [Plasmodium sp. ex Aplonis atrifusca] Length = 540 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 64/181 (35%), Gaps = 29/181 (16%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL---- 289 A+ L D + L+ + +++ L +T + + I+ R++ +++L Sbjct: 13 AVPLVYDIVKNLNVSNDGIHLYLSVFTHYLREYIKLGSSLSTNREFALNIIENLKNKYYL 72 Query: 290 -KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T+ T A+ ++L + + K + I+ TDG ++ Sbjct: 73 HGSTNLTIALS---RVLQDNFIK-----------KKGREDAVQLILIFTDGAPDDK---E 115 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP-----EYHYNVV--NADSLIHV 401 ++ K K+ K I + N + + SSP + + + +I Sbjct: 116 TAMQEVVKLKKMNAKFSVIGVGMGINREFNKRLVDSSPYEEKCDLYSEASWVDVKDIIAP 175 Query: 402 F 402 F Sbjct: 176 F 176 >gi|328885837|emb|CCA59076.1| hypothetical protein SVEN_5790 [Streptomyces venezuelae ATCC 10712] Length = 865 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 58/232 (25%), Gaps = 20/232 (8%) Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 V P G + + + + ++ Sbjct: 22 VLSGPFAATTASGAPGGAGAPALPRTPAAPADPGDPGEGPDPVDFAVVVDQSASLSDKDL 81 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTR-------VEKNIEPSWGTEKVR-------QY 279 A + + E +IG+ + V + + R Q Sbjct: 82 ARETEAAGLLSQGEISERSRAAVIGFGSSEKPGQSPVREVCPLTVADAAGRERLSDCVQE 141 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK------IPSLPFQKFI 333 + R + + TD A++QA LT+D + K L + Sbjct: 142 LGRRDAARMGPGTDFPAAIRQAVSRLTADGTAGGTGAAGKPAPKVVFLLTDGKLDVKDSP 201 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 + TD E+ + +A+ ++I + + + L Sbjct: 202 EYGTDPESRQSNGEKRLTEELARARAAGVQIWPLGFGSEIDRAALTAMAEGG 253 >gi|240255542|ref|NP_476505.3| collagen alpha-3(VI) chain isoform 2 precursor [Homo sapiens] gi|193787261|dbj|BAG52467.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 822 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 878 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 879 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 928 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 929 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 978 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 979 EYVFSVSTFRELPSLEQKLLTPITTL 1004 >gi|145503451|ref|XP_001437701.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404855|emb|CAK70304.1| unnamed protein product [Paramecium tetraurelia] Length = 588 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 51/182 (28%), Gaps = 25/182 (13%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 ++++ K AL+LFL S+ + G Y+ E + + + Q Sbjct: 349 SGSMSGSRISKAKEALILFLKSL-PQDSEFNIISFGSNFYSLWNESKM---YSQNSLEQA 404 Query: 280 VTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T +K+ S + LTD Sbjct: 405 INHVQSMDANLGGTRIIVPLKEMVYNKYYG----------------ASNKTTLNVFLLTD 448 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 GE+ I + K +I T+ I + + + H V + + + Sbjct: 449 GEDFA----DPIIDLVQKNNRAQTRIYTLGIGEGCSQYLIRRVAEVGNGKHQIVSDKEDI 504 Query: 399 IH 400 Sbjct: 505 SE 506 >gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 698 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 47/329 (14%), Positives = 103/329 (31%), Gaps = 71/329 (21%) Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVN-SSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 ++ K + S F+ + ++ N L+ +N F + T Sbjct: 146 KYQKGSVTNDYSDKPETFHFATTIKTQKEIQDVKVSVQGNKNLNDPHNATFVTNEAPTKD 205 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI--------FLIELVVDLSGSMHCA 206 ++ Q ++ + + I + P VVD SGSM Sbjct: 206 AIIIEA------QIKDEDKNVAVSSDGYIAVTTYPFFEGSIESNSEFYFVVDCSGSMA-- 257 Query: 207 MNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 ++ + LF+ S+ ++ + ++ + Sbjct: 258 ------------------GKRIENAVKCMRLFIQSL------PVGCRFAILKFGSQFQTV 293 Query: 267 IEPSWGTEKVRQYVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 +EP T++ + +D++ TD ++ L + + Sbjct: 294 LEPCDYTDENVARAMKLLDTIKADMGGTDILSPLQH-VSELKAKEGFV------------ 340 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNGQRLLKTC 382 K I FLTDGE +N IC A++N +I +I + + + + Sbjct: 341 ------KQIFFLTDGEVHNPDM------ICATAQKNRSGNRIFSIGLGSGADPGLIKGMA 388 Query: 383 VSSPEYHYNVVNADSLIH-VFQNISQLMV 410 S + + + D++ V + +S + Sbjct: 389 RKSGGNYSIIGDDDNMNEKVIEMLSSAIS 417 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 78/232 (33%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D A A ++ I+ ++ G Sbjct: 228 VLDPSGSMNIYLVLDGSGSIGASDFTG-----------AKKCLVNLIEKVASYGVKPRYG 276 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 277 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 333 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 334 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 386 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 387 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 438 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 78/232 (33%), Gaps = 40/232 (17%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 V+D SGSM+ + D A A ++ I+ ++ G Sbjct: 237 VLDPSGSMNIYLVLDGSGSIGASDFTG-----------AKKCLVNLIEKVASYGVKPRYG 285 Query: 256 LIGYTTRVEKNIEPS----WGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDK 309 L+ Y T + ++ S + V + + D + T++ A++ Y +++ Sbjct: 286 LVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMS--- 342 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KE 359 + + +G II +TDG +N + I +E Sbjct: 343 ---WPDDVPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPRE 395 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVF-QNISQL 408 +++ + + N + + + V + ++L VF Q I + Sbjct: 396 DYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQMIDES 447 >gi|260813598|ref|XP_002601504.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] gi|229286801|gb|EEN57516.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] Length = 400 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 70/243 (28%), Gaps = 22/243 (9%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 N + + LL + I ++ + H + + D+ Sbjct: 180 SQNYYAIRKVKILLGNAQSTVSAYAIYMLCHDTYPYHSLLCGNSLDIIYLVDGSGSVG-- 237 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + L ++ +G+I Y++++ + + D Sbjct: 238 ANNFEKVKLFIKKAVSGFVIGPAATQVGVIQYSSKIRQEFSMNSFQTVSGLLGAIDAMEY 297 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ A + S K ++ +TDG +++ + Sbjct: 298 MQGGTLTGRAIRYASKYGFSVFDG-------------ARRGVPKVLVVVTDGVSSDEVAI 344 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 A+ I + I + S L+ S+ E V N + L V + Sbjct: 345 PALE-----AQRQGIFVYAIGV--SNYDAEQLQKIASTNESSAMVDNFNLLDSVRNTLLT 397 Query: 408 LMV 410 + Sbjct: 398 SVC 400 >gi|167836796|ref|ZP_02463679.1| TadE family protein [Burkholderia thailandensis MSMB43] Length = 155 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 43/154 (27%), Gaps = 6/154 (3%) Query: 7 FRFYFKKGI--ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILA 63 + ++ +I FA+ F L++ ++ + S+ +A A A Sbjct: 1 MMRALGRFRSPRRQRGATAIEFAILFPLFFLILYGIVTYGMIFAAQQSLTLAATEGARAA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ + + + A L A ++ S T Sbjct: 61 LNYQVAQTQAAALGLRAAAACTAANNLTGWLSGATCTTSSNYTCSYDSTMFCIQVTVTYP 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + + + L ++ + T+ Sbjct: 121 YAANPLVPAVALFDAVLPTTLTSRATVQINPTNI 154 >gi|134295602|ref|YP_001119337.1| hypothetical protein Bcep1808_1494 [Burkholderia vietnamiensis G4] gi|134138759|gb|ABO54502.1| membrane protein-like protein [Burkholderia vietnamiensis G4] Length = 706 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 13/136 (9%), Positives = 45/136 (33%), Gaps = 3/136 (2%) Query: 16 ASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ + +I+ A+ ++ ++L G I + ++ ++ M++A + A L+ K ++ + Sbjct: 20 RRQRGSIAIMAAIWLSVAVIVLSG--IDIGRFYSERRHMQAAADLAALSSVMKASADSTC 77 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + A A + + S+ + + + Sbjct: 78 TAAKTAAQQVANPTANAVPANATLSVTCGVWTAPQSSSGSALFTPTAGDPTAGQCGGLQA 137 Query: 135 NRLDSSNNTIFYNMDV 150 + + Sbjct: 138 GTANGQSANAVCVTVS 153 >gi|147905660|ref|NP_001090738.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Xenopus (Silurana) tropicalis] gi|120537294|gb|AAI29013.1| LOC100036724 protein [Xenopus (Silurana) tropicalis] Length = 1076 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 51/171 (29%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + +K+++ V Sbjct: 272 LIRTSVSEMLETLSDDDFVNVAAFNNNAHDVSCFNHLVQANVRNKKKLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + K Sbjct: 329 KGTTDYKTGFKFAFDQLLNHNVSRANC--------------NKIIMLFTDGGEDKAKETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + ++Y + + ++ Sbjct: 375 EAYN-----KDKTVRVFTFSVGQHNYDKGPIQWMACQNKGFYYEIPSIGAI 420 >gi|47224880|emb|CAG06450.1| unnamed protein product [Tetraodon nigroviridis] Length = 777 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 30/276 (10%), Positives = 73/276 (26%), Gaps = 39/276 (14%) Query: 160 FIEHL----LNQRYNQKIVSFIPALLR-IEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 + + Y+ + + + S + Sbjct: 214 WSGSMPRCLGRNTYDSPSDVAAAFTGSLAGLMDVLSPEEKKTEKTSFGRTFNVAKGSRMN 273 Query: 215 NSAPICQDKKRTK--MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-- 270 + TK +NA + + +D + ++ + + V+ ++ Sbjct: 274 VYILLDTSGSITKQDFEISRNATIALIKKLDSYEVK---LKFHVLSFASGVKIIVDIRHA 330 Query: 271 ----WGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + + + D +S K T+ A+ ++L+ K S F Sbjct: 331 EISGSADDVIWNLMNFDYNSHGRKTGTNLHDALVSVNEVLSIFKANSVEQKFNE------ 384 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF-----------IKIVTISINASPN 374 Q II TDG +N + + + + I Sbjct: 385 ---TQNIIIIATDGFSNTGANPQIALAKIRSLLGYHNTSQDQTDETLLDVYVFGIGEQVK 441 Query: 375 GQRLLKTCVS--SPEYHYNVVNADSLIHVFQNISQL 408 +L ++ + + + ++L VF +I Sbjct: 442 KDQLNVLASQKRGEKHVFILKDFETLGEVFNSIISD 477 >gi|37499120|gb|AAQ91617.1| unknown [Antonospora locustae] Length = 824 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/108 (12%), Positives = 35/108 (32%), Gaps = 19/108 (17%) Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T +++A + + + S ++ II LTDG ++N + Sbjct: 373 PRGGTCIVAGLQRAVDLKPAAEDGSI----------------RRNIILLTDGGDSNLREI 416 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 + + + I I + +++ + H + +A Sbjct: 417 T---SLVQREAAKGTRFFAIGIGNGVSYDTVMEVARAGRGTHDFIRDA 461 >gi|327538643|gb|EGF25298.1| TadE family protein [Rhodopirellula baltica WH47] Length = 134 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 25/63 (39%), Gaps = 1/63 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVSNLS 73 I + + + F + + L++ + Y + S++ +A A LA + S + Sbjct: 7 IRARRGAALMEFVMCLPVLLVITLGTLETCRMIYLRQSLKLAAYECARLAIVPGVDSQMV 66 Query: 74 RLG 76 + Sbjct: 67 QDQ 69 >gi|307109997|gb|EFN58234.1| hypothetical protein CHLNCDRAFT_50640 [Chlorella variabilis] Length = 1183 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 20/198 (10%), Positives = 50/198 (25%), Gaps = 30/198 (15%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + K A++ F + + V E + L+ + T + G ++ + Sbjct: 143 NSGSMAGSPLDTAKRAVIRFAQAAPAAAGVMEACW--LVEFNTSTRAHSLLGLGGAQIEE 200 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V T A+ + + F++F +D Sbjct: 201 AVGTI---PAGGSTCFHRALSTLADLAAPSAPLGATAAAAAAAAVA---AARHFVVFFSD 254 Query: 339 GENNNFKSNVNTIKICDKAKENFIK-----------------IVTISINASPNGQRL--L 379 G ++ + + + + T+ + L L Sbjct: 255 GCDSGYGG---VEAALESLQARLGGGSGGGSGGGGGTGAEWCVHTLGFGRDHDAAFLSSL 311 Query: 380 KTCVSSPEYHYNVVNADS 397 S+P + + Sbjct: 312 TLAGSAPGSFQYIQSMAE 329 >gi|119569860|gb|EAW49475.1| von Willebrand factor A domain containing 2 [Homo sapiens] Length = 576 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 27 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 86 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + + KE + Sbjct: 87 HRGLPGGR---------------NASVPQILIIVTDGKSQGDVALPS-----KQLKERGV 126 Query: 363 KIVTISI 369 + + + Sbjct: 127 TVFAVGV 133 >gi|38348304|ref|NP_940898.1| von Willebrand factor A domain-containing protein 2 [Homo sapiens] gi|34527908|dbj|BAC85505.1| unnamed protein product [Homo sapiens] Length = 725 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 79 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + + KE + Sbjct: 139 HRGLPGGR---------------NASVPQILIIVTDGKSQGDVALPS-----KQLKERGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|74722595|sp|Q5GFL6|VWA2_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; AltName: Full=Colon cancer secreted protein 2; Short=CCSP-2; Flags: Precursor gi|50429312|gb|AAT77225.1| colon cancer secreted protein-2 [Homo sapiens] gi|50429314|gb|AAT77226.1| colon cancer secreted protein-2 [Homo sapiens] Length = 755 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 79 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + + KE + Sbjct: 139 HRGLPGGR---------------NASVPQILIIVTDGKSQGDVALPS-----KQLKERGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|27657363|emb|CAD60276.1| AMACO [Homo sapiens] Length = 755 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 37/127 (29%), Gaps = 20/127 (15%) Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L E V +G +++ + + T++ A+K Sbjct: 79 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + + +I +TDG++ + + + KE + Sbjct: 139 HRGLPGGR---------------NASVPQILIIVTDGKSQGDVALPS-----KQLKERGV 178 Query: 363 KIVTISI 369 + + + Sbjct: 179 TVFAVGV 185 >gi|311028997|ref|ZP_07707087.1| hypothetical protein Bm3-1_00293 [Bacillus sp. m3-13] gi|311032266|ref|ZP_07710356.1| hypothetical protein Bm3-1_17297 [Bacillus sp. m3-13] Length = 245 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 7/82 (8%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT--CVSSP 386 K I+ +TDG +N+ I + AKE I + I + + + + + S Sbjct: 7 KQILLITDGCSNSGDD---PIAMAALAKEQGITVNVIGVMDEDTIDERGMQEIEGIAMSG 63 Query: 387 EYHYNVVNADSLIHVFQNISQL 408 +V A +L Q +++ Sbjct: 64 GGVSQIVYAKNLSQTVQMVTRK 85 >gi|167829718|ref|ZP_02461189.1| hypothetical protein Bpseu9_38925 [Burkholderia pseudomallei 9] Length = 396 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 74/319 (23%), Gaps = 20/319 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 SI+ AL + + +G + + + ++ ++++ +A LA A + ++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAG 60 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 I+ + + S I + S I +R N Sbjct: 61 ITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVN 120 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL--RIEMGERPIFLIELVVDL 199 I V + + IP + P+ + Sbjct: 121 WFIQTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGD 180 Query: 200 -------------SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 +G+ + +S + A + + D + Sbjct: 181 WLSAKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTN 240 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPS---WGTEKVRQYVTRDMDSLILKPT--DSTPAMKQA 301 + Y + + + D + + T + Sbjct: 241 AYNTRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITG 300 Query: 302 YQILTSDKKRSFFTNFFRQ 320 S + R+ Sbjct: 301 INTGGSYNPSYYAAGADRR 319 >gi|325673442|ref|ZP_08153133.1| TadE family protein [Rhodococcus equi ATCC 33707] gi|325555463|gb|EGD25134.1| TadE family protein [Rhodococcus equi ATCC 33707] Length = 133 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 39/131 (29%), Gaps = 5/131 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + S+ ++ FAL V + L+ ++ + ++A +AA GA M Sbjct: 1 MKRLTSDSGVAAVEFALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIK 56 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS-AVFYNTEIQNIVNSSRISMT 130 R I DA+ I + S S + ++ Sbjct: 57 KDPAAARAAVKGAGVFSPAITDAEICISTSGSQGCSATSCPSGSTVTLTVSYPLEYMTGL 116 Query: 131 HMANNRLDSSN 141 L + Sbjct: 117 FPGKPTLTGTG 127 >gi|221113899|ref|XP_002160633.1| PREDICTED: similar to Collagen alpha-5(VI) chain [Hydra magnipapillata] Length = 9981 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 52/162 (32%), Gaps = 19/162 (11%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + Y+G++ + +I+ V Y + LK T + + A Sbjct: 9051 LKSYKISTDATYVGVVANGDQPIVSIKL----NAVSSYESIVGYLKNLKYTGESRKLSYA 9106 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 +QI+ + + F I + D K+ Sbjct: 9107 FQIVRTSLFSEENGGRESIPKTLIVFSNSGFSIDMND-----------LSDEAQALKDMG 9155 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 +KIV I++ + +LK V + + + +L ++ Q Sbjct: 9156 VKIVFIALGEDARSE-MLKQVV---DVFFFAEDLPNLKYILQ 9193 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 20/60 (33%), Gaps = 2/60 (3%) Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 +KIV + +N + N VSS V L F I + S ++ Sbjct: 8526 NRGVKIVVVGMNFNDNNIEKWNNAVSSNGKIVAVTK--ELPLFFGAIESSISQLLSSPLI 8583 >gi|330938298|gb|EGH41949.1| hypothetical protein PSYPI_05803 [Pseudomonas syringae pv. pisi str. 1704B] Length = 514 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 35/107 (32%), Gaps = 1/107 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + +L+ A + Sbjct: 72 ATAPDFANQSATRNGFTPNTDGRTLVTRCGTLTVDVAGPRVFVADST 118 >gi|118618215|ref|YP_906547.1| hypothetical protein MUL_2774 [Mycobacterium ulcerans Agy99] gi|118570325|gb|ABL05076.1| conserved membrane protein [Mycobacterium ulcerans Agy99] Length = 981 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 22/177 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMD 285 KM A + A +D++ + I Y V + +++ R + V Sbjct: 313 KMTAARRAASRIVDALTSDDRFAVLTFDDGIEY--PVGLPAGLTEASDRHRYRAVEHLAR 370 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ +++A +L ++ +I ++DG+ Sbjct: 371 VEARGDTEMLAPLRRALALLGREQVADTDDA---------------VLILISDGQ--VGN 413 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ +++ TI ++ + N L + V N D L Sbjct: 414 EDQLLQELSGDLGR--VRLHTIGVDEAVNAGFLRRLAGVGGGRCVLVDNEDRLDEAL 468 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 53/153 (34%), Gaps = 25/153 (16%) Query: 232 KNALLLFLDSIDL---LSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDS 286 K+ +L F D I K M +I +++ V + G E ++ V Sbjct: 76 KDFVLNFTDQISHLKLAKPWKTKTKMAIIQFSSSVRIEQSFNEWTGVENFKRIVNSMT-- 133 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 I + T + A+ A I + K K I +TDG ++ Sbjct: 134 YIGQGTYTYYAIMNATNIFKAHKSAGNV----------------KVAILMTDGIDHPKSP 177 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 + + D A+ I ++I ++ + +L Sbjct: 178 D--ARQASDFARAAGINFISIGLSTQKANKTIL 208 >gi|15921062|ref|NP_376731.1| hypothetical protein ST0828 [Sulfolobus tokodaii str. 7] gi|15621846|dbj|BAB65840.1| 360aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 360 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 18/44 (40%) Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 ++I++I I N L + +Y++ + L VF Sbjct: 156 NVQIISIGIGRDYNEVILKRMADKGSGVYYHIEDPTQLPSVFAQ 199 >gi|222147188|ref|YP_002548145.1| hypothetical protein Avi_0226 [Agrobacterium vitis S4] gi|221734178|gb|ACM35141.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 207 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 17/159 (10%), Positives = 46/159 (28%), Gaps = 14/159 (8%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 L R+R ++ S + +I FA+ + + L+I ++ + ++ +A + Sbjct: 13 GLWRRWRLVARRLRRSRDGSAAIEFAILAIPYFLIIFAILETFVAFIAEQTVNAAVD--- 69 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE---- 117 + + ++ N I L+ +A Sbjct: 70 -------TLGRQLRTGNITYNQARSTDKTATEFRQLFCNEISFLLTCDAAEVATPNRLWL 122 Query: 118 IQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + T +A ++ + +S + Sbjct: 123 DVRTYTAFSAMPTTIATTSGSLDTSSFAFTPGGASSINM 161 >gi|1399245|gb|AAB03226.1| integrin alpha-M [Rattus norvegicus] Length = 205 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 20/166 (12%) Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 K L+ Y+ + + + T + +++ + L Sbjct: 10 KSKTLFSLMQYSDEFRTHFTFNXFKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELFQK 69 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + K ++ +TDGE +N + +A+E I I Sbjct: 70 INGARDNAA-------------KILVVITDGEKFG--DPLNYEDVIPEAEEAGIIRYVIG 114 Query: 369 INAS---PNGQRLLKTCVSSP--EYHYNVVNADSLIHVFQNISQLM 409 + + P +R L T S P ++ + V N ++L + + + + Sbjct: 115 VXNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIRNQLQEKI 160 >gi|332267269|ref|XP_003282606.1| PREDICTED: von Willebrand factor A domain-containing protein 1 isoform 1 [Nomascus leucogenys] Length = 444 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 16/150 (10%), Positives = 45/150 (30%), Gaps = 24/150 (16%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ +R + + + T + A+ A + L ++ Sbjct: 75 SLVHVGSRPYTEFPFGQHSSGKAAQDAVRASAQRMGDTHTGLALVYAKEQLFAEASG--- 131 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + + +S Sbjct: 132 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTVFIVSTGRGNF 176 Query: 375 GQRLLKTCVSSPE----YHYNVVNADSLIH 400 + L S+P + +V + ++ Sbjct: 177 LE--LSAAASAPAEKHLHFVDVDDLHIIVQ 204 >gi|330953032|gb|EGH53292.1| hypothetical protein PSYCIT7_16979 [Pseudomonas syringae Cit 7] Length = 677 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 35/107 (32%), Gaps = 1/107 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + +L+ A + Sbjct: 72 ATAPDFANQSATRNGFTPNTDGRTLVTRCGTLTVDVAGPRVFVADST 118 >gi|260800521|ref|XP_002595177.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|260800523|ref|XP_002595178.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] gi|229280421|gb|EEN51189.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|229280422|gb|EEN51190.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] Length = 153 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 15/149 (10%), Positives = 42/149 (28%), Gaps = 19/149 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ L + V + ++ Y + + D + T Sbjct: 20 MKTFIRQLFAYLDIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDGIEYMGGYTL 79 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ A + +K K + +TDG + + + Sbjct: 80 TGKAIDFATNLHFDLRKG-------------ARADVTKIAVVITDGRSYDDVNRP----- 121 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTC 382 + ++ I + + + + + +L Sbjct: 122 ARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|219847012|ref|YP_002461445.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541271|gb|ACL23009.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 847 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 7/50 (14%), Positives = 18/50 (36%), Gaps = 2/50 (4%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNS 52 L R R Y + + + AL ++ + ++ + V ++ Sbjct: 2 LDRRIRTY--RLSRRTRGQSIPLLALMIVVLIGMVALSVDVGRTFSEERR 49 >gi|17541878|ref|NP_501918.1| hypothetical protein R10H10.3 [Caenorhabditis elegans] gi|3879152|emb|CAA94611.1| C. elegans protein R10H10.3, confirmed by transcript evidence [Caenorhabditis elegans] Length = 364 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 25/249 (10%), Positives = 69/249 (27%), Gaps = 16/249 (6%) Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 Y + + + + ++ + + + + Q Sbjct: 122 AGTYYTTTRNAVVTFESNPTIQKSGFS-MQYQSVNTVAPCNRDIFLIVNGLANVGSQANY 180 Query: 225 RTKMA-ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + ++ + +D + + + + T++V + Sbjct: 181 QKEINFIANQLTPTWNVGLDKVRVMLNLQTDIDYAIVWSADDVPSNANVTQEVLNMLEYV 240 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 D D + A+ L K+ +K +I NN Sbjct: 241 PDVTTDNNMDLECLFRYAFDGLDDSKEFD------------ERYGIEKVVIIFVAANANN 288 Query: 344 FKSNVNTIKICDKAK-ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + + K + E K++ + + + + RL K +S Y + + D+L + Sbjct: 289 DQDYNESFEFAHKIRTEQDAKVIVVGMGSGLDQTRLSKLAYAS-GYAFFSTSYDNLSSLI 347 Query: 403 QNISQLMVH 411 I+ + Sbjct: 348 PKINNAICS 356 >gi|167899815|ref|ZP_02487216.1| hypothetical protein Bpse7_39190 [Burkholderia pseudomallei 7894] Length = 396 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 5/35 (14%), Positives = 18/35 (51%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 SI+ AL + + +G + + + ++ ++++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 35 >gi|82913384|ref|XP_728622.1| micronemal protein WARP [Plasmodium yoelii yoelii str. 17XNL] gi|23485067|gb|EAA20187.1| micronemal protein WARP [Plasmodium yoelii yoelii] Length = 303 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 53/188 (28%), Gaps = 21/188 (11%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + D+ + K + + K+ +M +I + R + Sbjct: 91 NYCDNYYDITLIVENSSFIQKDYWM-KGTIPFLESMVRNARVSKDKAHMAIILFAGRQDL 149 Query: 266 NIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + EK+ + D+ T A++ A++ + + Sbjct: 150 IVPFTDELSQDKEKLIDKIRTLNDAATDSNTLYVYALEYAFEKVIFGEGTRSDA------ 203 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL-LK 380 K + G + + + + K+N IK++ + I L Sbjct: 204 --------PKVAVLFYYG-FDYGSNTSLIPDVVEDYKQNNIKLIIVGIALGNKQNAFILA 254 Query: 381 TCVSSPEY 388 C S + Sbjct: 255 DCKSDGDC 262 >gi|317056370|ref|YP_004104837.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448639|gb|ADU22203.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 554 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 26/252 (10%), Positives = 62/252 (24%), Gaps = 22/252 (8%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 V M E + V ++ + I + + Sbjct: 98 STFSTDVDTASFTNIRRMIENNQNIDPNAVRTEEFINYFKYNYEYPDGDDKIAITTELSD 157 Query: 228 MAALKNA----LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-------------S 270 +A + L ID+ V++ + + E + Sbjct: 158 CPWNADAKLMQIGLQAKDIDVQDIDSNIVFLIDVSGSMADENKLPLVVQAFAMLAENLGE 217 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ---GVKIPSL 327 + Y RD L + + + LT+ + K Sbjct: 218 NDRISIVTYAGRDTIELEGESGANYETIASTLAGLTAGGSTAGAAGINTAYELAEKYFIK 277 Query: 328 PFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +I TDG+ N S+ + ++ ++ + + + L+ + Sbjct: 278 GGNNRVILATDGDLNVGLSSEEELKALIEEKRDKGVFLSVLGFGMGNYKDNRLEALADNG 337 Query: 387 -EYHYNVVNADS 397 + + + D Sbjct: 338 NGNYAYIDSVDE 349 >gi|300776965|ref|ZP_07086823.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502475|gb|EFK33615.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 634 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 14/148 (9%), Positives = 42/148 (28%), Gaps = 19/148 (12%) Query: 252 VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR 311 +G++ Y + + EK + + + T ++ AY++ + + Sbjct: 311 DKVGIVVYAGNAGMVLPSTSAGEK-EKIIKALDNLQAGGSTAGGAGIELAYKLAKENFIK 369 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISIN 370 ++ TDG+ N S + + + ++ I + + Sbjct: 370 GGNN----------------RVVLATDGDFNVGASSTSDIETLITEKRKTGIFLTCLGYG 413 Query: 371 ASPNGQRLLKTCVSSP-EYHYNVVNADS 397 ++ + + N Sbjct: 414 MGNYKDNTMEVLADKGNGNYAYIDNMQE 441 >gi|182412149|ref|YP_001817215.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177839363|gb|ACB73615.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 859 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 69/229 (30%), Gaps = 33/229 (14%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 L+ + + + + A V ++ + +++ AL + + Sbjct: 495 NLVRIAMKVPAAGRGAGQPLNLTVLLDTSGSMERTDRATSVRAALGVLASLLTPDD---- 550 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + LIG+ + E S ++ RQ V + T+ A+ A Sbjct: 551 --RVTLIGFARQPRLLAE-SLAGDQARQLVDLASTTPFTGGTNLEAALSLA--------- 598 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI-CDKAKENFIKIVTISI 369 + + Q I+ +TDG N ++ + + ++ I + Sbjct: 599 -------GELARRHHNAAAQNRIVLITDGAANLGNADPAQLATRIETLRQQGIAFDACGV 651 Query: 370 NASPNGQRLLKTCV-SSPEYHY---NVVNAD-----SLIHVFQNISQLM 409 +L+ +Y NAD L F+ ++ + Sbjct: 652 GTDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRPAAENI 700 >gi|120603022|ref|YP_967422.1| outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] gi|120563251|gb|ABM28995.1| putative outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] Length = 3038 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/284 (7%), Positives = 69/284 (24%), Gaps = 11/284 (3%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + + + + + + ++ + N ++ + Sbjct: 2145 QTIDLGAFSTSGHTQTVVDDATGKLVITGYNSATGEVSYEYTLKDNTLEGDNVQRTFEVK 2204 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR--YNQK 172 T+ + I + + + F ++ T+ L + + Sbjct: 2205 VTDTSGDEAKANIVINVTDDAPTAHDDFATFPQVNTTTNLVIVLDTSGSMAWDSGVDGKS 2264 Query: 173 IVSFIPALLRI------EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + +MG I +++ D S +D + + D K Sbjct: 2265 RMELAQEAIAKLMHAYDDMGHVNIKIVDFYSDADASKPWFEGNDAVAKATQYLTTDNKFV 2324 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + L + + L ++ I + + + +V Sbjct: 2325 PGGGTDYDDATAATASALQTGMPNADRTVLYFFSDGAPDPISEALNATEEKAWVDALKG- 2383 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + D A+ Q+ T+ + + + Sbjct: 2384 --VDNLDIAYAIGIGSQVSTTHLLPVGWERNPTDPDNPDAHSTK 2425 >gi|289675735|ref|ZP_06496625.1| hypothetical protein PsyrpsF_20856 [Pseudomonas syringae pv. syringae FF5] Length = 328 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 11/130 (8%), Positives = 41/130 (31%), Gaps = 3/130 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKR--ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + D + + ++ + + + ++ + Sbjct: 72 ASAPDFANQSATRNGFVPNTDGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVAAHPV 131 Query: 133 ANNRLDSSNN 142 + Sbjct: 132 PRSIAAGIGA 141 >gi|46579425|ref|YP_010233.1| hemolysin-type calcium-binding repeat-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46448839|gb|AAS95492.1| hemolysin-type calcium-binding repeat protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233245|gb|ADP86099.1| outer membrane adhesin like proteiin [Desulfovibrio vulgaris RCH1] Length = 3038 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/284 (7%), Positives = 69/284 (24%), Gaps = 11/284 (3%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + + + + + + ++ + N ++ + Sbjct: 2145 QTIDLGAFSTSGHTQTVVDDATGKLVITGYNSATGEVSYEYTLKDNTLEGDNVQRTFEVK 2204 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR--YNQK 172 T+ + I + + + F ++ T+ L + + Sbjct: 2205 VTDTSGDEAKANIVINVTDDAPTAHDDFATFPQVNTTTNLVIVLDTSGSMAWDSGVDGKS 2264 Query: 173 IVSFIPALLRI------EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + + +MG I +++ D S +D + + D K Sbjct: 2265 RMELAQEAIAKLMHAYDDMGHVNIKIVDFYSDADASKPWFEGNDAVAKATQYLTTDNKFV 2324 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + L + + L ++ I + + + +V Sbjct: 2325 PGGGTDYDDATAATASALQTGMPNADRTVLYFFSDGAPDPISEALNATEEKAWVDALKG- 2383 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + D A+ Q+ T+ + + + Sbjct: 2384 --VDNLDIAYAIGIGSQVSTTHLLPVGWERNPTDPDNPDAHSTK 2425 >gi|52545928|emb|CAH56139.1| hypothetical protein [Homo sapiens] Length = 1222 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|314988185|gb|EFT32276.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] Length = 322 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLNGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAKSWTADSADKLREVYQQVHSSVG 287 >gi|217424378|ref|ZP_03455877.1| TadE family protein [Burkholderia pseudomallei 576] gi|217392843|gb|EEC32866.1| TadE family protein [Burkholderia pseudomallei 576] Length = 594 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 428 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 487 Query: 79 FE 80 + Sbjct: 488 AQ 489 >gi|260799304|ref|XP_002594637.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] gi|229279872|gb|EEN50648.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] Length = 181 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 58/186 (31%), Gaps = 28/186 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 + K L ++ + +G+I Y++ V + + K D Sbjct: 15 PNEFDKSKAFLRNVINQFQ---IGPDATKVGVIQYSSTVREEFSLNRYLTKQDVMRAIDR 71 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + A+ Q K I +TDG + Sbjct: 72 IPFLGGFTRTGEAITFMKQHSQFRGN------------------VPKIAIVITDGRAQDD 113 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 S + + A + + IS+ + + LK S+ Y V + D+L + + Sbjct: 114 VSYPSQM-----AHGAGVIMYAISVGSVNMNE--LKLIASAERYITTVSDFDALSTLTLS 166 Query: 405 ISQLMV 410 +++ + Sbjct: 167 LTEQIC 172 >gi|76818230|ref|YP_336470.1| hypothetical protein BURPS1710b_A1313 [Burkholderia pseudomallei 1710b] gi|76582703|gb|ABA52177.1| putative membrane protein [Burkholderia pseudomallei 1710b] Length = 722 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 556 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 615 Query: 79 FE 80 + Sbjct: 616 AQ 617 >gi|224171915|ref|XP_002198669.1| PREDICTED: similar to collagen, type XX, alpha 1, partial [Taeniopygia guttata] Length = 152 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 39/131 (29%), Gaps = 20/131 (15%) Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L Y++ E S + + + T + A+ Sbjct: 1 LSQYSSDPRTEWELSTYSTREQVLEAVRNLRYKGGNTFTGLALTH-------------VL 47 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + L +K +I LTDG++ + + K I+I I + + Sbjct: 48 EQNLKPDAGARLEAEKLVILLTDGKSQDDANL-----AAQTLKNLGIEIFAIGVKNADEA 102 Query: 376 QRLLKTCVSSP 386 + L+ S P Sbjct: 103 E--LRQVASEP 111 >gi|116619318|ref|YP_821474.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222480|gb|ABJ81189.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 296 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 16/153 (10%), Positives = 43/153 (28%), Gaps = 32/153 (20%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 T R + + T A+ A + L Sbjct: 116 LLDYTSNCGTMQNRLVMAKPHGMTALLDAIPLAVEHLRKAAH------------------ 157 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV--TISINASPNG----------Q 376 + I+ ++DG N + + +A+E + T+ + A + + Sbjct: 158 PRHAILIISDGGENASRV--RLHDVRRQAREANAPVYAATLGLEAEFDQGPYLDARRGPE 215 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + + +++ + + I++ + Sbjct: 216 LLREIAQITGGRAFSIQESRRIEEAAAGIAREL 248 >gi|302804190|ref|XP_002983847.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] gi|300148199|gb|EFJ14859.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] Length = 557 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 49/191 (25%), Gaps = 24/191 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG--LIGYTTRVEKNIE 268 S P +T+ N + L+++ V G + + V E Sbjct: 202 TSTSMSNPTGVLSSQTRFNVANNIIKKLLNTLTNGDQVAVSTIGGEKIGAPVSVVLGVQE 261 Query: 269 PSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S + S+ ++ ++ A + Sbjct: 262 TSLDLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSN----------------- 304 Query: 328 PFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 II TDG+ + T + + + + I + + SS Sbjct: 305 --LNVIILFTDGQFVTPGNFNFTQLSPVLAQLNARGVVVFVYRIGSFTTNDATFQQMQSS 362 Query: 386 PEYHYNVVNAD 396 Y V+N D Sbjct: 363 LNMSYEVINDD 373 >gi|302559122|ref|ZP_07311464.1| membrane protein [Streptomyces griseoflavus Tu4000] gi|302476740|gb|EFL39833.1| membrane protein [Streptomyces griseoflavus Tu4000] Length = 154 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 12/138 (8%), Positives = 38/138 (27%), Gaps = 3/138 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++G ++ + + + + FL L F+I K+ A A + Sbjct: 13 RRGRLDDRGSGAGAVIIFALVFLSLAAFVIDGGMSISKRERAADIAEQAARYAAQDIDRE 72 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG---YSAVFYNTEIQNIVNSSRIS 128 + + N+ A + ++ +A +++ + + + Sbjct: 73 ALYDNEGGPAPINYENCDARVKAFAREMDMSGPDIAATHCVAATVDAVQVEVQLTYAPVF 132 Query: 129 MTHMANNRLDSSNNTIFY 146 + + Sbjct: 133 TGMFYGGNVVVRGQAVAE 150 >gi|194389238|dbj|BAG65607.1| unnamed protein product [Homo sapiens] Length = 1237 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 13/169 (7%), Positives = 44/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGVQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|240255540|ref|NP_476506.3| collagen alpha-3(VI) chain isoform 3 precursor [Homo sapiens] Length = 1237 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 DV + ++ + +D + + + +I ++ + Sbjct: 1023 GGKRDVVFLIDGSQSAGPEFQYVRTLIERLVDYL---DVGFDTTRVAVIQFSDDPKVEFL 1079 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + + K + A++ + + S Sbjct: 1080 LNAHSSKDEVQ-NAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEG--------- 1129 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 +F++ ++ G KS+ + K+ + TI+ + + + L+K SP Sbjct: 1130 --VPQFLVLISSG-----KSDDEVDDPAVELKQFGVAPFTIA--RNADQEELVKIS-LSP 1179 Query: 387 EYHYNVVNADSLIHVFQNISQLMVHR 412 EY ++V L + Q + + Sbjct: 1180 EYVFSVSTFRELPSLEQKLLTPITTL 1205 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 45/169 (26%), Gaps = 20/169 (11%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 L ++ ++ L + + +G++ ++ Q + + Sbjct: 56 LDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSL-DTYSTKAQVLGAVKALGFAGGELA 114 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + +F R G + ++ ++ G S+ Sbjct: 115 NIGLALDF---------VVENHFTRAGGSRVEEGVPQVLVLISAG-----PSSDEIRYGV 160 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 161 VALKQ--ASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|11414924|dbj|BAB18554.1| voltage dependent calcium channel alpha2a/delta subunit [Rana catesbeiana] Length = 1102 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + + +++ V Sbjct: 272 LIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + Sbjct: 329 KGTTDYKQGFKFAFDQLRNTNVSRANC--------------NKIIMLFTDGGEDKATETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KLYN-----KNKTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 52/151 (34%), Gaps = 21/151 (13%) Query: 253 YMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSL--ILKPTD-STPAMKQAYQILTS 307 +G + Y + N + + + Q V +++ + T + A ++L Sbjct: 69 RVGFVTYNSLATINADLNKFKSWGDLSQGVNDSYNNMNLSSENTSFIGTGLITAGELLQ- 127 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 QG I + + K II N + + + + K I I+T+ Sbjct: 128 -----------VQGSAIGRVYYPKVIIVYASAFNGTGLLD--PLSVANTLKSAGITIITV 174 Query: 368 SINASPNG--QRLLKTCVSSPEYHYNVVNAD 396 +++ NG Q+ L + S + D Sbjct: 175 AVDTDNNGVIQKQLASIASPGSAFSLDPDDD 205 >gi|149005573|ref|ZP_01829312.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|168482703|ref|ZP_02707655.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|307126680|ref|YP_003878711.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|147762513|gb|EDK69473.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|154432930|gb|ABS82113.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432938|gb|ABS82120.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432946|gb|ABS82127.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432954|gb|ABS82134.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432970|gb|ABS82148.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|172043667|gb|EDT51713.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|306483742|gb|ADM90611.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|332076908|gb|EGI87370.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17545] gi|332203636|gb|EGJ17703.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA47368] Length = 883 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 50/215 (23%), Gaps = 26/215 (12%) Query: 136 RLDSSNNTIFY--NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 L S T L + N R+N + + ++ Sbjct: 200 ELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSNP 259 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + V L + V + K L ++ L D + + Sbjct: 260 DNRVALVTYGSTIFDGSEATVEKGVADANGK-----ILNDSALWTFDRTTFTAKTYNYSF 314 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L + + + +D T + A+ A ILT Sbjct: 315 LNLT---SDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILT------- 364 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K +K I +TDG Sbjct: 365 ---------KQARPNSKKVIFHITDGVPTMSYPIN 390 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 37/361 (10%), Positives = 98/361 (27%), Gaps = 30/361 (8%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 +V ++ D ++ + K + + T + Sbjct: 247 ATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTFDRTTFT 306 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + + TI + + + + + QK + +L Sbjct: 307 AKTYNYSFLNLTSDPTDIQTIKDRIPSDAE-ELNKDKLMYQFGATFTQKALMTADDILTK 365 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +I + D +M +N + Q A N+ + L+ Sbjct: 366 QARPNSKKVIFHITDGVPTMSYPINFKYTGTTQSYRTQLNNFK--AKTPNSSGILLEDFV 423 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 S E + G + ++ + V Q ++ + Y+ Sbjct: 424 TWSADGEHKIVRGDGESYQMFTKKPVT-DQYGVHQILSITSMEQ------RAKLVSAGYR 476 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +D + + + T N N ++ Sbjct: 477 FYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGN-------------MAQDGYD 523 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + T+ + + + R +++ SSP+ + NV + ++ +V+ K S+ Sbjct: 524 VFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSI 583 Query: 417 I 417 Sbjct: 584 E 584 >gi|108995558|ref|XP_001094724.1| PREDICTED: von Willebrand factor A domain-containing protein 1 isoform 2 [Macaca mulatta] Length = 438 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/180 (10%), Positives = 51/180 (28%), Gaps = 35/180 (19%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ TR + + T + A+ A + L ++ Sbjct: 74 SLVHVGTRPYTEFPFGQHSSGEAAQDAVRASVQRMGDTHTGLALAYAKEQLFAEAAG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + + +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTVFIVSTGRGNF 175 Query: 375 GQRLLKTCVSSPE----YHYNVVNADSLIHVFQ-NI----------SQLMVHRKYSVILK 419 + L S+P + +V + ++ + +I + + + + Sbjct: 176 LE--LSAAASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQLRATEITSSGFRLAWP 233 >gi|260904335|ref|ZP_05912657.1| von Willebrand factor type A domain-containing protein [Brevibacterium linens BL2] Length = 324 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 54/183 (29%), Gaps = 18/183 (9%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 A +D D +G+ + + + E V+ Y+ + T Sbjct: 113 EAYQELVDRFD-------GERIGMTVFNSTAVSAFPLTDDYEMVQNYLEEAEEGFR---T 162 Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFR----QGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + ++ + S + I+F TD Sbjct: 163 WGSEGTDYSWSTSPPNIGGSSLIGDGLVSCVDNFDRQDEDRSRSIVFATDNM-LAGDPLF 221 Query: 349 NTIKICDKAKENFIKIVTI---SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 + D A E+ +++ ++ S+ +P + L + +++ +A ++ + I Sbjct: 222 DLDDATDIAVESDVRVYSLSPPSVLTNPQTKELKSVSDRTGGKQFDMGSASTIDRIVSEI 281 Query: 406 SQL 408 Sbjct: 282 QSQ 284 >gi|239613654|gb|EEQ90641.1| von Willebrand RING finger domain-containing protein [Ajellomyces dermatitidis ER-3] gi|327352589|gb|EGE81446.1| von Willebrand RING finger domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 1025 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/241 (9%), Positives = 66/241 (27%), Gaps = 27/241 (11%) Query: 158 LQFIEHLLNQRYNQKIVSFIPAL----LRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + + K + + + + + + + + D Sbjct: 438 VDTSGTEEDDYRTHKTIKRVSSTTSSYGASKSNNTALTDYTNSIRDTLPPSFHVPLDLVI 497 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V P+ + K+ L++AL + ++ MGL+ + + T Sbjct: 498 V--IPVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMT 548 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 K T + L +D + + P I Sbjct: 549 TKAWS----------GWGTILNSIRPVGQKSLRADVVEGANVAMDLLMQRKSNNPLS-TI 597 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++D + +S + + AK + I + + + +++ + + V Sbjct: 598 LLISDSSISEPESVDFVVSRAEAAK---VGIHSFGLGLTHKPDTMIELSTRTKASYTYVK 654 Query: 394 N 394 + Sbjct: 655 D 655 >gi|11414930|dbj|BAB18557.1| voltage dependent calcium channel alpha2d/delta subunit [Rana catesbeiana] Length = 1078 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + + +++ V Sbjct: 272 LIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + Sbjct: 329 KGTTDYKQGFKFAFDQLRNTNVSRANC--------------NKIIMLFTDGGEDKATETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KLYN-----KNKTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|11414928|dbj|BAB18556.1| voltage dependent calcium channel alpha2c/delta subunit [Rana catesbeiana] Length = 1085 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + + +++ V Sbjct: 272 LIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + Sbjct: 329 KGTTDYKQGFKFAFDQLRNTNVSRANC--------------NKIIMLFTDGGEDKATETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KLYN-----KNKTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|11414926|dbj|BAB18555.1| voltage dependent calcium channel alpha2b/delta subunit [Rana catesbeiana] Length = 1090 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + + +++ V Sbjct: 272 LIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + Sbjct: 329 KGTTDYKQGFKFAFDQLRNTNVSRANC--------------NKIIMLFTDGGEDKATETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KLYN-----KNKTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 694 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 26/267 (9%), Positives = 75/267 (28%), Gaps = 45/267 (16%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI--- 219 + K + + + + + P+ ++ + + ++ +N N Sbjct: 180 SIEGNVMTLKTIPTTDFNVDVFLNKNPVS-VKEITGNTQVVNFKINPLNYLSNRKTDVKS 238 Query: 220 --------CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 ++ A+ LF + L + + + + Sbjct: 239 IVFLLDCSGSMTIDNRIENAIKAMDLF---LHSLEPGVKFEIVRFGSTFNSLFDFKLTEY 295 Query: 272 GTEKVRQYVTRDMDSLIL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + + + + T+ +KQ Y L+ Sbjct: 296 NDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNELSP----------------------- 332 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + LTDG +N + + D +++ KI ++ + A + + + Sbjct: 333 DVLFVLTDGAVDNSQ------AVLDFVRDSSTKIFSLGLGAGADMNLVRNLASFTGGVSE 386 Query: 391 NVVNADSLIHVFQNISQLMVHRKYSVI 417 +V++A L + + + S + Sbjct: 387 HVLDASQLRDSIIRLLEDSTNPTLSNV 413 >gi|330818825|ref|YP_004351042.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] gi|327374367|gb|AEA65719.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] Length = 660 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 50/153 (32%), Gaps = 26/153 (16%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 DV +I Y + G E+V + + + T + AM A L Sbjct: 521 DVPFEVISYAAGYQIIKSFDDGWEEVSRRSHAVAS--VGRSTATGMAMTVALSNL----- 573 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I ++ ++ LTDG + V T AKE +++VTI I Sbjct: 574 -------------ILRDEDRRMMVLLTDGAAGD---PVMTAASYQAAKEAGVEVVTIFIG 617 Query: 371 ASPNGQRLLK---TCVSSPEYHYNVVNADSLIH 400 L + ++ NV + D L Sbjct: 618 RDIQAIALTRSILNATGFGQHFSNVNSPDELAK 650 >gi|11414932|dbj|BAB18558.1| voltage dependent calcium channel alpha2e/delta subunit [Rana catesbeiana] Length = 1083 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 48/171 (28%), Gaps = 24/171 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR-VEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ ++ L+++ V + + + +++ V Sbjct: 272 LIRTSVTEMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVRNKKVLKEAVNNIT---A 328 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD K A+ L + K I+ TDG + Sbjct: 329 KGTTDYKQGFKFAFDQLRNTNVSRANC--------------NKIIMLFTDGGEDKATETF 374 Query: 349 NTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KLYN-----KNKTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|157817857|ref|NP_001102478.1| procollagen, type VI, alpha 3 [Rattus norvegicus] gi|149037629|gb|EDL92060.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_a [Rattus norvegicus] Length = 2207 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 45/169 (26%), Gaps = 22/169 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDS 294 ++ ++ LS + V +G++ Y+ K S + Sbjct: 263 DFLVNVLERLSVGNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANI 322 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +F R G + ++ ++ G S+ Sbjct: 323 GQALDF-----------VVENHFTRTGGSRVEEGVPQVLVLISAG-----PSSDEIRDAV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDLQE 413 >gi|149037631|gb|EDL92062.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_c [Rattus norvegicus] Length = 2862 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 45/169 (26%), Gaps = 22/169 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDS 294 ++ ++ LS + V +G++ Y+ K S + Sbjct: 263 DFLVNVLERLSVGNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANI 322 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +F R G + ++ ++ G S+ Sbjct: 323 GQALDF-----------VVENHFTRTGGSRVEEGVPQVLVLISAG-----PSSDEIRDAV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDLQE 413 >gi|149037630|gb|EDL92061.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_b [Rattus norvegicus] Length = 2867 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 45/169 (26%), Gaps = 22/169 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDS 294 ++ ++ LS + V +G++ Y+ K S + Sbjct: 263 DFLVNVLERLSVGNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANI 322 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +F R G + ++ ++ G S+ Sbjct: 323 GQALDF-----------VVENHFTRTGGSRVEEGVPQVLVLISAG-----PSSDEIRDAV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDLQE 413 >gi|126314401|ref|XP_001377042.1| PREDICTED: similar to Procollagen, type VI, alpha 2, partial [Monodelphis domestica] Length = 762 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 56/182 (30%), Gaps = 27/182 (14%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED---VYM 254 + V + T + KN ++ ++ + ++ + Sbjct: 342 TCGCCDCERHCGALDIVFVIDSSESIGYTNFSLEKNFVINVVNRLGSITKDPNSDTGTRI 401 Query: 255 GLIGYT-TRVEKNIEPSWG----TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ G + ++ V + I T + A+ Y L Sbjct: 402 GVVQYSHDGTFEAIKLDDGRIGSLAQFKEEVKKLE--WIAGGTWTPSALNYTYNELIKGS 459 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 + + F + +TDG ++ + + +CD K + + I I Sbjct: 460 R---------------RKKTRVFAVVITDGRHDPRDNEQSLKALCD--KVENVVVTAIGI 502 Query: 370 NA 371 Sbjct: 503 GD 504 >gi|56477525|ref|YP_159114.1| Flp pilus assembly protein TadG [Aromatoleum aromaticum EbN1] gi|56313568|emb|CAI08213.1| hypothetical protein, similarity to Flp pilus assembly protein TadG [Aromatoleum aromaticum EbN1] Length = 148 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 39/127 (30%), Gaps = 4/127 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + ++ FA ++ ++L + Y N++ A A +K + Sbjct: 1 MRHMRGVVAVEFAFLLIPLVMLGFGITEFGRAIYSYNTLAKAVRDAARHLTAKTPGDPVE 60 Query: 75 LGDRFESISNHAKR----ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + AL D K + LS T ++N+ + ++ Sbjct: 61 HAIAKCMAVHGNPDCNGPALAPDLKTSMVQTCDTILSCPGVENTVTTGTGMINTVTVRIS 120 Query: 131 HMANNRL 137 N + Sbjct: 121 GYPYNSV 127 >gi|116623633|ref|YP_825789.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] gi|116226795|gb|ABJ85504.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] Length = 185 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 42/136 (30%), Gaps = 4/136 (2%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 I F L +M L I +I + YK+ +++ A+ G V N + Sbjct: 14 GVELIEFTLVMMPLLGFIFLVIDLGWTIYKRATLQ----FAVREGCRYAVINQLQPLKDS 69 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + ++ ++ + + S + +S ++ + Sbjct: 70 NGNNYGMIDSVKYVVQQRAMGFLGSTPSDPGYATIQVRFYDPNSSLTTAVAMPPDCTSTI 129 Query: 140 SNNTIFYNMDVMTSYD 155 N Y ++V + Sbjct: 130 VPNKGGYLVEVSVEHY 145 >gi|262200403|ref|YP_003271611.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262083750|gb|ACY19718.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 423 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 46/163 (28%), Gaps = 22/163 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +A + AL + +D G++ + + + + +K R V + Sbjct: 56 LAGAQRALAGVIGQLDPRD------VFGVVTFDDDAQVVLPAAPLADKAR-AVDAVGSIV 108 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 TD + + Q L + ++ ++DG N + Sbjct: 109 PGGCTDLSSGYLRGLQELRRATASAGIRGG--------------TVLVISDGHVNRGIRD 154 Query: 348 -VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 I KA + I T+ + L S H Sbjct: 155 LDEFASITAKAAADGIITSTLGYGRGYDETLLSAIARSGNGNH 197 >gi|254428069|ref|ZP_05041776.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196194238|gb|EDX89197.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 657 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 15/144 (10%), Positives = 39/144 (27%), Gaps = 23/144 (15%) Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + +Q T PA+ Sbjct: 350 ETPVTVSDNSRQQAQDFVDGLQASGGTHMLPALSATLSQ-------------------PA 390 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 S + + +IF+TDG + + + E ++ T+ I ++PN + + Sbjct: 391 SDGYLRQVIFITDGA--VGNESGIFRALHQQLGE--ARLFTVGIGSAPNSHFMTRAAQFG 446 Query: 386 PEYHYNVVNADSLIHVFQNISQLM 409 + + + + + + + Sbjct: 447 RGSFTYINDQNQVQQGMDTLFRRL 470 >gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 40/323 (12%), Positives = 76/323 (23%), Gaps = 55/323 (17%) Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 K L ++ I + Q ++ ++S+ N + Sbjct: 135 NGYFFKFPLSTKYQKGIMTNEYSDKPDSFHFSLKVNTQKELSDFKVSVDGTKNVIDSHNA 194 Query: 142 NTIFYNMDVMTSYDYRLQFIE---HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD 198 + + + +S P+ VVD Sbjct: 195 TFETNEAPKKDAIFIETPIKDEDKSIAVSSDGYIAISTNPSFSGK---IESNSEFYFVVD 251 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 SGSM A + I + + +K F + + E Sbjct: 252 CSGSMSGARIINAVKCMRLFIQSLPLGCRFSIIKFGT-SFETVLQPCDYSDE-------- 302 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFF 318 +K + + TD ++ Sbjct: 303 -------------NVDKALNLLKSVNAKM--GGTDILSPLQH------------------ 329 Query: 319 RQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIVTISINASPNGQR 377 P F K I LTDGE NN I K +I +I + + + Sbjct: 330 -IAGLKPQPGFVKQIFLLTDGEVNNPD-----ITCATALKNRNENRIFSIGLGSGADPGL 383 Query: 378 LLKTCVSSPEYHYNVVNADSLIH 400 + S + + + D++ Sbjct: 384 IKGLAKKSGGNYIMIADEDNMNE 406 >gi|167908132|ref|ZP_02495337.1| hypothetical protein BpseN_38271 [Burkholderia pseudomallei NCTC 13177] Length = 396 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 5/35 (14%), Positives = 18/35 (51%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 SI+ AL + + +G + + + ++ ++++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 35 >gi|167725250|ref|ZP_02408486.1| hypothetical protein BpseD_39891 [Burkholderia pseudomallei DM98] Length = 396 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 5/35 (14%), Positives = 18/35 (51%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 SI+ AL + + +G + + + ++ ++++ Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNS 35 >gi|156408321|ref|XP_001641805.1| predicted protein [Nematostella vectensis] gi|156228945|gb|EDO49742.1| predicted protein [Nematostella vectensis] Length = 981 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 48/402 (11%), Positives = 108/402 (26%), Gaps = 49/402 (12%) Query: 46 WHYKKNSMESANN----AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNH 101 + + AA L + V + I R + I Sbjct: 36 ASQVSAQLHATFESNLHAAKLQKLYQNVEFSNAPSTNPHDILQAIARGIEHKLNETINAL 95 Query: 102 IKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFI 161 + + + NT I I+ + + N ++ +Q Sbjct: 96 LNAKTAVTNTGASNTRIIECCAMPAIARANYSYRFRAMLNTSVACKTRNPKD-SGIIQGE 154 Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-SGSMHCAMNSDPEDVNSAPIC 220 L+N N + S + + + +GS V + Sbjct: 155 NSLVNTFRNNLVDSSVISWQYFGTSTGNYLQFPASGKVCNGSSSFDPRFQSWYVEAVTRM 214 Query: 221 QDKKR------------TKMAALKNALLLFLDSIDLLSHV-----KEDVYMGLIGYTTRV 263 + +MA + A + LD++ V + + V Sbjct: 215 RTNIVVVIDRSSSMSTAGRMALARQAAVTVLDTLGPNDKVGVVAFSHFIIKPPGCFGGNV 274 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLI-------------LKPTDSTPAMKQAYQILTSDKK 310 + + +++ +V T PA++ A+++L D Sbjct: 275 AEALP--KNINRIKAWVEALTPRGKVSLQKTNLRYVSFPGATKYVPALEAAFEMLGGDFN 332 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN-----FIKIV 365 + +K + I+FLTDG+ + +V+ + + I + Sbjct: 333 IKILHHPLIALIKRS---AENMILFLTDGDPFDRNPDVSIFEAIRIGQRKLAFPARINVY 389 Query: 366 TISINASPNGQRLLKTCVS-SPEYHYNVVNADS--LIHVFQN 404 + + + + LK S + + + D+ L + Sbjct: 390 GLGESLNIDNLNRLKQIASLNNGTFTQINDQDASSLSTIMGK 431 >gi|169832403|ref|YP_001693966.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] gi|154432882|gb|ABS82071.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432906|gb|ABS82092.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|168994905|gb|ACA35517.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] Length = 883 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 50/215 (23%), Gaps = 26/215 (12%) Query: 136 RLDSSNNTIFY--NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 L S T L + N R+N + + ++ Sbjct: 200 ELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSNP 259 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 + V L + V + K L ++ L D + + Sbjct: 260 DNRVALVTYGSTIFDGSEATVEKGVADANGK-----ILNDSALWTFDRTTFTAKTYNYSF 314 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 + L + + + +D T + A+ A ILT Sbjct: 315 LNLT---SDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILT------- 364 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 K +K I +TDG Sbjct: 365 ---------KQARPNSKKVIFHITDGVPTMSYPIN 390 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 37/361 (10%), Positives = 98/361 (27%), Gaps = 30/361 (8%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 +V ++ D ++ + K + + T + Sbjct: 247 ATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTFDRTTFT 306 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + + TI + + + + + QK + +L Sbjct: 307 AKTYNYSFLNLTSDPTDIQTIKDRIPSDAE-ELNKDKLMYQFGATFTQKALMTADDILTK 365 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID 243 + +I + D +M +N + Q A N+ + L+ Sbjct: 366 QARPNSKKVIFHITDGVPTMSYPINFKYTGTTQSYRTQLNNFK--AKTPNSSGILLEDFV 423 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 S E + G + ++ + V Q ++ + Y+ Sbjct: 424 TWSADGEHKIVRGDGESYQMFTKKPVT-DQYGVHQILSITSMEQ------RAKLVSAGYR 476 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +D + + + T N N ++ Sbjct: 477 FYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGN-------------MAQDGYD 523 Query: 364 IVTISINASPN-------GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 + T+ + + + R +++ SSP+ + NV + ++ +V+ K S+ Sbjct: 524 VFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSI 583 Query: 417 I 417 Sbjct: 584 E 584 >gi|91788415|ref|YP_549367.1| TadE-like protein [Polaromonas sp. JS666] gi|91697640|gb|ABE44469.1| TadE-like protein [Polaromonas sp. JS666] Length = 179 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 14/169 (8%), Positives = 46/169 (27%), Gaps = 8/169 (4%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + ++ FA+ + F ++ ++ Y N+M+ A + Sbjct: 11 SAHRSPFKRQGGVAAVEFAIISLLFFTILFAILEFGRMLYVYNTMQEVTRRAAREAVVRW 70 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS-------AVFYNTEIQNI 121 + S + + +++ +S + + + Sbjct: 71 TNQESAVKSLALFGGTSLPAGAEVTSGNISISYLTKSGAAVNPPPLSPGDNISACGDADR 130 Query: 122 VNSSRISMTHMANNRLDSSN-NTIFYNMDVMTSYDYRLQFIEHLLNQRY 169 S I ++ + S ++F +++ E + Sbjct: 131 APISCIYSVRVSIENVTYSPMVSLFSFLNIGLPTSVVTMHAESMGFNGN 179 >gi|83719116|ref|YP_443061.1| hypothetical protein BTH_I2544 [Burkholderia thailandensis E264] gi|167582065|ref|ZP_02374939.1| hypothetical protein BthaT_28237 [Burkholderia thailandensis TXDOH] gi|167620226|ref|ZP_02388857.1| hypothetical protein BthaB_28225 [Burkholderia thailandensis Bt4] gi|257139291|ref|ZP_05587553.1| hypothetical protein BthaA_08814 [Burkholderia thailandensis E264] gi|83652941|gb|ABC37004.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 155 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 43/154 (27%), Gaps = 6/154 (3%) Query: 7 FRFYFKKGI--ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILA 63 + ++ +I FA+ F L++ +I + S+ +A A A Sbjct: 1 MTHAPGRFRSPRRQRGATAIEFAILFPMFFLILYGIITYGMIFAAQQSLTLAATEGARAA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ S + + A L A ++ S T Sbjct: 61 LNYQVAQTQSAALGLRAAAACTAANNLTGWLSGATCSTSSNYTCSYDSTMYCIQVTVTYP 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + + + L ++ + T+ Sbjct: 121 YAANPLVPAVALFDAVLPNTLTSQATVQINPTNI 154 >gi|167522505|ref|XP_001745590.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775939|gb|EDQ89561.1| predicted protein [Monosiga brevicollis MX1] Length = 1927 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 37/328 (11%), Positives = 89/328 (27%), Gaps = 50/328 (15%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 + + + + + T +VN++ + + + Sbjct: 1021 ANATTTPLANATTTTPLVDTTTTPLANATTTTTTPLVNATTTTPLVNATTTPLVNATSTT 1080 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 + + T +D T + N + + + I +V Sbjct: 1081 PNVNATTTTPLVDTTTPNANVTTTTP-MNNGTTTPFDTTMASTSTTLILPCAVTADIVIV 1139 Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 +D SG+M + A K L L + L D+ +GL Sbjct: 1140 IDASGAMSINDFT--------------------AAKTTALAILRRLALAQP---DISVGL 1176 Query: 257 IGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 I ++ + + + ++ + + T+ A+ A +L + + Sbjct: 1177 IFFSQQAQVALPLLDINDETEFQLLLLVLQAGQYQGQATNLGSALSSAADLLETSNNGAR 1236 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 I +DG S+ + + I+I+T++ + Sbjct: 1237 Q------------------FILFSDG-----SSDDQGTLVAQNIRATGIQILTVANVMNA 1273 Query: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHV 401 N LL ++ V + + Sbjct: 1274 NVYNLLLIAGAADNIFLTVQSDATQADA 1301 >gi|52078560|ref|YP_077351.1| YabS [Bacillus licheniformis ATCC 14580] gi|52783922|ref|YP_089751.1| YabS [Bacillus licheniformis ATCC 14580] gi|52001771|gb|AAU21713.1| YabS [Bacillus licheniformis ATCC 14580] gi|52346424|gb|AAU39058.1| YabS [Bacillus licheniformis ATCC 14580] Length = 245 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 30/86 (34%), Gaps = 7/86 (8%) Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT--C 382 I+ LTDG +N + + AKE I + I I + + + + Sbjct: 3 KGHLNQILLLTDGCSNRGED---PQAMAAFAKEQGITVNVIGIMDEHEMDQEAMKEVEGI 59 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL 408 + + VV L Q +++ Sbjct: 60 ALAGGGVHQVVYTSQLSQTVQMVTKK 85 >gi|319654932|ref|ZP_08009006.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] gi|317393357|gb|EFV74121.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] Length = 245 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 7/86 (8%) Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT--C 382 K I+ +TDG +N + + + AKE I + I + + Q + + Sbjct: 3 TGTLKQILLITDGCSNQGED---PVAMAALAKEQGITVNVIGVMEQDVIDEQGMNEIEGI 59 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL 408 S VV + L Q +++ Sbjct: 60 AMSGGGVSQVVYSQQLSQTVQMVTRK 85 >gi|307108695|gb|EFN56934.1| hypothetical protein CHLNCDRAFT_143466 [Chlorella variabilis] Length = 785 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 70/231 (30%), Gaps = 29/231 (12%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI----CQDKKRTKM 228 + + + RP + L++D+SGSM CA + D + ++ +RTK+ Sbjct: 285 VFLAVGLDSGLTDFRRPRLNLCLLLDVSGSMDCAFDRHYYDAATGRQATLQGEEARRTKL 344 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K + L+ + I + + R D+ Sbjct: 345 DVAKEVIKGVLERLSPDD-----------------CVAISLFSDAAATPKRMGRWGDADA 387 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 +++ A L + F G + + +I LTD E N + Sbjct: 388 AGIQAGIDSLRTA--GLDEGMAQLRRWGGFVDGDPLATE---NRLIVLTDAEANCGDVSE 442 Query: 349 NTIKICDKAKENFIKIVT--ISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + KA ++ + + Q + +++V + Sbjct: 443 TGLLARLKAAAAD-RLYATIVGVGLDFQTQLVEGIMRVRGASYFSVHSPGE 492 >gi|172040750|ref|YP_001800464.1| hypothetical protein cur_1070 [Corynebacterium urealyticum DSM 7109] gi|171852054|emb|CAQ05030.1| hypothetical protein cu1070 [Corynebacterium urealyticum DSM 7109] Length = 413 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/196 (9%), Positives = 52/196 (26%), Gaps = 18/196 (9%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS---HVKEDVYMGLIGYTTRVEKNIEP 269 D + A + ++ ++ AL + + G T N++ Sbjct: 232 DTSDAMGVVEGDARRLDVVRGALHSAFQRVGDNEGAVSLWNYSSQLSPGARTPYRVNVDL 291 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 S + + + ++ A++ P Sbjct: 292 SA-RDGGAAAGAVLDQLNVGGGNHANVSISAAHKAAVDSAAAGS------------GKPA 338 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + ++ L + ++ A +++ + I L + ++ Sbjct: 339 GRMVVVLAGKNQDELSVEQLKAQLA--AANPQVRVDIVGIGGDVAADELARIAEATGGKF 396 Query: 390 YNVVNADSLIHVFQNI 405 Y +A ++ V + I Sbjct: 397 YPARDAKAVDGVLKGI 412 >gi|149037632|gb|EDL92063.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] gi|149037633|gb|EDL92064.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] Length = 2140 Score = 41.8 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 45/169 (26%), Gaps = 22/169 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDS 294 ++ ++ LS + V +G++ Y+ K S + Sbjct: 263 DFLVNVLERLSVGNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANI 322 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+ +F R G + ++ ++ G S+ Sbjct: 323 GQALDF-----------VVENHFTRTGGSRVEEGVPQVLVLISAG-----PSSDEIRDAV 366 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV---NADSLIH 400 K+ + + + A + L+ + + V + L Sbjct: 367 VALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDLQE 413 >gi|302037436|ref|YP_003797758.1| hypothetical protein NIDE2113 [Candidatus Nitrospira defluvii] gi|300605500|emb|CBK41833.1| conserved protein of unknown function [Candidatus Nitrospira defluvii] Length = 359 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 17/240 (7%), Positives = 57/240 (23%), Gaps = 32/240 (13%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA--------------- 59 ++ +I+ A+S+ + + L ++ K+ +++ +A Sbjct: 21 RHDQRGAVAIVVAVSLSALVSLAALVLDGGAALVSKHQLQNLADAAALAGARQLGKLYES 80 Query: 60 --AILA-------GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 A ++ V + + + D + N ++ Sbjct: 81 QPAAISPPQPLSLANRARVQAVVAEVAERNRDATKLAAVTLSDVRVGRWNAATRTVVSSQ 140 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY---DYRLQFIEHLLN- 166 ++ + + RL S + + L Sbjct: 141 NGLDAVLVRAEGSMPTFLAGVVGIRRLSVSATA-VSALSALAEVQPGALTLPVGLSSAWM 199 Query: 167 ---QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 +++ + P G + ++ ++ + + Q Sbjct: 200 ALGPIDGRRLQVYAPGGGDPCAGWTTFTDTPPTMGRLQTIVRSLTGNSYTSPATVAGQTS 259 >gi|158256546|dbj|BAF84246.1| unnamed protein product [Homo sapiens] Length = 445 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 17/180 (9%), Positives = 52/180 (28%), Gaps = 35/180 (19%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ +R + + + T + A+ A + L ++ Sbjct: 74 SLVHVGSRPYTEFPFGQHSSGEAAQDAVRASAQRMGDTHTGLALVYAKEQLFAEASG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + + +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTVFIVSTGRGNF 175 Query: 375 GQRLLKTCVSSPE----YHYNVVNADSLIHVFQ-NI----------SQLMVHRKYSVILK 419 + L S+P + +V + ++ + +I + + + + Sbjct: 176 LE--LSAAASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQLHATEITSSGFRLAWP 233 >gi|148694797|gb|EDL26744.1| complement factor B, isoform CRA_g [Mus musculus] Length = 541 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 29/260 (11%), Positives = 70/260 (26%), Gaps = 31/260 (11%) Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 +V + + P+ V S + K Sbjct: 8 LVLLVLGFSSGGVSATPVLEARPQVSCSLEGVEIKGGSMNIYLVLDGSDSIGSSNFTGAK 67 Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DS 286 L ++ + GL+ Y T + + S + V + + + D Sbjct: 68 RCLTNLIEKVASYGVRPRY---GLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDH 124 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T++ A++ Y +++ II +TDG +N + Sbjct: 125 KLKSGTNTKRALQAVYSMMS--------WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGN 174 Query: 347 NVNTIKICDKA----------KENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVN 394 V I+ +E+++ + + + + + + V + Sbjct: 175 PVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKD 234 Query: 395 ADSLIHVFQNISQLMVHRKY 414 + L +VF + Sbjct: 235 MEDLENVFYQMIDETKSLSL 254 >gi|119576604|gb|EAW56200.1| von Willebrand factor A domain containing 1, isoform CRA_c [Homo sapiens] gi|119576605|gb|EAW56201.1| von Willebrand factor A domain containing 1, isoform CRA_c [Homo sapiens] Length = 410 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 17/180 (9%), Positives = 52/180 (28%), Gaps = 35/180 (19%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ +R + + + T + A+ A + L ++ Sbjct: 39 SLVHVGSRPYTEFPFGQHSSGEAAQDAVRASAQRMGDTHTGLALVYAKEQLFAEASG--- 95 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + + +S Sbjct: 96 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTVFIVSTGRGNF 140 Query: 375 GQRLLKTCVSSPE----YHYNVVNADSLIHVFQ-NI----------SQLMVHRKYSVILK 419 + L S+P + +V + ++ + +I + + + + Sbjct: 141 LE--LSAAASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQLHATEITSSGFRLAWP 198 >gi|40068485|ref|NP_073745.2| von Willebrand factor A domain-containing protein 1 isoform 1 precursor [Homo sapiens] gi|74749155|sp|Q6PCB0|VWA1_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|37747866|gb|AAH59409.1| Von Willebrand factor A domain containing 1 [Homo sapiens] gi|56205506|emb|CAI22657.1| von Willebrand factor A domain containing 1 [Homo sapiens] gi|119576602|gb|EAW56198.1| von Willebrand factor A domain containing 1, isoform CRA_b [Homo sapiens] Length = 445 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 17/180 (9%), Positives = 52/180 (28%), Gaps = 35/180 (19%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ +R + + + T + A+ A + L ++ Sbjct: 74 SLVHVGSRPYTEFPFGQHSSGEAAQDAVRASAQRMGDTHTGLALVYAKEQLFAEASG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + + +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTVFIVSTGRGNF 175 Query: 375 GQRLLKTCVSSPE----YHYNVVNADSLIHVFQ-NI----------SQLMVHRKYSVILK 419 + L S+P + +V + ++ + +I + + + + Sbjct: 176 LE--LSAAASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQLHATEITSSGFRLAWP 233 >gi|332974518|gb|EGK11438.1| PilC protein [Kingella kingae ATCC 23330] Length = 1328 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 37/156 (23%), Gaps = 18/156 (11%) Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 ++ D + + Y I + T + + + Sbjct: 278 YTESPL-RNLTSDLDFLDSDSAFYT-PSKRTAYGLYGMYPIFNYQPILAPQTQYAGEMLS 335 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS--PNGQRLLKT 381 S P K + ++ N I TI + +G LLK Sbjct: 336 FFSEPINKIDLRTVGNDDTGKSWNDPAFPT------QNITTFTIGFGSGLTSSGLALLKG 389 Query: 382 CVSS----PEY----HYNVVNADSLIHVFQNISQLM 409 S +Y + L F +I + Sbjct: 390 GASDIKLKDGTVQKAYYPANDQAGLQKAFDSIFDQI 425 >gi|303248311|ref|ZP_07334573.1| TadE family protein [Desulfovibrio fructosovorans JJ] gi|302490336|gb|EFL50248.1| TadE family protein [Desulfovibrio fructosovorans JJ] Length = 165 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 36/127 (28%), Gaps = 4/127 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E ++ FAL + + ++ +I V + ++ +A A + N + Sbjct: 13 RRRESGATAVEFALVLPVLVFMLLGIIEVANIL----RIQFTLESAATTVAHDISQNPNI 68 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 K+ + ++ S A+ T +S + + Sbjct: 69 TNQSAAQNLFDGKQDSYAPLVQQGRDTSDPSDPPALAMSPTTRPTCNSSSCTPFLITITY 128 Query: 135 NRLDSSN 141 + Sbjct: 129 TYKAMTA 135 >gi|219848640|ref|YP_002463073.1| hypothetical protein Cagg_1736 [Chloroflexus aggregans DSM 9485] gi|219542899|gb|ACL24637.1| conserved hypothetical protein [Chloroflexus aggregans DSM 9485] Length = 170 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 7/61 (11%), Positives = 19/61 (31%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + AL + L + +I + + +E A A + ++ + + Sbjct: 6 HGQALPLVALMLPILTLFVLVVIELANLWLAVAKLEDALQQATRSAVQQIDYAVLARNGQ 65 Query: 79 F 79 Sbjct: 66 A 66 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 29/182 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSID---LLSHVKEDVYM 254 + V + T KN ++ ++ + + + Sbjct: 601 TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV 660 Query: 255 GLIGYT-TRVEKNIEPSW----GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I+ ++ V I T + A+K AY L + Sbjct: 661 GVVQYSHEGTFEAIQLDDERINSLSSFKEAVKNLE--WIAGGTWTPSALKFAYNQLIKES 718 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +R F + +TDG ++ ++N +C + + + I I Sbjct: 719 RRQKTRVFA---------------VVITDGRHDPRDDDLNLRALC----NHDVTVTAIGI 759 Query: 370 NA 371 Sbjct: 760 GD 761 >gi|149183229|ref|ZP_01861674.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] gi|148849062|gb|EDL63267.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] Length = 245 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 34/86 (39%), Gaps = 7/86 (8%) Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKT--C 382 + ++ +TDG +N + + + AKE I + I + N S + + + + Sbjct: 3 TGTLRQVLLITDGCSNQGED---PVAMAALAKEQGITVNVIGVMENDSIDERGMEEIEGI 59 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL 408 S +V A L Q +++ Sbjct: 60 AMSGGGVSQIVYAQQLSQTVQMVTRK 85 >gi|330447624|ref|ZP_08311272.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491815|dbj|GAA05769.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 2023 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 32/384 (8%), Positives = 92/384 (23%), Gaps = 61/384 (15%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 I + D+ I ++ + + I + D Sbjct: 1404 YIVDVPAGMFEPDSNDGFSGDITADITINDKYGNTASANDEKDFDTIGSVITGSTSGDDL 1463 Query: 141 NNTI--------FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 + T Y + + L + + N + + ++ Sbjct: 1464 HGTGYNDLIIGDVYIESKNVNVNLVLDISGSMAMREVNPENIHLDNGHTSGKLTLVDRDG 1523 Query: 193 IELVVDLSGSM-----------------------HCAMNSDPEDVNSAPICQDKKRTKMA 229 ++ D + + + +++ Sbjct: 1524 VKEHFDFNSIDELSGILGLKFGSPLGFSVYSPELNITFPDGTTQTINDVYKDIGVVSRID 1583 Query: 230 ALKNALLLFLDSIDLLSHVKE--DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K ++ +++ + ++ ++ LI + ++V+ E W K + Sbjct: 1584 SAKESINHIVENYGETLNHEQLSNLKFSLITFASQVDGVKEFHWDFGK--------NTLV 1635 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--- 344 + + + + T+ + + F+TDG + Sbjct: 1636 TSDGQTIADYLVGVEPLKGPLSQTDYDTSLVAALNGFTNTDTSNIVYFVTDGMDTIGTNG 1695 Query: 345 ---KSNVNTIKICDKAKENFIKIVTISINASPN----GQRLLKTCVSSPEYHYN------ 391 + + + IV ++I + N +L S + + Sbjct: 1696 QQFNKDWVVSQTGGNIDKYKPTIVPVAIGGTINNTPEAHDILNQIASLGQGYKPDNSGHS 1755 Query: 392 ----VVNADSLIHVFQNISQLMVH 411 N D L N ++ Sbjct: 1756 QVIISTNTDQLDDAINNSFDNIIS 1779 >gi|156404021|ref|XP_001640206.1| predicted protein [Nematostella vectensis] gi|156227339|gb|EDO48143.1| predicted protein [Nematostella vectensis] Length = 3546 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 50/169 (29%), Gaps = 25/169 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS-- 270 DV+ + + +++ D + + ++ Y + Sbjct: 2208 DVDVVYALSPSSNSTFKTMTDSIKKVGDKFGT-----DRFHYSIVVYGNSTKNVSSFDTI 2262 Query: 271 -WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 E + + V + + + ++ A+ A +IL + Sbjct: 2263 IPYEEALEEEVNKAIRQSLSGNSNLEEALTNAKKILNT---------------PNTRKSS 2307 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 +K ++ +TD N + K+ E I +V+ I P L Sbjct: 2308 RKAVVIITD--NTSGKTPTVLKDTAKPLHEQGIIVVSSGIGDDPKYDEL 2354 >gi|227326309|ref|ZP_03830333.1| hypothetical protein PcarcW_02949 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 210 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 21/52 (40%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 SRF ++ S +A+ ++ AL+ L + + ++ ME Sbjct: 14 TSRFLSVMRRFWFSHRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQ 65 >gi|154250681|ref|YP_001411505.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154631|gb|ABS61848.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 187 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 15/156 (9%), Positives = 39/156 (25%), Gaps = 4/156 (2%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM----ESANNAAILAGAS 66 K+ + + ++ FAL + I + + +A + A + Sbjct: 16 LKRFLRNCAGIAAVEFALIFPVMIAFYFGSIETTNMLTANRRVTSVAYTAADITAQATSI 75 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + L + + +S S Sbjct: 76 SNSDLADIFAASSAILAPFSTTPLKVRITSVVAYSSNIAKVAWSDGLNIAPRSTGSTVSL 135 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 S A + + + T Y + + + F + Sbjct: 136 PSGLTTAGSSVIMAEVTYSYVSPISEAITETITFTD 171 >gi|290975425|ref|XP_002670443.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084002|gb|EFC37699.1| vWFA domain-containing protein [Naegleria gruberi] Length = 348 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 46/162 (28%), Gaps = 5/162 (3%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDL--LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 ++ K+ + + ++ S ++ V K + EK Sbjct: 43 DNTSSMSGEIKIAKDTVQSIITTLHDHFQSDLRFSAVSYRDHTDDYVVKEYPFTKNVEKA 102 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + YV D A+ A +++ P + Sbjct: 103 KGYVNEM---FAKGGGDMPEALASALKVVNEIPFNKKGRKICIWIADAPPHGMGASGDYF 159 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 DG + ++ IK+ +E + +I S N Q+L Sbjct: 160 PDGCKDEQGEIIDWIKLASHLQEKNVVFYSIICGRSKNDQQL 201 >gi|311251244|ref|XP_003124541.1| PREDICTED: integrin alpha-M-like [Sus scrofa] Length = 448 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 34/90 (37%), Gaps = 7/90 (7%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG---QRLLKT 381 K ++ +TDGE + + +A + I + + N + L T Sbjct: 201 ARENALKILVVITDGEKFG--DPLGYEDVIPEADRKGVIRYVIGVGDAFNSWKSREELNT 258 Query: 382 CVS--SPEYHYNVVNADSLIHVFQNISQLM 409 S S ++ + V N ++L + + + + Sbjct: 259 IASKPSGDHVFQVTNFEALKTIQNQLQEKI 288 >gi|227823966|ref|YP_002827939.1| Flp pilus assembly protein TadG [Sinorhizobium fredii NGR234] gi|227342968|gb|ACP27186.1| Flp pilus assembly protein TadG [Sinorhizobium fredii NGR234] Length = 201 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 13/149 (8%), Positives = 40/149 (26%), Gaps = 8/149 (5%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 F+ + + +I FA+ + F +++ I + + +A + M Sbjct: 17 QLFRSLLGDRRGATAIEFAILALPFFIVVFASIETFVAFAGEQLLANATD--------TM 68 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 + + ++A D+ + E ++ + Sbjct: 69 ARKIRTGEITKDMSEAEFRQAFCDEIAILLTCSATEVDEPAKLYLDVRQVADPSQFPAAV 128 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 A++ D + + + Sbjct: 129 PRVDASDTSDLDTSGFQFAPGAGKEFTMV 157 >gi|260800519|ref|XP_002595176.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] gi|229280420|gb|EEN51188.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] Length = 153 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 15/149 (10%), Positives = 42/149 (28%), Gaps = 19/149 (12%) Query: 235 LLLFLDS-IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + F+ L + V + ++ Y + + D + T Sbjct: 20 MKTFIGQLFAYLDIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDGIEYMGGYTL 79 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A+ A + +K K + +TDG + + + Sbjct: 80 TGKAIDFATNLHFDLRKG-------------ARADVTKIAVVITDGRSYDDVNRP----- 121 Query: 354 CDKAKENFIKIVTISINASPNGQRLLKTC 382 + ++ I + + + + + +L Sbjct: 122 ARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|326670664|ref|XP_003199264.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 2265 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 41/371 (11%), Positives = 94/371 (25%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 1505 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 1562 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 1563 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFVIGSRSSDSQELQRISQ 1615 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 1616 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 1663 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N Sbjct: 1664 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYL-NT 1719 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + V + A++ F Sbjct: 1720 YTTNEEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 1768 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 1769 QILVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 1821 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 1822 SVSDFSELPNV 1832 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 42/369 (11%), Positives = 94/369 (25%), Gaps = 49/369 (13%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 690 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 747 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 748 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFVIGSRSSDSQELQRISQ 800 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 801 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 848 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ E N + Sbjct: 849 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPEANFYLNTY 905 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T K + V + F + Sbjct: 906 TTK-EEIVDAVRGLRHKGGRPLYTG---------EALQFVRDNVFTASSGSRRLEGVPQI 955 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 ++ L+ G S + KE + T I + + R L+ P Y +V Sbjct: 956 LVLLSGGR-----SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSV 1008 Query: 393 VNADSLIHV 401 + L +V Sbjct: 1009 SDFSELPNV 1017 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 52/183 (28%), Gaps = 20/183 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D R A+K+ + ++ + + ++ Y+ E N + T K + Sbjct: 56 DGSDGTRNGFPAMKDFVQKMVEKLAAAE---NRDRISVVQYSREPEANFYLNTYTTK-EE 111 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + + F + ++ L+ Sbjct: 112 ILDAVRGLRHKGGRPLYTG---------AALQYVKDNVFTASSGSRRLEGVPQILVLLSG 162 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G KS + KE + T I + + R L+ P Y +V + L Sbjct: 163 G-----KSFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSVSDFSEL 215 Query: 399 IHV 401 +V Sbjct: 216 PNV 218 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 35/334 (10%), Positives = 93/334 (27%), Gaps = 43/334 (12%) Query: 89 ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ---------NIVNSSRISMTHMANNRLDS 139 +A +F+++++ + SG + +I + V+++ S+ + Sbjct: 926 LYTGEALQFVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAASSLKELGVLTFGI 985 Query: 140 SNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + Y L + +++++ + P E Sbjct: 986 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPTSPTVTAEYST 1045 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 +DV D R+ A+++ + ++ + + ++ Sbjct: 1046 PR------------KDVVFLLDGSDGTRSSFPAMRDFVQRVVEKFN---IEANRDRVSVV 1090 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 Y+ E + + K + R + + F Sbjct: 1091 QYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG---------AALQYVRDNVF 1140 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 + +I L+ G S + KE + T+ I + + R Sbjct: 1141 TASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSLKELGVL--TLGIGSRGSDSR 1193 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 L+ Y +V + L +V + + + Sbjct: 1194 ELQRISYEANYALSVADFSELPNVQEQLLASVQT 1227 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + V+++ S+ + + + Y L + +++++ + Sbjct: 1781 DSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 1840 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 1841 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDFVQR 1888 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 1889 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 1944 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 1945 ---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 1990 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 1991 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 2042 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 29/297 (9%), Positives = 78/297 (26%), Gaps = 34/297 (11%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIV 174 + + V+++ S+ + + + Y L + ++++ Sbjct: 164 KSFDSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLL 223 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + E +DV D R+ A+++ Sbjct: 224 ASVQVTSIPVTPTSTTVTAEYSTPR------------KDVVFLLDGSDGTRSSFPAMRDF 271 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + ++ + + ++ Y+ E + + K + R Sbjct: 272 VQRVVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPL 327 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 + + F + +I L+ G S + Sbjct: 328 YTG---------AALQYVRDNVFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAA 373 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 374 SSLKELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 428 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 79/260 (30%), Gaps = 38/260 (14%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + N + + + + ++ + +++D+SGSM Sbjct: 43 SALMGNGDGSYSLTVSVSSTDMDTAQQQTESDVVVLMDVSGSM----------------- 85 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-- 278 T M KNA+ + + L+ + V M ++ +++ + E S G+E Sbjct: 86 ---TTTDMKVAKNAVNGLANQL--LNDENDTVRMSIVRFSSEAKTL-EFSNGSEWTHSPA 139 Query: 279 -YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 T+ A++ A ++ D R + G + T Sbjct: 140 LVAQALNTLTSRGNTNWDGALQNASALVQGDSARKSYVVLMSDGYPNTINSCYPAVANCT 199 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS--------PEYH 389 D + + I+ + +I + + + +K V P Sbjct: 200 DTSWSEPNAVPKAIEAANTM--PNTQIYAV--STRTSASESMKELVDGINAKAPKYPAQI 255 Query: 390 YNVVNADSLIHVFQNISQLM 409 + SL + F I+ + Sbjct: 256 MYGTDQQSLNNAFDTIADAI 275 >gi|78060311|ref|YP_366886.1| TadE-like protein [Burkholderia sp. 383] gi|77964861|gb|ABB06242.1| TadE-like protein [Burkholderia sp. 383] Length = 177 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 10/138 (7%), Positives = 40/138 (28%), Gaps = 2/138 (1%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FAL +M ++L + Y+ ++ A A + + ++ + + Sbjct: 12 RGAVAVEFALVLMPMIMLATGVAEFGRAIYQFETLTKATRDAARYLSVWLPTDSAYPVSQ 71 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + + + + ++ A T + + + + Sbjct: 72 AQCLVVYGSTTCGASGTELVPGLTTSMVTICDAQ--RTTGCSDGSDPSQFANLPTYDANN 129 Query: 139 SSNNTIFYNMDVMTSYDY 156 ++ + + Sbjct: 130 NAASGTATGAINVVEVKI 147 >gi|326670662|ref|XP_003199263.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1024 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 42/371 (11%), Positives = 96/371 (25%), Gaps = 53/371 (14%) Query: 41 IYVLD------WHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDA 94 + + + E A + + G G + + ++ A Sbjct: 46 VQYSRDPEAHFYLNTYSRKEDALD--TIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSR 103 Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI--FYNMDVMT 152 + + LSG + V++ S+ + + + + Sbjct: 104 RVEGVPQLLILLSG-------ARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQ 156 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE 212 Y L + Q++ + I + + + E Sbjct: 157 EPSYALSVSDFTDLPSVQQQLFTNINKVFVAGVPTTSTTIAEGRRQR------------R 204 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV D R A+K+ + ++ + + + ++ Y+ N + Sbjct: 205 DVVFLLDGSDGTRNGFPAMKDFVQRMVEKL---DVAENRDRISVVQYSREPGANFYLNTY 261 Query: 273 TEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K + V + A++ F Sbjct: 262 TTK-EEIVDAVRGLRHKGGRPLYTGEALQY-----------VRDNVFTASSGSRRLEGVP 309 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 + ++ L+ G S + KE + T I + + R L+ P Y Sbjct: 310 QILVLLSGGR-----SFDSVNAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYAL 362 Query: 391 NVVNADSLIHV 401 +V + L +V Sbjct: 363 SVSDFSELPNV 373 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 78/294 (26%), Gaps = 34/294 (11%) Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYN--MDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 + VN++ S+ + + + Y L + +++++ + Sbjct: 322 DSVNAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASV 381 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P E +DV D R+ A+++ + Sbjct: 382 QVTSIPVTPTSPTVTAEYSTPR------------KDVVFLLDGSDGTRSSFLAMRDFVQR 429 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + + ++ Y+ E + + K + R Sbjct: 430 VVEKFN---IEANRDRVSVVQYSRDAEVHFYLNS-YTKKEDILDRVTGLRHKGGRPLYTG 485 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + F + +I L+ G S + Sbjct: 486 ---------AALQYVRDNIFTASSGSRRLEGVPQILILLSGGR-----SFDSVDAAASSL 531 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 KE + T+ I + + R L+ Y +V + L +V + + + Sbjct: 532 KELGVL--TLGIGSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQT 583 >gi|312139253|ref|YP_004006589.1| tade-like protein [Rhodococcus equi 103S] gi|311888592|emb|CBH47904.1| putative TadE-like protein [Rhodococcus equi 103S] Length = 133 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 39/131 (29%), Gaps = 5/131 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 K + S+ ++ FAL V + L+ ++ + ++A +AA GA M Sbjct: 1 MKRLTSDSGVAAVEFALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIK 56 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS-AVFYNTEIQNIVNSSRISMT 130 R I DA+ I + S S + ++ Sbjct: 57 KDPAAARAAVKGAGVFSPAITDAEICISTSGTQGCSATSCPSGSTVTLTVSYPLEYMTGL 116 Query: 131 HMANNRLDSSN 141 L + Sbjct: 117 FPGKPTLTGTG 127 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 52/168 (30%), Gaps = 21/168 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 K +++ LD++ +V + + + ++ + + + Sbjct: 241 KTSVMEMLDTLSDDDYVTVASFHEKADPVSCFRQLVQ--ANVRNKKVIKEAVQEMVARGT 298 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 TD + A+ L + K I+ TDG + + Sbjct: 299 TDYKAGFEYAFSQLQNTSITRANC--------------NKMIMMFTDGGEDRVQDVFEKY 344 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSL 398 +K +++ T S+ L+ ++ Y++ + + ++ Sbjct: 345 NWPNKT----VRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|16127181|ref|NP_421745.1| hypothetical protein CC_2951 [Caulobacter crescentus CB15] gi|221235982|ref|YP_002518419.1| TadE-like pilus assembly protein [Caulobacter crescentus NA1000] gi|13424579|gb|AAK24913.1| hypothetical protein CC_2951 [Caulobacter crescentus CB15] gi|220965155|gb|ACL96511.1| TadE-related pilus assembly protein [Caulobacter crescentus NA1000] Length = 183 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 36/127 (28%), Gaps = 11/127 (8%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLL-IGFLIYVLDWHYKKNSMESANNAAIL 62 +SR R + ++ FAL +++ G V + + + Sbjct: 1 MSR-RRPLSSFWRDRRGVSAVEFALIAPVMIVMYCGLA-EVTQAMMAQRRLSN------- 51 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 AS++ +++ + F ++ ++ ++ + Sbjct: 52 -IASQIGDLVAQSNQTGPTKMADVFTIGGIIMAPFPTATLRMCVASVTSDATGRDTVAWS 110 Query: 123 NSSRISM 129 +S Sbjct: 111 RASGTMT 117 >gi|196228899|ref|ZP_03127765.1| exporter-like protein [Chthoniobacter flavus Ellin428] gi|196227180|gb|EDY21684.1| exporter-like protein [Chthoniobacter flavus Ellin428] Length = 796 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 10/119 (8%), Positives = 29/119 (24%), Gaps = 4/119 (3%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +A + + K+ ++S S + + D F + S F Sbjct: 133 ADATLKSSVEKLGESISPQEVALLSYDPYNVAGALGDLGSFAGTAQQSEFSSADGTFRVL 192 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT----SYDYRLQFIEHLLNQRYNQ 171 + + N N++ + + + + + + Sbjct: 193 YVVSAHNFPNYKAAIAWVNQIKQIAHATVTDSAIHLGFTGEPAFVADISGSMEWDMTSS 251 >gi|16331430|ref|NP_442158.1| hypothetical protein slr0226 [Synechocystis sp. PCC 6803] gi|1001600|dbj|BAA10228.1| slr0226 [Synechocystis sp. PCC 6803] Length = 482 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 44/405 (10%), Positives = 106/405 (26%), Gaps = 31/405 (7%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-------ANN 58 R + ++ +++ F++ A+ V L +IG +++ + A + Sbjct: 8 RLIIFSRR-RSADNGGFALPMAIMVGLVLTVIGI------VMMQRSMFQQNDSTSKAATD 60 Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS----GYSAVFY 114 A A + + ++ + + DA + S + Sbjct: 61 QAQNAAEAGITKIINLMNTPANRFISALPDCQNWDAGSNTCTDTGSTPSWHNLPNLGSWT 120 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 T + T + + + T+ + SY Y Sbjct: 121 ITTPGAACQPPATTTTDITATASNRAWKTLPQGEFRLVSYQYTPDTANGGAIGLGPGTGT 180 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + IE+ + ++ + A S +A + + Sbjct: 181 LVVEGAINRNAANEARSRIEVKIPITRLPNDASVSSLWVTETATGDEIVSGNQDVYSDII 240 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + S+D +V +G Y+ + N+ P + T + + T Sbjct: 241 VNDCDASVDPADFPNSEVAIGTYEAIYSNVPQPNLPPYPTCPASNTFATMSGSMTLPRST 300 Query: 293 DSTPAMKQAYQILTSDKKRSFF-----TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 D++ + N I ++F++ + + N N Sbjct: 301 DASTTITLNDGTNLPAYVYCIDKIALGGNDTINVRTINPTSGERFLVLMHVKGDINVSGN 360 Query: 348 VNT----IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + C K + I S + + S Sbjct: 361 THLNHSCSGSCADYKPTDLNIF--GYGQSAHSSTPPQILASGGNN 403 >gi|239995187|ref|ZP_04715711.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii ATCC 27126] Length = 1356 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 20/60 (33%), Gaps = 1/60 (1%) Query: 362 IKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 + TI A+ N L S Y N+ L+ F I + + + + G Sbjct: 637 VITHTIGFAANNNANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSVKDINATFVSPG 696 >gi|157831431|pdb|1IDO|A Chain A, I-Domain From Integrin Cr3, Mg2+ Bound Length = 189 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 62/198 (31%), Gaps = 25/198 (12%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 P++ + D + + + F+ ++ K L+ Y+ + Sbjct: 3 PQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQ-LKKSKTLFSLMQYSEEFRIHFTFK 61 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 R V L T + +++ + L + + F Sbjct: 62 EFQNNPNPRSLVKPITQLL--GRTHTATGIRKVVRELFNITNGARKNAF----------- 108 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSS 385 K ++ +TDGE + + +A + I + S ++ L T S Sbjct: 109 --KILVVITDGEKFG--DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASK 164 Query: 386 P--EYHYNVVNADSLIHV 401 P ++ + V N ++L + Sbjct: 165 PPRDHVFQVNNFEALKTI 182 >gi|138893737|ref|YP_001124190.1| hypothetical protein GTNG_0058 [Geobacillus thermodenitrificans NG80-2] gi|196250779|ref|ZP_03149466.1| conserved hypothetical protein [Geobacillus sp. G11MC16] gi|134265250|gb|ABO65445.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] gi|196209729|gb|EDY04501.1| conserved hypothetical protein [Geobacillus sp. G11MC16] Length = 246 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 35/87 (40%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKT 381 + I+ +TDG +N+ + + A+E I + I + NGQR ++ Sbjct: 2 RKGTLRQILLITDGCSNHGED---PAAMAALAREQGITVNVIGVLDQDTIDENGQREIEA 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ VV A L Q +++ Sbjct: 59 IAAAGGGMSQVVYARQLSQTVQMVTRQ 85 >gi|226493950|ref|NP_001146885.1| von Willebrand factor type A domain containing protein [Zea mays] gi|195604730|gb|ACG24195.1| von Willebrand factor type A domain containing protein [Zea mays] Length = 484 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 13/86 (15%), Positives = 30/86 (34%), Gaps = 4/86 (4%) Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE--NFIKIVTISINASPNGQRLL 379 FI+ ++DG + FK +I D + + T + + + + L Sbjct: 9 DDRADKKRAGFIVLISDGLDGQFKWGDESITPTDPIRGLLRKYPVHTFGLGKAHDPRALH 68 Query: 380 KTCVSSPEYHYNVVN--ADSLIHVFQ 403 S + ++ + + +I F Sbjct: 69 YIANISYGIYSSITDNLENKIIEAFA 94 >gi|156409371|ref|XP_001642143.1| predicted protein [Nematostella vectensis] gi|156229284|gb|EDO50080.1| predicted protein [Nematostella vectensis] Length = 332 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 16/130 (12%), Positives = 42/130 (32%), Gaps = 21/130 (16%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +++ S + + ++ Y+ R K + + K+R + T A+ Sbjct: 30 TLNAFSLRQRRTRVSVVLYSNRPFKVFGFNRYSSKLRVIRAINYMRYPRGGTKLRRALYY 89 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + L + ++R ++ ++ LTDG + Sbjct: 90 VKRYLFTRRQRGR----------------KQVLVVLTDGISRRG-----VKAPAISLHRA 128 Query: 361 FIKIVTISIN 370 +++ +I I Sbjct: 129 GVEVHSIGIG 138 >gi|152999662|ref|YP_001365343.1| type IV pilin biogenesis protein [Shewanella baltica OS185] gi|151364280|gb|ABS07280.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS185] Length = 1169 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 30/290 (10%), Positives = 70/290 (24%), Gaps = 66/290 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V +++ ++ A K +++ K A+ + S + Sbjct: 208 AAANNTDFGVGQPVTLYTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDF 267 Query: 248 V--KEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQI 304 + G + T+ R +T + L T M +AY+ Sbjct: 268 GLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRY 327 Query: 305 LTSDK---------------KRSFFTNFFRQGVKIPSLPFQ----KFIIFLTDGENN--- 342 + + + +G S ++I++TDG Sbjct: 328 FAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDK 387 Query: 343 -----------------NFKSNVNTIKICDKAKE----------------------NFIK 363 N+ S + ++ Sbjct: 388 NANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQVQNVR 447 Query: 364 IVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI A L +T ++ N+ L + + +++ Sbjct: 448 TYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|260793652|ref|XP_002591825.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] gi|229277036|gb|EEN47836.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] Length = 2660 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 41/118 (34%), Gaps = 19/118 (16%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T + A+K + + + +K + +TDG++N Sbjct: 2068 DAKGGTATRLALKFLRESVIPEAVAELDRPDES----------KKALFLITDGKSNTGGD 2117 Query: 347 NVNTIKICDKAKEN-FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + K +E ++I TI I+ + L + SSP+ + +L F+ Sbjct: 2118 PS---EEARKLREELGLEIYTIGISNDV-SKTELASVASSPKK----EHMFTLKDYFR 2167 >gi|260826343|ref|XP_002608125.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] gi|229293475|gb|EEN64135.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] Length = 1803 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 18/180 (10%), Positives = 42/180 (23%), Gaps = 21/180 (11%) Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 S + + + +K L F S + ++ + T + + Sbjct: 52 DRSGSVGRSNYDKIIDFVKAVLNHF-------SVSPTTTRVAVVSFGTSARVEFDLLRSS 104 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 L K +Y ++ + K + Sbjct: 105 SNDNNKCELLRTHLP----------KLSYTGGATNTVGALRLALALLKNPGVRSYSTKVV 154 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNV 392 +TDG N + + + + + I + + RL + V Sbjct: 155 FTITDGYWNRGGDPAY---VVRELQSRGVIMFAFGIGSWGISSYRLSALGNRDSGRYKYV 211 >gi|163747460|ref|ZP_02154812.1| hypothetical protein OIHEL45_00420 [Oceanibulbus indolifex HEL-45] gi|161379313|gb|EDQ03730.1| hypothetical protein OIHEL45_00420 [Oceanibulbus indolifex HEL-45] Length = 179 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 5/72 (6%), Positives = 26/72 (36%), Gaps = 1/72 (1%) Query: 3 LLSRFRFYFKKGIASEKANFSII-FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 ++ + +++ E + +I FA+ + + + + + ++ + + Sbjct: 1 MIQKTIKAWRRFRGDENGSVMLIEFAILSPLLFGCLIMSVEMSFYAMRHMFLDRGLDMTV 60 Query: 62 LAGASKMVSNLS 73 +++S Sbjct: 61 RYVRLNTNTDMS 72 >gi|319955584|ref|YP_004166851.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319424244|gb|ADV51353.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 363 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 61/188 (32%), Gaps = 19/188 (10%) Query: 229 AALKNALLLFLDSIDL-LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 LK A + F++++ + + + + + + E + + D + Sbjct: 142 QELKTASVSFVNNVMPEVESESNKMAIYWFDGEDELHLLNDLTSSKEDLIASIDAITDDI 201 Query: 288 IL-KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD A+ ++ + K + + I+ TDG + + Sbjct: 202 SNDPSTDLYGAVIKSTEKAEKLIKETKNNDIIGAA----------SIVIFTDGTDQASRY 251 Query: 347 NVNTIKICDKAKENFIKI--VTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 DK K + I TI + + + + +L + + N L F + Sbjct: 252 TE--KAALDKVKNASLNISYFTIGLGSEIDTE-VLASIGKTSSVF--AGNKAELETTFND 306 Query: 405 ISQLMVHR 412 IS L+ R Sbjct: 307 ISYLVSQR 314 >gi|319788877|ref|YP_004090192.1| Cna B domain protein [Ruminococcus albus 7] gi|315450744|gb|ADU24306.1| Cna B domain protein [Ruminococcus albus 7] Length = 2109 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 33/354 (9%), Positives = 84/354 (23%), Gaps = 33/354 (9%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+ + L + + N K ++ D + + + + N+ Sbjct: 390 ATALSDLLDKTTFVVSTTINGTKYYVVADLNSAGTGYYFSHYTNNIVNATKFKTLSKTNN 449 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMT---------SYDYRLQFIEHLLNQRYNQKIVS 175 + S V S + N ++ Sbjct: 450 KGTFTVCDLPTSMSSGRTYTVTEYSVPDNDRYEKLSKSVSLPSPTSDFTTNAGTGTVKLN 509 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA------ 229 ++G + + D ++ D + + Sbjct: 510 NNEEGYDAKVGTATLDKVVHNGD-GKALSTDNADDIKTLADIYKTTKFIIGYWDGSTMRY 568 Query: 230 ------ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + K+A LD ++ + G+ T+++ + ++ + +++ Sbjct: 569 LTNGYLSAKDAFSGNLDDLNNFNDTAYKSGDGMYYAPTKLDSSHNVTFDMSRTSTDISKA 628 Query: 284 ---MDSLILKPTDSTPAMKQAYQILTS-DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 +D+ Q Y L + F + + I Sbjct: 629 YVFDAGYNYSGSDTCDYFGQVYLNLLPLNSSGVSQEVVFIEVNGAKGYGYDDDIDVSKAV 688 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + S + I K + T+ G + V S +Y + Sbjct: 689 NLSTLGSVSDKRNISGVMKNDTASSYTV---TDSKGGKY----VISSGKYYPIS 735 >gi|183983524|ref|YP_001851815.1| hypothetical protein MMAR_3543 [Mycobacterium marinum M] gi|183176850|gb|ACC41960.1| conserved membrane protein [Mycobacterium marinum M] Length = 983 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 55/177 (31%), Gaps = 22/177 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMD 285 KM A + A +D++ + I Y V + +++ R + V Sbjct: 315 KMTAARRAASRIVDALTSDDRFAVLTFDDGIEY--PVGLPAGLTEASDRHRYRAVEHLAR 372 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T+ +++A +L ++ +I ++DG+ Sbjct: 373 VEARGDTEMLAPLRRALALLGREQVADTDDA---------------VLILISDGQ--VGN 415 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + ++ +++ TI ++ + N L + V N D L Sbjct: 416 EDQLLQELSGDLGR--VRLHTIGVDEAVNAGFLRRLAGVGGGRCVLVDNEDRLDEAL 470 >gi|220922540|ref|YP_002497842.1| hypothetical protein Mnod_2575 [Methylobacterium nodulans ORS 2060] gi|219947147|gb|ACL57539.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 506 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 18/152 (11%), Positives = 43/152 (28%), Gaps = 1/152 (0%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + A + ++ ++ L + + + + M++A +A LAGA+ Sbjct: 23 LLSQCRSLWADSRGIMLPYVSIMLVVIAGLSLLGLDGIRYMSLQTQMQAAADALALAGAA 82 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ +I N L + V ++ Sbjct: 83 ELNGKAGARARAQAAIDNLLSNGLSGMGVT-APVQVTVQFKRTLPVASQLPTGGTAATAD 141 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + + NTIF V ++ Sbjct: 142 SDAHFVIVDASPITLNTIFPASFVNSALANTF 173 >gi|238062108|ref|ZP_04606817.1| hypothetical protein MCAG_03074 [Micromonospora sp. ATCC 39149] gi|237883919|gb|EEP72747.1| hypothetical protein MCAG_03074 [Micromonospora sp. ATCC 39149] Length = 150 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 12/133 (9%), Positives = 31/133 (23%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + ++ S+ A +++ L +IG + ++ A G + Sbjct: 11 RRWRGDEGRVSLFLAAAMIGVLAIIGLAFDGAGQLRSRQLADNLAAEAARTGGQSIDRAK 70 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + G E A+ A+ + + Sbjct: 71 AIEGGSKEIDEAAARTAVTSYLAAAGATDHDVEFPVVGGEKLIRVRVTVTYDRYLLGFFG 130 Query: 133 ANNRLDSSNNTIF 145 + + S Sbjct: 131 FGDAVTVSGEATA 143 >gi|115390819|ref|XP_001212914.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114193838|gb|EAU35538.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 1013 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 73/242 (30%), Gaps = 26/242 (10%) Query: 156 YRLQFIEHLLNQRYNQKIV--SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L + R ++ S + I + S +++ + Sbjct: 438 FDLDNSGEEEDYRASKIRRQASLNSSYGAARSNNTAITDYTNAGGDTLS-TTSLHIPLDI 496 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 V P+ + K+ L++AL ++++ MGL+ + + T Sbjct: 497 VVVIPVSSSMQGLKITLLRDALKFLVNNLGPRD------RMGLVTFGSSGGGV-PLVGMT 549 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K +++ ++ L +D + S P Sbjct: 550 TKSW-----------GGWPKILNSIRPVGHKSLRADVVEGANVAMDLLMQRKSSNPLS-T 597 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 I+ ++D ++ S + + AK + I + + + +++ + + V Sbjct: 598 ILLISDSSTSDPDSVDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYLYV 654 Query: 393 VN 394 + Sbjct: 655 KD 656 >gi|311254856|ref|XP_003125982.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 825 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 20/230 (8%), Positives = 63/230 (27%), Gaps = 34/230 (14%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNS--DPEDVNSAPICQDKKRTKMAALKNALL 236 ++ V++ SG M D + + + ++ + A+ Sbjct: 174 IAKSKKIEATSSSSWCPVLEFSGVAAAYMYPEHDTKAPPLSRLIPSLNHNRLNRMNQAVK 233 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDSLILKPTDST 295 FL L ++ ++G++ + T + R+ + + + Sbjct: 234 YFL-----LQTIENGSWVGVVDFDTTAHIKSKLIQIKSNNERRKLLESLPTEASGGISIC 288 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 ++ A+Q++ + + D Sbjct: 289 SGIESAFQVIKEIYPQVDGSEIILVVAGEDKNIRNCM----------------------D 326 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS---LIHVF 402 + K++ I +I++ + + + + + H+ + L Sbjct: 327 RVKQSGAIIHSIALGPNADPA-VTEMSAVTGGMHFYTTDQSESRGLTDAL 375 >gi|170746809|ref|YP_001753069.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170653331|gb|ACB22386.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 204 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 14/136 (10%), Positives = 36/136 (26%), Gaps = 5/136 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI---LAGASK 67 + E+ ++ FAL + ++L + + A L+G Sbjct: 8 LSRFRRDERGIAAVEFALVLPLLIILYFGTAELTRVVDATRKL--TLFARTLGDLSGRMD 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + + R L + + N + + + + R Sbjct: 66 NALATQDGMTKIAGAATAILRPLDASGLQIVVNAMGVESINGTLKGFVCSSWPQNATKRP 125 Query: 128 SMTHMANNRLDSSNNT 143 + +N L ++ Sbjct: 126 ANQANGSNGLPATPAA 141 >gi|56418593|ref|YP_145911.1| hypothetical protein GK0058 [Geobacillus kaustophilus HTA426] gi|261417558|ref|YP_003251240.1| hypothetical protein GYMC61_0058 [Geobacillus sp. Y412MC61] gi|297528433|ref|YP_003669708.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|319765215|ref|YP_004130716.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] gi|56378435|dbj|BAD74343.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426] gi|261374015|gb|ACX76758.1| conserved hypothetical protein [Geobacillus sp. Y412MC61] gi|297251685|gb|ADI25131.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|317110081|gb|ADU92573.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] Length = 246 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 35/87 (40%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKT 381 + I+ +TDG +N+ + + A+E I + I I NG+R ++ Sbjct: 2 RKGTLRQILLITDGCSNHGED---PAAMAALAREQGITVNVIGILDQGAMDENGRREIEA 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 ++ VV A L Q +++ Sbjct: 59 IAAAGGGMSQVVYAKQLSQTVQMVTRQ 85 >gi|288956975|ref|YP_003447316.1| hypothetical protein AZL_001340 [Azospirillum sp. B510] gi|288909283|dbj|BAI70772.1| hypothetical protein AZL_001340 [Azospirillum sp. B510] Length = 196 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 12/77 (15%), Positives = 29/77 (37%), Gaps = 2/77 (2%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 K + ++ FA+ +L+ + + + +N +E A A AG + ++ + Sbjct: 24 RKGSVAVEFAIVAPMIILVFIAVFELGMLEFSRNCLELAARQASRAGVTGVL--PTGYKT 81 Query: 78 RFESISNHAKRALIDDA 94 R ++I Sbjct: 82 REDAIQALVTSLTAGYF 98 >gi|282854078|ref|ZP_06263415.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583531|gb|EFB88911.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|314981158|gb|EFT25252.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091981|gb|EFT63957.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 322 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 65/212 (30%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVMSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGNNTQGGSPLVAANRAAAAK---VSVYTIAFGTETGYVDLDGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAKSWTADSADKLQEVYQQVHSSVG 287 >gi|171693029|ref|XP_001911439.1| hypothetical protein [Podospora anserina S mat+] gi|170946463|emb|CAP73264.1| unnamed protein product [Podospora anserina S mat+] Length = 1378 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 32/320 (10%), Positives = 71/320 (22%), Gaps = 37/320 (11%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF- 145 D + + G + E I + + N L + + F Sbjct: 220 ASRYGDKPQDYNNAASTSIPEGLTIQIEVVEAGKIASIVSPTHKVTVENWLGTRAASSFA 279 Query: 146 ----YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER---------PIFL 192 + R ++ E + Sbjct: 280 DLVGEDTRSSVETALVKLETGSAFLDRDFVLDIATGGPNDEAESPQAWIEKHPTLPNQQA 339 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + +D ++ K+ +L+ A+ FL I Sbjct: 340 LMVTIPPGFTTRTSNPTDQTEILLLADLSGSMDDKLTSLRAAMQFFLKGI------PNGR 393 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 + + + + S + A ++ + Sbjct: 394 KFNVWCFGSSYKSWQPHSVDYGEA----------------SYQSASSWVDTNFHANMGGT 437 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 + V II LTDGE + I+ E I+ + I Sbjct: 438 ELLPAVQAIVTARDKRLPTDIIILTDGETWRLDETLEYIRKQRDLTEGGIRFFALGIG-P 496 Query: 373 PNGQRLLKTCVSSPEYHYNV 392 L++ + V Sbjct: 497 AVSHALVEGIAKVGGGYAEV 516 >gi|108758893|ref|YP_632824.1| pilus biogenesis protein [Myxococcus xanthus DK 1622] gi|108462773|gb|ABF87958.1| pilus biogenesis protein, TadE family [Myxococcus xanthus DK 1622] Length = 397 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 51/140 (36%), Gaps = 2/140 (1%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + + ++ FALSV ++++ F +Y+ + K ++ A A+ S +S+ Sbjct: 6 QMRSRSRGAATVEFALSVPLLVMILMFSMYLTELVRAKLKLQEAARYAVWEMTSYALSDF 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + G ++ + + A + +R+ +G F S + + Sbjct: 66 A-NGKHDDAFEDARREAHKEFVERYKDMD-SVEPNGTGGNFIARYTDVTATISNKEIALL 123 Query: 133 ANNRLDSSNNTIFYNMDVMT 152 + L + + + Sbjct: 124 ESGMLSRPSTSEGGGLAGAI 143 >gi|53719511|ref|YP_108497.1| hypothetical protein BPSL1897 [Burkholderia pseudomallei K96243] gi|76809138|ref|YP_333338.1| hypothetical protein BURPS1710b_1939 [Burkholderia pseudomallei 1710b] gi|121600603|ref|YP_993100.1| TadE family protein [Burkholderia mallei SAVP1] gi|124384304|ref|YP_001026123.1| hypothetical protein BMA10229_A0115 [Burkholderia mallei NCTC 10229] gi|126438724|ref|YP_001058805.1| TadE family protein [Burkholderia pseudomallei 668] gi|126450097|ref|YP_001080605.1| TadE family protein [Burkholderia mallei NCTC 10247] gi|126455238|ref|YP_001066056.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|134282206|ref|ZP_01768911.1| TadE family protein [Burkholderia pseudomallei 305] gi|167002484|ref|ZP_02268274.1| TadE family protein [Burkholderia mallei PRL-20] gi|167719789|ref|ZP_02403025.1| TadE family protein [Burkholderia pseudomallei DM98] gi|167738789|ref|ZP_02411563.1| TadE family protein [Burkholderia pseudomallei 14] gi|167816012|ref|ZP_02447692.1| TadE family protein [Burkholderia pseudomallei 91] gi|167824388|ref|ZP_02455859.1| TadE family protein [Burkholderia pseudomallei 9] gi|167845919|ref|ZP_02471427.1| TadE family protein [Burkholderia pseudomallei B7210] gi|167894495|ref|ZP_02481897.1| TadE family protein [Burkholderia pseudomallei 7894] gi|167902901|ref|ZP_02490106.1| TadE family protein [Burkholderia pseudomallei NCTC 13177] gi|167911138|ref|ZP_02498229.1| TadE family protein [Burkholderia pseudomallei 112] gi|167919161|ref|ZP_02506252.1| TadE family protein [Burkholderia pseudomallei BCC215] gi|217423591|ref|ZP_03455092.1| TadE family protein [Burkholderia pseudomallei 576] gi|226199670|ref|ZP_03795223.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237812065|ref|YP_002896516.1| hypothetical protein GBP346_A1807 [Burkholderia pseudomallei MSHR346] gi|238562619|ref|ZP_04610141.1| conserved hypothetical protein [Burkholderia mallei GB8 horse 4] gi|242315970|ref|ZP_04814986.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254178368|ref|ZP_04885023.1| TadE family protein [Burkholderia mallei ATCC 10399] gi|254179959|ref|ZP_04886558.1| TadE family protein [Burkholderia pseudomallei 1655] gi|254188631|ref|ZP_04895142.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|254198024|ref|ZP_04904446.1| TadE family protein [Burkholderia pseudomallei S13] gi|254199898|ref|ZP_04906264.1| TadE family protein [Burkholderia mallei FMH] gi|254206230|ref|ZP_04912582.1| TadE family protein [Burkholderia mallei JHU] gi|254259054|ref|ZP_04950108.1| TadE family protein [Burkholderia pseudomallei 1710a] gi|254297795|ref|ZP_04965248.1| TadE family protein [Burkholderia pseudomallei 406e] gi|254358353|ref|ZP_04974626.1| TadE family protein [Burkholderia mallei 2002721280] gi|52209925|emb|CAH35897.1| putative membrane protein [Burkholderia pseudomallei K96243] gi|76578591|gb|ABA48066.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|121229413|gb|ABM51931.1| TadE family protein [Burkholderia mallei SAVP1] gi|124292324|gb|ABN01593.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126218217|gb|ABN81723.1| TadE family protein [Burkholderia pseudomallei 668] gi|126228880|gb|ABN92420.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|126242967|gb|ABO06060.1| TadE family protein [Burkholderia mallei NCTC 10247] gi|134246244|gb|EBA46333.1| TadE family protein [Burkholderia pseudomallei 305] gi|147749494|gb|EDK56568.1| TadE family protein [Burkholderia mallei FMH] gi|147753673|gb|EDK60738.1| TadE family protein [Burkholderia mallei JHU] gi|148027480|gb|EDK85501.1| TadE family protein [Burkholderia mallei 2002721280] gi|157807798|gb|EDO84968.1| TadE family protein [Burkholderia pseudomallei 406e] gi|157936310|gb|EDO91980.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|160699407|gb|EDP89377.1| TadE family protein [Burkholderia mallei ATCC 10399] gi|169654765|gb|EDS87458.1| TadE family protein [Burkholderia pseudomallei S13] gi|184210499|gb|EDU07542.1| TadE family protein [Burkholderia pseudomallei 1655] gi|217393449|gb|EEC33470.1| TadE family protein [Burkholderia pseudomallei 576] gi|225928256|gb|EEH24290.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|237505940|gb|ACQ98258.1| conserved hypothetical protein [Burkholderia pseudomallei MSHR346] gi|238522205|gb|EEP85651.1| conserved hypothetical protein [Burkholderia mallei GB8 horse 4] gi|242139209|gb|EES25611.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|243061824|gb|EES44010.1| TadE family protein [Burkholderia mallei PRL-20] gi|254217743|gb|EET07127.1| TadE family protein [Burkholderia pseudomallei 1710a] Length = 155 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 42/154 (27%), Gaps = 6/154 (3%) Query: 7 FRFYFKKGI--ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILA 63 + ++ +I FA+ F L++ +I + S+ +A A A Sbjct: 1 MTRALGRLRSPRRQRGATAIEFAILFPMFFLILYGIITYGMIFAAQQSLTLAATEGARAA 60 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQN 120 ++ S + + A L A + S T Sbjct: 61 LNYQVAQTQSAALGLRAAAACTAANNLTGWLSGATCTTSTNYTCSYDSTMYCIQVTLTYP 120 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + + + L ++ + T+ Sbjct: 121 YAANPLVPAVALFDAVLPTTLTSRATVQINPTNI 154 >gi|13475442|ref|NP_107006.1| hypothetical protein mlr6511 [Mesorhizobium loti MAFF303099] gi|14026194|dbj|BAB52792.1| mlr6511 [Mesorhizobium loti MAFF303099] Length = 537 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 10/112 (8%), Positives = 31/112 (27%), Gaps = 6/112 (5%) Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 S A+ A L K + + + ++ +TDG + Sbjct: 77 SASAITDAADSLKFLGKTPLTAAVKQAAEALKYTEDKATVVLITDGLETCGGDPC---AL 133 Query: 354 CDKAKENFIKIV--TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVF 402 + K + + + + + + + ++ + + +L Sbjct: 134 GKELKASGVDFTADVVGFGLTADEGKQIACLAENTGGKYIQASDEKALQEAL 185 >gi|304390827|ref|ZP_07372779.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|304325710|gb|EFL92956.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 171 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 8/151 (5%), Positives = 38/151 (25%), Gaps = 13/151 (8%) Query: 8 RFYFKKGIA--SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + E ++ + + ++I + V + ++ ++ + A Sbjct: 14 RRFVARFERLGDEDGRIMLLGLAACVLLCVIILMSMAVSGVYLEQRRLQRLADQTASMAA 73 Query: 66 SKMVSNLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSG-------YSAVFY 114 + M + + + + L G ++ Sbjct: 74 ANMADTAYYQNGIVDGVPLEIEPYHASERAAEYLSGASISANSGLDGIDLVDVDVASTRV 133 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 ++ + + +++ + Sbjct: 134 QVTLRATGKIPLVLPLVSSLTQVELTATGAA 164 >gi|261820222|ref|YP_003258328.1| hypothetical protein Pecwa_0903 [Pectobacterium wasabiae WPP163] gi|261604235|gb|ACX86721.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 212 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 2/71 (2%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SRF + S +A+ ++ AL+ L + + ++ ME +A Sbjct: 14 ISRFLSRLRGFWFSRRASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQ--RVGAIA 71 Query: 64 GASKMVSNLSR 74 M L Sbjct: 72 SILAMQQALDE 82 >gi|170087774|ref|XP_001875110.1| predicted protein [Laccaria bicolor S238N-H82] gi|164650310|gb|EDR14551.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1136 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 6/187 (3%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + K+ +K L + S+ + + +G RV K + G Sbjct: 635 PPPSAVPSTAQLKIRVIKTTLDFLVASLAKKDRLS--LVTFEVGNAGRVRKTPFLNLGKA 692 Query: 275 KVRQYVTRDMDSLILKPT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + RQ + + +D L L+ + + + + +D + + P I Sbjct: 693 QSRQRLDKFIDELGLRLEENQDEFLVRGSKDEKTDVVTAVNHGLDVVLQRKARNPISGMI 752 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++ + + A + I + S + L + + V Sbjct: 753 LVSDASDSTRRAQMDLVLARAEAA---NVPIHSFGYGRSHDPASLWLMSNHTSGTYTFVK 809 Query: 394 NADSLIH 400 + L Sbjct: 810 DWYDLRD 816 >gi|326407931|gb|ADZ65001.1| traG-related protein [Lactococcus lactis subsp. lactis CV56] Length = 627 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/366 (8%), Positives = 68/366 (18%), Gaps = 48/366 (13%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 +II A+ V+ ++ + + KN + + S+ + D + Sbjct: 213 IAIIGAIVVVIMIISLFIGV--------KN------DQ----CSQPSYSDSNTTEDLKKV 254 Query: 82 ---ISNHAKRALIDDAKRF----IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + K + + + F T +M + Sbjct: 255 PKSVEDFVKSHKEAYILSWKAGGFLPSASIAQTMVENGFNFTNPSGTSLWKAHNMGGVKT 314 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 ++ + T+ + G Sbjct: 315 SKKEDFPVTLATYGNDAV------DITGTKAGTSVGDNTGGAYTYFKDYNAGIVGKAEFM 368 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L + + A + +L A + + Y Sbjct: 369 AHQSLYKGAINNPDGKVTLMAIANGGWATDPNYLVSLIGAYNSLGQKFKWIDKEAIEKYG 428 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK---PTDSTPAMKQAYQILTSDKKR 311 + K T +D + + Sbjct: 429 EAPYKANSPIPKLPTDKIPNKNESKNTAGVDVDEDGCVVNPN-PSGEVTGKNGAPTQNLP 487 Query: 312 SFFTNFFRQGVKIPSLPFQKFIIF--LTDGENNNFKSNVNTI-----------KICDKAK 358 + + F T G N I + AK Sbjct: 488 PEYIGKLTLPMPDDKNYAGNSYPFGQCTWGVYNRLAQIGMPIEWFSGDSGNGGNWGESAK 547 Query: 359 ENFIKI 364 K+ Sbjct: 548 AKGYKV 553 >gi|304412583|ref|ZP_07394188.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|307303599|ref|ZP_07583352.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] gi|304349059|gb|EFM13472.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|306912497|gb|EFN42920.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] Length = 1168 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/290 (10%), Positives = 70/290 (24%), Gaps = 66/290 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V +++ ++ A K +++ K A+ + S + Sbjct: 208 AAANNTDFGVGQPVTLYTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDF 267 Query: 248 V--KEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQI 304 + G + T+ R +T + L T M +AY+ Sbjct: 268 GLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRY 327 Query: 305 LTSDK---------------KRSFFTNFFRQGVKIPSLPFQ----KFIIFLTDGENN--- 342 + + + +G S ++I++TDG Sbjct: 328 FAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDK 387 Query: 343 -----------------NFKSNVNTIKICDKAKE----------------------NFIK 363 N+ S + ++ Sbjct: 388 NANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQMQNVR 447 Query: 364 IVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI A L +T ++ N+ L + + +++ Sbjct: 448 TYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 31/95 (32%), Gaps = 5/95 (5%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 + I+ + + L + I + + + +K ++ + A LA M S+ S ++ Sbjct: 1 MAAIWVMVAIVVLGV----IDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKN 56 Query: 82 ISNHAKRAL-IDDAKRFIKNHIKESLSGYSAVFYN 115 + A I + + + Sbjct: 57 NVTSSANINDKGYAFTLISATATANPTAGNDQIAV 91 >gi|326385751|ref|ZP_08207380.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] gi|326209730|gb|EGD60518.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] Length = 201 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 11/146 (7%), Positives = 38/146 (26%), Gaps = 3/146 (2%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + + EKA+ ++ FA++ + ++ ++ + ++E+ A Sbjct: 9 RPFAFLARLMRDEKASTAMEFAIAAPFLIGILMATFEIIILFLAQAALETTAEGAARYV- 67 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + +S ++ + +S + + Sbjct: 68 -LTGQAQTNFTGVKDSNGKVITT-PQQQFAAYVCTQMSSFMSCNNLYVDVNSGSDYTTVD 125 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVM 151 + + F Sbjct: 126 LSVPQFTFDATNNYKVTNTFNYNPGT 151 >gi|198423392|ref|XP_002124188.1| PREDICTED: similar to fibropellin Ia [Ciona intestinalis] Length = 1781 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 55/193 (28%), Gaps = 38/193 (19%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR--------- 282 + L + E+V +G++ Y+ S K + R Sbjct: 96 RRVLTWIKAVASGFNISNENVNIGVVQYSHWYRTLPMSSQIYLKTEIALNRCRTQACFQY 155 Query: 283 --DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D ++ T + A+ + K +I LTDG Sbjct: 156 LVDKIQIMGYTTYTGAAI----------------NKTILEDFSTTPARSNKILILLTDGI 199 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH----YNVVNAD 396 + + + A+ I I I + + L+ + + Y + + + Sbjct: 200 SKDDVNY-----ASAFARRQNITIFCIGVGS--YSLSQLQIIANGELNNNWRIYQLNSFN 252 Query: 397 SLIHVFQNISQLM 409 L QN+ + + Sbjct: 253 QLPTTVQNLQREI 265 >gi|217974384|ref|YP_002359135.1| type IV pilin biogenesis protein [Shewanella baltica OS223] gi|217499519|gb|ACK47712.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS223] Length = 1168 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 30/290 (10%), Positives = 70/290 (24%), Gaps = 66/290 (22%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + V +++ ++ A K +++ K A+ + S + Sbjct: 208 AAANNTDFGVGQPVTLYTDNYLRWYGLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDF 267 Query: 248 V--KEDVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQI 304 + G + T+ R +T + L T M +AY+ Sbjct: 268 GLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRY 327 Query: 305 LTSDK---------------KRSFFTNFFRQGVKIPSLPFQ----KFIIFLTDGENN--- 342 + + + +G S ++I++TDG Sbjct: 328 FAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDK 387 Query: 343 -----------------NFKSNVNTIKICDKAKE----------------------NFIK 363 N+ S + ++ Sbjct: 388 NANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQMQNVR 447 Query: 364 IVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI A L +T ++ N+ L + + +++ Sbjct: 448 TYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|115525744|ref|YP_782655.1| TadE family protein [Rhodopseudomonas palustris BisA53] gi|115519691|gb|ABJ07675.1| TadE family protein [Rhodopseudomonas palustris BisA53] Length = 170 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 38/137 (27%), Gaps = 2/137 (1%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ +++ + +I FA F L+ +I + +E+ A A V Sbjct: 1 MRRFCRNKQGSAAIEFAFVAPMFFALLFAIIETSAVFFAGQYLETGTQDAARALLIDDVQ 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + + A I ++ G N N + Sbjct: 61 SKGTNQ--ADFKQSICDKVSALFACDNIYVDVQSFPKGDPITIVNPIDDNGNFINNFKYE 118 Query: 131 HMANNRLDSSNNTIFYN 147 ++ FY Sbjct: 119 PPPPKSENTVVVRAFYQ 135 >gi|319791189|ref|YP_004152829.1| hypothetical protein Varpa_0496 [Variovorax paradoxus EPS] gi|315593652|gb|ADU34718.1| hypothetical protein Varpa_0496 [Variovorax paradoxus EPS] Length = 637 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 8/106 (7%), Positives = 28/106 (26%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + + L + + +K +++ +AA + L+ Sbjct: 19 RRRQSGQAIVYLVAMLPIIFLSVLVVYNTATAAREKMKLQNTADAATYSANVLTARTLNY 78 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L +++ + + + + S + E Sbjct: 79 LAYTNRAMAANEASVATLASVQTSMAMLITSAANIQEGLVVAEAMK 124 >gi|31615649|pdb|1N9Z|A Chain A, Integrin Alpha M I Domain Mutant Length = 192 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 59/182 (32%), Gaps = 28/182 (15%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 +M + ++ L L + + I +T + +N R V Sbjct: 23 FRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQNNP------NPRSLVKPIT 76 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 L T + +++ + L + + F K ++ +TDGE Sbjct: 77 QLL--GRTHTATGIRKVVRELFNITNGARKNAF-------------KILVVITDGEKFG- 120 Query: 345 KSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTCVSSP--EYHYNVVNADSLI 399 + + +A + I + S ++ L T S P ++ + V N ++L Sbjct: 121 -DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALK 179 Query: 400 HV 401 + Sbjct: 180 TI 181 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 51/156 (32%), Gaps = 21/156 (13%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--LILKPTDS-TP 296 I + + +G++ Y + + ++ K + + T Sbjct: 65 AQIGINPSNPKTTRIGMVTYNSNATVDAHLNYDWPKNNDVLNFYSTMSEISEDSTSYVAH 124 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 ++ A +L S+ S ++K II K++ + ++ Sbjct: 125 GLQAAQNLLYSESFGS------------NRSHYKKVIIVCASTFKGTGKNDPIP--VANR 170 Query: 357 AKENFIKIVTISINASPNG--QRLLKTCVSSPEYHY 390 K +KI+TI+ + + L K SSP + Sbjct: 171 LKGRGVKILTIAYDQGDEKVVEELAKI--SSPGLSF 204 >gi|149927035|ref|ZP_01915293.1| hypothetical protein LMED105_09277 [Limnobacter sp. MED105] gi|149824256|gb|EDM83476.1| hypothetical protein LMED105_09277 [Limnobacter sp. MED105] Length = 164 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 34/118 (28%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R F +++ + AL + L +I ++ ++++ SA + A Sbjct: 1 MTTATRAHSAFVNRAENQRGAQLVELALMLPVVLTVIFAIVGYSLLFMVQHTLSSAVSQA 60 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 + A + R +++ + G + Sbjct: 61 ARSVAVAGSTADPEAAARQLLLNSLPSAMYPAGFSFSTDQLAGAADCGNALGAGTNPA 118 >gi|47219917|emb|CAF97187.1| unnamed protein product [Tetraodon nigroviridis] Length = 1251 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 58/184 (31%), Gaps = 36/184 (19%) Query: 234 ALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILK 290 ++ FL+ + + +G++ Y V N+ S T + +V + Sbjct: 160 SITDFLERFIKTIDIGPKLSQVGIVSYGETVTHNVNLSQFDNTAALMDFVRQLPQ-QTGF 218 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T + + + +K ++ +TDGE+++F + Sbjct: 219 KTMTFMGINTTR-------------LEAFLPERGARQGVKKVMVIVTDGESHDFHALDEV 265 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 I CD+ + I+ I++ N Q + D + + I Q + Sbjct: 266 IAACDR---DGIERFGIAVLGDYNRQN---------------KSKDEIQKFIKEI-QSIS 306 Query: 411 HRKY 414 Sbjct: 307 SEPL 310 >gi|240169445|ref|ZP_04748104.1| hypothetical protein MkanA1_09039 [Mycobacterium kansasii ATCC 12478] Length = 889 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 62/228 (27%), Gaps = 34/228 (14%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D +S+ D K ++ A K + D + + G + + Sbjct: 42 DESSSLQDTDGKAARVQAAKYLVQTLGRYADRIQAKLDVAIAGFAESYVSEQDWTPLTGA 101 Query: 273 TEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD--------------KKRSFFTNF 317 T + V ++ TD A+ A Q L S +F Sbjct: 102 TAQHVGDALSTLASKNTGIDTDYWLALDGARQALASRGSGVGGADRCQAIAWFSDSKIDF 161 Query: 318 FRQGV--KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + + + + +S + D+ + I ++ + + + Sbjct: 162 TARPLTKPYAEGVPLNSANGVAETIRLATESICRPGGLADQLRSRGIVMLGVGLGDAARA 221 Query: 376 Q--RLLKTCVSS---------------PEYHYNVVNADSLIHVFQNIS 406 ++ + P Y V N D ++ F +++ Sbjct: 222 SQFDVMSAISTGRGLNGMPCGNITEPAPGDFYRVSNIDDMLFAFDSLN 269 >gi|330904402|gb|EGH34974.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. japonica str. M301072PT] Length = 64 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 12/57 (21%), Positives = 19/57 (33%), Gaps = 17/57 (29%) Query: 362 IKIVTISINASPNGQRL----------------LKTCVS-SPEYHYNVVNADSLIHV 401 +KI I I + P+ L LK S S ++ + D L + Sbjct: 1 VKIYPIGIGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 57 >gi|92113789|ref|YP_573717.1| TadE-like protein [Chromohalobacter salexigens DSM 3043] gi|91796879|gb|ABE59018.1| TadE-like protein [Chromohalobacter salexigens DSM 3043] Length = 153 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 28/78 (35%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + +I FA+ F L++ LI ++ +++ + Sbjct: 1 MRESTHLKRARRKRQSGVAAIEFAIVFPVFFLVLYALIGYAFVFLIQSGLQNLAGETVRQ 60 Query: 64 GASKMVSNLSRLGDRFES 81 A+ + + D+ E+ Sbjct: 61 VATISTTPVEDDNDQREA 78 >gi|302185170|ref|ZP_07261843.1| hypothetical protein Psyrps6_02464 [Pseudomonas syringae pv. syringae 642] Length = 677 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 10/74 (13%), Positives = 29/74 (39%), Gaps = 1/74 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGKCSGTA 71 Query: 75 LGDRFESISNHAKR 88 + + + Sbjct: 72 ATAPDFANQSATRN 85 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 41.4 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 64/239 (26%), Gaps = 51/239 (21%) Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF-LIELVVDLSGSMHCAMNSDPED 213 F + + + F P ++ GE I L++D+SGSM Sbjct: 238 YLLPIFNGFI-WNEEGKFPIMFAPKNPKVLDGENNYAHNISLLIDISGSME--------- 287 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + KN +L LD + + + + ++ + S Sbjct: 288 ------------KDFSVYKNNILKILDKLAEIP----NWQINIVVFNDES-TARSFSNQE 330 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + T +K+A + + + Sbjct: 331 NNIEDIKVYINNLKANGYTKLYGTIKEALESF------------------KGKIDESSTL 372 Query: 334 IFLTDGENNNFKSNVNTIKICDK----AKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 I TDG++ SNV + D K + T+ Q + + + Sbjct: 373 IVFTDGKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQ-YYNQEFFEQVATRGGF 430 >gi|254786434|ref|YP_003073863.1| hypothetical protein TERTU_2434 [Teredinibacter turnerae T7901] gi|237685047|gb|ACR12311.1| TPR domain protein [Teredinibacter turnerae T7901] Length = 655 Score = 41.4 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 56/198 (28%), Gaps = 29/198 (14%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 D++ + QD K +++ + L+ L LI Y+ + Sbjct: 101 WDLSPSMAAQDVKPSRLVRARLKLIDLLKQRKEG-------LTALIAYSGEAHVVTPLTD 153 Query: 272 GTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T+ + + + K +++ A+ A Q+L F G+ Q Sbjct: 154 DTDTIISLLYGLDPAIMPAKGSNAEMALTLANQLLKEGANGRGDIVFLTDGIA---PTAQ 210 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKA----------KENFIKIVTISINASPNGQRLLK 380 + D + ++ + I ++ + Q + Sbjct: 211 DAFAEIHDTAPHPISFWGIGTAQGAPIPLAGGGFAYDRDRNMVIASL------DEQAMSD 264 Query: 381 TCVSSPEYH--YNVVNAD 396 + ++ +AD Sbjct: 265 LAAKVGGLYVPFSATDAD 282 >gi|290976796|ref|XP_002671125.1| predicted protein [Naegleria gruberi] gi|284084691|gb|EFC38381.1| predicted protein [Naegleria gruberi] Length = 1058 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 50/183 (27%), Gaps = 26/183 (14%) Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + ++ K AL L +ID + + I Y T E + + Sbjct: 72 SGSMSGSAISEAKLALESLLTNID-----GHNERVLFIPYDTSAELI---DMSRMSLTEK 123 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + TD A ++ T FF G + + I + Sbjct: 124 LNQVQRVHAGGGTDFACVFD-AIRLFTGALDGQIAIVFFTDGQDGYNGNRETAIDMMKKR 182 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADS 397 +S + TI ++ + + L + S+ +A S Sbjct: 183 LTTESESF---------------EFHTIGFSSGHDARLLTDMTRLGSAQGTFQYAESASS 227 Query: 398 LIH 400 + Sbjct: 228 IST 230 >gi|91977979|ref|YP_570638.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684435|gb|ABE40737.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 208 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 10/132 (7%), Positives = 35/132 (26%), Gaps = 8/132 (6%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + FA V LL+ + + W A + + + + + Sbjct: 17 RLARDRSGLAATEFAFIVPLMLLMFFATVEISTWV--------AVDRKVTLVSRTLSDLV 68 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S+ + + A + + + ++S ++++ + Sbjct: 69 SQATSVTDKDLPNVFLASYGVLAPYPTDTAEATISEIYVNNAGVAKVQWSKLAKVAKSGN 128 Query: 133 ANNRLDSSNNTI 144 ++ Sbjct: 129 NKPVATIEVSSR 140 >gi|86355858|ref|YP_467750.1| hypothetical protein RHE_CH00199 [Rhizobium etli CFN 42] gi|86279960|gb|ABC89023.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 193 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 39/157 (24%), Gaps = 10/157 (6%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + +R ++ K +I FA+ ++L + A Sbjct: 5 NPFTRLVLTARRLARERKGAGAIEFAILFPVLVMLYIGAFEITIGLSVSKR-------AT 57 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS--AVFYNTEIQ 119 A S + +++ +S + + ++G S A + Sbjct: 58 RAAGS-IADLVTQQQSVTKSALAQMPSVATAMFVPYNSTSLTLKITGISIDAGANAKVLW 116 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + A ++ + V T Sbjct: 117 SWAQDGTTPYAKNATVSDVPADMKTANSFLVRTELSI 153 >gi|253582503|ref|ZP_04859725.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] gi|251835648|gb|EES64187.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] Length = 376 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 25/79 (31%), Gaps = 2/79 (2%) Query: 323 KIPSLPFQKFIIFLTDGEN-NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 P + I + D +++ + +T KAK I I TI P K Sbjct: 140 STPRENLSQIIFLVGDAPPHDDYNDSPDTSDTAKKAKSRGIIINTIQCGDMPKTDYYWKA 199 Query: 382 CVS-SPEYHYNVVNADSLI 399 ++++ + Sbjct: 200 IAQFGGGEYFHISGDGGVK 218 >gi|157375479|ref|YP_001474079.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] gi|157317853|gb|ABV36951.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] Length = 2812 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 33/311 (10%), Positives = 75/311 (24%), Gaps = 18/311 (5%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + +++ + LG+ + D + N ++ S + T Sbjct: 2004 AAVNGVITVTTALGELAVYSEDSTDHRAGDYEYKLTANSTDGDIASESFDYTLTNSLGSN 2063 Query: 123 NSSRISMTHMANNRL--DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPA 179 +S+ +++ + + D S N V T+ L + + Q + Sbjct: 2064 SSASLTVKISDDAPVVHDISQNLQANADVVTTNLTLVLDVSGSMGDPVGNGQTYLEVAID 2123 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV-----NSAPICQDKKRTKMAALKNA 234 L + E + ++ + + D + T A NA Sbjct: 2124 ALTALINEVDSTGNVNIQIVNFHSNTGSSGWLIDDVAGAISYLESLVTYGPTHYDAALNA 2183 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 ++ L + + I + + Q + T Sbjct: 2184 VMNS----GSLPDGADQSLLYFISDGSPSPGQ-----EVDPALQSIWESYLVNSGYKTAF 2234 Query: 295 TPAMKQA-YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 + A L + I+ DG+ + Sbjct: 2235 GIGIGSAGLSDLLPVAYPEVDGDEVYAVKLDDPEDLASTILNYFDGDTIKGSYTTGVVFG 2294 Query: 354 CDKAKENFIKI 364 D I + Sbjct: 2295 ADGGHIESIAV 2305 >gi|120599894|ref|YP_964468.1| type IV pilin biogenesis protein [Shewanella sp. W3-18-1] gi|120559987|gb|ABM25914.1| type IV pilin biogenesis protein, putative [Shewanella sp. W3-18-1] Length = 1165 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 33/124 (26%), Gaps = 12/124 (9%) Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 L ++ K + ++ L NN N + K Sbjct: 404 DTLINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNNDIVIGNKDSLGVDNK 463 Query: 359 ENFIKIVTISINASPN-GQRLLKTCVSSPEY----------HYNVVNADSLIHVFQNISQ 407 I++ TI + + LL+ +Y N L+ ++ + Sbjct: 464 -QNIRVFTIGFSDGADEAAALLEETAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALK 522 Query: 408 LMVH 411 ++ Sbjct: 523 SILT 526 >gi|193213114|ref|YP_001999067.1| TadE family protein [Chlorobaculum parvum NCIB 8327] gi|193086591|gb|ACF11867.1| TadE family protein [Chlorobaculum parvum NCIB 8327] Length = 160 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 41/136 (30%), Gaps = 7/136 (5%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 ++FR G+ S+K N I FA + FL+L+ ++ Y K + A AG Sbjct: 9 AKFRTPVNAGVQSQKGNALIEFAFVLPVFLVLLFGMVTFSLGIYDKTVLTMATREGARAG 68 Query: 65 ASKMVSNLSRLGDRF-----ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A N GD N L + F + Sbjct: 69 AVYDAGNYDSDGDLDVTSVQTKARNATLAVLANKLISFGSKTPTVDNPSITGDILTVT-- 126 Query: 120 NIVNSSRISMTHMANN 135 +N + I + Sbjct: 127 AHLNYTGIFFLTGGQD 142 >gi|187927770|ref|YP_001898257.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Ralstonia pickettii 12J] gi|187724660|gb|ACD25825.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Ralstonia pickettii 12J] Length = 1122 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 23/225 (10%), Positives = 55/225 (24%), Gaps = 16/225 (7%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA----LLLFLDSIDLLSHVKEDVYMG 255 G+ + + A++ +D + ++ Sbjct: 227 PGNPPYSRMIWLSRAWGYYANISGSGKMLEAVQADSPSHYQNLIDYLAPETNSNSAASGH 286 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 S + R Y S T T + ++ + +L +D + T Sbjct: 287 SKLEIKNGAVATPLSGSLQTARSY---FAGSQTGYSTPVTQSCQKNFVLLATDGNPTGKT 343 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----FIKIVTISINA 371 + + + T N + T K I+ + + + Sbjct: 344 DGSMWALSDQQSTYNSSTGQWTFSNAANDVFSQITALRSTSVKGYTSPFDIQTYVVGMGS 403 Query: 372 ----SPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVH 411 + L + Y + +L FQ+I+ + Sbjct: 404 TVSNPASVAALNQMAKLGGTGNAYLSTDPATLQQQFQSIAADITT 448 >gi|332809815|ref|XP_001161858.2| PREDICTED: integrin alpha-10 isoform 1 [Pan troglodytes] Length = 1036 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 73/243 (30%), Gaps = 41/243 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-----------KNALLLFLDSI 242 + H +N D N + D T M + + + FL + Sbjct: 1 MNITHAYSQGHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRL 60 Query: 243 -DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMK 299 L E + +GL+ Y K V + + T + A+ Sbjct: 61 VGKLFIDPEQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIM 119 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A S + ++ +TDGE+++ + +K C+ + Sbjct: 120 VACTEGFSQSHG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR- 165 Query: 360 NFIKIVTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + I++ + R ++T S P+ + +NV + +L + + + Sbjct: 166 --VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRI 223 Query: 410 VHR 412 Sbjct: 224 FGL 226 >gi|194385334|dbj|BAG65044.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 73/243 (30%), Gaps = 41/243 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-----------KNALLLFLDSI 242 + H +N D N + D T M + + + FL + Sbjct: 1 MNITHAYSQGHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRL 60 Query: 243 -DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMK 299 L E + +GL+ Y K V + + T + A+ Sbjct: 61 VGKLFIDPEQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIM 119 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A S + ++ +TDGE+++ + +K C+ + Sbjct: 120 VACTEGFSQSHG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR- 165 Query: 360 NFIKIVTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + I++ + R ++T S P+ + +NV + +L + + + Sbjct: 166 --VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRI 223 Query: 410 VHR 412 Sbjct: 224 FGL 226 >gi|198420538|ref|XP_002128804.1| PREDICTED: similar to hedgling [Ciona intestinalis] Length = 420 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 16/104 (15%), Positives = 33/104 (31%), Gaps = 14/104 (13%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T A+ AY + G + ++ II +TDG N V+T Sbjct: 285 TCLVDALIHAY----------VEMSATINGGRQSQNNVEQDIILITDGCANCHHYGVSTE 334 Query: 352 KICDKAKE----NFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 ++ + + I I + ++ L+ ++ Sbjct: 335 QVLQNLADFFVQQGMHIYVIGVGLQHACRQKLRILAQGGRCYHF 378 >gi|156358483|ref|XP_001624548.1| predicted protein [Nematostella vectensis] gi|156211335|gb|EDO32448.1| predicted protein [Nematostella vectensis] Length = 186 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 52/157 (33%), Gaps = 22/157 (14%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 K + + ++ ++G I Y+T + + + E+ D + Sbjct: 19 TFKKCINFVKRVLRAFDVSEKGTHVGAIIYSTDTKLAFDFNTYKERGDIEAAFDRVKFLG 78 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 + T + +K A L K ++ +TDG +++ + Sbjct: 79 ETTFTGKGLKMALSELY----------------KTARKDVSNLLVVITDGRSHDDVVKPS 122 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + + ++IVT+ + S + + LK P Sbjct: 123 -----EMLRNAGVRIVTVGLGNSFDINQ-LKAMAGKP 153 >gi|50355937|ref|NP_571316.1| complement component bfb [Danio rerio] gi|49904460|gb|AAH76051.1| Complement component bfb [Danio rerio] Length = 456 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/199 (11%), Positives = 54/199 (27%), Gaps = 13/199 (6%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQ 278 + LK A + ++ +S+ + ++ + T V + ++ + Sbjct: 259 AVDVSDSINDLKKAKQIIKTLLEKISYYEVSPSYEILMFATDVYQIVKMRDFKTEEVAGS 318 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 D L S R K L + +I TD Sbjct: 319 LSKVFEDLDKFDFDKKLDQKSSNIAKLYQTILDSMSNEQIR--NKEDFLQTKHVVIVFTD 376 Query: 339 GENNNFKSNVNTIKICDKA-------KENFIKIVTISINASPNGQRLLKTCVSSPE--YH 389 G+ N + + + +EN + + + + L + Sbjct: 377 GQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFGVGKDVRTEDLNGLVSEKENERHF 436 Query: 390 YNVVNADSLIHVFQNISQL 408 + + + D + F ++ Sbjct: 437 FKLQDLDEVQKTFDSLMND 455 >gi|325108017|ref|YP_004269085.1| TadE family protein [Planctomyces brasiliensis DSM 5305] gi|324968285|gb|ADY59063.1| TadE family protein [Planctomyces brasiliensis DSM 5305] Length = 162 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 11/84 (13%), Positives = 30/84 (35%), Gaps = 2/84 (2%) Query: 4 LSRFRFYFKKGIA--SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + R F+ + + + + FA+ V F L + ++ + + ++ A A Sbjct: 1 MRRAFFHNRSNTSSEKRRGAALVEFAIVVPVFGLFLAAMVEFGHVYMVQTTLRGAAKKAA 60 Query: 62 LAGASKMVSNLSRLGDRFESISNH 85 G V++ + + + Sbjct: 61 RLGIGDGVTSADVSAEATRIVQSA 84 >gi|29826729|ref|NP_821363.1| hypothetical protein SAV_189 [Streptomyces avermitilis MA-4680] gi|29603825|dbj|BAC67898.1| putative membrane protein [Streptomyces avermitilis MA-4680] Length = 142 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 36/132 (27%), Gaps = 6/132 (4%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA--ILAGASKMVS 70 + ++ + SI A+ L+ +I V W+Y + + A AA L A S Sbjct: 15 RRWGDDRGDTSIQMAIIFPFVLIATVAVIQVSMWYYAR---QIALTAAREGLTAARAYES 71 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + + + ++ + S + I + Sbjct: 72 GPADGAAQAREVLGRVAGDSLRGYSVSASSNGQRVRVQVSGTAMSM-IPGVPGLQVTQSA 130 Query: 131 HMANNRLDSSNN 142 R Sbjct: 131 SGPVERWTVPGR 142 >gi|116192639|ref|XP_001222132.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51] gi|88181950|gb|EAQ89418.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51] Length = 952 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 28/369 (7%), Positives = 81/369 (21%), Gaps = 53/369 (14%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 K + + S ++ + D + + + S + F + + Sbjct: 199 KNIQSPSHPISVTLGNTSAGAASGADMSLQKASATLALSTTELGDDFVLRVVATNTGNPI 258 Query: 127 ISMTHMANNRLDSSNNTI----FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + + F + + + L Sbjct: 259 AVLEQHPTLDNHRALMATLVPKFNLPSTKPEIVFLCDRSGSMEGPNVRHLKTALHLFLKS 318 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + +G ++ + GS H + + Q ++ + + F Sbjct: 319 LPVG------VKFNICSFGSRHEFLFGNKS--------QTYDQSSLEKASRYVDGFSADF 364 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + Y + + W + + + + + K T + Sbjct: 365 GGTDIYHPMEDVINRRYADMDLEVFLLTDGEIWDQSALFELLNKKVAESNGKIRVFTLGI 424 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD----GENNNFKSNVNTIKIC 354 + + F + V K + L N+ + Sbjct: 425 GSGASHALVEGVAAAGNGFSQS-VGDNEKMNTKVVRMLKASLMPHINDYTLEVKYANDVA 483 Query: 355 DKAKENFIKI-------VTIS----------------INASPNGQRLLKTC---VSSPEY 388 D + + +I +TI + + + + S+ Sbjct: 484 DTSADEDFEIVEKVMDGMTIDVREVEKDEAPKKPTSLFDEAADPDAEMLDSSLDASAGGK 543 Query: 389 HYNVVNADS 397 + +V Sbjct: 544 YSHVPPVSE 552 >gi|313221050|emb|CBY31881.1| unnamed protein product [Oikopleura dioica] Length = 1282 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 49/177 (27%), Gaps = 18/177 (10%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + ++ + L+ + + Y + + K Sbjct: 500 QVMKSWMRRLISGLNIAPGRTQVSVYLYNNIFRTIFNLNEHQNAYDMITAINKMVYSGKG 559 Query: 292 TDSTPAMKQAY-QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A ++L + + ++ LTDG+ ++ N Sbjct: 560 TRIARALQSAMSKVLIPESGLRPNSEI--------------YLYLLTDGKESDVADVNNM 605 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQNIS 406 A + I + + I+ S L + + + N L + I+ Sbjct: 606 ANDIKDAFSDRITLTAVGISRSVENAELYAIASAPKKDNVFLLENYRDLDTI-TKIA 661 >gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS278] gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand factor type A (VWA) domain; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 755 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 31/244 (12%), Positives = 66/244 (27%), Gaps = 50/244 (20%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMA 229 + + P + + V+D SGSM T + Sbjct: 336 DYLLAFVTPPAVATATQRPQPRDVIFVIDNSGSM--------------------GGTSIR 375 Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSL 287 K +LL L + +I + + S E V Sbjct: 376 QAKASLLYALGRLQPNDRFN------VIRFDDTMTVLFPSSVPADAEHVGNATRFVSSLD 429 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ PAM+ A + ++FLTDG N + Sbjct: 430 ARGGTEMVPAMRAALT------------------DDGSDSDRMRQVVFLTDGAIGN---D 468 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + + +I + I ++PN + + ++ + + + +++ Sbjct: 469 QQLFETITAMRGRS-RIFMVGIGSAPNTYLMSRAAELGRGAFTHIGSVEQVEERMRDLFA 527 Query: 408 LMVH 411 + + Sbjct: 528 KLEN 531 >gi|72009654|ref|XP_784730.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 580 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 47/124 (37%), Gaps = 23/124 (18%) Query: 254 MGLIGYTTRVEKNIEPSWGTEK-VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + + + + T+ +RQ + + T ++ ++L S + Sbjct: 10 LGIVVFWSESQIRADLTDITDTALRQTLVNALPPSPTGSTCIGCGIESGLKVLGSYAQG- 68 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +I+ L+DG NN + D K + + I TI+I+ S Sbjct: 69 ------------------GYILLLSDGVENNK---PYIRYMYDDIKNSGVIIDTITISNS 107 Query: 373 PNGQ 376 + Q Sbjct: 108 ADQQ 111 >gi|149370556|ref|ZP_01890245.1| hypothetical protein SCB49_13870 [unidentified eubacterium SCB49] gi|149356107|gb|EDM44664.1| hypothetical protein SCB49_13870 [unidentified eubacterium SCB49] Length = 388 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 37/126 (29%), Gaps = 5/126 (3%) Query: 269 PSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 +G + + + T++ ++ A +L + + G Sbjct: 243 LVFGNDAWTIKIKDLPYLNVGPYHTNTVAGLQLAMDMLRRKRNTNKQIFMITDGKPSCLR 302 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT--CVSS 385 ++G + + + +A++ I I T I P Q+ + + Sbjct: 303 LPDGRYYKNSNGLDEYIVNKCYL--MARQARKLHIPITTFMIANDPYLQQFVDAFTAANQ 360 Query: 386 PEYHYN 391 + Y Sbjct: 361 GKAFYT 366 >gi|254293208|ref|YP_003059231.1| TadE family protein [Hirschia baltica ATCC 49814] gi|254041739|gb|ACT58534.1| TadE family protein [Hirschia baltica ATCC 49814] Length = 187 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 6/38 (15%), Positives = 16/38 (42%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVL 44 + K+ +++ ++ FAL + +L I + Sbjct: 2 LINFIKQFWKNDEGVAALEFALCLPLLTVLFFGTIEIS 39 >gi|73997822|ref|XP_534930.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-4 subunit isoform 1 [Canis familiaris] Length = 1121 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 44/345 (12%), Positives = 100/345 (28%), Gaps = 31/345 (8%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + V NL + + + + + + N E+ N S+ Sbjct: 122 EAVKNLVEAAEEADLNHEFNESLVFNYYNSVLINEKDENGDYVELGAEFVLESNAHFSNL 181 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + T +++ +L ++ ++ L + QR + + Sbjct: 182 MVNTSISSVQLPTNVYNKDPDILNGVYMSEALNPVFVENFQRDPTLTWQYFGSSTGFF-- 239 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI----------CQDKKRTKMAALKNALL 236 R I+ D +G + + + +A K +M K+ + Sbjct: 240 -RIYPGIKWTPDENGVIAFDCRNRGWYIQAATSPKDIVIVVDTSGSMKGLRMTIAKHTIS 298 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 LD++ V Y I Y K + + + ++ P Sbjct: 299 TILDTLGENDFVNIIAYSDYIHYVEPCFKGTLVQADRDNREHFKQLVDELMVKGVGIVNP 358 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +A+QIL ++ + I+ +TDG ++K Sbjct: 359 ALTEAFQILKQFQE------------ARQGSLCNQAIMLITDGAVEDYKPVFEKYNWP-- 404 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I++ T I +K + + +Y ++ L Sbjct: 405 --DRKIRVFTYLIGREVTFADRMKWIACNNKGYYTQIST--LADA 445 >gi|209545604|ref|YP_002277833.1| TadE family protein [Gluconacetobacter diazotrophicus PAl 5] gi|209533281|gb|ACI53218.1| TadE family protein [Gluconacetobacter diazotrophicus PAl 5] Length = 163 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 38/128 (29%), Gaps = 3/128 (2%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAILA--GASKMVS 70 + ++ F L + L+ ++ V + +++ +AA A G S + Sbjct: 6 FRRDRRGVTAVEFGLLTPVIIGLLLMVLEVGWQMATEIALQHGVHDAARFAMTGQSTVPG 65 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 R ++I A + + + G + + + + + Sbjct: 66 LDGSPTCRAQAIVWLATAEAPGILSPSNLSVMASANGGTAVGSSQSGFGGNATQTIVYVF 125 Query: 131 HMANNRLD 138 + L Sbjct: 126 TYSQPFLT 133 >gi|227818615|ref|YP_002822586.1| CpaE2 pilus assembly protein [Sinorhizobium fredii NGR234] gi|36958871|gb|AAQ87296.1| CpaE [Sinorhizobium fredii NGR234] gi|227337614|gb|ACP21833.1| CpaE2 pilus assembly protein [Sinorhizobium fredii NGR234] Length = 580 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 36/127 (28%), Gaps = 3/127 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSF-LLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + SE ++ FAL L L+ V +++ +++ A A Sbjct: 426 LRNKAQDLKKSEAGASAVEFALFAPVLALGLVAMA-DVALALHERMTIDHVLRAGAQAAM 484 Query: 66 SKMVSNL-SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + ++ + S + KR+ + +A T Sbjct: 485 ADPGATQVDKVLQSTLAQSAKPANVTLAPVKRYCACPENADVKPEAAPQCGTTPCADAVQ 544 Query: 125 SRISMTH 131 + Sbjct: 545 QLVFYRL 551 >gi|118367268|ref|XP_001016849.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89298616|gb|EAR96604.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1049 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 31/247 (12%), Positives = 81/247 (32%), Gaps = 12/247 (4%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + + +PA + P F ++ ++ S+ A+N+ + + Sbjct: 236 IFGSSKIFNEDSVKSAILPASHSAMVSFIPNFNEDITQEIDDSIRAAINNGDDIFSDE-F 294 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 Q + + L ++ F+ + S + + P+ V + Sbjct: 295 QQKLNQELIDHLNSSRSEFI-FLLDRSGSMSGQPIRRACEALTLFLKSLPNDSYFNVISF 353 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKK----RSFFTNFFRQGVKIPSLPFQKFIIF 335 + ++ ++++A +++ + + V+ + K I Sbjct: 354 GSSFDKLFPSSTKYTSESLEKAILLISKYQADLGGTEIYNPLNNVFVQNKIQGYNKQIFL 413 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFI-KIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 LTDGE ++ + V IK K N ++ +I + + + ++ ++ V Sbjct: 414 LTDGEVDSPQQVVRLIK-----KNNKYNRVHSIGFGSGADQYLIKESAIAGKGISKLVDQ 468 Query: 395 ADSLIHV 401 L V Sbjct: 469 KCDLSEV 475 >gi|319940447|ref|ZP_08014792.1| hypothetical protein HMPREF9464_00011 [Sutterella wadsworthensis 3_1_45B] gi|319806073|gb|EFW02822.1| hypothetical protein HMPREF9464_00011 [Sutterella wadsworthensis 3_1_45B] Length = 184 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 8/75 (10%), Positives = 25/75 (33%), Gaps = 8/75 (10%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYK----KNSMESAN- 57 SR + + + ++ A + L ++ + ++++ SA Sbjct: 6 FRSRIFTAGRAFLRDGRGVSAVETAFAFPVILAAFFIIVEFANMALTIQVGESAVSSALL 65 Query: 58 ---NAAILAGASKMV 69 +A L +++ Sbjct: 66 RFRDAGELGASAEND 80 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 45/358 (12%), Positives = 99/358 (27%), Gaps = 53/358 (14%) Query: 55 SANNAAILAGASKMVSNLS---------------RLGDRFESISNHAKRALIDDAKRFIK 99 S +++ A + S + + A A ++ + Sbjct: 315 STLDSSTAAVSGSGASGSVANTLTMTGGEELSGLQTDPVKQEQLKQAILASLNLPPGYTT 374 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 ++I+ ++ + +Q + T A + Y + Sbjct: 375 DNIQINVVSVTQEGGRRRLQAPQTKVVVEYTLTATPEVQVEVLAKAEASV----YLFIRV 430 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + + + + PI ++ D S ++ ++ N Sbjct: 431 QQPAIPWWSGPTATICISDSEGACNVNTPPITPPVVIQDCSANVCFLLDGSKSLTNV--- 487 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 D +A+ ++ + D + + IG+ T E S+ + V Sbjct: 488 --DGWNDVVASARSIMYSLNDPAAVFDVFWFSNDVEKIGHATGAEVAANNSFVSMVVNTT 545 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T A+ +L + + I+ +TDG Sbjct: 546 PDAH-------GTWMAQAITTCQGVLLEE-----------------DTQASRTIVLITDG 581 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 + + + T + D AK IK+V + LK S P++ + N DS Sbjct: 582 KPTDPQP---TFEAADAAKARGIKMVVVGAGEIDYA--TLKALASGPQFVFANTNLDS 634 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 14/114 (12%), Positives = 36/114 (31%), Gaps = 6/114 (5%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 + K+ + + S + K + + + ++ Sbjct: 101 SDSSKIVRQIDYIGSS-VGKNKCTFLGELSLIRYEAGWTDTKGALEEADRVLQHARSGAN 159 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 + ++ LTDG++ + I + + I+IV I + + N L + Sbjct: 160 RLVVLLTDGQSTEGD----PVGIATRIRNKGIRIVAIGVG-NVNMDELTSIATA 208 >gi|254293209|ref|YP_003059232.1| TadE family protein [Hirschia baltica ATCC 49814] gi|254041740|gb|ACT58535.1| TadE family protein [Hirschia baltica ATCC 49814] Length = 186 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 44/168 (26%), Gaps = 7/168 (4%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R +++ ++ FAL F LL+ L+ + + +++ A Sbjct: 15 RNRLNDFAKNKEGIVAVEFALIAAPFFLLVFGLLEIALIFFMTTTLDYGVTQAS------ 68 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + ++ + + + + + + + Sbjct: 69 -RQIRTGELQSKAAVKSDFATLVCGNLFDLLDCSNRLHIDVQRYEDFVASNAGDTLPLNE 127 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 T N + + + + V + + L + K + Sbjct: 128 DGTLQNNFQYNPGGPSEIVLVQVYYEWSLITPIMSSALKNMNDGKRLL 175 >gi|297460574|ref|XP_597723.5| PREDICTED: collagen, type XX, alpha 1 [Bos taurus] Length = 1364 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 53/188 (28%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L ++ V +GL Y+ + + + K Sbjct: 188 SHFQQVKDFLASVIE---PFEIGPGKVQVGLTQYSGAPQTEWDLNAFGTKEEVLNAVHNL 244 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + K +I +TDG++ + Sbjct: 245 HYRGGNTFTGLALTH-------------VLEQNLKPRAGLRPEAAKLVILVTDGKSQDD- 290 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K + I + + + L+ S P +NV + L + Sbjct: 291 ----AHTAGHVLKGLGVDIFAVGV--KNADETELRLLASQPLDITVHNVQDFPQLSTLSG 344 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 345 LLSRLICQ 352 >gi|297481786|ref|XP_002692297.1| PREDICTED: Collagen, type XX, alpha 1-like [Bos taurus] gi|296481265|gb|DAA23380.1| Collagen, type XX, alpha 1-like [Bos taurus] Length = 1342 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 53/188 (28%), Gaps = 25/188 (13%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +K+ L ++ V +GL Y+ + + + K Sbjct: 188 SHFQQVKDFLASVIE---PFEIGPGKVQVGLTQYSGAPQTEWDLNAFGTKEEVLNAVHNL 244 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 T + A+ + K +I +TDG++ + Sbjct: 245 HYRGGNTFTGLALTH-------------VLEQNLKPRAGLRPEAAKLVILVTDGKSQDD- 290 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQ 403 K + I + + + L+ S P +NV + L + Sbjct: 291 ----AHTAGHVLKGLGVDIFAVGV--KNADETELRLLASQPLDITVHNVQDFPQLSTLSG 344 Query: 404 NISQLMVH 411 +S+L+ Sbjct: 345 LLSRLICQ 352 >gi|260827134|ref|XP_002608520.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] gi|229293871|gb|EEN64530.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] Length = 1236 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 16/132 (12%), Positives = 38/132 (28%), Gaps = 23/132 (17%) Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 K++ MG++ + + + + + + + + T+ + + A + Sbjct: 667 DFKDEDQMGIVTFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQTNLSDGLLSAISMF 726 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN------VNTIKICDKAKE 359 F II TDG+ N + K+ + Sbjct: 727 KGSSGSDFHNG----------------IILFTDGQANQGIIDAAELVQEYNSKMAGLGEG 770 Query: 360 NFIKIVTISINA 371 + I T +I Sbjct: 771 VCLPISTFTIGD 782 >gi|24374985|ref|NP_719028.1| hypothetical protein SO_3479 [Shewanella oneidensis MR-1] gi|24349714|gb|AAN56472.1|AE015784_5 hypothetical protein SO_3479 [Shewanella oneidensis MR-1] Length = 596 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 34/355 (9%), Positives = 90/355 (25%), Gaps = 29/355 (8%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + LG + + A ++ + + S ++ + Sbjct: 206 ATGEANGLPVDSLGSAAKPVRYTAVNIHSSESAKASAKAKAQKTGFGSGKVVTVYTEDYL 265 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 N + + RL+ + + I + D+ L + + I + ++ Sbjct: 266 NWHHGTKKKVNKTRLELAQDAIESVILTTPGVDFGLAVFN-MNGPTDFARDGGRIISGIK 324 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + I L++ V +S + + + + Q++ + N + Sbjct: 325 TMSAQAKIDLLKRVNLISYAQNTPLCETLYEAYRYFGGQEQYFGDDDSDYNYYDRWGYYQ 384 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 K Y+ D + A Sbjct: 385 GTYEASKNPFTDPKALTPGTNNYQSPFKPCQN--EAYIVYITDGEPT--------LDSAA 434 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + GV + + ++ L+ + A + + Sbjct: 435 DGIIER---------LTNGVGKHTSNPKSYLSALSGWMRTEDVNPNL-------AGKQTV 478 Query: 363 KIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + TI + L +T + +++ +A L Q ++ + S Sbjct: 479 RTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASQLRSSLQTALNNILEKNAS 533 >gi|307327617|ref|ZP_07606802.1| conserved hypothetical protein [Streptomyces violaceusniger Tu 4113] gi|306886729|gb|EFN17730.1| conserved hypothetical protein [Streptomyces violaceusniger Tu 4113] Length = 128 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 14/127 (11%), Positives = 35/127 (27%), Gaps = 8/127 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRF 79 S+ FAL+ ++ +++G L+ N A LA A + Sbjct: 1 MSLFFALATVAIFMVMGLLVDGSGALNAGNR------ATSLAQEAARSAGQQIDPAQAIE 54 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + A A+ ++ + + + + + + + Sbjct: 55 GTAITVDPDAAAAAAEDYLAAADVQGDVEITDDGQTLTVTVHDTYQTSFASLVGKSTISV 114 Query: 140 SNNTIFY 146 S + Sbjct: 115 SGTATAH 121 >gi|294011438|ref|YP_003544898.1| tight adherence protein TadE [Sphingobium japonicum UT26S] gi|292674768|dbj|BAI96286.1| tight adherence protein TadE [Sphingobium japonicum UT26S] Length = 201 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 13/114 (11%), Positives = 31/114 (27%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K E ++ F ++ +FL L+ V Y + ++ A + Sbjct: 3 IKHLHRDESGVAAVEFGITASAFLALLLGGFDVGHTLYMQTVLQGTVQKAGRDLTLSSGA 62 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 ++ + + +R + I + A E + Sbjct: 63 EAAQQLAIDGRVRDAVRRLNASLSDEDIIITREYYADFTKAQAAQPEDADQDGI 116 >gi|262527579|sp|P54289|CA2D1_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor Length = 1103 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + N Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFN 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|170743967|ref|YP_001772622.1| TadE family protein [Methylobacterium sp. 4-46] gi|168198241|gb|ACA20188.1| TadE family protein [Methylobacterium sp. 4-46] Length = 134 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 10/131 (7%), Positives = 30/131 (22%), Gaps = 5/131 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ + E+ + FA+ + L+ I Y A A ++ + Sbjct: 1 MRRFGSDERGATAAEFAVVAPILIALVMGSIEFGSIMYT----LGTTEFATNDAARQLAT 56 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 N I+ + + + + ++ Sbjct: 57 NRITASQVAGIIALRLPS-WAQASAAVTISQSSTDPNKNQYTVTTNVPLSSATPTQFFSL 115 Query: 131 HMANNRLDSSN 141 + + Sbjct: 116 IYGTKSFNCTA 126 >gi|241662330|ref|YP_002980690.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Ralstonia pickettii 12D] gi|240864357|gb|ACS62018.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Ralstonia pickettii 12D] Length = 1122 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 23/225 (10%), Positives = 55/225 (24%), Gaps = 16/225 (7%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA----LLLFLDSIDLLSHVKEDVYMG 255 G+ + + A++ +D + ++ Sbjct: 227 PGNPPYSRMIWLSRAWGYYANISGAGKMLEAVQADSPSHYQNLIDYLAPETNSNSAASGH 286 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 S + R Y S T T + ++ + +L +D + T Sbjct: 287 SKLEIKNGAVATPLSGSLQTARSY---FAGSQTGYSTPVTQSCQKNFVLLATDGNPTGKT 343 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN----FIKIVTISINA 371 + + + T N + T K I+ + + + Sbjct: 344 DGSMWALSDQQSTYNSSTGQWTFSNAANDVFSQITALRSTSVKGYTSPFDIQTYVVGMGS 403 Query: 372 ----SPNGQRLLKTCVSSP-EYHYNVVNADSLIHVFQNISQLMVH 411 + L + Y + +L FQ+I+ + Sbjct: 404 TVSNPASVAALNQMAKLGGTGNAYLSTDPATLQQQFQSIAADITT 448 >gi|54112390|ref|NP_000713.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Homo sapiens] gi|109658756|gb|AAI17469.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|109659118|gb|AAI17471.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|119597396|gb|EAW76990.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] Length = 1091 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + N Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFN 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|6090615|gb|AAF03259.1| dihydropyridine receptor alpha 2 subunit [Homo sapiens] Length = 1110 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + N Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFN 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|85859128|ref|YP_461330.1| hypothetical protein SYN_01503 [Syntrophus aciditrophicus SB] gi|85722219|gb|ABC77162.1| hypothetical membrane protein [Syntrophus aciditrophicus SB] Length = 173 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 44/112 (39%), Gaps = 3/112 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI-LAGA 65 + K+ ++ + ++ FAL + F LL+ +I + Y +++++ A LA Sbjct: 12 MKKISKRILSDSRGVTAVEFALIMPFFFLLLFAIIDFGWYFYSQHTIQFATREGTRLALV 71 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 + + E + K + A ++ S+ +A + + E Sbjct: 72 GVQLKDKDGNEMSRE--DSIIKTIQDNAALAVDPAALQISIYPVAAGYSDPE 121 >gi|15599750|ref|NP_253244.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] gi|9950799|gb|AAG07942.1|AE004869_6 type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] Length = 1161 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 36/384 (9%), Positives = 88/384 (22%), Gaps = 44/384 (11%) Query: 39 FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFI 98 I + K+++++ +AA + + + +R + + ++ Sbjct: 139 VTIEYGRGYDKESTIK--ADAA------YYYDFTGSSSCNRTNQACYTRRYVSTEQRQNF 190 Query: 99 KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 N + A + ++ N + + + S R Sbjct: 191 ANWYSFYRTRALATQTAANLAFYSLPENARVSWQLLNDSNCNQMGSGSSSGNCFSNYLR- 249 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F +N + + ++G + Sbjct: 250 DFTGQHRVNFFNWLEKLSVNGGTPLRQAMTRAGEFLKKTGVNGPYAYR--PGTQTAPEYS 307 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 L N + + D + + GT Sbjct: 308 CRGSYHILMTDGLWNNDSANVGNADSTARN--------LPDGKSYSSQTPYRDGTFDTLA 359 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + TD+ P + + ++ + + P+ L Sbjct: 360 -----DQAFHYWATDARPDIDDNIKPYIPYPDQANPSAEYWNPRNDPATWQHMVTYTLGL 414 Query: 339 GENNNFKSN---------VNTIKICDKA---KENFIKIVTISINASPNGQRLLKTCVSSP 386 G + S + + + N S N L V+S Sbjct: 415 GLTTSLTSPRWEGSTFSGGYNDIVAGNLSWPRASN--------NDSNNVYDLWHAAVNSR 466 Query: 387 EYHYNVVNADSLIHVFQNISQLMV 410 ++ + D L+ FQ+I + Sbjct: 467 GEFFSADSPDQLVAAFQDILNRIS 490 >gi|21222916|ref|NP_628695.1| hypothetical protein SCO4532 [Streptomyces coelicolor A3(2)] gi|256785967|ref|ZP_05524398.1| hypothetical protein SlivT_15867 [Streptomyces lividans TK24] gi|289769859|ref|ZP_06529237.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|10178323|emb|CAC08348.1| putative membrane protein [Streptomyces coelicolor A3(2)] gi|289700058|gb|EFD67487.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 153 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 12/138 (8%), Positives = 36/138 (26%), Gaps = 3/138 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ ++ + + + + FL L F+I K+ A A + Sbjct: 12 RRTRLDDRGSGAGAVIIFALVFLSLSAFVIDGGMSISKRERAADIAEQAARYAAQDIDRE 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG---YSAVFYNTEIQNIVNSSRIS 128 + N+ A + ++ +A +++ + S + Sbjct: 72 ALYDDVGGPAPINYENCNARVKAFAAEMDMTGADIAATHCVTADAAQVQVEVQLTYSPVF 131 Query: 129 MTHMANNRLDSSNNTIFY 146 + + Sbjct: 132 TGMFYGGDVVVHGEAVAE 149 >gi|148263861|ref|YP_001230567.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Geobacter uraniireducens Rf4] gi|146397361|gb|ABQ25994.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Geobacter uraniireducens Rf4] Length = 1482 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 73/304 (24%), Gaps = 24/304 (7%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A+ + + N + I + +K + A T N Sbjct: 219 ATAGANGVISPAGATNVAKNADQTFNIIPNAGYNVLDVKIDGTSVGARTSYTFTNVTANH 278 Query: 125 SRISMTHMANNRLDSS---NNTIFYNMDVMTSYDYRLQFIEHLLN--------QRYNQKI 173 + + + + + I + + + Sbjct: 279 TINVTFALQTTSMGNYCQQPSFIGSITQLKPNVLILSDNSGSMGWLAYESETYNNAKSYY 338 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE----DVNSAPICQDKKRTKMA 229 F P + G ++D + M + DV + K + A Sbjct: 339 GYFDPQKMYKNSGTVFTIDTTKILDKNDIMSGNYLNWQMMRRIDVVRKALIGGKVVNRTA 398 Query: 230 ALK--NALLLFLDSIDLLSHVKED------VYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 A K +A F V GL+ + + E + VT Sbjct: 399 ATKYLSANGGFTVEYGPTEPTGIIQALDGKVRFGLMFFRSNYEGGYIAAPIGSTAADLVT 458 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTS-DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T ++ +A + ++ I + F++ +TDGE Sbjct: 459 QIEAKNPSGNTPLAESIYEATKYFGHVASAYDAAVDYSTWADPILYPCQKNFMLVVTDGE 518 Query: 341 NNNF 344 Sbjct: 519 PTAD 522 >gi|239934063|ref|ZP_04691016.1| hypothetical protein SghaA1_38047 [Streptomyces ghanaensis ATCC 14672] gi|291442502|ref|ZP_06581892.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291345397|gb|EFE72353.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 152 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 9/119 (7%), Positives = 27/119 (22%), Gaps = 4/119 (3%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 + + +IG + + A AG + + G+ A A Sbjct: 33 ALIGIIGVAVDGGGKMRAVERADHIAGEAARAGGQAIDPAKAISGEAVIVAPQDAVAAAQ 92 Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV 150 + S ++ + + + + + + + Sbjct: 93 AYLRSVGA----TGTVSVSGDGKTLTVRVADSYHTKFLPVVGIGSMPVTGHGKATLLHG 147 >gi|153834067|ref|ZP_01986734.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148869622|gb|EDL68612.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 149 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 31/142 (21%), Gaps = 5/142 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV----- 69 ++ I F L++ L+L +I V + + A Sbjct: 8 AKAQSGLAIIEFILALPVLLMLSVLVIDVCRAFIQYTEVNKALQNGARYAVVDTYGTLSF 67 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 +++ + + A +I+ T S Sbjct: 68 DSIADETSIKNVVVYGSPTASTTPIIDYIEVGDITITQPTETNKEVTLSATYNYVPIFST 127 Query: 130 THMANNRLDSSNNTIFYNMDVM 151 +N L S Sbjct: 128 LPFSNQSLQFSIGATTTMRTGP 149 >gi|271962919|ref|YP_003337115.1| hypothetical protein Sros_1377 [Streptosporangium roseum DSM 43021] gi|270506094|gb|ACZ84372.1| hypothetical protein Sros_1377 [Streptosporangium roseum DSM 43021] Length = 584 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 12/111 (10%), Positives = 40/111 (36%), Gaps = 8/111 (7%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NF 361 S FR+ ++ +T G ++ + ++ ++ ++ + Sbjct: 469 SSSLYDSVLAGFREVTDSYDETMNNTLLVITAGRDDGKGLSSG--ELAERLRDEWDPEHP 526 Query: 362 IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNISQLMVH 411 ++IV ++ + L + + + + +I VF +++ + H Sbjct: 527 VQIVVLAFGDDLDRAALGQVASITNGSLHIAQEPNEIIEVFLSALARRLCH 577 >gi|257866903|ref|ZP_05646556.1| predicted protein [Enterococcus casseliflavus EC30] gi|257872580|ref|ZP_05652233.1| predicted protein [Enterococcus casseliflavus EC10] gi|257800861|gb|EEV29889.1| predicted protein [Enterococcus casseliflavus EC30] gi|257806744|gb|EEV35566.1| predicted protein [Enterococcus casseliflavus EC10] Length = 779 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 26/348 (7%), Positives = 71/348 (20%), Gaps = 34/348 (9%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + D + + + Sbjct: 30 SETTTYDFDYQTDATGQYLTNGTAQGDTYNYKYGQAP---GVAFDEENYVNYADEAYLKK 86 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLR 182 D + + + + ++ ++ Sbjct: 87 SVKEAAGQVLFDVTLAVKGNQTSTPVDLVMVIDYSSSMTGEKLTNALLGLQQFGNELADS 146 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + G I ++ ++ + + + + + T M A L Sbjct: 147 LANGSIRIGIVAYNREIYTTNGFSTDIAYLEQFLRETAETHSGTFMQKGLLAGQQLLLEE 206 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 K +++G R Y+ D ++ Sbjct: 207 SRPEAEKLFIHIG----------------DDSANRSYLPADDATVYQNTGAIRDYNGYHT 250 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + S S +TD T+ K++ Sbjct: 251 ESYVESFQTSATAYHTTSKNPSDSQAIPVSSTVVTD----------ETLGTIVALKKSGF 300 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLM 409 +++ S G+ + + S+P + N L I+ + Sbjct: 301 TYYSVATAPSARGEYIGRNLASAPSNYLTTDENLSGLGSALTEIANHI 348 >gi|319425465|gb|ADV53539.1| type IV pili-associated adhesin, PilY [Shewanella putrefaciens 200] Length = 1165 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 21/203 (10%), Positives = 49/203 (24%), Gaps = 24/203 (11%) Query: 228 MAALKNALLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-------RQY 279 L A F + + K I T + G+ Y Sbjct: 329 CETLYEAYQYFSGGQVTFGNKDKNGTGNNKIDGYTPNNPPSILTSGSYTTPFKKCPDTAY 388 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + D + + + +F G K + ++ + Sbjct: 389 IIYITDGAPTLDKSADTLINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNN 448 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEY---------- 388 + + I + I++ TI + + LL+ Sbjct: 449 DIVIGNKDSLGID-----NKQNIRVFTIGFSDGADEAAALLEETAFRGGNPRGSNNISKG 503 Query: 389 HYNVVNADSLIHVFQNISQLMVH 411 +Y N L+ ++ + ++ Sbjct: 504 YYVAKNGLDLVSALEDALKSILT 526 >gi|315105441|gb|EFT77417.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] Length = 322 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 65/212 (30%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMPPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLNGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L +Q + + Sbjct: 256 STVADRTHAKSWTADSADKLQEGYQQVHSSVG 287 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 41.0 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 77/227 (33%), Gaps = 34/227 (14%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 D SGSM+ + D D A K L+ ++ + + Sbjct: 766 DPSGSMNIYLVLDGSDSIGAGN--------FTGAKKCLVNLIEKVASYGVKPRYALVTYA 817 Query: 258 GYTT-RVEKNIEPSWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 Y V+ + + S + V + ++ D + T++ A++ Y +++ + Sbjct: 818 TYPRIWVKVSDQESSNADWVTKKLSEINYEDHKLKSGTNTKRALQAVYSMMS------WP 871 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KENFIKI 364 + +G II +TDG +N + I +E+++ + Sbjct: 872 EDIPPEGWNRTRH----VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDV 927 Query: 365 VTISIN--ASPNGQRLLKTCVSSPEYHYNVVNADSLIHV-FQNISQL 408 + L + + ++ + V + ++L V FQ I + Sbjct: 928 YVFGVGPLVDQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDES 974 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 30/341 (8%), Positives = 77/341 (22%), Gaps = 27/341 (7%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 D + + S S + + + + + Sbjct: 77 ASNGWDKNSSWSGDSSDTSKSYLKFGTDTSNPDYQIRKYAKETSTPGLYDVYLNAKGNEV 136 Query: 147 NMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPALLRIEMGERPIFLI-----ELVVDLS 200 + + + V ++ + I VV S Sbjct: 137 KNIKPIDIVLVVDMSGSMNSSVNGGNDRVGAARQGVKNFLKTINDAGIGKYVNVGVVGFS 196 Query: 201 GSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT 260 + + + + A Q T++ L F ++ M Sbjct: 197 SPGYISSSGTLSENIDASDNQAHI-TRINNL--LANDFKGGTFTQLGIRTGQSMLAGDSN 253 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + I + G VT + TD A + R + Sbjct: 254 DHKKMMILLTDGVPTFSYKVTGAT---TINGTDYGTAFGTNRD-----EPRFTSQLWKAS 305 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV---------TISINA 371 G + + D ++ + I++ + + Sbjct: 306 GNSRTPSSYTVSGKTIKDTWPATLGESLIAKSQSTELHALGIQLSKDVGYTNNNSYTYLT 365 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + +K S P + + +A + +N ++ ++ + Sbjct: 366 DSQVRDRMKLLAS-PGLYQDANSASDVETYLKNQAKDVLSQ 405 >gi|149915863|ref|ZP_01904387.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] gi|149810186|gb|EDM70032.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] Length = 235 Score = 41.0 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 11/123 (8%), Positives = 30/123 (24%), Gaps = 16/123 (13%) Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A L + + ++ +TDG Sbjct: 97 IIAAVDALRPGGLTPLAASVQAAAEALNYRTTPGIVVLVTDGNETCGGRPCALGTALAD- 155 Query: 358 KENFIKIVTISI--NASP-----------NGQRLLKTCVS--SPEYHYNVVNADSLIHVF 402 + + + I P +G ++ C+S + + + + L Sbjct: 156 QAEGLTVHVIGFRVQYDPFAWDNPEAQTYDGGAVVAKCLSDRTGGLYVDTQTVEELTEAL 215 Query: 403 QNI 405 + + Sbjct: 216 EAV 218 >gi|87312294|ref|ZP_01094392.1| hypothetical protein DSM3645_06434 [Blastopirellula marina DSM 3645] gi|87284998|gb|EAQ76934.1| hypothetical protein DSM3645_06434 [Blastopirellula marina DSM 3645] Length = 135 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 14/121 (11%), Positives = 37/121 (30%), Gaps = 8/121 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + FA+ F LL+ +I + A GA + V + + Sbjct: 3 RSKRTGAAVVEFAIVAPLFFLLVFGMIEYGRMVMV----QQVITNASREGARRAVLDGAT 58 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + ++ ++A + I+ S + + ++ ++ M + Sbjct: 59 TSEVVAAVEEFLEQASVSGGNLEIRVSPDPPGSASNGD----PVGVTISVPFSDVSWMPS 114 Query: 135 N 135 Sbjct: 115 P 115 >gi|311254427|ref|XP_003125838.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Sus scrofa] Length = 1177 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPEALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 + ++ S P+ + +NV + +L + + + Sbjct: 320 QEIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|218680613|ref|ZP_03528510.1| TadE family protein [Rhizobium etli CIAT 894] Length = 176 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 8/58 (13%), Positives = 24/58 (41%), Gaps = 4/58 (6%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + ++ + K +I FA+ + +++ +I V + + A +A++ Sbjct: 3 RKKIFAPLRRLVGDHKGAAAIEFAILALPLFIILFGIIEVSLMFFVNS----ALDASV 56 >gi|332237899|ref|XP_003268144.1| PREDICTED: integrin alpha-10 isoform 2 [Nomascus leucogenys] Length = 1036 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 73/243 (30%), Gaps = 41/243 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-----------KNALLLFLDSI 242 + H +N D N + D T M + + + FL + Sbjct: 1 MNITHAYSQGHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRL 60 Query: 243 -DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMK 299 L E + +GL+ Y K V + + T + A+ Sbjct: 61 VGKLFIDPEQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIM 119 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A S + ++ +TDGE+++ + +K C+ + Sbjct: 120 VACTEGFSQSHG-------------GRPEAARLLVVVTDGESHDGEQLPAALKACEAGR- 165 Query: 360 NFIKIVTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + I++ + R ++T S P+ + +NV + +L + + + Sbjct: 166 --VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRI 223 Query: 410 VHR 412 Sbjct: 224 FGL 226 >gi|78062899|ref|YP_372807.1| TadE-like protein [Burkholderia sp. 383] gi|77970784|gb|ABB12163.1| TadE-like protein [Burkholderia sp. 383] Length = 142 Score = 41.0 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 14/130 (10%), Positives = 37/130 (28%), Gaps = 3/130 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA---SKMVSN 71 + S+ F L + L+++ +I K + +A+ A AG M++ Sbjct: 8 ARHARGVVSLEFVLMLPFLLMVLIGIIDTSLILCDKAVITNASREAARAGVMLRVPMLTT 67 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 ++ N + +G + T + + Sbjct: 68 TQIASIAQNAMQNSLITGGTATTPAVTVTQANGTTAGTALTVTVTYTYSGLVLGSAFSAL 127 Query: 132 MANNRLDSSN 141 + +++ Sbjct: 128 TGPVTVSATS 137 >gi|308171956|ref|YP_003918661.1| hypothetical protein BAMF_0065 [Bacillus amyloliquefaciens DSM 7] gi|307604820|emb|CBI41191.1| conserved hypothetical protein [Bacillus amyloliquefaciens DSM 7] gi|328551766|gb|AEB22258.1| hypothetical protein BAMTA208_00330 [Bacillus amyloliquefaciens TA208] gi|328910026|gb|AEB61622.1| hypothetical protein LL3_00067 [Bacillus amyloliquefaciens LL3] Length = 245 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT-- 381 + ++ +TDG +N+ + I + AKE I + I I + + + + Sbjct: 2 NKGHLNQVLLITDGCSNHGED---PIAMAAFAKEQGITVNVIGIMEENAIDQEAMKEVEG 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + VV A L Q +++ Sbjct: 59 IAMAGGGVHQVVYASQLSQTVQMVTKK 85 >gi|154684584|ref|YP_001419745.1| YabS [Bacillus amyloliquefaciens FZB42] gi|63146677|emb|CAG28929.1| YabS protein [Bacillus amyloliquefaciens FZB42] gi|154350435|gb|ABS72514.1| YabS [Bacillus amyloliquefaciens FZB42] Length = 245 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT-- 381 + ++ +TDG +N+ + I + AKE I + I I + + + + Sbjct: 2 NKGHLNQVLLITDGCSNHGED---PIAMAAFAKEQGITVNVIGIMEENAIDQEAMKEVEG 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + VV A L Q +++ Sbjct: 59 IAMAGGGVHQVVYASQLSQTVQMVTKK 85 >gi|113969431|ref|YP_733224.1| type IV pilin biogenesis protein [Shewanella sp. MR-4] gi|113884115|gb|ABI38167.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-4] Length = 1223 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 39/355 (10%), Positives = 98/355 (27%), Gaps = 29/355 (8%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + LG + + D + E + + ++ + Sbjct: 206 AAGEAQGLPVDSLGTASKPVRYTEASTSSSDNVKSAAKAKAEKTAFGTGKVVTVYTEDYL 265 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 N T + RL + + I + D+ L + Q+ I + ++ Sbjct: 266 NWLHGPKTRVNKTRLALAKDAIESVILTTPGVDFGLAIFN-MNGPYDGQRDGGRIISGIK 324 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I L++ V D++ + + + + Q + + + + + Sbjct: 325 PMSAAAKIDLLKGVNDINYAQNTPLCETLYEAYRYFGGQSQYFGDDDSDYDYFDRWGRYL 384 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +T + + ++ I TD P + A Sbjct: 385 GTYQASRN-------SFTDPEALVVGTNNYNSPFKRCQNEAYIIYI---TDGEPTLDNAA 434 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L +GV + ++ L+ N + + A + + Sbjct: 435 DGL---------VEQLTKGVGKHTSRPVSYLSALSGWMRNEDVNP-------NSAGKQTV 478 Query: 363 KIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + TI + L +T + +++ +A L Q ++ + S Sbjct: 479 RTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEKNAS 533 >gi|170748500|ref|YP_001754760.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170655022|gb|ACB24077.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 195 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 36/111 (32%), Gaps = 5/111 (4%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + I + +I F++ LL++ I + + +++A A + ++S Sbjct: 12 RLIGDREGVSAIEFSVIAPILLLILMGSIELPRAYMIGKRLDNAA-----ATMADLISRG 66 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 S + + A D ++ I + S S N Sbjct: 67 SYADLKPVFAATGAISNPYDVSRASIVLTAAGTYSDGSVATTKVCSSAESN 117 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 28/245 (11%), Positives = 66/245 (26%), Gaps = 45/245 (18%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + P + V+D SGSM Sbjct: 333 DGKTYLLAFVTPPTAPDTAAPPAKREVVFVIDNSGSMSGPS------------------- 373 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 + + +L L + ++ + +V T + + + + + Sbjct: 374 -IEQARQSLALAISKLNPDD--RFNVIRFDDTMTDYFKGLVAATPD--NREKAIAYVRGL 428 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ PA++ A + + + ++FLTDG Sbjct: 429 TADGGTEMLPALQAALR-----------------NQGPVATGALRQVVFLTDGAI--GNE 469 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +I A + ++ T+ I ++PN + K + + D + + Sbjct: 470 RQLFQEI--TANRSDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELF 527 Query: 407 QLMVH 411 + + Sbjct: 528 AKLQN 532 >gi|221112732|ref|XP_002163909.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 1516 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 36/98 (36%), Gaps = 5/98 (5%) Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPF-QKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 ++ SF + G KI + KF + ++D +N+ + + K + + Sbjct: 1049 AYQNESPESFNSGIMHTGNKIVTPKLGNKFGLTMSDKPHND---SNKLASLALKMRNKGV 1105 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 IV I I + + ++ L S + L+ Sbjct: 1106 IIVAIGIGPNVDQKK-LTDIAGSSDKVLLAEKMQDLLE 1142 >gi|163738633|ref|ZP_02146047.1| TadE-like protein [Phaeobacter gallaeciensis BS107] gi|163742981|ref|ZP_02150364.1| hypothetical protein RG210_01907 [Phaeobacter gallaeciensis 2.10] gi|161383664|gb|EDQ08050.1| hypothetical protein RG210_01907 [Phaeobacter gallaeciensis 2.10] gi|161387961|gb|EDQ12316.1| TadE-like protein [Phaeobacter gallaeciensis BS107] Length = 186 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 23/44 (52%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 E N ++ FA+ + +FL L+ + + +++ +E A + + Sbjct: 24 EDGNATVEFAIVIPAFLFLLMNTVELGMITIQQSMLERALDQTV 67 >gi|148537043|dbj|BAF63430.1| Ca(2+)-activated chloride channel splicing variant [Rattus norvegicus] Length = 514 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 25/136 (18%) Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSLILKPTDST 295 + L V+++ +GL+ + + + + +T ++ + T Sbjct: 329 NQAAELYLTQIVEKESMVGLVTFDSTAQIQNYLIKITNTGDYKKITGNLPQQAVGGTSIC 388 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC- 354 ++ +Q +TS + + + I+ LTDGE++ I C Sbjct: 389 RGLEAGFQAITSSDQSTSGSE----------------IVLLTDGEDD-------LISSCF 425 Query: 355 DKAKENFIKIVTISIN 370 + K + I TI++ Sbjct: 426 EVVKHSGAVIHTIALG 441 >gi|327541352|gb|EGF27893.1| TadE family protein [Rhodopirellula baltica WH47] Length = 128 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 14/118 (11%), Positives = 29/118 (24%), Gaps = 2/118 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ FA + L LI + ++S A A + Sbjct: 2 RHGAAAVEFAFIAPLMIFLTFGLIELGRLSMLRDSAIHATREAARVAIKPSATTSEISSR 61 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 E +S + + + Y +N + ++M H Sbjct: 62 VEEELSLMGISGGSSTI--NFTSDGSTGVELVTVNVYIPIAENSWLPNTLAMGHTNIE 117 >gi|187479012|ref|YP_787036.1| hypothetical protein BAV2525A [Bordetella avium 197N] gi|115423598|emb|CAJ50136.1| putative exported protein (partial) [Bordetella avium 197N] Length = 116 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 9/86 (10%), Positives = 31/86 (36%), Gaps = 1/86 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 ++ + + AL ++ ++L+G ++Y K ++ + L + + + Sbjct: 3 QRGSILVPAALIILIGMVLLGGA-QFGYFYYVKRELQKTADLVALTAVQVIDGSAASCAF 61 Query: 78 RFESISNHAKRALIDDAKRFIKNHIK 103 + + K+ L ++ Sbjct: 62 ADLTAKANVKQNLSAYHTALNTPSVQ 87 >gi|83647468|ref|YP_435903.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635511|gb|ABC31478.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 659 Score = 40.6 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 14/151 (9%), Positives = 42/151 (27%), Gaps = 31/151 (20%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 ++ + + LD+ L+ ++ + + +R + Sbjct: 113 TPNRLTRARQKVYDLLDARKEGQ-------TALVAFSGSGHVVAPLTEDSNTLRAMLPAL 165 Query: 284 MD-SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + +++ ++ A ++ I+ +TDG Sbjct: 166 DPFIMPEMGSNAAAGIESALNVIKQAGA------------------VNARILLITDGVE- 206 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASP 373 + +V I D + + I + + A Sbjct: 207 --EVDVQPIN--DLLSKAGVSISVLGVGADD 233 >gi|17533685|ref|NP_496744.1| C-type LECtin family member (clec-64) [Caenorhabditis elegans] gi|3876682|emb|CAB03056.1| C. elegans protein F35C5.7, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 385 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 22/215 (10%), Positives = 57/215 (26%), Gaps = 21/215 (9%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL--DSIDLL 245 + D + + + N F I Sbjct: 10 TAFATTQASTDRNCGSNWNNLWLDIVIVVDNSKGMTNEGITEVAANIATTFASGPRIGTD 69 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAY 302 + ++ Y + + + + + + + + A Sbjct: 70 YSDPRSTRLSILTYNSEATVVADLNQFQSADDVYQTLFSFLNEVSDSDDSFLAKGLGMAE 129 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 +L + + N+ R V S + DGE+ + I + + + + Sbjct: 130 SVLYNGRMNGVRENYNRLVVVYASAYRE-------DGED--------PLNIAKRLRSSGV 174 Query: 363 KIVTISINASPNGQRLLKTC-VSSPEYHYNVVNAD 396 I T++ + +G L K ++SP + + + Sbjct: 175 AIATVAFDPDGDGLLLSKLSKIASPNMSFKSRDPN 209 >gi|158337841|ref|YP_001519017.1| hypothetical protein AM1_4727 [Acaryochloris marina MBIC11017] gi|158308082|gb|ABW29699.1| conserved hypothetical protein [Acaryochloris marina MBIC11017] Length = 971 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 32/375 (8%), Positives = 79/375 (21%), Gaps = 50/375 (13%) Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 L A S D +N + + S+ S N+ Sbjct: 414 LGTARDGDSGPIPSADALGDDNNGSDDEDGVTLAPLNASDTSYSIQASSLSVTNSTGGPA 473 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI-------- 173 + + D + N Sbjct: 474 TLHAWVDFDGNGTFDSDEYTSQPVQNGISSPDGALTWSGAGVSGMSGGTTTYARFRITTD 533 Query: 174 --VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 ++ GE + + ++ + D+ ++ Sbjct: 534 GGINANTPGGFARDGEVEDHALAITTPVNPDIDNNFCQVSSDMMFILDKSGSVS--LSER 591 Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDMDSL 287 + + ++ L +G++ + + I + + + ++ Sbjct: 592 RLQRDAVMAMLNYLVDNNITSRVGIVRFDSTSATVIGYTDVTAANLPTFESALNTNYVNI 651 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ +QA + S + F DG N+ S Sbjct: 652 GGGATNWEAGFQQAISL-------------------GVSPGSPDVVFFFADGNINSGGSP 692 Query: 348 VNTIKICDKAKENFIKIVTISINA---------SPNGQRLLKTCVSSPEY---HYNVVNA 395 + K+ I I I + + + V + Sbjct: 693 N---DEALQFKQAGAHIYGIGIQSLDIDDFLDITDGSNTTQFDAALDNANSADYVEVNSY 749 Query: 396 DSLIHVFQNISQLMV 410 D L ++ + + Sbjct: 750 DDLADDMTSLLRSLC 764 >gi|332206577|ref|XP_003252372.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 [Nomascus leucogenys] Length = 1107 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 275 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 332 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 333 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 378 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 379 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 423 >gi|297288988|ref|XP_002803438.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Macaca mulatta] Length = 997 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|296209823|ref|XP_002807089.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Callithrix jacchus] Length = 1094 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|226941831|ref|YP_002796905.1| VCBS [Laribacter hongkongensis HLHK9] gi|226716758|gb|ACO75896.1| VCBS [Laribacter hongkongensis HLHK9] Length = 1087 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 31/320 (9%), Positives = 75/320 (23%), Gaps = 58/320 (18%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + G + + +A +D + E+ + ++ Q Sbjct: 174 ASVMVVALTPPDTGSKGDWSYRVELKAPMDHSAPVAPVKGDENTRAFDVSIKVSDGQTTT 233 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 + +++T + + ++ + T+ L + + Sbjct: 234 TAP-LNITVEDDMPVVANTSQTVSLPSQDTNLLLTLDVSGSMNDP--------------- 277 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + K +++A K A+ LD Sbjct: 278 --------------------------------SGVKDANGKDLSRLALAKQAISQLLDQY 305 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 D L VK + L+ ++ + + + T+ A+ A Sbjct: 306 DALGDVK----VQLVKFSEGGSVQSNNWMTVAEAKAALAGITKGD--GGTNYDEALDLAR 359 Query: 303 QILTS----DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 Q D ++ F + + + D + + Sbjct: 360 QAFAKPGQLDGAKNVSYFFSDGDPTLSNSGQKNNSGATVDPDKGDGIDATEQKSWEAFLT 419 Query: 359 ENFIKIVTISINASPNGQRL 378 N I I + L Sbjct: 420 NNNILSHAIGLGQDVKSTYL 439 >gi|260789946|ref|XP_002590005.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] gi|229275192|gb|EEN46016.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] Length = 159 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 51/171 (29%), Gaps = 24/171 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + K + ++ L D +G++ ++ R +N + R Sbjct: 10 SSESFRTYGFEEEKTFVQTVVNYFTLGE---NDTRVGVVTFSDRDRQNTRVTLNEHYTRV 66 Query: 279 YVTRDMDSLIL--KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + ++ + T + + +F + + F+I + Sbjct: 67 ELLTEIRDIPYDRGHTYTGLGLDH-----------VRNNSFLEVNGRRNNTL--DFLIVI 113 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSP 386 TD E+ + ++ I + + + + Q L+T P Sbjct: 114 TDDESEDDIVRP-----AQLIRQMGITVFVVGVGQESDISQPTLETIAGDP 159 >gi|329744564|ref|NP_001192916.1| voltage-dependent calcium channel subunit alpha-2/delta-1 [Bos taurus] Length = 1091 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|114614242|ref|XP_001160235.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 1 [Pan troglodytes] Length = 1110 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|114614244|ref|XP_001160279.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Pan troglodytes] gi|114614246|ref|XP_519175.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 3 [Pan troglodytes] Length = 1091 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 59/165 (35%), Gaps = 25/165 (15%) Query: 228 MAALKNALLLFLDSIDLLSHV---KEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTR 282 K+ + D + L+ V K D+ + + +++ V+ + S + +Q V Sbjct: 64 FDKQKDFVNSLSDKVFQLTPVGFLKYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 I + T S A+ A +L + ++ K + +TDG ++ Sbjct: 124 MN--FIGQGTFSYYAISNATMLLKREGRKDGV----------------KVALLMTDGIDH 165 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE 387 + I + A+ I +TI ++ N +L P Sbjct: 166 PKNPD--VQSISEDARTAGILFITIGLSTVVNEAKLRLISGDPPS 208 >gi|197102272|ref|NP_001124862.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Pongo abelii] gi|55726163|emb|CAH89855.1| hypothetical protein [Pongo abelii] Length = 1079 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 49/170 (28%), Gaps = 22/170 (12%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 ++ ++ L+++ V + + + ++ + + Sbjct: 271 LIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQ--ANVRNKKVLKDAVNNITAK 328 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 TD A++ L + K I+ TDG + Sbjct: 329 GITDYKKGFSFAFEQLLNYNVSRANC--------------NKIIMLFTDGGEERAQEIFT 374 Query: 350 TIKICDKAKENFIKIVTISINA-SPNGQRLLKTCVSSPEYHYNVVNADSL 398 K+ +++ T S+ + + + + Y+Y + + ++ Sbjct: 375 KYN-----KDKKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|327285304|ref|XP_003227374.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Anolis carolinensis] Length = 3587 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 48/158 (30%), Gaps = 38/158 (24%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR------VEKNIEPSWGTEKVRQYVTR 282 A ++ L + V + ++ ++++ V+ PS G + +Q + Sbjct: 91 ANFRSELRFVTKLLSDFPVVPTATRVAIVTFSSKNHVLSRVDYISPPSPGRTQPQQQRQQ 150 Query: 283 D-----------MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 T + A +QA QIL + K Sbjct: 151 QHKCALLGREIPGIGYRGGGTYTKGAFQQAAQILLHSRAN-----------------ATK 193 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 I +TDG +N I +E ++I T I Sbjct: 194 VIFLITDGYSNGGD----PRPIAASLREFGVEIFTFGI 227 >gi|301614661|ref|XP_002936793.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 344 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 46/120 (38%), Gaps = 24/120 (20%) Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 + + T + A+++A Q +G +K I LTDG+ + + Sbjct: 2 MYMGEGTYTGTAIRKATQ----------------EGFFGARTGVRKVAIVLTDGQ-TDKR 44 Query: 346 SNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKT---CVSSPE--YHYNVVNADSL 398 +V +A+ I++ I I + P L+ S P+ + Y + + ++L Sbjct: 45 ESVKLDIAVREAQAANIEMYAIGIVNASDPTQVDFLRELNLIASDPDSEHMYLIDDFNTL 104 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 53/192 (27%), Gaps = 34/192 (17%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT--RVEKNIEPSWGTEKVRQYVT 281 R + + + + + + LI +++ + + + VT Sbjct: 248 YRKYIEMAEELVNSLV-------IGRRFSRIALITFSSVGKSRTQFNLDRYFDG-KDIVT 299 Query: 282 RDMDSLILKPT-DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T ++ + +Q P +K ++ TDG Sbjct: 300 AIRRLESSGGTTAIGEGIRLGTEQ------------KDKQHGGRPVEIAKKIMLVFTDGW 347 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISIN----ASPNGQRLL----KTCVSSPEYHYNV 392 +N ++ AK + TI N L +T V ++ Y+ Sbjct: 348 SNKGPD---VEEMTRNAKGAGFTLYTIVYEGNGRVDANSPGLNLYTIETMVDDHKHVYSE 404 Query: 393 VNADSLIHVFQN 404 N LI + Sbjct: 405 RNFTQLIQELRQ 416 >gi|320106711|ref|YP_004182301.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319925232|gb|ADV82307.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 384 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 57/207 (27%), Gaps = 47/207 (22%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + ++N +F ++ + + ++ Y R + + + + Sbjct: 149 LQDMRNTSDVFFRTLKPEDY------IAIVTYDLRTHILTDFTQDKRVTAEALNSLTIPG 202 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T+ A+ + LT + +K+II + G + K Sbjct: 203 FSD-TNMFDALSETLDRLT-------------------RIDGRKYIILVGTGRDTFSKL- 241 Query: 348 VNTIKICDKAKENFIKIVTISIN------ASPNG--------------QRLLKTCVSSPE 387 + + + I I A G ++ + Sbjct: 242 TLDKILAKVKQAQNVTIFAIGTGQLVRELADSRGRMGGIARMDYLQADNQMKTFAQMTGG 301 Query: 388 YHYNVVNADSLIHVFQNISQLMVHRKY 414 Y + +L VF I+Q + ++ Sbjct: 302 QAYFPLFQGALPDVFSQINQSIRNQYV 328 >gi|206559892|ref|YP_002230656.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] gi|198035933|emb|CAR51825.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] Length = 164 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 11/66 (16%), Positives = 22/66 (33%), Gaps = 7/66 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 E+ +I FAL + F L++ +I + ++ LA + Sbjct: 7 VSGMRRRERGATAIEFALMLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARA 59 Query: 71 NLSRLG 76 L+ Sbjct: 60 ALNYQQ 65 >gi|167566517|ref|ZP_02359433.1| hypothetical protein BoklE_28426 [Burkholderia oklahomensis EO147] gi|167573598|ref|ZP_02366472.1| hypothetical protein BoklC_27443 [Burkholderia oklahomensis C6786] Length = 598 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 46/157 (29%), Gaps = 19/157 (12%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME---------- 54 SR R + ++ +F+I+ A+++ + +G + + + Y++ +++ Sbjct: 20 SRQRRARRSPAR-QRGSFAIVAAIAIGVVIAALG-AVDLGNLFYQRRALQGIADMAALAA 77 Query: 55 -SANN------AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 + AA A+ S + ++ + F S + Sbjct: 78 AQTMDDGCVKPAATAQSAALGNGFDSSAAGQSMTVVCGRWDVKDNTGPSFFAGSAAGSAA 137 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 G A ++ + + Sbjct: 138 GSDAQLNAVQVTVTRTVPYYFLGSQRTIAATGTAQAT 174 >gi|149175889|ref|ZP_01854507.1| hypothetical protein PM8797T_24761 [Planctomyces maris DSM 8797] gi|148845336|gb|EDL59681.1| hypothetical protein PM8797T_24761 [Planctomyces maris DSM 8797] Length = 157 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 43/127 (33%), Gaps = 9/127 (7%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 + +K + + ++ AL V +++ + V + + N+A+ Sbjct: 6 KRKIQKSYSERRGVAAVESALVVPLLVMIAMGTMDVSQYVNV---AQLVNDASYEGARRA 62 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY------SAVFYNTEIQNI 121 +N+ + +S+ N+ + A I + + ++ Y + + Sbjct: 63 SQNNVKNQSEVEQSVLNYFTSQFPNQASGEIGSALTVNVRDYLEANIPNGNLETVPSGSA 122 Query: 122 VNSSRIS 128 V+ I Sbjct: 123 VSVQVIF 129 >gi|387437|gb|AAA39549.1| MHC factor B [Mus musculus] Length = 477 Score = 40.6 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 61/195 (31%), Gaps = 28/195 (14%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW----GTEKVRQYVTRDM--DSLILKP 291 + I+ ++ GL+ Y T + + S + V + + + D + Sbjct: 6 LTNLIEKVASYGVRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSG 65 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ A++ Y +++ II +TDG +N + V I Sbjct: 66 TNTKRALQAVYSMMS--------WAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVI 115 Query: 352 KICDKA----------KENFIKIVTISINASPNGQRLLKTCVSSPE--YHYNVVNADSLI 399 + +E+++ + + + + + + V + + L Sbjct: 116 QDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLE 175 Query: 400 HVFQNISQLMVHRKY 414 +VF + Sbjct: 176 NVFYQMIDETKSLSL 190 >gi|114046659|ref|YP_737209.1| type IV pilin biogenesis protein [Shewanella sp. MR-7] gi|113888101|gb|ABI42152.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-7] Length = 1223 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 41/355 (11%), Positives = 95/355 (26%), Gaps = 29/355 (8%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 A + LG + + D + E + + ++ + Sbjct: 206 AAGEAQGLPVDSLGTASKPVRYTEASTSSSDNVKSAAKAKAEKTAFGTGKVVTVYTEDYL 265 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 N T + RL + + I + D+ L + Q+ I + ++ Sbjct: 266 NWLHGPKTRVNKTRLALAKDAIESVILTTPGVDFGLAIFN-MNGPYDGQRDGGRIISGIK 324 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 I L++ V D++ + + + + Q + + + + + Sbjct: 325 PMSAAAKIDLLKGVNDINYAQNTPLCETLYEAYRYFGGQSQYFGDDDSDYDYFDRWGRYL 384 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 + +T + + ++ I TD P + A Sbjct: 385 GTYQASRN-------SFTDPEALVVGTNNYNSPFKRCQNEAYIIYI---TDGEPTLDNAA 434 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 L + K S P N + N + + Sbjct: 435 DGLVEQLTKGV--------GKHTSRPVSYLSALSGWMRNEDVNPNSVGKQT--------V 478 Query: 363 KIVTISINASP-NGQRLLKT-CVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + TI + N + LLK + +++ +A L Q ++ + S Sbjct: 479 RTFTIGFSEGAANAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEKNAS 533 >gi|330969427|gb|EGH69493.1| hypothetical protein PSYAR_02909 [Pseudomonas syringae pv. aceris str. M302273PT] Length = 678 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 35/107 (32%), Gaps = 1/107 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + +L+ A + Sbjct: 72 ASAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADST 118 >gi|225181999|ref|ZP_03735431.1| flp pilus assembly protein TadD [Dethiobacter alkaliphilus AHT 1] gi|225167284|gb|EEG76103.1| flp pilus assembly protein TadD [Dethiobacter alkaliphilus AHT 1] Length = 319 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 8/149 (5%), Positives = 31/149 (20%), Gaps = 13/149 (8%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K + +++ + ++ A+ + L + V + + + + + Sbjct: 7 LKNILNNQRGSVLVLVAVGMTVLLGAAALVADVGVNYVTQARLS----------VAADAA 56 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 L+ E A+ K + + ++ Sbjct: 57 ALAGGTLFDEGRDAVMLAAVETAEKNGVPAE--QVFVEVDDNARGVTVRTQAPVQLFFGR 114 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 ++ Sbjct: 115 IFGMEGGAMEQRARAARTR-PIAFYGVFP 142 >gi|163745747|ref|ZP_02153107.1| TadE-like protein [Oceanibulbus indolifex HEL-45] gi|161382565|gb|EDQ06974.1| TadE-like protein [Oceanibulbus indolifex HEL-45] Length = 176 Score = 40.6 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 11/74 (14%), Positives = 25/74 (33%), Gaps = 1/74 (1%) Query: 7 FRFYFK-KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 + + K + SE +I F + + L + W ++ ++ A N A+ Sbjct: 1 MKGFLKHRFRRSEAGGVTIEFVILLPLVLYFFFLALETGLWSAREITLRRATNLAVRDVR 60 Query: 66 SKMVSNLSRLGDRF 79 + S + Sbjct: 61 LSTGTTPSYDDMKA 74 >gi|156350136|ref|XP_001622157.1| hypothetical protein NEMVEDRAFT_v1g142259 [Nematostella vectensis] gi|156208605|gb|EDO30057.1| predicted protein [Nematostella vectensis] Length = 169 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 15/159 (9%), Positives = 47/159 (29%), Gaps = 14/159 (8%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++AL + I+ +V++ + + + + R + S Sbjct: 18 DHFQDALSFVKNVINSFDVTPGNVHVAMAVCSDKPHIVFDF-------RAFNDHVSISRA 70 Query: 289 LKPTDSTPA-MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 L + + R + K + +T G + + + Sbjct: 71 LGSVKYPQGKLMAGGNLYFLLTGRCLRAIKSSIFINSGRHGVPKMLYVMTGGNSRDRVAL 130 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + +++ +++ ++ + + LK +SP Sbjct: 131 PS-----RSLRDDGVQVYSLGLG-NSYKAYELKEMATSP 163 >gi|47229020|emb|CAG09535.1| unnamed protein product [Tetraodon nigroviridis] Length = 680 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 34/138 (24%), Gaps = 18/138 (13%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ MG++ Y I + + + + Sbjct: 314 DFLAEVAQAINVGPTGPMMGVVQYGDDPVTEISLKSVSSSREARSAVEKITQKGGLSHVG 373 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ + SD + + L DG + + Sbjct: 374 RALSYINKQYFSDANGNRGAA-------------PNVAVVLVDGWPTD-----KVEEASR 415 Query: 356 KAKENFIKIVTISINASP 373 A+E+ I I ++I Sbjct: 416 LARESGINIFFVTIEGPD 433 >gi|116625492|ref|YP_827648.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] gi|116228654|gb|ABJ87363.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] Length = 754 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 25/72 (34%), Gaps = 3/72 (4%) Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 LT+ + + + + S I+ L+DG +N ++ TI + Sbjct: 150 ADQLTAAAPATRIGDTLERVLAESSSLPLGAIVLLSDGADNAGGIDLATIAA---IRRQR 206 Query: 362 IKIVTISINASP 373 I I T+ Sbjct: 207 IPIHTVGFGKEH 218 >gi|291398101|ref|XP_002715428.1| PREDICTED: integrin, alpha 10 [Oryctolagus cuniculus] Length = 1169 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPTALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 + ++ S P+ + +NV + +L + + + Sbjct: 320 KEIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|118463538|ref|YP_880101.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] gi|118164825|gb|ABK65722.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] Length = 554 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 19/239 (7%), Positives = 63/239 (26%), Gaps = 2/239 (0%) Query: 36 LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAK 95 +G + +++ SA +A+ + V + R S Sbjct: 127 AVGSAVDSAVGSAVGSAVGSAVGSAVDSAVYSAVGSAVDSAVRSAVRSAVYSAVYSAVGS 186 Query: 96 RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYD 155 S + + + ++ ++ ++ +DS+ ++ + Y Sbjct: 187 AVGSAVGSAVDSAVGSAVDSAVRSAVDSAVGSAVGSAVDSAVDSAVDSAVGSAVYSAVYS 246 Query: 156 YRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVN 215 ++ + + S + + + + ++ VD + D + Sbjct: 247 AVGSAVDSAVASAVGSAVDSAVDSAVGSAVYSAVRSAVDSAVDSAVDSAVRSAVDSAVYS 306 Query: 216 SAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + +A+ +AL + + +E ++ Sbjct: 307 AVGSAVYSAV--YSAVYSALGSAPIKPFWHQLFGGRQWCWWPAFIAYFRDVVELQLDSD 363 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 19/203 (9%), Positives = 63/203 (31%), Gaps = 14/203 (6%) Query: 40 LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + ++++ SA ++A+ + V + + ++ + A+ + Sbjct: 107 AVD----SAVRSAVGSAVDSAVYSAVGSAV-DSAVGSAVGSAVGSAVGSAVDSAVYSAVG 161 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ 159 + + ++ + + + V S+ S A + S V ++ Sbjct: 162 SAVDSAVRSAVRSAVYSAVYSAVGSAVGSAVGSAVDSAVGSAVDSAVRSAVDSAV----- 216 Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + + + S + + + + + VD + + D ++ Sbjct: 217 --GSAVGSAVDSAVDSAVDSAVGSAVYSAVYSAVGSAVDSAVASAVGSAVDSAVDSAVGS 274 Query: 220 CQDKKRTKMAALKNALLLFLDSI 242 +A+ +A+ +DS Sbjct: 275 AVYSAVR--SAVDSAVDSAVDSA 295 >gi|90425191|ref|YP_533561.1| TadE-like [Rhodopseudomonas palustris BisB18] gi|90107205|gb|ABD89242.1| TadE-like [Rhodopseudomonas palustris BisB18] Length = 176 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 22/53 (41%), Gaps = 1/53 (1%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAA 60 + + + ++ FAL F+ L+ +I V + +E+ +AA Sbjct: 12 RALRGFRRHRRGSAAVEFALVAPIFIALLFAIIEVALVFFAGQLLETGTQDAA 64 >gi|301606953|ref|XP_002933089.1| PREDICTED: integrin alpha-2-like [Xenopus (Silurana) tropicalis] Length = 1081 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 68/191 (35%), Gaps = 26/191 (13%) Query: 230 ALKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYVTRDMDSL 287 A+ FL+ L + LI Y + + ++ T++ +D + Sbjct: 198 YPWYAVRNFLEKFVQGLDIGPSSTQVSLIQYGNDPQVMYKMDTYRTKEEMAKAITGIDQM 257 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T++ A+ + S + K ++ +TDGE+++ Sbjct: 258 GGTQTNTFKAIDFTRKFAFSKEYG-------------GRPNANKVMVVITDGESHDNALL 304 Query: 348 VNTIKICDKAKENFIK--IVTISI--NASPNGQRLLKTCVS-----SPEYHYNVVNADSL 398 I C+ K+N I+ I + + + + L+K S ++ +NV + +L Sbjct: 305 SKVIGDCE--KDNIIRFGIAVLGYYNRYNIDSKNLIKEIESIASVPKEKFFFNVSDEVAL 362 Query: 399 IHVFQNISQLM 409 + + + + Sbjct: 363 LEKAGTLGERI 373 >gi|241113141|ref|YP_002972976.1| hypothetical protein Rleg_4786 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861349|gb|ACS59015.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 194 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 43/185 (23%), Gaps = 7/185 (3%) Query: 7 FRFYFKK---GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 FRF + + A + FAL + ++L+ + + + + + Sbjct: 10 FRFARSRTRHLVRDRSAASGVEFALVLPILVMLLFGTVDLGHALTVSRKI----DEIASS 65 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + S + + A L + + S + Sbjct: 66 TGDMISQQGSWTKSDVAKLLSGASFILQPYETTGLTITVAVDDIAKSGSATVNWSAALNT 125 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 S+ S A + + Y N R Sbjct: 126 SALNSGAASAIEVPSEIQDDGVQVVLTRVQYTLTTPVSAFFSNFTGQNGYSFDRHYFNRP 185 Query: 184 EMGER 188 +G++ Sbjct: 186 RVGDK 190 >gi|57958|emb|CAA79153.1| collagen alpha 2 chain type VI [Mus musculus] Length = 371 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 33/101 (32%), Gaps = 21/101 (20%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 ++ V I T + A+K AY L + +R F Sbjct: 34 NSLSSFKEAVKNLE--WIAGGTWTPSALKFAYNQLIKESRRQKTRVFA------------ 79 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 + +TDG ++ ++N +C + + + I I Sbjct: 80 ---VVITDGRHDPRDDDLNLRALC----DRDVTVTAIGIGD 113 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 24/164 (14%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDM--DSLILKPTDSTP 296 ++ + M L+ Y ++ ++ ++ + + + R +S T Sbjct: 216 TLARRDDDPLNARMALLQYGSQNQQQVAFPLTYNVTTIHEALERATYLNSFSHVGTGIVH 275 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ + +R +F +FLTDG N + Sbjct: 276 AINNVVRGARGGARRHAELSF----------------VFLTDGVTGNDS----LEESVHS 315 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 ++ + +++ + L K + + + DSL Sbjct: 316 MRKQNVVPTVVAVGGDVDMDVLTKISLGDRAAIFREKDFDSLAQ 359 >gi|305680383|ref|ZP_07403191.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659914|gb|EFM49413.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 880 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 43/248 (17%), Positives = 77/248 (31%), Gaps = 53/248 (21%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ----DKKRTKMAALKNA-LLLFL 239 + + + V+D S S+ D N+ + + K A NA + + L Sbjct: 53 IDNKKKADLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKL 112 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + DVY G + P ++ + R+ D T AM Sbjct: 113 SGFGQGYRSQPDVYGGWVNVRDHAGDLTPP------IQGFDQRNNDVFTDYGTALNGAMA 166 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFKSNVNTIKIC- 354 P K I+F TDG + + V IC Sbjct: 167 --------------------DLASRPDPESCKAILFFTDGKLTVQGDQKADIVAQKAICS 206 Query: 355 -----DKAKENFIKIVTISI--NASPNGQRLLK------TCVSS---PEYHYNVV-NADS 397 K ++ I++ T+ + + + +++L+ C +N NA S Sbjct: 207 ADGQVKKLRDANIQLFTVGLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAAS 266 Query: 398 LIHVFQNI 405 L F++I Sbjct: 267 LFSAFRSI 274 >gi|319428191|gb|ADV56265.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 599 Score = 40.6 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 25/237 (10%), Positives = 65/237 (27%), Gaps = 36/237 (15%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + R + + L++ + + + + Sbjct: 391 RSSGRRLINNAGLRLMQNDMRVFKHESQVDMPNTAVTLLLDQSNSMCGKAYQTSVESTYA 450 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP-TDSTP 296 ++++ + V ++G+ E I E + +T+ S T Sbjct: 451 LVEALSKI----NLVKTSVLGFGNSTESVIALKGFEETPAKCLTKLASSSADGYCTPLAT 506 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + A L +K ++ +TDG+ + + + Sbjct: 507 GLWAALNQL------------------YTRTEDRKVVLVVTDGQPHG---FQYCKNLIAE 545 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH----VFQNISQLM 409 + + +++ I I N L + + D L VF+ I++ + Sbjct: 546 MQASNVEVYGIGIGNDLNLPTLQSLF---GKQF--AIKVDQLSDLGNEVFK-IAEGI 596 >gi|320094785|ref|ZP_08026530.1| hypothetical protein HMPREF9005_1142 [Actinomyces sp. oral taxon 178 str. F0338] gi|319978282|gb|EFW09880.1| hypothetical protein HMPREF9005_1142 [Actinomyces sp. oral taxon 178 str. F0338] Length = 340 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 38/170 (22%), Gaps = 24/170 (14%) Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD--------STPAMKQAYQILTSDKK 310 + R + + + + S ++ S Sbjct: 135 WDARSITKFPLTDDYTLADDVLKEMSEIMENGFQGKSASGVYVSNELLEYLEDTFDSQGG 194 Query: 311 RSFFTNFFRQ----GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S G + I+ TD E + D AK + + Sbjct: 195 SSSLAGDGLASCVLGFDHYDQERSRMILLATDNEILGDQIYT-LKDAIDFAKRQNVVVTA 253 Query: 367 ISINASPNGQRLLKT--------CVSSPEYHYNVVNADSLIHVFQNISQL 408 + +G L + + Y+ N S+ V I + Sbjct: 254 L---YPSDGSTFLSSEGRELETLVKETGGEFYDASNPASVQGVIAAIEEQ 300 >gi|319442698|ref|ZP_07991854.1| hypothetical protein CvarD4_13132 [Corynebacterium variabile DSM 44702] Length = 916 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 22/234 (9%), Positives = 66/234 (28%), Gaps = 28/234 (11%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + I L++D S S+ P D ++ ++ A ++ + + Sbjct: 65 IAATKTADILLILDQSASLKGFDGKPPTDPDNI---------RVDATRDLVKQLGTHAED 115 Query: 245 LSHVKEDVYMGL---IGYTTRVEKN-IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 L G +T + + + + + V T+ A+ Sbjct: 116 LGADINVKLAGFGEGYHNSTGDYGDWVSVGANSGDLDKAVDGFGRRNTDMYTNYEDALAG 175 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A + ++ + + + + N + + + Sbjct: 176 ASREFAGAPTTDGKSSDCQAVLFFSDGKVTHPEKSNEQAAADVCRPNSPLVS----LRNS 231 Query: 361 FIKIVTISI---NASPNGQRLLKTCVSSP-------EYHYNV-VNADSLIHVFQ 403 ++ T+ + + + + + LL P +++ + L+ F+ Sbjct: 232 GVRFFTVGLIPEDETDSPRELLTEMAEGPCGGGEANGAYFDAGTDPAGLLSAFR 285 >gi|225022540|ref|ZP_03711732.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] gi|224944663|gb|EEG25872.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] Length = 880 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 45/248 (18%), Positives = 78/248 (31%), Gaps = 53/248 (21%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ----DKKRTKMAALKNA-LLLFL 239 + + + V+D S S+ D N+ + + K A NA + + L Sbjct: 53 IDNKKKADLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKL 112 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + DVY G + P G ++ V TD A+ Sbjct: 113 SGFGQGYRSQPDVYGGWVNVRDHAGDLTPPIQGFDQRNNDV----------FTDYGTALN 162 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG----ENNNFKSNVNTIKIC- 354 A L P K I+F TDG + + V IC Sbjct: 163 GAMADL----------------ASRPDPESCKAILFFTDGKLTVQGDQKADIVAQKAICS 206 Query: 355 -----DKAKENFIKIVTISI--NASPNGQRLLK------TCVSS---PEYHYNVV-NADS 397 K ++ I++ T+ + + + +++L+ C +N NA S Sbjct: 207 ADGQVKKLRDANIQLFTVGLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAAS 266 Query: 398 LIHVFQNI 405 L F++I Sbjct: 267 LFSAFRSI 274 >gi|92116018|ref|YP_575747.1| TadE-like [Nitrobacter hamburgensis X14] gi|91798912|gb|ABE61287.1| TadE-like protein [Nitrobacter hamburgensis X14] Length = 189 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 5/65 (7%) Query: 1 MHLLSRFR----FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 M RFR + +++ + ++ FAL F L+ +I + S+E+ Sbjct: 1 MSPFPRFRTVAARVAGRFRGNQRGSAAVEFALVAPLFFALLFAIIETGMVFFASQSLETT 60 Query: 57 N-NAA 60 ++A Sbjct: 61 LQDSA 65 >gi|212635869|ref|YP_002312394.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] Length = 710 Score = 40.6 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 14/152 (9%), Positives = 47/152 (30%), Gaps = 24/152 (15%) Query: 251 DVYMGLIGYTTRVEK--NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 ++ + + V + + + + + T+ + A+ +A Sbjct: 383 QDSFNILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQ---- 438 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + + + ++F+TDG +I ++ +++ ++ TI Sbjct: 439 --------------QDANRERLRQVLFITDGA--VGNEPQLFTQIRNQLQQS--RLFTIG 480 Query: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I +PN + + + + + Sbjct: 481 IGDAPNAHFMQRAAELGRGTYTYIGKQSEVKS 512 >gi|330895596|gb|EGH27904.1| hypothetical protein PSYJA_02284 [Pseudomonas syringae pv. japonica str. M301072PT] Length = 677 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 35/107 (32%), Gaps = 1/107 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + +L+ A + Sbjct: 72 ASAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADST 118 >gi|269795767|ref|YP_003315222.1| hypothetical protein Sked_24760 [Sanguibacter keddieii DSM 10542] gi|269097952|gb|ACZ22388.1| conserved repeat protein [Sanguibacter keddieii DSM 10542] Length = 1516 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 21/215 (9%), Positives = 57/215 (26%), Gaps = 36/215 (16%) Query: 218 PICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 +AA+++A F+DS+ + L + Sbjct: 342 VDLSGSVGRYIAAMRSAASGFVDSLTGTPSS-----VALFTFADNAPAATG--------- 387 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 ++ + T +K T+ ++ ++ + LT Sbjct: 388 ----ANLGLTPVSTTAGADTVKNRIATYTAGFDTNWDRGLYQVAASAT---PFDVAVVLT 440 Query: 338 DGENNNFKSNVNTIKI------------CDKAKENFIKIVTISINASPNGQ-RLLKTCV- 383 DG + ++ + + K +++ + + G L+ Sbjct: 441 DGNPTVYAAHEGPGDLTRFREVENGIFSANAVKAEGTRVIAVGVGDGIGGAPDNLRAISG 500 Query: 384 -SSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 S +Y + + ++ + SV+ Sbjct: 501 PSGGSDYYQTDDYAEAGEALRALALGACNGSVSVV 535 >gi|66047599|ref|YP_237440.1| hypothetical protein Psyr_4372 [Pseudomonas syringae pv. syringae B728a] gi|63258306|gb|AAY39402.1| conserved hypothetical protein [Pseudomonas syringae pv. syringae B728a] Length = 678 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 35/107 (32%), Gaps = 1/107 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNLSR 74 + ++ ++ AL++ LL +I + +K S++ + A L A + + + Sbjct: 12 SRQRGAIGLMAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTA 71 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + +L+ A + Sbjct: 72 ASAPDFASQSATRNGFTPNTDGRTLVTRCGTLAVDVAGPRVFVADST 118 >gi|164687487|ref|ZP_02211515.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] gi|164603261|gb|EDQ96726.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] Length = 273 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 64/203 (31%), Gaps = 24/203 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKV 276 K+ L + L + L D+ + ++ +++ E S + Sbjct: 42 TSGSMSGKKIGTLNTTMEELLPELRGLGGATTDIKLAVMTFSSGCEWITKEPMSVDDYQY 101 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + TD A + L+ ++ + PSL + I L Sbjct: 102 WTRLK------AEGLTDLGEAFTELSNKLS-----------RKEFLNAPSLSYAPVIFLL 144 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG + + + +K+ + + + + LLK +PE +D Sbjct: 145 TDGYATDDALEGLKTLQHNNWYKYGLKV-ALGLGEKFDEE-LLKKFTGNPELVVTAKTSD 202 Query: 397 SLIHVFQNIS---QLMVHRKYSV 416 L + + I+ + R ++ Sbjct: 203 QLSKLVKTIAVTSSQIGSRSMTL 225 >gi|75676720|ref|YP_319141.1| TadE-like protein [Nitrobacter winogradskyi Nb-255] gi|74421590|gb|ABA05789.1| TadE-like protein [Nitrobacter winogradskyi Nb-255] Length = 146 Score = 40.6 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 41/138 (29%), Gaps = 8/138 (5%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANNAAILAGASK 67 + +A ++ FA+ + FL+L+ G +++ + A AA Sbjct: 10 SSLPRFARCARAASAVEFAMLLPLFLVLVAGIVVFGAYLTMVHGVQQLAAEAA-----RS 64 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 V+ LS + + A + + + SG + + + S I Sbjct: 65 SVAGLSETERTSLAENYVTTNAGSYPLLQPGHLTMSAATSG--GGVFVVTVNYDASDSII 122 Query: 128 SMTHMANNRLDSSNNTIF 145 + + Sbjct: 123 FTLPFVPAPPSTIVRSAA 140 >gi|327467286|gb|EGF12786.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK330] Length = 470 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 32/339 (9%), Positives = 88/339 (25%), Gaps = 20/339 (5%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH-MANNRLDSSNNTI 144 K R I ++ + F + +NS ++ + + Sbjct: 131 FKTDGAYQDNRLINYNLTGKYPDSNNKFSIDTAMSTLNSKQVFSKVGKGKKGIALAYRND 190 Query: 145 FYNMDVMTSYDYRLQFIEHL--LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 V + + + + N +V + E + ++ + ++S + Sbjct: 191 PIEGQVNVAISFVFDSSGSMERDMKGRNTNVVKERRISILREKAIEMVKELKEIGNVSVN 250 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + K+ L + + L + Sbjct: 251 LSGFSHYGFYVQKDFSQLDKGTE---QIEKSINSLPTRGVTNPGDGLRYGMVSLQQQHVQ 307 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 ++ + + G Y+ + YQ + F+ G Sbjct: 308 LKYVVLLTDGIPN--AYIVDPSANYPGNKVWIRGGEIVDYQKPIYNLSTKVDN--FQVGY 363 Query: 323 KIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN---FIK-IVTISIN----ASPN 374 + ++F D ++++ TI + I+ + I + Sbjct: 364 LDGAKYSEQFSRLAYDQDSSDSAFRAKTIGYAGAVSKKFGKDIRRVNLIGFSGVEQEVAY 423 Query: 375 GQRLLKTCVSS--PEYHYNVVNADSLIHVFQNISQLMVH 411 G+ L + + + +A+ L F +I + + Sbjct: 424 GEALTEAIREGVVDAQYVSATDAEELKKTFSDIKKQIQQ 462 >gi|330805799|ref|XP_003290865.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] gi|325078990|gb|EGC32613.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] Length = 942 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 50/194 (25%), Gaps = 25/194 (12%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP--SWGTE 274 + + ++K+ + + + + MGL Y R + E Sbjct: 627 CFDTTGSMASVIESVKSKVNQTVTRLMQTIPNIKIGIMGLGDYCDRENVITTLDLTENVE 686 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 K+ ++T+ T ++ ++ S K + Sbjct: 687 KLTTFITKI------PHTSGGD------------VPEAYEYALYKAKELSWSKHTSKAFV 728 Query: 335 FLTDGENNNFKSNVNTIKI---CDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 + D + I CD + IKI I A + + Sbjct: 729 MIGDSNPHEPSFTNLHINWFEECDNLFDMGIKIY--GIQAIKECCFYQEIAERTGGLCIK 786 Query: 392 VVNADSLIHVFQNI 405 D + +F I Sbjct: 787 FNKFDLITEMFLAI 800 >gi|53719502|ref|YP_108488.1| hypothetical protein BPSL1888 [Burkholderia pseudomallei K96243] gi|52209916|emb|CAH35888.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 568 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 35/128 (27%), Gaps = 7/128 (5%) Query: 29 SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLGDRFESISNHA 86 +++ L + + + + + ++ + A LAGA +M + SN Sbjct: 1 MLVAIAALG--AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARSNGF 58 Query: 87 KRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 A + + + Y V +++ S + Sbjct: 59 DPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLGPSRTVSATS 118 Query: 144 IFYNMDVM 151 +V Sbjct: 119 TAKATNVD 126 >gi|325267978|ref|ZP_08134626.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] gi|324980581|gb|EGC16245.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] Length = 1106 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 20/201 (9%), Positives = 50/201 (24%), Gaps = 11/201 (5%) Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 + + L + + ++ + D+ + Sbjct: 39 QSGTAAIKPNILLFLDTSGSMTTGGVRYNGGTASRIVVAKAVAR----------DVITAT 88 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 N N + R + L+ + D + + G + Sbjct: 89 RENNNWGLMTFNGSMRDATHSRDIANGTGILVYDALNPTASEQRIWSDQWYSIGGKLKQP 148 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 ++ + G Q ++ T A +A + + + + Q Sbjct: 149 VLDVSSTTGAANYNQLISTINGLPANTNTPIPSAYYEAIRYFRGMPEGTTANSPNTQYTS 208 Query: 324 IPSL-PFQKFIIFLTDGENNN 343 + +IIF++DGE Sbjct: 209 PIKYRCQKNYIIFVSDGEPTG 229 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 12/109 (11%), Positives = 31/109 (28%), Gaps = 8/109 (7%) Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK--ICDKAKEN 360 Q + + S+ + + + + + + G + KS + + + Sbjct: 254 QAINNGANGSWDSAYSTARIAYLAHNWD----MMDSGRDAEGKSFNDATGSPVGQDFSKQ 309 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 I T+ N Q L + ++ D L + + Sbjct: 310 TITTYTVGFQ--ANVQLLSEMARQGGGQYFIANEGDQLKAALTSALNAI 356 >gi|167562917|ref|ZP_02355833.1| TadE family protein [Burkholderia oklahomensis EO147] gi|167570108|ref|ZP_02362982.1| TadE family protein [Burkholderia oklahomensis C6786] Length = 155 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 42/146 (28%), Gaps = 4/146 (2%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVSN 71 + ++ +I FA+ F +++ ++ + S+ +A A A ++ Sbjct: 9 RPRRFQRGATAIEFAILFPVFFMVLYGIVTYGMIFAAQQSLTLAATEGARAALNYQVAQT 68 Query: 72 LSRLGDRFESISNHAKRALID---DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S + + A L A + S T + + Sbjct: 69 QSAALGLRAAAACTAASNLTGWLTGATCSTSQNFTCSYDSTMYCIQVTVTYPYAANPLVP 128 Query: 129 MTHMANNRLDSSNNTIFYNMDVMTSY 154 + N L ++ + T+ Sbjct: 129 TVALFNAVLPTTLTSQATVQINPTNI 154 >gi|294791468|ref|ZP_06756625.1| putative von Willebrand factor type A domain protein [Scardovia inopinata F0304] gi|294457939|gb|EFG26293.1| putative von Willebrand factor type A domain protein [Scardovia inopinata F0304] Length = 372 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 65/239 (27%), Gaps = 39/239 (16%) Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + +C D + +A + + +L+ ID L + +GL + + Sbjct: 104 NTSSGSRDIVLCLDVSGSALAYDRQIIAAYLEIIDQL----QGERIGLSIFDSTSRTVFP 159 Query: 269 PSWGTEKVRQYVT-----------------------------RDMDSLILKPTDSTPAMK 299 + + ++ + ++ + + Sbjct: 160 LTDDYDVIQSQLKYGFTILQKVSSGAYNKISKKEYAQIQDWLEGTRNITNSSSLIGDGLV 219 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 L + ++ + IIF TD + + T + K Sbjct: 220 SCALQLPQFSINANDSSDKKGSAARARNARSASIIFATDNVQSGKGTYSLTEAMSVVTK- 278 Query: 360 NFIKIVTISIN-----ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 + I + + I +S + ++ S ++ N L + Q I + K Sbjct: 279 SGISVDGLYIGPQSRSSSTSAIQMKNLITHSGGVFVDMNNTADLNSLVQAIEKTSSGLK 337 >gi|121535594|ref|ZP_01667401.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] gi|121305834|gb|EAX46769.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] Length = 586 Score = 40.6 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 32/134 (23%), Gaps = 27/134 (20%) Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 + + + + + T +K + L + ++ Sbjct: 453 NQARVQVPLTRDFAQAESSLAHIESF---GSTPLALGLKVGIEYLKESRAKNP------- 502 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK----ICDKAKENFIKIVTISINASPNGQ 376 +I +TDG + + K + I + P+ Sbjct: 503 -----------LVILITDGVPTVGDITGDPLADALTAAASIKSHGYGFTCIGL--KPHRD 549 Query: 377 RLLKTCVSSPEYHY 390 L + ++ Y Sbjct: 550 YLTQVAQAAGGNIY 563 >gi|300908945|ref|ZP_07126408.1| hypothetical extracellular protein [Lactobacillus reuteri SD2112] gi|300894352|gb|EFK87710.1| hypothetical extracellular protein [Lactobacillus reuteri SD2112] Length = 1020 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 52/189 (27%), Gaps = 10/189 (5%) Query: 40 LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + N A A A + + ++ N+ + + A + + Sbjct: 187 AVD--QAQQTVNQASQAVEQAKQAVVDATNRAAEQKTEEAQNTINNLENQGKNQASQDLN 244 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY--R 157 N I + T+ QN VN++ + + N D N D Sbjct: 245 NQI------NNVSGQVTDAQNDVNNTNAKIGDLQNQVKDLQNQETNTQADPNVKTVPNGV 298 Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + + LL + + + + G+M A + DV+S+ Sbjct: 299 IDYSKSLLENVARNNLWAPADMNANTNGNVVEQNIYVSPDNDKGNMSGANRNHDADVSSS 358 Query: 218 PICQDKKRT 226 Sbjct: 359 AYDPYYTTD 367 >gi|289618520|emb|CBI55244.1| unnamed protein product [Sordaria macrospora] Length = 1141 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 36/113 (31%), Gaps = 12/113 (10%) Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + G K + I+ ++D ++ S + + AK I I + + Sbjct: 685 GANVAMDLLMGRKYNNPIAT--IMLISDASTSDADSVDFVVSRAEAAK---ITIHSFGLG 739 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIH-------VFQNISQLMVHRKYSV 416 + +++ + + V + L Q++S V K + Sbjct: 740 TTHKPDTMIELSTRTKASYTYVKDWMMLRECLAGCLGAMQSLSHQNVKLKLKL 792 >gi|110679845|ref|YP_682852.1| hypothetical protein RD1_2616 [Roseobacter denitrificans OCh 114] gi|109455961|gb|ABG32166.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 207 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 12/137 (8%), Positives = 34/137 (24%), Gaps = 15/137 (10%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLL--IGFLI-YVLDWHYKKNSMESANNAAI 61 + ++ + N +I + + + F I + + Sbjct: 6 TAITARLRRFRRDQHGNVAIEAVIILPIMIWAYLAMFTIFDTYRQYTAQQK--------- 56 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A+ +S+L + + ++ E Q I Sbjct: 57 ---AAYTISDLISRQATPLDAGFLDGTHDLFETLTRAVGQTGIRITVARFDQTLAEYQVI 113 Query: 122 VNSSRISMTHMANNRLD 138 + +R M + + + Sbjct: 114 WSRTRGGMVALGSTDIA 130 >gi|53723508|ref|YP_102950.1| hypothetical protein BMA1283 [Burkholderia mallei ATCC 23344] gi|52426931|gb|AAU47524.1| conserved hypothetical protein [Burkholderia mallei ATCC 23344] Length = 568 Score = 40.6 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 35/128 (27%), Gaps = 7/128 (5%) Query: 29 SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS--NLSRLGDRFESISNHA 86 +++ L + + + + + ++ + A LAGA +M + SN Sbjct: 1 MLVAIAALG--AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARSNGF 58 Query: 87 KRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 A + + + Y V +++ S + Sbjct: 59 DPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSVPYFFLGPSRTVSATS 118 Query: 144 IFYNMDVM 151 +V Sbjct: 119 TAKATNVD 126 >gi|114764814|ref|ZP_01443996.1| hypothetical protein 1100011001322_R2601_10479 [Pelagibaca bermudensis HTCC2601] gi|114542700|gb|EAU45723.1| hypothetical protein R2601_10479 [Roseovarius sp. HTCC2601] Length = 249 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 7/60 (11%), Positives = 16/60 (26%), Gaps = 4/60 (6%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFL---IYVLDWHYKKNSMESAN 57 M++ + + +K + +I + +L G V A Sbjct: 1 MNMTT-IKTLLRKFRRDNEGYVTIEVMFMLPVLFVLFGAAWVYFDVFRQQSVNQKANYAI 59 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 72/226 (31%), Gaps = 22/226 (9%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED-- 251 C + ++ + + + L + + + + Sbjct: 18 VYNPQSYTDRPCGDDLGNLWLDVIAVVDNSQGMTTDGLSSVAANIATVFSSGTRIGTNAT 77 Query: 252 ----VYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 +GL+ Y + + N + + Y + A+ A + Sbjct: 78 EPRTTRVGLVTYNSVAKVNADLNTFQSINDVY---------NGVFNYLSAVTDATDSYLA 128 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 ++ F Q ++K +I + + + +K+ D+ K + + IVT+ Sbjct: 129 TGLQAANALFASQSFNSTRNHYKKVVIVYASEYKSYGELD--PVKVADEMKGSGVYIVTV 186 Query: 368 SINASPNGQRLLKTCV--SSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + NGQ LLK ++P Y + N D +V I ++ Sbjct: 187 AYDQGGNGQ-LLKDLAGIATPGYSF--SNTDDSDNVIGEIQGALLQ 229 >gi|310822678|ref|YP_003955036.1| pilus biogenesis protein [Stigmatella aurantiaca DW4/3-1] gi|309395750|gb|ADO73209.1| Pilus biogenesis protein, TadE family [Stigmatella aurantiaca DW4/3-1] Length = 393 Score = 40.6 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 39/122 (31%), Gaps = 10/122 (8%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNL 72 ++++ + ++ FAL V + ++ F +Y+ + K ++ LA A +M S Sbjct: 3 ASAQRGSATVEFALIVPVLVAILMFSMYLTELVRAKLKLQE------LARYAAWEMTSYT 56 Query: 73 SRLGDRFESISNHAKRA--LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + +D+A K+ F S + Sbjct: 57 LSDFAKAKHDEAFTLAQREAMDEALERYKDLDSVEPDAVPGNFIARYSDVKGTLSNEEIG 116 Query: 131 HM 132 Sbjct: 117 FF 118 >gi|302754792|ref|XP_002960820.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] gi|300171759|gb|EFJ38359.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] Length = 563 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 49/192 (25%), Gaps = 24/192 (12%) Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG--LIGYTTRVEKNI 267 D S P +T+ N + L++ V G + + V Sbjct: 207 DTSASMSNPTGVLSSQTRFNVANNIIKRLLNTFTNGDQVAVSTIGGEKIGAPVSVVLGVQ 266 Query: 268 EPSWGTEKVRQYVTRD-MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 E S + S+ ++ ++ A + Sbjct: 267 ETSLDLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSN---------------- 310 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 II TDG+ + T + + + + I + + S Sbjct: 311 ---LNVIILFTDGQFVTPGNFNFTQLSPVLAQLNARGVVVFVYRIGSFTTNDATFQHMQS 367 Query: 385 SPEYHYNVVNAD 396 S Y V+N D Sbjct: 368 SLNMSYEVINDD 379 >gi|229822437|ref|YP_002883963.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229568350|gb|ACQ82201.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 343 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 47/195 (24%), Gaps = 20/195 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-------KVRQYVTRDMD 285 +D+ L E + L + + + R + D+D Sbjct: 110 EYDTQIIDTYLQLLDSFEGERIALSIWNSTSRTVFPLTDDYTLVREELETARAALDFDVD 169 Query: 286 SLILKPTDSTP-----AMKQAYQILTSDKKRSFFTNFFRQGV--KIPSLPFQKFIIFLTD 338 SL D A + + G+ + II TD Sbjct: 170 SLDDWSYDQEALDRLLAFIAGTEGGDGESSSLVGDGLASCGLLFDESETDRSRSIILATD 229 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK-----TCVSSPEYHYNVV 393 E D E I I I A + + ++ + Sbjct: 230 NEVFGEPVYTLPE-AGDFVTERGITIFGIYAGAVTSTSAAQEKEYQDVVAANGGLFFTSD 288 Query: 394 NADSLIHVFQNISQL 408 + ++ + +IS Sbjct: 289 DPAAVDGIIDSISNQ 303 >gi|149030577|gb|EDL85614.1| integrin, alpha 10 (predicted) [Rattus norvegicus] Length = 746 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 55/176 (31%), Gaps = 27/176 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 E + +GL+ Y K S T + A+ A Sbjct: 198 DPEQIQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGF 257 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S+ + + ++ +TDGE+++ + +K C+ + + Sbjct: 258 SESRG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYG 301 Query: 367 ISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ + R ++ S P+ + +NV + +L + + + Sbjct: 302 IAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|164565428|ref|NP_001101169.2| integrin, alpha 10 [Rattus norvegicus] Length = 1167 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 55/176 (31%), Gaps = 27/176 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 E + +GL+ Y K S T + A+ A Sbjct: 198 DPEQIQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGF 257 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S+ + + ++ +TDGE+++ + +K C+ + + Sbjct: 258 SESRG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYG 301 Query: 367 ISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ + R ++ S P+ + +NV + +L + + + Sbjct: 302 IAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|302381516|ref|YP_003817339.1| hypothetical protein Bresu_0401 [Brevundimonas subvibrioides ATCC 15264] gi|302192144|gb|ADK99715.1| Protein of unknown function DUF2134, membrane [Brevundimonas subvibrioides ATCC 15264] Length = 543 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 7/133 (5%), Positives = 28/133 (21%), Gaps = 1/133 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + +++ A + L I V +++ + A ++ A+ + Sbjct: 7 FRQDTRGGIAVMAAAAGSLICALAAISIDVASVALSGRTLQGVADLAAMSAAANLDVADQ 66 Query: 74 RLGDRFESI-SNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + + S + + + + Sbjct: 67 AAQATVSANLRAATVKTELGAYVADPDIAPSGRFSVQADSANAARVTVTATAPLFFGKIV 126 Query: 133 ANNRLDSSNNTIF 145 + Sbjct: 127 IGKDEVVQVRSAV 139 >gi|146304149|ref|YP_001191465.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702399|gb|ABP95541.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 363 Score = 40.6 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 16/130 (12%), Positives = 39/130 (30%), Gaps = 29/130 (22%) Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + T A+++A+ + K + +I LTDG Sbjct: 77 KELESLKVGSGTAMYKALQEAFNL----------------ARKYGEPTY---VILLTDGV 117 Query: 341 NNN-----FKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 ++ S + C + ++I++ I + + L + + Y Sbjct: 118 PSDMGCMPGLSRKFDLNRCLPVYQGLSVPENVQIISFGIGDDYSEEILTEVSEKGRGFFY 177 Query: 391 NVVNADSLIH 400 +V + + Sbjct: 178 HVTDPAQIPE 187 >gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas haloplanktis TAC125] gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 664 Score = 40.2 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 68/245 (27%), Gaps = 47/245 (19%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 + +P +R VVD SGSMH Sbjct: 298 NGERYGLAMLMPPADNFIATQRLARETVFVVDTSGSMH--------------------GQ 337 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 M KNAL L +D + ++ + + +R+ Sbjct: 338 SMEQAKNALFYALSLLDSNDSFNIIGFDNVVTLMS----DKPLVASGFNLRRAERFIYGL 393 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T+ A+ +L + F + +IFLTDG Sbjct: 394 QADGGTEIQGALD---AVLDGSQFDGFV----------------RQVIFLTDGS--VSNE 432 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 + I K ++ ++ T+ I ++PN + + + + + Q + Sbjct: 433 DALFKSIQAKLGDS--RLFTVGIGSAPNSFFMRRAADVGKGSFTFIGSTSEVQPKMQQLF 490 Query: 407 QLMVH 411 + H Sbjct: 491 DKLAH 495 >gi|149188838|ref|ZP_01867128.1| hypothetical protein VSAK1_05795 [Vibrio shilonii AK1] gi|148837258|gb|EDL54205.1| hypothetical protein VSAK1_05795 [Vibrio shilonii AK1] Length = 174 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 16/120 (13%), Positives = 40/120 (33%), Gaps = 10/120 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA------SKMV 69 +++ F I L V+ + + F + + + + A A A ++ Sbjct: 6 KNQRGTFIIELTLVVIVLIGIFLFTTDISHKLLVRAQL----DRASFALANILKERARFY 61 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 ++ L D + ++D A+ I I+ + + Y + + S+ Sbjct: 62 NDRLVLSDGDRDDMHALASRMLDTAEADIAIKIEALHNTATVDSYTSVQYGAMGCETTSI 121 >gi|91794067|ref|YP_563718.1| type IV pilin biogenesis protein, putative [Shewanella denitrificans OS217] gi|91716069|gb|ABE55995.1| type IV pilin biogenesis protein, putative [Shewanella denitrificans OS217] Length = 1190 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 19/274 (6%), Positives = 52/274 (18%), Gaps = 59/274 (21%) Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + + + + +D + ++++ K + Sbjct: 197 SASSTQAQIDAAIIKAKQTGFGTGKALTLYTDNYLRWYHGDQANVPKSRLDIAKEVIKNT 256 Query: 239 LDSIDLLSHVKEDVYMGLIG--YTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDST 295 + + + + TE + + T Sbjct: 257 IVTTPSVDFGLTVFNINAFSEGDADGGRIVSGIKTMTEANKLALLNTVAGLDPETNTPLC 316 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK---------------------FII 334 + + Q + G + +++ Sbjct: 317 ETLYEVQQYFGGKTLTFGNKDKKPSGYSYTPNTPPRDTSIEQSGKYISPFKECQNLAYVV 376 Query: 335 FLTDGE-----------------------NNNFKSNVNTIKICDKAKENFI--------- 362 ++TDG N N + + N + Sbjct: 377 YITDGVPTLDQNANNLVKALPNADQTAYKNTNPNFSSYLPNLAQWMNNNDVNANLVGEQK 436 Query: 363 -KIVTISINASPNGQR--LLKTCVSSPEYHYNVV 393 K TI + L+KT ++ Sbjct: 437 VKTFTIGFSDGAADAAPLLIKTAELGGGKYFAAK 470 >gi|220672885|emb|CAX13868.1| novel protein similar to vertebrate vitrin (VIT) [Danio rerio] Length = 761 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 22/205 (10%), Positives = 55/205 (26%), Gaps = 37/205 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 R+ + D + M M+ + + K+ Sbjct: 350 GFTSRVMDTTARAPEVPSQGDPNCKVDMAFLMDGSWSIG----------KRRFKIQKD-- 397 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 ++ L+ MG+I Y ++ ++ +++ + P+ Sbjct: 398 -FLVEVSQALNVGVAGAMMGIIQYGDDPVTEFSLKQFSNSKDLKPAISKIV--QKGGPSH 454 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + SD + + L DG + + Sbjct: 455 VGKALSYINKQFFSDANGNRGGA-------------PNVAVVLVDGWPTD-----RVEEA 496 Query: 354 CDKAKENFIKIVTISINASPNGQRL 378 A+E+ I I ++I + ++ Sbjct: 497 SRLARESGINIFFVTIEGPDDNEKQ 521 >gi|316973947|gb|EFV57489.1| putative midasin [Trichinella spiralis] Length = 4042 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 46/160 (28%), Gaps = 14/160 (8%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 D + + ++ +K L L ++ + E ++G+ + T + Sbjct: 3829 KRDYQVVLAIDNSQSMELNNVKTMALQSLAMLNEAFYTLEIGHVGVCKFGTEPILLQPLT 3888 Query: 271 --WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 + + + + + TD ++ A + T S + S Sbjct: 3889 KTFDVHYGAKIIEQLNCNEQR--TDIFKLLQCAASLFTPQPASS---------DRTDSGM 3937 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI-KIVTI 367 K +I L+DG N I I I Sbjct: 3938 PTKLMIILSDGRNVYSVGEQALKDAVRSVFSAGIFVIYII 3977 >gi|269104788|ref|ZP_06157484.1| TPR domain protein in aerotolerance operon [Photobacterium damselae subsp. damselae CIP 102761] gi|268161428|gb|EEZ39925.1| TPR domain protein in aerotolerance operon [Photobacterium damselae subsp. damselae CIP 102761] Length = 691 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 13/126 (10%), Positives = 37/126 (29%), Gaps = 24/126 (19%) Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQIL 305 ++ GLI Y + + + + + + +++ ++QA +L Sbjct: 120 PGWKEGSTGLITYAGDGYTISPLTDDSSTLANLIPSLSPKIMPIPGSNAASGIRQAIDLL 179 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 + II +TDG + + + K ++ Sbjct: 180 KQAGDTNGD------------------IILITDGMT-----EKESQESLAELKNTHYRVS 216 Query: 366 TISINA 371 +++ Sbjct: 217 ILAVGT 222 >gi|329663456|ref|NP_001192519.1| integrin alpha-10 [Bos taurus] gi|297472754|ref|XP_002686129.1| PREDICTED: integrin, alpha 10-like [Bos taurus] gi|296489504|gb|DAA31617.1| integrin, alpha 10-like [Bos taurus] Length = 1167 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + ++ C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPTALQACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++ S P+ + +NV + +L + + + Sbjct: 320 REIRAIASDPDEKFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|109732283|gb|AAI15771.1| Integrin, alpha 10 [Mus musculus] Length = 1166 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 54/176 (30%), Gaps = 27/176 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 E + +GL+ Y K S T + A+ A Sbjct: 198 DPEQIQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGF 257 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + + ++ +TDGE+++ + +K C+ + + Sbjct: 258 SQSRG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYG 301 Query: 367 ISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ + R ++ S P+ + +NV + +L + + + Sbjct: 302 IAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|124486692|ref|NP_001074522.1| integrin alpha-10 [Mus musculus] Length = 1167 Score = 40.2 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 54/176 (30%), Gaps = 27/176 (15%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILT 306 E + +GL+ Y K S T + A+ A Sbjct: 198 DPEQIQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGF 257 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 S + + ++ +TDGE+++ + +K C+ + + Sbjct: 258 SQSRG-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYG 301 Query: 367 ISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ + R ++ S P+ + +NV + +L + + + Sbjct: 302 IAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|311744035|ref|ZP_07717841.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311313165|gb|EFQ83076.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 139 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 5/47 (10%), Positives = 19/47 (40%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM 53 R ++ E+ + +I L V ++ + ++ + ++ + Sbjct: 1 MRAPVRRRRGDERGSMAIEVVLLVPILVMFLLLVVAAGRYVTVRSDI 47 >gi|167045536|gb|ABZ10188.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_APKG10H12] Length = 356 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 49/152 (32%), Gaps = 28/152 (18%) Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 KMAA + AL FL + D + L + E + + + Sbjct: 110 GQKMAAAQRALDRFL-----FDLLGPDDEIFLYRFDYTPELLQDWTVDR---IRLSRAIR 161 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D T A+ ++ F+K ++ ++DG +NN Sbjct: 162 DIRPRGNTALYDAVAESV------------------PRVAGGQHFKKALLIISDGNDNNS 203 Query: 345 KSNVNTIKICDKAKENFIKIVTISINASPNGQ 376 +++ ++ + +E+ I I I+ Sbjct: 204 ETD--VRELRELIRESEALIYAIGIDGPSTPA 233 >gi|254454548|ref|ZP_05067985.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198268954|gb|EDY93224.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 237 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 26/113 (23%), Gaps = 14/113 (12%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 S+ +T I + + D T +K + Sbjct: 8 GSNDSPINPDVTWCFTGDQNAIIPHTTSESDLH---NAINDLHAYGNTAIDMGVKWGVAL 64 Query: 305 LTSDKKRSF-----------FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 L K + + + K ++ +TDGEN Sbjct: 65 LDPSTKSLIASLAGASGSGVPSVAAGRPEQFTQSDVLKVLVLMTDGENTQQWD 117 >gi|115749084|ref|XP_001197592.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115959785|ref|XP_001193076.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 966 Score = 40.2 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 75/258 (29%), Gaps = 68/258 (26%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 L + + I LV+D SGSM Sbjct: 291 NSPSLPEDTDTSPNFIVVQPSGSLR--------IVLVLDTSGSMD--------------- 327 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQ 278 + + F+ SI V + Y+ ++ + + + T + R+ Sbjct: 328 -----GERFDKMIRGAKNFIQSI-----VPNNSYVAIVEFNYESIVDSYMTELTSVISRK 377 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + +L T + A Q+ + S ++I L+D Sbjct: 378 DLASLLPTLADGATCIGCGIVTAIQVAQYNDMDSRGV----------------YLILLSD 421 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKT-------CVSSPEYHY 390 GE N+ +T+ D + + + + +I+ A + L + C Sbjct: 422 GEENHGTPIADTM---DDIEGSGVIVHSIAFYEADTQLEDLAQMTGGISATCADGG---- 474 Query: 391 NVVNADSLIHVFQNISQL 408 +A +I F +I Sbjct: 475 ---SAQCVISAFVSIIAQ 489 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 9/63 (14%), Positives = 19/63 (30%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 D+S N I L + +R+++ I + I + ++E Sbjct: 297 EDTDTSPNFIVVQPSGSLRIVLVLDTSGSMDGERFDKMIRGAKNFIQSIVPNNSYVAIVE 356 Query: 195 LVV 197 Sbjct: 357 FNY 359 >gi|256959601|ref|ZP_05563772.1| predicted protein [Enterococcus faecalis Merz96] gi|256950097|gb|EEU66729.1| predicted protein [Enterococcus faecalis Merz96] Length = 370 Score = 40.2 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 90 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 149 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 150 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 204 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 205 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 264 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 265 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 303 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 304 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 344 >gi|74195997|dbj|BAE30555.1| unnamed protein product [Mus musculus] Length = 436 Score = 40.2 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 49/159 (30%), Gaps = 24/159 (15%) Query: 270 SWGTEKVRQYVTRDM--DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 S + V + + + D + T++ A++ Y +++ Sbjct: 1 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMS--------WAGDAPPEGWNRT 52 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKA----------KENFIKIVTISINASPNGQR 377 II +TDG +N + V I+ +E+++ + + + Sbjct: 53 RH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVN 110 Query: 378 LLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHRKY 414 + + + V + + L +VF + Sbjct: 111 INALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 149 >gi|157375505|ref|YP_001474105.1| hypothetical protein Ssed_2368 [Shewanella sediminis HAW-EB3] gi|157317879|gb|ABV36977.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 152 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 16/130 (12%), Positives = 40/130 (30%), Gaps = 3/130 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ +++ FA+ FL+L+ +I V Y N + A+ A V++ Sbjct: 1 MRFQRGVYAVEFAIVAGVFLMLLFAIIEVGRLMYTYNVLHEASRRAARIAVVCQVTDTDI 60 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + N +++ + + I N + Sbjct: 61 KN---MGLFNGIDLIPNLTPANLTISYLDNEGDEATGMAIVLVRAEIANYQHEFLVPGLY 117 Query: 135 NRLDSSNNTI 144 ++S + Sbjct: 118 RVINSPTFST 127 >gi|149758640|ref|XP_001499491.1| PREDICTED: similar to integrin, alpha 10 isoform 1 [Equus caballus] Length = 1167 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPTALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++ S P+ + +NV + +L + + + Sbjct: 320 REIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|149758642|ref|XP_001499504.1| PREDICTED: similar to integrin, alpha 10 isoform 2 [Equus caballus] Length = 1177 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPTALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++ S P+ + +NV + +L + + + Sbjct: 320 REIRAIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|73970088|ref|XP_531792.2| PREDICTED: hypothetical protein XP_531792 [Canis familiaris] Length = 1465 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 50/169 (29%), Gaps = 34/169 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +K+ +K+ ++ F+ V + + + +R + D Sbjct: 722 SKLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAVAWQEKLV---EINEDNLRGAQSWIRD 778 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 I T++ A++ A+ + I LTDG Sbjct: 779 IQIGSSTNTLHALQIAFA-----------------------DKETQVIYLLTDGRP---- 811 Query: 346 SNVNTIKICDKAKE-NFIKIVTISINASPN-GQRLLKTCVS-SPEYHYN 391 + + ++ K I I TIS N + LK S + + Sbjct: 812 -DQPPEMVIEQVKVFQKIPIYTISFNYNDEIANGFLKELASLTGGEFHF 859 >gi|116622830|ref|YP_824986.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225992|gb|ABJ84701.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 329 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 20/215 (9%), Positives = 58/215 (26%), Gaps = 36/215 (16%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 M +V + ++A+ F+ +D + + ++ Sbjct: 101 GMERPDAFVGTNVFVLFDTSNFMYKGFVYAEDAIADFVRGLDRADS------VAVYTFSR 154 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + + + S+ T + + Sbjct: 155 NLSRAASLTHDHGV---AIAGLRKSVAGDDT-------------------ALYNALLLAL 192 Query: 322 VKIPSLPFQKFIIFLTDGENNNF-KSNVNTIKICDKAKENFIKIVTISIN----ASPNGQ 376 +P +K +I ++G +N + + + + + I I IS + + Sbjct: 193 RDAAKVPGRKVVIVFSNGPDNASMVAPDDVRAVAE---DEGIPIYVISTSEVNKDPISSG 249 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + Y + F+NI + + + Sbjct: 250 VFKRIATRTGGKAYWAKTWQKQVEAFENIREDLGN 284 >gi|315634657|ref|ZP_07889941.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] gi|315476605|gb|EFU67353.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] Length = 498 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 33/274 (12%), Positives = 70/274 (25%), Gaps = 63/274 (22%) Query: 143 TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGS 202 T +L V Sbjct: 271 TKTITPAQRPQMVIVFDDSLSMLASLNESSAVINA------------------------- 305 Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 N++ + ++ ++ K A +D ID ++ +GL+ Sbjct: 306 FWDRWNNNEIISEEEKVHMFREPRRLTTAKKAATDIIDKIDP------NIDIGLVSLRRC 359 Query: 263 VEKNIEPSWGTEKVRQYVTRDMDSLILK-----PTDSTPAMKQAYQILTSDKKRSFFTNF 317 +G K + + + T ++QA ++ K+ +F Sbjct: 360 PGARNHGMFGPSKRKALKAQIRGLFPKEGEENIGTALHSGLRQAAAMVDGVKRDAF---- 415 Query: 318 FRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASPNG 375 I+ L+DGE + K N N I + + +KI + I + Sbjct: 416 ---------------ILVLSDGEESCDKGNTNVCAIAQELAKQKPKLKINVVDIGGAKAA 460 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 L + + + + I+Q + Sbjct: 461 NCL---ATETKGKVFTANSKAQVT---NMINQAI 488 >gi|92113590|ref|YP_573518.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] gi|91796680|gb|ABE58819.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] Length = 596 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 12/93 (12%), Positives = 33/93 (35%), Gaps = 12/93 (12%) Query: 330 QKFIIFLTDGENN------------NFKSNVNTIKICDKAKENFIKIVTISINASPNGQR 377 ++ +I LTDG + + + + + TI+++ + + Sbjct: 122 KRHVILLTDGMVDLPGSGEVKRKRDAASRETLIASLAPELATQDVVVHTIALSRNVDRDL 181 Query: 378 LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 L + S+ + L+ F ++ + +V Sbjct: 182 LERVSQSTDGLAAVAETPEELLRAFLDVLERIV 214 >gi|68160860|gb|AAY86876.1| unknown extracellular protein lr1422 [Lactobacillus reuteri] Length = 1011 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 52/189 (27%), Gaps = 10/189 (5%) Query: 40 LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + N A A A + + ++ N+ + + A + + Sbjct: 178 AVD--QAQQTVNQASQAVEQAKQAVVDATNRAAEQKTEEAQNTINNLENQGKNQASQDLN 235 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY--R 157 N I + T+ QN VN++ + + N D N D Sbjct: 236 NQI------NNVSGQVTDAQNDVNNTNAKIGDLQNQVKDLQNQETNTQADPNVKTVPNGV 289 Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + + LL + + + + G+M A + DV+S+ Sbjct: 290 IDYSKSLLENVARNNLWAPADMNANTNGNVVEQNIYVSPDNDKGNMSGANRNHDADVSSS 349 Query: 218 PICQDKKRT 226 Sbjct: 350 AYDPYYTTD 358 >gi|313227043|emb|CBY22190.1| unnamed protein product [Oikopleura dioica] Length = 1109 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 48/177 (27%), Gaps = 18/177 (10%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + ++ + L+ + + Y + + K Sbjct: 552 QVMKSWMRRLISGLNIAPGRTQVSVYLYNNIFRTIFNLNEHQNAYDMITAINKMVYSGKG 611 Query: 292 TDSTPAMKQAY-QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 T A++ A ++L + ++ LTDG+ ++ N Sbjct: 612 TRIARALQSAMSKVLIPQSGLRPNSEI--------------YLYLLTDGKESDVADVNNM 657 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTC-VSSPEYHYNVVNADSLIHVFQNIS 406 A + I + + I+ S L + + + N L + I+ Sbjct: 658 ANDIKDAFGDRITLTAVGISRSVENAELYAIASAPKKDNVFLLENYRDLDTI-TKIA 713 >gi|255598999|ref|XP_002537129.1| conserved hypothetical protein [Ricinus communis] gi|223517386|gb|EEF25254.1| conserved hypothetical protein [Ricinus communis] Length = 397 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 71/237 (29%), Gaps = 35/237 (14%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 V+D S SM ++A I K AA + + F+D + D Sbjct: 159 VFVIDRSVSMDHPFAGQATGGHAAEI-------KSAAARRLITAFID-------SRPDDM 204 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS-DKKRS 312 MG++G+T ++ + E + + L T+ + A + + S Sbjct: 205 MGVVGFTNSALYGMKITTNREGIHAAIQAAT-GPALNQTNIGAGITNAVTLFENIASTGS 263 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 G S + I G+ N V + I I T Sbjct: 264 RAIVLLSDGAGKLSPRVKYQIRKQLVGKKLNLYWIVLREP-------DDISIFT---GNV 313 Query: 373 PNGQRLLKTCVSS--------PEYHYNVVNADSLIHVFQNI-SQLMVHRKYSVILKG 420 + L + + NA +L + I ++ +Y VI+ G Sbjct: 314 YDDDHLPEAIALDRFFQSLNVKYKAFEADNATALESALREIDAKEKNVIQYGVIIPG 370 >gi|115767164|ref|XP_001193368.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] gi|115976266|ref|XP_001179968.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] Length = 797 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 48/124 (38%), Gaps = 23/124 (18%) Query: 254 MGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 +G++ + + + + + T RQ + + + T + ++L S + Sbjct: 300 LGIVVFDSTSQISGNLTDITETASRQRLVNALPPSPVGGTCIGCGILSGIEVLGSYAQG- 358 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 +II L+DGE + ++T D+ K + + I TI+I+ S Sbjct: 359 ------------------GYIILLSDGEETDAPFIMDTY---DEIKNSGVIIDTITISDS 397 Query: 373 PNGQ 376 + Q Sbjct: 398 ADQQ 401 >gi|71984286|ref|NP_498819.2| C-type LECtin family member (clec-160) [Caenorhabditis elegans] gi|47117847|sp|P34393|CL160_CAEEL RecName: Full=C-type lectin domain-containing protein 160; Flags: Precursor gi|28894818|gb|AAK84522.2|L11247_8 C-type lectin protein 160, confirmed by transcript evidence [Caenorhabditis elegans] Length = 639 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 52/158 (32%), Gaps = 25/158 (15%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS--- 286 +K + + + L ++++ V +GLI Y+ + E +PS T + D Sbjct: 307 MVKAEINTLVGQMSLDPNIQKHVQVGLIKYSDKAEVVFKPSDYTNEDEFTEDLWSDPRLE 366 Query: 287 ---LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + ++QA ++ +K I+ N+ Sbjct: 367 DVDEKSDEVNLHLGLQQAAKM-----------------TASMRNGVRKVIVVYAASYNDE 409 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + +I +E I+T++ + ++K Sbjct: 410 GNDD--ARQIAANIRETGYAIITVAFVEPESSNLVMKI 445 >gi|303318459|ref|XP_003069229.1| hypothetical protein CPC735_024200 [Coccidioides posadasii C735 delta SOWgp] gi|240108915|gb|EER27084.1| hypothetical protein CPC735_024200 [Coccidioides posadasii C735 delta SOWgp] Length = 849 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 58/228 (25%), Gaps = 8/228 (3%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + ++ ES+++ A ++ D+ IK ++ SG +I +I + Sbjct: 543 QQADTPTQSKSPQESVADTADPSVKHDSTSTIKYEASKNTSGLG-----IQIVDIADELL 597 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV--SFIPALLRIE 184 S + T+ L + + S P+ + Sbjct: 598 SSNATTPQTVDSCPSTTVAGTPCTPDWDVSGLPRSSQMDVSPKKTVEIEQSSNPSSPPVA 657 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E+ + + S + A D T + + Sbjct: 658 HHEKHLSTSLDSSNTISSTPQKFQPGKTITDPACPSPDPSTTSFDPPRLGTPSSITDRQT 717 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + V G + VE + + + R K T Sbjct: 718 FNSVYSGEP-GSECFHNSVEDVPSLTSSASTMTGTLPRLSAGFYAKGT 764 >gi|167562908|ref|ZP_02355824.1| hypothetical protein BoklE_10135 [Burkholderia oklahomensis EO147] Length = 602 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 56/179 (31%), Gaps = 8/179 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS- 70 ++ IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LA A KM Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADMAALAAAQKMDDQ 75 Query: 71 -NLSRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSR 126 + SN A + + + Y V +++ Sbjct: 76 CTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTASTPLNAVQVTATQ 135 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 S + + L N + + + A L + + Sbjct: 136 SVPYFFLGPSRTVSATSTAK--ATNIDQFTVGTTLASLQGGLVNNVLNALLGANLSLSV 192 >gi|148692793|gb|EDL24740.1| calcium channel, voltage-dependent, alpha2/delta subunit 3 [Mus musculus] Length = 1091 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|6753236|ref|NP_033915.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Mus musculus] gi|81872883|sp|Q9Z1L5|CA2D3_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|4186073|emb|CAA09423.1| calcium channel alpha-2-delta-C subunit [Mus musculus] gi|147898121|gb|AAI40360.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] gi|151555329|gb|AAI48701.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] Length = 1091 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|94498566|ref|ZP_01305121.1| hypothetical protein SKA58_08334 [Sphingomonas sp. SKA58] gi|94422009|gb|EAT07055.1| hypothetical protein SKA58_08334 [Sphingomonas sp. SKA58] Length = 199 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 10/124 (8%), Positives = 33/124 (26%), Gaps = 6/124 (4%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 ++ R + + + F +L I + + Y A Sbjct: 1 MIGRIASKLMRLRRDARGVTLLEFGFVAPPLMLTIMAIGDLGYQAYWHAVARGVLEKAAR 60 Query: 63 AGA------SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 A + ++ + +++ + + + F + E ++ +A Sbjct: 61 AASVGTLNGAQFDAYITQQMSAITARNASPPHIVKKSYYNFSRVGKPEKITSDTAPLGVY 120 Query: 117 EIQN 120 + Sbjct: 121 NSGD 124 >gi|320016751|gb|ADW00323.1| Flp pilus assembly membrane protein TadE [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 156 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 35/106 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + + + ++ F L + F+ ++ + Y +++ A + A + Sbjct: 1 MRDTILRLLPANRGSIAVEFTLIFILFIFMLLLVTETSRLFYTSANLDFALSEAAKTAKN 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + N+ FE N L + + S A Sbjct: 61 RDAENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVAD 106 >gi|281200486|gb|EFA74705.1| hypothetical protein PPL_11534 [Polysphondylium pallidum PN500] Length = 936 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 12/143 (8%), Positives = 33/143 (23%), Gaps = 10/143 (6%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYM-----GLIGYTTRVEKNIEPSWGTEK--VRQYVT 281 K A +D ++ + V + + + + + Sbjct: 73 DDTKGAADFLVDRWLKITPEQVYVNWKGGVVDGALFQRGIYNTEPLTTFLTESINITALK 132 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + ++ T+ M + + P +K DG Sbjct: 133 QSDRGFMIGATNLDTGM---FDQFYKTDSDIVKAVAASASIPGLFPPTEKNGYDYADGGV 189 Query: 342 NNFKSNVNTIKICDKAKENFIKI 364 + ++C + + I Sbjct: 190 TYLTPITDAARLCYETGAINVTI 212 >gi|240145313|ref|ZP_04743914.1| conserved hypothetical protein [Roseburia intestinalis L1-82] gi|257202610|gb|EEV00895.1| conserved hypothetical protein [Roseburia intestinalis L1-82] Length = 182 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 6/69 (8%), Positives = 30/69 (43%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + FK+ + ++ + + + + ++ L +I + + ++ A +A+++ Sbjct: 1 MRKCYQSFKRLLTGQEGSMAPVAVAATIAMLFIILGVSEYMRLVITAAGIKDAMESAVVS 60 Query: 64 GASKMVSNL 72 + + + Sbjct: 61 VVNDNYNEV 69 >gi|170746810|ref|YP_001753070.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170653332|gb|ACB22387.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 207 Score = 40.2 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 55/193 (28%), Gaps = 7/193 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-A 65 R + + ++ FAL + F L+G + +++ ++ A + A Sbjct: 14 IRELASRYAQARDGATAVEFALVALPFFALVGACLENGIVFWEQEILQQAVSDASRQIYT 73 Query: 66 SKMVSNLSRLGDRFESISNHAKRA-----LIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + + D +S F +++ S++ + + Sbjct: 74 GAFQTTNAGTTDTATLMSRFRTAICTQPNGTPRVTIFTCANVRVSVTKVADYDSANPVSP 133 Query: 121 IVNSSRISMTHMA-NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + ++ + + N+ + + + N++ V Sbjct: 134 VATNASGASDWNPNFAGYACAGNSAIVVVQAAVDIPVFFPLLGAGVPNLPNKRRVLQAAT 193 Query: 180 LLRIEMGERPIFL 192 + ++E P Sbjct: 194 VFKVEPYTAPSGC 206 >gi|323702227|ref|ZP_08113893.1| hypothetical protein DesniDRAFT_1105 [Desulfotomaculum nigrificans DSM 574] gi|323532717|gb|EGB22590.1| hypothetical protein DesniDRAFT_1105 [Desulfotomaculum nigrificans DSM 574] Length = 207 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 39/150 (26%), Gaps = 16/150 (10%) Query: 13 KGIASEKA--NFSIIFALSVMSFLLLIGF-LIYVL---DWHYKKNSMESANNAAILAGAS 66 + I ++ + ++ ++ +++ IY K M A A A AS Sbjct: 37 RLIKDQRGFISILVLMCGMLIFMVIVPYITGIYYGVKHKGMQTKAWMNEALEFA--AAAS 94 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 M +L + + AL D + SLS + NT+ Sbjct: 95 NMNGDLDISQLDPDMAKQYFSIALADI--------LDGSLSNETITPLNTDYYPGPIKIT 146 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 L + Sbjct: 147 DFEIVSPGETLPNGITARQPGFLATIEVPL 176 >gi|221127354|ref|XP_002168164.1| PREDICTED: similar to microneme 1, partial [Hydra magnipapillata] Length = 285 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 57/164 (34%), Gaps = 22/164 (13%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPS----WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 V +I Y+ I S W E + V+ T + A+K A + Sbjct: 2 GPTGVRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIE--YSHGKTRTDLALKVARK 59 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + ++ K +I LTDG++ KS + EN + Sbjct: 60 HVFCNECSL-------------QHNIPKLLIVLTDGQSTFPKSTQFEAHLIKV--ENDLT 104 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 I+++ ++ + + LK+ + ++ + + + L I + Sbjct: 105 IISVGVSDQVDIEE-LKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 40/137 (29%), Gaps = 18/137 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MD 285 + + D ++ + EDV +GLI + TR + S Sbjct: 27 TSNFRKVRQFIEDFVNSMPISPEDVRVGLITFATRSKVRWNLSDPKATNPSLAISAARSL 86 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S T + ++ A ++L + K ++ +TDG +N Sbjct: 87 SYSTGVTYTHYGLQDAKKLLYDTNAGARNN-------------VPKLVLVMTDGASN--- 130 Query: 346 SNVNTIKICDKAKENFI 362 T ++ Sbjct: 131 LPSQTRSSAAALRDAGA 147 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 40/137 (29%), Gaps = 18/137 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD--MD 285 + + D ++ + EDV +GLI + TR + S Sbjct: 63 TSNFRKVRQFIEDFVNSMPISPEDVRVGLITFATRSKVRWNLSDPKATNPSLAISAARSL 122 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 S T + ++ A ++L + K ++ +TDG +N Sbjct: 123 SYSTGVTYTHYGLQDAKKLLYDTNAGARNN-------------VPKLVLVMTDGASN--- 166 Query: 346 SNVNTIKICDKAKENFI 362 T ++ Sbjct: 167 LPSQTRSSAAALRDAGA 183 >gi|32476015|ref|NP_869009.1| inter-alpha-trypsin inhibitor domain-containing protein [Rhodopirellula baltica SH 1] gi|32446559|emb|CAD76394.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 764 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 52/177 (29%), Gaps = 30/177 (16%) Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 F D + + ++ + T + S ++ T+ PA Sbjct: 366 FADHVLDHLNPNDEFRVIAFSNRTTAFQPNAVSATDANIQSAKQFVRGLRASGGTNLLPA 425 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD---GENNNFKSNVNTIKIC 354 +K A G + +++I +TD G +++ + + Sbjct: 426 LKLAL------------------GGEADESARPRYMILMTDALVGNDHSILRYLRQPEF- 466 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD---SLIHVFQNISQL 408 ++ I+ A+PN + + + V N D + F ++ Sbjct: 467 -----QDARVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFHELTSQ 518 >gi|326331495|ref|ZP_08197785.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325950751|gb|EGD42801.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 140 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 12/135 (8%), Positives = 34/135 (25%), Gaps = 5/135 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + E+ + S+ + + L+G + + + ++ A A + + Sbjct: 5 SDERGSVSVWLVTATFAMTTLVGLAVDLGG--QVHAQ-QRTHDIAAQAARAGGQEVQAGP 61 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 A A + S+S + N + + Sbjct: 62 AVEGRYARIDAAAARQAAKAYLSAAGVGGSVSIRDGNRIVVRANDTYN--TKFLGIIGLT 119 Query: 136 RLDSSNNTIFYNMDV 150 +L + + Sbjct: 120 KLKVHGEATAHLVRS 134 >gi|297161950|gb|ADI11662.1| hypothetical protein SBI_08544 [Streptomyces bingchenggensis BCW-1] Length = 149 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 13/138 (9%), Positives = 34/138 (24%), Gaps = 8/138 (5%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASKMVSNLSRL 75 ++ + +++ L G +I K A + A A ++ RL Sbjct: 18 DDRGQVTSFVMGVMLALWLFAGIVIDGGLALAGKAR---ALDVAQEAARTGAQQLDIGRL 74 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + +K A A + + + + + + Sbjct: 75 RSTDDVRLLRSKAARAATAYVRATGDTGSA----TVQGDEVTVHVTHHQPAQILQLVGVR 130 Query: 136 RLDSSNNTIFYNMDVMTS 153 L + + + Sbjct: 131 TLTVTAHATVRAERANAN 148 >gi|332809817|ref|XP_003308326.1| PREDICTED: integrin alpha-10 [Pan troglodytes] Length = 1024 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 67/232 (28%), Gaps = 32/232 (13%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V L + + + ++ L + + E + Sbjct: 3 LPFVTHLFLPLVFLTGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLF---IDPEQI 59 Query: 253 YMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y K V + + T + A+ A S Sbjct: 60 QVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIMVACTEGFSQSH 118 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 + ++ +TDGE+++ + +K C+ + + I++ Sbjct: 119 G-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVL 162 Query: 370 -------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 + R ++T S P+ + +NV + +L + + + Sbjct: 163 GHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 214 >gi|187954851|gb|AAI40832.1| ITGA10 protein [Homo sapiens] gi|219519694|gb|AAI44638.1| ITGA10 protein [Homo sapiens] Length = 1024 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 67/232 (28%), Gaps = 32/232 (13%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + V L + + + ++ L + + E + Sbjct: 3 LPFVTHLFLPLVFLTGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLF---IDPEQI 59 Query: 253 YMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y K V + + T + A+ A S Sbjct: 60 QVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIMVACTEGFSQSH 118 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI- 369 + ++ +TDGE+++ + +K C+ + + I++ Sbjct: 119 G-------------GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVL 162 Query: 370 -------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 + R ++T S P+ + +NV + +L + + + Sbjct: 163 GHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 214 >gi|167570097|ref|ZP_02362971.1| hypothetical protein BoklC_09651 [Burkholderia oklahomensis C6786] Length = 602 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 56/179 (31%), Gaps = 8/179 (4%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS- 70 ++ IA E+ +F+++ A+ ++ + +G + + + + + ++ + A LA A KM Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADMAALAAAQKMDDQ 75 Query: 71 -NLSRLGDRFESISNHAKRALIDDAKRFIK---NHIKESLSGYSAVFYNTEIQNIVNSSR 126 + SN A + + + Y V +++ Sbjct: 76 CTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTASTPLNAVQVTATQ 135 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 S + + L N + + + A L + + Sbjct: 136 SVPYFFLGPSRTVSATSTAK--ATNIDQFTVGTTLASLQGGLVNNVLNALLGANLSLSV 192 >gi|324504675|gb|ADY42017.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 898 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 44/156 (28%), Gaps = 19/156 (12%) Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + +D L ++ V + I Y T + Sbjct: 21 VKWTVSLVDALPVDRDGVRIAAIQYAGFPLTEFALGTYLNADDIRQHLSQIKFQSGVTRT 80 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A+++A L ++ + K I+ TDG S + +K Sbjct: 81 GYALRKADSELFRQERGARSDAI-------------KIIVLFTDG-----LSIDDPLKPA 122 Query: 355 DKAKE-NFIKIVTISINASPNGQRLLKTCVSSPEYH 389 + ++ +KI +S+ + + + Sbjct: 123 HELRDIKRVKIYVVSVGSDGFEPEMNRIAGDKRNVF 158 >gi|269926138|ref|YP_003322761.1| TadE family protein [Thermobaculum terrenum ATCC BAA-798] gi|269789798|gb|ACZ41939.1| TadE family protein [Thermobaculum terrenum ATCC BAA-798] Length = 135 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 36/126 (28%), Gaps = 8/126 (6%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + FAL++ LLLI + + N++ +A A + Sbjct: 14 GQAIVEFALTIPIMLLLIMLTVDFGRAIWYYNAIANA--------AREGARYGIVKSHSD 65 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 I N + I S + + + N + + + +++ Sbjct: 66 AEIINTVLQKSTGVPLSNSNVTITRSGTSPNGSIKVSISYNFRPITPLVSNLIPGGQINL 125 Query: 140 SNNTIF 145 ++ Sbjct: 126 RASSEM 131 >gi|90425190|ref|YP_533560.1| TadE-like [Rhodopseudomonas palustris BisB18] gi|90107204|gb|ABD89241.1| TadE-like [Rhodopseudomonas palustris BisB18] Length = 214 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 12/148 (8%), Positives = 31/148 (20%), Gaps = 16/148 (10%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-------- 54 L F + ++ F + V L++ I + + Sbjct: 14 LFYGFERQLGRLRRDTSGLAAVEFVMIVPLMLVMFFGTIELSSGFAAHRKVSIVAQTISD 73 Query: 55 --------SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 +A + + + + + ID A Sbjct: 74 LTSRGKSAAATDVSNFLSIADAIMTPYPAVHSADQFQTTITEVYIDPATGIGHAQWSRGD 133 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + +A ++V + Sbjct: 134 AVRNAGSVVAVPADLVAKDSSNNVIAGQ 161 >gi|73542614|ref|YP_297134.1| filamentous haemagglutinin adhesin HecA 20-repeat-containing protein [Ralstonia eutropha JMP134] gi|72120027|gb|AAZ62290.1| Filamentous haemagglutinin, N-terminal:Adhesin HecA 20-residue repeat x2 [Ralstonia eutropha JMP134] Length = 2536 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 18/269 (6%), Positives = 57/269 (21%), Gaps = 6/269 (2%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L A+ + ++ D + S + + + + + Sbjct: 761 ALGSAATQDNVVAGTLDAAATNSVALIGNTQATNVTLAAPTLVNAGTTVATQTLSASGGT 820 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + N ++ ++ + D + V + L L+ + + Sbjct: 821 LGNQGTLAGNQVSVDMADVVSRGTLGGQTVNLTASNTLDNTGGLIVGVQTLNVTTGALIS 880 Query: 181 LRIEM------GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + +L + ++G N+ + + + + Sbjct: 881 NARGTLFAGDLTGQSPTTGDLTLTVTGGNSSFNNAAGQVLAGNNATVNLPNQAIDLSAAT 940 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDS 294 + + + +G + + G +L T Sbjct: 941 TGTISANNTVTLAAQSIDNIGTWTTPGAANTVLNGAQGISNTGTLQGAGNLTLTTGGTLD 1000 Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 LT + Sbjct: 1001 NQGNIVGGSSLTMSGSTVGNSGTIHADGD 1029 >gi|317124425|ref|YP_004098537.1| hypothetical protein Intca_1292 [Intrasporangium calvum DSM 43043] gi|315588513|gb|ADU47810.1| hypothetical protein Intca_1292 [Intrasporangium calvum DSM 43043] Length = 159 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 6/52 (11%), Positives = 18/52 (34%), Gaps = 2/52 (3%) Query: 4 LSRFRFYFKK--GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM 53 +SR + ++ E +++ + L+ I + V + + Sbjct: 1 MSRLFSSSSRTRMVSGEDGQIAVLILGLFVIVLVFILGAVDVTAAQLARMRL 52 >gi|117921567|ref|YP_870759.1| type IV pilin biogenesis protein [Shewanella sp. ANA-3] gi|117613899|gb|ABK49353.1| type IV pilin biogenesis protein, putative [Shewanella sp. ANA-3] Length = 1168 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 45/349 (12%), Positives = 104/349 (29%), Gaps = 38/349 (10%) Query: 92 DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR-LDSSNNTIFYNMDV 150 +D + N+ G+ A T + ++ ++ D+ + V Sbjct: 158 EDITKVDPNNSSSYNDGFPANGLTTGSGKKKKAYPYFISSYPSSTWGDALAAAKNTDFGV 217 Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD--LSGSMHCAMN 208 + +L ++ + R+++ ++ I I G N Sbjct: 218 GQPVTFYSDN--YLRWYWLSKAGKLPTMKVARLDIAKKAISNIISSTPTVDFGLAVFNYN 275 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM-------------- 254 E A +++LLL +D + ++ M Sbjct: 276 YPNEGNRDGGRIVSGITQMTDAARDSLLLTIDKLPAKTNTPLCETMYEAYRYFAGQGVKY 335 Query: 255 --GLIGYTTRVEKNIEPSWGTEKVRQYVT----RDMDSLILKPTDSTPAMKQAYQ----I 304 G Y + V EK Y + + ++ TD +P + ++ Sbjct: 336 GHGDTDYGSYVGNRPPYDTSVEKGGAYSSPFKVCTDIAYVIYVTDGSPTVDKSANNDVIS 395 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 LTS + + F + + PS ++ + + N + NT ++ ++ Sbjct: 396 LTSTGSKDGDYSSFSKNLDTPSY-LPALASYMFNNDLINKLDSSNTEQM------QNVRT 448 Query: 365 VTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 TI A L +T ++ N+ L + + +++ Sbjct: 449 YTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNILN 497 >gi|120556589|ref|YP_960940.1| vault protein inter-alpha-trypsin subunit [Marinobacter aquaeolei VT8] gi|120326438|gb|ABM20753.1| Vault protein inter-alpha-trypsin domain protein [Marinobacter aquaeolei VT8] Length = 712 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 47/174 (27%), Gaps = 22/174 (12%) Query: 232 KNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + A L +D L + + + + D Sbjct: 372 RQARSALLRGLDTLRPGDRFNVIQFNSQAHALYTQ-PVPANGHYLARARDYVQDLTADGG 430 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T+ A+ A + S + ++F+TDG + Sbjct: 431 TEMAGALSLAMGM-----------------DGSESSGHVQQMVFMTDGA--VGNESALFD 471 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNI 405 +I ++ T++I ++PN L + + V +A + + Sbjct: 472 QIRTGLGNR--RLFTVAIGSAPNMHFLREAARWGRGQYTAVHSAAEVDKALGKL 523 >gi|159044811|ref|YP_001533605.1| hypothetical protein Dshi_2268 [Dinoroseobacter shibae DFL 12] gi|157912571|gb|ABV94004.1| hypothetical protein Dshi_2268 [Dinoroseobacter shibae DFL 12] Length = 174 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 8/52 (15%), Positives = 16/52 (30%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 R + K E+ ++ F + + + K ME A + Sbjct: 2 IRGFRKFLCRDERGTATVEFVIVFPLIIAVFMSTFEAAMLTAKYTMMERALD 53 >gi|325067680|ref|ZP_08126353.1| hypothetical protein AoriK_07659 [Actinomyces oris K20] Length = 928 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 60/217 (27%), Gaps = 41/217 (18%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 L L F D + +K V Y + S +V + +T D L Sbjct: 79 QYLAQRLAAFAD--SSKATLKVRVAGFASDYHALGDWVTINSGSLAEVGKQITAAGDDLK 136 Query: 289 LKPTDSTPAMKQAYQILTSDKKRS----------------FFTNFFRQGVKIPSLPFQKF 332 TD A++ A Q L + + +Q Sbjct: 137 DYDTDYWTALEGARQDLAGHDRSECRAVAWFSDGAYDLDVRDSEQSQQDFGTTKPYAPGS 196 Query: 333 IIFLTDGENN----NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS--- 385 + G + + D+ + + I ++ I +++ + +K Sbjct: 197 ALTDESGVKTAESAGKQDICRPTGLADQLRSSHITLLGIGLSSGDSNFDFMKRITLGGGA 256 Query: 386 ----------------PEYHYNVVNADSLIHVFQNIS 406 P Y V + DSL+ F +IS Sbjct: 257 NAAKNSVESCGNVSDPPGGFYPVTDIDSLLMAFDSIS 293 >gi|332164672|ref|NP_001193680.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] gi|296474881|gb|DAA16996.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] Length = 1091 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K +M K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRMTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|291223817|ref|XP_002731904.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 978 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 30/250 (12%), Positives = 79/250 (31%), Gaps = 51/250 (20%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F + V+ +P + +R + LV+D+SGSM Sbjct: 297 DFTGKKNLPLPPEDNVNTVPYFRLV---QRRERRVVLVLDVSGSM-----EGKPLTQLLQ 348 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVR 277 +C + R + + Y+G++ + E ++ ++R Sbjct: 349 VCTNYIRNTID--------------------DGSYLGVVKFEGIAETLVDLMLIDGPQIR 388 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + + ++ + T + + ++L++ +II ++ Sbjct: 389 ETIIANLPTYAGGRTSIGGGVLKGIEVLSNFGNED---------------TTGGYIILVS 433 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPE-YHYNV--VN 394 DGE + + + A + I T++ + + ++L + + + Sbjct: 434 DGEETSEPIIDDIWGDIELA---GVVIDTVAFSDKAD-KKLESLATKTGGLSFFFSGNDD 489 Query: 395 ADSLIHVFQN 404 +L F + Sbjct: 490 TTALYDAFTS 499 >gi|152994355|ref|YP_001339190.1| hypothetical protein Mmwyl1_0314 [Marinomonas sp. MWYL1] gi|150835279|gb|ABR69255.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 186 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 18/45 (40%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 + E AN + FAL V + +I + + + + ME Sbjct: 4 RFFRDENANAVLEFALFVPVMIGIILVSADLYNINRMRGVMEQTA 48 >gi|46581744|ref|YP_012552.1| hypothetical protein DVU3343 [Desulfovibrio vulgaris str. Hildenborough] gi|120601102|ref|YP_965502.1| TadE family protein [Desulfovibrio vulgaris DP4] gi|46451167|gb|AAS97812.1| hypothetical protein DVU_3343 [Desulfovibrio vulgaris str. Hildenborough] gi|120561331|gb|ABM27075.1| TadE family protein [Desulfovibrio vulgaris DP4] gi|311235372|gb|ADP88226.1| TadE family protein [Desulfovibrio vulgaris RCH1] Length = 139 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 10/120 (8%), Positives = 34/120 (28%), Gaps = 3/120 (2%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 +++ L ++ LLL+ + + ++S+ A + S+ + Sbjct: 23 AALLIVLFLVPMLLLV---MDAGRFITARHSLTQAVRQGAIVLIHAQASSQADTQASAAV 79 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + N ++ + + T + + + + S+ Sbjct: 80 KDALEVSGYRPGEVTVVINRPDQTRATITVTLDTTRFAVFGVAFTVLPVIIRESATVSTG 139 >gi|325108016|ref|YP_004269084.1| TadE family protein [Planctomyces brasiliensis DSM 5305] gi|324968284|gb|ADY59062.1| TadE family protein [Planctomyces brasiliensis DSM 5305] Length = 166 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 27/99 (27%), Gaps = 4/99 (4%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAILAGA---SKMVSNL 72 S + + FA FLLLI I + + + SA LAG + N Sbjct: 16 SRRGAAMVEFAFVAPVFLLLIIGTIEMGNALEASTQLSSALREGGRLAGMDWEGLISENE 75 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 + I N A + E Sbjct: 76 TPNQKVIRDIKNFLTAAGYPGDSVTVTITSAEGTDVGET 114 >gi|86138567|ref|ZP_01057140.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] gi|85824627|gb|EAQ44829.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] Length = 472 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 11/102 (10%), Positives = 31/102 (30%), Gaps = 7/102 (6%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 D+ A+ A ++ K + + + +I ++DGE Sbjct: 86 DTRDAIAAAVAKISPKGKTPISAAVRQAADALRHSEEKATVIPISDGEETCGLDPC---A 142 Query: 353 ICDKAKENFI--KIVTISINASPNGQRLLKTC--VSSPEYHY 390 + + + + + I + + R C ++ + Sbjct: 143 VGADLEASGVDFTLHAIGFGIADDTARAQLQCLAENTGGVYR 184 >gi|323138935|ref|ZP_08073996.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322395781|gb|EFX98321.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 204 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 37/110 (33%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 I + ++ F L + FLLLI ++ ++ ++ +++ A + V N S Sbjct: 23 LIHDSRGFAAVEFGLIALPFLLLIVAILEYSYGNFAQSRLDAVVQQASRQIMTGYVQNQS 82 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G ++ K + + + Y + + + Sbjct: 83 VGGKALDANQFRTKIMCPKLPAIMNCADLYVDVQAFDTPDYGSFVNATKS 132 >gi|242807550|ref|XP_002484979.1| von Willebrand and RING finger domain protein [Talaromyces stipitatus ATCC 10500] gi|218715604|gb|EED15026.1| von Willebrand and RING finger domain protein [Talaromyces stipitatus ATCC 10500] Length = 1044 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 23/231 (9%), Positives = 66/231 (28%), Gaps = 23/231 (9%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + N++ S + + + + G + D V P+ Sbjct: 475 YRTTKTNKRQSSATSSYGANKSNNTALTEYTNSMRDPGPASFHVPLDLVVV--IPVSSSM 532 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 + K+ L++AL + ++ MGL+ + + T+ + Sbjct: 533 QGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGVPLVGMTTKAWNGWGKIL 586 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + + + + + N I+ ++D ++ Sbjct: 587 NSLRPVGQKSLRADVVEGANVAMDLLMQRKSNNPIS------------TILLISDSSTSD 634 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 +S + + AK + I + + + L++ + + V + Sbjct: 635 PESVDFVVSRAEAAK---VGIHSFGLGLTHKPDTLIELSTRTKACYTYVKD 682 >gi|149064977|gb|EDM15053.1| rCG28280 [Rattus norvegicus] Length = 106 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 38/116 (32%), Gaps = 23/116 (19%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 + T + A++ A + K +K + +TDG+ + + Sbjct: 5 GEGTYTATALQAANDMF-----------------KEARPGVKKVALVITDGQ-TDSRDKR 46 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQ-----RLLKTCVSSPEYHYNVVNADSLI 399 + A ++ ++I I + + + + + E+ Y + +L Sbjct: 47 KLADVVKDANDSKVEIFVIGVVKKDDPNFEIFHKEMNLIATDSEHVYQFDDFFTLQ 102 >gi|311070712|ref|YP_003975635.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] gi|310871229|gb|ADP34704.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] Length = 245 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 32/84 (38%), Gaps = 7/84 (8%) Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN----ASPNGQRLLKTCVS 384 I+ +TDG +N+ + + + AKE I + I I + ++ Sbjct: 5 HLNQILLITDGCSNHGED---PLAMAAFAKEQGITVNVIGIMEENTVDHEAMKEVEGIAL 61 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 + + VV A L Q +++ Sbjct: 62 AGGGVHQVVYASQLSQTVQMVTKK 85 >gi|170693230|ref|ZP_02884390.1| type VI secretion protein IcmF [Burkholderia graminis C4D1M] gi|170141760|gb|EDT09928.1| type VI secretion protein IcmF [Burkholderia graminis C4D1M] Length = 1189 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 36/109 (33%), Gaps = 23/109 (21%) Query: 6 RFRFYFKKGIAS----EKA---------------NFSIIFALSVMSFLLLIGFLIYVLDW 46 R ++ ++ + E + L+V++ L+ GF Sbjct: 410 RHTYFVERLLKDVLFRESGLAGTNPRFERQKLFMQAAAYAGLAVVTLALVAGFATSYARN 469 Query: 47 HYKKNSMESAN----NAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 ++ A +AA LAGA + S +R R E IS A Sbjct: 470 RAFVAEVQGALKDLPDAATLAGAPDLKSYFARALARLEVISATQDAAGQ 518 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 81/291 (27%), Gaps = 60/291 (20%) Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + + I + D+ L++ + I + P Sbjct: 284 VRIDTTNDATREITLDGAAAADRDFVLEWSAVASDAPQVGLFREHIGKDDYVLAYVTPPA 343 Query: 192 L---------IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 L + V+D SGSM T + K +L L + Sbjct: 344 LASPKKVQREVIFVIDNSGSM--------------------GGTSIEQAKASLDYALSQL 383 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 +I + + K E + T+ PA+ Sbjct: 384 QPGDRFN------VIRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQGGTEMLPALHA 437 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 A L + + + I+FLTDGE +N + + A+ Sbjct: 438 A---LDDSNQGNGL----------------RQIVFLTDGEISNEQ--QLLDAVA--ARRG 474 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 +I + I ++PN + + ++ +A + + + + + Sbjct: 475 RSRIFMVGIGSAPNSYLMNRAAELGRGTFTHIGSAAEVDERMRALFDKLEN 525 >gi|115374131|ref|ZP_01461419.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] gi|115368907|gb|EAU67854.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] Length = 1551 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 59/233 (25%), Gaps = 12/233 (5%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + +++ G + A D L ALL Sbjct: 379 ISTFTQTSSANTLKMQPGCQGILANADAFDSSRTGFIATINSLTFNTATPLARALLNTGY 438 Query: 241 SIDLLSHVKEDVYMGLIGYTT---RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 V +DV+ T + + + + R D S ++ TD P Sbjct: 439 YFTSDQTVYKDVFGFGATNPTVGYAYPTDFKNEPLSSENRSVCWGDQASAVIILTDGEPN 498 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE----NNNFKSNVNTI-K 352 + + + L D N+ + + I Sbjct: 499 TDTLGSAVVQKIRSRNGGPVSCPPSAPCADTPNDANAMLDDVAKLLFTNDLQYSTPPIVG 558 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + + +++ T+ N LLK + Y +A L Q+ Sbjct: 559 ALNTSGQQSLRVYTVGFAIDSN---LLKNAAAVGGGRSYTAHDAAGLRQALQD 608 >gi|110801927|ref|YP_697812.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens SM101] gi|110682428|gb|ABG85798.1| von Willebrand factor type A domain protein [Clostridium perfringens SM101] Length = 622 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 24/296 (8%), Positives = 74/296 (25%), Gaps = 31/296 (10%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--EMGERPI 190 N + + + + + ++ S + ++ + I Sbjct: 68 GNYEITLTVKGKPKKVTKPVDILLIMDASNSM-YYNMDELKASMNSLVDKVIDNIPNSRI 126 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM----AALKNALLLFLDSIDLLS 246 ++ ++ + + D I Sbjct: 127 AVVAFGTEVEEVFSFNNKNKFTSKEEYKNAIKDSYYYITRKGNTNIEGTWRVADEIFKNE 186 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + + + NI + + Y + L K K+A++ Sbjct: 187 LNNNSNSKKDVIFFSDGYPNISVDYLYSIGYLSVYNYYNDYYLNEKNYYLNQIYKKAFEE 246 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + + + + + + + NN +N+ ++ + D Sbjct: 247 GKNYRNEHRDYEYEKIMKENGNTIGSWALYEYKKFYNNYPDTNIFSVALIDN-------- 298 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYNV----------VNADSLIHVFQNISQLMV 410 I + + LL +S ++ + N+ SL ++ I+ ++ Sbjct: 299 --IPYEDKNSAKELLSKMQNSG--YFTIDSRFGQDNNENNSKSLKEIYDKIANNII 350 >gi|149188855|ref|ZP_01867145.1| hypothetical protein VSAK1_05880 [Vibrio shilonii AK1] gi|148837275|gb|EDL54222.1| hypothetical protein VSAK1_05880 [Vibrio shilonii AK1] Length = 186 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 11/123 (8%), Positives = 27/123 (21%), Gaps = 16/123 (13%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 +K +F+I A +M+ + F + + + + + A A Sbjct: 12 HRQKGSFAIELAFVMMALCAIYLFSTDLSHQLLVRAKL----DRSSFALA---------- 57 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS--RISMTHMA 133 E + + S + + T + Sbjct: 58 NVIKERTRYFDADVAAGKNLAVTSSDLVNLTQVASRMLNTAPGNVALKIESLTNKTTVVG 117 Query: 134 NNR 136 Sbjct: 118 FTS 120 >gi|301756599|ref|XP_002914168.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Ailuropoda melanoleuca] Length = 1127 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 43/345 (12%), Positives = 101/345 (29%), Gaps = 31/345 (8%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + V NL + + + + D + N E+ + N ++ Sbjct: 122 EAVKNLVEAAEEADLNHEFNESLVFDYYNSVLINERDENGNYVELGAEFILESNTHFNNL 181 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + T ++N +L ++ ++ L + QR + + Sbjct: 182 MVNTSISNVQLPTNVYNKDPDILNGVYMSEALNPVFVENFQRDPTLTWQYFGSSTGFF-- 239 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI----------CQDKKRTKMAALKNALL 236 R I+ D +G + + + +A K +M K+ + Sbjct: 240 -RIYPGIKWTPDENGVIAFDCRNRGWYIQAATSPKDIVIVVDTSGSMKGLRMTIAKHTIS 298 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 LD++ V Y I Y K I + + + ++ Sbjct: 299 TILDTLGENDFVNIIAYSDYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQ 358 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +A+QIL ++ + I+ +TDG +++ Sbjct: 359 ALTEAFQILKQFQE------------ARQGSLCNQAIMLITDGAVEDYEPVFEKYNWP-- 404 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + +++ T I +K + + +Y ++ L Sbjct: 405 --DRKVRVFTYLIGREVTFADRMKWIACNNKGYYTQIST--LADA 445 >gi|281349604|gb|EFB25188.1| hypothetical protein PANDA_001994 [Ailuropoda melanoleuca] Length = 805 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 43/345 (12%), Positives = 101/345 (29%), Gaps = 31/345 (8%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + V NL + + + + D + N E+ + N ++ Sbjct: 122 EAVKNLVEAAEEADLNHEFNESLVFDYYNSVLINERDENGNYVELGAEFILESNTHFNNL 181 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + T ++N +L ++ ++ L + QR + + Sbjct: 182 MVNTSISNVQLPTNVYNKDPDILNGVYMSEALNPVFVENFQRDPTLTWQYFGSSTGFF-- 239 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI----------CQDKKRTKMAALKNALL 236 R I+ D +G + + + +A K +M K+ + Sbjct: 240 -RIYPGIKWTPDENGVIAFDCRNRGWYIQAATSPKDIVIVVDTSGSMKGLRMTIAKHTIS 298 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 LD++ V Y I Y K I + + + ++ Sbjct: 299 TILDTLGENDFVNIIAYSDYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQ 358 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 A+ +A+QIL ++ + I+ +TDG +++ Sbjct: 359 ALTEAFQILKQFQE------------ARQGSLCNQAIMLITDGAVEDYEPVFEKYNWP-- 404 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + +++ T I +K + + +Y ++ L Sbjct: 405 --DRKVRVFTYLIGREVTFADRMKWIACNNKGYYTQIST--LADA 445 >gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1] gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1] Length = 718 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 54/207 (26%), Gaps = 46/207 (22%) Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 VVD+SGSM+ M K L L ++ H ++ Sbjct: 292 FVVDVSGSMN--------------------GYPMEVSKKLLRNLLVNLPETDHYNILLFA 331 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 G + V + E +++ + + T A+K AY + Sbjct: 332 G----GSSVLAPEPLACTKENIQKGINFLTNERGGGGTRLLNALKTAYAL---------- 377 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + ++ +TDG I + + T I + N Sbjct: 378 --------PRMDKTSARSMVVITDG--YVSVERKAFEMIEQNLGQAN--VFTFGIGSGVN 425 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHV 401 L S + + V Sbjct: 426 RYLLEGMAKISNSETFIATEMNEANDV 452 >gi|127512111|ref|YP_001093308.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] gi|126637406|gb|ABO23049.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] Length = 1204 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 11/59 (18%), Positives = 21/59 (35%), Gaps = 2/59 (3%) Query: 359 ENFIKIVTISINASP-NGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHRKYS 415 + + TI + N + LLK S +++ +A L Q ++ S Sbjct: 458 DQHVITHTIGFSEGAKNAEDLLKKVASKGGGSYFDATDATKLQGSIQQAVNEVLANSAS 516 >gi|293384268|ref|ZP_06630156.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387001|ref|ZP_06631569.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312906299|ref|ZP_07765310.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312909645|ref|ZP_07768499.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078416|gb|EFE15780.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083543|gb|EFE20506.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310627674|gb|EFQ10957.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311290047|gb|EFQ68603.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 383 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 23/281 (8%), Positives = 69/281 (24%), Gaps = 31/281 (11%) Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLRIEMGERPIFL 192 + + + + + + + ++ N + + G I + Sbjct: 103 FNVTLDVKGNQTESPIDLVLVIDYSSSMKGEKLNNALKGLQQFGEELSDSLTDGHVRIGI 162 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + + + + + + T M L L + + Sbjct: 163 VAYNRLTYSTADFSTDMNDLEDFLRNTAEPHSGTFMQK-----GLLEGQRLLAEKSRPNA 217 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ ++ +D + + + S Sbjct: 218 KKMLVHIGDGSANASFLPRENAQIYPNNGEIIDYNGYHTSSYMEEFQTESNQYYTSNSAS 277 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINAS 372 N + N+ NT+ K++ +I+ N S Sbjct: 278 TDANAI---------------------QTNSTTVTDNTLGTIVSLKKSGDTYYSIAANPS 316 Query: 373 PNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLMVHR 412 G+ + + S P+ + + N L + ++ + + Sbjct: 317 LRGEYISRNIASDPKNYLIIDENLSGLGTALKELAGSINNT 357 >gi|317125804|ref|YP_004099916.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589892|gb|ADU49189.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 568 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 11/87 (12%), Positives = 31/87 (35%), Gaps = 4/87 (4%) Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK----ENFIKIVTISINA 371 +++ + ++ +TDG N++ ++ + K N I+I + + Sbjct: 472 ATYQRARATYTKGQLNSVVIVTDGLNDDDYGASLSVALSRVKKLVDPRNPIRITIVGFGS 531 Query: 372 SPNGQRLLKTCVSSPEYHYNVVNADSL 398 P+ + + + + N L Sbjct: 532 EPDAKAMTPFAQLTGGRYVNAAEPKDL 558 >gi|270158236|ref|ZP_06186893.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|289163508|ref|YP_003453646.1| unknwon proteins [Legionella longbeachae NSW150] gi|269990261|gb|EEZ96515.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|288856681|emb|CBJ10492.1| putative unknwon proteins [Legionella longbeachae NSW150] Length = 309 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 46/163 (28%), Gaps = 26/163 (15%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMK 299 + L K+ GL+ +T+ + + + + D + + + A+ Sbjct: 120 KLHDLFAHKDAGQFGLVIFTSEPFVVSPLTDDGQTISSLLASLTPDIMPVTGQNLHSALN 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A ++ K + ++ D + +I + K E Sbjct: 180 EASTLI-----------------KQAGYNQGQILVMTAD------SPSSESIALAKKLAE 216 Query: 360 NFIKIVTISINASPNGQRLLKT-CVSSPEY-HYNVVNADSLIH 400 + I + I A N L + ++ L Sbjct: 217 SGIYSSIMPIKADQNLNPLFQKFAQEGEGQLIKYTSDSSDLDQ 259 >gi|107099783|ref|ZP_01363701.1| hypothetical protein PaerPA_01000801 [Pseudomonas aeruginosa PACS2] gi|218893649|ref|YP_002442518.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa LESB58] gi|218773877|emb|CAW29691.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa LESB58] Length = 1163 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 25/261 (9%), Positives = 52/261 (19%), Gaps = 29/261 (11%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F +N + + ++G + Sbjct: 252 DFTGQHRVNFFNWLENLSVGGGTPLRQAMTRAGEFLKKTGVNGPYAYR--PGTQTSPEYS 309 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 L N + + D S + Sbjct: 310 CRGSYHILMTDGLWNNDSASVGNADSTSRS--------LPDGKSYSSQTPYRDAASNTLA 361 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + TD+ P + + ++ + + P+ L Sbjct: 362 -----DQAFHYWATDARPDIDDNIKPYIPYPDQANPSAEYWNPRNDPATWQHMVTYTLGL 416 Query: 339 GENNNFKSN---------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 G + S + N S N L V+S Sbjct: 417 GLTTSLTSPKWEGSTYSGGYDEIAAGRLSWPNASN-----NHSNNVYDLWHAAVNSRGEF 471 Query: 390 YNVVNADSLIHVFQNISQLMV 410 ++ + D L+ FQ+I + Sbjct: 472 FSADSPDQLVAAFQDILNRIS 492 >gi|15602709|ref|NP_245781.1| TadF [Pasteurella multocida subsp. multocida str. Pm70] gi|12721153|gb|AAK02928.1| TadF [Pasteurella multocida subsp. multocida str. Pm70] Length = 187 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 20/46 (43%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 FK ++K +I F M ++L FL ++ +++A Sbjct: 9 FKHFWKNKKGAVTIEFLFMSMFLIVLFAFLFDLVMLRSTLGKLDNA 54 >gi|158319480|ref|YP_001511987.1| hypothetical protein Clos_0429 [Alkaliphilus oremlandii OhILAs] gi|158139679|gb|ABW17991.1| conserved hypothetical protein [Alkaliphilus oremlandii OhILAs] Length = 244 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 32/80 (40%), Gaps = 7/80 (8%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKT--CVSSP 386 K +I +TDG++N S I +KA N I + TI I N L + + Sbjct: 8 KQMILVTDGQSNVGGS---PIIAAEKAYRNGIIVNTIGIVDGKESNEDALNEIVEIAKAG 64 Query: 387 EYHYNVVNADSLIHVFQNIS 406 Y + L Q+++ Sbjct: 65 GGTYEYSYINELFQTMQSLT 84 >gi|307327616|ref|ZP_07606801.1| TadE family protein [Streptomyces violaceusniger Tu 4113] gi|306886728|gb|EFN17729.1| TadE family protein [Streptomyces violaceusniger Tu 4113] Length = 151 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 16/135 (11%), Positives = 40/135 (29%), Gaps = 5/135 (3%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ ++++ A+ + ++G +I + A ++A A A Sbjct: 13 LRRDRGSYAVETAVLAPVLIGILGLMIAFGRI----ADADGAVDSAARAAARAASLERDA 68 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + E+ S D + + SGY+ S +++ + Sbjct: 69 GTAQAEAQSAATHSL-DGDGVTCQTSSVTVDTSGYALDVGEEASVTATISCTANLSDIGL 127 Query: 135 NRLDSSNNTIFYNMD 149 L S Sbjct: 128 PGLPGSKTLTASWTS 142 >gi|51597677|ref|YP_071868.1| hypothetical protein YPTB3379 [Yersinia pseudotuberculosis IP 32953] gi|186896816|ref|YP_001873928.1| TadE family protein [Yersinia pseudotuberculosis PB1/+] gi|51590959|emb|CAH22617.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] gi|186699842|gb|ACC90471.1| TadE family protein [Yersinia pseudotuberculosis PB1/+] Length = 157 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 35/106 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + + + ++ F L + F+ ++ + Y +++ A + A + Sbjct: 2 MRDTILRLLPANRGSIAVEFTLIFILFIFMLLLVAETSRLFYTSANLDFALSEAAKTAKN 61 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + N+ FE N L + + S A Sbjct: 62 RDDENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVAD 107 >gi|71028596|ref|XP_763941.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350895|gb|EAN31658.1| thrombospondin-related protein [Theileria parva] Length = 552 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 58/181 (32%), Gaps = 31/181 (17%) Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL-----KPTDSTP 296 + +S V++ ++ ++T + I + K R +D L T + Sbjct: 239 VKSISISPNYVHLSVVTFSTSIRWLISFLNPSGKDENLALRVIDELKNSKPVFGFTFTGQ 298 Query: 297 AMK---QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ +A + + K II +TDG + + Sbjct: 299 ALNFITEAVYQFGARQN------------------APKAIILITDGSSTQPNVTSQASAM 340 Query: 354 CDKAKENFIKIVTISINASPNGQ-RLLKTCVSSPEY-HYNVVNADSLIHVFQNISQLMVH 411 +E + I+ + + + + + R + C H+ + N + +I + + Sbjct: 341 ---LREAGVTILVVGVGMARDYECRAVAGCPVKGNCPHFFMTNWNEIIRKVGELMAEVCE 397 Query: 412 R 412 Sbjct: 398 T 398 >gi|159139286|gb|ABW89597.1| putative type IV pili-associated adhesin precursor [Pseudomonas aeruginosa] Length = 1163 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 25/261 (9%), Positives = 52/261 (19%), Gaps = 29/261 (11%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F +N + + ++G + Sbjct: 252 DFTGQHRVNFFNWLENLSVGGGTPLRQAMTRAGEFLKKTGVNGPYAYR--PGTQTSPEYS 309 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 L N + + D S + Sbjct: 310 CRGSYHILMTDGLWNNDSASVGNADSTSRS--------LPDGKSYSSQTPYRDAASNTLA 361 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + TD+ P + + ++ + + P+ L Sbjct: 362 -----DQAFHYWATDARPDIDDNIKPYIPYPDQANPSAEYWNPRNDPATWQHMVTYTLGL 416 Query: 339 GENNNFKSN---------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 G + S + N S N L V+S Sbjct: 417 GLTTSLTSPKWEGSTYSGGYDEIAAGRLSWPNASN-----NHSNNVYDLWHAAVNSRGEF 471 Query: 390 YNVVNADSLIHVFQNISQLMV 410 ++ + D L+ FQ+I + Sbjct: 472 FSADSPDQLVAAFQDILNRIS 492 >gi|111026118|ref|YP_708401.1| hypothetical protein RHA1_ro10050 [Rhodococcus jostii RHA1] gi|110824961|gb|ABH00243.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 150 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 13/144 (9%), Positives = 43/144 (29%), Gaps = 5/144 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + ++ + +I L+ + +LL+G+ + + A GA Sbjct: 2 RLTRWLSRARRDDRGDVTIELCLATVILILLLGW----MYAYGVNRQAHQKVEHAATEGA 57 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + A ++ + ++ SG+ + + Sbjct: 58 RAASLARTIATATPLAYQ-AAAGSMDGQGLKCATMNVNADTSGFRTRPGVPATVEVTVTC 116 Query: 126 RISMTHMANNRLDSSNNTIFYNMD 149 ++S + ++ + + Sbjct: 117 QVSFDALGWPGVNGARTVTATAIS 140 >gi|239934064|ref|ZP_04691017.1| hypothetical protein SghaA1_38052 [Streptomyces ghanaensis ATCC 14672] gi|291442503|ref|ZP_06581893.1| TadE family protein [Streptomyces ghanaensis ATCC 14672] gi|291345398|gb|EFE72354.1| TadE family protein [Streptomyces ghanaensis ATCC 14672] Length = 150 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 15/143 (10%), Positives = 45/143 (31%), Gaps = 2/143 (1%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 +SR + + E + +I A+ V ++ + I +++A A Sbjct: 1 MSRRARWAGRLRRDE-GSVAIEAAIIVPPLIMFVCLAIAGGRIVTSGAKIDAAAEDAARE 59 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + ++ + + + + + + + S+ I VN Sbjct: 60 ASIHRTAASAQAAAQAAAAESLNDQGITCASSSVSIDTGGLSVPVGQVGTVTVSISCTVN 119 Query: 124 SSRISM-THMANNRLDSSNNTIF 145 + + + L S+ ++ Sbjct: 120 LADLLLPGVPGARTLTSTATSVV 142 >gi|171317111|ref|ZP_02906314.1| TadE family protein [Burkholderia ambifaria MEX-5] gi|171097745|gb|EDT42572.1| TadE family protein [Burkholderia ambifaria MEX-5] Length = 148 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 11/128 (8%), Positives = 33/128 (25%), Gaps = 8/128 (6%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ ++ FA+ F ++ ++ ++ + LA + + L+ Sbjct: 10 CRRQRGATAVEFAIIFPVFFVIFYAILSFGMIFT----IQQSL---TLAASEGARAGLNY 62 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 ++N A + S S ++ + M Sbjct: 63 APSLAARVTNATTTAQNVVGW-LNIGTPQVSAPQCSYDTSKPPSLYCLSVTVSYMPESWV 121 Query: 135 NRLDSSNN 142 + Sbjct: 122 TTMPFLGT 129 >gi|332237901|ref|XP_003268145.1| PREDICTED: integrin alpha-10 isoform 3 [Nomascus leucogenys] Length = 1032 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 70/238 (29%), Gaps = 32/238 (13%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 R + LV L S+ + + ++ L + + Sbjct: 5 NRSGMELPLVTHLFLSLVFLTGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLF--- 61 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 E + +GL+ Y K V + + T + A+ A Sbjct: 62 IDPEQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIMVACTE 120 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 S + ++ +TDGE+++ + +K C+ + + Sbjct: 121 GFSQSHG-------------GRPEAARLLVVVTDGESHDGEQLPAALKACEAGR---VTR 164 Query: 365 VTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 I++ + R ++T S P+ + +NV + +L + + + Sbjct: 165 YGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 222 >gi|326672739|ref|XP_002664110.2| PREDICTED: vitrin [Danio rerio] Length = 868 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 22/205 (10%), Positives = 55/205 (26%), Gaps = 37/205 (18%) Query: 178 PALLRIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 R+ + D + M M+ + + K+ Sbjct: 457 GFTSRVMDTTARAPEVPSQGDPNCKVDMAFLMDGSWSIG----------KRRFKIQKD-- 504 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYVTRDMDSLILKPTD 293 ++ L+ MG+I Y ++ ++ +++ + P+ Sbjct: 505 -FLVEVSQALNVGVAGAMMGIIQYGDDPVTEFSLKQFSNSKDLKPAISKIV--QKGGPSH 561 Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 A+ + SD + + L DG + + Sbjct: 562 VGKALSYINKQFFSDANGNRGGA-------------PNVAVVLVDGWPTD-----RVEEA 603 Query: 354 CDKAKENFIKIVTISINASPNGQRL 378 A+E+ I I ++I + ++ Sbjct: 604 SRLARESGINIFFVTIEGPDDNEKQ 628 >gi|325275094|ref|ZP_08141074.1| hypothetical protein G1E_17455 [Pseudomonas sp. TJI-51] gi|324099771|gb|EGB97637.1| hypothetical protein G1E_17455 [Pseudomonas sp. TJI-51] Length = 651 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 44/145 (30%), Gaps = 7/145 (4%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGASKMVSNL 72 A ++ ++ +++ LL + +I + ++ ++ + A L A V Sbjct: 5 FAARQRGAIGLMAVVTLGLALLFMLLVIDSGRLYLEQRKLQRIADMAALEAAGQYAVCTG 64 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S + + + N + S + N+ ++ Sbjct: 65 SGPQATVVARAAATRNGHA------PGNPLAASCGYLQTGANSLRTFTTDNNRTEAIRVD 118 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYR 157 +N + +S Y++ T Sbjct: 119 VSNTVPTSVAAGVYSLAQGTGVPLT 143 >gi|226360795|ref|YP_002778573.1| hypothetical protein ROP_13810 [Rhodococcus opacus B4] gi|226239280|dbj|BAH49628.1| hypothetical protein [Rhodococcus opacus B4] Length = 891 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 52/207 (25%), Gaps = 35/207 (16%) Query: 233 NALLLFLDSIDLLSHV-KEDVYMGLIGYTTRVEKNIEPSWGTE----KVRQYVTRDMDSL 287 +A L+ ++ ++ + + G+ + + ++ V + Sbjct: 67 SAANYLLEQLNTFGGSAGVELNVAIAGFADDFTVHAPWTRLDNGSLPALQGEVEKFRTRT 126 Query: 288 ILKPTDSTPAMKQAYQILTS-----------DKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 TD A+ A + L Q + Sbjct: 127 DGIDTDYWNALDGARRTLAEHDSQSEANRCQAVAWFSDGKLDFTVRDAEKPYAQGISLRS 186 Query: 337 TDGENN----NFKSNVNTIKICDKAKENFIKIVTISINASPNGQ---RLLKTCVS----- 384 G +S I D+ + + I + + A L+++ + Sbjct: 187 DQGVQQVVAAARESICRPAGIADQLRSSGIVTFAVGLAAGTAQPSDFDLMRSIATGGDGA 246 Query: 385 -------SPEYHYNVVNADSLIHVFQN 404 SP Y N D L+ F Sbjct: 247 CGKTTSPSPGDFYLAQNIDDLLFAFDA 273 >gi|254773191|ref|ZP_05214707.1| hypothetical protein MaviaA2_00706 [Mycobacterium avium subsp. avium ATCC 25291] Length = 609 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 16/156 (10%), Positives = 44/156 (28%), Gaps = 11/156 (7%) Query: 40 LIYVLDW------HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + K ++ESA + A +A + ++ + L+ + E +S L + Sbjct: 201 AVEGMLARIGKEQQRTKAALESARDFAAVA-SHELRTPLTAMRTNLEVLSTLD---LPHE 256 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-MTHMANNRLDSSNNTIFYNMDVMT 152 ++ + + + S A E + + L + Sbjct: 257 QRQEVIGDVIRTQSRIEATLTALERLAQGELTTVDDFVPFDITELLDRAAHDALRVYPDV 316 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ ++V ++ G Sbjct: 317 EVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHGNA 352 >gi|163842175|ref|YP_001626580.1| von Willebrand factor type A domain-containing protein [Renibacterium salmoninarum ATCC 33209] gi|162955651|gb|ABY25166.1| von Willebrand factor type A domain, putative [Renibacterium salmoninarum ATCC 33209] Length = 267 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 31/248 (12%), Positives = 71/248 (28%), Gaps = 41/248 (16%) Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 +L + + PA E+ I L +D+SGSM Sbjct: 5 SVLALSVGALLAASRPAEQNANRPEQNNRDIMLCLDVSGSMVDTDA-------------- 50 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 +D L+ +GL+ + + + + ++ + Sbjct: 51 --------------KIVDVFANLAQEFHGERLGLVIFDSTAVQVFPLTEDYGYIKDELNV 96 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + ++ + D+ Y S K+ + + I+F TD Sbjct: 97 ALKAMTDQSDDTGF-FGGTYSGRGSSLIGDGLATCVNSFPKLGAEQRSRSIVFATDNVLL 155 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNG---------QRLLKTCVSSPEYHYNVV 393 T A +N I++ IN + + +++ S+ +Y + Sbjct: 156 GKPLFSLT-DAAGLATKNSIRVY--GINPNGSDVAGAKGAAAKQMRAAVESTGGNYYVLA 212 Query: 394 NADSLIHV 401 + + + Sbjct: 213 DDAAAKDI 220 >gi|227544809|ref|ZP_03974858.1| hypothetical extracellular protein [Lactobacillus reuteri CF48-3A] gi|227185210|gb|EEI65281.1| hypothetical extracellular protein [Lactobacillus reuteri CF48-3A] Length = 966 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 52/189 (27%), Gaps = 10/189 (5%) Query: 40 LIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIK 99 + N A A A + + ++ N+ + + A + + Sbjct: 187 AVD--QAQQTVNQASQAVEQAKQAVVDATNRAAEQKTEEAQNTINNLENQGKNQASQDLN 244 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY--R 157 N I + T+ QN VN++ + + N D N D Sbjct: 245 NQI------NNVSGQVTDAQNDVNNTNAKIGDLQNQVKDLQNQETNTQADPNVKTVPNGV 298 Query: 158 LQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSA 217 + + + LL + + + + G+M A + DV+S+ Sbjct: 299 IDYSKSLLENVARNNLWAPADMNANTNGNVVEQNIYVSPDNDKGNMSGANRNHDADVSSS 358 Query: 218 PICQDKKRT 226 Sbjct: 359 AYDPYYTTD 367 >gi|297279796|ref|XP_002801793.1| PREDICTED: integrin alpha-10-like isoform 3 [Macaca mulatta] Length = 1036 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 73/243 (30%), Gaps = 41/243 (16%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL-----------KNALLLFLDSI 242 + H +N D N + D T M + + + FL + Sbjct: 1 MNITHAYSQGHQKLNLDTVSYNILGVDSDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRL 60 Query: 243 -DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQYVTRDMDSLILKPTDSTPAMK 299 L E + +GL+ Y K V + + T + A+ Sbjct: 61 VGKLFIDPEQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRRE-GRETKTAQAIM 119 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 A S + ++ +TDGE+++ + +K C+ + Sbjct: 120 VACTEGFSQSHG-------------GRPEAARLLVVVTDGESHDGEELPAALKTCEAGR- 165 Query: 360 NFIKIVTISI--------NASPNGQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 + I++ + R ++T S P+ + +NV + +L + + + Sbjct: 166 --VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRI 223 Query: 410 VHR 412 Sbjct: 224 FGL 226 >gi|257876502|ref|ZP_05656155.1| predicted protein [Enterococcus casseliflavus EC20] gi|257810668|gb|EEV39488.1| predicted protein [Enterococcus casseliflavus EC20] Length = 799 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 26/348 (7%), Positives = 71/348 (20%), Gaps = 34/348 (9%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + D + + + Sbjct: 30 SETTTYDFDYQTDATGQYLTNGTAQGDTYNYKYGQAP---GVAFDEESYVNYADEAYLKK 86 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA----LLR 182 D + + + + ++ ++ Sbjct: 87 SVKEAADQGLFDVTLAVKGNQTSTPVDLVMVIDYSSSMTGEKLTNALLGLQQFGNELADS 146 Query: 183 IEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + G I ++ ++ + + + + + T M A L Sbjct: 147 LANGSIRIGIVAYNREIYTTNGFSTDIAYLEQFLRETAETHSGTFMQKGLLAGQQLLLEE 206 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAY 302 K +++G R Y+ D ++ Sbjct: 207 SRPEAEKLFIHIG----------------DDSANRSYLPADDATVYQNTGAIRDYNGYHT 250 Query: 303 QILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI 362 + + S S +TD T+ K++ Sbjct: 251 ESYVESFQTSATAYHTTSKNPSDSQAIPVSSTVVTD----------ETLGTIVALKKSGF 300 Query: 363 KIVTISINASPNGQRLLKTCVSSPEYHYNV-VNADSLIHVFQNISQLM 409 +++ S G+ + + S+P + N L I+ + Sbjct: 301 TYYSVATAPSARGEYIGRNLASAPSNYLTTDENLSGLGSALTEIANHI 348 >gi|149202861|ref|ZP_01879832.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] gi|149143407|gb|EDM31443.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] Length = 212 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 11/114 (9%), Positives = 27/114 (23%), Gaps = 15/114 (13%) Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + + ++ +TDG + A+ + + Sbjct: 80 LMPGGLTPIAASVAAAAEVLEYRTQPGIVVLVTDGNETCG-GTPCALGAALTAEARDLTV 138 Query: 365 VTISINA-------------SPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN 404 I +GQ + K ++ + D L+ + Sbjct: 139 HVIGFRVVHDPFSWNSPEAQGYDGQTVAKCLADATGGVFVSTETVDELVEALRE 192 >gi|153834035|ref|ZP_01986702.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148869590|gb|EDL68580.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 423 Score = 39.8 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 6/41 (14%), Positives = 18/41 (43%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES 55 S+K ++ ++ ++ L + F I + K +++ Sbjct: 10 RRSQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQN 50 >gi|68072835|ref|XP_678331.1| von willebrand factor a-domain-related protein, [Plasmodium berghei strain ANKA] gi|15077688|gb|AAK83296.1| micronemal protein WARP [Plasmodium berghei] gi|29838451|gb|AAO92597.1| von Willerbrand factor A domain-related ookinete/early oocyst protein [Plasmodium berghei] gi|56498766|emb|CAH99142.1| von willebrand factor a-domain-related protein, putative [Plasmodium berghei] Length = 303 Score = 39.8 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 20/188 (10%), Positives = 53/188 (28%), Gaps = 21/188 (11%) Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + D+ + K + K+ +M +I + Sbjct: 91 NYCDNYYDITLVVENSSFVQKDYWM-KGTIPFLESMARNARVSKDKAHMSIILFAGTQNL 149 Query: 266 NIEPSWGTE----KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + K+ + + D+ T A++ A++ + + Sbjct: 150 IVPFTDEISQDKEKLIEKIRTLDDAGTDSNTLYVYALEYAFEKVIFGEGTRSDA------ 203 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR-LLK 380 K + G + ++ + + K++ IK++ + I + LL Sbjct: 204 --------PKIAVLFYYGFDYGANKSLIP-DVVEDYKQSNIKLIIVGIGLTLRENALLLA 254 Query: 381 TCVSSPEY 388 C S + Sbjct: 255 DCKSEGDC 262 >gi|307591433|ref|YP_003900232.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986287|gb|ADN18166.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 491 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 13/174 (7%), Positives = 41/174 (23%), Gaps = 3/174 (1%) Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 + + + ++ + + R ++ I ++ + + Sbjct: 50 TPQEIVLLIDCSGSMDGNKLSEVKTAATSFVQRQDLITNRIAVMGFGSGVQLGTPLTSDV 109 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLF---LDSIDLLSHVKEDVYMGLIGYTTRVEKN 266 + A + AL A S++ E+ ++ L + Sbjct: 110 NVLQTAIANLYDGGGTMMDQALTAATDQLHNASASLESAIPSGENQHILLFTDGVAADPY 169 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 G + + T+ + ++ +F F Sbjct: 170 NTLVAGQTAQNAQINIVAVATGDADTNFLSQLTGDPNLVFYANTGNFDAAFQAA 223 >gi|239832734|ref|ZP_04681063.1| TadE family protein [Ochrobactrum intermedium LMG 3301] gi|239825001|gb|EEQ96569.1| TadE family protein [Ochrobactrum intermedium LMG 3301] Length = 180 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 46/153 (30%), Gaps = 17/153 (11%) Query: 4 LSRFRFYFKKGIA---------SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME 54 RF F + ++ ++ FA+ FLL++ +I + N+++ Sbjct: 27 RKRFSFVTDRFRRIPLLLFFNRNKSGTAAVEFAILAPVFLLILMGMIAFGLYLGVANAVQ 86 Query: 55 S-ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 A +A ++ + + S K A K + + Sbjct: 87 QLAADA-----TRTALAGIDPPERLALATSYIRKNAAKYSLIDPEKMQVNVDNAQSDPNQ 141 Query: 114 YNTEIQNIVNSSRI--SMTHMANNRLDSSNNTI 144 + IQ + I MT + + + Sbjct: 142 FTVTIQYDAENLPIWNLMTGLPLPDTIITRAST 174 >gi|99081993|ref|YP_614147.1| hypothetical protein TM1040_2153 [Ruegeria sp. TM1040] gi|99038273|gb|ABF64885.1| hypothetical protein TM1040_2153 [Ruegeria sp. TM1040] Length = 201 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 11/109 (10%), Positives = 27/109 (24%), Gaps = 15/109 (13%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFL---IYVLDWHYKKNSMESANNA 59 + + F+ E+ N ++ AL + L + + Sbjct: 1 MFRKIAHKFRAFRRDEEGNIAVEAALYLPLLLFVFAATYTLFDLFRQETVNTK------- 53 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 A+ VS+L + + + + +S Sbjct: 54 -----AAYTVSDLISRETTALNDEYINSIYTLGKLMARAGSDMSMRISV 97 >gi|227114883|ref|ZP_03828539.1| hypothetical protein PcarbP_18060 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 182 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 11/119 (9%), Positives = 31/119 (26%), Gaps = 6/119 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA------GAS 66 + S + + A V L+ + L + + +A + A+ A + Sbjct: 21 RHWRSTRGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAFRLDNLASD 80 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +RL R + +++ + + ++ Sbjct: 81 SAEQMGTRLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNGNQDNPNGEETTT 139 >gi|313675093|ref|YP_004053089.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941791|gb|ADR20981.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 461 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 60/207 (28%), Gaps = 39/207 (18%) Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT--------TRV 263 D + + + + ++ A K L +DS+ V V + L Y Sbjct: 38 FDASGSMLAKWGNELRIDAAKRVLTDLVDSL----RVNNKVELALRPYGHLTPAKERNCQ 93 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + +E + + + R T +++Q+ + Sbjct: 94 DTKLEIPFAPNNNDRIIDRLKYIYPRGTTPIAYSLEQSAKDF------------------ 135 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI--VTISINASPNGQRLLKT 381 ++ II +TDG + + + + + ++ I + I + + Sbjct: 136 PKDNNYRNIIIIITDGIESC---DGDPCAVSLELQKKDIFLRPFVIGLGMEEKFAAEFEC 192 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 ++N + + V I Sbjct: 193 M----GEYFNAKDISAFQAVLNGILDQ 215 >gi|90406743|ref|ZP_01214936.1| hypothetical protein PCNPT3_01885 [Psychromonas sp. CNPT3] gi|90312196|gb|EAS40288.1| hypothetical protein PCNPT3_01885 [Psychromonas sp. CNPT3] Length = 156 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 11/142 (7%), Positives = 42/142 (29%), Gaps = 6/142 (4%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R KK + + ++ F + + ++ ++ + + + ++ A + A Sbjct: 2 RLLQRPKKLRSQQSGAATLEFVFTFIPLFVMCLLMLEICRYMITSSVLDVALSNATRQVI 61 Query: 66 SKMVSNLSRLGDRFESIS------NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 V++ + + N + K +K+ + Sbjct: 62 VTPVNSDVSAQLKQLIDAKKWPLINAKNITINAQYFTDFKALVKQQSDSVHHGQIYAKYI 121 Query: 120 NIVNSSRISMTHMANNRLDSSN 141 + + + ++ + S+ Sbjct: 122 LSYSYTPLLLSFSGFSFSKLSS 143 >gi|119960552|ref|YP_948617.1| TadE-like family protein [Arthrobacter aurescens TC1] gi|119947411|gb|ABM06322.1| putative TadE-like family protein [Arthrobacter aurescens TC1] Length = 149 Score = 39.8 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 15/133 (11%), Positives = 33/133 (24%), Gaps = 6/133 (4%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 S+ ++ FAL + FL+L+ + + S+ A + + + + Sbjct: 19 AKSQSGAVAVEFALVLPIFLVLVLGIFEFGRAFNIQISLSEAA------REAARYAAIHQ 72 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + A I + SG S I + Sbjct: 73 SDSTYSVGDAQAAGVAAAPTVDLAPGDITVTSSGTSPCNVEVNISYSTPWMTGFPGLVPG 132 Query: 135 NRLDSSNNTIFYN 147 + + Sbjct: 133 MPAELDISGTGVM 145 >gi|260805092|ref|XP_002597421.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] gi|229282686|gb|EEN53433.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] Length = 705 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 54/175 (30%), Gaps = 23/175 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 ++ A +D++ ++ + +G I Y + I T + Sbjct: 519 LSIFNQAKQFIVDTLQCIADRGVQIGVGYIVYDCVPKTIITLGTYTSDDPAVSGIIHYEM 578 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 T + + Y LTS + + LTDG+ ++ Sbjct: 579 TEGGT-TRTPLAIRYMRLTSKF----------------RDGAARAAVILTDGQTEGDAAD 621 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 + + I++ ++I + +G L+ S ++ + +L + Sbjct: 622 DASDAR-----DAGIEMYAVAIGSFVDGSA-LQAIAGSGANVFDSSDPCALANRM 670 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 53/197 (26%), Gaps = 31/197 (15%) Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLL---SHVKEDVYM 254 + V + T KN ++ ++ + + + + Sbjct: 595 TCGCCDCEKRCGALDIVFVIDSSESVGLTNFTLEKNFVINTINRLGSIAKDPSSETGTRV 654 Query: 255 GLIGYT-TRVEKNIEPSWGT----EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 G++ Y+ + I + + V + I T + A+K AY L Sbjct: 655 GVVQYSHNGTFQAIRLNDSKIDSMSAFKDAVKKLE--WIAGGTWTPSALKFAYDNLI--- 709 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 ++ +TDG + + +C + I + I I Sbjct: 710 ------------RDSRRSKANVTVVVITDGRYDPRDDDKLLNYLCT---DTSIDVNAIGI 754 Query: 370 N---ASPNGQRLLKTCV 383 P LK+ Sbjct: 755 GDMFDQPEENESLKSIA 771 >gi|114564047|ref|YP_751561.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] gi|114335340|gb|ABI72722.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] Length = 1204 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 27/349 (7%), Positives = 70/349 (20%), Gaps = 25/349 (7%) Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + Q V Sbjct: 148 DTTNASGVGAGYPVDDLSDAYKNDGNALNNAKLTDFGLGEPITLFTQQYVQWYYNKPAAT 207 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRL-----QFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 +R+ + + + D L N+ + + + Sbjct: 208 NTDRITVAKRVLNEVIVNTPGVDLGLMLFNYNASGS-NRSSENKDGGRVVSKINTGNINN 266 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + ++ N+ P + + + + Sbjct: 267 KKAL----ASIVKNTIAIKSNNTPLCETLYEAALYLHGKPLEYGHDTYDSVAATPLNKTW 322 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL---KPTDSTPAMKQAYQI 304 + G I +T V + G + + V + I S + Sbjct: 323 KWVKKWNGWIEESTPVNPAQ-YNPGYDATAESVDGYISPFIGKKCGSNASIIYITDGAPT 381 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK------AK 358 +D + + + + G +N + Sbjct: 382 SDTDAQGKVKSLLGLASTTTDADVPGSESL----GNDNYSLLPPLAHWLATNDVNPNSEG 437 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF-QNIS 406 + + TI +A L+ T +Y+ ++ L I+ Sbjct: 438 DQVVNTYTIGFSADAPVDILISTAALGGGKYYSALDPAELQASLISAIA 486 >gi|313216071|emb|CBY37449.1| unnamed protein product [Oikopleura dioica] Length = 107 Score = 39.8 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 36/97 (37%), Gaps = 17/97 (17%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 +K ++ LTDG++ + AK I+ I + + LK + Sbjct: 12 RPYVKKHMVLLTDGQSQDDVG-----APARAAKNFNIRTFAIGVGD--AIEDELKLVATP 64 Query: 386 P--EYHYNVVNADS---LIH--VFQ---NISQLMVHR 412 P + Y+V + D L F+ ++ + ++ Sbjct: 65 PFSDTLYHVEDYDGIRHLQDTLAFKFCEDLGKSLLTL 101 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 39.8 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 28/348 (8%), Positives = 81/348 (23%), Gaps = 38/348 (10%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 + S ++ +DA + S + V N + N + + Sbjct: 412 TVSANGSPMTQQGGGNDAYAETGPFTVGTTSHVTIVNSNVKPSNNKSIVKTGGGDGDQYT 471 Query: 137 LDSSNNT----IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFL 192 L + + + + + N + ++ Sbjct: 472 LYLTASGDSTSSTVTTTTPADIVLVMDKSGSMKGELDNNAKEAANALAKKLL--TDKNST 529 Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 + + ++ + F + +++ E Sbjct: 530 LPSEQQVQMAVVTFSTKATIE----------------------QNFTTDVLKINNAVEGD 567 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA----YQILTSD 308 G + +++ S G V++++ D T S + Y Sbjct: 568 PDGGTNWEAALKQANILS-GRSNVKKHIIFLSDGDPTFRTSSYGGSCYSHWGPYYTPQPK 626 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN---TIKICDKAKENF-IKI 364 + + + + DG + + S+ +AK + Sbjct: 627 YTTKEACTAAKYDWLDENPDDKYDGVPGYDGVHGSGSSDGYGYNYQAALAEAKGRGDAAL 686 Query: 365 VTISINASPNG-QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + ++ N ++L F I + Sbjct: 687 YVVKTSTEAKKMADFAEQAGAVDGKEFDGTNPENLTKAFNQIYSSITS 734 >gi|32477944|ref|NP_870938.1| hypothetical protein RB13235 [Rhodopirellula baltica SH 1] gi|32448501|emb|CAD78016.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 140 Score = 39.8 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 26/63 (41%), Gaps = 1/63 (1%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVSNLS 73 I +++ + F + + L++ + Y + S++ +A A LA + S + Sbjct: 13 IRAKRGAALMEFVMCLPVLLVITLGTLETCRMIYLRQSLKLAAYECARLAIVPGVDSQMV 72 Query: 74 RLG 76 + Sbjct: 73 QDQ 75 >gi|260837260|ref|XP_002613623.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] gi|229299009|gb|EEN69632.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] Length = 655 Score = 39.8 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 13/172 (7%), Positives = 47/172 (27%), Gaps = 15/172 (8%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + ++ +K+ LD + +V + + + ++ + T Sbjct: 308 TSGSVEGRSLSLMKHTTWFLLDRLTEDDYVATGYFNAYAQAVSCLSSFVQAT--THNKEV 365 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + + ++ A++I + K I+ +T+ Sbjct: 366 IHKSLDNLEAADQANYYAGLEYAFKIF--------NNFEMEDRFENQGAECNKVIVLVTE 417 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 ++ + I++ I + + +L+ ++ Sbjct: 418 NAELYPEAVFQKYNP-----DRNIRVFVIVVGEPIHDWSVLQKMACDNRGYF 464 >gi|150260284|ref|ZP_01917012.1| putative pseudopilin [Yersinia pestis CA88-4125] gi|162418845|ref|YP_001604882.1| hypothetical protein YpAngola_A0264 [Yersinia pestis Angola] gi|165925141|ref|ZP_02220973.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165939868|ref|ZP_02228407.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. IP275] gi|166008852|ref|ZP_02229750.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211965|ref|ZP_02238000.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167399469|ref|ZP_02304993.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167418764|ref|ZP_02310517.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425180|ref|ZP_02316933.1| TadE-like family protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167466382|ref|ZP_02331086.1| membrane protein [Yersinia pestis FV-1] gi|218927877|ref|YP_002345752.1| hypothetical protein YPO0686 [Yersinia pestis CO92] gi|229837368|ref|ZP_04457531.1| Flp pilus assembly membrane protein TadE [Yersinia pestis Pestoides A] gi|229840580|ref|ZP_04460739.1| Flp pilus assembly membrane protein TadE [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842870|ref|ZP_04463022.1| Flp pilus assembly membrane protein TadE [Yersinia pestis biovar Orientalis str. India 195] gi|229900905|ref|ZP_04516029.1| Flp pilus assembly membrane protein TadE [Yersinia pestis Nepal516] gi|270487720|ref|ZP_06204794.1| TadE-like protein [Yersinia pestis KIM D27] gi|294502752|ref|YP_003566814.1| hypothetical protein YPZ3_0642 [Yersinia pestis Z176003] gi|115346488|emb|CAL19362.1| putative membrane protein [Yersinia pestis CO92] gi|149289692|gb|EDM39769.1| putative pseudopilin [Yersinia pestis CA88-4125] gi|162351660|gb|ABX85608.1| TadE-like family protein [Yersinia pestis Angola] gi|165912179|gb|EDR30817.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. IP275] gi|165923341|gb|EDR40492.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992191|gb|EDR44492.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206711|gb|EDR51191.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166962758|gb|EDR58779.1| TadE-like family protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051973|gb|EDR63381.1| TadE-like family protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167055943|gb|EDR65724.1| TadE-like family protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|229682244|gb|EEO78336.1| Flp pilus assembly membrane protein TadE [Yersinia pestis Nepal516] gi|229690137|gb|EEO82194.1| Flp pilus assembly membrane protein TadE [Yersinia pestis biovar Orientalis str. India 195] gi|229696946|gb|EEO86993.1| Flp pilus assembly membrane protein TadE [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705491|gb|EEO91501.1| Flp pilus assembly membrane protein TadE [Yersinia pestis Pestoides A] gi|262360784|gb|ACY57505.1| hypothetical protein YPD4_0596 [Yersinia pestis D106004] gi|262364729|gb|ACY61286.1| hypothetical protein YPD8_0596 [Yersinia pestis D182038] gi|270336224|gb|EFA47001.1| TadE-like protein [Yersinia pestis KIM D27] gi|294353211|gb|ADE63552.1| hypothetical protein YPZ3_0642 [Yersinia pestis Z176003] Length = 156 Score = 39.8 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 35/106 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + + + ++ F L + F+ ++ + Y +++ A + A + Sbjct: 1 MRDTILRLLPANRGSIAVEFTLIFILFIFMLLLVAETSRLFYTSANLDFALSEAAKTAKN 60 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + N+ FE N L + + S A Sbjct: 61 RDAENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVAD 106 >gi|332216482|ref|XP_003257380.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Nomascus leucogenys] Length = 1398 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 592 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 631 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 632 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 691 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 692 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 735 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 736 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 772 >gi|297671074|ref|XP_002813673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Pongo abelii] Length = 987 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 239 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 278 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 279 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 338 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 339 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 382 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 383 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 419 >gi|296225455|ref|XP_002758485.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 1 [Callithrix jacchus] gi|296225457|ref|XP_002758486.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 2 [Callithrix jacchus] Length = 1091 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|291393868|ref|XP_002713441.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Oryctolagus cuniculus] Length = 1352 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 516 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 555 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 556 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 615 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 616 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 659 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 660 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 696 >gi|290243156|ref|YP_003494826.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] gi|288945661|gb|ADC73359.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] Length = 615 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 38/113 (33%), Gaps = 18/113 (15%) Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 ++ +K ++ +TDG N+ +S ++ A++ +++ Sbjct: 516 STGATPMSNAILGVLPSMFARSESRKVMLVITDGAPNDSES---AMEAIRMARDVNVEMY 572 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVIL 418 I I P + + V N + V +++ + + +L Sbjct: 573 AIGIETD-------------PSHLFGVENTTVIQSV-GELAENIFGL-LTPVL 610 >gi|194389314|dbj|BAG61618.1| unnamed protein product [Homo sapiens] Length = 525 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 155 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 194 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 195 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 254 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 255 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 298 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 299 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 335 >gi|119585707|gb|EAW65303.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Homo sapiens] Length = 992 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 155 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 194 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 195 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 254 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 255 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 298 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 299 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 335 >gi|109039062|ref|XP_001082066.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Macaca mulatta] Length = 691 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|73985413|ref|XP_533789.2| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Canis familiaris] Length = 1128 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 286 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 325 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 326 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 385 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 386 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 429 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 430 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 466 >gi|54112397|ref|NP_060868.2| voltage-dependent calcium channel subunit alpha-2/delta-3 [Homo sapiens] gi|74723683|sp|Q8IZS8|CA2D3_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|22770596|gb|AAN06673.1| voltage-gated calcium channel alpha(2)delta-3 subunit [Homo sapiens] gi|187950675|gb|AAI37506.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] gi|187953583|gb|AAI37503.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] Length = 1091 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 289 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|7105926|emb|CAB75962.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 997 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 155 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 194 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 195 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 254 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 255 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 298 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 299 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 335 >gi|7024361|emb|CAB75878.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 519 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 155 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 194 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 195 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 254 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI + ++I Sbjct: 255 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 298 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 299 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 335 >gi|297663822|ref|XP_002810364.1| PREDICTED: integrin alpha-10-like [Pongo abelii] Length = 1177 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|254475237|ref|ZP_05088623.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214029480|gb|EEB70315.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 199 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 11/68 (16%), Positives = 23/68 (33%), Gaps = 8/68 (11%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L+ R F++ + S+ FA + L + I + + +A L Sbjct: 1 MLTSLRTLFRRYRRDTDGSVSVEFAFYMPLLLG-VFAAI-YTYFDAFRQ------ESANL 52 Query: 63 AGASKMVS 70 A + Sbjct: 53 KAAYTISD 60 >gi|194385372|dbj|BAG65063.1| unnamed protein product [Homo sapiens] Length = 1095 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 229 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 285 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 286 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 323 >gi|156358479|ref|XP_001624546.1| predicted protein [Nematostella vectensis] gi|156211333|gb|EDO32446.1| predicted protein [Nematostella vectensis] Length = 688 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 55/187 (29%), Gaps = 23/187 (12%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 MH ++ S E + A LK L + +G++ + + Sbjct: 5 HMHVSIVSGGEVELGFMLDTSTSLGGEANLKITLDFIKAVYGSFTISSSAYRVGVVIFGS 64 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + + S + LI T + + L S + S Sbjct: 65 SAKVAFDFSKFSSSAEIESGLSEIKLIGGATAAGQGLTTCNTALFSKARSS--------- 115 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 +K ++ L G++++ + + K + I I + + + + + L Sbjct: 116 -------AKKMLLVLIAGKSSDD------VGVASSMKTSGISIFVLGMGKAID-KTQLNM 161 Query: 382 CVSSPEY 388 S Y Sbjct: 162 MASQESY 168 >gi|119591830|gb|EAW71424.1| integrin, alpha 10, isoform CRA_a [Homo sapiens] Length = 1177 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|114558438|ref|XP_514418.2| PREDICTED: integrin alpha-10 isoform 2 [Pan troglodytes] Length = 1167 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|7385003|gb|AAF61638.1|AF172723_1 integrin alpha 10 subunit [Homo sapiens] Length = 517 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 102 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 158 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 159 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 196 >gi|3420888|gb|AAC31952.1| integrin subunit alpha 10 precursor [Homo sapiens] Length = 1167 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|38569398|ref|NP_003628.2| integrin alpha-10 precursor [Homo sapiens] gi|115502407|sp|O75578|ITA10_HUMAN RecName: Full=Integrin alpha-10; Flags: Precursor gi|6650628|gb|AAF21944.1|AF112345_1 integrin alpha 10 subunit [Homo sapiens] gi|119591831|gb|EAW71425.1| integrin, alpha 10, isoform CRA_b [Homo sapiens] gi|182887771|gb|AAI60008.1| Integrin, alpha 10 [synthetic construct] gi|189055316|dbj|BAG36921.1| unnamed protein product [Homo sapiens] Length = 1167 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|261409007|ref|YP_003245248.1| hypothetical protein GYMC10_5231 [Paenibacillus sp. Y412MC10] gi|261285470|gb|ACX67441.1| hypothetical protein GYMC10_5231 [Paenibacillus sp. Y412MC10] Length = 743 Score = 39.8 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 31/353 (8%), Positives = 79/353 (22%), Gaps = 28/353 (7%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 S + S+ +++ + + + I K E AA+ + S L + Sbjct: 18 RSSDGSVSVFLIIALAAVFMFVAIFIDFSRIAAMKVQSERLTRAAVRSVMSSYDPQLQKE 77 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + + + G + + + +S + Sbjct: 78 YGLYAHGGTSGDLIMGNVLNDSLNP-------GDRSDAFRLMAMELDSSGLEFQRPLG-- 128 Query: 136 RLDSSNNTIFYNMDVMTSYDYRL-------QFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 D N I +M D+ L + + + V + L ++ Sbjct: 129 TYDIFNRQIIEDMKYKAPIDFTLELVGKFKPLSQSM---KEASNTVDVLKKLRKLYDKRE 185 Query: 189 PIFLIELVVDLSGSMHCAMNSD-PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 L H S D PI + + D D Sbjct: 186 EALDEMLAKQRKAGEHAIRLSKLVMDPPGQPINDTSLGGSIQTAAEGASQYEDYKDKQQE 245 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + ++ + + + + Q L Sbjct: 246 DADRPP--------KEKQYTLLLMAYRSGMSSIGSQISREAERFQRENDKLLADAQQLWE 297 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + + + D + S+ + ++ ++ Sbjct: 298 EARSLNEEMKQVIKESENRSSNAGYDQVTNDVTPGSADSSGSEAEMIQSIRKQ 350 >gi|218963799|gb|ACL13562.1| CopS [Pseudomonas fluorescens] Length = 462 Score = 39.8 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 21/199 (10%), Positives = 50/199 (25%), Gaps = 18/199 (9%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI----LAGASKMVSNLSRLGD 77 S++F L+V + L + G + M + L ++++ L+ + Sbjct: 10 MSLMFMLAVTAVLTVAGLS--FNNLSRHHFKM---LDQQALNEKLHSTRRILTGLNSIDQ 64 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 + + L ++ ++ Sbjct: 65 FSDVKPELEALLGAHRDLTALIIDGDGKLLFADPGPIVVPEDFRTVTNSNVWEWRDQEQI 124 Query: 138 --DSSNNTIFYNMDVMTSYDYRLQFIEHL-------LNQRYNQKIVSFIPALLRIEMGER 188 + I D + +H+ I + + A L + Sbjct: 125 FRGVTAQAIVTGQDKPLTVMLIFDVTQHMAFFETLERWFWIGLVISALVSAALGWMVARS 184 Query: 189 PIFLIELVVDLSGSMHCAM 207 + I LV ++ SM Sbjct: 185 GLRPIRLVTQVAASMSAKS 203 >gi|319779653|ref|YP_004130566.1| hypothetical protein TEQUI_1510 [Taylorella equigenitalis MCE9] gi|317109677|gb|ADU92423.1| hypothetical protein TEQUI_1510 [Taylorella equigenitalis MCE9] Length = 424 Score = 39.8 bits (91), Expect = 0.91, Method: Composition-based stats. Identities = 10/77 (12%), Positives = 23/77 (29%), Gaps = 6/77 (7%) Query: 18 EKANFSII-FALSVMSFLLLIGFLIYV--LDWHYKKNSMESANNAAILAGASKMVSNLSR 74 K ++ AL + + +G K+ + ++A AGA L+ Sbjct: 9 RKGQVLVLGLALIAIVIISFLGM---YRNSQIISKRTKLTHVVDSAAYAGAIVQARALNM 65 Query: 75 LGDRFESISNHAKRALI 91 + + + Sbjct: 66 QAYINLAQTANQIALAN 82 >gi|296329567|ref|ZP_06872053.1| hypothetical protein BSU6633_00600 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305672764|ref|YP_003864435.1| hypothetical protein BSUW23_00335 [Bacillus subtilis subsp. spizizenii str. W23] gi|296153310|gb|EFG94173.1| hypothetical protein BSU6633_00600 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305411007|gb|ADM36125.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 245 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 34/87 (39%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKT 381 + I+ +TDG +N+ + + + AKE I + I I P + ++ Sbjct: 2 NNGHLNQILLITDGCSNHGED---PLAMAAFAKEQGITVNVIGIMEENQIDPEAMKEVEG 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + VV A L Q +++ Sbjct: 59 IALAGGGVHQVVYASQLSQTVQMVTKK 85 >gi|189350352|ref|YP_001945980.1| hypothetical protein BMULJ_01515 [Burkholderia multivorans ATCC 17616] gi|189334374|dbj|BAG43444.1| putative membrane protein [Burkholderia multivorans ATCC 17616] Length = 599 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 9/133 (6%), Positives = 32/133 (24%), Gaps = 10/133 (7%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALI 91 +++ I V + ++++ +++ + A +A M S+ + A Sbjct: 1 MIAIVVLGAIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSPAMMAAQSNALANG 60 Query: 92 DDAKRFI----------KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 D + + + I + + Sbjct: 61 FDYRASGQTLSIECGRWDTSATPYFNSTFTPLNAVSVSVTQQVPYIFLGRFFGKSGSTGA 120 Query: 142 NTIFYNMDVMTSY 154 ++ + Sbjct: 121 TVAAFSTAKAINI 133 >gi|119181220|ref|XP_001241850.1| hypothetical protein CIMG_05746 [Coccidioides immitis RS] Length = 849 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 59/228 (25%), Gaps = 8/228 (3%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + ++ ES+++ A ++ D+ IK ++ SG +I +I + Sbjct: 543 QQADTPTQSKSPQESVADTADPSVKHDSASTIKYEASKNTSGLG-----IQIVDIADELL 597 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV--SFIPALLRIE 184 S + T+ L + + S P+ + Sbjct: 598 SSNATTPQTVDSCPSTTVAGTPCTPDWDVSGLPRSSQMDVSPKKTVEIEQSSNPSSPPVA 657 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 E+ + + S ++A D T + + Sbjct: 658 HHEKHLSTSLDSSNTISSTPQKFQPGKTITDAACPSPDPSTTSFDPPRLGTPSSITDRQT 717 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPT 292 + V G + VE + + + R K T Sbjct: 718 FNSVYSGEP-GSECFHNSVEDVPSLTSSASTMTGTLPRLSAGFYAKGT 764 >gi|16077133|ref|NP_387946.1| hypothetical protein BSU00650 [Bacillus subtilis subsp. subtilis str. 168] gi|221307875|ref|ZP_03589722.1| hypothetical protein Bsubs1_00330 [Bacillus subtilis subsp. subtilis str. 168] gi|221312197|ref|ZP_03594002.1| hypothetical protein BsubsN3_00330 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221317130|ref|ZP_03598424.1| hypothetical protein BsubsJ_00330 [Bacillus subtilis subsp. subtilis str. JH642] gi|221321393|ref|ZP_03602687.1| hypothetical protein BsubsS_00330 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313735|ref|YP_004206022.1| hypothetical protein BSn5_11895 [Bacillus subtilis BSn5] gi|586890|sp|P37561|YABS_BACSU RecName: Full=Uncharacterized protein yabS gi|467454|dbj|BAA05300.1| unknown [Bacillus subtilis] gi|2632332|emb|CAB11841.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis str. 168] gi|291482437|dbj|BAI83512.1| hypothetical protein BSNT_00116 [Bacillus subtilis subsp. natto BEST195] gi|320020009|gb|ADV94995.1| hypothetical protein BSn5_11895 [Bacillus subtilis BSn5] Length = 245 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 34/87 (39%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI----NASPNGQRLLKT 381 + I+ +TDG +N+ + + + AKE I + I I P + ++ Sbjct: 2 NNGHLNQILLITDGCSNHGED---PLAMAAFAKEQGITVNVIGIMEENQIDPEAMKEVEG 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 + + VV A L Q +++ Sbjct: 59 IALAGGGVHQVVYASQLSQTVQMVTKK 85 >gi|289425109|ref|ZP_06426886.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154087|gb|EFD02775.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] Length = 322 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 66/212 (31%), Gaps = 39/212 (18%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + D T++AA K F++S+ + ++ + E + PS Sbjct: 101 DSSLSMKADDVSPTRLAAAKAKAKDFINSLPTG------FNVAVVSISEHPEIRMLPSTD 154 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 V + V T A+ ++ + + Sbjct: 155 RPTVLRAVDGIELQD---GTALGGAIDKSLE------------AVKMAPGGSKNP-APAA 198 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--------------ASPNGQRL 378 I+ L+DG+N S + AK + + TI+ +P+ + L Sbjct: 199 IVMLSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGYVDLNGQRERVAPDTKLL 255 Query: 379 LKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + + +AD L V+Q + + Sbjct: 256 STVADRTHAKSWTADSADKLQEVYQQVHSSVG 287 >gi|220927961|ref|YP_002504870.1| hypothetical protein Ccel_0508 [Clostridium cellulolyticum H10] gi|219998289|gb|ACL74890.1| hypothetical protein Ccel_0508 [Clostridium cellulolyticum H10] Length = 715 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 40/112 (35%), Gaps = 1/112 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 K ++ ++ + + L+IG ++ ++ AN AAI + + + L Sbjct: 4 HKTKGAITVFLSIVLTAIFLVIGTFTDGARLRLAQSHVQRANKAAISSVLANYNNELKDE 63 Query: 76 GDRFESISN-HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 F + + + ++ + + S Y N +++N + Sbjct: 64 YGLFGVYLDEDSLQETYEEYLSKNLGIVGKGKSMYDFSIDNIKLENPYSLEN 115 >gi|326927888|ref|XP_003210120.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Meleagris gallopavo] Length = 1069 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 54/217 (24%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 227 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 266 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 267 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNM 326 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + I+ +TDG + + TI + ++I Sbjct: 327 LNEFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 370 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 371 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 407 >gi|326431768|gb|EGD77338.1| hypothetical protein PTSG_12722 [Salpingoeca sp. ATCC 50818] Length = 829 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 22/214 (10%), Positives = 60/214 (28%), Gaps = 27/214 (12%) Query: 202 SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTT 261 C ++ + V + ++ F S+D+ + ++ ++T Sbjct: 571 PPPCLDHAAQDIVLLLDSSSSLGFSGWQDVQAFGANFARSLDIDD---NRGRLAVVAFST 627 Query: 262 RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQG 321 + ++ ++ + T + A+ + L ++ Sbjct: 628 ETRQLLDLDDVYNADLIATLIEIYPFLNGATATAAALNFVHDDLATNVAAGR-------- 679 Query: 322 VKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLL 379 I+ +TDG + + + ++V + + +LL Sbjct: 680 ----RPWLPLTIVTVTDGIPT--EPSPQVTAAVQALQAAGARLVAVGAGPAVPLSTLQLL 733 Query: 380 KTCVSSPEYHYNV---VNADSLIHVFQNISQLMV 410 P+ V + SL + IS + Sbjct: 734 -----GPDGVVQVPLVEDLASLSTAYDAISTAVC 762 >gi|307294185|ref|ZP_07574029.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] gi|306880336|gb|EFN11553.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] Length = 157 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 10/127 (7%), Positives = 29/127 (22%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ A ++ AL + +LL V K +++ A + +N Sbjct: 3 RRFFADPMGTSTVELALIMPILVLLACMAGDVAMAFKAKIALQRAAERTGQLATAGGYTN 62 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + A + + ++ + Sbjct: 63 DTSKTQAAYNNLAADAAAAAGVSTNNVTVTPTLLCDATVQTASPEVPCADGQQTKRYVAI 122 Query: 132 MANNRLD 138 + Sbjct: 123 TISGSYT 129 >gi|118096863|ref|XP_414338.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-3 subunit [Gallus gallus] Length = 1090 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 54/217 (24%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 248 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 287 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 288 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNM 347 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + I+ +TDG + + TI + ++I Sbjct: 348 LNEFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 391 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 392 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 428 >gi|304320376|ref|YP_003854019.1| hypothetical protein PB2503_04012 [Parvularcula bermudensis HTCC2503] gi|303299278|gb|ADM08877.1| hypothetical protein PB2503_04012 [Parvularcula bermudensis HTCC2503] Length = 143 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 43/128 (33%), Gaps = 19/128 (14%) Query: 6 RFRFYFKK---GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 R R + K+ +++ + ++ FAL F+LL+ + + + Sbjct: 2 RARRHSKRSFSFFGNQRGSAAVEFALVCPVFILLMTGVFSGGMVFHVR------------ 49 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + +R R ++ + D A+ ++ + ++ + + Sbjct: 50 ----ETNYLYAREAARGLALGYFNETEAKDYAEDQAEDALGIDVTVSVDPATVGDPTDQN 105 Query: 123 NSSRISMT 130 IS+T Sbjct: 106 VVVSISVT 113 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 17/222 (7%), Positives = 46/222 (20%), Gaps = 33/222 (14%) Query: 88 RALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYN 147 + ++I+ A + + + + L + Sbjct: 255 NGWDGNPTNRNNSYIEYGGEKEDADYAIRKFAKETATPGLF-------DLYLNVRGNTQK 307 Query: 148 MDVMTSYDYRLQFIEHLLNQR----------YNQKIVSFIPALLRIEMGERPIFLIELVV 197 + + + + ++ +I MG + Sbjct: 308 NITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVGYSIEGYSY 367 Query: 198 D----LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL----------KNALLLFLDSID 243 GS N S + + K ++L D + Sbjct: 368 SNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGHKKVIVLLTDGVP 427 Query: 244 LLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 S+ + V+ + + ++R D Sbjct: 428 TFSYKVQRVH--AQSSSNYYGTQFSNTQDRPGNTSLISRIYD 467 >gi|145596821|ref|YP_001161118.1| D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase [Salinispora tropica CNB-440] gi|145306158|gb|ABP56740.1| D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase [Salinispora tropica CNB-440] Length = 593 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 24/223 (10%), Positives = 50/223 (22%), Gaps = 15/223 (6%) Query: 23 SIIFALSVMSFLLLIGFLIYVLD------WHYKKNSMESANNA--AILAGASKMVSNLSR 74 + L+ + L L + V WH +SA A+LA A + Sbjct: 124 LLPAVLASVGLLALAAVGVTVARSGPVAQWHDGGADGQSAVEPPPAVLAVADVNAPEPAA 183 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 G R AL D + + ++ T ++ + A Sbjct: 184 AGVRAALDPLVNSAALGDRVNVSVTDVASGAILYDQGAADGTVPASVTKLVTAATVLAAR 243 Query: 135 NRLD-------SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + + + ++ + + L Sbjct: 244 DPGHRIPTRAVAGAAPGEVVLIGGGDPTLAVDGTGFYPGAARLDELAAQVKTALGGVAPT 303 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 R + L + E +A + Sbjct: 304 RVVVDSSLYSGPVHEPGWDDDILSEGYGAAITALMTDGARSDL 346 >gi|22127365|ref|NP_670788.1| hypothetical protein y3491 [Yersinia pestis KIM 10] gi|45442763|ref|NP_994302.1| hypothetical protein YP_3001 [Yersinia pestis biovar Microtus str. 91001] gi|108809097|ref|YP_653013.1| hypothetical protein YPA_3106 [Yersinia pestis Antiqua] gi|108810708|ref|YP_646475.1| hypothetical protein YPN_0543 [Yersinia pestis Nepal516] gi|145597780|ref|YP_001161856.1| hypothetical protein YPDSF_0470 [Yersinia pestis Pestoides F] gi|21960450|gb|AAM87039.1|AE013952_6 hypothetical [Yersinia pestis KIM 10] gi|45437629|gb|AAS63179.1| hypothetical protein YP_3001 [Yersinia pestis biovar Microtus str. 91001] gi|108774356|gb|ABG16875.1| membrane protein [Yersinia pestis Nepal516] gi|108781010|gb|ABG15068.1| putative membrane protein [Yersinia pestis Antiqua] gi|145209476|gb|ABP38883.1| membrane protein [Yersinia pestis Pestoides F] Length = 157 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 35/106 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + + + + + ++ F L + F+ ++ + Y +++ A + A + Sbjct: 2 MRDTILRLLPANRGSIAVEFTLIFILFIFMLLLVAETSRLFYTSANLDFALSEAAKTAKN 61 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAV 112 + N+ FE N L + + S A Sbjct: 62 RDAENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVAD 107 >gi|194334882|ref|YP_002016742.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312700|gb|ACF47095.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 336 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 49/159 (30%), Gaps = 28/159 (17%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 DV+++ QD + ++ K + +S L+ + + Sbjct: 96 DVSNSMRAQDVRPDRLTRAKR-------ELVEVSRRVGRGRRSLVVFAGSAALQCPLTAD 148 Query: 273 TEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + + L+ TD A++ A + L S K RS + Sbjct: 149 QAVFETMLDIASPELVELQGTDLGGALRLAGKTLDSGKGRSSL----------------Q 192 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 I+ +DGE++ ++ + I + Sbjct: 193 VIVMASDGEDHVGAGAAVAAELAV----RGTNVFVIGVG 227 >gi|255557524|ref|XP_002519792.1| protein binding protein, putative [Ricinus communis] gi|223541031|gb|EEF42588.1| protein binding protein, putative [Ricinus communis] Length = 436 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 56/213 (26%), Gaps = 41/213 (19%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM--GL 256 S P + MA L+ + L + + + + Sbjct: 2 CFCLDWHMTGSRPTPYVPGRVQLISINNNMAPLEESKLKVMLVFTGGDSTNDRPGLVPSI 61 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL-ILKPTDSTPAMKQAYQILTSDKKRSFFT 315 I + + + TE + + ++ L T+ T ++ ++L Sbjct: 62 ICRLSHSLRMLTLRQITENSQNDLEILINGLHADAATNITAGLQTGLRVLNDRSLSGGRV 121 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 I+ ++DGE N I + Sbjct: 122 VD---------------IMLMSDGEQNAGDD------------AAQIPV---------GN 145 Query: 376 QRLLKTCVSS--PEYHYNVVNADSLIHVFQNIS 406 +LK + +V N D+L F +++ Sbjct: 146 MPVLKAIADNSMGGTFSDVQNTDNLSKAFSDLT 178 >gi|85713498|ref|ZP_01044488.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] gi|85699402|gb|EAQ37269.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] Length = 189 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 20/198 (10%), Positives = 51/198 (25%), Gaps = 9/198 (4%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + FA+ V L+++ I V A N + A + S Sbjct: 1 MRRDSRGTAATEFAIIVPLMLVMLFGTIEVSSGV--------AVNRKVTLVARTLSDLTS 52 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + A + ++ ++S + + + + + Sbjct: 53 QSRGVNDADVTNFLAASYGIMWPYPSGPVQATISELYIDPATSVARVQWSKGKAPRGTGS 112 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 + S + V+ + L + ++S L RP Sbjct: 113 TVGIPSGLIARDSSGKVLPNQYLIFSE-VSYLYKPILGYVMSKAGITLSDATYTRPRKFS 171 Query: 194 ELVVDLSGSMHCAMNSDP 211 + S + Sbjct: 172 CVTYPTPPSGNFPPCPTN 189 >gi|218506872|ref|ZP_03504750.1| hypothetical protein RetlB5_04304 [Rhizobium etli Brasil 5] Length = 41 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 6/20 (30%), Positives = 10/20 (50%) Query: 14 GIASEKANFSIIFALSVMSF 33 I+ NF I+ AL ++ Sbjct: 8 FISDRSGNFGIMTALLMVPL 27 >gi|325677636|ref|ZP_08157288.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110604|gb|EGC04768.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 549 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 39/389 (10%), Positives = 104/389 (26%), Gaps = 60/389 (15%) Query: 22 FSIIFALSV-MSFLLLIGFLIYVLD-WHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 F++ A+ + ++ G N + G + + Sbjct: 9 FAMAAAIMMTVTMFASCGSASDSTYKAENDTNITNNDV-----KGGKSPENITADAETDG 63 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + + + A+ + ++ + S + +T ++ +S ++ M NR D Sbjct: 64 GEFNYYTEEAIPEINTEEYNHYAENSFQSVAEHPLSTFSTDVDTASFTNVRRMIENRQDI 123 Query: 140 SNNTIFYNMDVMT--SYDYRLQFIEHLLNQ-------RYNQKIVSFIPALLRIEMGERPI 190 + + + + + + ++ + + + + Sbjct: 124 FADAVRTEEFINYFKYDYPQPDNDDKIGITTELSDCPWNDESKLMLVGLQAKDIDVQDID 183 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 I ++D+SGSM K+ + A + +++ + Sbjct: 184 SNIVFLIDVSGSM-------------------GDANKLPLVAQAFAMLAENLGQNDRISI 224 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 Y G VE E K+ + + + AY Sbjct: 225 VTYAGR----DTVELEGESGANYSKIAETLAGLTAGGST---AGAAGINTAY-------- 269 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKENFIKIVTISI 369 K +I TDG+ N S+ + ++ ++ I + + Sbjct: 270 --------ELAQKYFIEGGNNRVILATDGDLNVGLSSQEELTALIEEKRDMGIFLSVLGF 321 Query: 370 NASPNGQRLLKTCVSSP-EYHYNVVNADS 397 L+ + + + + D Sbjct: 322 GMGNYKDSRLEALADNGNGNYAYIDSIDE 350 >gi|257470753|ref|ZP_05634843.1| von Willebrand factor type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] gi|317064958|ref|ZP_07929443.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313690634|gb|EFS27469.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 376 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-NIEPSWGTEKVRQYVTRDMDS 286 + K + ++ + +H V +GL+ Y R + + + E + + + MD Sbjct: 56 IQGAKTKIWSIVNEVM-QNHKDSKVKIGLVAYRDRGDVYVTKVTQLNENLDEIYSVLMDY 114 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN-NNFK 345 D +++A + S P + I + D +++ Sbjct: 115 KAQGGGDDPEDVRKALHESLEIIQWSA-----------PRENLSQIIFLVGDAPPHDDYN 163 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVV 393 + +T+ KAK I I TI P+ R K ++++ Sbjct: 164 DSPDTVVTAKKAKSKGIIINTIQCGNMPSTDRYWKAIAQFGGGEYFHIS 212 >gi|53804162|ref|YP_113994.1| hypothetical protein MCA1539 [Methylococcus capsulatus str. Bath] gi|53757923|gb|AAU92214.1| conserved domain protein [Methylococcus capsulatus str. Bath] Length = 597 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 50/156 (32%), Gaps = 18/156 (11%) Query: 151 MTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 L + IP++L G + I E + SG + + +D Sbjct: 373 APDVAIVLDASGSMR-----------IPSVLDGN-GAQLIARFERCMMDSGLLAPVVCAD 420 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 + I + T++ A + A+ + + DV +GL+ + + Sbjct: 421 LIGAYESIIQGGRGPTRLMAAQKAVNNVVGGL------PGDVDVGLVVLEDCPQASDYGM 474 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + Q + R + K T + QA + Sbjct: 475 YDGARRGQLLQRVNGLIPRKGTPLGNGIVQAANKVD 510 >gi|148657455|ref|YP_001277660.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569565|gb|ABQ91710.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 429 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 25/74 (33%), Gaps = 3/74 (4%) Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG--QRLLKTCVSSPEYHY 390 I+ ++DG+ V T A +++ I N +RL ++ Sbjct: 279 IVLVSDGDETCGGDPVATAAALHTA-NPRLRVSVIGFNIEQEEWRRRLEGIAAYGGGAYF 337 Query: 391 NVVNADSLIHVFQN 404 + NA L + Sbjct: 338 DAANAVQLADALEQ 351 >gi|149034246|gb|EDL89016.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_b [Rattus norvegicus] Length = 924 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 72 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 111 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 112 DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNV 171 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI E ++I Sbjct: 172 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPERKVRI 215 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 216 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 252 >gi|149034245|gb|EDL89015.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Rattus norvegicus] Length = 930 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 72 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 111 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 112 DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNV 171 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI E ++I Sbjct: 172 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPERKVRI 215 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 216 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 252 >gi|170724845|ref|YP_001758871.1| outer membrane adhesin-like protein [Shewanella woodyi ATCC 51908] gi|169810192|gb|ACA84776.1| outer membrane adhesin like proteiin [Shewanella woodyi ATCC 51908] Length = 3259 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 14/181 (7%), Positives = 39/181 (21%), Gaps = 14/181 (7%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 N + + + + +N S+ + + +T Sbjct: 2771 NGSANLSNGFSYTVNDGNGDTDTADVE---ILVFAHQTLNPGDDSLNITGDTQDLVIGDT 2827 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYN-------QKIVSFIPALLRIEMGERPIFLIELV 196 + + + + + N + + G L++ Sbjct: 2828 TGIIPGQNYNIAFIIDTSGSMGSTAVNTAEQQLLTVYSQLLSYAEQNNSGVINTLLVDFD 2887 Query: 197 VDLSGSMHCAMNSDPE----DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 D S + + A + A A F + + ++ Sbjct: 2888 TDASLLISINLTDPNALQLLSNALATMTSGGATNYYDAFSTAYDWFQNGVPTTNNGNNIT 2947 Query: 253 Y 253 Y Sbjct: 2948 Y 2948 >gi|28212256|ref|NP_783185.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Rattus norvegicus] gi|81871225|sp|Q8CFG5|CA2D3_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|27450708|gb|AAO14654.1|AF486278_1 calcium channel alpha-2 delta-3 subunit [Rattus norvegicus] Length = 1085 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 55/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 249 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 288 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 289 DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNV 348 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + TI E ++I Sbjct: 349 LSDFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPERKVRI 392 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 393 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 429 >gi|226943318|ref|YP_002798391.1| type IV pilus assembly protein, PilY1-like protein [Azotobacter vinelandii DJ] gi|226718245|gb|ACO77416.1| type IV pilus assembly protein, PilY1-like protein [Azotobacter vinelandii DJ] Length = 1036 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 4/55 (7%) Query: 359 ENFIKIVTISI---NASPNGQRLLKTCV-SSPEYHYNVVNADSLIHVFQNISQLM 409 + + I + AS + RLLK+ +++ NAD + I + Sbjct: 274 KQNVLTYAIDVYNRQASEDHSRLLKSMAHVGGGKYFSATNADEIETALSTIFSEI 328 >gi|77359907|ref|YP_339482.1| hypothetical protein PSHAa0961 [Pseudoalteromonas haloplanktis TAC125] gi|76874818|emb|CAI86039.1| conserved protein of unknown function; putative TPR domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 625 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 46/192 (23%), Gaps = 23/192 (11%) Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 + + D ++ + L +D L+ Y + Sbjct: 97 SYSMYSTDILPNRLMQARFKALDMIDLFKEGD-------TALVAYAGSAYTISPLTNDAT 149 Query: 275 KVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 + + + + K ++ + A ++L G+ Sbjct: 150 TLENLIPSLSPEIMPDKGSNVLAGLDIAKELLGQAGYLDGDIILITDGIDQQEQSEVSTF 209 Query: 334 IFLTDGENNNFKSNVNTIKICDK-----AKEN--FIKIVTISINASPNGQRLLKTCVSSP 386 T N + K++ I + TI N +L +S Sbjct: 210 TKNTQYRLNIYGVGTAQGAPIKLPEGGFLKDHYGQIVVPTI------NTSQLKSLATNSG 263 Query: 387 EYH--YNVVNAD 396 Y +AD Sbjct: 264 GKFASYQPSSAD 275 >gi|307292637|ref|ZP_07572483.1| TadE family protein [Sphingobium chlorophenolicum L-1] gi|306880703|gb|EFN11919.1| TadE family protein [Sphingobium chlorophenolicum L-1] Length = 209 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 13/155 (8%), Positives = 35/155 (22%), Gaps = 11/155 (7%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG------ 64 + + ++ FALS+ L L + + + Y + + A L Sbjct: 6 LHRLWPNRSGVAAVEFALSLPILLGLTMYSMEAANMAYTSQKLG---DIATLTADSVSRI 62 Query: 65 --ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + LG + R ++ + S + + + Sbjct: 63 RLSISNGDLTDALGGMKILGDSIDLRNRGRIIVSSVQPVLDSSGNVTNQKVRWQRCTGAL 122 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + + + + Sbjct: 123 IKDSPYVVNANLGTAGIGATGRKIAAAKDSELIFV 157 >gi|297565533|ref|YP_003684505.1| putative cytoplasmic protein [Meiothermus silvanus DSM 9946] gi|296849982|gb|ADH62997.1| putative cytoplasmic protein [Meiothermus silvanus DSM 9946] Length = 407 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 23/76 (30%), Gaps = 2/76 (2%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ +K A ++L G I G + + T+ Sbjct: 287 TNTAEGIKLARKMLKKMGGEMKQIIMITDGKPSALTLPSGQIYKNAWGLDPVILA--ETL 344 Query: 352 KICDKAKENFIKIVTI 367 K A++ I I T Sbjct: 345 KEATLARKEGIPIHTF 360 >gi|291240911|ref|XP_002740358.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1126 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 41/163 (25%), Gaps = 29/163 (17%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 ++ + L D + + R + V + Sbjct: 889 ELVRELVSLIWDQFQRENIKFNIIRFSGNVEKWRHQIVDPLEENCH---DAVRWVSTFVA 945 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 T + A+ +A+ + + LTDG+ ++ S + Sbjct: 946 SGNTCTLEALYEAF-----------------------NDRNIDGVYLLTDGKPDSSTS-L 981 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYH 389 +I +KI TIS N L + S Sbjct: 982 VLKEIARLNTTRGVKIHTISFNCQDESANIFLRQLSAMSRGRF 1024 >gi|291223811|ref|XP_002731901.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 682 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 47/146 (32%), Gaps = 28/146 (19%) Query: 162 EHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQ 221 + + + ++ P + + + I LV+D+SGSM+ Sbjct: 294 DFIDGNNPPRYVLDTTPDFIIV---KTSPLRIVLVLDVSGSMN----------------- 333 Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP-SWGTEKVRQYV 280 ++ L + F+ V + ++G++ + + + R + Sbjct: 334 --SNNRIGILNSLATKFIKYT-----VPDGHFVGIVEFESHATIVSNLIELTSNTTRDNL 386 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILT 306 + + S T ++ Q LT Sbjct: 387 SSLVPSYTRGGTCIGCGLQSGIQELT 412 >gi|163751748|ref|ZP_02158966.1| hypothetical protein KT99_12264 [Shewanella benthica KT99] gi|161328400|gb|EDP99559.1| hypothetical protein KT99_12264 [Shewanella benthica KT99] Length = 152 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 15/143 (10%), Positives = 44/143 (30%), Gaps = 4/143 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ +++ FA+ F +L+ +I V Y N + A+ A + + Sbjct: 1 MKYQRGVYAVEFAIVAGIFFMLMFAIIEVGRLMYTYNVLHEASRRAARIAVVCRIDD--- 57 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 D + + + + + L + ++ + + + Sbjct: 58 -TDIKTMALFNGANLIPNLTTANLSISYLDELGNAATGIDIDLVRADIANYQHQFLVPGL 116 Query: 135 NRLDSSNNTIFYNMDVMTSYDYR 157 +R+ +S Y + Sbjct: 117 SRVINSPTFTAYLPRESLGVYHV 139 >gi|85374480|ref|YP_458542.1| hypothetical protein ELI_08265 [Erythrobacter litoralis HTCC2594] gi|84787563|gb|ABC63745.1| hypothetical protein ELI_08265 [Erythrobacter litoralis HTCC2594] Length = 233 Score = 39.4 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 11/108 (10%), Positives = 29/108 (26%), Gaps = 2/108 (1%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 H R + + + + FA LL+ I + + + A Sbjct: 6 HSFLRRIHALCRIVRDTRGVALVEFAFISPIILLMGVVGIEMANQAVVNMRISQAAMH-- 63 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 +A + + + L + + + + + E+ Sbjct: 64 IADNASRIGDRDSLVAQKIYEGDINDLFIGVGIQAGNGIDLFENGRIV 111 >gi|218671458|ref|ZP_03521128.1| hypothetical protein RetlG_07263 [Rhizobium etli GR56] Length = 94 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 7/43 (16%), Positives = 16/43 (37%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVL 44 + +R ++ I K +I FA+ ++L + Sbjct: 5 NPFTRLALTARRLIRERKGAGAIEFAILFPVLVMLYIGAFEIT 47 >gi|283769328|ref|ZP_06342227.1| TadE-like protein [Bulleidia extructa W1219] gi|283103985|gb|EFC05369.1| TadE-like protein [Bulleidia extructa W1219] Length = 189 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 7/51 (13%), Positives = 17/51 (33%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 + + E I FAL + + ++ + V+ K + + Sbjct: 16 LLKKITRFLKREDGQSFIEFALVLPILITVLSVVFDVVRIVDAKMVLNNVA 66 >gi|313107199|ref|ZP_07793398.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa 39016] gi|310879900|gb|EFQ38494.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa 39016] Length = 1163 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 25/261 (9%), Positives = 52/261 (19%), Gaps = 29/261 (11%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F +N + + ++G + Sbjct: 252 DFTGQHRVNFFNWLENLSVGGGTPLRQAMTRAGEFLKKTGVNGPYAYR--PGTQASPEYS 309 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 L N + + D S + Sbjct: 310 CRGSYHILMTDGLWNNDSASVGNADSTSRS--------LPDGKSYSSQTPYRDAASNTLA 361 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + TD+ P + + ++ + + P+ L Sbjct: 362 -----DQAFHYWATDARPDIDDNIKPYIPYPDQANPSAEYWNPRNDPATWQHMVTYTLGL 416 Query: 339 GENNNFKSN---------VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYH 389 G + S + N S N L V+S Sbjct: 417 GLTTSLTSPKWEGSTYSGGYDEIAAGRLNWPNASN-----NHSNNVYDLWHAAVNSRGEF 471 Query: 390 YNVVNADSLIHVFQNISQLMV 410 ++ + D L+ FQ+I + Sbjct: 472 FSADSPDQLVAAFQDILNRIS 492 >gi|301778383|ref|XP_002924606.1| PREDICTED: integrin alpha-10-like [Ailuropoda melanoleuca] Length = 1175 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|281353330|gb|EFB28914.1| hypothetical protein PANDA_013982 [Ailuropoda melanoleuca] Length = 1128 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 246 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFL 302 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 303 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 340 >gi|269128333|ref|YP_003301703.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268313291|gb|ACY99665.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 155 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 14/136 (10%), Positives = 34/136 (25%), Gaps = 3/136 (2%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + + S+ L F+L + L V + ++ ++ A A+ A + Sbjct: 6 RRAICRAGRDDGGGMSLELVLMTPIFVLFLMLLAAVGRYVDAQSQIDGAARDAVRAASVA 65 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + +R + + G T + + Sbjct: 66 RSAEQARRLAHRVGTDTLRGTHWCRGGPAIATDTRQWGPGGR---VAVTVTCTVDLGDLM 122 Query: 128 SMTHMANNRLDSSNNT 143 + A + Sbjct: 123 FLGLPATRTFTGNAVA 138 >gi|260810520|ref|XP_002600012.1| hypothetical protein BRAFLDRAFT_74129 [Branchiostoma floridae] gi|229285296|gb|EEN56024.1| hypothetical protein BRAFLDRAFT_74129 [Branchiostoma floridae] Length = 879 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 20/221 (9%), Positives = 61/221 (27%), Gaps = 15/221 (6%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 + + + ++ S + +V L D Sbjct: 208 KIVNGLPWTCRMNHSSGIYEETCHFNVYHFINEATGSYMCYQYLAEVTDFCHSDPDGDPL 267 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGT-EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + + + +W K R + + + T + M Q Sbjct: 268 SYHNREAPNVHNAMC---QHQSTWDVMNKHRDFASGANPPRAVDSTQPSFIM---LQESE 321 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + + + ++ P I+ +T G+ + + + + + + T Sbjct: 322 PRRVLVLDMSATMEALEAVGHPQGGRILLVTAGQEDG---DPRIRDVMPDLRRKKVIVDT 378 Query: 367 ISINASPNGQRLLKTCV--SSPEYHYN-VVNADSLIHVFQN 404 I+ + LL++ + + ++ + ++ +L F Sbjct: 379 IAYGRDAD--PLLQSLAQETGGDSFFSGIHDSAALDSAFTA 417 >gi|308509926|ref|XP_003117146.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] gi|308242060|gb|EFO86012.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] Length = 713 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 14/158 (8%), Positives = 38/158 (24%), Gaps = 18/158 (11%) Query: 228 MAALKNALLLFLDS---IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE--KVRQYVTR 282 M + + + I + ++ Y + + Q + + Sbjct: 49 MDGVYDTIQAMFGQSVRIGAGHPDPRSTRVAIVTYNEVAKVEAGFDEFKSLTALNQELEK 108 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + + ++K I+ T +N Sbjct: 109 LNATQKSDS----------FDAFMDLGLSAANNLITAANRANDRKQYKKLIVLFT---SN 155 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLK 380 N + +++ I I T+ A +G + Sbjct: 156 YSFKNQRPDVLALSIRQSGIDISTVYTGAGTSGNTAFQ 193 >gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli Brasil 5] Length = 784 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 52/194 (26%), Gaps = 27/194 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + +L L + + + Y + + Sbjct: 362 NSGSMSGQSIEQARQSLALAISRLSKDDRFNVIRFDDTMTDYFNGLVAASP-----DNRE 416 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T T+ PA++ A + + + ++FLT Sbjct: 417 KAITYVRGLSADGGTEMLPALEDALRNQGPVASGAL-----------------RQVVFLT 459 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG +I A ++ T+ I ++PN + K + + D Sbjct: 460 DGAI--GNEQQLFQEI--SANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQ 515 Query: 398 LIHVFQNISQLMVH 411 + + + + Sbjct: 516 VASRMGELFAKLQN 529 >gi|325677343|ref|ZP_08157008.1| hypothetical protein HMPREF0724_14791 [Rhodococcus equi ATCC 33707] gi|325551806|gb|EGD21503.1| hypothetical protein HMPREF0724_14791 [Rhodococcus equi ATCC 33707] Length = 734 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 44/182 (24%), Gaps = 5/182 (2%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--NIEPSWGTEKVRQYV--T 281 T + LK+A +D+ + V Sbjct: 139 TNLPILKSATDQLVDAFVGTPSRMAVFGFARYSPNQNATTNEIYPNHPELQSVSTQAGAD 198 Query: 282 RDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 S QA + + T G + DG N Sbjct: 199 EFKSQYADWGLGSGTNWDQALYRVATSGVHYDATIVLTDGNPTRWGLQGPPVTPSGDGSN 258 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 +F I + K+N ++V+ I +G L S + Y+ N Sbjct: 259 THFADVEEAIFSANLLKKNETRVVSFGIGRGVSGVSGLNLAAISGQTAYDGTNVVD-ADY 317 Query: 402 FQ 403 FQ Sbjct: 318 FQ 319 >gi|117919536|ref|YP_868728.1| undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] gi|117611868|gb|ABK47322.1| Undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] Length = 1223 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 9/63 (14%), Positives = 23/63 (36%), Gaps = 2/63 (3%) Query: 355 DKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 + A + ++ TI + L +T + +++ +A L Q ++ + Sbjct: 471 NSAGKQTVRTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEK 530 Query: 413 KYS 415 S Sbjct: 531 NAS 533 >gi|163849427|ref|YP_001637471.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527431|ref|YP_002571902.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670716|gb|ABY37082.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451310|gb|ACM55576.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 851 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 55/208 (26%), Gaps = 38/208 (18%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 Y L + Q ++ + D S Sbjct: 405 RPPVQYVVILDATGSMSMNFNGQGKLN-----GQTRQCITGPPGWPFPDDPSCDGSPNFA 459 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL-----LSHVKEDVYMGLIGYTTRV 263 P + ++ K A+ L + ++ + M ++ + V Sbjct: 460 WTPREE-----------RRIYVAKEAVRLLIKQTNMPGNPGYDPTRPIDQMAIVWFNQDV 508 Query: 264 EK----NIEPSWGTEKVRQYV-----TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 + S + + V + L T++ A + Q+L Sbjct: 509 PESRIMLGGFSSDPAALDRAVLDAGKVNNDPYLSQGGTNTAGAFYRVGQLL-----GRAP 563 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + G + +++ II +TDG N Sbjct: 564 TGTTQLGREWT---YRRAIILVTDGLAN 588 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 4/45 (8%), Positives = 17/45 (37%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNS 52 R + + + + + AL ++ + ++ + V ++ Sbjct: 4 RTHTYRPLKRVRGQSIPLLALMIVVLIGMVALSVDVGRTFSEERR 48 >gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing protein [Rhizobium etli CNPAF512] Length = 794 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 52/194 (26%), Gaps = 27/194 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + +L L + + + Y + + Sbjct: 362 NSGSMSGQSIEQARQSLALAISRLSKDDRFNVIRFDDTMTDYFNGLVAASP-----DNRE 416 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T T+ PA++ A + + + ++FLT Sbjct: 417 KAITYVRGLTADGGTEMLPALEDALRNQGPVASGAL-----------------RQVVFLT 459 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG +I A ++ T+ I ++PN + K + + D Sbjct: 460 DGAI--GNEQQLFQEI--SANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQ 515 Query: 398 LIHVFQNISQLMVH 411 + + + + Sbjct: 516 VASRMGELFAKLQN 529 >gi|227326308|ref|ZP_03830332.1| hypothetical protein PcarcW_02944 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 190 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/119 (9%), Positives = 31/119 (26%), Gaps = 6/119 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA------GAS 66 + S + + A V L+ + L + + +A + A+ A + Sbjct: 29 RHWRSTRGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAFRLDNLASD 88 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +RL R + +++ + + ++ Sbjct: 89 SAEQMGARLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNGSQDNPNGEETTT 147 >gi|170571356|ref|XP_001891697.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158603658|gb|EDP39502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 319 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 37/122 (30%), Gaps = 17/122 (13%) Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWG-TEKVRQYVTRDMDSLILKPT-DSTPAMKQAYQIL 305 + + LI +++ + + + + VT T ++ + Sbjct: 204 GRRFSRIALITFSSVGKTRTQFNLDRYFNGKDIVTAIRRLESSGGTTAVGEGIRLGIEQ- 262 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 +Q P+ +K ++ TDG +N ++ AK + Sbjct: 263 -----------KDKQHGGRPNEIAKKAMLVFTDGWSNKGPD---VEEMSRNAKGAGFTLY 308 Query: 366 TI 367 T+ Sbjct: 309 TV 310 >gi|124006669|ref|ZP_01691501.1| domain of unknown function protein [Microscilla marina ATCC 23134] gi|123987824|gb|EAY27515.1| domain of unknown function protein [Microscilla marina ATCC 23134] Length = 3238 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 59/266 (22%), Gaps = 28/266 (10%) Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + Y L + + + + M Sbjct: 2205 EPPIEVIYVLDMSGSMKW---EYPKTDDAGKTISRFRAAQDALIYANSALAQQGMSSRSA 2261 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 + +A + L + + G + E Sbjct: 2262 LIVFNDTTAQVMSGFTNN-FQQLNSIVENLGA--------PNGGTPMSKGMLSAKELLKT 2312 Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLP 328 S + V +T + + L+ A I + + + + Sbjct: 2313 RSADKKPVVVLITDGVPTYDLRNFPYDHLKVSAVDIFDPVAQVFRIKEVVAKMGGLNT-- 2370 Query: 329 FQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FIKIVTISINASP------NGQRLLK 380 + N + + D+ K I + I+ +L+ Sbjct: 2371 -----WISSSYGLYNGQVLSEVMGAIDEIKAETPDALIYGLGIDQDAGNNEASFNDDILE 2425 Query: 381 TCV-SSPEYHYNVVNADSLIHVFQNI 405 + ++ V NA+S+I NI Sbjct: 2426 YGAHKTGATYHTVDNAESMIKAMANI 2451 >gi|221067366|ref|ZP_03543471.1| TadE family protein [Comamonas testosteroni KF-1] gi|220712389|gb|EED67757.1| TadE family protein [Comamonas testosteroni KF-1] Length = 149 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 7/34 (20%), Positives = 17/34 (50%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYK 49 SE+ + FAL+++ FL+ + ++ + Sbjct: 8 RSERGATIVEFALALLVFLMFLFGIVDFSRMLFT 41 >gi|16127180|ref|NP_421744.1| hypothetical protein CC_2950 [Caulobacter crescentus CB15] gi|221235981|ref|YP_002518418.1| TadG-like pilus assembly protein [Caulobacter crescentus NA1000] gi|13424578|gb|AAK24912.1| hypothetical protein CC_2950 [Caulobacter crescentus CB15] gi|220965154|gb|ACL96510.1| TadG-related pilus assembly protein [Caulobacter crescentus NA1000] Length = 183 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 38/124 (30%), Gaps = 4/124 (3%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 R + +++ ++ FAL + FL+L+ +I + ++E+ A++ Sbjct: 11 RLRLARSAGRFARADEGATAVEFALVAIPFLMLLFAIIELGLVFLVSITLEN----AVID 66 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + + A + + ++ + ++ Sbjct: 67 AGRTIRTGEVQTTGGNANSFKTAVCNRMSWLGSKCSSALRLDVRTFTDYATGQASATNTT 126 Query: 124 SSRI 127 Sbjct: 127 VPTT 130 >gi|259416592|ref|ZP_05740512.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348031|gb|EEW59808.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 203 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 10/115 (8%), Positives = 27/115 (23%), Gaps = 15/115 (13%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFL---IYVLDWHYKKNSMESANNA 59 + + ++ E N ++ AL + L + + + Sbjct: 1 MFRKITHKLQEFRRKEDGNIALEAALYLPLLLGVFAATYTLFDLFRQETVNSK------- 53 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A+ VS+L + + + + +S Sbjct: 54 -----AAYTVSDLISRETAALNDDYIDSIYTLGKLMARAGSDMSMRVSVIRWDAD 103 >gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] Length = 794 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 52/194 (26%), Gaps = 27/194 (13%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVK-EDVYMGLIGYTTRVEKNIEPSWGTEKVR 277 + + +L L + + + Y + + Sbjct: 362 NSGSMSGQSIEQARQSLALAISRLSKDDRFNVIRFDDTMTDYFNGLVAASP-----DNRE 416 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T T+ PA++ A + + + ++FLT Sbjct: 417 KAITYVRGLSADGGTEMLPALEDALRNQGPVASGAL-----------------RQVVFLT 459 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG +I A ++ T+ I ++PN + K + + D Sbjct: 460 DGAI--GNEQQLFQEI--SANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQ 515 Query: 398 LIHVFQNISQLMVH 411 + + + + Sbjct: 516 VASRMGELFAKLQN 529 >gi|145611772|ref|XP_369110.2| hypothetical protein MGG_00134 [Magnaporthe oryzae 70-15] gi|145019021|gb|EDK03300.1| hypothetical protein MGG_00134 [Magnaporthe oryzae 70-15] Length = 923 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 51/168 (30%), Gaps = 21/168 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +++AL + + MGL+ + + T K ++S Sbjct: 397 KINLVRDALRFMVHQLGDRD------RMGLVTFGSGGGGV-PIVGMTTKSWHGWGGVLNS 449 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + S A + + K + I+ ++D ++ S Sbjct: 450 IKPVGQKSHRA---------DVVEGANVAMDLLMQRKYNNPIAT--IMLISDASTSDADS 498 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK I I + + + +++ + + V + Sbjct: 499 VDFVVSRAEAAK---ISIHSFGLGMTHKPDTMIELSTRTKASYTYVKD 543 >gi|85859137|ref|YP_461339.1| flp pilus assembly protein family protein [Syntrophus aciditrophicus SB] gi|85722228|gb|ABC77171.1| flp pilus assembly protein family [Syntrophus aciditrophicus SB] Length = 146 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 15/140 (10%), Positives = 35/140 (25%), Gaps = 5/140 (3%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + K + FA+ + L+L+ +I Y K + +A+ AG + Sbjct: 7 MRKLREHKGAAVVEFAIVLPILLVLVFGIIEFGILIYNKQVLTNASREGARAGIVYIDGT 66 Query: 72 LSRLGDRFESISN--HAKRALIDDAKRFIKNHIKESLSGY--SAVFYNTEIQNIVNSSRI 127 + +D + I + + V+ Sbjct: 67 SRVPAGNASNSDTIKGIVNDYANDYLITFGSSIPLNTDVEFPEGQDSGDPLIVTVSYGYS 126 Query: 128 SMTHMANNRLDSSNNTIFYN 147 + T+ Sbjct: 127 FI-LFPVTDYPIKARTVMKY 145 >gi|332237897|ref|XP_003268143.1| PREDICTED: integrin alpha-10 isoform 1 [Nomascus leucogenys] Length = 1175 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 271 GRPEAARLLVVVTDGESHDGEQLPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFL 327 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 328 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 365 >gi|291296803|ref|YP_003508201.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471762|gb|ADD29181.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 406 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 23/76 (30%), Gaps = 2/76 (2%) Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ +K A ++L G I G + + T+ Sbjct: 286 TNTAEGLKLARKMLRKMSGEMKQIIMITDGKPSALTLPSGQIYKNAWGLDPVILA--ETL 343 Query: 352 KICDKAKENFIKIVTI 367 K A++ I I T Sbjct: 344 KEATLARKEGIPIHTF 359 >gi|149202126|ref|ZP_01879099.1| hypothetical protein RTM1035_12403 [Roseovarius sp. TM1035] gi|149144224|gb|EDM32255.1| hypothetical protein RTM1035_12403 [Roseovarius sp. TM1035] Length = 195 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 8/193 (4%), Positives = 37/193 (19%), Gaps = 18/193 (9%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFL---IYVLDWHYKKNSMESANNAAILAGA 65 ++ E ++ + + + F + + + Sbjct: 8 KLLRRFWKQETGTAAMETVVMFPFLFMGLTFSYEYFDMFRYQSVREKATYTV------AD 61 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +++ DD ++ + + E++ Sbjct: 62 MLSRETSEVNEAYIDNVKVLFDIMTNDDGNNQVRVTVVRYHVDATNNIDEFELRWSEVRG 121 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 + ++ + + + + M + V + Sbjct: 122 SGDLNPLSADDVR-NAHATLPQMLNGQEIILV--------ETSSDYDPVFSTGFTDGTNI 172 Query: 186 GERPIFLIELVVD 198 R + Sbjct: 173 KTRMFVPLRFASQ 185 >gi|160900629|ref|YP_001566211.1| FG-GAP repeat-containing protein [Delftia acidovorans SPH-1] gi|160366213|gb|ABX37826.1| FG-GAP repeat protein [Delftia acidovorans SPH-1] Length = 1182 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 22/290 (7%), Positives = 48/290 (16%), Gaps = 43/290 (14%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV-------DLSGSMHCAMNSDPE 212 + + +FI + + G + LS + A + Sbjct: 145 NTNSMR-PLQSTHRANFISFISNLNPGGGTPSHLMFSQADSYMRRGLSKNSPWASDPGNT 203 Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN---IEP 269 ++ N + + + Y + Sbjct: 204 GAPYLACRRNYHIMMTDGRWNGTATGVPDSTRRDNATNLTLSDNMVYGGSGAADQKKSAL 263 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 T S + + + S Sbjct: 264 YRDTSTGTTLADWAFYSWATPMQTTGL-----TGTMQPTADYRKAPSVETFTNDSGSSVT 318 Query: 330 QKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT-----ISINASP----------- 373 TI A + +T + Sbjct: 319 LDRYWNPRYNPATWPHMVTYTIGFSKMAFDWSYPSITRPTQMVPFGYDGSFPDLAKGSTT 378 Query: 374 -----------NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 N L ++ Y V + L F+ I + + Sbjct: 379 WPNLTASTESRNALDLWHAALNGRGRFYAVEKGEDLEKAFREIFGQINTQ 428 >gi|224066048|ref|XP_002192868.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha2/delta subunit 3 [Taeniopygia guttata] Length = 1090 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 54/217 (24%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 248 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 287 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 288 DDFFNIIAYNEELHYVEPCLNGTLVQADRANKEHFREHLDKLFAKGIGMLDIALNEAFNM 347 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L + I+ +TDG + + TI + ++I Sbjct: 348 LNEFNHTG------------QGSICSQAIMLITDGAVDTYD----TIFAKYNWPDRKVRI 391 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 392 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 428 >gi|254430946|ref|ZP_05044649.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] gi|197625399|gb|EDY37958.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] Length = 2003 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 31/335 (9%), Positives = 73/335 (21%), Gaps = 38/335 (11%) Query: 87 KRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFY 146 D + + + I +++ + Sbjct: 803 IYLAQDHSAVDNAVTASLASGANDRATLQVNADGTYTFTLIDNFLLSDPGDTTEQTESIS 862 Query: 147 NMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER-------PIFLIELVVDL 199 ++ + N + + + + + LV+D+ Sbjct: 863 SLVGGINI-LVEDGDGDAANGTTGIALSLLVKDDIPTAVPITESGESFPTGTNLFLVIDV 921 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSM T+M N+ L +D + L +K +V + + Sbjct: 922 SGSMA-------------NASGVDGMTRMQLQINSALELIDQYEALGPLKVNV----VTF 964 Query: 260 TTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN-- 316 T +W + V+ ++ + T+ A+ Sbjct: 965 ATDASAPFSTTWQDADAVKTFI---QTLVPTSRTNYDAALNLTINTFNGGTASDIDGATN 1021 Query: 317 --FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 +F + + + N I I + N Sbjct: 1022 VLYFFSDGVPNENDISGTGPGNGSLGGGDGIDSSEETTWENFLNTNNIASFAIGLGNGVN 1081 Query: 375 GQRLLKTC-----VSSPEYHYNVVNADSLIHVFQN 404 L + V + L ++ Sbjct: 1082 ASNLNPIAYNGFGSGTEANAIVVTDLSQLDATLRS 1116 >gi|53723210|ref|YP_112195.1| hypothetical protein BPSS2193 [Burkholderia pseudomallei K96243] gi|52213624|emb|CAH39678.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 180 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 14 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 73 Query: 79 FE 80 + Sbjct: 74 AQ 75 >gi|56697084|ref|YP_167447.1| von Willebrand factor type A domain-containing protein [Ruegeria pomeroyi DSS-3] gi|56678821|gb|AAV95487.1| von Willebrand factor type A domain protein [Ruegeria pomeroyi DSS-3] Length = 565 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 29/105 (27%), Gaps = 3/105 (2%) Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ A L K + + + +I ++DG ++ Sbjct: 93 AAIASAVAGLKPKGKTPMLEAVRQAAEALRYTEEKATVILVSDGVETC-DADPCATAAAL 151 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVS--SPEYHYNVVNADSL 398 +A + + C++ + + NAD L Sbjct: 152 EAAGVDFTAHVVGFDIDDPQALAQMQCLADQTGGTFRSAANADEL 196 >gi|171913105|ref|ZP_02928575.1| hypothetical protein VspiD_18035 [Verrucomicrobium spinosum DSM 4136] Length = 455 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 25/241 (10%), Positives = 57/241 (23%), Gaps = 43/241 (17%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED 213 + L F + + + L + M + +I + P Sbjct: 235 VVFVLDFSVSMKKNQVELVVNEMGKTLKMLPMRSQ-YQVILFAGGARFADRNWKVEQPST 293 Query: 214 VNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGT 273 + + K++ + F + + + L + Sbjct: 294 FERIMVHRGKQKYRF---------FSPTKSHTDYAFDGSDNELPKDDW-------LTSNA 337 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 V + + TD A K A + P K + Sbjct: 338 VNVGRTMEELDKKETWGGTDWRWAFKTALNM----------------------SPPPKVV 375 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 F+TDG + + ++ + T ++ S + + + V Sbjct: 376 YFMTDGLGGSNVELILDY----NKRKAKATLNTFLMHTSAGAPLMKEIADKTGGRFTIVK 431 Query: 394 N 394 N Sbjct: 432 N 432 >gi|294139943|ref|YP_003555921.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326412|dbj|BAJ01143.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1227 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/312 (10%), Positives = 78/312 (25%), Gaps = 64/312 (20%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + I + + +R + + ++ + +T+ Sbjct: 230 TPSSLINYTVASASSNSKTKRNATDKAALTGFGTGRVITVYTEDYLRWEHGVKNKVTKTR 289 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMD 285 + K+A+ + + + M G + K + + + Sbjct: 290 LELAKDAIKNVILTTPGVDFGLSIFNMNGPGDNQRDGGRIISGIKNMSSKGKIDLLTSVQ 349 Query: 286 SLIL-KPTDSTPAMKQAYQILT--------------------------SDKKRSFFTNFF 318 S+ + T + +AY+ + + S Sbjct: 350 SITYAQNTPLCETLYEAYRYFSGQSVLFGDDDSDYISYDRWGRYRGIYKSNRNSPLDPDI 409 Query: 319 RQGVKIPSL----PFQKFIIFLTDGENNNFKSNVNTIKICDKA----------------- 357 S + +I+++TDGE + + I+ Sbjct: 410 LDAGVYSSPLKKCQDKAYIVYITDGEPTVDSAANHAIQSLTGGVDKHTAYPVSYLSAMSS 469 Query: 358 ---------KENFIK---IVTISI--NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 KE ++ TI A+ L KT +++ NA L Q Sbjct: 470 WMNNHDVNPKEPGVQHVSTFTIGFSEGAASAASLLKKTAAVGGGSYFDATNAKLLQGSLQ 529 Query: 404 NISQLMVHRKYS 415 ++ + S Sbjct: 530 QAVNKILEKNAS 541 >gi|218461773|ref|ZP_03501864.1| hypothetical protein RetlK5_20828 [Rhizobium etli Kim 5] Length = 159 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 49/151 (32%), Gaps = 14/151 (9%) Query: 4 LSRFRFYFKK-----GIASEKANFSIIFALSVMSFLLLIGFLI-YVLDWHYKKNSMESAN 57 + R + ++ E+ AL + F+ L I +++ +++ Sbjct: 1 MMALRNFIRRRLCLGFWRREEGAVLAE-ALLAIPFVTLFAAGILEFGSIFWQRMQIDAGL 59 Query: 58 NAAILAGASKMVSNLSR----LGDRFESISNHAKRALIDDAKRFI---KNHIKESLSGYS 110 A + ++ + ++I+ + + DAK + K+ +++ Sbjct: 60 RDAGRYLSRCRPASGTYVPTCNQATAKTIAFYGTQTPAADAKPRVPDWKDAADITITAPD 119 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 A T + S + + + S+ Sbjct: 120 ADGNITLSTAHLYKSSPVFGFLGIDAITISS 150 >gi|126442655|ref|YP_001064079.1| TadE family protein [Burkholderia pseudomallei 668] gi|242313326|ref|ZP_04812343.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254262367|ref|ZP_04953232.1| TadE family protein [Burkholderia pseudomallei 1710a] gi|126222146|gb|ABN85651.1| TadE family protein [Burkholderia pseudomallei 668] gi|242136565|gb|EES22968.1| TadE family protein [Burkholderia pseudomallei 1106b] gi|254213369|gb|EET02754.1| TadE family protein [Burkholderia pseudomallei 1710a] Length = 180 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 14 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 73 Query: 79 FE 80 + Sbjct: 74 AQ 75 >gi|2935363|gb|AAC05096.1| complement component BfB [Danio rerio] Length = 359 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 52/194 (26%), Gaps = 13/194 (6%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK--VRQ 278 + LK A + ++ +S+ + ++ + T V + ++ + Sbjct: 164 AVDVSDSINDLKKAKQIIKTLLEKISYYEVSPSYEILMFATDVYQIVKMRDFKTEEVAGS 223 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 D L S R K L + +I TD Sbjct: 224 LSKVFEDLDKFDFDKKLDQKGSNIAKLYQTILDSMSNEQIR--NKEDFLQTKHVVIVFTD 281 Query: 339 GENNNFKSNVNTIKICDKA-------KENFIKIVTISINASPNGQRLLKTCVSSPE--YH 389 G+ N + + + +EN + + + + L + Sbjct: 282 GQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFGVGKDVRTEDLNGLVSEKENERHF 341 Query: 390 YNVVNADSLIHVFQ 403 + + + D + F Sbjct: 342 FKLQDLDEVQKTFD 355 >gi|149173297|ref|ZP_01851928.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] gi|148848103|gb|EDL62435.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] Length = 1020 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 17/112 (15%), Positives = 35/112 (31%), Gaps = 10/112 (8%) Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 D T I S K +I ++DG+ I K + + + Sbjct: 536 DMPSFANTMQLGLNGLIKSDASTKHMIIISDGDPQ----PPTPQLIGQFLKNK-VSVSMV 590 Query: 368 SINA--SPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVI 417 +I + ++ + +Y + + L +F + K S+I Sbjct: 591 AIFPHGGEDISKMRDIAGVTGGRYYFPSDPNQLPSIF---IKESKTLKRSMI 639 >gi|5670335|gb|AAD38177.2|AF152598_6 TadF [Aggregatibacter actinomycetemcomitans] gi|26000720|gb|AAN75216.1| TadF [Aggregatibacter actinomycetemcomitans] gi|307548795|dbj|BAJ19117.1| TadF [Aggregatibacter actinomycetemcomitans] gi|307548810|dbj|BAJ19131.1| TadF [Aggregatibacter actinomycetemcomitans] Length = 200 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 7/45 (15%), Positives = 21/45 (46%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 KK +++ S+ F ++ + + FL ++ + +++A Sbjct: 18 KKFFHNKRGAVSLEFLFMLILLVFIFAFLTDLVIVRTTQGKLDNA 62 >gi|237510209|ref|ZP_04522924.1| TadE family protein [Burkholderia pseudomallei MSHR346] gi|235002414|gb|EEP51838.1| TadE family protein [Burkholderia pseudomallei MSHR346] Length = 189 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 23 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 82 Query: 79 FE 80 + Sbjct: 83 AQ 84 >gi|294140778|ref|YP_003556756.1| hypothetical protein SVI_2007 [Shewanella violacea DSS12] gi|293327247|dbj|BAJ01978.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 152 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 42/130 (32%), Gaps = 3/130 (2%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ +++ FA+ FLLL+ +I V Y N + A+ A + + Sbjct: 1 MRYQRGVYAVEFAIVGSIFLLLLFAIIEVGRLMYTYNVLHEASRRAARIAVVCQIDDADV 60 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + +N + + + +G +I N + + Sbjct: 61 KTMALFNGANLIPNLTTANLSISYIDELGNDATGI---NIVLVRADIANYQHQFLVPGLS 117 Query: 135 NRLDSSNNTI 144 ++S + Sbjct: 118 RVINSPTFST 127 >gi|254440736|ref|ZP_05054229.1| hypothetical protein OA307_151 [Octadecabacter antarcticus 307] gi|198250814|gb|EDY75129.1| hypothetical protein OA307_151 [Octadecabacter antarcticus 307] Length = 186 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 12/84 (14%), Positives = 25/84 (29%), Gaps = 1/84 (1%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + + +E N ++ F L + F++L + + ME + A+ Sbjct: 5 IKLLARFLRLFRRNEDGNPTVEFTLVFLPFIILPVSGFELGLLMTRHVMMERGIDMAVRQ 64 Query: 64 GASKMVSNLSRLGDRFESISNHAK 87 I N A Sbjct: 65 V-RLNTGTPITEQQFKTMICNAAA 87 >gi|310822272|ref|YP_003954630.1| cglb protein [Stigmatella aurantiaca DW4/3-1] gi|309395344|gb|ADO72803.1| CglB protein [Stigmatella aurantiaca DW4/3-1] Length = 430 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 32/332 (9%), Positives = 68/332 (20%), Gaps = 88/332 (26%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + L + + + R S Sbjct: 47 TAQALKPNLMVLLDTSGSM-----------------TLPVNTRDPNCYRANNTPSPDDDY 89 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 + +++ T+ + L+ A+ FL ++ + Y G + + Sbjct: 90 CGQTPSTACDTSKC-----PTRWSELQGAMSRFLSESGGVARMGLTTYPGPAVGSNSLRC 144 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM-KQAYQILTSDKKRSFFTNFFRQGVKI 324 + + + + S F Q + Sbjct: 145 EASTVVNKNIPQSDADEALLGAANDIQSVILGIPNAGTNAPSGGTPTSLSLQFVGQQPDV 204 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNT---IKIC--------------------------- 354 + F++ LTDG N N N + C Sbjct: 205 QATDRDNFVLLLTDGLPNCNPDNQNAGTNVDACQCTLANIGNNTGCQGDYVRRGCLDKDA 264 Query: 355 -----DKAKENFIKIVTISINAS-------------PNGQRLLKTCVSSPE--------- 387 + K I+ + + A ++C P Sbjct: 265 SVVAVEDLKRKGIRTIVVGFGAETATGNGPATLNAMATAGDFARSCRDDPNACGAGDTCD 324 Query: 388 --------YHYNVVNADSLIHVFQNISQLMVH 411 Y N + L + I L+ Sbjct: 325 AQTKLCGRRFYQAANQEELAEALREIIDLVGT 356 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 10/36 (27%), Positives = 16/36 (44%) Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 A GQ LL+ C S +++ + L F+ I Sbjct: 2 APEGGQALLQYCASDASHYFQAEKMEDLFAAFKAIG 37 >gi|254426967|ref|ZP_05040674.1| TadE-like protein [Alcanivorax sp. DG881] gi|196193136|gb|EDX88095.1| TadE-like protein [Alcanivorax sp. DG881] Length = 150 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 7/77 (9%), Positives = 20/77 (25%), Gaps = 2/77 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + ++ F + + ++ G + + + M+ A + A+ S L Sbjct: 5 RKQAGAVALEFLMLFPFVVAMLYGAAV-YGLTFFAQYRMQDAVDNAVSTALYIDRSALQG 63 Query: 75 LGDRFESISNHAKRALI 91 Sbjct: 64 AALGASVTQRANSALAG 80 >gi|167589171|ref|ZP_02381559.1| membrane protein [Burkholderia ubonensis Bu] Length = 636 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 9/158 (5%), Positives = 41/158 (25%), Gaps = 10/158 (6%) Query: 15 IASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + ++ S++ + +++ + L I + + +++ ++ + + LA + + S Sbjct: 1 MRGQRGAVSVLATVWVLVALVTLG--AIDIGNLFFQRRDLQRIADLSALAAVQSL--DPS 56 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + + + +A + Q + Sbjct: 57 DTSCGAANTAALNNARENETSNTVTTGAPARGQDQVAATCGRWDPQVYAGQPAYFAPAAS 116 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 ++ + +L Sbjct: 117 GMTQLNAAQVTVTRTMRYSFLGVV-----SMLGAGPGT 149 >gi|126306639|ref|XP_001365694.1| PREDICTED: similar to Von Willebrand factor A domain containing 1 [Monodelphis domestica] Length = 593 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 50/169 (29%), Gaps = 22/169 (13%) Query: 233 NALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + + F+ + L V ++ + ++ Sbjct: 31 SRVKEFVGQLVQPLPLGPGSVQTSMVHVGSEPTVEFPFDRHHSGAAAQEAIQAAKQLMGD 90 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T++ A+ A + L + + +K ++++TDG+ S+ + Sbjct: 91 TNTGLALALAKKQLFTTEAG-------------ARSGVRKVLVWVTDGD-----SSDDVQ 132 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSS-PEYHYNVVNADSLI 399 K+ + + +S L S PE H V+ D L Sbjct: 133 APMQVLKDLGVTVFIVSTGRGNFLD--LSAAASQPPEKHLRFVDVDDLQ 179 >gi|51245321|ref|YP_065205.1| fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] gi|50876358|emb|CAG36198.1| related to fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] Length = 1640 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 22/266 (8%), Positives = 61/266 (22%), Gaps = 18/266 (6%) Query: 154 YDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL-----SGSMHCAMN 208 R+ + P IE+ + L+ + Sbjct: 631 MRLRIPTNPFVKKSSDTNFRTIDTPIDSDIEILVDAVEHYPLIWGTTPLAENFYEVIRYF 690 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE 268 ++ + K + +D + + + + + Sbjct: 691 QQKGPYYNSSVNSAGDVASFLTEKAWDPYYFKELDAMIRCANSSVIIFTDGGSYTDSYVP 750 Query: 269 PSWGTEKVRQYVTRDMDSLILKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 P E ++ + T+ + + + + + N + Sbjct: 751 PFNDEEFNESSASQAASAYSDGSNYTNDYDGNIVTGEGINTRNTDNSYKNNLDDLALWAN 810 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKT-CV 383 + LT+ +N+ + E ++ T+ S N LL+ Sbjct: 811 --LNRKGQALTEALSNSRDLRDDLKG------EQYLTTYTVGFGISANSAEERLLQDTAA 862 Query: 384 SSPEYHYNVVNADSLIHVFQNISQLM 409 + + L V + + Sbjct: 863 HGQGTYSLAEDGQQLKDVLKGTINSI 888 >gi|89068016|ref|ZP_01155433.1| CpaB family protein [Oceanicola granulosus HTCC2516] gi|89046255|gb|EAR52312.1| CpaB family protein [Oceanicola granulosus HTCC2516] Length = 281 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 23/76 (30%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNH 85 F L ++ + L GF +Y+ + + + A A+ A A+ V + Sbjct: 5 FGLVLIVGVGLAGFAVYMAQGYLQDTQAQLARERAVAAQATPTVEVYAVTRQIGYGEQLT 64 Query: 86 AKRALIDDAKRFIKNH 101 + Sbjct: 65 PDDVTLVKYAEPFLPD 80 >gi|327193256|gb|EGE60162.1| hypothetical protein RHECNPAF_1700075 [Rhizobium etli CNPAF512] Length = 251 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 41/172 (23%), Gaps = 9/172 (5%) Query: 4 LSRFRFYFKKGI---ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNA 59 LS FRF + + A + FAL + ++L+ + + ++ A+ Sbjct: 64 LSLFRFARSRLLDLARDRLAASGVEFALVLPILVMLLFGTVDLGHALTVSRKIDEIASTT 123 Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 L G ++ + N I ++ Sbjct: 124 GDLIGQQSSWTSSDVTKLLSGASFILQPYDTSGLKITLTVNDISKN-----GNATVNWSA 178 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 S+ S A + + + Y N Sbjct: 179 AFNTSALNSGAASAIDIPSQIQDAGVQVVLTRVQYTLTTPVSSFFSNFTGQS 230 >gi|311744036|ref|ZP_07717842.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311313166|gb|EFQ83077.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 143 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 13/140 (9%), Positives = 34/140 (24%), Gaps = 1/140 (0%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + E+ +I + LL G ++ + ++ A GA + Sbjct: 1 MRRPREERGAATIFVLGLSVVLLLCAGLVVDGGLAINARMAVADDAEQAARIGADSIDVA 60 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + I ++ A ++ ++ + + Sbjct: 61 -ALRAGSGIVIDAGLATQRAGAFLANRGYGAGQATVVVDAANGAVTVELRDTTTTLILGL 119 Query: 132 MANNRLDSSNNTIFYNMDVM 151 + N+ D S Sbjct: 120 VGINQYDVSAGATSTPNTGP 139 >gi|182437627|ref|YP_001825346.1| hypothetical protein SGR_3834 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466143|dbj|BAG20663.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 758 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 29/110 (26%), Gaps = 29/110 (26%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A++Q L+S + + LTDGE + S Sbjct: 139 GTGTDFPSAIRQGVHELSSG----------------TDPSVPRVLFLLTDGEMDVTGSPQ 182 Query: 349 N-------------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 A ++I + P+ ++L + Sbjct: 183 YGDPAHREAEGKRQLELELKNAAAKNVQIWPLGFGPDPDKEQLDRIAAGG 232 >gi|213965183|ref|ZP_03393381.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213952297|gb|EEB63681.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 922 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 17/142 (11%), Positives = 35/142 (24%), Gaps = 19/142 (13%) Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 V + + D + TD + A + ++ FF G Sbjct: 130 NGTNVDRDIEAFKDRTDGEYTDYAKGLDGAAEAVSDGTSDCKAILFFSDG---------- 179 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI---NASPNGQRLLKTCVSS--- 385 F G + + + I T+ + ++ + L + Sbjct: 180 FPTREAGGAEAVMQDVCRPEGAVGRLRAADTLIFTVGLAPEKSATDTPDLRRISEGDCSN 239 Query: 386 ---PEYHYNVVNADSLIHVFQN 404 H N L+ F+ Sbjct: 240 SAPSGQHLFANNPGELLAAFRQ 261 >gi|115525743|ref|YP_782654.1| TadE-like protein [Rhodopseudomonas palustris BisA53] gi|115519690|gb|ABJ07674.1| TadE-like protein [Rhodopseudomonas palustris BisA53] Length = 213 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 13/116 (11%), Positives = 32/116 (27%), Gaps = 8/116 (6%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + ++ FAL V L +I +I + A + + + Sbjct: 19 RLRRDTRGLGAVEFALIVPLMLAMIFGVIQISSGI--------AIDRKVSMVTQTTSDLV 70 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 SR + E + + +K ++ N + + + Sbjct: 71 SRYKEVAEVDLDGIITIANAILTPYDSTPLKAKITQVYINPANGNACVQWSKATSN 126 >gi|315656396|ref|ZP_07909285.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315492955|gb|EFU82557.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 171 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 7/150 (4%), Positives = 38/150 (25%), Gaps = 13/150 (8%) Query: 9 FYFKKGIA--SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + E ++ + + ++I + + + ++ ++ + A+ Sbjct: 15 RFAARFERLGDEDGRIMLLGLAACVLLCVIILMSMALSGVYLEQRRLQRLADQTASMAAA 74 Query: 67 KMVSNLSRLGDRFESI----SNHAKRALIDDAKRFIKNHIKESLSG-------YSAVFYN 115 M + + + + SL G ++ Sbjct: 75 NMADTAYYQNGIVDGVPLEIEPYHASERAAEYLSGASISANSSLDGIDLVDVDVASTRVQ 134 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 ++ + + +++ + Sbjct: 135 VTLRATGKIPLVLPLVSSLTQVELTATGAA 164 >gi|163745748|ref|ZP_02153108.1| hypothetical protein OIHEL45_09155 [Oceanibulbus indolifex HEL-45] gi|161382566|gb|EDQ06975.1| hypothetical protein OIHEL45_09155 [Oceanibulbus indolifex HEL-45] Length = 187 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 29/67 (43%), Gaps = 4/67 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA-AILAGASKMV 69 FK+ + +++ + +I F + + + I ++ D + A +A A++A + Sbjct: 6 FKRFLKADEGSQTIAFVVLLPLLVWSIMAMLTFTDAFRIRA---MATDATAVIADSLSRE 62 Query: 70 SNLSRLG 76 + L Sbjct: 63 TTPIDLN 69 >gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] Length = 740 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 33/355 (9%), Positives = 95/355 (26%), Gaps = 64/355 (18%) Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L+ AS + + + + ++ + + S+ ++ ++ Sbjct: 221 TLSRASLLAGSADLIAPTADPNMVARAGGGLNPVSITVNLDPGFAPEAISSPYHAVSVRG 280 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ-----KIVS 175 ++ + + + +L + +++ Sbjct: 281 SGSTRT------------VTLADGAVPANRDFELRWSASGDAPMLGLFKQRHGELEYVMA 328 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 I +GE P + V+D SGSM + + + + ++ Sbjct: 329 TITPPALERVGEAPPREMIFVIDNSGSMAGESMPAARRSLLYALETLRPQDRFNVIR--- 385 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 +++ + + + T + + T+ Sbjct: 386 ---------FDDTMTELFASAVQASD------------SNIAAAKTFTHNLMANGGTEML 424 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 PA++ A + + +IFLTDG +I Sbjct: 425 PALRAALR-------------------DRAPDERVRQVIFLTDGAL--SNEADMMEEINR 463 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 K++ ++ + I ++PN + + + +V + Q + + Sbjct: 464 NRKDS--RVFMVGIGSAPNTYLMRRMAEAGRGTFTHVGMGEEAEDQMQRLLDRLS 516 >gi|163846487|ref|YP_001634531.1| hypothetical protein Caur_0910 [Chloroflexus aurantiacus J-10-fl] gi|222524270|ref|YP_002568741.1| hypothetical protein Chy400_0993 [Chloroflexus sp. Y-400-fl] gi|163667776|gb|ABY34142.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl] gi|222448149|gb|ACM52415.1| conserved hypothetical protein [Chloroflexus sp. Y-400-fl] Length = 170 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 17/56 (30%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 I AL + + L I +I V + +E A A + + Sbjct: 1 MWHRHHGQALPILALMLPALTLFILVVIEVANLWLDVALLEDALQQATRSAVQHLD 56 >gi|134288259|ref|YP_001110422.1| von Willebrand factor, type A [Burkholderia vietnamiensis G4] gi|134132909|gb|ABO59619.1| von Willebrand factor, type A [Burkholderia vietnamiensis G4] Length = 623 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 66/252 (26%), Gaps = 32/252 (12%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM-HCAMNSDPEDVNSAPI 219 L R + + + E + L+VD S SM C ++ A Sbjct: 402 SGKLRPNRVYRVALGDTDVFRKKARTEELDTCMYLLVDESSSMSACFDRERAPQLHKAQA 461 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 + + + + G+ Y T V + + Q Sbjct: 462 NPNDPHQHFDVSRAVAAGRVAVAAGEVLDGAQIPFGVASYNTAVREWQDFDGNWSNTLQR 521 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 T++ A+ A + + +K + +TDG Sbjct: 522 YEAA----ATGSTNTHLAVVWALRKF------------------VDRNEQRKVLAVVTDG 559 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + + +A+ +++ I I AS + + + N L Sbjct: 560 DPGDATV---LEAALKEAERFGVEVRFILIGASEEVRYKGLSAA-----YGVATNVRELA 611 Query: 400 HV-FQNISQLMV 410 F ++ + Sbjct: 612 KAVFGSLEAAIC 623 >gi|118594950|ref|ZP_01552297.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] gi|118440728|gb|EAV47355.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] Length = 318 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 33/264 (12%), Positives = 73/264 (27%), Gaps = 55/264 (20%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGER-PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + + ++ P + E + ++D S SM D ++ + Sbjct: 52 MSLIIFLLVVILANPWSKSTSITEIGKGAQLVFLIDRSVSMAKPFIGDDDNKSEIKSL-- 109 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 A + L F+D + +G++G++ + + + Sbjct: 110 -------AARRILKDFIDQ-------RPSDMIGIVGFSNSALYASKITKNRSYTYAAIDA 155 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 S I + T+ + + + + ++ L+DG Sbjct: 156 ATGSSINQ-TNIGSGITSGLFMFSEIETTGSQA-----------------LVLLSDGA-- 195 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPN-----GQRLLK-------------TCVS 384 S +I + E I + I I + G L+ + Sbjct: 196 GKISKRVKERIAEMLNEKKINLYWIIIKEPNDVSLFSGNTYLEGREPTVIKLDNFFKSLK 255 Query: 385 SPEYHYNVVNADSLIHVFQNISQL 408 + Y N D+L +I Q Sbjct: 256 TEYKAYEAENPDALSSAIADIDQK 279 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 63/200 (31%), Gaps = 22/200 (11%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 +R + ++A F + +++ LI + E + + + + + Sbjct: 82 SGSQRNLIDIERSAASQFFREV-----LRKKDLAFLIMFGEETELLQDYTGSPRLLTEGL 136 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 S + P + + + K+ +K I+ +TDG Sbjct: 137 NHLEVS--SGVSGIHPGPV--PTMGGPRGTVLYDAVYLAANEKLKGEVGRKVIVVITDGV 192 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPN-----------GQRLLKTCVSSPEYH 389 + + + N + + A+++ I +I + L K + Sbjct: 193 DQGSRMSRN--QAIEAAQKSDCVIYSIDYSDPRAYGPFNMVGGGGEGELRKMSDETGGRV 250 Query: 390 YNVVNADSLIHVFQNISQLM 409 Y V +L VF+ + + M Sbjct: 251 YKVDRRHTLDQVFKELQEEM 270 >gi|83312849|ref|YP_423113.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947690|dbj|BAE52554.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 175 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 12/170 (7%), Positives = 49/170 (28%), Gaps = 10/170 (5%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 + + ++ +++ ++ L+++ + ++ + A A ++ + + Sbjct: 12 LKRDRRGSVTLEYSILLLPALMILFGTFEMGMVIFENS----VVEGATRDAARRLRTGQA 67 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + N A + S + F + + + ++ Sbjct: 68 QTSA------NPASTFQQTFCASLFSLYKCSSFTFDVRSFSDFTTIALPAPQFDAQGNVT 121 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 N + + V+ + + I L+ + A+ + Sbjct: 122 NAQFTPGGAGTITTVRVIYRHVFSTPLIGSLMGAGTGNTLGITATAVFKT 171 >gi|2326547|emb|CAA04501.1| Matrilin-3 [Homo sapiens] Length = 303 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 9/77 (11%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 S K I +TDG + + ++ +A+ + I++ + ++ + LK S Sbjct: 1 SSNIPKVAIIVTDGRPQDQVN-----EVAARAQASGIELYAVGVDRADMAS--LKMMASE 53 Query: 386 P--EYHYNVVNADSLIH 400 P E+ + V + Sbjct: 54 PLEEHVFYVETYGVIEK 70 >gi|310825520|ref|YP_003957878.1| hypothetical protein STAUR_8297 [Stigmatella aurantiaca DW4/3-1] gi|309398592|gb|ADO76051.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1414 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 59/233 (25%), Gaps = 12/233 (5%) Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 + +++ G + A D L ALL Sbjct: 242 ISTFTQTSSANTLKMQPGCQGILANADAFDSSRTGFIATINSLTFNTATPLARALLNTGY 301 Query: 241 SIDLLSHVKEDVYMGLIGYTT---RVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 V +DV+ T + + + + R D S ++ TD P Sbjct: 302 YFTSDQTVYKDVFGFGATNPTVGYAYPTDFKNEPLSSENRSVCWGDQASAVIILTDGEPN 361 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE----NNNFKSNVNTI-K 352 + + + L D N+ + + I Sbjct: 362 TDTLGSAVVQKIRSRNGGPVSCPPSAPCADTPNDANAMLDDVAKLLFTNDLQYSTPPIVG 421 Query: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQN 404 + + + +++ T+ N LLK + Y +A L Q+ Sbjct: 422 ALNTSGQQSLRVYTVGFAIDSN---LLKNAAAVGGGRSYTAHDAAGLRQALQD 471 >gi|264678232|ref|YP_003278139.1| TadE-like protein [Comamonas testosteroni CNB-2] gi|262208745|gb|ACY32843.1| TadE-like protein [Comamonas testosteroni CNB-2] Length = 147 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 42/130 (32%), Gaps = 8/130 (6%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV--SNLSRL 75 ++ + I FAL ++ FL+ + ++ Y A N A AGA V +L++ Sbjct: 6 QRGSTLIEFALGLLIFLMFLLGVVDFSRLLYTWT----AANEATRAGARYAVVCDDLNQK 61 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + N + D K K + + S + + A Sbjct: 62 TSVLQYMKNRLPQITAVDVK--WKPDSCTTATCQSVTVSIPQDGLKFQWIAPIVGSAAQT 119 Query: 136 RLDSSNNTIF 145 + + Sbjct: 120 VIGMPGFSTT 129 >gi|170751927|ref|YP_001758187.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] gi|170658449|gb|ACB27504.1| TadE family protein [Methylobacterium radiotolerans JCM 2831] Length = 215 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 36/128 (28%), Gaps = 9/128 (7%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI-------- 61 ++ ++ FAL + FL L G ++ ++ + + + A A+ Sbjct: 18 LLRRLARDSGGAAAVEFALVALPFLALCGAILQIMFQMWATQNFDRALQNAVRTIFTGQF 77 Query: 62 -LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 L A + + + + +K + S S SA Sbjct: 78 QLDTAGQTNPATVLATLKTRMCGPSSSPVPTVFNCQNVKIDVSTSSSFASATAAKPIDTA 137 Query: 121 IVNSSRIS 128 S Sbjct: 138 TGTWSTSF 145 >gi|104781654|ref|YP_608152.1| RND family multidrug efflux transporter [Pseudomonas entomophila L48] gi|95110641|emb|CAK15352.1| putative multidrug efflux transporter, RND family [Pseudomonas entomophila L48] Length = 786 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 6/63 (9%), Positives = 19/63 (30%), Gaps = 7/63 (11%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 + ++ +L +G + + Y + + LA +V+ + + Sbjct: 667 LTVATLPVMVLAVGVGVDYAFYIYNRLQLH-------LAAGRDIVAAFQQALREVGVATV 719 Query: 85 HAK 87 Sbjct: 720 FTA 722 >gi|326778278|ref|ZP_08237543.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326658611|gb|EGE43457.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 758 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 29/110 (26%), Gaps = 29/110 (26%) Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD A++Q L+S + + LTDGE + S Sbjct: 139 GTGTDFPSAIRQGVHELSSG----------------TDPSVPRVLFLLTDGEMDVTGSPQ 182 Query: 349 N-------------TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 A ++I + P+ ++L + Sbjct: 183 YGDPAHREAEGKRQLELELKNAAAKNVQIWPLGFGPDPDKEQLDRIAAGG 232 >gi|117618784|ref|YP_857965.1| hypothetical protein AHA_3491 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117560191|gb|ABK37139.1| flagellin hook IN motif family [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 5047 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/275 (10%), Positives = 62/275 (22%), Gaps = 20/275 (7%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI-------PALLRIEMGERPIFLIELVV 197 + + + + + N + G IFL++ Sbjct: 4493 VVLPGQNYNIAFMVDSSGSIGSSAMNTIKSQLAQVFSSLKASAGTDGAGTVNIFLVDFDT 4552 Query: 198 DLSGSMHCAMNSDPE--------DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVK 249 + S+ + D K F S + Sbjct: 4553 LANKSVSVNLKDSNALSQLQAILDSMDGSTNSGGGTNYEDVFKTTANWFNSSTVQGNTNA 4612 Query: 250 EDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 +++ + E S + R D+ + T T + T Sbjct: 4613 KNLTYFITDGQPTFYLENEGSDPMIRDRSGTVNDVYLSNVIGTSYTYGQTYSVNGRTVVN 4672 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLT-DGENNNFKSNVNTIKICD--KAKENFIKIVT 366 ++ + ++ T G N+ S + + + Sbjct: 4673 GNGVVYDYNGNQIGYMRPDGAGRYVYATLAGTGNSTSSTTIANSVAGFGLLTGMGVTVEA 4732 Query: 367 ISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 I + A+ + L S VNA L + Sbjct: 4733 IGLGANISYNDLKSY--DSDGVIMTGVNASDLANA 4765 >gi|467168|gb|AAA17326.1| L308_C1_175 [Mycobacterium leprae] Length = 141 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 8/36 (22%), Positives = 16/36 (44%) Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 Q L K C + ++ + DSL V+ + + + Sbjct: 30 NQTLQKICEITDGQVFHAYDLDSLKQVYSTLQRQIG 65 >gi|33595651|ref|NP_883294.1| putative hemolysin [Bordetella parapertussis 12822] gi|33565730|emb|CAE36274.1| putative hemolysin [Bordetella parapertussis] Length = 2215 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 37/322 (11%), Positives = 79/322 (24%), Gaps = 66/322 (20%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + A D + L+ + + N + Sbjct: 1548 NANGSYSYKANASGQGALPGDGQDVFTYTVRDADGDLTSTTLTINIDQFVAGSNDNNTIT 1607 Query: 130 THMANNRL--DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 N+ + D + L + + ++ + Sbjct: 1608 GGAGNDVMLGDQGGIKQNVTAGTSYNIALVLDLSDSMNDKWGS----------------- 1650 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 K T++ K+AL L++ + Sbjct: 1651 ---------------------------------GSNKPTRLQTAKDALKALLENQLAVHD 1677 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +V + ++ K E V + V M T A + Q Sbjct: 1678 GEINVSLITFNGSSSALKKSITGLTPENVDEMVDILMGLKASSATPYGAAFDRTTQWFE- 1736 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + V P++ FLTDGE + S + + A + + I Sbjct: 1737 ----------GQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISD--VHGI 1784 Query: 368 SINASPNGQRLLKTCVSSPEYH 389 I + + L K ++ Y+ Sbjct: 1785 GIGSGVSTSTLNKY-DNTGGYY 1805 >gi|296158792|ref|ZP_06841621.1| TadE family protein [Burkholderia sp. Ch1-1] gi|295890997|gb|EFG70786.1| TadE family protein [Burkholderia sp. Ch1-1] Length = 165 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 13/128 (10%), Positives = 40/128 (31%), Gaps = 2/128 (1%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R R + ++ +++ ++ FAL F +++ +I + ++ A A Sbjct: 13 RQRIHARR-AGAQRGATAVEFALVFPLFFMILYAIITFSLILVAQQNLTMAAEEGARAAL 71 Query: 66 SKMVSNLSRLGDRFESISN-HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + + A + + + + + G + I N Sbjct: 72 NWQSNTSLQTALTNRGNAACAAAKLVTAKLVQSAQCTPSSTTCGPGGTMQCVNVLLIYNY 131 Query: 125 SRISMTHM 132 + + Sbjct: 132 QANPIVPI 139 >gi|254454002|ref|ZP_05067439.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198268408|gb|EDY92678.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 186 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 23/58 (39%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + + + +E + ++ FAL + F++L + + ME + A+ Sbjct: 5 IKTLARFLRLFRRNEDGSPTVEFALIFLPFIILPVSGFELGLLMTRHVMMERGIDMAV 62 >gi|254450087|ref|ZP_05063524.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264493|gb|EDY88763.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 139 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 23/58 (39%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + + + +E + ++ FAL + F++L + + ME + A+ Sbjct: 5 IKTLARFLRLFRRNEDGSPTVEFALIFLPFIILPVSGFELGLLMTRHVMMERGIDMAV 62 >gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] Length = 676 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 42/332 (12%), Positives = 88/332 (26%), Gaps = 51/332 (15%) Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 S I + L ++ ++N ++QN N Sbjct: 218 TQNSTQTDNVPSSQFSLNIDIDVGLELVDINSKYHNVDVQNTAFGQ---YKIALNETNTV 274 Query: 140 SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDL 199 + + + + F + + +P +R + VVD Sbjct: 275 NRDFLLEFKPLQKEQAQAAFFTQQF-ENGDRYGLAMLMPPGDHFTQTQRLPREMVFVVDT 333 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 SGSMH M K AL L +D + ++ Sbjct: 334 SGSMH--------------------GQSMEQAKKALFYALSLLDSDDSFNIIGFDNIVTP 373 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 + + +R+ T+ A+ +L + F Sbjct: 374 MS----DKPLIASDFNLRRAERFIYSLEADGGTEIQGALN---AVLDGSEFDGFV----- 421 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 + ++FLTDG + I K ++ ++ T+ I ++PN + Sbjct: 422 -----------RQVVFLTDGS--VSNEDALFKNIQSKLGDS--RLFTVGIGSAPNSFFMR 466 Query: 380 KTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + + Q + + H Sbjct: 467 RAADIGKGSFTFIGSTSEVQPKMQQLFDKLAH 498 >gi|310825620|ref|YP_003957978.1| hypothetical protein STAUR_8397 [Stigmatella aurantiaca DW4/3-1] gi|309398692|gb|ADO76151.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 257 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 23/231 (9%), Positives = 59/231 (25%), Gaps = 13/231 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 FR + A E ++ A+ + F+ L+ ++ + H + + A A+ AGA Sbjct: 3 FRMTKRNPGARESGQAAVEAAIILPLFVFLMLGILQIGLMHQARLMTKYAAYRAVRAGAI 62 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + A N + + ++ + Sbjct: 63 HNAKTDEMERAALAVMLPLVSEAKSGGEVIKPINSASDFSTKWTTGGVSKNEMPDAKMKY 122 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 + + + D+ D + + + + + Sbjct: 123 VEVVTCGPLKEDAGGGEEVDFDDPKIA--------GQAGWRESQRTKLRI-----QATFN 169 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 R V S +M+ + + + ++ K+A Sbjct: 170 YRMPIPFANWVIHSAAMNKEVPWVLRMGKKSTGTTGFEHSRYEKYKSAASD 220 >gi|330470624|ref|YP_004408367.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] gi|328813595|gb|AEB47767.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] Length = 555 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 16/186 (8%), Positives = 49/186 (26%), Gaps = 22/186 (11%) Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 T++ A A + S + + + ++ + + + + Sbjct: 370 GGGTRLDATVRAAEEGAALLLDNSELGVWAFATNLDGERDHQEILPVAPLGAQRDRLAEA 429 Query: 283 DMDS--LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 T + AY+ + + ++ +TDG Sbjct: 430 LAAVRVEPQGGTGLYDSTLAAYR----------------DARRNWTPGRINLVLVMTDGR 473 Query: 341 NNNFKSNVNTIKICDKAK----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 N + S + + + + I+ I + + + + L + + Sbjct: 474 NEDDDSIGRAALLAELKRLQEPRRPLPIIFIGLGGAIDPEELEAIAKVTGGRVFVTAQPS 533 Query: 397 SLIHVF 402 + +F Sbjct: 534 GMRRIF 539 >gi|326330151|ref|ZP_08196462.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325951964|gb|EGD43993.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 165 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 48/130 (36%), Gaps = 2/130 (1%) Query: 10 YFKKGIASE--KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 ++ + + + +I A+ V +F+L +G +I+ +++SA A + Sbjct: 20 AMRRPRRDQNERGSAAIEAAIGVPAFMLFVGLIIFGGRTATTHEAVQSAAADAARTASIA 79 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 S ++ + + A + + + + + + ++ S + Sbjct: 80 RTSTEAQSEGARAARESLANQQIQCRSVTIEVSTDGFATPVGEPAAVEVTVACRLDLSDL 139 Query: 128 SMTHMANNRL 137 S+ + +R+ Sbjct: 140 SVPGVPGSRV 149 >gi|116622792|ref|YP_824948.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225954|gb|ABJ84663.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 360 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 14/108 (12%), Positives = 37/108 (34%), Gaps = 28/108 (25%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA------------------- 371 K ++ L+DG + K + + A+ + ++ Sbjct: 218 KALVILSDGVDTGSKVS--LSSAVESAQRADTLVYSVLFEDREAYGVPGFGGMGRGRGGR 275 Query: 372 ------SPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVHR 412 NG+++L+ + + + V + L ++Q I + + H+ Sbjct: 276 RPMPTAPANGKKVLERISTETGGRFFEVSKKEPLEKIYQEIEEDLRHQ 323 >gi|270308522|ref|YP_003330580.1| hypothetical protein DhcVS_1138 [Dehalococcoides sp. VS] gi|270154414|gb|ACZ62252.1| hypothetical protein DhcVS_1138 [Dehalococcoides sp. VS] Length = 538 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 54/171 (31%), Gaps = 14/171 (8%) Query: 11 FKKGIASEKAN---FSIIF----ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 KK I EK ++IF A+ + L+L+G + + + +++A + Sbjct: 4 IKKLIRGEKGASLAVALIFLAIGAIMIPPLLMLVGSGLKQGAAIENRTDVLYSSDAGV-- 61 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + +S N + + + + S + +++ V Sbjct: 62 -EWVINILKTGSEGVTDSHGNIGLPNPQSPTRTYNMSDLNGSAVMVTLTYHDLGSYYDVV 120 Query: 124 SSRISMTHMANNRLD----SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 S+ S ++F N D +L +++ N Sbjct: 121 STASLNGRSITTNATLRYQMSGGSVFDNAITSLDGDVKLTGSSSVISDPRN 171 >gi|26331510|dbj|BAC29485.1| unnamed protein product [Mus musculus] Length = 323 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 44/143 (30%), Gaps = 20/143 (13%) Query: 230 ALKNALLLF-LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 A+ F + + L + + LI Y E V ++ Sbjct: 183 YPWEAVKNFLVKFVTGLDIGPKKTQVALIQYANEPRIIFNL-NDFETKEDMVQATSETRQ 241 Query: 289 LKP--TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T++ A++ A S K ++ +TDGE+++ Sbjct: 242 HGGDLTNTFRAIEFARDYAYSQTSG-------------GRPGATKVMVVVTDGESHDGSK 288 Query: 347 NVNTIKICDKAKENFIKIVTISI 369 I+ C+ ++ I I++ Sbjct: 289 LKTVIQQCN---DDEILRFGIAV 308 >gi|77458974|ref|YP_348480.1| transporter, putative [Pseudomonas fluorescens Pf0-1] gi|77382977|gb|ABA74490.1| Putative transport protein [Pseudomonas fluorescens Pf0-1] Length = 804 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 4/33 (12%), Positives = 13/33 (39%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 + ++ +L +G + + Y + + A Sbjct: 684 LTIATLPVMVLAVGIGVDYAFYIYNRLQLHQAL 716 >gi|197295156|ref|YP_002153697.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] gi|195944635|emb|CAR57239.1| putative flp type pilus assembly protein [Burkholderia cenocepacia J2315] Length = 178 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 8/62 (12%), Positives = 25/62 (40%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FAL +M ++L + Y+ ++ A A + + ++ + + Sbjct: 12 RGAVAVEFALVLMPMIMLATGVAEFGRAIYQYETLTKATRDAARYLSVWLPTDSAYPVAQ 71 Query: 79 FE 80 + Sbjct: 72 AQ 73 >gi|194221204|ref|XP_001915782.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Equus caballus] Length = 1055 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 56/217 (25%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 213 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 252 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ I Sbjct: 253 DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNI 312 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + + I+ +TDG + + TI + ++I Sbjct: 313 LSDFNHTGQGSICIQA------------IMLITDGAVDTYD----TIFAKYNWPDRKVRI 356 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 357 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 393 >gi|23821225|emb|CAD52984.1| hypothetical protein [Rhodococcus fascians] Length = 571 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 32/356 (8%), Positives = 90/356 (25%), Gaps = 24/356 (6%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALID----DAKRFIKNHIKESLSGYSAVFYN 115 A+ A A+ + + + E+ S A+ + D + + Sbjct: 210 AVAAIAAGVRGDPTTALTVDETSSTQLVTAMSELAVTDTGAAATTASTTYDALAALENAA 269 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + + + + V L + + + Sbjct: 270 GPDAAVHAVPVTEQQLASTDSSTLTAVRPQGATPVADHPAVVL------SGDETSTRAAA 323 Query: 176 FIPALLRIEMGERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 +R G + + VD + + D A + + L + Sbjct: 324 AFVDFVRQPDGTQTLLDAGFSVDEPQDAGIVAPPSGPVADALLAVLRNPVLPRRATVLLD 383 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVE-KNIEPSWGTEKVRQYVTRDMD------S 286 + + ++ V + + + +E + + +D + Sbjct: 384 VSESMRTTEGGATRLQNTVRALSEQFRRVPDATELGLWSFSEDLNNSLPFRVDVPTGPMT 443 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ---KFIIFLTDGENNN 343 + + T A+ + LT + + + + ++ +TDG +++ Sbjct: 444 VPVGTTPRRSALDATAEALTPATGSFTYASVLVAYLDAVAGYVPGRVNSVVLITDGPDDS 503 Query: 344 FKSNVNTIKICDKAKE--NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 S + A + + + + I + V +DS Sbjct: 504 PLSADELLTELTSASDPARPVAVNVVRIGDGSPAPVFTDIAARTGGTVDTVPTSDS 559 >gi|83770915|dbj|BAE61048.1| unnamed protein product [Aspergillus oryzae] Length = 1100 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/244 (11%), Positives = 69/244 (28%), Gaps = 33/244 (13%) Query: 158 LQFIEHLLNQRYNQ------KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 L N + S A + + S S+H ++ Sbjct: 526 LDNSGAEEEDYRNSQIKRQASLNSSYGAARSNNTAITDYTSVAVESGPSPSLHIPLD--- 582 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 V P+ + K+ L++AL + ++ MGL+ + + Sbjct: 583 -IVVVIPVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVG 634 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K +++ + L +D + S P Sbjct: 635 MTTKSW-----------GGWGKILNSIRPVGQKSLRADVVEGANVAMDLLMQRKLSNPIS 683 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 I+ ++D ++ S + + AK + I + + + +++ + + Sbjct: 684 -TILLISDSSTSDPDSVDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYL 739 Query: 391 NVVN 394 V + Sbjct: 740 YVKD 743 >gi|88858062|ref|ZP_01132704.1| hypothetical protein PTD2_11769 [Pseudoalteromonas tunicata D2] gi|88819679|gb|EAR29492.1| hypothetical protein PTD2_11769 [Pseudoalteromonas tunicata D2] Length = 637 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 11/109 (10%), Positives = 29/109 (26%), Gaps = 8/109 (7%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 D K +++ + L ++ L+ Y + + Sbjct: 102 YSTDIKPNRLSQARFKALDMIELFKEGE-------TALVAYAADAYVISPLTSDASTLSN 154 Query: 279 YVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 + D + K ++ + A ++L+ S G+ Sbjct: 155 LIPSLSPDIMPTKGSNVMAGLTTANELLSQAGYLSGDIILVTDGIDSED 203 >gi|307294426|ref|ZP_07574270.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] gi|306880577|gb|EFN11794.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] Length = 556 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/306 (9%), Positives = 69/306 (22%), Gaps = 47/306 (15%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSV---MSFLLLIGFLIYVLDWHYKKNSMES----- 55 + R R K + S + ALS+ ++ G + +++ Sbjct: 1 MPRIR---KGLLQSTGGAVAPTVALSLFGLIAV---GGIAFDYARMASLDSELQNAADQA 54 Query: 56 -------------ANNAAILAGASKMVSN---LSRLGDRFESISNHAKRALIDDAKRFIK 99 A + A A ++ + ++ + +I+ + Sbjct: 55 ALAAATQLDGKTGACSRAANAASALIRNDARFANDGNASGLAITVANEATCDRTGFIKFY 114 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA----NNRLDSSNNTIFYNMDVMTSYD 155 + S +G A ++ VNS + S + + Sbjct: 115 KNKDRSDTGTLADADVNFVEVTVNSRTARFALTPVVAMFSSGPLSAKAYAGLGEAICKVP 174 Query: 156 YRL-----QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVD--LSGSMHCAMN 208 + + I + L + +G +GS N Sbjct: 175 PVMICNPDEPIGNTDVDYDFAVANRIGKGLKLVTVGNGNSAWAPGNFGYLDTGSSTSNPN 234 Query: 209 ------SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 V + +T+ A D+ + G Sbjct: 235 VELREALGWISVPGDCSSLEGVKTRTGAGTTVTQAINTRFDIYERANNNAQSGTGNNGNG 294 Query: 263 VEKNIE 268 + Sbjct: 295 NNASCP 300 >gi|283787708|ref|YP_003367573.1| tight adherence protein TadF [Citrobacter rodentium ICC168] gi|282951162|emb|CBG90853.1| putative tight adherence protein TadF [Citrobacter rodentium ICC168] Length = 203 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 18/183 (9%), Positives = 51/183 (27%), Gaps = 10/183 (5%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN--------NAAIL--AGA 65 +E + SI F+ ++ F+L++ + K+ +E + L A Sbjct: 21 KNEDGSVSIEFSFVMLIFILIMYLVTDFGMAIVKQGRLERTSHTLASLVRERRALYQANE 80 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + L + +++ A L DA +S S ++ + Sbjct: 81 TLTQEEVDELLEIGKNLQKEADINLTIDALYLTPTQEGQSASVTASTLTFNAGNGKCQPN 140 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM 185 ++ + +L + + + + + + + Sbjct: 141 NVAANDVQWQKLSPYAASTERWVPIYRVRVCIPDTSSLFMTLMNGRSQKLPSVFATDVAI 200 Query: 186 GER 188 Sbjct: 201 SRN 203 >gi|307265045|ref|ZP_07546606.1| conserved hypothetical protein [Thermoanaerobacter wiegelii Rt8.B1] gi|306920030|gb|EFN50243.1| conserved hypothetical protein [Thermoanaerobacter wiegelii Rt8.B1] Length = 136 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 13/137 (9%), Positives = 34/137 (24%), Gaps = 6/137 (4%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLI-YVLDWHYKKNSMESANNAAILAGASK 67 K + ++ + ++F L +M F +L + + + + + + L A Sbjct: 2 RALKDFLKDKRGDAVLLFMLFLMIFSILFMYAVYSISRGVGAREELVKVCDEIALNIAVS 61 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + +S + + S N I + S Sbjct: 62 -----AVNMQYAQSGDLAVDTNKAYSLAVNTFKDLGIPVKDVSVTVKNRYIYVTASVSGE 116 Query: 128 SMTHMANNRLDSSNNTI 144 + + Sbjct: 117 MYGISGDITVTGMAKAR 133 >gi|315497472|ref|YP_004086276.1| tade family protein [Asticcacaulis excentricus CB 48] gi|315415484|gb|ADU12125.1| TadE family protein [Asticcacaulis excentricus CB 48] Length = 184 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 34/143 (23%), Gaps = 10/143 (6%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLI-GFLIYVLDWHYKKNSMESANNAAI 61 +L R R G+ + ++ FAL +++ G + A Sbjct: 1 MLRRLRQTLHIGLRARNGTAAVEFALIAPILIVIYWGLA-DLSLGIMANRK------TAH 53 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 LA + S + + F + T + Sbjct: 54 LAATMGDLVAQSESLTQANVSDIFE--IGTSILEPFPAGTSLQMRISSVTRNKTTGVIAA 111 Query: 122 VNSSRISMTHMANNRLDSSNNTI 144 + S +D+ T Sbjct: 112 DWTPAPSKNWKGTTTVDTKGLTT 134 >gi|226310166|ref|YP_002770060.1| hypothetical protein BBR47_05790 [Brevibacillus brevis NBRC 100599] gi|226093114|dbj|BAH41556.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 222 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 9/86 (10%), Positives = 25/86 (29%), Gaps = 5/86 (5%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFL-IYVLDWHYKKNSMESANNAAIL 62 + R + + +E+ N +I +L+ K ++ A + Sbjct: 1 MVRILRH----LQNERGNMTIFTLTIYWFMFMLVFVALFNFSTVFVDKEQASNSAQQASI 56 Query: 63 AGASKMVSNLSRLGDRFESISNHAKR 88 A + + +++ K Sbjct: 57 AAVKDIYDEMIVAIMAYDASITRLKN 82 >gi|197121078|ref|YP_002133029.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. K] gi|196170927|gb|ACG71900.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. K] Length = 1722 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 38/170 (22%), Gaps = 17/170 (10%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAM-----KQAYQILTSD 308 I Y + + + G +T D D L T A + Sbjct: 478 YSICYLCQTSSVVVLTDGEPSPGSGLTPNDGDRLPSGGTTLADTTSGKYAGAASTGIKDI 537 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI---V 365 G K ++ N + + N + C K ++ Sbjct: 538 TSTDCPQCANFSGTAAWKDSLAKVAWYMH---NMDLRDNAESTWDCK--KNGGKQVLDTY 592 Query: 366 TISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 T+ L + N +L I + + R Sbjct: 593 TVGFATSQLPDAKTILHAAAEAGGGKFVPAENPQTLAEGLTYILEKINER 642 >gi|39996168|ref|NP_952119.1| hypothetical protein GSU1066 [Geobacter sulfurreducens PCA] gi|39982933|gb|AAR34392.1| hypothetical protein GSU1066 [Geobacter sulfurreducens PCA] gi|298505182|gb|ADI83905.1| type IV pilus tip-associated adhesin [Geobacter sulfurreducens KN400] Length = 1014 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 68/248 (27%), Gaps = 33/248 (13%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 D + ++ ++ ++T++ + + + ++ + + + Sbjct: 126 WDGNYLNWLFYHATSSQRSTVSTDATLQKTRIQTARGVISNLVKTVSGVRFGLAKLNVDG 185 Query: 257 IGYTTRVEKNIEP---SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 R + + + G T T A+ + +Q S Sbjct: 186 YDRFDRKQTDGGSIVRNCGDLTSANVDTSVSGISAETWTPLGEALSEVWQYF--KGGTSL 243 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNF--KSNVNTIKICDKAKENF---------- 361 + I S + F I +TDGE + CD A + Sbjct: 244 YNTGVSYTSPITSSCQKSFTIVVTDGEPTYDGCYRGDFSSYGCDNAADADSHLADVAAHM 303 Query: 362 --------------IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + TI + + + L T + +Y + L QN Sbjct: 304 NGSDATSAYGGTQSVTTYTIGM--TIDSSLLRTTAENGGGSYYTTTSGMDLATALQNAVN 361 Query: 408 LMVHRKYS 415 ++ R+ S Sbjct: 362 EILGRQSS 369 >gi|317137676|ref|XP_001727887.2| von Willebrand and RING finger domain protein [Aspergillus oryzae RIB40] Length = 1013 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/244 (11%), Positives = 69/244 (28%), Gaps = 33/244 (13%) Query: 158 LQFIEHLLNQRYNQ------KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 L N + S A + + S S+H ++ Sbjct: 439 LDNSGAEEEDYRNSQIKRQASLNSSYGAARSNNTAITDYTSVAVESGPSPSLHIPLD--- 495 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 V P+ + K+ L++AL + ++ MGL+ + + Sbjct: 496 -IVVVIPVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVG 547 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K +++ + L +D + S P Sbjct: 548 MTTKSW-----------GGWGKILNSIRPVGQKSLRADVVEGANVAMDLLMQRKLSNPIS 596 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 I+ ++D ++ S + + AK + I + + + +++ + + Sbjct: 597 -TILLISDSSTSDPDSVDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYL 652 Query: 391 NVVN 394 V + Sbjct: 653 YVKD 656 >gi|238489955|ref|XP_002376215.1| von Willebrand and RING finger domain protein [Aspergillus flavus NRRL3357] gi|220698603|gb|EED54943.1| von Willebrand and RING finger domain protein [Aspergillus flavus NRRL3357] Length = 1013 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/244 (11%), Positives = 69/244 (28%), Gaps = 33/244 (13%) Query: 158 LQFIEHLLNQRYNQ------KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 L N + S A + + S S+H ++ Sbjct: 439 LDNSGAEEEDYRNSQIKRQASLNSSYGAARSNNTAITDYTSVAVESGPSPSLHIPLD--- 495 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 V P+ + K+ L++AL + ++ MGL+ + + Sbjct: 496 -IVVVIPVSSSMQGLKITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVG 547 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMK-QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 T K +++ + L +D + S P Sbjct: 548 MTTKSW-----------GGWGKILNSIRPVGQKSLRADVVEGANVAMDLLMQRKLSNPIS 596 Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 I+ ++D ++ S + + AK + I + + + +++ + + Sbjct: 597 -TILLISDSSTSDPDSVDFVVSRAEAAK---VSIHSFGLGLTHKPDTMIELSTRTKGSYL 652 Query: 391 NVVN 394 V + Sbjct: 653 YVKD 656 >gi|150377239|ref|YP_001313834.1| TadE family protein [Sinorhizobium medicae WSM419] gi|150031786|gb|ABR63901.1| TadE family protein [Sinorhizobium medicae WSM419] Length = 141 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 37/101 (36%), Gaps = 1/101 (0%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASKMV 69 + ++ ++ FAL + LLL+ +I Y +N + A + A ++ Sbjct: 8 IRSLRRNQSGATAVEFALVCLPLLLLVIGVIEFGRAFYVRNELSHAADVAARRVLIGQIA 67 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYS 110 N S + + S + D KE++ G + Sbjct: 68 HNASDSEAQAKLDSAVRESFRSGDPTLLTIAVSKETVDGIA 108 >gi|52843051|ref|YP_096850.1| TPR repeat-containing protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630162|gb|AAU28903.1| TPR (repeat) domain protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 310 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 51/163 (31%), Gaps = 26/163 (15%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMK 299 + L K+ GLI +T + + + ++ D + + + A+ Sbjct: 120 KLHDLFTRKDIGQFGLIVFTGEPFVVSPLTDDGQTISSLLSALTPDIMPVAGYNLDSALD 179 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 +A Q++ K + ++ + + I+ K E Sbjct: 180 EASQLI-----------------KQAGYDHGEILVL------TAETPSSSAIEKAKKLAE 216 Query: 360 NFIKIVTISINASPNGQRLLKTCVSSPEYHYNV--VNADSLIH 400 + I + I A N L + ++ + + V ++ L Sbjct: 217 SGIFSSIMPIRADENLNPLFQRFANAGDGYLLVYSPDSSDLEK 259 >gi|327266004|ref|XP_003217797.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Anolis carolinensis] Length = 1098 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 24/217 (11%), Positives = 54/217 (24%), Gaps = 37/217 (17%) Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDL 244 + ++VD+SGSM K ++ K + LD++ Sbjct: 256 QAATSPKDVVILVDVSGSM--------------------KGLRLTIAKQTVSSILDTLGD 295 Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 Y + Y + A+ +A+ + Sbjct: 296 DDFFNIIAYNEELHYVEPCLNGTLVQADRANKEHFREHLDKLFAKGIGMLDIALVEAFNM 355 Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 L+ + I+ +TDG + + + + ++I Sbjct: 356 LSDFNHTG------------QGSICSQAIMLVTDGAVDTYDAVFEKYNWP----DRKVRI 399 Query: 365 VTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLIH 400 T I LK + + + + + Sbjct: 400 FTYLIGREAAFADNLKWMACANKGFFTQISTLADVQE 436 >gi|326925142|ref|XP_003208779.1| PREDICTED: epithelial chloride channel protein-like, partial [Meleagris gallopavo] Length = 395 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 10/81 (12%), Positives = 32/81 (39%), Gaps = 6/81 (7%) Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE-KVRQYVTR 282 ++ L+ A +FL + ++ +G++ + + + T RQ + + Sbjct: 320 SYNRITNLRTAAEVFL-----IQIIEVGSRVGIVTFESSAYERSPLVQITSVATRQRLVQ 374 Query: 283 DMDSLILKPTDSTPAMKQAYQ 303 ++ + T+ +++ Q Sbjct: 375 NLPTTAGGGTNICSGIEKGLQ 395 >gi|127512039|ref|YP_001093236.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] gi|126637334|gb|ABO22977.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] Length = 1182 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 20/257 (7%), Positives = 58/257 (22%), Gaps = 33/257 (12%) Query: 166 NQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 +IV I + + + + + + + + P+ Sbjct: 297 GDADGGRIVYDIKEMTSTNQNSLISLINGMPAQTNTPLCETLYEAYQYFSGGPVTFGNDD 356 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 +S + T I + G + + Sbjct: 357 KNAQGSYKIDGYIPNSPPSILTGGNYTTPFKKCPDTAY--IIYITDGAPTIDSNADSAII 414 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 SL ++ + + S + + + +G +N Sbjct: 415 SLASSAENTAN--YSGFDFVNSWGDTETSYLPALAAYMYNNDMVK--GVKDANGIDN--- 467 Query: 346 SNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEY----------HYNVVN 394 + +++ TI + + +LL+ +Y N Sbjct: 468 -------------KQNVRLFTIGFSEGADVAAKLLEEAAFRGGNPRDDSGVSKGYYVAKN 514 Query: 395 ADSLIHVFQNISQLMVH 411 L+ + + ++ Sbjct: 515 GLDLVAAMDDALKSILS 531 >gi|254438536|ref|ZP_05052030.1| type I secretion target GGXGXDXXX repeat protein domain protein [Octadecabacter antarcticus 307] gi|198253982|gb|EDY78296.1| type I secretion target GGXGXDXXX repeat protein domain protein [Octadecabacter antarcticus 307] Length = 951 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 29/352 (8%), Positives = 80/352 (22%), Gaps = 14/352 (3%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +A A +S + G ++ + I I ++G + Sbjct: 2 ADAMAAASTGDTISLAAGYGPEVVIVTLDNITITGPASATGIDLTIGAGVTGLTLDGEAP 61 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 + I + +N + + + L + + +SF Sbjct: 62 INVTDSGTDDIITGNSGDNIITVTGGADVVDGGAGDD-WLVLDYSGSAVTVSATTTSISF 120 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + + + I ++ +D ++ Sbjct: 121 AGSPGTVTIAPNSIEHFLVLTGGGTDTITLTTADGDNYFDTGEGASTVTVGSGDNVFVGG 180 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 +D++ + I + G+ + +D++ L ++ Sbjct: 181 SGIDTLTFGNGDNS------INLGYGTNGANTLTGGSGNNTILGSDGIDTVTLGGGNNRI 234 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + L+ + G I ++ + G + + + Sbjct: 235 EVGDGANTLSIGGTFNGNNVIIG-GSGIDTITVGGGNNCIDAGWGTAGANTITSSATGTG 293 Query: 357 AKE----NFIKIVTISINAS--PNGQRLLKTCVSSPEYHYNVVNADSLIHVF 402 I +T+ S G + + N I Sbjct: 294 TNSILGGEGIDTITLGNGDSYIAAGGGDNTITTGTGDSFITADNGIDTITAM 345 >gi|170732845|ref|YP_001764792.1| TadE family protein [Burkholderia cenocepacia MC0-3] gi|169816087|gb|ACA90670.1| TadE family protein [Burkholderia cenocepacia MC0-3] Length = 164 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 9/66 (13%), Positives = 22/66 (33%), Gaps = 7/66 (10%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ +I FA+ + F L++ +I + ++ LA + Sbjct: 7 VSGMRRRQRGATAIEFAMVLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARA 59 Query: 71 NLSRLG 76 L+ Sbjct: 60 ALNYQQ 65 >gi|319648589|ref|ZP_08002803.1| YabS protein [Bacillus sp. BT1B_CT2] gi|317389356|gb|EFV70169.1| YabS protein [Bacillus sp. BT1B_CT2] Length = 245 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 30/86 (34%), Gaps = 7/86 (8%) Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKT--C 382 I+ LTDG +N + + AKE I + I I + + + + Sbjct: 3 KGHLNQILLLTDGCSNRGED---PQAMAAFAKEQGITVNVIGIMDEHEMDQEAMKEVEGI 59 Query: 383 VSSPEYHYNVVNADSLIHVFQNISQL 408 + + +V L Q +++ Sbjct: 60 ALAGGGVHQLVYTSQLSQTVQMVTKK 85 >gi|226941651|ref|YP_002796725.1| outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] gi|226716578|gb|ACO75716.1| putative outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] Length = 2392 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 28/305 (9%), Positives = 69/305 (22%), Gaps = 36/305 (11%) Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 L+ D S+ + + ++ T + N + + Sbjct: 1554 LANDSDPDGKHSDLRIEDGSIRLADGMDGYYSITVDYDKGTVTVTPLDKDGNGTVDYVKD 1613 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIF 191 + + + V + I G+ Sbjct: 1614 LQLDYTVKDADGATDQATVT--VSPVDTVTGTSHGDT------LLGDSSGEIITGDVSGS 1665 Query: 192 LIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED 251 ++ ++ ++S + ++ KNA+ LD++ Sbjct: 1666 GGISTPGMNYNLCIILDSSGSMESYMSTGSYSSSARIDVAKNAIQTLLDNMADFDGTINL 1725 Query: 252 VYMGLIGYT------------TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + + + T ++ + T+ A Sbjct: 1726 SIVDFDSGSKMALQGVTLRDLCVQDAQGNWHLDTSAGSAFMQAMGNITATGGTNYESAFM 1785 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE 359 QA Q + P+ + F+TDG+ N KI Sbjct: 1786 QAEQWFSQQ----------------PTDGYTNHTYFVTDGKPTARYENSFADKISHTVSF 1829 Query: 360 NFIKI 364 + + + Sbjct: 1830 DSVTV 1834 >gi|254445696|ref|ZP_05059172.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198260004|gb|EDY84312.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 923 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 25/339 (7%), Positives = 79/339 (23%), Gaps = 39/339 (11%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A + + + + + S S + + +N Sbjct: 442 AEGLDNPNLVAASTANATQTAPNNDEPLPRTQNPPPTTQLSEYPESNTATDPQSTFSLNV 501 Query: 125 SRISMTH----MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S +S +A N + Y + + + P Sbjct: 502 SDVSYRLTEAYLAQNVRPPAGTLRTEEFVNAFDYG---DPTPPVARKIGFTWERAHWPFA 558 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ + + + + + + ++ + + Sbjct: 559 HDRDV-------LRFSLQTAAHGRASSQPLHLTLAIDTSGSMSRPDRVDIVNSLATALQS 611 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 ++ + ++ + + ++ T TD A++ Sbjct: 612 NLTEKD------RLSIVSFDRQPRLVLD-GQSVTAETNLATLATQLNPQGGTDLESALQL 664 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN-TIKICDKAKE 359 +YQ + + +I +TDG N +N + + Sbjct: 665 SYQTAQRHFQENAIN----------------RVILITDGAANLGNTNAEQLRTTVTENRI 708 Query: 360 NFIKIVTISINASPNGQRLLKT-CVSSPEYHYNVVNADS 397 I + I + L++ + + + + + Sbjct: 709 RGIALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSPED 747 >gi|171912852|ref|ZP_02928322.1| hypothetical protein VspiD_16770 [Verrucomicrobium spinosum DSM 4136] Length = 779 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 24/67 (35%), Gaps = 3/67 (4%) Query: 305 LTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKI 364 + + + P ++ LTDG++N S ++ KE+ I + Sbjct: 181 FEFKPEATATNLDQPLRDALGPNPAGTALVILTDGQHNGPGS---PEEMAAGLKESGIPV 237 Query: 365 VTISINA 371 T+ + Sbjct: 238 FTVGYGS 244 >gi|296134299|ref|YP_003641546.1| TadE family protein [Thermincola sp. JR] gi|296032877|gb|ADG83645.1| TadE family protein [Thermincola potens JR] Length = 133 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 36/135 (26%), Gaps = 9/135 (6%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 F + E + AL + +L+I ++ A GA V Sbjct: 5 FHQFRQRENGQALVEMALVLPVLILIIFGIVEFGRIMNT----YLIVTNAAREGARHGVV 60 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + R I+N I I + S ++ ++ + Sbjct: 61 GGTDADIRDAVINNAYTLDPAK-----ITVTISPAGSRTRGAPLTVQVSYSLDIIAPLIG 115 Query: 131 HMANNRLDSSNNTIF 145 + N + +T Sbjct: 116 VITGNPYIVTASTTM 130 >gi|291571720|dbj|BAI93992.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 449 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 39/132 (29%), Gaps = 14/132 (10%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 L T S M A ++++ + K + Sbjct: 289 TVADTIAKEVTQKLADGTTSSGTDMGTALKLVSQELKIPPMNERAL----------PPVL 338 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVV 393 + ++DG+ + + + + ++I I+I + L K + Sbjct: 339 VLISDGQPTDDFQSGLDELMKQPWGKKAVRI-AIAIGKDADEAVLQKFIGHTELKPLQAN 397 Query: 394 NADSLIHVFQNI 405 N ++L F+ I Sbjct: 398 NPEAL---FKYI 406 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 51/145 (35%), Gaps = 21/145 (14%) Query: 265 KNIEPSWGTEKVRQYVTRDMDS-LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 E S+ Q + + D + T++ A+ A I ++ Sbjct: 104 TEAEISFRYANRSQAIEKLNDLPYMACKTNTQLALNLAEMIFFNNTLGPLRPG------- 156 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTC- 382 ++ I+ TDG++N + + K+ ++I +++ G + + Sbjct: 157 ------RRRILIFTDGQSNVKEQMTLYRAF--RLKKRGVEIYVVAVGKYLYG--MHEIIG 206 Query: 383 -VSSPEYH-YNVVNADSLIHVFQNI 405 +S +H Y V + + + Q I Sbjct: 207 LATSSSHHLYRVRSMKDFVKIVQLI 231 >gi|83717072|ref|YP_440459.1| hypothetical protein BTH_II2271 [Burkholderia thailandensis E264] gi|83650897|gb|ABC34961.1| putative exported protein [Burkholderia thailandensis E264] Length = 180 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 8/129 (6%), Positives = 40/129 (31%), Gaps = 5/129 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL+ + Y+ ++ A A + + ++ + + Sbjct: 14 RGAVAVEFAIVMIPLVLLVTGVAEFGRAIYQYEALTKATRDAARYLSMYLPTDPAYPIAQ 73 Query: 79 FESISNHAKRALIDDAKRFIKN-----HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + + + + ++ N + + Sbjct: 74 AQCLAVYGSTTCGSTGSELAPGLATSMVVVCDAAHSTNCSDASDPAQFANVPTYDTDNGS 133 Query: 134 NNRLDSSNN 142 + + + Sbjct: 134 PDPASLAGS 142 >gi|134281853|ref|ZP_01768560.1| TadE family protein [Burkholderia pseudomallei 305] gi|134246915|gb|EBA47002.1| TadE family protein [Burkholderia pseudomallei 305] Length = 168 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 FE 80 + Sbjct: 62 AQ 63 >gi|58337998|ref|YP_194583.1| hypothetical protein LBA1740 [Lactobacillus acidophilus NCFM] gi|227902825|ref|ZP_04020630.1| conserved hypothetical protein [Lactobacillus acidophilus ATCC 4796] gi|58255315|gb|AAV43552.1| putative membrane protein [Lactobacillus acidophilus NCFM] gi|227869488|gb|EEJ76909.1| conserved hypothetical protein [Lactobacillus acidophilus ATCC 4796] Length = 1376 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 36/391 (9%), Positives = 91/391 (23%), Gaps = 35/391 (8%) Query: 15 IASEKANFSI--IFALSVMSFLLLIGFLI-------YVLDWHYKKNSMESANNAAILAGA 65 + ++KA+FSI + + + + I + + + +M+ + + + A Sbjct: 14 MNNQKAHFSIRKLTIGAASVLIGITFMGINGQTVNADEMTGNTQPEAMQVSTDKSTDATT 73 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + S + + D+ ++ + + + + S Sbjct: 74 TQQNDSDSSAQKNAAQTVEVPQNSKQDNTGETSQSTTDKKDTQVQKTESSETKADTAQSQ 133 Query: 126 RISMT--------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFI 177 N + S + L ++ + + Sbjct: 134 VPPAANQTQKSADQNVENTQSPVASNTTQPKTQNDSKYNVADWGGSLNDETHEYTLN--- 190 Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 + ++ I D + K A + Sbjct: 191 ----KYNGSDKENIYIPNTEDFIKAGKITDIDKVFITKDLIQNITKNG--------ATSI 238 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 +D K K S + M T A Sbjct: 239 VIDDQGSEDKNKVYAKGDWSNAFAGSTKLK--SVDLSHLDTSQVTSMAGAFNGDTGLNDA 296 Query: 298 MKQAYQILT-SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + I ++ F+ V + F + ++ +K Sbjct: 297 NLSGWNIQNMTNLSSLFYGANNLYNVDMSGWNFINNPNTNSMFSYADYNLKSVNLKNAKN 356 Query: 357 AKENFIKIVTISINASPNGQRLLKTCVSSPE 387 ++ ++I +I S L SP Sbjct: 357 VTDDILRIYARAIKNSNATTADLSDINLSPN 387 >gi|113476689|ref|YP_722750.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167737|gb|ABG52277.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 420 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 43/183 (23%), Gaps = 22/183 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVR 277 Q K K+ K A L + + + + L GY+TRV + + G Sbjct: 53 TSQSMKGEKLQRAKEACLAVVSHLRDPDY------LSLAGYSTRVTPLLESLAGGGAAAG 106 Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 D T A+ + L +K G T Sbjct: 107 FAEGAIADLQARGVTRIDLALDWIEESLLPEKSPPLVGVLITDGHA-------------T 153 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQR--LLKTCVSSPEYHYNVVNA 395 + I K I + + + + N L Sbjct: 154 NAGGTPLDDMKPFIVKARNMKSCGIILCAVGLGDAANFNTSFLTDLSDQGGGAFIYADTP 213 Query: 396 DSL 398 D L Sbjct: 214 DKL 216 >gi|328906790|gb|EGG26561.1| hypothetical protein PA08_1610 [Propionibacterium sp. P08] Length = 131 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 8/132 (6%), Positives = 27/132 (20%), Gaps = 12/132 (9%) Query: 17 SEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E+ + S+ L V L++ G + A + + + + Sbjct: 2 DERGGGSASVWMLLMVPVILVVAGLVFDGSRQISA----------TQAAQDAAVAATRAG 51 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 A + + + ++ + Sbjct: 52 TDAAATPQLAGRDGAAAAVQAARQALSAAGVSGSVQVGGVTITVTTSQSRPTVFLSAIGI 111 Query: 135 NRLDSSNNTIFY 146 +++ Sbjct: 112 SQVTGHGKAHAQ 123 >gi|238027564|ref|YP_002911795.1| TadE-like protein [Burkholderia glumae BGR1] gi|237876758|gb|ACR29091.1| TadE-like protein [Burkholderia glumae BGR1] Length = 156 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 37/117 (31%), Gaps = 1/117 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVSNL 72 S++ + S+ FAL + +F +++ +I + ++ +A A A ++ Sbjct: 9 LRRSQRGSMSVEFALILPAFFMVLYAVITYGLIFAAQQNLTLAATEGARAALNYQISPTP 68 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + A L Y + Y ++ S + Sbjct: 69 QAALTARAAAACTAATHLTGWLTGVSCASTPAGTCSYDSSMYCIKVTLTYPYSSSPL 125 >gi|256394484|ref|YP_003116048.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] gi|256360710|gb|ACU74207.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] Length = 583 Score = 39.1 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 32/131 (24%), Gaps = 10/131 (7%) Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV-----KIPSLPFQKFII 334 + + T M AY L K + Sbjct: 446 IAELGAAEPGGGTHGQR-MIAAYGALADKAGSRNGLYDVLLAAYQNVQKGWDPTRTNTVA 504 Query: 335 FLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHY 390 TDG++++ S + I I++ +++ + L K + Sbjct: 505 VFTDGKDDDLNSMTSDQLIAKLQAAVDPARPIRVFVVALGTDVDLTLLNKITAVTGGAAV 564 Query: 391 NVVNADSLIHV 401 + + + Sbjct: 565 HFSDMGQMTAA 575 >gi|167744178|ref|ZP_02416952.1| hypothetical protein Bpse14_39263 [Burkholderia pseudomallei 14] gi|167821377|ref|ZP_02453057.1| hypothetical protein Bpse9_40023 [Burkholderia pseudomallei 91] gi|167829719|ref|ZP_02461190.1| hypothetical protein Bpseu9_38930 [Burkholderia pseudomallei 9] gi|167851186|ref|ZP_02476694.1| hypothetical protein BpseB_38461 [Burkholderia pseudomallei B7210] gi|226194150|ref|ZP_03789750.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] gi|225933843|gb|EEH29830.1| TadE family protein [Burkholderia pseudomallei Pakistan 9] Length = 168 Score = 39.1 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 FE 80 + Sbjct: 62 AQ 63 >gi|254473781|ref|ZP_05087176.1| TadE-like protein [Pseudovibrio sp. JE062] gi|211957167|gb|EEA92372.1| TadE-like protein [Pseudovibrio sp. JE062] Length = 191 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 12/110 (10%), Positives = 34/110 (30%), Gaps = 5/110 (4%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 L + + K + + ++ FA+ L ++ ++ + E + A+ Sbjct: 15 LIKSKILIGKFAKNARGVTAVEFAIIAPFLLGMVMAILELGLSFLV----EVVLDNAVAE 70 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + + + ++ + K L + A K +S Sbjct: 71 ASRHIRTGQVFYDAEYD-LGKFKKHVLDNGAGLLNAVDEKIFISVQHRDN 119 >gi|160897507|ref|YP_001563089.1| TadE family protein [Delftia acidovorans SPH-1] gi|160363091|gb|ABX34704.1| TadE family protein [Delftia acidovorans SPH-1] Length = 180 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 10/122 (8%), Positives = 31/122 (25%), Gaps = 1/122 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ ++ FAL ++L+ + + S+ A ++ Sbjct: 32 LRRRQRGVAAVEFALIAGVMVMLLMGMFVYWRVLQAQQSLARATGDGARMVQHLAFGGMA 91 Query: 74 RLGDRFESISNHAKRALIDDAK-RFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 + + +A D + I + Q ++ + + Sbjct: 92 GYNPAKPAERANILQATSDVVNQSLAASGIPAGTAPTQIQIDWGTSQATLSVVYMLPPVL 151 Query: 133 AN 134 Sbjct: 152 GQ 153 >gi|330819181|ref|YP_004348043.1| TadE family protein [Burkholderia gladioli BSR3] gi|327371176|gb|AEA62531.1| TadE family protein [Burkholderia gladioli BSR3] Length = 257 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 37/131 (28%), Gaps = 7/131 (5%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + + F + L L+ + ++A L A + N + L Sbjct: 12 QGGQSLVEFVIVAPVLLFFCFGLLQYALLFQARA----TLDSATLEAAREGAVNHAELDA 67 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRL 137 A+ A +L+ + + I+N +R + R Sbjct: 68 ---MQRGLARGLSPLYAYEANAAGASVALARANQAVRDRARIAIINPTRAQLDDFGQTRY 124 Query: 138 DSSNNTIFYNM 148 D+ + + + Sbjct: 125 DAQSRSTVREI 135 >gi|319782123|ref|YP_004141599.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168011|gb|ADV11549.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 554 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 9/114 (7%), Positives = 31/114 (27%), Gaps = 6/114 (5%) Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 S A+ A L K + + + ++ +TDG + Sbjct: 93 SGSAISAAADSLKFLGKTPLTAAVKQAAEALRYTEDKATVVLITDGLETCGGDPC---AL 149 Query: 354 CDKAKENFIKIV--TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVFQN 404 + + + + + + + + + ++ + + +L Sbjct: 150 GKELEASGVDFTADVVGFGLTADEGKQIACLADNTGGKYIQASDEKALQEALAE 203 >gi|126457913|ref|YP_001076992.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|167725251|ref|ZP_02408487.1| hypothetical protein BpseD_39896 [Burkholderia pseudomallei DM98] gi|167899816|ref|ZP_02487217.1| hypothetical protein Bpse7_39195 [Burkholderia pseudomallei 7894] gi|167908133|ref|ZP_02495338.1| hypothetical protein BpseN_38276 [Burkholderia pseudomallei NCTC 13177] gi|167916480|ref|ZP_02503571.1| hypothetical protein Bpse112_38767 [Burkholderia pseudomallei 112] gi|167924337|ref|ZP_02511428.1| hypothetical protein BpseBC_37623 [Burkholderia pseudomallei BCC215] gi|254182580|ref|ZP_04889174.1| TadE family protein [Burkholderia pseudomallei 1655] gi|254187132|ref|ZP_04893647.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|254192458|ref|ZP_04898897.1| TadE family protein [Burkholderia pseudomallei S13] gi|254296486|ref|ZP_04963942.1| TadE family protein [Burkholderia pseudomallei 406e] gi|126231681|gb|ABN95094.1| TadE family protein [Burkholderia pseudomallei 1106a] gi|157806328|gb|EDO83498.1| TadE family protein [Burkholderia pseudomallei 406e] gi|157934815|gb|EDO90485.1| TadE family protein [Burkholderia pseudomallei Pasteur 52237] gi|169649216|gb|EDS81909.1| TadE family protein [Burkholderia pseudomallei S13] gi|184213115|gb|EDU10158.1| TadE family protein [Burkholderia pseudomallei 1655] Length = 168 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 26/62 (41%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDAARYLSTYLPTDPAYPLAQ 61 Query: 79 FE 80 + Sbjct: 62 AQ 63 >gi|309358766|emb|CAP33788.2| hypothetical protein CBG_15420 [Caenorhabditis briggsae AF16] Length = 876 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 50/165 (30%), Gaps = 31/165 (18%) Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + + L ++ V +G++ Y+ R + + + + T + Sbjct: 54 NWTIKIVRDLPVHEDAVRVGIVQYSERNDIITHL------------ETLKFMPGEDTRTG 101 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A+ +A + L + II TDG S Sbjct: 102 VALSKADDEIFDYDGG-------------ARLKATRLIIVFTDG-----LSMDKPTLAAK 143 Query: 356 KAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + +KI TIS+N+ +L V + + + + + Sbjct: 144 ALRRKGVKIYTISVNSIGFVPEMLGI-VGDADNVFGPTDENRIEE 187 >gi|269954957|ref|YP_003324746.1| conserved repeat domain-containing protein [Xylanimonas cellulosilytica DSM 15894] gi|269303638|gb|ACZ29188.1| conserved repeat domain protein [Xylanimonas cellulosilytica DSM 15894] Length = 2553 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 37/376 (9%), Positives = 75/376 (19%), Gaps = 61/376 (16%) Query: 60 AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 A LA SK V + +++ I + +S + Sbjct: 54 AALAAPSKEVDPATHHEFPGVEDTDY-------YISSRNPGRIWTDKTVFSTDVAVPKDM 106 Query: 120 NIVNSSRISMTHMANNRLDS------SNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKI 173 + + + + L S + + + + N + Sbjct: 107 QDEENPIPDLQ-VGEDELAVALSAVGSTRHVSREVPTPVDVSLIIDNSTSMRQCVGNTNM 165 Query: 174 VSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN 233 + A + +M A+N + P + Sbjct: 166 CTGASADDN-----------PYTKSRAAAMVEALNVAIGIIAENPANRIAVTQFSENSGE 214 Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTD 293 + L + G Y T + + + ++ T+ Sbjct: 215 FVDLGTPKNAPRPSAQATFIGGPYVYLTHSGSTMTLTMTDDTRTGNARTRTLTVGGGSTN 274 Query: 294 STPAMKQAYQILTSD------KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + +L +R F G S + G N+ S Sbjct: 275 IHRGVATGMGLLAQANSVTGENQRIPNVILFTDGEPTYSASVTNWWAPGASGANHGPNSP 334 Query: 348 VNTIKICDKAKEN----------------------------FIKIVTISINAS--PNGQR 377 T + K + TI + S R Sbjct: 335 TATQYYGNGFKAALTASLLKNKIEAVYNDEAFNLAHARTPVEASVYTIGLGVSGLSEQGR 394 Query: 378 LLKTCVSSPEYHYNVV 393 L P Sbjct: 395 QLAYATLDPGKTNPAT 410 >gi|115927463|ref|XP_793559.2| PREDICTED: similar to putative cell wall protein FLO11p [Strongylocentrotus purpuratus] gi|115956322|ref|XP_001194238.1| PREDICTED: similar to putative cell wall protein FLO11p [Strongylocentrotus purpuratus] Length = 1496 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 23/314 (7%), Positives = 70/314 (22%), Gaps = 12/314 (3%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 +A ++A + ++ + + S + + S Sbjct: 36 NATDSAPVNATDSTPADATDSTPVNATDSTPMNATDSTPVNATDSTPVNATDSTPMNATD 95 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 +T + ++ + DSS + V + + + + V Sbjct: 96 STPVNATDSAPVNATDSTPATATDSSPVNATDSSPVNATDSTPVNATDSTPMNATDSTPV 155 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNA 234 + + + + ++ + +N+ +A T + + Sbjct: 156 NA-TDSSPVNATDSTPANVTDSSPVNATDSSPVNATDSTPANATDSSPVNATDSSPVNAT 214 Query: 235 LLLFLDSIDLLSHVKE----------DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD- 283 L++ D G + + Sbjct: 215 DSTPLNATDSTPVNGTESIPVNGTDATPVNGTDPMPVNATDSTPLNATDSTPLNATDATP 274 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++ T+ T + A T+ + ++ + D N Sbjct: 275 VNGTDSTTTNGTYSTPSAGNTTTTGTLPPITNLTTTLAPTTINFIAADLMLSIFDNNNTP 334 Query: 344 FKSNVNTIKICDKA 357 + + D Sbjct: 335 LIDSNLLVVEGDNI 348 >gi|15921064|ref|NP_376733.1| hypothetical protein ST0830 [Sulfolobus tokodaii str. 7] gi|15621848|dbj|BAB65842.1| 381aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 381 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 54/174 (31%), Gaps = 31/174 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ K L L+ I + + I +++ V IE + + V + ++ Sbjct: 57 KIETAKQGALQLLNKIPPGN------KITFITFSSTVNTLIEFADTSGSVGETISSVT-- 108 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 T A+ A QI +II LTDG + + Sbjct: 109 -AQGNTVLYTALSTAIQIAKK-------------------HGIPGYIILLTDGNPTDLTN 148 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 K+ + K+++ I N Q L + ++ + + + Sbjct: 149 TDAYEKLQFP---DGFKVISFGIGDDYNEQLLKVLADKTGGILNHIQDPMEIAN 199 >gi|325291589|ref|YP_004277453.1| hypothetical protein AGROH133_03083 [Agrobacterium sp. H13-3] gi|325059442|gb|ADY63133.1| hypothetical protein AGROH133_03083 [Agrobacterium sp. H13-3] Length = 197 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 25/88 (28%), Gaps = 4/88 (4%) Query: 2 HLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 + RF + + ++ FA+ L L + + SA +A Sbjct: 11 NGRLRFIALVARFARDRRGVGAVEFAIVFPILLALYLTSFELTIGYNTYKRASSA--SAT 68 Query: 62 L--AGASKMVSNLSRLGDRFESISNHAK 87 + + + + L + + Sbjct: 69 INDLISKTNSVDKAYLTSMQDVTAAVFA 96 >gi|218778089|ref|YP_002429407.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218759473|gb|ACL01939.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 377 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 32/97 (32%), Gaps = 5/97 (5%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE---NFIK 363 + + +I ++DG N + + K N + Sbjct: 125 ASGDSPMDLALSGAADDLARTNGSTAVILVSDGYKNAMDAAAAVAAA-KELKSRYMNKVC 183 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 I TI I + P G+ +L V + ++V NA + Sbjct: 184 IYTIQIGSDPTGKAILDKIVKAGGCGFSV-NAADIAD 219 >gi|87308939|ref|ZP_01091077.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] gi|87288282|gb|EAQ80178.1| hypothetical protein DSM3645_19318 [Blastopirellula marina DSM 3645] Length = 346 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 34/319 (10%), Positives = 72/319 (22%), Gaps = 55/319 (17%) Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 N D I + + SL +T S + + + N Sbjct: 77 ASVTCNFQSATSGDGVDVLIGDMPEVSLDDSGGETLDTVAAASEVSETVEVDELLNEVAP 136 Query: 139 SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG----ERPIFLIE 194 S + T+ L + + + + + Sbjct: 137 PSAEPTDIGELIDTA---ALTPSGAMGGGIGSLSTIGGGGSGVGAGASFMGLRAAGRRFC 193 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 ++ D SGSM K+ +K +L + S+ + + Sbjct: 194 IIADCSGSM--------------------SGVKLDYVKEEILETVSSL----PREAQFQV 229 Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 T+ PA + A + + Sbjct: 230 IFFQSQAVPFPQKGWRHPKRDFNALSEWLKTVGPAGGTNPLPAFEIALK-FSPR------ 282 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 + F+TDG ++ + + E +K+ IS Sbjct: 283 ---------------PDAVFFMTDGLFDDNVVGEVKRQ--NDLSEPKVKVHAISFMDRSA 325 Query: 375 GQRLLKTCVSSPEYHYNVV 393 + + S + +V Sbjct: 326 EPLMRQIAGESGGEYRHVQ 344 >gi|238923486|ref|YP_002937002.1| hypothetical protein EUBREC_1106 [Eubacterium rectale ATCC 33656] gi|238875161|gb|ACR74868.1| Hypothetical protein EUBREC_1106 [Eubacterium rectale ATCC 33656] Length = 209 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 20/190 (10%), Positives = 56/190 (29%), Gaps = 10/190 (5%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ + E + ++ + L I F+++V + + +K +E +A+L GA+K Sbjct: 1 MKRLL-DESGALIVEASIVFPTMFLAIFFMLFVGNAYTQKCKVEDIVVSAVLEGAAKCAD 59 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + + ++ + + + S++ Sbjct: 60 PMLDDIESGSIPDFNSVSLKPYRYLTGSLRDTEGDIESEINKNVGKIGTGFFTGMKPSIS 119 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + V Y+++ + + Q + I ++ + Sbjct: 120 IDRLKF-----ESKVIYSKVNVDVSYKIK----MPIRLLGQDEFTNISYATHTDVPVSDV 170 Query: 191 FLIELVVDLS 200 VD + Sbjct: 171 PEFIRTVDFA 180 >gi|83859352|ref|ZP_00952873.1| hypothetical protein OA2633_13145 [Oceanicaulis alexandrii HTCC2633] gi|83852799|gb|EAP90652.1| hypothetical protein OA2633_13145 [Oceanicaulis alexandrii HTCC2633] Length = 178 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 24/61 (39%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +R R + + + ++ FAL F LLI + V + +E+ N A Sbjct: 1 MSASARIRDRITRFHRAREGAAAVEFALIAAPFFLLIFATLEVALFFLGSTIIENGVNEA 60 Query: 61 I 61 Sbjct: 61 A 61 >gi|253996765|ref|YP_003048829.1| TadE family protein [Methylotenera mobilis JLW8] gi|253983444|gb|ACT48302.1| TadE family protein [Methylotenera mobilis JLW8] Length = 173 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 11/101 (10%), Positives = 33/101 (32%), Gaps = 3/101 (2%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 +L + Y K+ ++ ++ FA+ + ++ ++ Y N+++ A Sbjct: 2 MLKMMKSYSKR--QRQQGAAAVEFAIIALLLFTILFGILEFGRLFYVYNTVQEVTRHAAR 59 Query: 63 -AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 A + ++ + + A + I Sbjct: 60 EAVVRWVDNSNTSPAKILALFGGASVPAGAEITAANIDIQY 100 >gi|307133504|dbj|BAJ19016.1| TadF [Aggregatibacter actinomycetemcomitans] Length = 192 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 6/45 (13%), Positives = 21/45 (46%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 KK +++ ++ F ++ + + FL ++ + +++A Sbjct: 10 KKFFHNKRGAVTLEFLFMLILLVFIFAFLTDLVIVRTTQGKLDNA 54 >gi|186475468|ref|YP_001856938.1| 3-oxoacyl-(acyl carrier protein) synthase II [Burkholderia phymatum STM815] gi|184191927|gb|ACC69892.1| Beta-ketoacyl synthase [Burkholderia phymatum STM815] Length = 408 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 22/211 (10%), Positives = 52/211 (24%), Gaps = 18/211 (8%) Query: 182 RIEMGERPIFLIELVVDLSGSM-----HCAMNSDPEDVNSAPICQDKKRTKMAALKNALL 236 + E + D S S +S + A + + N+ + Sbjct: 77 AVRASELALADAGFAGDESISDGRMGVAYGSSSGSVEPIRAFGTMLESGSMTDVTSNSYV 136 Query: 237 LFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTP 296 + ++ G I T+ + + G + + L Sbjct: 137 QMMPHTTAVNVSLFWDLKGRIVPTSSACASGSQAIGYAYENIAMGKQTLMLAGG------ 190 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK 356 + L+ F T + + DG + ++ + Sbjct: 191 -----AEELSGPAVAVFDTLYATSTRNDEPHLTPRPFDAKRDGLVVGEGAATLVLEEYEH 245 Query: 357 AKENFIKIVT--ISINASPNGQRLLKTCVSS 385 AK I + + +G + + S+ Sbjct: 246 AKARGATIHAEIVGFGCNSDGAHMTQPTAST 276 >gi|146302073|ref|YP_001196664.1| von Willebrand factor type A (vWA)domain-containing protein [Flavobacterium johnsoniae UW101] gi|146156491|gb|ABQ07345.1| Uncharacterized protein with a von Willebrand factor type A (vWA) domain-like protein [Flavobacterium johnsoniae UW101] Length = 372 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 30/107 (28%), Gaps = 3/107 (2%) Query: 269 PSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSL 327 +G + + + T++ ++ A IL + + G Sbjct: 227 LVFGNDAWTIPIKDLPYLQVGPYHTNTVAGLQLAMDILRRKRNTNKQIFMITDGKPSCVR 286 Query: 328 PFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 ++G + T +A++ I I T I P Sbjct: 287 ERDGSYYMNSNGLDEYIVDKCYTQ--AQQARKLHIPITTFMIANDPY 331 >gi|78221957|ref|YP_383704.1| type IV pilus assembly protein PilY1 [Geobacter metallireducens GS-15] gi|78193212|gb|ABB30979.1| type IV pilus assembly protein PilY1 [Geobacter metallireducens GS-15] Length = 991 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 34/353 (9%), Positives = 79/353 (22%), Gaps = 82/353 (23%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL----------------LNQRYNQKIV 174 A + S +F V + L + + R + Sbjct: 24 LAAMSFAAISQYPLFLTGSVQPNVMILLDNSGSMNTIMEHRDYSPGTVYSGSFRGGEIYF 83 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPED--------------------- 213 + I VD + +N Sbjct: 84 NRIDYSKSGNPYYLVSRDTGHTVDGNNQNQYTVNGRSITLPFPYVDTRWNGNYLNWLFFH 143 Query: 214 -----VNSAPICQDKKRTKMAALKNALLLFLDS----------IDLLSHVKEDVYMGLIG 258 S + T++ ++ + + + ++ + G + Sbjct: 144 ATAAQYGSLATDASIRVTRIQTARSVISDVVKNVSGVRFGLFKLNNSQGGSKVKDCGALT 203 Query: 259 YTT--------RVEKNIEPSWGTEKVRQYVTRDMDSLILK--PTDSTPAMKQAYQILTSD 308 TT E ++ QY T T ++ Y I+ +D Sbjct: 204 PTTVDAAVDGINAETWTPLGETLSEIWQYFRGGDSLYNSGTYTTPITNGCQKNYTIIVTD 263 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE--------- 359 + ++ + + N + + + + Sbjct: 264 GEPTYDNCYQGPFASYGCPKDP--------YDKENENAPSHLADVAKDMHDGNASPFSGR 315 Query: 360 -NFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + TI + + L +T V+ ++ + SL QN +V Sbjct: 316 VQNVSTYTIGLTL--DSALLDQTAVNGGGKYFTTTSGISLATAVQNALADIVS 366 >gi|299744810|ref|XP_001831286.2| von Willebrand RING finger domain-containing protein [Coprinopsis cinerea okayama7#130] gi|298406298|gb|EAU90449.2| von Willebrand RING finger domain-containing protein [Coprinopsis cinerea okayama7#130] Length = 1139 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 28/280 (10%), Positives = 70/280 (25%), Gaps = 16/280 (5%) Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 + + S ++ L+ R Q +S + R Sbjct: 546 LSGSTQTSVSTVDSLLQNSARSTMSSSTSHNGSLMPPRTLQHKLSTLGEDDREMEPYDVP 605 Query: 191 FLIELVVDLSGSMHCAMNSD----------PEDVNSAPICQDKKRTKMAALKNALLLFLD 240 LV + S + + K+ +K L + Sbjct: 606 S--PLVAPHNSSGPSNSLPSLPHPPMDLILVVSLPPPGSVPSTAQLKIRVIKTTLDFTVA 663 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 S+ + + +G +V K S G + + + + +D + +S + Sbjct: 664 SMGPKDRIS--LVTFEVGPGGKVRKTPFLSVGRPQSKARLEKFIDETGSRLDESQD--EF 719 Query: 301 AYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN 360 + +K V + + + ++ + +A+ Sbjct: 720 LVRGSRDEKTDVVTAVNHGLDVVLQRKARNPISGLILVSDASDSTRRAQMDLVLARAEAA 779 Query: 361 FIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + I + S + L + + V + L Sbjct: 780 NVPIHSFGYGRSHDPASLWLMSNHTSGTYTFVKDWYDLRS 819 >gi|311112130|ref|YP_003983352.1| hypothetical protein HMPREF0733_10460 [Rothia dentocariosa ATCC 17931] gi|310943624|gb|ADP39918.1| conserved hypothetical protein [Rothia dentocariosa ATCC 17931] Length = 140 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 13/141 (9%), Positives = 48/141 (34%), Gaps = 3/141 (2%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M L+ R ++ E + FA+++ + + ++ + ++ +E ++ A Sbjct: 1 MRLIHRMIRAWRLRARDE-GAVTAEFAVALPAVIAVLALCLSAAAVGMAQSRLEESSRVA 59 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A A NL+ + I + + ++ + + + Sbjct: 60 ARAFAR--GDNLAAVQHEVSRIDPSINLQVQTVEETSGARQVRVQVERAAPGVIGSVTGW 117 Query: 121 IVNSSRISMTHMANNRLDSSN 141 + ++ + +++ ++ Sbjct: 118 KLQATATARVEGSSDNATTAG 138 >gi|297490310|ref|XP_002698147.1| PREDICTED: integrin alpha X-like [Bos taurus] gi|296473225|gb|DAA15340.1| integrin alpha X-like [Bos taurus] Length = 889 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 11/59 (18%), Positives = 27/59 (45%), Gaps = 5/59 (8%) Query: 356 KAKENFIKIVTISINASPNGQRLLK---TCVSSPE--YHYNVVNADSLIHVFQNISQLM 409 +A+ I I + ++ ++ LK S+P + + V N D+L + + + + + Sbjct: 120 RAEAAGIIRYAIGVGSAFRYKKSLKELIDIASTPSEEHVFQVENFDALRDIQKQLKEKI 178 >gi|227551457|ref|ZP_03981506.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895933|ref|ZP_05675586.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293377027|ref|ZP_06623237.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227179401|gb|EEI60373.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257832498|gb|EEV58919.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292644243|gb|EFF62343.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 477 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/161 (8%), Positives = 41/161 (25%), Gaps = 9/161 (5%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L G++ S + D ++I+ +G A + + + + Sbjct: 214 NTNVLDHQGNKNGSNQWDGINSWDGDPNDRTHSYIEYGGTGNQADYAIRKFAKETTTPGL 273 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ-RYNQKIVSFIPALLRIE-M 185 + + + + + + R + + + + Sbjct: 274 F-------DVYLNARGNVQKDITPLDLVLVVDWSGSMNDNDRIGEVKIGVDRFVDTLADS 326 Query: 186 GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 G + V S + + + + Q K T Sbjct: 327 GITDKINMGYVGYSSEGHNYSNGTVQMGSFDSVKNQVKSIT 367 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 34/333 (10%), Positives = 94/333 (28%), Gaps = 26/333 (7%) Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA--- 133 +S S L + + + + +G + + + H+ Sbjct: 3075 SVTDSNSGLTSNGLAITLTKVGSDIVGSTTAGEVFRISVASNGTVTLTQSAELDHLPEDV 3134 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLI 193 +N D++ ++ ++++ + + + I + + +I Sbjct: 3135 DNSNDNNLISLANGKVLLSATVTVVDGDNDTATGTVSADLGGNIRFEDDVPTAKDNSVVI 3194 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 ++ +++ + + ++ K+AL +D + + Sbjct: 3195 TEAGLPPFNLVMVIDTSGSMLWQIGTSTNGSPNRLELAKDALNHMID---SYVALGVPLV 3251 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 +I + + I + + + ++ T+ + A LT+D Sbjct: 3252 FTVIDFASGA-VLIPQTSDPDVAKASISGLPT--DGGGTNYNAPLVLAQNQLTADLANPA 3308 Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 ++ + FL+DG N + N +++ + +N S Sbjct: 3309 LAG------------YETKVYFLSDGAPNEGNVPAGWTSFVN---SNNVEVYAVGLNVSG 3353 Query: 374 NGQ--RLLKTCVSSPEYHYNVVNADSLIHVFQN 404 N L + V N L Q Sbjct: 3354 NATAIAQLGLVEDHGDAVTLVNNIYDLDATLQA 3386 >gi|323498382|ref|ZP_08103382.1| putative RTX toxin [Vibrio sinaloensis DSM 21326] gi|323316624|gb|EGA69635.1| putative RTX toxin [Vibrio sinaloensis DSM 21326] Length = 1914 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/335 (7%), Positives = 62/335 (18%), Gaps = 35/335 (10%) Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + D + + + + Sbjct: 1338 DSQGQANVEIKVPYDYKGDLDFPVSVTATSIESANSDSATTTETVDVTARDYTLQSGSHG 1397 Query: 134 NNRLDSSNN---------TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 N+ + + + I + + L + + K + Sbjct: 1398 NDTITAGADNDLIVGDVQGIEIIPGQDYNIAFVLDTSGSMGGNVSDAKDEILDVFDELVA 1457 Query: 185 M-------GERPIFLIELVVDLSG--SMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 G I L E S S++ A + + A + L Sbjct: 1458 AVNQGAHPGTVNIMLSEFASGASQVISVNLADANVRDTFEDALDDMKDSGQGSTNYEAGL 1517 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV---------TRDMDS 286 +D + I + + + D Sbjct: 1518 QSAVDWFGSQPNPSAQNITYFITDGNPNASTQLNNLDKSNFSKVILDVDSSGQLVTLADI 1577 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + ++L ++ RQ + DG + + Sbjct: 1578 VESSNYSVGQTVNYKGEVLIDQW-GRVYSPLTRQMIGDIDANGSSLRYDDYDGNSTQAQH 1636 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKT 381 + + I + N L + Sbjct: 1637 MYQVLAALSS-------VEAIGLGNGVNSTTLKQY 1664 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 22/208 (10%), Positives = 52/208 (25%), Gaps = 20/208 (9%) Query: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 S S D + + + ++ + + +G Sbjct: 32 CGGSISDIWLDVVVVVDNSQRVNKRSFVSS----TRDTINNIFREASI-----PRTRVGF 82 Query: 257 IGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN 316 + Y ++ N K + Y +S M + + Sbjct: 83 VTYNSQATTN----ADLNKFKSYGDLQQ-----GVYNSYNDMNLSPEKTPYIGTGLIAAG 133 Query: 317 FFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG- 375 Q + + N ++ + + + K I I+TI+++ NG Sbjct: 134 ELLQIQGSADGHVNHPKVIIAYATALNGTGLLDPLSVANTLKSAGITIITIAVDTDDNGV 193 Query: 376 -QRLLKTCVSSPEYHYNVVNADSLIHVF 402 ++ L S + + V Sbjct: 194 IEKQLAPLASPGAAFGSNYKIGDIQSVM 221 >gi|167584961|ref|ZP_02377349.1| hypothetical protein BuboB_06471 [Burkholderia ubonensis Bu] Length = 167 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 7/138 (5%), Positives = 38/138 (27%), Gaps = 2/138 (1%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA ++ +LL+ + Y+ ++ + A ++ + S+ + Sbjct: 2 RGVAAVEFAFVLIPLVLLVTGVAEFGRAIYQYEALTKSTRDAARYLSTYLSSDPAYPLAA 61 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 + ++ + + ++ + + + Sbjct: 62 AQCLAVYGSTTCGATGTEIAPG--LSTTQVVVCDASHSSGCGDASDPPQFANVATYDTNN 119 Query: 139 SSNNTIFYNMDVMTSYDY 156 +++ + Sbjct: 120 GASSGTPSGSINLVEVKI 137 >gi|310830511|ref|YP_003965612.1| hypothetical protein PPSC2_p0151 [Paenibacillus polymyxa SC2] gi|309249978|gb|ADO59544.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 187 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ + + K +FSI+F + ++ ++++ + +L + ++S + + ++ V Sbjct: 4 LKQLLKNNKGSFSILFVILILIGVMVVTAFVDILIESWSMQEVQSVMDTSGVSALRASVD 63 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + I + + E S +T+ V+ + I + Sbjct: 64 E---------------TKLRIGEFDVEKSVAVSEYKKLVSMRLKDTKKIKSVHFNPIEIE 108 Query: 131 HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 T V+ + I +P + R Sbjct: 109 SFTEKWGT--GETKKARPQVLLDSSIVIVVSGS--------TIFDLVPGTAEVFYNSRKG 158 Query: 191 FLIELVV 197 E+ Sbjct: 159 SDFEVTY 165 >gi|294789671|ref|ZP_06754904.1| pilus-associated protein [Simonsiella muelleri ATCC 29453] gi|294482383|gb|EFG30077.1| pilus-associated protein [Simonsiella muelleri ATCC 29453] Length = 948 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 11/60 (18%), Positives = 19/60 (31%), Gaps = 3/60 (5%) Query: 360 NFIKIVTISINASPNGQ---RLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSV 416 I TI A + + L K ++ N N L F +I + + + Sbjct: 126 QKIGTFTIGFGAGLSERGLTSLSKMATANKNVALNASNQKELDAAFTSIIKQITSENIQI 185 >gi|156101461|ref|XP_001616424.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148805298|gb|EDL46697.1| hypothetical protein, conserved [Plasmodium vivax] Length = 1837 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 29/318 (9%), Positives = 75/318 (23%), Gaps = 19/318 (5%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA LA ++ G ++ A + +S + +F Sbjct: 1284 AASLAANQPANQPANQPGGGASNLFGAGGSASSGNFFNSGSGSGALGMSTNTGLFNQ--- 1340 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS--------------YDYRLQFIEHL 164 + N++ ++++ + +N + + +S ++ Sbjct: 1341 GSSANTNSPLFGGLSSSSANMANAQFAKSGQMGSSNLFNSGPSAAAGMGISSLSNNSGNM 1400 Query: 165 LNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK 224 NQ + + F L SM + + + Sbjct: 1401 FGSLNNQSNTNMFASNNNSGTPFGKNFSTGSGSQLKDSMFPSAQPSSRNQMMSSNFFSAN 1460 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN-IEPSWGTEKVRQYVTRD 283 + M+ N+ F ++ + I + Q Sbjct: 1461 KDMMSTATNSGSFFSNTTGNTPTGVNSGASAFGSKQNFGTPDMISGGMNSSHASQSGNLF 1520 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTN-FFRQGVKIPSLPFQKFIIFLTDGENN 342 + S + L+S+ + + G NN Sbjct: 1521 NRGGMYDGQMSAVPGATTFNNLSSNSGSRMDNSLSANKYGNYFQNNGPSKSNIFAAGANN 1580 Query: 343 NFKSNVNTIKICDKAKEN 360 + ++ + + +N Sbjct: 1581 LSSGSNTSLFQANSSNKN 1598 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 46/176 (26%), Gaps = 21/176 (11%) Query: 234 ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEK---VRQYVTRDMDSLILK 290 + + I + + + +GL+ Y + Sbjct: 79 TVFGYDTRIGVRDYEPKTTRVGLVSYNADAKILAGLDTYQSYDDLANGVFDSLNSVSATD 138 Query: 291 PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNT 350 + + A ++ K + ++K +I + +N Sbjct: 139 ESYLAKGLSAAEKVFEEGKSTA------------NRTQYKKVVIVY--ASSYKGTGELNP 184 Query: 351 IKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN----ADSLIHVF 402 + + D+ K +KI+T++ + + + L + + + Sbjct: 185 VPVADRMKTAGVKIITVAFSQNNDDGLLKDLSEIASLDFDFANTDKGVVEEIQGAM 240 >gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42] gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 780 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 56/193 (29%), Gaps = 25/193 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + +L L + ++ + +V T + + + + Sbjct: 348 NSGSMSGQSIEQARQSLALAISRLNPND--RFNVIRFDDTMTDYFKGLVAATPD--NREK 403 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 V T+ PA++ A + + + ++FLTD Sbjct: 404 AVAYVRSLTADGGTEMLPALEDALRNQGPVASGAL-----------------RQVVFLTD 446 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G N + I A ++ T+ I ++PN + K + + D + Sbjct: 447 GAIGNEQQLFQEIT----ANRGDARVFTVGIGSAPNTYFMTKAAEVGRGTFTQIGSTDQV 502 Query: 399 IHVFQNISQLMVH 411 + + + Sbjct: 503 ASRMSELFAKLQN 515 >gi|313835037|gb|EFS72751.1| TadE-like protein [Propionibacterium acnes HL037PA2] gi|314929711|gb|EFS93542.1| TadE-like protein [Propionibacterium acnes HL044PA1] gi|314972065|gb|EFT16162.1| TadE-like protein [Propionibacterium acnes HL037PA3] Length = 131 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 8/132 (6%), Positives = 27/132 (20%), Gaps = 12/132 (9%) Query: 17 SEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 E+ + S+ L V L++ G + A + + + + Sbjct: 2 DERGGGSASVWMLLMVPVILVVAGLVFDGSRQISA----------TQAAQDAAVAATRAG 51 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 A + + + ++ + Sbjct: 52 TDAAATPQLAGRDGAAAAVQAARQALSAAGVSGSVQVGGVTITVTTSQSRPTLFLSAIGI 111 Query: 135 NRLDSSNNTIFY 146 +++ Sbjct: 112 SQVTGHGKAHAQ 123 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 53/188 (28%), Gaps = 42/188 (22%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + V+D+SGSM K K A LD + + Sbjct: 305 PNTRKNVIFVIDVSGSM--------------------YGQKTRQTKRAFTTILDDVRPID 344 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 + ++ + S +++V + T+ ++ +A +IL Sbjct: 345 RINIILFSSYAHVWREDQMVEATSDNIAAAKRHVNGLS---VGGGTNIYDSLMKAVEILL 401 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + II LTDG+ N + V + + + + + Sbjct: 402 E----------------HDTGDAMPLIIMLTDGQVGNAAAIVRDVTSVIGGR---LSLFS 442 Query: 367 ISINASPN 374 I + Sbjct: 443 IGFGNGVD 450 >gi|253687075|ref|YP_003016265.1| hypothetical protein PC1_0674 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753653|gb|ACT11729.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 186 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 11/119 (9%), Positives = 30/119 (25%), Gaps = 6/119 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA------GAS 66 + S + A V L+ + L + + +A + A+ A + Sbjct: 25 RHWRSTHGVIATEVAFLVPVVLVGVMMLFELARIGLVIAAGSAALDKAVQAFRLDNLASD 84 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 +RL R + +++ + + ++ Sbjct: 85 SAEQMGTRLKARMVEAGYGYLKEDDLTVSVLHFDNLSQLGGLTNGNSSQDNPNGEETTT 143 >gi|209886525|ref|YP_002290382.1| Flp pilus assembly protein TadG [Oligotropha carboxidovorans OM5] gi|209874721|gb|ACI94517.1| Flp pilus assembly protein TadG [Oligotropha carboxidovorans OM5] Length = 197 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 40/149 (26%), Gaps = 9/149 (6%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLD--------WHYKKNSMESANNAAILAG 64 + + + ++ FA+ + L+L I V + + + A Sbjct: 16 RLMRDTRGVAAVEFAVILPVILMLFLGTIGVSTGVAVYRKVIILTRT-LSDLISQAQKLE 74 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 AS + + + + +A I + + G S ++V Sbjct: 75 ASDIPNAFNISSAVMAPYPSAPVQAKISQVYIEPTTLVAKVKWGASLNATARGCNDVVTE 134 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTS 153 + L S + Y S Sbjct: 135 LVPDGIRIGGTYLIMSEVSYDYTPVAGVS 163 >gi|86748913|ref|YP_485409.1| hypothetical protein RPB_1790 [Rhodopseudomonas palustris HaA2] gi|86571941|gb|ABD06498.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 206 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 16/136 (11%), Positives = 39/136 (28%), Gaps = 9/136 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R + + FA V LL+ + + A + + + Sbjct: 12 RRCSMRLAKDRSGLAATEFAFIVPLMLLMFFATVELSAGI--------AVDRKVTLVSRT 63 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS-GYSAVFYNTEIQNIVNSSR 126 + +S+ +S + A + + ++S Y ++Q ++ Sbjct: 64 LSDLVSQATTVTDSDLKNVFAASYGVLAPYPTSTADATISEIYVNDAGVAKVQWSKAATV 123 Query: 127 ISMTHMANNRLDSSNN 142 A L +S+ Sbjct: 124 AQSGSTATATLATSSR 139 >gi|220918111|ref|YP_002493415.1| hypothetical protein A2cp1_3014 [Anaeromyxobacter dehalogenans 2CP-1] gi|219955965|gb|ACL66349.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 408 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 7/38 (18%), Positives = 17/38 (44%) Query: 32 SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMV 69 + + V + +++A++A LAGA ++ Sbjct: 1 MLCAFLALALNVGHLFSVRGELQNASDAGALAGAIELD 38 >gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 791 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 21/195 (10%), Positives = 56/195 (28%), Gaps = 29/195 (14%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE--PSWGTEKV 276 + + +L L + ++ +I + + + + + Sbjct: 362 NSGSMSGPSIEQARQSLALAISKLNPDDRFN------VIRFDDTMTDYFKGLVTATPDNR 415 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T+ PA++ A + + + ++FL Sbjct: 416 EKAIGYVRGLTADGGTEMLPALQAALRNQGPVASGAL-----------------RQVVFL 458 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG N + I A ++ T+ I ++PN + K + + D Sbjct: 459 TDGAIGNEQQLFQEIT----ANRGDARVFTVGIGSAPNTYFMTKAAEMGRGTFTAIGSTD 514 Query: 397 SLIHVFQNISQLMVH 411 + + + + Sbjct: 515 QVASRMGELFAKLQN 529 >gi|332366663|gb|EGJ44406.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1059] Length = 450 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 96/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 A +S + + + + A D KN + + Sbjct: 53 AVSSPIEAAGTEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETST 112 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 113 VDVTKTEVAPAKVDKAPEPITRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 167 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 168 PPTVTKAGSADIVFVVDRSGSMGSTIDIVRTNINEFVRNITKEGI---TARFGLATFSDE 224 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 225 VYGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 264 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 265 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 324 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + ++ ++ N +L I + + +Y Sbjct: 325 IERTVATV------KAIEGIYKSFATEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 376 Query: 415 SVILK 419 I+K Sbjct: 377 YKIIK 381 >gi|156047970|ref|XP_001589952.1| hypothetical protein SS1G_08716 [Sclerotinia sclerotiorum 1980] gi|154693113|gb|EDN92851.1| hypothetical protein SS1G_08716 [Sclerotinia sclerotiorum 1980 UF-70] Length = 1171 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 54/173 (31%), Gaps = 31/173 (17%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K++ +++AL + ++ MGL+ + + T K + + S Sbjct: 648 KISLVRDALKFMVSNLGDRD------RMGLVTFGSSGGAA-PLVGMTSKTWNGWQQILAS 700 Query: 287 LILKPTDST-----PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + S A +L + + I+ ++D Sbjct: 701 IRPVGQKSHRVDVVEGANVAMDLLMQRRSNNPIA----------------TILLISDSST 744 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 ++ +S + + AK I + + + + +++ + + V + Sbjct: 745 SDAESVDFVVSRAEAAK---IAVHSFGLGMTHKPDTMIELSTRTKASYTYVKD 794 >gi|297279798|ref|XP_002801794.1| PREDICTED: integrin alpha-10-like isoform 4 [Macaca mulatta] gi|297279800|ref|XP_002801795.1| PREDICTED: integrin alpha-10-like isoform 5 [Macaca mulatta] Length = 1024 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 120 GRPEAARLLVVVTDGESHDGEELPAALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFL 176 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 177 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 214 >gi|297279792|ref|XP_002801791.1| PREDICTED: integrin alpha-10-like isoform 1 [Macaca mulatta] gi|297279794|ref|XP_002801792.1| PREDICTED: integrin alpha-10-like isoform 2 [Macaca mulatta] Length = 1167 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++T S P+ + +NV + +L + + + Sbjct: 320 REIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|83589208|ref|YP_429217.1| hypothetical protein Moth_0340 [Moorella thermoacetica ATCC 39073] gi|83572122|gb|ABC18674.1| hypothetical protein Moth_0340 [Moorella thermoacetica ATCC 39073] Length = 208 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 4/45 (8%), Positives = 13/45 (28%), Gaps = 3/45 (6%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESA 56 ++ + + + F + + L + K + A Sbjct: 62 RRFLRDARGFVLLEFLIVFLIMAFLTFGGVDYWVILSKH---QYA 103 >gi|226307664|ref|YP_002767624.1| hypothetical protein RER_41770 [Rhodococcus erythropolis PR4] gi|226186781|dbj|BAH34885.1| conserved hypothetical protein [Rhodococcus erythropolis PR4] Length = 569 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 35/362 (9%), Positives = 80/362 (22%), Gaps = 41/362 (11%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A L ++ S + + + + + + Sbjct: 222 RAALVPHAQAESVRTEDQPVGQKMLTELVSSGKVGVSTEQQVVEFTKAESKISATVPATG 281 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY-RLQFIEHLLNQRYNQKIVSFI 177 I++ + + ++ + + + L + + Sbjct: 282 TMIMDYPLVVSASAPDRHYAAAKKSDAIAELLHSESAQEVLTRSGFRTSGGGAVEGGVGD 341 Query: 178 PALLRIEMGERPIFLIE------------LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 L + + + +D+S SM + Sbjct: 342 VTALTVSEQSVADQTLGAWALMAKPIRTLVAIDVSASMDYPAAGGKTRMQLTTAAALAGN 401 Query: 226 TKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 ++ L F ++ +E V + + + R Sbjct: 402 QVFPDSVAAGLWAFSQGMNGTQDFQELVPIRRYDTIVDGVTQRKLMADQANRLTELKR-- 459 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 T A++ + +I LTDG N + Sbjct: 460 -----GATGLYDTTLAAFRKVQESY----------------DPRAVNSVIILTDGANEDP 498 Query: 345 KSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S + A+E + IVTI I + L + Y + + Sbjct: 499 DSISREQLLDVLAREQDPARPVIIVTIGITDDADAAALADISRVTGGSTYIARDPSEISE 558 Query: 401 VF 402 VF Sbjct: 559 VF 560 >gi|146292169|ref|YP_001182593.1| type IV pilin biogenesis protein [Shewanella putrefaciens CN-32] gi|145563859|gb|ABP74794.1| type IV pilin biogenesis protein, putative [Shewanella putrefaciens CN-32] Length = 1165 Score = 38.7 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 46/196 (23%), Gaps = 24/196 (12%) Query: 228 MAALKNALLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-------RQY 279 L A F + + K I T + G+ Y Sbjct: 329 CETLYEAYQYFSGGQVTFGNKDKNGTGNNKIDGYTPNNPPSILTSGSYTTPFKKCPDTAY 388 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + D + + + +F G K + ++ + Sbjct: 389 IIYITDGAPTLDKSADTLINGLVANAKNQAANYAAFSFTNAGNKTETSYMPALAAYMYNN 448 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEY---------- 388 + + I + I++ TI + + LL+ Sbjct: 449 DIVIGNKDSLGID-----NKQNIRVFTIGFSDGADEAAALLEETAFRGGNPRGSNNISKG 503 Query: 389 HYNVVNADSLIHVFQN 404 +Y N L+ ++ Sbjct: 504 YYVAKNGLDLVSALED 519 >gi|304310105|ref|YP_003809703.1| Putative type IV fimbrial biogenesis protein PilY1 [gamma proteobacterium HdN1] gi|301795838|emb|CBL44037.1| Putative type IV fimbrial biogenesis protein PilY1 [gamma proteobacterium HdN1] Length = 1368 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 22/51 (43%) Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRKYSVILKG 420 ++ N L ++S ++ + D+++ F+ I + +R S G Sbjct: 624 DSDNNVYDLWHAAINSRGEFFSAESPDAIVQAFREIVSRISNRTTSAGAPG 674 >gi|169624023|ref|XP_001805418.1| hypothetical protein SNOG_15261 [Phaeosphaeria nodorum SN15] gi|111056366|gb|EAT77486.1| hypothetical protein SNOG_15261 [Phaeosphaeria nodorum SN15] Length = 296 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 28/182 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + A D ++ A ++A+ L + + M + + E + Sbjct: 11 DNSEASRNGDYVPSRWEAQQDAVNLIFSAKTGANPESSVGLMSMG--GSTPEILTTLTTD 68 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 KV + + I + + A L + + ++ Sbjct: 69 IGKV---LDGLHRTKIKGNSHFVTGINVAALALKHRQ----------------NKSQKQR 109 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLL---KTCVSSPE 387 II + + N IK+ K K+ I I ++ + ++L + S Sbjct: 110 IIIFN--CSPIEEDEKNLIKLAKKMKKTGINIDIVAFGELSDDTTKKLQAFSENVSSGEG 167 Query: 388 YH 389 + Sbjct: 168 SY 169 >gi|257069050|ref|YP_003155305.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] gi|256559868|gb|ACU85715.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] Length = 340 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 48/195 (24%), Gaps = 20/195 (10%) Query: 233 NALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ-------YVTRDMD 285 L + L E + L + + + + V++ + D Sbjct: 108 EYDTEILSTFAELVDTFEGERVALSIFNSTSRTVFPLTNDYDLVQRELEAGAEAIDFDEF 167 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV--------KIPSLPFQKFIIFLT 337 L D + + Y + + IIF T Sbjct: 168 GYRLGNRDYSEDEVRQYVEFVDGTRGIADEASVIPDGLASCAQEFDQAEEDRSRSIIFAT 227 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINA---SPNGQRLLKTC-VSSPEYHYNVV 393 D E N + ++ I + T A P LKT + Sbjct: 228 DNEVNGDPLFT-LQEAAERVAARDIDLYTFYPGAYECGPQCYEELKTATEEQGGELHESS 286 Query: 394 NADSLIHVFQNISQL 408 + +++ + I + Sbjct: 287 DPEAIPSIINQIQKS 301 >gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 794 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 55/193 (28%), Gaps = 25/193 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + K +L L + + + +V T + + + + Sbjct: 362 NSGSMSGPSIEQAKQSLALAISRLTPND--RFNVIRFDDTMTDYFKGLVAATPD--NREK 417 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ PA++ A + + + ++FLTD Sbjct: 418 AIAYVRGLPADGGTEMLPALEDALR-----------------NQGPVATGALRQVVFLTD 460 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G N + I A ++ T+ I ++PN + K + + D + Sbjct: 461 GAIGNEQQLFQEIT----ANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTQIGSTDQV 516 Query: 399 IHVFQNISQLMVH 411 + + + Sbjct: 517 ASRMGELFAKLQN 529 >gi|17570773|ref|NP_508098.1| hypothetical protein ZK1193.2 [Caenorhabditis elegans] gi|1118062|gb|AAA83291.1| Hypothetical protein ZK1193.2 [Caenorhabditis elegans] Length = 1250 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 49/190 (25%), Gaps = 8/190 (4%) Query: 222 DKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT 281 D + + A + +++I + D L+G + + Y Sbjct: 1062 DSSGSISDEMYYATVGAVNTIGNAISIGHDHSRILLGTYDAISHFSGDLNTLDTFEAYQN 1121 Query: 282 RDMDSLILKPTDS-TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 + D L T ++ + ++ F V+ + + Sbjct: 1122 KLADLFSLGYTGINGNNIQSVMDYI---VLQNNSAPFRPAPVRKFLMLLSSQGWDKGNVS 1178 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + + ++ I + S N +L S V + + L Sbjct: 1179 EGKENGFSDPAPAARNLHKLGLETFAIGLGTSANMTQLNAIAKCST----QVNDQNELTS 1234 Query: 401 VFQNISQLMV 410 I L+ Sbjct: 1235 TISQIISLLC 1244 >gi|38233798|ref|NP_939565.1| hypothetical protein DIP1208 [Corynebacterium diphtheriae NCTC 13129] gi|38200059|emb|CAE49735.1| Putative secreted protein [Corynebacterium diphtheriae] Length = 470 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 31/353 (8%), Positives = 79/353 (22%), Gaps = 17/353 (4%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 AA + + + + A + + A + + + Sbjct: 125 AATDQVGIARREDATEDAPVTYPVKSDAVSSALVAAFTHNNDASAATKMLRDFSETTIDS 184 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 V S + + + N+ L + + + Sbjct: 185 A--VAQSAPKIVVSKSETPQGYSFEEIQNLSKPLR-AVALNPTDSVNEEVVRAGADFGTA 241 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 ++P L + V + + +S P + + + + + Sbjct: 242 IAANTPDAQKPQKLSQEVATAAEILASLTDSSPAVDSQSNQSPRDILMLLDTSEYINGVS 301 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKN------IEPSWGTEK-VRQYVTRDMDSLILKP 291 D I + R + + G + R + D + Sbjct: 302 TDGIRWFDAASHGLINLAQKQEERGKAIGLWNYSSPLTPGITQGWRSNLNFDTLDAARQV 361 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 D Q T + + D S Sbjct: 362 KDVLGNFGTGGQPQTLAATSAALNYAVEHAQHAGNTSVVIVFTGTND-------SFTGLA 414 Query: 352 KICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + KA+E+ + + I I + + + + + + +A L + Sbjct: 415 ESVRKAQESHVSLHAIQIGDGQDDTNIAQAAKDTNGSYVHATSAAELDRAIAS 467 >gi|326333218|ref|ZP_08199465.1| PE-PGRS family protein [Nocardioidaceae bacterium Broad-1] gi|325948862|gb|EGD40955.1| PE-PGRS family protein [Nocardioidaceae bacterium Broad-1] Length = 539 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 29/364 (7%), Positives = 88/364 (24%), Gaps = 22/364 (6%) Query: 52 SMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSA 111 M SA + + A + + A ++ + + Sbjct: 182 RMMSALGTVVASQAKPAREARLEAYGERIASGDEAAAPTDSGRLAELQTVMSGVTATSEQ 241 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMT-SYDYRLQFIEHLLNQRYN 170 + + + + L +++++ + S L Sbjct: 242 QWSTYAPMLKAAAPKEGTVVLDYPVLITADSSRRNTIAESITSLARVLD-SGAADTALTQ 300 Query: 171 QKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD------PEDVNSAPICQDKK 224 + +G V + + + + + A Sbjct: 301 AGFRTAGAGKPSPNVGAYTRVDAASVTSAAAAWNAKLRPTRTIAVVDVSGSMAWPAGSGA 360 Query: 225 RTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T++ + A +D + + +GL ++ + + Sbjct: 361 TTRLQLTQAAASEAVDLLPGGAA------VGLWAFSEKSAGM--LDEDHTVLVPTRQLAT 412 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T++ + A + + + ++ TDG N++ Sbjct: 413 KDQRAQLTEAIAGL-TARTGGGTALHDTALAAYQAAVAAY-DPLASNTVLLFTDGTNDDP 470 Query: 345 KSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 S + + ++++ I I A + L ++ Y + + Sbjct: 471 DSMDLDELVRQLEAASDPRRPVRVLGIGITADADLGALQAIADATGGAAYVAERPEDVGT 530 Query: 401 VFQN 404 V + Sbjct: 531 VLRE 534 >gi|256374467|ref|YP_003098127.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918770|gb|ACU34281.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 564 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 39/362 (10%), Positives = 99/362 (27%), Gaps = 46/362 (12%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A+ A+LA E L+D IK + +G +A + Sbjct: 240 ASEQAVLAANRAAGGLRVAASYPAEG------TMLLDYPVVRIKRASDQPGTGVAASGFE 293 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 +++ R ++ V + + Sbjct: 294 QALRSAKTRERFVDAGFRTPDGQAAAGLSAERDGVGGDAV-----------NAMPKPSPA 342 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + LL + V+D+SGSM M + + +A + Sbjct: 343 EVSELLGTWGAVSLDSRMLAVLDVSGSMTELMGNGQTRMAAASEAALTALGMLPDTSE-- 400 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 + ++ + + + + + + +++Q + T Sbjct: 401 ---IGLWAFSTNKRPPNDWVELVPLGPLGEVLGSAPRRTRLQQGAKGLAAL-VGGGTALN 456 Query: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 A++ + S ++ +TDG N+++ S + Sbjct: 457 DTTLAAFRRMQSTY----------------DPEKINSVVLITDGRNDDYASITTAQLLQA 500 Query: 356 KAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 E+ I ++ + + + + L + ++ Y + A + V + + Sbjct: 501 LESESDPARPIPLIMVGLGQEADMEALQEISSATGGKAYQALEAADIRSV---LLDAISQ 557 Query: 412 RK 413 R+ Sbjct: 558 RR 559 >gi|123445195|ref|XP_001311360.1| hypothetical protein [Trichomonas vaginalis G3] gi|121893166|gb|EAX98430.1| hypothetical protein TVAG_264790 [Trichomonas vaginalis G3] Length = 382 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 40/110 (36%), Gaps = 8/110 (7%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 + AM++ I K P F K I LTDGE + N+ K Sbjct: 10 NVHEAMEKISHIDADYKGTDILKTLNFVFGLKPQDGFVKQIFLLTDGE------DRNSDK 63 Query: 353 ICDKAK--ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 IC +A+ N +I +I + + + S + + + +++ Sbjct: 64 ICAEAQMNRNQTRIFSIGLGDGADPGLIKGVADKSGGSYTLITDEENMNE 113 >gi|107022592|ref|YP_620919.1| TadE-like [Burkholderia cenocepacia AU 1054] gi|116689541|ref|YP_835164.1| TadE family protein [Burkholderia cenocepacia HI2424] gi|105892781|gb|ABF75946.1| TadE-like protein [Burkholderia cenocepacia AU 1054] gi|116647630|gb|ABK08271.1| TadE family protein [Burkholderia cenocepacia HI2424] Length = 164 Score = 38.7 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 9/63 (14%), Positives = 22/63 (34%), Gaps = 7/63 (11%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ +I FA+ + F L++ +I + ++ LA + L+ Sbjct: 10 MRRRQRGATAIEFAMVLPVFFLILYAIITYGMIFAAQQNL-------TLAATEGARAALN 62 Query: 74 RLG 76 Sbjct: 63 YQQ 65 >gi|328545284|ref|YP_004305393.1| TadE-like protein [polymorphum gilvum SL003B-26A1] gi|326415026|gb|ADZ72089.1| TadE-like protein [Polymorphum gilvum SL003B-26A1] Length = 196 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 39/106 (36%), Gaps = 7/106 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R F + + +I FA+ FL+L+ +I + N + + A++ Sbjct: 24 RGLFGSFSRNRRGTTAIEFAIIAGPFLVLMFGIIEFGLAFFV-NRI---LDHAVMESTRL 79 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF 113 + + ++ + ++ K + F+ ++ + + + Sbjct: 80 IRTGQAQKANFDKA---AFKAEVCTHLTDFLCDNARFDVDVRTFST 122 >gi|53804181|ref|YP_113983.1| MxaC protein [Methylococcus capsulatus str. Bath] gi|53757942|gb|AAU92233.1| putative MxaC protein [Methylococcus capsulatus str. Bath] Length = 325 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 27/229 (11%), Positives = 55/229 (24%), Gaps = 44/229 (19%) Query: 193 IELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDV 252 I L++D S SM+ + + +K A + L F+ Sbjct: 84 IVLLMDRSSSMNENFSGR-------YLGGTAGESKNAVARRLLADFVRRRGD-------D 129 Query: 253 YMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQ--------- 303 ++ ++ + + E V + D T+ P + A Sbjct: 130 LFAMVAFSAAPRYVMPLTQDREAVLAAIDSVGDRG-HGITNIAPGLAMALDFFNGRPATG 188 Query: 304 ----ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK- 358 +L SD + + N+ + D A Sbjct: 189 ARIILLVSDGATRIEEESQDMIRQWFQDSQSRLYWIYLR-STNSAPLDRPPTGASDDASP 247 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 E + S+ Y N ++ +I + Sbjct: 248 EYLLHRYFQSLGVPYTA--------------YEATNPRAVEEAIADIGR 282 >gi|21741095|emb|CAD40135.1| OSJNBa0061C06.10 [Oryza sativa Japonica Group] Length = 1194 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 59/204 (28%), Gaps = 27/204 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMD--- 285 K+A + + + + ++ S + R Sbjct: 560 TKHATQTWREKLKNTVKFARNCIHISSVNSSLPTPTFPASQHANSTTARSAPAAADGISC 619 Query: 286 -SLILKPTDSTPAMKQAY-QILTSDKKRSFFTNFFRQGVKIPSLPFQ------KFIIFLT 337 + T++ ++++ LT + F + + Q II ++ Sbjct: 620 KLSKVLLTNAIQPVQKSISDYLTCSNHENLQHAIFLRKAGMLDNRPQEEKDRMGAIIVIS 679 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 D +N+ IC +A I T N N + + S + + + + Sbjct: 680 DNHDNS---------ICMEALSTNYNIHTFGFNGMHNVRAMYNIASRSNGMYDLLNDDRN 730 Query: 398 L-IHVFQNISQLMVHRKYSVILKG 420 L F + + S+I+ G Sbjct: 731 LITEAFISCMNKIT----SIIVLG 750 >gi|29828406|ref|NP_823040.1| hypothetical protein SAV_1864 [Streptomyces avermitilis MA-4680] gi|29605509|dbj|BAC69575.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 462 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/209 (8%), Positives = 49/209 (23%), Gaps = 25/209 (11%) Query: 170 NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKK--RTK 227 ++ +P + I L + + TK Sbjct: 1 MSHGLAAVPGEFKANEATTHFVEIALKAGRARPEDAPATEPLPVNFVFVVDTSGSMTGTK 60 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQYVTRDMDS 286 + +K+AL + +G+I + V + + ++ Sbjct: 61 LDTVKSALQTIYRELRPADC------LGIITFDHNVRTVLPAVAKQDLPPERFAEVVSAL 114 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 TD ++ ++ + +DG+ + + Sbjct: 115 TTQGGTDIDLGVQYGIDEISRHSVSG---------------RTVNCLYLFSDGDPTSGER 159 Query: 347 NVNTIKICDKAKENF-IKIVTISINASPN 374 + ++ AK + + + Sbjct: 160 DWIKVRANVAAKLRGDLTLSCFGFGSDAR 188 >gi|332534651|ref|ZP_08410483.1| TPR domain protein in aerotolerance operon [Pseudoalteromonas haloplanktis ANT/505] gi|332035931|gb|EGI72412.1| TPR domain protein in aerotolerance operon [Pseudoalteromonas haloplanktis ANT/505] Length = 643 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 33/144 (22%), Gaps = 14/144 (9%) Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM-DSLILKPTDSTPAMKQAYQILTSDKKRS 312 L+ Y + + + + + K ++ + A ++LT Sbjct: 129 TALVAYAGTAYVISPLTNDATTLSNLIPSLSPEIMPDKGSNVLAGLDMAKELLTQAGYID 188 Query: 313 FFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDK-----AKEN--FIKIV 365 G+ T N + K+ I + Sbjct: 189 GDIILVTDGIDQQEQSDVSSFTSNTQYRLNIYGVGTAQGAPIKLPEGGFLKDRYGQIVVP 248 Query: 366 TISINASPNGQRLLKTCVSSPEYH 389 TI N +L SS Sbjct: 249 TI------NASQLKNLASSSGGKF 266 >gi|222111710|ref|YP_002553974.1| type IV fimbrial biogenesis protein pily1 [Acidovorax ebreus TPSY] gi|221731154|gb|ACM33974.1| type 4 fimbrial biogenesis protein PilY1 [Acidovorax ebreus TPSY] Length = 1205 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 32/339 (9%), Positives = 75/339 (22%), Gaps = 34/339 (10%) Query: 95 KRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 + + L+ N R+ N + +N Sbjct: 131 QSMNNSTTGSGLTLSDLGTAAATNGTSHNMMRVLDATHRANFIAYANKYSASGNTPTHRM 190 Query: 155 DYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDV 214 R +++ + + +P + L + + + V Sbjct: 191 VQRAD--DYMRATPHQNGPWADVPGTKLADAKNDNK---PLGCRRNYHILLTDGGWNQYV 245 Query: 215 NSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTE 274 S RT L ++ + + + + Sbjct: 246 QSTTPLNYDNRTSPDLPNGKPYTATAQTRLYRDSEDFTTIADWAFQSWANPLRPATDLHG 305 Query: 275 KV-------RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKR--SFFTNFFRQGVKIP 325 KV + + + + K T + + +D +F F + Sbjct: 306 KVSPLPEYDNAPASETITNRVTKTTATLEKF-WNPKYDPADWAHMVTFTIGFSTDALPQK 364 Query: 326 SLPFQKFIIFLTDGE----NNNFKSNVNTIKICDKAKENFIKIVT-----------ISIN 370 + K +TD + + D A T + Sbjct: 365 NYSLNKDGTSVTDQGAITAPTTLLPYGDDGNLADYANGT----YTWRAYGGSSTGTTVTS 420 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLM 409 G + ++ Y V + L F+ I + + Sbjct: 421 TRDRGHDMWHAALNGRGRFYAVEKGEDLKEAFRQIVRTI 459 >gi|116052331|ref|YP_792642.1| hypothetical protein PA14_55820 [Pseudomonas aeruginosa UCBPP-PA14] gi|115587552|gb|ABJ13567.1| putative membrane protein [Pseudomonas aeruginosa UCBPP-PA14] Length = 556 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 36/306 (11%), Positives = 90/306 (29%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E SA + I + N++ + Sbjct: 66 SAQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVH--- 122 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + + F +++ + Sbjct: 123 ----------------------VEVTKSVPGSLVANLGGLFGGGNANVDLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T + A D+ + + MG Sbjct: 161 NATISAGTGLASVNSGQSALLNPILSGLLGTHIDLSAAAYNGIADAKLSVLDILGADGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LIG T + + ++Q + ++ L S A++ + S + Sbjct: 221 LIGVDTSLGTIEQLLNTNVGLQQVLAASVNVLAKNGVASVEALRTQLVGVKSATLKLGDL 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|311696337|gb|ADP99210.1| protein containing a von Willebrand factor type A (vWA) domain [marine bacterium HP15] Length = 704 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 70/223 (31%), Gaps = 36/223 (16%) Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL--DSIDLL 245 + LV SG+M + S + + R AL+ L D +++ Sbjct: 324 EDYLMAMLVPGESGAMALPRDLVFVIDTSGSMAGESIRQARDALQAGLGTLTPRDRFNVI 383 Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + + + + + + RQYV R T+ PA+ +A + Sbjct: 384 QFNSQTHSLFM-------QPEVATGNNLARARQYVDRL---RADGGTEMAPALSRALEG- 432 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 G + + +IF+TDG +I + ++ Sbjct: 433 ---------------GGETEDGARVRQVIFITDGA--VGNEAALFRQIRQQLGNQ--RLF 473 Query: 366 TISINASPNGQRLLKTCVSSPEYHYNVVNADS----LIHVFQN 404 T++I ++PN + + + + + L +F Sbjct: 474 TVAIGSAPNRHFMREAARWGRGTYTAIHSPSDVDGPLQALFSA 516 >gi|296390996|ref|ZP_06880471.1| hypothetical protein PaerPAb_22709 [Pseudomonas aeruginosa PAb1] Length = 556 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 36/306 (11%), Positives = 90/306 (29%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E SA + I + N++ + Sbjct: 66 SAQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVH--- 122 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + + F +++ + Sbjct: 123 ----------------------VEVTKSVPGSLVANLGGLFGGGNANVDLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T + A D+ + + MG Sbjct: 161 NATISAGTGLASVNSGQSALLNPILSGLLGTHIDLSAAAYNGIADAKLSVLDILGADGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LIG T + + ++Q + ++ L S A++ + S + Sbjct: 221 LIGVDTSLGTIEQLLNTNVGLQQVLAASVNVLAKNGVASVEALRTQLVGVKSATLKLGDL 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|167566938|ref|ZP_02359854.1| hypothetical protein BoklE_30541 [Burkholderia oklahomensis EO147] Length = 168 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 28/62 (45%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + + ++ FA+ ++ +LL+ + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGSVAVEFAIVMIPLVLLVTGVAEFGRAIYQYEALTKATRDATRYLSTYLSTDSAYPLAQ 61 Query: 79 FE 80 + Sbjct: 62 AQ 63 >gi|156386852|ref|XP_001634125.1| predicted protein [Nematostella vectensis] gi|156221204|gb|EDO42062.1| predicted protein [Nematostella vectensis] Length = 398 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 55/188 (29%), Gaps = 23/188 (12%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL- 287 L+ + L +D ++ + ++ Y + + + S Sbjct: 225 NNLERVIAYILKFLDRFDFAQDKTRVAVVEYDVVPHTKLTLLGSGLASISDIREIVASQI 284 Query: 288 --ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK 345 K T + A+K A + +++ K ++ +TDG + Sbjct: 285 RFSGKSTRTDLALKNAQNLFQREQRSG----------------VPKTLVIITDGRSWY-- 326 Query: 346 SNVNTIKICDKAKENFIKIVTISINASP--NGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 + K +K + +++ +RL C + + L Sbjct: 327 DDTLLNGAIAGIKALGVKTLAVAVGDDSMLELERLQSICETLDNVIKLGEYENLLASADT 386 Query: 404 NISQLMVH 411 +++L+ Sbjct: 387 TLNKLVQT 394 >gi|148692918|gb|EDL24865.1| RIKEN cDNA 1700112N15 [Mus musculus] Length = 492 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 4/93 (4%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVS 384 II LTDG K ++T++ KA+ + T+ + ++ L Sbjct: 18 GGRIVNSVIIALTDGLLL-LKPYLDTMEEAKKARRMGAIVYTVGVFM--YSKQQLVNIAG 74 Query: 385 SPEYHYNV-VNADSLIHVFQNISQLMVHRKYSV 416 P+ + V +L V ++ SV Sbjct: 75 DPDRCFGVDEGFSALEGVVDPLTSKSCTEILSV 107 >gi|328867084|gb|EGG15467.1| type A von Willebrand factor domain-containing protein [Dictyostelium fasciculatum] Length = 854 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 40/361 (11%), Positives = 90/361 (24%), Gaps = 39/361 (10%) Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + + ++I+N + L + I + + N V Sbjct: 110 AIKTKDDAFNTFSDAIANGYQSFLAEVDDSTFVISIGNIPPPSNPMSKELSSDNTVTIET 169 Query: 127 ISMTHMANNRLDSS---NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + D + F ++ + L + N + P Sbjct: 170 TIQSMLGTQTNDLHYYMHKCFFPTSQYSPNFSLTMNVQVELSSPIQN---ILVQPYKHNT 226 Query: 184 EM-GERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSI 242 + G + + + + ++ + TK AL D + Sbjct: 227 VIEGNKATITVTKTTSVLATFLLSITPSSNQQKPLYHMEYNPTTKSTALGITFYPQFDHL 286 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL---------KPTD 293 + + ++ LI + + EK ++ + M SL T Sbjct: 287 SIDDIQTKGEFIFLIDCSGSMAGFT-----IEKAKKTLQILMRSLPTNSYFNIYCFGGTY 341 Query: 294 ---------STPAMKQAYQILTSDKKRSFFTNFFRQGVKI-----PSLPFQKFIIFLTDG 339 A S + +K + I +TDG Sbjct: 342 KSLFQTSVLYDDDNLAAASEYISSIDANMGGTQLISPLKRIYDFETFPNHPRQIFIITDG 401 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 E N I ++ +I T+ + + + + Y+ V N ++ Sbjct: 402 EIPN---RDYVIDFVASKRDVS-RIFTVGLGSDVDKALVTGLSNVCGGYYELVTNNATME 457 Query: 400 H 400 Sbjct: 458 T 458 >gi|323705583|ref|ZP_08117157.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] gi|323535060|gb|EGB24837.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] Length = 229 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 33/79 (41%), Gaps = 4/79 (5%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN-GQRLLKTCVSSPEYH 389 K II +TDG++N N A I++ +I I + +K SS Sbjct: 6 KQIIVVTDGKSNVG---GNPADAAFLAYRKGIRVSSIGIVDDGKLSIKEIKDIASSGGGV 62 Query: 390 YNVVNADSLIHVFQNISQL 408 Y+++ +D I ++Q Sbjct: 63 YDIIYSDDFIRSLSMVTQK 81 >gi|319791199|ref|YP_004152839.1| tade family protein [Variovorax paradoxus EPS] gi|315593662|gb|ADU34728.1| TadE family protein [Variovorax paradoxus EPS] Length = 393 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 6/55 (10%), Positives = 21/55 (38%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 ++ +++ +L + ++ + + ++E A + A AG+ Sbjct: 9 RRQRGVSMTETMIALPVLILFVMCVVQFGLIYRARLTLEYAAHEAARAGSLNNGM 63 >gi|114706777|ref|ZP_01439677.1| TadE-like protein [Fulvimarina pelagi HTCC2506] gi|114537725|gb|EAU40849.1| TadE-like protein [Fulvimarina pelagi HTCC2506] Length = 178 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 18/69 (26%), Gaps = 18/69 (26%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIY-----------------VLDWHYKKN 51 FK+ +I F L F L +I V Sbjct: 16 SLFKRLRKDRGGVAAIEFVLCAFPFFALFLAIIETAILFTAGIVLESGVQGVARQILT-G 74 Query: 52 SMESANNAA 60 ++SA + A Sbjct: 75 QLQSAGDEA 83 >gi|308473944|ref|XP_003099195.1| CRE-CLEC-63 protein [Caenorhabditis remanei] gi|308267668|gb|EFP11621.1| CRE-CLEC-63 protein [Caenorhabditis remanei] Length = 414 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 15/158 (9%), Positives = 49/158 (31%), Gaps = 18/158 (11%) Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPS--WGTEKVRQYV-TRDMDSLILKPTDSTPA 297 I +GL+ Y + + + + + V + + Sbjct: 94 RIGNQYTDPRSTRVGLVTYNKVATEVADLNHIQSIDDLYSVVFSTLTSVSSEDDSYLATG 153 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + A ++ + + + +++ ++ + +++ + D+ Sbjct: 154 IGAAEKVFQNGRNGNV------------RSNYKRLVLVYASAYKGDGQNDPIPVS--DRL 199 Query: 358 KENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVN 394 K + + I TI+ + + L +SP Y + + Sbjct: 200 KSSGVVISTIAFDQDGDEALLAGLAQIASPNYAFTSED 237 >gi|159184182|ref|NP_353183.2| hypothetical protein Atu0148 [Agrobacterium tumefaciens str. C58] gi|159139514|gb|AAK85968.2| Atu0148-1 mutant of a conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 198 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 23/55 (41%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 L+R + +K S +I FA+ + + L++ +I + + +A Sbjct: 3 LARLKPLLEKFGFSRDGTAAIEFAILAIPYFLVVFAIIETFIALMAEQVVANATE 57 >gi|257455026|ref|ZP_05620271.1| putative pilus assembly protein tip-associated adhesin PilY1 [Enhydrobacter aerosaccus SK60] gi|257447598|gb|EEV22596.1| putative pilus assembly protein tip-associated adhesin PilY1 [Enhydrobacter aerosaccus SK60] Length = 1268 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 22/217 (10%), Positives = 51/217 (23%), Gaps = 27/217 (12%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 + L + ++ YN + A + + + Sbjct: 45 ALPTPGNPNIMMMLDISGSM-DRTYNSISGCYNRAATQTNTYT-------YTDPSNSANT 96 Query: 205 CAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV- 263 ++ ++ + ++ LKNALL L + + + + + + Sbjct: 97 FSVTANLWQCKANQAGTIILYDRIGTLKNALLKMLTDNKVNNDIGLGIGVFSYNGDGKTG 156 Query: 264 ---EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL-------------TS 307 + + + T + A +A + S Sbjct: 157 IIRYPIQKLT--MANRQLMANYVAGLTASGGTPTPSAFAEAGAYMLGTNTSGTGSGFTNS 214 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D + I LTDGE + Sbjct: 215 DNITKIPNRTRYLQGAQETTCAGNGIYLLTDGEPDTS 251 >gi|167842451|ref|ZP_02469135.1| hypothetical protein Bpse38_37655 [Burkholderia thailandensis MSMB43] Length = 168 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 9/129 (6%), Positives = 41/129 (31%), Gaps = 5/129 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGAVAVEFAIVMIPLVLLATGVAEFGRAIYQYEALTKATRDATRYLSTYLSTDPAYPVAQ 61 Query: 79 FESISNHAKRALIDDAKRFIKN-----HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + I S + +++ + + + Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVIVCDASHTTNCSDSSDPAQFASVPTYDTNNGS 121 Query: 134 NNRLDSSNN 142 + + + Sbjct: 122 PDPASLAGS 130 >gi|294496937|ref|YP_003560637.1| hypothetical protein BMQ_0087 [Bacillus megaterium QM B1551] gi|295702304|ref|YP_003595379.1| hypothetical protein BMD_0085 [Bacillus megaterium DSM 319] gi|39640|emb|CAA45619.1| unnamed protein product [Bacillus megaterium] gi|294346874|gb|ADE67203.1| conserved hypothetical protein [Bacillus megaterium QM B1551] gi|294799963|gb|ADF37029.1| conserved hypothetical protein [Bacillus megaterium DSM 319] Length = 246 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 31/82 (37%), Gaps = 7/82 (8%) Query: 331 KFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKT--CVSSP 386 K ++ +TDG +N + + + AKE I + I + + + Q + S Sbjct: 7 KQMLVITDGGSNTGED---PVAMAALAKEQGISVNVIGVMEEDTIDEQSTNEIEGIAMSG 63 Query: 387 EYHYNVVNADSLIHVFQNISQL 408 +V L Q +++ Sbjct: 64 GGVSQIVYVKQLSQTVQMVTRQ 85 >gi|18309499|ref|NP_561433.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens str. 13] gi|18144176|dbj|BAB80223.1| hypothetical protein [Clostridium perfringens str. 13] Length = 614 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 32/294 (10%), Positives = 75/294 (25%), Gaps = 35/294 (11%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL---------LNQRYNQKIVSFIPALLRI 183 N + + + + + K++ IP Sbjct: 68 GNYEITLTVKGKPKKVTKPVDILLIIDASNSMYYNMDELKASMNSLVDKVIDNIPDSRIA 127 Query: 184 EMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM-AALKNALLLFLDSI 242 + + + S + + T + ++ + A +F + + Sbjct: 128 VVAFGTYSEEVFSFNNKNNFTSKEEYKKAIKKSYNNIEGRGNTNIESSWRLADEIFKNEL 187 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNI----EPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + K+DV GY + G E QY K + Sbjct: 188 NNNFNSKKDVIFFSDGYPNESMNMLLESKYLEGGREYYEQYKKHLEQIYKTKIKKDSDYE 247 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 K L + +S + K + I + +N ++ + ++ K + Sbjct: 248 KILMDELNNGNYKSIDEWALYEYEKFYNNYPDTNIFSVALIDNIPYEDKNSAKELLSKMQ 307 Query: 359 ENFIKIVTI--SINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 + TI N N+ SL ++ I+ ++ Sbjct: 308 NSGY--FTIDSRFGQDNNE-----------------NNSKSLKEIYDKIANNII 342 >gi|167574007|ref|ZP_02366881.1| hypothetical protein BoklC_29495 [Burkholderia oklahomensis C6786] Length = 168 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 7/62 (11%), Positives = 28/62 (45%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + + ++ FA+ ++ +LL+ + Y+ ++ A A ++ + ++ + + Sbjct: 2 RGSVAVEFAIVMIPLVLLVTGVAEFGRAIYQYEALTKATRDATRYLSTYLSTDSAYPLAQ 61 Query: 79 FE 80 + Sbjct: 62 AQ 63 >gi|269962897|ref|ZP_06177237.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832451|gb|EEZ86570.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 3773 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 31/328 (9%), Positives = 80/328 (24%), Gaps = 11/328 (3%) Query: 30 VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRA 89 + + G I + + + A + + + S S +K Sbjct: 1757 IPVLSIAPGGWIDFVMESQIR---QDALDQIVATPKYNGSNFNSAKVTPEGSNLTVSKEI 1813 Query: 90 LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS-SNNTIFYNM 148 + D K ++ + F T + + + ++N R++ T Sbjct: 1814 VSIDGKSYVSGDTYKPEDEVIYKFTVTNTEPVWRDQTVIQDIVSNVRVEVIGGTTKSAFS 1873 Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMN 208 + S+ + + A +++ + ++G + Sbjct: 1874 ETNISHAFT-----STGTTGTQDTYIESYDATDDLDLLVDIAPEEVIEFTVTGKVRPDAL 1928 Query: 209 SDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK--N 266 D + Q + + L + + + G Sbjct: 1929 GDIDGNTGKAGNQSDTTDVIPPVVPVLNFEKSVLSTTADSGTCTFPSTTGTDCHYNPNGQ 1988 Query: 267 IEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPS 326 ++ V +T + T +T K A+Q + + + F Sbjct: 1989 VQYQVSVTNVGDSITNGATIVDELNTINTSDGKAAFQDYSVAIVSAPDPDRFSITGNYQG 2048 Query: 327 LPFQKFIIFLTDGENNNFKSNVNTIKIC 354 L G+ F+ + Sbjct: 2049 TVPLNAAFDLMPGDTVVFEIDGTVAADA 2076 >gi|320586920|gb|EFW99583.1| von willebrand and ring finger domain containing protein [Grosmannia clavigera kw1407] Length = 1087 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 49/168 (29%), Gaps = 21/168 (12%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ +++AL + ++ MGL+ + + T+ + Sbjct: 552 KINLVRDALRFMVHTLGERD------RMGLVTFGSGGGGVPVVGMTTKSWPGWSGVLSSI 605 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + + K + I+ ++D ++ S Sbjct: 606 KPVGQKSHRADVVEGANVAMD----------LLMQRKYNNPIAT--IMLISDASTSDADS 653 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK I I + + + +++ + + V + Sbjct: 654 VDFVVSRAEAAK---ITIHSFGLGMTHKPDTMIELSTRTKASYTYVKD 698 >gi|269215637|ref|ZP_06159491.1| conserved hypothetical protein [Slackia exigua ATCC 700122] gi|269131124|gb|EEZ62199.1| conserved hypothetical protein [Slackia exigua ATCC 700122] Length = 970 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 26/311 (8%), Positives = 82/311 (26%), Gaps = 10/311 (3%) Query: 44 LDWHYKK---NSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRAL-IDDAKRFIK 99 + + + + ++A A + G + +S+ + A + Sbjct: 489 AGAVQVRDGASQLADSLDSAATAAQDGIGQMQGVNGQVKDGLSSLSSSLGDAGGALTSLS 548 Query: 100 NHIKESLSGYSAVFYNTEIQNIVNSSRISM-THMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + + S + + +N ++ ++ + + + + + ++ Sbjct: 549 SDAQAQAAALSGAVSSDASEASLNDAQANISAVLGGDTSNLTPEQLSALQAASSNVASAK 608 Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 + + +S I ++ G E SG++ +N +++ Sbjct: 609 GSLPKTPDTTKLNDDLSQISSVAGQVSGATASIGSESSTPDSGTIMGGVNGVSGGLSALG 668 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK-----NIEPSWGT 273 + + A A ++ + + S G+ Sbjct: 669 SSLSSVDSGLPAASGAARQLSAGAGKVADGLGSASASIGSADDSSSVTLIGGVNQLSAGS 728 Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 ++ + D ++ + A L + N Q + Sbjct: 729 SALKDGLVALRDGGTTPSGKTSAGLTGALAALGDKDTGNTLINGAFQLMGGGDRLVANLG 788 Query: 334 IFLTDGENNNF 344 T+G N Sbjct: 789 DDATEGTVMNG 799 >gi|115622698|ref|XP_001202504.1| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] gi|115631776|ref|XP_785188.2| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] Length = 338 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 35/118 (29%), Gaps = 21/118 (17%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 S +++V R T+ ++ A +L Sbjct: 6 SDNIAAAKRHVNRI---RAGGGTNLYDGLRNAVDLLMEHGNGE----------------A 46 Query: 330 QKFIIFLTDGENNNFKSNVNTIKI--CDKAKENFIKIVTISINASPNGQRLLKTCVSS 385 II LTDG+ + + I + + + ++S + L K +S+ Sbjct: 47 MPLIIMLTDGQPTSGSVKSTSEIIQRITNLIDGRLSLFSVSFGNGVDFSFLEKLSLSN 104 >gi|78066129|ref|YP_368898.1| Flp pilus assembly protein TadG [Burkholderia sp. 383] gi|77966874|gb|ABB08254.1| Flp pilus assembly protein TadG [Burkholderia sp. 383] Length = 164 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 23/69 (33%), Gaps = 7/69 (10%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 R E+ +I FAL + F L++ +I + ++ LA Sbjct: 4 RRVAAGMRRRERGTTAIEFALMLPMFFLILYAIITYGMIFAAQQNL-------TLAATEG 56 Query: 68 MVSNLSRLG 76 + L+ Sbjct: 57 ARAALNYQQ 65 >gi|260841635|ref|XP_002614016.1| hypothetical protein BRAFLDRAFT_67400 [Branchiostoma floridae] gi|229299406|gb|EEN70025.1| hypothetical protein BRAFLDRAFT_67400 [Branchiostoma floridae] Length = 1108 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 34/178 (19%), Gaps = 3/178 (1%) Query: 178 PALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLL 237 P P + + + D + + ++ Sbjct: 347 PPENTTSTYGHPAMDTPYTTGRTTHIPGRLAGDTTYIPAQSASDTPYTNGHPSMDTPYTN 406 Query: 238 FLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA 297 ++ + G V T ++ T+ PA Sbjct: 407 SHPTVGTPYTNSHPAVDTPYTNGD-PTVGTPYTNGHPTVGTPYTNGHPTVGTPYTNGHPA 465 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICD 355 + Y S+ G +P K + + CD Sbjct: 466 VDTPYSNGDPSVDTSYTNGDPATG--TNDMPGPKRKWRADSRCGQRYPAYDGNPAACD 521 >gi|222080975|ref|YP_002540338.1| hypothetical protein Arad_7188 [Agrobacterium radiobacter K84] gi|221725654|gb|ACM28743.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 145 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 12/144 (8%), Positives = 42/144 (29%), Gaps = 1/144 (0%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL-AGA 65 +K S ++ FAL + +FLL++ ++ + +++ A G Sbjct: 1 MIDRLQKLRQSRSGASAVEFALVMPAFLLMLFGIVEFGRLFWTSHALHETAIATARCMGI 60 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ + I+ +A + + + + +++ + Sbjct: 61 PQLECEDGGAYSTSKVITFAETKAAGWFVNLDPASIALDHNASCYGLAGFSQVNLTYQFT 120 Query: 126 RISMTHMANNRLDSSNNTIFYNMD 149 + + + + Sbjct: 121 TALPGLLTSLAGGTGLVAQACYTN 144 >gi|196230799|ref|ZP_03129660.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] gi|196225140|gb|EDY19649.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] Length = 1545 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 35/126 (27%), Gaps = 18/126 (14%) Query: 274 EKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFI 333 +KV + R + T+ A+ AY+ + Sbjct: 1257 DKVHDVIARVNEITPEGGTNLEAALDLAYETAHHHFAVDS----------------TNRV 1300 Query: 334 IFLTDGENNNFKSN-VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYN 391 I TDG N N K + ++ I + I LL+ ++ + Sbjct: 1301 ILFTDGAANLGDVNPDALTKKVEAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGF 1360 Query: 392 VVNADS 397 + + Sbjct: 1361 INTPED 1366 >gi|261342506|ref|ZP_05970364.1| putative tail fiber protein [Enterobacter cancerogenus ATCC 35316] gi|288315147|gb|EFC54085.1| putative tail fiber protein [Enterobacter cancerogenus ATCC 35316] Length = 670 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/305 (9%), Positives = 79/305 (25%), Gaps = 3/305 (0%) Query: 51 NSMESANNAA--ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG 108 + SA ++A +LA K V L + + N + E L+ Sbjct: 177 TQLSSAIDSASEVLAATPKAVKAAYDLAKGKYAAQDATTGQKGIVQLSSATNSVSEVLAA 236 Query: 109 YSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + T + S+ T + + + ++ + Sbjct: 237 TPKAVKTAYDLANAKYTAVDATTAQKGLVQLSSATDSVSEVLAATPKAVKAANDNANGRV 296 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKM 228 + + V+ I + + IF + + + + P Q + Sbjct: 297 PSGRKVNGRALTGDINVTSQDIFNGQ-AAGIGNAEDLNAYTTPGLYYQPANAQAQTGKNY 355 Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 L + + T + + + D+ ++ Sbjct: 356 PEANAGSLEVYKHAGITQIYRIYNNSRSYIRTLYGGTWSAWTKQYDAANKPTAIDVSAIP 415 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 L + + + + S S+ + G +++ G+ +++ Sbjct: 416 LAGSTAVNGVVRNSAEFQSISANSYRMVYGNYGAFWRQDGSNLYLMLTNSGDQYGSYNSL 475 Query: 349 NTIKI 353 + + Sbjct: 476 RPLAV 480 >gi|126666680|ref|ZP_01737657.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] gi|126628725|gb|EAZ99345.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] Length = 1318 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 20/239 (8%), Positives = 57/239 (23%), Gaps = 29/239 (12%) Query: 203 MHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTR 262 + + + + + + + + F +I Y+ L ++ Sbjct: 199 DNYDCYQRKTNTSDSGAAGFSQYLFSSRFEASDSDFAQNILDFGTYLTWQYVSLSWFSNS 258 Query: 263 VEKNIEP-----SWGTEKVRQYVTRDMDSL----------ILKPTDSTPAMKQAYQILTS 307 + + + S T ++ A + Sbjct: 259 SPGQGMLHVAIDDVDAAHKTRLLNKLNTSQFSSSSDTPLRNAGLTPLAGTLESASKYFGG 318 Query: 308 DKKRSFFTNFFRQGVKIPSLP-FQKFIIFLTDGENNNFKSNVNTIKI----------CDK 356 + P+ Q +++ +TDG + KS T + Sbjct: 319 TLSNGEAASGVSTLAPTPNFCGTQDYVVLVTDGLPSVNKSGSKTTDVTSAVDAVADEAAA 378 Query: 357 AKENFIKIVTISINASPNGQR-LLKTCVSSPEY--HYNVVNADSLIHVFQNISQLMVHR 412 ++ + + LL + ++ +L +I + +R Sbjct: 379 LLSQGVRTYVVGFGLPTGVDKSLLNKIAIAGGTKATLFANDSTTLDAALGSIFLNIFNR 437 >gi|115374114|ref|ZP_01461402.1| type IV pilin biogenesis protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310825522|ref|YP_003957880.1| hypothetical protein STAUR_8299 [Stigmatella aurantiaca DW4/3-1] gi|115368890|gb|EAU67837.1| type IV pilin biogenesis protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309398594|gb|ADO76053.1| uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 167 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 16/45 (35%), Gaps = 4/45 (8%) Query: 362 IKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNI 405 + + T+ + N LL+ + Y V + L Q + Sbjct: 119 VNVYTVGLGIDSN---LLRNAANEGGGLSYRVNDVAGLKQAIQEV 160 >gi|111018687|ref|YP_701659.1| hypothetical protein RHA1_ro01688 [Rhodococcus jostii RHA1] gi|110818217|gb|ABG93501.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 904 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 54/207 (26%), Gaps = 35/207 (16%) Query: 233 NALLLFLDSIDLLSHV-KEDVYMGLIGYTTRVEKNIEPSWGTE----KVRQYVTRDMDSL 287 +A L+ ++ ++ + + G++ + + + ++ V R Sbjct: 67 SAANYLLEQLNTFGGSAGVELNVAIAGFSEKFIVHAPWTRLDNGSLPALQGEVERFRTRT 126 Query: 288 ILKPTDSTPAMKQAYQILTSDK-----------KRSFFTNFFRQGVKIPSLPFQKFIIFL 336 TD A+ A + L Q + Sbjct: 127 DGIDTDYWNALDGARRTLAERDGQSEANRCQAVAWFSDGKLDFTVRDAEKPYAQGVSLGS 186 Query: 337 TDGENN----NFKSNVNTIKICDKAKENFIKIVTISINASPNGQ---RLLKTCVS----- 384 G +S I D+ + + I + + A L+++ + Sbjct: 187 DQGVQQVVAAARESICRPAGIADQLRSSGIVTFAVGLAAGTAQPSDFDLMRSIATGGDGA 246 Query: 385 -------SPEYHYNVVNADSLIHVFQN 404 SP Y N D L+ F Sbjct: 247 CGKTTSPSPGDFYLAQNIDDLLFAFDA 273 >gi|268532316|ref|XP_002631286.1| Hypothetical protein CBG03099 [Caenorhabditis briggsae] gi|187036874|emb|CAP23539.1| hypothetical protein CBG_03099 [Caenorhabditis briggsae AF16] Length = 406 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 53/164 (32%), Gaps = 15/164 (9%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVY-----MGLIGYTTRVEKNIEPSWGTEKVRQYVTR 282 + ++N L L SI + VK +G++ Y + T Y Sbjct: 58 LWQVRNTLTQVLGSISTIGPVKYPADPRSTCVGVVTYDDNATTQSQLDASTSFSDLYNVI 117 Query: 283 DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 + + T++ L+ + + + I F+ D + Sbjct: 118 QSSMVAVDSTNN--------SYLSLGLLAAEKAFKDGRSRSYRYNYRKVVIAFVADYQGQ 169 Query: 343 NFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP 386 + I ++ K+N + I+T++ + P Q + S+ Sbjct: 170 GTALDAMP--IANRLKDNTVSIITVACTSDPVKQAAIAAIASTG 211 >gi|167579119|ref|ZP_02371993.1| hypothetical protein BthaT_13320 [Burkholderia thailandensis TXDOH] gi|167617234|ref|ZP_02385865.1| hypothetical protein BthaB_13088 [Burkholderia thailandensis Bt4] gi|257141106|ref|ZP_05589368.1| hypothetical protein BthaA_18164 [Burkholderia thailandensis E264] Length = 168 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 8/129 (6%), Positives = 40/129 (31%), Gaps = 5/129 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 + ++ FA+ ++ +LL+ + Y+ ++ A A + + ++ + + Sbjct: 2 RGAVAVEFAIVMIPLVLLVTGVAEFGRAIYQYEALTKATRDAARYLSMYLPTDPAYPIAQ 61 Query: 79 FESISNHAKRALIDDAKRFIKN-----HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + ++ + + + + ++ N + + Sbjct: 62 AQCLAVYGSTTCGSTGSELAPGLATSMVVVCDAAHSTNCSDASDPAQFANVPTYDTDNGS 121 Query: 134 NNRLDSSNN 142 + + + Sbjct: 122 PDPASLAGS 130 >gi|17536325|ref|NP_495348.1| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|3258584|gb|AAC24429.1| Hypothetical protein T19D12.4a [Caenorhabditis elegans] Length = 1028 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 49/161 (30%), Gaps = 17/161 (10%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + + + ++ +G+I +++ V + + + G D Sbjct: 404 IVPQYVDFAKKLVAQYKYGNDNTRVGIITFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTG 463 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ T A A + ++ + K + LTDG Sbjct: 464 GLTNVTKAQLTAKNLFDTESNANRN----------------KVLFILTDGVPTVDTYTDE 507 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 DK K + + ++ + + VS P+Y + Sbjct: 508 V-AAGDKLKSISVISFFVGYSSYSDEVKTELGKVSEPKYIF 547 >gi|50119741|ref|YP_048908.1| hypothetical protein ECA0796 [Pectobacterium atrosepticum SCRI1043] gi|49610267|emb|CAG73710.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 182 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 11/133 (8%), Positives = 34/133 (25%), Gaps = 4/133 (3%) Query: 10 YFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI----LAGA 65 + ++ S + + A V L+ + L + +A + A+ L Sbjct: 18 HHERNWRSTRGVIATEVAFLVPVVLVGVMMLFELARIGLVIAVGSAALDKAVQTFRLDNL 77 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + + DD + + + G +++ + Sbjct: 78 VTDSAEQMGTRLKARMVEAGYGYLKEDDLTVSVLHFDNLTQLGGLTNGNSSQDNPNGEET 137 Query: 126 RISMTHMANNRLD 138 ++ Sbjct: 138 TTLPVWSVTVQIK 150 >gi|225376650|ref|ZP_03753871.1| hypothetical protein ROSEINA2194_02292 [Roseburia inulinivorans DSM 16841] gi|225211533|gb|EEG93887.1| hypothetical protein ROSEINA2194_02292 [Roseburia inulinivorans DSM 16841] Length = 320 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 4/36 (11%), Positives = 17/36 (47%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDW 46 ++ + K + +++ L ++ + GF++ + Sbjct: 1 MQRLFKTVKGSITVLVTLILVPTIFFTGFMVDLARI 36 >gi|239825644|ref|YP_002948268.1| hypothetical protein GWCH70_0060 [Geobacillus sp. WCH70] gi|239805937|gb|ACS23002.1| conserved hypothetical protein [Geobacillus sp. WCH70] Length = 244 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 33/87 (37%), Gaps = 7/87 (8%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--NASPNGQRLLKT-- 381 + I+ +TDG +N+ + I + AKE I + I + + + L + Sbjct: 2 RKGTLRQILLITDGCSNHGED---PIAMASLAKEQGITVNVIGVLDQDTIDESGLREIEG 58 Query: 382 CVSSPEYHYNVVNADSLIHVFQNISQL 408 S VV A L Q +++ Sbjct: 59 IALSGGGISQVVYAKQLSQTVQMVTRK 85 >gi|115537688|ref|NP_872008.2| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|82654523|gb|ABB88212.1| Hypothetical protein T19D12.4b [Caenorhabditis elegans] Length = 1015 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 49/161 (30%), Gaps = 17/161 (10%) Query: 230 ALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 + + + + ++ +G+I +++ V + + + G D Sbjct: 391 IVPQYVDFAKKLVAQYKYGNDNTRVGIITFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTG 450 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T+ T A A + ++ + K + LTDG Sbjct: 451 GLTNVTKAQLTAKNLFDTESNANRN----------------KVLFILTDGVPTVDTYTDE 494 Query: 350 TIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHY 390 DK K + + ++ + + VS P+Y + Sbjct: 495 V-AAGDKLKSISVISFFVGYSSYSDEVKTELGKVSEPKYIF 534 >gi|17533683|ref|NP_496743.1| C-type LECtin family member (clec-63) [Caenorhabditis elegans] gi|3876684|emb|CAB03058.1| C. elegans protein F35C5.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 411 Score = 38.3 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 22/211 (10%), Positives = 59/211 (27%), Gaps = 23/211 (10%) Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 G + V+++ ++ T++AA + I +GL+ Sbjct: 50 CGGDLTNLWLDVVVVVDNSKGMTNEGITEIAANIVTVFGNGTRIGNQYSDPRSTRLGLVT 109 Query: 259 YTTRVEKNIEPSWGTEKV---RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 Y R + + + + + + A +L S + Sbjct: 110 YNGRSTIVADLNLLQSIDDLYQSVFSTLNQVSNSDDSFLAKGIGAAENVLQSGRTNGV-- 167 Query: 316 NFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 +++ ++ + + + D+ K + + + T++ + + Sbjct: 168 ----------RSNYKRLVVVYASAYKGEGELDPIP--VADRLKSSGVVVSTVAFDQDGDE 215 Query: 376 QRL--LKTCVSSPEYHYNVVN---ADSLIHV 401 L L S P Y + + L Sbjct: 216 ALLAGLTNIAS-PNYAFTSKDLNLVGELQGA 245 >gi|116754083|ref|YP_843201.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116665534|gb|ABK14561.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 789 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 62/188 (32%), Gaps = 29/188 (15%) Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTE 274 AP+ + T + K+ + L S D +G++ + T+ +P Sbjct: 371 APMRIGESTTYLDYEKSLAIELL-----QSPELRDARVGIVVFGTKPYVVSQPVPIRNRA 425 Query: 275 KVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII 334 + + + L T+ ++ A++I+ K + ++ Sbjct: 426 VIEESIKSLQTPLGRDETNLDEGLRLAWKIINESKAEAD-------------------LV 466 Query: 335 FLTDG--ENNNFKSNVNTIKICDKAKENFIKIVTISI-NASPNGQRLLKTCVSSPEYHYN 391 ++DG E + K + + D +E + I + + + + R + + Sbjct: 467 IISDGRIEPDKVKGDQVFLNSVDILREMNATVTLIQVQSYAGSAGRFEELAALTGATFRP 526 Query: 392 VVNADSLI 399 V SL Sbjct: 527 AVYPSSLT 534 >gi|94986631|ref|YP_594564.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] gi|94730880|emb|CAJ54243.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] Length = 530 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 45/136 (33%), Gaps = 29/136 (21%) Query: 278 QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT 337 + +T + T T A+ +L+ +K + +T Sbjct: 422 KRITNSFAAEAHGSTPMTEALWWVLGMLS------------------TRPEHRKIVFVVT 463 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 DG ++ ++ TI + K I+++ I I+A ++ S + + Sbjct: 464 DGYPDDPETAKETIAVA---KRMGIEVLGIGIDAPA----IISMIPGSEN----ITDIRE 512 Query: 398 LIHVFQNISQLMVHRK 413 L + Q ++ K Sbjct: 513 LAPAMFRLLQQIMTEK 528 >gi|118367749|ref|XP_001017084.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89298851|gb|EAR96839.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1085 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 49/161 (30%), Gaps = 26/161 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 QD+ + + A+ + + + Y+ I + + +I + Sbjct: 437 KYFHYYFIQDESGSFSNDHQYAIQGVAQLFNRIKP---NDYITYIKFDSSSHVDIPKTLK 493 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + T+ A + Q + S K + Sbjct: 494 SSLSQGDFISKIQKCRGGGTNFQSAFQTLLQQIQS---------------KYDQQEYP-V 537 Query: 333 IIFLTDGENNNFKSNVNTI--KICDKAKENFIKIVTISINA 371 +IF+TDG++N ++ + +C I TI + Sbjct: 538 VIFITDGQDNTDLDSIISQITSLCQ-----DIVFYTIGYGS 573 >gi|73981421|ref|XP_850355.1| PREDICTED: similar to integrin, alpha 10 precursor [Canis familiaris] Length = 1165 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI--------NASPNGQ 376 + ++ +TDGE+++ + +K C+ + + I++ + Sbjct: 263 GRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFL 319 Query: 377 RLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 R ++ + P+ + +NV + +L + + + Sbjct: 320 REIRMIANDPDERFFFNVTDEAALTDIVDALGDRIFGL 357 >gi|193208855|ref|NP_505147.3| C-type LECtin family member (clec-218) [Caenorhabditis elegans] gi|163644494|gb|AAY43994.2| C-type lectin protein 218 [Caenorhabditis elegans] Length = 389 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 58/178 (32%), Gaps = 21/178 (11%) Query: 233 NALLLFLD-SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 N++ F + K + ++ Y+ + S +T + +L Sbjct: 62 NSVFGFTQIRVGSNYPDKRGARVSVLTYSDSPTVHANLSDFKSTDE--LTSMIYALK-PS 118 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 T ++ + +++ + + +I + G+ ++ Sbjct: 119 TSYDSNLQSSLKLVKNMMNYKDINAPRN---------NTQTVIIIYAGDYVDYDEPTIA- 168 Query: 352 KICDKAKENFIKIVTIS-INASPNGQ----RLLKTCVSSPEYHYNVVN--ADSLIHVF 402 + D+ K + +KI+T++ I+ + + + LK S + ++ + Sbjct: 169 QFGDQLKADGVKIITVADISNTDHQHVSKLKWLKELASEGNGFNINDDYVSEEVQKAL 226 >gi|113866862|ref|YP_725351.1| RND superfamily exporter [Ralstonia eutropha H16] gi|113525638|emb|CAJ91983.1| predicted exporter of the RND superfamily [Ralstonia eutropha H16] Length = 792 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 6/44 (13%), Positives = 18/44 (40%), Gaps = 5/44 (11%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESA-----NNAAILA 63 + ++ +L +G + + Y + ++ A +A +A Sbjct: 672 LTVATLPVMVLAVGIGVDYAFYIYNRLQVQLANGLAFTDALAVA 715 >gi|86157055|ref|YP_463840.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Anaeromyxobacter dehalogenans 2CP-C] gi|85773566|gb|ABC80403.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter dehalogenans 2CP-C] Length = 1722 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 16/173 (9%), Positives = 37/173 (21%), Gaps = 15/173 (8%) Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVT-RDMDSLILKPTDSTPAMKQAYQILTSDK 309 + I Y + + + G +T D D L T + Sbjct: 474 NANQYSICYLCQTSSVVLLTDGEPSPGSGLTPNDGDRLPTGSTTLAD----TTSGKYAGA 529 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA----KENFIKI- 364 + + + + + N++ + K ++ Sbjct: 530 TSTGIQGIASTDCPQCANFSGENAWKDNLAKVAWYMHNMDLRDNAESTWDCKKNGGKQVL 589 Query: 365 --VTISI---NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 T+ L + N +L I + + R Sbjct: 590 DTYTVGFATRQLPDARTILHAAAEAGGGKFVPAENPQTLAEGLTYILEKINER 642 >gi|297722843|ref|NP_001173785.1| Os04g0198300 [Oryza sativa Japonica Group] gi|255675207|dbj|BAH92513.1| Os04g0198300 [Oryza sativa Japonica Group] Length = 1173 Score = 38.3 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 59/204 (28%), Gaps = 27/204 (13%) Query: 231 LKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMD--- 285 K+A + + + + ++ S + R Sbjct: 539 TKHATQTWREKLKNTVKFARNCIHISSVNSSLPTPTFPASQHANSTTARSAPAAADGISC 598 Query: 286 -SLILKPTDSTPAMKQAY-QILTSDKKRSFFTNFFRQGVKIPSLPFQ------KFIIFLT 337 + T++ ++++ LT + F + + Q II ++ Sbjct: 599 KLSKVLLTNAIQPVQKSISDYLTCSNHENLQHAIFLRKAGMLDNRPQEEKDRMGAIIVIS 658 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 D +N+ IC +A I T N N + + S + + + + Sbjct: 659 DNHDNS---------ICMEALSTNYNIHTFGFNGMHNVRAMYNIASRSNGMYDLLNDDRN 709 Query: 398 L-IHVFQNISQLMVHRKYSVILKG 420 L F + + S+I+ G Sbjct: 710 LITEAFISCMNKIT----SIIVLG 729 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 38.3 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 18/160 (11%), Positives = 56/160 (35%), Gaps = 27/160 (16%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + + + QD++ +++ +K + + + LI + R + Sbjct: 106 DASLSMLAQDERPSRLERVKQEVRRL-------RAMSPADRVALIAFAGRSYILTPLTGD 158 Query: 273 TEKVRQYVTRDMDSLILK-PTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + ++ ++ + + + A++Q ++L + + Sbjct: 159 DGALELFLDNLDPGVVGQAGSSLSRAIRQGSELLLASDGS-----------------ADR 201 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINA 371 ++ L+DGE+ + ++ +A I +VT+ Sbjct: 202 ALVLLSDGESFDSAEDI--ESAASEAGSKGISVVTVGFGT 239 >gi|260834079|ref|XP_002612039.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] gi|229297412|gb|EEN68048.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] Length = 794 Score = 38.3 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 68/199 (34%), Gaps = 25/199 (12%) Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 +GS C + I + + A +K L + +++ + + +I Y Sbjct: 182 TGSGACVEFTTQAVDLVFLIARPRSSVSNADVKTFLKSVVAALN---VSQTAARVAVIEY 238 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 T+ + + + T VT ++S+ + S A + LT Sbjct: 239 TSVMHSHFDL--PTHLTNAQVTSAINSIPAWGSTSYRKTGSAIKYLTDYLS--------- 287 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 K ++ +TD +N++ S AK + + ++ + S + L Sbjct: 288 -----WRDGIPKVLVVITDATSNDYVSGP-----AQSAKNAGLILSSVGVGTSISS-TEL 336 Query: 380 KTCVSSPEYHYNVVNADSL 398 T ++ Y Y V + + Sbjct: 337 NTIATNSSYRYTVSSYTDI 355 >gi|84502750|ref|ZP_01000869.1| hypothetical protein OB2597_00960 [Oceanicola batsensis HTCC2597] gi|84389145|gb|EAQ01943.1| hypothetical protein OB2597_00960 [Oceanicola batsensis HTCC2597] Length = 181 Score = 38.3 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 21/55 (38%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 R + + E FA+ + F + + + + +++ +E + A+ Sbjct: 8 LRNRIRHWLGDETGAVVADFAIMLPVFTMFMLSSVEMGLMTFRQTMLERGLDMAV 62 >gi|291230030|ref|XP_002734973.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 454 Score = 38.3 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 64/217 (29%), Gaps = 25/217 (11%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + V+D + SM +N E++++ +M+ ++ AL+ + D Sbjct: 20 DAAVLDLAFVIDCTSSMGSWINEAKENIHNIVDEIVA--KEMSDIRLALVEYRDHP---- 73 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 +T V + ++ + ++ + D A+ Sbjct: 74 ----------PQDSTFVTRVLDFTP---SLKDMQKQMDSMAAHGGGDGPEAVADGLHETF 120 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 + R T P G +KI + E I + + Sbjct: 121 NLNWRPLATKVCVLIADAPPHGLGDSGDGFPKGCPAGHD----PMKIARQMAEQNITLYS 176 Query: 367 ISINASPNG--QRLLKTCVSSPEYHYNVVNADSLIHV 401 + + G + + + ++ +A L V Sbjct: 177 VVCGSYAEGFKDFFMAIAHVTGGQYVSLKDAKLLSKV 213 >gi|302553405|ref|ZP_07305747.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] gi|302471023|gb|EFL34116.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] Length = 152 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 13/133 (9%), Positives = 34/133 (25%), Gaps = 3/133 (2%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 ++ + + + + FL L F+I K+ A A + + Sbjct: 16 DDRGSGAGAVIIFTLVFLSLSAFVIDGGLSISKRERAADIAEQAARYAAQDIDLDALYEN 75 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSG---YSAVFYNTEIQNIVNSSRISMTHMA 133 D + N + ++ +A E++ + S + Sbjct: 76 DGGPAPINFGNCNARVKTFAREMDMTGADIAATHCVAADAQQVEVEVQLTYSPVFTGMFY 135 Query: 134 NNRLDSSNNTIFY 146 + + Sbjct: 136 GGDMVVHGQAVAE 148 >gi|213692491|ref|YP_002323077.1| narrowly conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 15697] gi|213523952|gb|ACJ52699.1| narrowly conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 15697] gi|320458637|dbj|BAJ69258.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis ATCC 15697] Length = 312 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 15/244 (6%), Positives = 52/244 (21%), Gaps = 28/244 (11%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLI------------------------YVLDW 46 + G ++I A+ ++ +L+ + + Sbjct: 5 VRYGRRRSHGAGALIAAVILVVAVLIAAWTLLPRLFTPHETARTAEEAYCAAVSPDGARR 64 Query: 47 HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 + ++A A +A + + + + + D + S Sbjct: 65 TLTPDQAQNAALIANIAVTRGLPDHAATVAIATAMQESRLTNLDYGDLDSLGLFQQRPSQ 124 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLN 166 +A + + + + + Sbjct: 125 GWGTAEQVSDMTYATNIFYDHLLQVPDWETIPVEDAAQEVQRSGYPELYATWDAMARAWA 184 Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVD----LSGSMHCAMNSDPEDVNSAPICQD 222 + + + L + L+ + ++ S+ + +A Sbjct: 185 SGLTGERSADVTCALEPAISSDADGLVANIGGTLPNVNVSVAPPNGKTGTNNGAATNTAS 244 Query: 223 KKRT 226 T Sbjct: 245 TTLT 248 >gi|221056438|ref|XP_002259357.1| protein phosphatase [Plasmodium knowlesi strain H] gi|193809428|emb|CAQ40130.1| protein phosphatase, putative [Plasmodium knowlesi strain H] Length = 977 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 12/239 (5%), Positives = 42/239 (17%), Gaps = 10/239 (4%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 ++ ++ + + A+ + N + + V S Sbjct: 391 TTEPTTDPMNMDSGEGATEEAAENVPDKETPNAADNATENTSDKVVENVEPVGGDASVKS 450 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIE 184 A + + + TS + + S + Sbjct: 451 MEEGSAESAKAPVAENAESTATEATPSTSESMATEASPSTSTPAATEATPSTSASTATEA 510 Query: 185 MGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP--ICQDKKRTKMAALKNALLLFLDSI 242 + + + + ++A T A + Sbjct: 511 IPSTSASTATEATPSTSASTATEATPSTSASTATEATPSTSASTATEATPSTSASTATEA 570 Query: 243 DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + + + + T + + Sbjct: 571 TPSTSASTATEATPSTSASTATEATPSTSASTATEAT--------PSTSTSANEGAQTG 621 >gi|168216419|ref|ZP_02642044.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381279|gb|EDT78758.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 620 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 25/298 (8%), Positives = 68/298 (22%), Gaps = 37/298 (12%) Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI--EMGERPI 190 N + + + + + + S + ++ + I Sbjct: 68 GNYEITLTVKGKPKKVTKPVDILLIMDASNSM-WNDMDALKESMNSLVDKVIDNIPNSRI 126 Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 +I ++ + K + + E Sbjct: 127 AVIAFGTEVEEVFSFNNKNKFTSKEEYKKAIKKSYNNIER-----RGNTNIEGTWRRADE 181 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVRQ--YVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 L + + I S G V + R + D + L Sbjct: 182 IFKNELNNNSNSKKDVIFFSDGYPNVSRDEMFARGYIARDRWNDDEYYKFRDYKDFLDGY 241 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 R N + + + + + K + I +++ Sbjct: 242 YYRINKKNLEYEKLMMSDKAYGNYKSIYEWALYEYGKFYNDYP---------DTNIFSVA 292 Query: 369 I------NASPNGQRLLKTCVSSPEYHYNV----------VNADSLIHVFQNISQLMV 410 + + + LL +S ++ + N+ SL ++ I+ ++ Sbjct: 293 LIDNIPYEDKNSAKELLSKMQNSG--YFTIDSRFGQDNNENNSKSLKEIYDKIANNII 348 >gi|118589698|ref|ZP_01547103.1| hypothetical protein SIAM614_04640 [Stappia aggregata IAM 12614] gi|118437784|gb|EAV44420.1| hypothetical protein SIAM614_04640 [Stappia aggregata IAM 12614] Length = 184 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 31/71 (43%), Gaps = 4/71 (5%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL 62 L SR ++ ++ FAL + F ++ +I V +H+ N M + A++ Sbjct: 8 LTSRQNRGRAAFGRNDSGATAVEFALIAIPFFTVVFGIIEVGLYHFV-NRM---FDNAVI 63 Query: 63 AGASKMVSNLS 73 + ++ + + Sbjct: 64 TASREIRTGQA 74 >gi|317483712|ref|ZP_07942659.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] gi|316925028|gb|EFV46167.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] Length = 1111 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 26/304 (8%), Positives = 67/304 (22%), Gaps = 14/304 (4%) Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 S S + + + K + I+ VN Sbjct: 464 TSHDASTDISYNETTKISIDWQKGTISVTNTGKNSESIQFGYGVEDRHGATDSADITVNV 523 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK-----IVSFIPA 179 + + + L T N+ + + + L + + K + + Sbjct: 524 TATTGAGSIGDDL-LQGATTTENVAMSYNISFVLDKSGSMGSSYSTAKEAVANYIEKLWD 582 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS----APICQDKKRTKMAALKNAL 235 ++ I +++ + + D K + N Sbjct: 583 DIQNTDAIINIQVVKFSSSVGWGDNNTFTLDKSTTYKELQAFLSAHVTNNDKASGNTNYE 642 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDST 295 L + + +E+ + + + + E N E+ R + Sbjct: 643 DALLKAESWFNSQEENGFANRLYFISDGEPNRPYGKPVERAEAVYDRIVGDSAHPVDVHA 702 Query: 296 PAM----KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTI 351 + +L G ++ ++D N +V Sbjct: 703 IGILGNGANDLDVLNKFDNTDGADQIRNAGELYDAIASSTVTKPVSDTIFANKGDDVVFG 762 Query: 352 KICD 355 Sbjct: 763 DTAQ 766 >gi|303238383|ref|ZP_07324918.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302594087|gb|EFL63800.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 1527 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 23/256 (8%), Positives = 63/256 (24%), Gaps = 28/256 (10%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 ++ + +S T M N + M + + I+S Sbjct: 510 KVLPLADSQSFDYTLMPGEEASGVLNVVIPEMPPKADVVFAFDLTGSMGG------ILST 563 Query: 177 IPALLRIEMGERPIFLIELVVDLSGSMHCA--MNSDPEDVNSAPICQDKKRTKMAALKNA 234 + M E +++ + M D C + + Sbjct: 564 AKTKAKTIMTELNKLGVDINYGVVSYMDYLYGSYGSGGDYPYKLNCSMTESM------AS 617 Query: 235 LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-------RQYVTRDMDSL 287 + ++S+ L + + + + + NI G +K+ + + + Sbjct: 618 VSDSINSLKLGNGNDGPESYTRVLFESYADTNIGWRSGAKKIFVNFGDNLPHDNNLNEGV 677 Query: 288 ILKPTDSTPAMKQAYQI-LTSDKKRSFFTNFFRQGVKI------PSLPFQKFIIFLTDGE 340 K + + + ++ T + + K G Sbjct: 678 PGKTGTYNTGIDVGRDVKINTEDDLDLQTVLAQMAANKIELIACQTNGAYKEYWSYWAGL 737 Query: 341 NNNFKSNVNTIKICDK 356 + N+ + + Sbjct: 738 TGGKLYDTNSANMAND 753 >gi|269958192|ref|YP_003327980.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] gi|269306873|gb|ACZ32422.1| TadE family protein [Xylanimonas cellulosilytica DSM 15894] Length = 133 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 13/115 (11%), Positives = 32/115 (27%), Gaps = 10/115 (8%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M +++ + E+ + S+ A+ + + L L+ + + + Sbjct: 1 MRSRRQYKSLTARLR--ERGSVSLEAAVVMPALLALLWLGMQGALMYQGRT--------T 50 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 LA A + + + A+ + S S A Sbjct: 51 ALAAAQEGARVAAGENGTASAGIAAAEDLADAATLGLRGTQVDGSRSATEASVTV 105 >gi|302517857|ref|ZP_07270199.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426752|gb|EFK98567.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 610 Score = 37.9 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 49/200 (24%), Gaps = 24/200 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-------GT 273 + +++M K +LL L + + + + + G Sbjct: 419 PGRGQSRMDVTKASLLQALAQFTSADDIGLWEFSTRLDGDRDYRELVPTDRLGARKGEGV 478 Query: 274 EKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + T AYQ + F P Sbjct: 479 TQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYAADKFNAVVLLTDGTNEDP--- 535 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 G + D A+ + +V I++ G + ++ + Sbjct: 536 -------GSLTRGALLTKLRDLADPAR--PLPLVAIAVGPEAAGDDVKAIGSATGGSGFK 586 Query: 392 VVNADSLIHVFQNISQLMVH 411 V + + V I+ +V Sbjct: 587 VDDPAQIHEV---INNAIVE 603 >gi|332361361|gb|EGJ39165.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1056] Length = 451 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 96/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 A +S + + + + A D KN + Sbjct: 54 AVSSPIEAAGTEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETAT 113 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 114 VDVTKTEVAPAKVDKAPEPITRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 168 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 169 PPTVTKAGSADIVFVVDRSGSMGGTIDIVRANINEFVRNITKEGI---TARFGLATFSDE 225 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 226 VYGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 265 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 266 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 325 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + + K + ++ N +L I + + +Y Sbjct: 326 IERTVATV-----KAIEGIYKNFA-TEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 377 Query: 415 SVILK 419 I+K Sbjct: 378 YKIIK 382 >gi|317133739|ref|YP_004089650.1| hypothetical protein Rumal_3301 [Ruminococcus albus 7] gi|315450201|gb|ADU23764.1| hypothetical protein Rumal_3301 [Ruminococcus albus 7] Length = 168 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 39/149 (26%), Gaps = 7/149 (4%) Query: 9 FYFKKGIASEKANFSII-FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASK 67 K+ N ++ F + + F+ L+ L+ + K + AA A Sbjct: 4 KIIKRFFR--SGNGELLGFVICMPLFVWLLMLLVSIAQLSITKEQITYLAYAAGRAAVVS 61 Query: 68 MVSNLSRLGDRFES----ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 +N + + + +D S ++ Sbjct: 62 ENANDALNNAQAVVDESSNLSGFSNVNVDILFNGEPITESNINSISWQKGEYIVVRLTYT 121 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMT 152 + + + + + N + VM Sbjct: 122 TDSLISGAIPFSDEEIDMNGTREMLMVMM 150 >gi|161524907|ref|YP_001579919.1| TadE family protein [Burkholderia multivorans ATCC 17616] gi|189350343|ref|YP_001945971.1| Flp pilus assembly protein [Burkholderia multivorans ATCC 17616] gi|160342336|gb|ABX15422.1| TadE family protein [Burkholderia multivorans ATCC 17616] gi|189334365|dbj|BAG43435.1| Flp pilus assembly protein [Burkholderia multivorans ATCC 17616] Length = 167 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 35/128 (27%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 ++ +I FA F L+ ++ + S+ A + A + + +R Sbjct: 13 WRRQRGVAAIEFAFVFPLFFLIFYGIVTFAMIFVIQQSLTFAASEGARAALNYASTPCNR 72 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 L + A + S + T ++S + M Sbjct: 73 LSVNAQKAVTQALAGAPWSTNVTFAAQVSVSAPTPKSTAGVTCDSTFASTSSSTFNVMVT 132 Query: 135 NRLDSSNN 142 + N Sbjct: 133 TTYSYAAN 140 >gi|268530438|ref|XP_002630345.1| C. briggsae CBR-CLEC-63 protein [Caenorhabditis briggsae] Length = 414 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 18/198 (9%), Positives = 48/198 (24%), Gaps = 21/198 (10%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVY 253 VD A V N +F + I Sbjct: 48 NVDRECGGDLANLWLDVVVVVDNSKGMTAEGITEVAANIATVFGNGTKIGTQYTDPRSTR 107 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y + + + + + + + A +L + Sbjct: 108 LGLVTYNGQSTVVADLNAIQSIDQLYSTVFSALSSVSTSDDSYLARGIGAAEMVLQQGRN 167 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +++ ++ + + + D+ K + + + T++ + Sbjct: 168 NGI------------RSNYKRLVVVYASAYKGEGELDPIP--VADRLKASGVVVSTVAFD 213 Query: 371 ASPNGQRL--LKTCVSSP 386 + L L S+ Sbjct: 214 QDGDEALLAGLTNIASTN 231 >gi|297171348|gb|ADI22352.1| hypothetical protein [uncultured nuHF2 cluster bacterium HF0500_02A10] Length = 266 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 42/125 (33%), Gaps = 25/125 (20%) Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + T+ PA++ A++ L+ + +I L+DG+ Sbjct: 4 IASAQTNIYPALEMAFEELSEIDAEV------------------RHVILLSDGQTY---- 41 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVF---Q 403 + + ++ I + ++++ + L Y +++A + VF Sbjct: 42 PDKYESLVTRMAKDDISVSSVAVGQESDRALLADIAEWGNGRSYFILDAARVPQVFIQET 101 Query: 404 NISQL 408 I+ Sbjct: 102 QIASQ 106 >gi|289177966|gb|ADC85212.1| Ammonium transporter [Bifidobacterium animalis subsp. lactis BB-12] Length = 558 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 43/192 (22%), Gaps = 11/192 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAGASKMVSN 71 + + N + I + + FL+ G ++ K + AA L+ + Sbjct: 127 RMSQLDSGNAAWILTSACLVFLMTPGVALFYGGMVRAKAVLNMLIMEAAALSVTMIIWGL 186 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 N D A F+ + G + Sbjct: 187 WGWSIAYA---GNSIGGVFGDPATGFLLRDTMVAQDGVFTSASLNSNNYPASVDVAFQAA 243 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH-------LLNQRYNQKIVSFIPALLRIE 184 A + I + T + +I ++ I L Sbjct: 244 FAMITVALICGAIAERVKYSTWMIFVALWITFDYAPMAHMVWNGGLLSADGAISQALGAA 303 Query: 185 MGERPIFLIELV 196 + + + Sbjct: 304 AHDFAGGTVVHI 315 >gi|296124352|ref|YP_003632130.1| TadE family protein [Planctomyces limnophilus DSM 3776] gi|296016692|gb|ADG69931.1| TadE family protein [Planctomyces limnophilus DSM 3776] Length = 145 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 6/41 (14%), Positives = 19/41 (46%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 S + S+ +L++ + + V + ++++E+A Sbjct: 19 SRRGALSVELSLTIPVLFAIFFGAVEVTRLNMLRHTIENAA 59 >gi|240255600|ref|NP_680085.3| von Willebrand factor A domain-containing protein 1 precursor [Mus musculus] gi|81901574|sp|Q8R2Z5|VWA1_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 1; AltName: Full=von Willebrand factor A domain-related protein; Flags: Precursor gi|20071274|gb|AAH26919.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|26346104|dbj|BAC36703.1| unnamed protein product [Mus musculus] gi|54611403|gb|AAH36166.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|123295149|emb|CAM18384.1| von Willebrand factor A domain containing 1 [Mus musculus] gi|148683083|gb|EDL15030.1| von Willebrand factor A domain containing 1 [Mus musculus] Length = 415 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 50/157 (31%), Gaps = 19/157 (12%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ ++ + + + T++ A+ A + L +++ Sbjct: 74 SLVHVGSQPHTEFTFDQYSSGQAIQDAIRVAPQRMGDTNTGLALAYAKEQLFAEEAG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + I +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTIFIVSTGRGNL 175 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L + ++ + V + D L + + + + Sbjct: 176 LELLAAASAPAEKHLHFV-DVDDLPIIARELRGSITD 211 >gi|189462676|ref|ZP_03011461.1| hypothetical protein BACCOP_03373 [Bacteroides coprocola DSM 17136] gi|189430837|gb|EDU99821.1| hypothetical protein BACCOP_03373 [Bacteroides coprocola DSM 17136] Length = 546 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 59/236 (25%), Gaps = 6/236 (2%) Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 +AA +A +S +++ L+ SI N + + F + V Sbjct: 190 LDAAAVAASSYILTGLTPKTKYSVSIYNGDVKRGT---RTFTTTESYPAGYDIVNVSDAD 246 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNN-TIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVS 175 + +I + + N + N + M F + Sbjct: 247 MLNDIFANPANYIQDNGGNVVLVFANGSTTDYMGESMELTIPADFKSVIFWGESGDTKPV 306 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 F+P L + I +DL + +V ++ + + Sbjct: 307 FMPKGLSMAGSHDLIRF--YNLDLQNTSSANDYIVNFNVEGTVGEILIDNCNISKTRGVV 364 Query: 236 LLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 + D L + + S T +++L Sbjct: 365 RVQSDGAKGSIGSINIDNCVLTDIGSYGVLQTKVSGFTLNSVSLSNSTVNTLSAGG 420 >gi|148253064|ref|YP_001237649.1| hypothetical protein BBta_1526 [Bradyrhizobium sp. BTAi1] gi|146405237|gb|ABQ33743.1| hypothetical protein BBta_1526 [Bradyrhizobium sp. BTAi1] Length = 385 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 39/381 (10%), Positives = 94/381 (24%), Gaps = 38/381 (9%) Query: 33 FLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL---GDRFESISNHAKRA 89 + + + + YK M+SA +AA A+ + + + + Sbjct: 1 MIGMAALAVDIGFAIYKHRQMQSAADAAAFGAATALATGHPADFRTEAYALAAAVGYVNG 60 Query: 90 LIDDAKRFI-KNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNM 148 + + + ++A ++ + I + + + + Sbjct: 61 VDGVVITVNHPPVSPPASAAHAADPTAVQVIITQPQTLILVGLFGSGIFKLGVSAVALQG 120 Query: 149 DVMTSYDYRLQFIEHLLNQRYNQKIVSF--IPALLRIEMGERPIFLIELVVDLSGSMHCA 206 + + L N V+ + A + +S + Sbjct: 121 NAGLYCVFALDPTASAAVYLNNNAAVNACGVGANSNSNSALTLNNNALITGPVSVVGKWS 180 Query: 207 MNSD------PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--VYMGLIG 258 + ++ P+ N+A + + S + + V++ + Sbjct: 181 LANNARLTGTPQKQNAAAMTDPYAGVPLQPPPGPCTNPPASASTVHILNPTGEVHLCRLN 240 Query: 259 YTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPA---------MKQAYQILTSDK 309 Y+ V N++P + + ++ T + TS + Sbjct: 241 YSNNVTLNLDPGAYYIDTQLSLQNNVTVNGTGVTLIINGNYAMSLGNNLTMNLTAPTSGE 300 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK-----SNVNTIKI--CDKAKENFI 362 R + F I +G N T+ C + I Sbjct: 301 YTGLVFASPRTATSTSTQRFSNNTILNLNGAVYFPNQILQFDNNATLNARTCGQLIARKI 360 Query: 363 KIVTISINASPNGQRLLKTCV 383 I N L C Sbjct: 361 AIY--------NNATLGSQCA 373 >gi|72091964|ref|XP_795239.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115929603|ref|XP_001180674.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 729 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 69/229 (30%), Gaps = 45/229 (19%) Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + + + + + + I LV+D S SM Sbjct: 142 TDFMNGNSPSLSEDTDTSPNFIV-VQPAGSLRIVLVLDTSFSMDGP-------------- 186 Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV-RQY 279 + + F+ SI V + Y+ ++ + + N + + R+ Sbjct: 187 ------RFNKMIRGAKNFIQSI-----VPNNSYVAIVEFNSVSRVNSHMIELSSVISRKN 235 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 + + ++ T + A ++ + S ++I L+DG Sbjct: 236 LASLLPTVAENNTCIGCGILTAIEVAQYNGLDSRGV----------------YLILLSDG 279 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY 388 + N ++ D + + + + +I++ + Q L + + Sbjct: 280 QENIYEDITLVADTLDDIESSGVTVHSIAL-YGADTQ-LEELAQMTGGR 326 >gi|88812999|ref|ZP_01128242.1| type 4 fimbrial biogenesis protein PilY1 [Nitrococcus mobilis Nb-231] gi|88789777|gb|EAR20901.1| type 4 fimbrial biogenesis protein PilY1 [Nitrococcus mobilis Nb-231] Length = 1179 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 47/222 (21%), Gaps = 9/222 (4%) Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 L L + + P N+ + + D Sbjct: 262 PLRRALQRAGVYFKTNQPYYNNAVDSGDGSASCRQNYQVLMTDGLWNGGLSNGPGNYDNS 321 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVT---RDMDSLILKPTDSTPAMKQAYQILTSDKK 310 + T V + D L ++Q + Sbjct: 322 EQTLPDGTSYNPQAPFQDDNSNVLADLAFKYWAEDLSTLANDVPPYMLQQGASASANYWN 381 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 + + LTD + + D +K I+ N Sbjct: 382 A--QNDPANWQHLVNFTVGLGLGASLTDPVWGGSTFTGDYPNLADGSKNWP----AITSN 435 Query: 371 ASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L ++S ++ + +L F+ I + R Sbjct: 436 TPSTVSDLWHAAINSRGQFFSADDPKALAAAFKTILNRVADR 477 >gi|297571772|ref|YP_003697546.1| hypothetical protein Arch_1222 [Arcanobacterium haemolyticum DSM 20595] gi|296932119|gb|ADH92927.1| hypothetical protein Arch_1222 [Arcanobacterium haemolyticum DSM 20595] Length = 138 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 39/142 (27%), Gaps = 8/142 (5%) Query: 14 GIASEKANFSIIFA---LSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 E+ N I+ + V++ LLIG H ++ + + ++A LA A ++ Sbjct: 1 MKTKEEGNLIILGLGIWVFVIALALLIGSA---AHLHNERKELLAHADSAALALAQRIDD 57 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 + A + +G + + + Sbjct: 58 ARYYATGNIDYAPPDLTEHA--HALAERAHATIHEPTGVKGENVVITLCQTTSIPMLPTF 115 Query: 131 HMANNRLDSSNNTIFYNMDVMT 152 + ++ + T Sbjct: 116 LNFASNVEMCATSFARLRITPT 137 >gi|239980164|ref|ZP_04702688.1| hypothetical protein SalbJ_12035 [Streptomyces albus J1074] gi|291452018|ref|ZP_06591408.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291354967|gb|EFE81869.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 153 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 17/135 (12%), Positives = 34/135 (25%), Gaps = 5/135 (3%) Query: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL--SRLG 76 ++I + + FL L F+I K+ A A + G Sbjct: 21 SGAGAVI--IFALVFLTLAAFVIDGGLSISKRERAADIAEQAARYAAQDIDREALYEDEG 78 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSG-YSAVFYNTEIQNIVNSSRISMTHMANN 135 + + A N + SG +A E++ + + Sbjct: 79 GPAPINYANCGARVRQFAAEMGMNGADIAASGCVAATVDQVEVEIQLTYDPVFTGMFYGG 138 Query: 136 RLDSSNNTIFYNMDV 150 + + N Sbjct: 139 AVTVKGRSTAENAVG 153 >gi|152984932|ref|YP_001350526.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] gi|150960090|gb|ABR82115.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] Length = 1155 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 20/258 (7%), Positives = 53/258 (20%), Gaps = 23/258 (8%) Query: 159 QFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 F +N + + ++G + Sbjct: 250 DFNGQHRVNFFNWLENLSVEGGTPLRRAMTRAGDFLRRTGVNGPYAYR--PGEQTAPEYS 307 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLS---HVKEDVYMGLIGYTTRVEKNIEPS---WG 272 L N+ + + D S + + + W Sbjct: 308 CRGSYHILMTDGLWNSDSANVGNADSNSRTLPDGQHYSSQAPYKDNASNTLADLAFHYWA 367 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 + + + A + + + K+ Sbjct: 368 NDARPDIEDNLNPFITYPDKANPSAEYWNPRNDPATWQHMVTYTLGLGLTTTL--ISPKW 425 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNV 392 G + ++ +++++ N L V+S ++ Sbjct: 426 EGSTYAGGYQDIQAGRLNWPSA-------------AVDSANNVYDLWHAAVNSRGEFFSA 472 Query: 393 VNADSLIHVFQNISQLMV 410 + D L+ FQ I + Sbjct: 473 DSPDQLVAAFQAIVNRIS 490 >gi|163747461|ref|ZP_02154813.1| hypothetical protein OIHEL45_00425 [Oceanibulbus indolifex HEL-45] gi|161379314|gb|EDQ03731.1| hypothetical protein OIHEL45_00425 [Oceanibulbus indolifex HEL-45] Length = 182 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 3/48 (6%), Positives = 12/48 (25%), Gaps = 3/48 (6%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFL---LLIGFLIYVLDWHY 48 + + S+ + +I + + L + + Sbjct: 1 MRVLSSLLTRFKRSDDGSIAIETVIMLPLMFWAYLAMYSTFDTFRMYN 48 >gi|309364362|emb|CAP25016.2| CBR-CLEC-63 protein [Caenorhabditis briggsae AF16] Length = 415 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 18/198 (9%), Positives = 48/198 (24%), Gaps = 21/198 (10%) Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD--SIDLLSHVKEDVY 253 VD A V N +F + I Sbjct: 48 NVDRECGGDLANLWLDVVVVVDNSKGMTAEGITEVAANIATVFGNGTKIGTQYTDPRSTR 107 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKK 310 +GL+ Y + + + + + + + A +L + Sbjct: 108 LGLVTYNGQSTVVADLNAIQSIDQLYSTVFSALSSVSTSDDSYLARGIGAAEMVLQQGRN 167 Query: 311 RSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN 370 +++ ++ + + + D+ K + + + T++ + Sbjct: 168 NGI------------RSNYKRLVVVYASAYKGEGELDPIP--VADRLKASGVVVSTVAFD 213 Query: 371 ASPNGQRL--LKTCVSSP 386 + L L S+ Sbjct: 214 QDGDEALLAGLTNIASTN 231 >gi|254000294|ref|YP_003052357.1| Pyrrolo-quinoline quinone [Methylovorus sp. SIP3-4] gi|253986973|gb|ACT51830.1| Pyrrolo-quinoline quinone [Methylovorus sp. SIP3-4] Length = 1229 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 17/230 (7%), Positives = 46/230 (20%), Gaps = 25/230 (10%) Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + S+ + V + + L + +G Sbjct: 28 TDIASAPISNNSGTVVKPNVMFVLDDSGSMDFNYLPD------------NVGNSNSAGTI 75 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH------- 247 S +D + D + ++ Sbjct: 76 YNCRRMSSGSTECAGGRDDGAAMTEGGDPPFYSFYHNATYYNPNITYTPPVNPCDPTSNL 135 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + + + + + + GT ++ + T + Sbjct: 136 PSQTTWTAVRTNGHTMNTSCGQTSGTVNIQTAFPERVYCNSNGTTCKRNGIDY------R 189 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 +F P + T +N ++ I+ C +A Sbjct: 190 SSGSYDDYDFPDNDRTNPYTSTTGTYRYPTTINSNPHYYSITPIEYCTRA 239 >gi|153008053|ref|YP_001369268.1| hypothetical protein Oant_0717 [Ochrobactrum anthropi ATCC 49188] gi|151559941|gb|ABS13439.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 182 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 11/71 (15%), Positives = 26/71 (36%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R +K + + ++ FAL LL+ + + D + + ++ A Sbjct: 1 MRTCLRKFLNDRRGLGAVEFALIAPVLLLIYLGSVDLADGVDTNKKVSRSASSLADLVAR 60 Query: 67 KMVSNLSRLGD 77 ++ + L D Sbjct: 61 QLSVTKNDLND 71 >gi|73749041|ref|YP_308280.1| hypothetical protein cbdb_A1307 [Dehalococcoides sp. CBDB1] gi|147669807|ref|YP_001214625.1| hypothetical protein DehaBAV1_1168 [Dehalococcoides sp. BAV1] gi|73660757|emb|CAI83364.1| hypothetical protein cbdbA1307 [Dehalococcoides sp. CBDB1] gi|146270755|gb|ABQ17747.1| hypothetical protein DehaBAV1_1168 [Dehalococcoides sp. BAV1] Length = 535 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 70/244 (28%), Gaps = 23/244 (9%) Query: 11 FKKGIASEKAN---FSIIF----ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 KK I EK ++IF A+ + L+LIG + + +++A + Sbjct: 4 IKKLIKGEKGASLAIALIFLAIGAIMIPPLLMLIGSGLKQGTTFEDRTGAIYSSDAGV-- 61 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + ++ N D + + + + S + +++ V Sbjct: 62 -EWVINILKTGGEGVTDAYGNIGLPNEQDSIRIYNLSDLNGSTVEVTLTYHDVGSYYNVI 120 Query: 124 SSRISMTHMANNRLD----SSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQK------- 172 S+ + ++F N D +L +++ N Sbjct: 121 STASLNGKSITTQATLKYQLGGGSVFDNAITSLDGDVKLTGSSSVISDPRNPPGAMVYSG 180 Query: 173 --IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAA 230 ++ + + + + ++G + + S + Sbjct: 181 GELILTGSSFIEGNVYADDGIDLGWSTPITGDAVTPADINRPGNISGTVTTGAAPQLPNV 240 Query: 231 LKNA 234 L +A Sbjct: 241 LTDA 244 >gi|157283965|ref|YP_001468233.1| TadE family protein [Kineococcus radiotolerans SRS30216] gi|151363107|gb|ABS06109.1| TadE family protein [Kineococcus radiotolerans SRS30216] Length = 131 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 12/122 (9%), Positives = 37/122 (30%), Gaps = 2/122 (1%) Query: 24 IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESIS 83 + L + L+G + ++ + A AGA ++ + +R + Sbjct: 1 MWVVLCLPMLFALLGLIFDGGAAIAGAQQADNVADQAARAGADQLDVSAARSEQALFGGA 60 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 + A+ + A ++ + + +A + + ++L + Sbjct: 61 DTARASRA--ACEYVSVALPGASCTVTANADTVTVAVAATVPAQFFGVIGIHQLSIDGSG 118 Query: 144 IF 145 Sbjct: 119 SA 120 >gi|157375506|ref|YP_001474106.1| hypothetical protein Ssed_2369 [Shewanella sediminis HAW-EB3] gi|157317880|gb|ABV36978.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 165 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 51/171 (29%), Gaps = 13/171 (7%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 + + +K +I F + + LL++ + Y+ + + ++MV + Sbjct: 1 MRMVK-QKGVAAIEFTIMLPVLLLMLFATAEIGRAIYQYSHL------------TRMVRD 47 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 SR A D + + + + + V + + Sbjct: 48 ASRYLSVTAIPDTSGSLAASFDDNCDLGDDCNLDCKNCISETKDLLVYGKVGGTVPLLYG 107 Query: 132 MANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLR 182 ++ + + S + + V Y++ F + + I + Sbjct: 108 LSTSDVIISGSPATSIVTVSVDYNWHPLFSDRISGFGIGDGIDLSFNFSVS 158 >gi|297560200|ref|YP_003679174.1| hypothetical protein Ndas_1229 [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296844648|gb|ADH66668.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 133 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 35/131 (26%), Gaps = 12/131 (9%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSR 74 ++ + A+ +FLL +G + +N AA LA A V L Sbjct: 5 DDRGQTTAFVAILASAFLLCLGLVTDGGGLMRARN------EAATLAQEAARAGVQQLDW 58 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 R + + A + N + + + T ++ S + Sbjct: 59 SSYREGTNEVALDPSAASAAAQSFLNAAGATGTVAVSGDTVTVTCSVDYSFT----LLPM 114 Query: 135 NRLDSSNNTIF 145 Sbjct: 115 GSTTVDATATA 125 >gi|319941895|ref|ZP_08016216.1| hypothetical protein HMPREF9464_01435 [Sutterella wadsworthensis 3_1_45B] gi|319804548|gb|EFW01418.1| hypothetical protein HMPREF9464_01435 [Sutterella wadsworthensis 3_1_45B] Length = 168 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 36/138 (26%), Gaps = 16/138 (11%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 FR F + S + FAL + + + A +A L+ A Sbjct: 7 FRSAFSAFLRSRLGAAGVEFALLLP--MAAFVLAV--------------ALESARLSIAY 50 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ G ++ A+ + A S F ++ + + Sbjct: 51 ALIERAVEEGIHEAKLNRGAEAETLVKAALEKWRFGVFDPSDLKLTFTSSSSMIDLLAGA 110 Query: 127 ISMTHMANNRLDSSNNTI 144 + + + Sbjct: 111 NPGAGTGGDSVHLKVDAR 128 >gi|212715270|ref|ZP_03323398.1| hypothetical protein BIFCAT_00162 [Bifidobacterium catenulatum DSM 16992] gi|212661951|gb|EEB22526.1| hypothetical protein BIFCAT_00162 [Bifidobacterium catenulatum DSM 16992] Length = 512 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 42/187 (22%), Gaps = 11/187 (5%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM-ESANNAAILAGASKMVSNLSRLG 76 + N + I + + FL+ G + K + AA L+ + Sbjct: 86 DSGNAAWILTSASLVFLMTPGVAFFYGGMVRAKAVLNMMIMEAAALSVTMVIWVLWGWSI 145 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 D A F+ S G V+ + A Sbjct: 146 AYA---GTSVGGVFGDPATGFLLKDSMVSDGGVFTSASLNSNNYPVSVDVAFQSAFAMIT 202 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEH-------LLNQRYNQKIVSFIPALLRIEMGERP 189 + I + T + +I ++ I + + Sbjct: 203 VALICGAIAERVKYSTWMIFVALWITFDYAPLAHMVWNGGLLSADGAISQAIGAAAHDFA 262 Query: 190 IFLIELV 196 + + Sbjct: 263 GGTVVHI 269 >gi|73539238|ref|YP_299605.1| transmembrane protein [Ralstonia eutropha JMP134] gi|72122575|gb|AAZ64761.1| probable transmembrane protein [Ralstonia eutropha JMP134] Length = 563 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 10/84 (11%), Positives = 27/84 (32%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A E+ S + + + + I V ++ +++ + + L+GA ++ S Sbjct: 10 ARERGAVSTVAVILTATVAFVALLSIDVGHVFMRQRQLQNMVDLSALSGAQQLKLATSAA 69 Query: 76 GDRFESISNHAKRALIDDAKRFIK 99 + + I Sbjct: 70 LQSAGVLGTVQTIGSQNGYPSGIA 93 >gi|319782170|ref|YP_004141646.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168058|gb|ADV11596.1| TadE family protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 179 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 12/121 (9%), Positives = 28/121 (23%), Gaps = 1/121 (0%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN-NAAI 61 +++ + + + + AL L+L + + + K ME+ + A Sbjct: 2 MIATLGHHLNRFQRDGRGTALVEMALIAPLMLMLSAGVFEFGNLIHDKLLMEAGLTDGAR 61 Query: 62 LAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 A + A+ + T N Sbjct: 62 FAARCNSKLYTDAGLAAIDCAGVATNIAVFGKTAVGVSGVPDVPRISGWRTADVTVTMNN 121 Query: 122 V 122 Sbjct: 122 T 122 >gi|296444402|ref|ZP_06886367.1| TadE family protein [Methylosinus trichosporium OB3b] gi|296258049|gb|EFH05111.1| TadE family protein [Methylosinus trichosporium OB3b] Length = 207 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 40/145 (27%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 S+ FA+ + L LIG + V + + ++ A A A + + N++ Sbjct: 27 MSVEFAIVSVPLLGLIGAIFEVGLVYMRGEQLQIATQNASRAMLTNSIGNMTYQNFINNY 86 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 + + + L S+ T + + + ++ Sbjct: 87 VCSWQTTGTVAPGTLDRSFDCSRLLVDVSSPSSWTGAATSNSFYTAPNALGSTITMPAAG 146 Query: 142 NTIFYNMDVMTSYDYRLQFIEHLLN 166 + + + L Sbjct: 147 SIAVVRIVYPMPMMTAILTGGVLTG 171 >gi|1246302|gb|AAA93502.1| pilY1 [Pseudomonas aeruginosa PAO1] Length = 1161 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 18/41 (43%) Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMV 410 N S N L V+S ++ + D L+ FQ+I + Sbjct: 490 NDSNNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 530 >gi|320168829|gb|EFW45728.1| minichromosome maintenance complex component 2 [Capsaspora owczarzaki ATCC 30864] Length = 917 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 44/148 (29%), Gaps = 5/148 (3%) Query: 49 KKNSMESANNAA-ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLS 107 + ++SA +AA LA A + + + AK+ + + ++ + + + Sbjct: 702 SRAQLQSALDAANALAPAVDEDKD-AIPQAMLKKYIIFAKQNVRPKLRDVDEDKLAKLYA 760 Query: 108 GYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 T I SM M+ DV + L+ ++ Sbjct: 761 DLRRESMTTGSIPITVRFVESMIRMSEAHARMHLREFVNEDDVNMAVRVALE---SFIST 817 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIEL 195 + + + + + + + Sbjct: 818 QKFSVMKTLSKSFSKYITYRKDNNELLF 845 >gi|323525744|ref|YP_004227897.1| TadE family protein [Burkholderia sp. CCGE1001] gi|323382746|gb|ADX54837.1| TadE family protein [Burkholderia sp. CCGE1001] Length = 171 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 14/148 (9%), Positives = 42/148 (28%), Gaps = 4/148 (2%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASKMV 69 F++ S++ +I FAL F + ++ + S+ A+ A + Sbjct: 23 FRRARRSQRGATAIEFALVFPLFFCVFYAIVTFSLIFVAQQSLTLASEEGARAALNYQKA 82 Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY---NTEIQNIVNSSR 126 ++++ + + + +++ + Y T + Sbjct: 83 NDVTAAINLRAAAACTTATSMVAQLISTAACQTVPAACSYQPAMTCLAVTLTYDYAAHPL 142 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSY 154 I + L + + + Sbjct: 143 IPNLPLLGLVLPDNLTSTATVQLNPVNI 170 >gi|149059384|gb|EDM10391.1| integrin alpha 1, isoform CRA_a [Rattus norvegicus] Length = 934 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 44/98 (44%), Gaps = 13/98 (13%) Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL------ 378 +K ++ +TDGE+++ I+ C+ + I+ +I+I N L Sbjct: 20 ARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFV 76 Query: 379 --LKTCVS--SPEYHYNVVNADSLIHVFQNISQLMVHR 412 +K+ S + ++ +NV + +L+ + + + + + Sbjct: 77 EEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFAL 114 >gi|226196263|ref|ZP_03791846.1| hemagluttinin family protein [Burkholderia pseudomallei Pakistan 9] gi|225931661|gb|EEH27665.1| hemagluttinin family protein [Burkholderia pseudomallei Pakistan 9] Length = 1148 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 28/319 (8%), Positives = 69/319 (21%), Gaps = 13/319 (4%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALID----DAKRFIKNHIKESLSGYS 110 + + A +A +S+ ++ + S +N + + + Sbjct: 469 QSTDRAAVAVSSETINGTTYNYAGVASPANGVVSIGGVGTERQLINVAAGQVSATSTDAI 528 Query: 111 AVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD----VMTSYDYRLQFIEHLLN 166 V + + + + YN SY + Sbjct: 529 NGSQLYATNQAVIAEDAKVNSLGGGVASALGGNAAYNATTGAITAPSYAVYGTTQNSVGG 588 Query: 167 QRYNQKIVSFIPALLRIEMGE-RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + ++ + + + + Sbjct: 589 AIDALQALAPLQYTSGPGVTTPNAPGSAPTNTVTLVGAGGPGANTTPVTLTNVAPGKLSA 648 Query: 226 TKMAALKNA-LLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDM 284 T A+ + L + L + +G + Y+ + V Sbjct: 649 TSTDAVNGSQLYATNQQVANLVSSVNNGGVGPVQYSDPSAPTTPNGGKPSQDLTLVGAAS 708 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + L A A + + K ++ + ++ DG +N Sbjct: 709 GPVALHNVAPGTASTDAVNVGQLGAVTTGLGGGAAIDPKTGAVTAPSYTVYNADGTTSNV 768 Query: 345 KSNVNTIKICDKAKENFIK 363 N D IK Sbjct: 769 ---GNVGAAIDAINSTGIK 784 >gi|254440642|ref|ZP_05054135.1| hypothetical protein OA307_57 [Octadecabacter antarcticus 307] gi|198250720|gb|EDY75035.1| hypothetical protein OA307_57 [Octadecabacter antarcticus 307] Length = 198 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 3/39 (7%), Positives = 11/39 (28%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLI 41 +L R + ++ E+ + + + Sbjct: 1 MLKRIKSTVRRFRREEEGTVVVEAIIMFPVLFATVLATF 39 >gi|261252527|ref|ZP_05945100.1| methyl-accepting chemotaxis protein [Vibrio orientalis CIP 102891] gi|260935918|gb|EEX91907.1| methyl-accepting chemotaxis protein [Vibrio orientalis CIP 102891] Length = 578 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 23/253 (9%), Positives = 70/253 (27%), Gaps = 8/253 (3%) Query: 45 DWHYKKNSMESANNAAILAGASKMVSNL-SRLGDRFESISNHAKRALIDDAKRFIKNHIK 103 + A ++A+ A + SN+ + E+ + I + Sbjct: 297 HMFVDVSRSSQAIDSAVHNIAEQSASNVKTLDQHTLETEQAITAIEELSATAASISDSAD 356 Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + + E ++ ++ + N++ + + + LQ I Sbjct: 357 NAAKLTESTNQFAEQSKQTVNNAVTSVNALVNQVSAMSEVTSTMSEDTKQISSVLQVIGE 416 Query: 164 LLNQRYNQKIVSFIPAL-------LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 + Q + + I A + + L D + ++ + ++ Sbjct: 417 IAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRALAARTQDSTSQINEMLAKLRSTTDN 476 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 D R + +DS++ +++ ++ T + + + Sbjct: 477 VVNEMDATRHSCEQTASNTNQVMDSLNQVTNSVVEMNDLNALIATSAMQQSQVTGEVSTN 536 Query: 277 RQYVTRDMDSLIL 289 + + L Sbjct: 537 MAAIQEIVRLLNN 549 >gi|303238434|ref|ZP_07324969.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302594138|gb|EFL63851.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 306 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 14/137 (10%), Positives = 33/137 (24%), Gaps = 12/137 (8%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 I S K +++ + +I + YKK A K + Sbjct: 7 IRSRKGTTLVLYVIMFSVIAACASIVIDIGVVAYKK------------ADTVKATDAAAL 54 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + ++ I+ + S + + Sbjct: 55 ASAQSLFYGDENPVVVATEYLEKNGVDPDDAQINILDDGKGISIKTKIEVSYLFAKLLGF 114 Query: 135 NRLDSSNNTIFYNMDVM 151 + + N M V Sbjct: 115 DSGEIKANATAKVMGVT 131 >gi|197335351|ref|YP_002156064.1| Na+-driven multidrug efflux pump [Vibrio fischeri MJ11] gi|197316841|gb|ACH66288.1| Na+-driven multidrug efflux pump [Vibrio fischeri MJ11] Length = 447 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 45/144 (31%), Gaps = 7/144 (4%) Query: 20 ANFS-IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL--- 75 S IIF L ++S ++ G I + + + + + A+ + + L Sbjct: 51 GVISQIIFFLMIISMMVSTGTGILIGQ-YNGSGRTQQSTDVAVASVVLGGILGLGLSLFA 109 Query: 76 --GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 I + + A ++ +L+ + ++T +++ S + ++ Sbjct: 110 FFAANPIVIMYGLESDVATFAYDYLIITGSFTLTVTLGIVFSTILRSNGYSRTPMIVNLL 169 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYR 157 L+ N Sbjct: 170 TGGLNIFGNYCALYQPFGLPVYGV 193 >gi|251778756|ref|ZP_04821676.1| surface protein PspC [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083071|gb|EES48961.1| putative cell wall binding repeat protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 604 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 20/294 (6%), Positives = 68/294 (23%), Gaps = 19/294 (6%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ ++ + DA + + S + + + Sbjct: 282 AQASDDIDGAKYAKTVANYAITNEKGVDATAELAD---MSFNIVDGKLVAYKTTESNGKT 338 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL----------LNQRYNQKIVS 175 +++ + + T N D + + + ++ + V Sbjct: 339 NVTVKTVKFSTERGYTYTEIENADDQDATAFDIDVDGNMWALDGGFVYKFDNADEWTKVY 398 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN-- 233 + + ++ + + + + + +K Sbjct: 399 KVDGSMDQMSVYNKDNMVVWNENDEVYSVIGSKDESGETEDSQGWVQAEDGSWTFVKEDG 458 Query: 234 --ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 A D+ M + P + D+ Sbjct: 459 TKATGWLNDNGTWFYLNSNGAMMTGWQNVNGTWYYLNPVSDGTRGAMKTGWINDNGTWYY 518 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDGENNN 343 T+++ AM+ +Q + + + G + + + + G N+ Sbjct: 519 TNASGAMQTGWQNVNGTWYFMQGSGAMQTGWLNDNGTWYYLQASGAMKTGWLND 572 >gi|225351301|ref|ZP_03742324.1| hypothetical protein BIFPSEUDO_02893 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225158757|gb|EEG71999.1| hypothetical protein BIFPSEUDO_02893 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 476 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 42/187 (22%), Gaps = 11/187 (5%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM-ESANNAAILAGASKMVSNLSRLG 76 + N + I + + FL+ G + K + AA L+ + Sbjct: 50 DSGNAAWILTSASLVFLMTPGVAFFYGGMVRAKAVLNMMIMEAAALSVTMVIWVLWGWSI 109 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 D A F+ S G V+ + A Sbjct: 110 AYA---GTSVGGVFGDPATGFLLKDSMVSDGGVFTSASLNSNNYPVSVDVAFQSAFAMIT 166 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEH-------LLNQRYNQKIVSFIPALLRIEMGERP 189 + I + T + +I ++ I + + Sbjct: 167 VALICGAIAERVKYSTWMIFVALWITFDYAPLAHMVWNGGLLSADGAISQAIGAAAHDFA 226 Query: 190 IFLIELV 196 + + Sbjct: 227 GGTVVHI 233 >gi|260820654|ref|XP_002605649.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] gi|229290984|gb|EEN61659.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] Length = 168 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 46/161 (28%), Gaps = 24/161 (14%) Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPA 297 D +D +G++ + V+ +R + R T+ A Sbjct: 30 DMVDGFDIGSAQTRVGVVQFAWMVQAEFNLGDYLDGTDLRNAIARIRYMD-GPGTEIGKA 88 Query: 298 MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKA 357 + + L S G + + + +I +TDG ++ + Sbjct: 89 LVFTKRRLFS----------ELYGARPETQDVPRIVILITDGRSSPESQLHTAGVVVYAV 138 Query: 358 KENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 + L+T S Y+V + DSL Sbjct: 139 GVGEAV-----------DEAELETAASDSSKVYHVTDFDSL 168 >gi|115680671|ref|XP_001197987.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115719718|ref|XP_001196973.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 528 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 45/128 (35%), Gaps = 25/128 (19%) Query: 251 DVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 D +G++ + + + E + T ++ V S T +K ++L S Sbjct: 142 DSKLGIVVFESASREIAELTDITDTASRQRLVNALNTS-PKGTTCIGCGIKSGLKVLGSY 200 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 + +I+ L+DG N + + + D + + I TI+ Sbjct: 201 AQG-------------------GYILLLSDGVEN---EDPSITDMYDDINNSGVIIDTIT 238 Query: 369 INASPNGQ 376 I+ + Q Sbjct: 239 ISNEADQQ 246 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 43/155 (27%), Gaps = 21/155 (13%) Query: 251 DVYMGLIGYTTRVEKNIEPSW-GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 + +I ++ V+ + + TD + A Q L Sbjct: 396 QDKIAVIAFSDSVKAEWYATGGDLSSMSTLDQNIQKLQAGGGTDIYTPVMTALQQLAGAD 455 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 +I +TDG++N ++ N I I +I Sbjct: 456 VSQCNPA----------------VILMTDGQSNTGRTFTNVQSTYKSI-GKDIPIFSIEF 498 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 A + +L + S ++ L+ F+ Sbjct: 499 GA-ADPTQLKQFGTLSKAALFD--GRKDLVAAFKQ 530 >gi|162451432|ref|YP_001613799.1| hypothetical protein sce3160 [Sorangium cellulosum 'So ce 56'] gi|161162014|emb|CAN93319.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 404 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 63/205 (30%), Gaps = 40/205 (19%) Query: 191 FLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKE 250 + +VVD SGSM+ + DP LK A+L +D + + Sbjct: 108 PTVMIVVDRSGSMYGSGFWDP-------------------LKTAVLSVVDRLQ------D 142 Query: 251 DVYMGLIGYTTRVEKNIEPSWGTEKVR-QYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 V G + +T + G + + + + T ++ + ++ Sbjct: 143 RVRFGFLTFTGTANQQCPLLAGADGIALNHHAAIAAAYDEASTVPPGKLETPTAMTFNET 202 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK------SNVNTIKICDKAKENFIK 363 F G K+I+F+TDGE + + + + A E I Sbjct: 203 VVPELLAFPEPG--------PKYILFVTDGEPDRCDDVRAECARDDVVGAVQDAYEQGIG 254 Query: 364 IVTISINASPNGQRLLKTCVSSPEY 388 + + Q L + Sbjct: 255 TFVFGLGSGALAQHLQDVANAGAGQ 279 >gi|323449422|gb|EGB05310.1| hypothetical protein AURANDRAFT_72267 [Aureococcus anophagefferens] Length = 1009 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 20/290 (6%), Positives = 50/290 (17%), Gaps = 22/290 (7%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKR----ALIDDAKRFIKNHIKESLSGYSA 111 + + LA + + ++ DD + + SA Sbjct: 351 SADGTALAVGASGNDGAGSYAGHARVFAWNSDDDTWIQRGDDIDGEAADDNSGASVSLSA 410 Query: 112 VFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 + N S A +S + + + + Sbjct: 411 DGTTLAVGASGNDGAGSSAGHARVFAWNSVDETWIQRGDDIDGEAAYDYSGS-------S 463 Query: 172 KIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAAL 231 +S L + + + + + I + Sbjct: 464 VSLSADGTTLAVGAYTNGDDDLGSIAGHARVFAWDPVDETWVQRGVDIDGEFVSLSADGT 523 Query: 232 KNALLLFLDS-----------IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 A+ + S + G S + Sbjct: 524 TLAVGAPYNGGAGYEAGHARVFAWDSVDETWKQRGDDIDAEAAGDWSGWSVSLSADGATL 583 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 +D+ A A+ + + Sbjct: 584 AVGAIYNDGGGSDAGHARVFAWDSVDETWVQRGDDIDGEAAGDDSGYSVS 633 >gi|308050055|ref|YP_003913621.1| TadE family protein [Ferrimonas balearica DSM 9799] gi|307632245|gb|ADN76547.1| TadE family protein [Ferrimonas balearica DSM 9799] Length = 167 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 11/123 (8%), Positives = 43/123 (34%), Gaps = 3/123 (2%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI-LAGASKMVSNL 72 ++ + + FA+ +++ + + Y N++ + A LA Sbjct: 1 MKRHQQGAYVVEFAIVATVVFVMLFACLEIARLMYSYNALTEVSRRAARLAAVCAPSPGT 60 Query: 73 SRLGDR--FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 D ++ + + A + +++ + + +++ T+++ V + + + Sbjct: 61 MEPTDAMKALALFDGRQMVANLSADNLVVDYLTNTGALAASIPDTTQVRATVANYQHELI 120 Query: 131 HMA 133 Sbjct: 121 IPG 123 >gi|188589415|ref|YP_001922351.1| surface protein PspC [Clostridium botulinum E3 str. Alaska E43] gi|188499696|gb|ACD52832.1| surface protein PspC [Clostridium botulinum E3 str. Alaska E43] Length = 604 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 20/294 (6%), Positives = 68/294 (23%), Gaps = 19/294 (6%) Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 ++ ++ + DA + + S + + + Sbjct: 282 AQASDDIDGAKYAKTVANYAITNEKGVDATAELAD---MSFNIVDGKLVAYKTTESNGKT 338 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHL----------LNQRYNQKIVS 175 +++ + + T N D + + + ++ + V Sbjct: 339 NVTVKTVKFSTERGYTYTEIENADDQDATAFDIDVDGNMWALDGGFVYKFDNADEWTKVY 398 Query: 176 FIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKN-- 233 + + ++ + + + + + +K Sbjct: 399 KVDGSMDQMSVYNKDNMVVWNENDEVYSVIGSKDESGETEDSQGWVQAEDGSWTFVKEDG 458 Query: 234 --ALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKP 291 A D+ M + P + D+ Sbjct: 459 TKATGWLNDNGTWFYLNSNGAMMTGWQNVNGTWYYLNPVSDGTRGAMKTGWINDNGTWYY 518 Query: 292 TDSTPAMKQAYQILTSDKKRSFFTNFFRQG--VKIPSLPFQKFIIFLTDGENNN 343 T+++ AM+ +Q + + + G + + + + G N+ Sbjct: 519 TNASGAMQTGWQNVNGTWYFMQGSGAMQTGWLNDNGTWYYLQASGAMKTGWLND 572 >gi|114568965|ref|YP_755645.1| TadE family protein [Maricaulis maris MCS10] gi|114339427|gb|ABI64707.1| TadE family protein [Maricaulis maris MCS10] Length = 185 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 43/128 (33%), Gaps = 6/128 (4%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + + ++ FA+ F LL+ +I + + +E+ A+L A K+ + Sbjct: 19 RFVRARSGATAVEFAMIGAPFFLLLFAMIEIAAVFFTGTVLEN----AVLESARKIRTGE 74 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 ++ G ++ + ++++ + + + I + Sbjct: 75 AQTGGMSQAG--FRQEVCELIEAVANCDNLEIDVQVFEDFDTVDQSSPIQEDGSMDTGDF 132 Query: 133 ANNRLDSS 140 D+ Sbjct: 133 GWEPGDAG 140 >gi|256823631|ref|YP_003147594.1| Vault protein inter-alpha-trypsin domain-containing protein [Kangiella koreensis DSM 16069] gi|256797170|gb|ACV27826.1| Vault protein inter-alpha-trypsin domain protein [Kangiella koreensis DSM 16069] Length = 689 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 36/126 (28%), Gaps = 24/126 (19%) Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 T+ A+ A P + ++FLTDG Sbjct: 398 ATLEADGGTEIAKAINLALD--------------------KPDSSLLRQVVFLTDGSI-- 435 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQ 403 I ++ N ++ TI I A+PN + K + A + + Sbjct: 436 GNERQIFQMIENQLGNN--RLFTIGIGAAPNSYFMSKAANYGRGTFTYIGKASEVQTKLE 493 Query: 404 NISQLM 409 + + + Sbjct: 494 QLFKKL 499 >gi|260824043|ref|XP_002606977.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] gi|229292323|gb|EEN62987.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] Length = 219 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 41/154 (26%), Gaps = 25/154 (16%) Query: 231 LKNALLLFLDSI-DLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLIL 289 L+ A+ FL + + + ++ + V + V++ V Sbjct: 9 LRKAVRQFLAGVEQTATQTGLKENVAVVEFGGGVRIVQHLTNDYRCVQRAVDNL---RPG 65 Query: 290 KPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVN 349 T + +A + L + R +I +TDG+ +N Sbjct: 66 GTTPMFEGLMEALKELCQNGGVLVLPGGIRM---------TPRVILMTDGKPDNQ---DK 113 Query: 350 TIKICDKAKENF---------IKIVTISINASPN 374 I + I + + + Sbjct: 114 VILAAAAFSRKGYQEVGLPYPVPIACVGCGSGVD 147 >gi|118462291|ref|YP_879442.1| sensor histidine kinase [Mycobacterium avium 104] gi|118163578|gb|ABK64475.1| sensor-type histidine kinase PrrB [Mycobacterium avium 104] Length = 442 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 16/156 (10%), Positives = 44/156 (28%), Gaps = 11/156 (7%) Query: 40 LIYVLDW------HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + K ++ESA + A +A + ++ + L+ + E +S L + Sbjct: 201 AVEGMLARIGKEQQRTKAALESARDFAAVA-SHELRTPLTAMRTNLEVLSTLD---LPHE 256 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-MTHMANNRLDSSNNTIFYNMDVMT 152 ++ + + + S A E + + L + Sbjct: 257 QRQEVIGDVIRTQSRIEATLTALERLAQGELTTVDDFVPFDITELLDRAAHDALRVYPDV 316 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ ++V ++ G Sbjct: 317 EVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHGNA 352 >gi|41406251|ref|NP_959087.1| hypothetical protein MAP0153 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41394599|gb|AAS02470.1| hypothetical protein MAP_0153 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 442 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 16/156 (10%), Positives = 44/156 (28%), Gaps = 11/156 (7%) Query: 40 LIYVLDW------HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + K ++ESA + A +A + ++ + L+ + E +S L + Sbjct: 201 AVEGMLARIGKEQQRTKAALESARDFAAVA-SHELRTPLTAMRTNLEVLSTLD---LPHE 256 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-MTHMANNRLDSSNNTIFYNMDVMT 152 ++ + + + S A E + + L + Sbjct: 257 QRQEVIGDVIRTQSRIEATLTALERLAQGELTTVDDFVPFDITELLDRAAHDALRVYPDV 316 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ ++V ++ G Sbjct: 317 EVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHGNA 352 >gi|23010405|ref|ZP_00051105.1| COG1109: Phosphomannomutase [Magnetospirillum magnetotacticum MS-1] Length = 134 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 23/124 (18%), Gaps = 23/124 (18%) Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVT 366 D S F E + + C K I Sbjct: 3 PDPAASRPPVLFAWKDGKTYTGNPSQFRGYRYAEWDGSQPAQIDPNKCTDLKNAGATISV 62 Query: 367 ISI----------------------NASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN 404 + I SP L+ C S P + Y A + Sbjct: 63 LYIPYKLVKNYTNTSYIVWENDRVNGFSPTLAEPLRKCAS-PGFFYTANTAADITASLGA 121 Query: 405 ISQL 408 + Sbjct: 122 MFDQ 125 >gi|217978823|ref|YP_002362970.1| hypothetical protein Msil_2686 [Methylocella silvestris BL2] gi|217504199|gb|ACK51608.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 174 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 26/70 (37%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 SE ++ FAL+ L+L+ + + Y+ + +A A ++ A S+ Sbjct: 33 RRSEAGTAALEFALATPLLLILVAGVTEIGFAIYQGMQVSAAVEAGMMYAAKNGWSSSGI 92 Query: 75 LGDRFESISN 84 + Sbjct: 93 ASAVVSASGA 102 >gi|163731885|ref|ZP_02139332.1| hypothetical protein RLO149_21314 [Roseobacter litoralis Och 149] gi|161395339|gb|EDQ19661.1| hypothetical protein RLO149_21314 [Roseobacter litoralis Och 149] Length = 200 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 10/90 (11%), Positives = 23/90 (25%), Gaps = 14/90 (15%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLL--IGFLI-YVLDWHYKKNSMESA----------- 56 ++ ++ N +I + + F I + + Sbjct: 5 LRRFCDDQQGNVAIEAVIILPIMFWAYLAMFTIFDTYRQYTSQQKAAYTISDLISRQATP 64 Query: 57 NNAAILAGASKMVSNLSRLGDRFESISNHA 86 +A L G + L+R + A Sbjct: 65 LDAGFLDGTHNLFETLTRAVGQTGMRITVA 94 >gi|253687076|ref|YP_003016266.1| hypothetical protein PC1_0675 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753654|gb|ACT11730.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 216 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 12/68 (17%), Positives = 23/68 (33%), Gaps = 2/68 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 F ++ S A+ ++ AL+ L + + ++ ME A +A Sbjct: 23 FLNGLRRFWFSRCASTAVETALAFPIVLAIGSLCADIYTVGLERTRMEQ--RAGAIASIL 80 Query: 67 KMVSNLSR 74 M L Sbjct: 81 AMQQKLDE 88 >gi|33600172|ref|NP_887732.1| putative hemolysin [Bordetella bronchiseptica RB50] gi|33567770|emb|CAE31684.1| putative hemolysin [Bordetella bronchiseptica RB50] Length = 3346 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 37/322 (11%), Positives = 79/322 (24%), Gaps = 66/322 (20%) Query: 70 SNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISM 129 + + + A D + L+ + + N + Sbjct: 2643 NADGSYSYKANASGQGALPGDGQDVFTYTVRDADGDLTSTTLTINIDQFVAGSNDNNTIT 2702 Query: 130 THMANNRL--DSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 N+ + D + L + + ++ + Sbjct: 2703 GGAGNDVMLGDQGGIKQNVTAGTSYNIALVLDLSDSMNDKWGS----------------- 2745 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 K T++ K+AL L++ + Sbjct: 2746 ---------------------------------GSNKPTRLQTAKDALKALLENQLAVHD 2772 Query: 248 VKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 + +V + ++ K E V + V M T A + Q Sbjct: 2773 GEINVSLITFNGSSSALKKSITGLTPENVDEMVDILMGLKASSTTPYGAAFDRTTQWFE- 2831 Query: 308 DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + V P++ FLTDGE + S + + A + + I Sbjct: 2832 ----------GQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISD--VHGI 2879 Query: 368 SINASPNGQRLLKTCVSSPEYH 389 I + + L K ++ Y+ Sbjct: 2880 GIGSGVSTSTLNKY-DNTGGYY 2900 >gi|294146457|ref|YP_003559123.1| hypothetical protein SJA_C2-00280 [Sphingobium japonicum UT26S] gi|292676874|dbj|BAI98391.1| putative membrane protein [Sphingobium japonicum UT26S] Length = 598 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 15/42 (35%), Gaps = 1/42 (2%) Query: 7 FRFY-FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWH 47 FR + + A + +FA + GF + ++ Sbjct: 9 FRRAWLRPLLGDRSAAVAPMFAALGAVLIGAAGFALDGGLYY 50 >gi|317123666|ref|YP_004097778.1| type II secretion system F domain [Intrasporangium calvum DSM 43043] gi|315587754|gb|ADU47051.1| Type II secretion system F domain [Intrasporangium calvum DSM 43043] Length = 652 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 26/253 (10%), Positives = 76/253 (30%), Gaps = 32/253 (12%) Query: 160 FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPI 219 + + + V + + +++ + V + + ++ + Sbjct: 33 VMSDVKSSSGTVTGVLTVRSANPVQVDPGSVKASVDGVIVKSFVSEMTHTKRTAMLVIDT 92 Query: 220 CQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQY 279 MA ++ A +L EDV +G++ + +++P+ ++ Sbjct: 93 SGSMGTDGMATVRAATRAYLKEA------PEDVLIGVVTFANTAGVDLKPTVDRAAAQRV 146 Query: 280 VTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG 339 V T A++ A + + D + ++ L+DG Sbjct: 147 VNGLD---ARGDTSLYAAVRSAARAMPGDGD--------------------RSMVLLSDG 183 Query: 340 ENNNFKSNVNTIKICD-KAKENFIKIVTISINASPNGQ--RLLKTCVSSPEYHYNVVNAD 396 + + + + K +++ + N L +S NA Sbjct: 184 ADTVSDDRQGDLAEANRELKRRGVRVDVVRFNTDDPDAVVALRSFASASGGSVIPATNAS 243 Query: 397 SLIHVFQNISQLM 409 + F++ ++ + Sbjct: 244 DVGAAFKSAARAL 256 >gi|313837068|gb|EFS74782.1| conserved hypothetical protein [Propionibacterium acnes HL037PA2] gi|314927903|gb|EFS91734.1| conserved hypothetical protein [Propionibacterium acnes HL044PA1] gi|314971841|gb|EFT15939.1| conserved hypothetical protein [Propionibacterium acnes HL037PA3] Length = 125 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 30/123 (24%), Gaps = 8/123 (6%) Query: 21 NFSIIFALSVMSFLLLIGFLIYVL-DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + S++ + + L+ G + + + A AA + L Sbjct: 2 SMSVLITMLLPVLLVSAGIAVDGAERARAVRIAHTVAAEAARAGCEEGSAAQLVGQDGTG 61 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + A A R S +A + V + + + Sbjct: 62 AARVAAETSARGAKADRL-------SRLVITANGDAVSVATEVTRPTRLLALIGLTEVHG 114 Query: 140 SNN 142 + Sbjct: 115 RAS 117 >gi|257064791|ref|YP_003144463.1| RHS repeat protein [Slackia heliotrinireducens DSM 20476] gi|256792444|gb|ACV23114.1| RHS repeat protein [Slackia heliotrinireducens DSM 20476] Length = 516 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 45/193 (23%), Gaps = 11/193 (5%) Query: 41 IYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKN 100 + V + +AA+L A+ + +N A L + Sbjct: 141 VDVSG-----TKLAE-LDAAVLQAAAYLDVTGCENLAELNLSANSALETLRAMDTALTEV 194 Query: 101 HIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQF 160 + + + + + ++ I + + +++ Y L Sbjct: 195 DVSGAAGLSAIEVDDDVAVVGLEATPIHEQWVPVHVTKVVSSS---GGTSTYEYSLSLDE 251 Query: 161 IEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPIC 220 + V E I + D+ D I Sbjct: 252 AGVATGYAESTVGVDDTSEQAVYEFNYDESGNIIGIWDVEA--ESMDVEFVYDDAGNVIR 309 Query: 221 QDKKRTKMAALKN 233 + T+ + + Sbjct: 310 RTGVPTEYSYTYD 322 >gi|163731886|ref|ZP_02139333.1| hypothetical protein RLO149_21319 [Roseobacter litoralis Och 149] gi|161395340|gb|EDQ19662.1| hypothetical protein RLO149_21319 [Roseobacter litoralis Och 149] Length = 181 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 5/46 (10%), Positives = 17/46 (36%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI 61 E ++ F + V + I + + ++ ++ + A+ Sbjct: 14 RDEGGQIAVEFVIMVPLVFTIFMTAIELGIYSMRQMWLDRGLDIAV 59 >gi|116249089|ref|YP_764930.1| hypothetical protein pRL120423 [Rhizobium leguminosarum bv. viciae 3841] gi|115253739|emb|CAK12132.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 194 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 16/169 (9%), Positives = 41/169 (24%), Gaps = 21/169 (12%) Query: 7 FRFYFKKGIA----SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN----- 57 F + + A + FAL + ++L+ + + ++ Sbjct: 9 FFRFARSRARHLLCDRSAASGVEFALVLPILVMLLFGTVDLGHALTVSRKIDEIASSTGD 68 Query: 58 -----------NAA-ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKES 105 + A +L+GAS ++ G + ++ + Sbjct: 69 MISQQGSWTKSDVAKLLSGASFILQPYDTTGLTITLAVDDIAKSGSATVNWSAALNTSAL 128 Query: 106 LSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSY 154 SG ++ + S + S+ Sbjct: 129 TSGSASTIEVPSEIQDDGVQVVLTRVQYTLTTPVSAFFSNFTGQNGYSF 177 >gi|116694138|ref|YP_728349.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113528637|emb|CAJ94984.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 158 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 15/151 (9%), Positives = 43/151 (28%), Gaps = 7/151 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANN-- 58 M L +R + + +I FA+ L ++ ++Y + + A Sbjct: 1 MLLHARVNRGINR-RRHGAGSAAIEFAIVAPVLLAIVIGIVYYGVMLALQQVLTLAAEEG 59 Query: 59 -AAIL---AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A L AG S ++ + + ++ +++ S Sbjct: 60 ARAALRYPAGVSGSGLAATQGARVDAAAAMARGTLPTSIRNLLSTGNVAQAVPCASGSTD 119 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 + + + + + ++ + Sbjct: 120 VCVQVTLNLPTTNILPAVPMVPVPATLSGSA 150 >gi|59711873|ref|YP_204649.1| multidrug efflux system [Vibrio fischeri ES114] gi|59479974|gb|AAW85761.1| predicted multidrug efflux system [Vibrio fischeri ES114] Length = 447 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 45/144 (31%), Gaps = 7/144 (4%) Query: 20 ANFS-IIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL--- 75 S IIF L ++S ++ G I + + + + + A+ + + L Sbjct: 51 GVISQIIFFLMIISMMVSTGTGILIGQ-YNGSGRTQQSTDVAVASVVLGGILGLGLSLFA 109 Query: 76 --GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 I + + A ++ +L+ + ++T +++ S + ++ Sbjct: 110 FFAANPIVIMYGLESDVATFAYDYLIITGSFTLTVTLGIVFSTILRSNGYSRTPMIVNLL 169 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYR 157 L+ N Sbjct: 170 TGGLNIFGNYCALYQPFGLPVYGV 193 >gi|110637494|ref|YP_677701.1| hypothetical protein CHU_1084 [Cytophaga hutchinsonii ATCC 33406] gi|110280175|gb|ABG58361.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406] Length = 690 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 10/142 (7%), Positives = 35/142 (24%), Gaps = 20/142 (14%) Query: 269 PSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF---------TNFFR 319 + + + ++T ++ + + ++ Sbjct: 89 LTSDSTQRLSWLTTIDKIKQALN---NEGIETSVYSFSGKQENDPVTFTFPSSNLNALLL 145 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 + + S I L+DG N T + TI + + + + + Sbjct: 146 EPKQEESGRNIAASILLSDGIYNVGMDPSITPSTA--------PVFTIGVGDTSDQKDVR 197 Query: 380 KTCVSSPEYHYNVVNADSLIHV 401 + + + + + Sbjct: 198 LVSAIHNKLAFKGNDFPIVAEI 219 >gi|332185369|ref|ZP_08387117.1| hypothetical protein SUS17_561 [Sphingomonas sp. S17] gi|332014347|gb|EGI56404.1| hypothetical protein SUS17_561 [Sphingomonas sp. S17] Length = 178 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 38/142 (26%), Gaps = 19/142 (13%) Query: 15 IASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSR 74 + + + FAL + L+L + ++ +A K+ Sbjct: 8 LRDRRGVAMVEFALILPVMLVL----------YLGGAQLQDG-----IACNRKVTIATRA 52 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 GD ++ A D + + + A TE+ + + + Sbjct: 53 AGDLITQNTSGKISAKEVDDSLKVATQVLLPYAASEATVRVTEVATSNGRTS----VVWS 108 Query: 135 NRLDSSNNTIFYNMDVMTSYDY 156 L+ + + + Sbjct: 109 RGLNVAAYKRGTAIVIPPEMRM 130 >gi|86740436|ref|YP_480836.1| hypothetical protein Francci3_1731 [Frankia sp. CcI3] gi|86567298|gb|ABD11107.1| putative membrane protein [Frankia sp. CcI3] Length = 143 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 13/127 (10%), Positives = 34/127 (26%), Gaps = 6/127 (4%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + + +F++ G ++ K A AGA ++ + R G Sbjct: 14 GTVTAFVVVLFAAFVMFGGLIVDAGGALADKTEAMGIAQEAARAGAQQLDLTVLRQGGTV 73 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDS 139 + + A A + + V + + + + + L Sbjct: 74 RLLPDQAVAAAQSYLAQAGA------TGTVTIVDNTVTVTVTAVHDNLLLGVVGLDTLTV 127 Query: 140 SNNTIFY 146 + + Sbjct: 128 TATGSAH 134 >gi|269977483|ref|ZP_06184455.1| TadE family protein [Mobiluncus mulieris 28-1] gi|269934399|gb|EEZ90961.1| TadE family protein [Mobiluncus mulieris 28-1] Length = 150 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 41/138 (29%), Gaps = 4/138 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + E+ + + FAL V + L+ ++ + + K ++ +AA + V Sbjct: 15 RRILKCERGSEPVSFALVVPLVMFLVLGVLQLALSMWVKTTL---IDAASAGAHAAAVVG 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIK-ESLSGYSAVFYNTEIQNIVNSSRISMT 130 A D + + + ++S +Q V++ + Sbjct: 72 APPEVAEETVREVLASTVGRDYVQNVEVSRVNLTTVSNQFGSATVDALQVSVSAPLPVLG 131 Query: 131 HMANNRLDSSNNTIFYNM 148 + Sbjct: 132 FFGFGSFQVVGHAAVEVA 149 >gi|90415840|ref|ZP_01223773.1| putative outer membrane protein [marine gamma proteobacterium HTCC2207] gi|90332214|gb|EAS47411.1| putative outer membrane protein [marine gamma proteobacterium HTCC2207] Length = 1157 Score = 37.5 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 22/370 (5%), Positives = 57/370 (15%), Gaps = 24/370 (6%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 + A A + I + Sbjct: 663 DGLDGATGASGVAGSVTELTDALVEDYSVYLGNNPSATTDTAQYNVAIGMTALDAVTTGD 722 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQ--FIEHLLNQRYNQK 172 + + + + + + M L Sbjct: 723 SNVAVGHDALTSNTTGNSNTASGSFALKSNTTGMRNTALGANTLDDNTTGSYNTASGAYA 782 Query: 173 IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALK 232 + S G+ ++ + + + + S+ ++ Sbjct: 783 LQSNTTGYNNTASGDSALYYNTEGIGNTAFGNAVLVSNTTGSHNTASGSYASNRNTTGDY 842 Query: 233 NALLLFLDSIDLLSHVKE---------DVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 N ++ K G + G + Sbjct: 843 NTATGTYALYGNITGSKNTASGNRALYSNTTGNYNTANGTNALYSNTAGYDNTATGTDAL 902 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLT---DGE 340 S ++ ++ + T + + + Sbjct: 903 YSS-TTGYQNTASGVQSLMRNTTGWANTAVGDHALWSVTTSNFNTATGGSALFNSTGNQN 961 Query: 341 NNNFKSNVNTIKICDKAKENFIKIVT---------ISINASPNGQRLLKTCVSSPEYHYN 391 N + D + T I A L Sbjct: 962 TANGYFALYNNTTGDNNVAVGYALYTNTTGSQNTAIGYGADVASGDLTNATAIGNGAIVT 1021 Query: 392 VVNADSLIHV 401 N L + Sbjct: 1022 ASNTIQLGNA 1031 >gi|327459721|gb|EGF06061.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1057] Length = 450 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 97/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 A +S + + + + A D KN + + Sbjct: 53 AVSSPIEAAGTEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETST 112 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 113 VDVTKTEVAPAKVDKAPEPITRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 167 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 168 PPTVTKAGSADIVFVVDRSGSMGGTIDIVRANINEFVRNITKEGI---TARFGLATFSDE 224 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 225 VYGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 264 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 265 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 324 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + + K + ++ N +L I + + +Y Sbjct: 325 IERTVATV-----KAIEGIYKNFA-TEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 376 Query: 415 SVILK 419 I+K Sbjct: 377 YKIIK 381 >gi|296268212|ref|YP_003650844.1| hypothetical protein Tbis_0219 [Thermobispora bispora DSM 43833] gi|296090999|gb|ADG86951.1| hypothetical protein Tbis_0219 [Thermobispora bispora DSM 43833] Length = 145 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 13/111 (11%), Positives = 38/111 (34%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + ++ E+ + ++ + + ++ ++ +SA + + LAGA+ S Sbjct: 27 RRLSGERGSATVWMITVITVIWAVATAVVAAGMARVARHRAQSAADLSALAGAAWAFSRP 86 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + I++ L A + SL + ++ Sbjct: 87 ALACTHAKEIADANGVVLESCAISEGIAEVAVSLRFTLPIAGARSVRATAR 137 >gi|289207395|ref|YP_003459461.1| Neisseria PilC domain protein [Thioalkalivibrio sp. K90mix] gi|288943026|gb|ADC70725.1| Neisseria PilC domain protein [Thioalkalivibrio sp. K90mix] Length = 1174 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 17/162 (10%), Positives = 39/162 (24%), Gaps = 22/162 (13%) Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ 330 W V D + T M+ A + +IP Sbjct: 341 WSPGDGNVLVDPDWPLIAAGLTPLEGTMRTARDYFLGQTEHFGTEQGNLDTPEIPRSCDV 400 Query: 331 KFIIFLTDGEN----NNFKSNVNTIKICDKAKEN--------FIKIVTISIN-------- 370 +++TDG + + + +A + + + Sbjct: 401 NAAVWVTDGLPSVQADGTELGDDPATAIRQASDAIEDFHTTTDVDTYIVGFALPPGIGEL 460 Query: 371 ASPNGQRLLKTCVSSPEY--HYNVVNADSLIHVFQNISQLMV 410 + L S Y+ + D L+ + + ++ Sbjct: 461 DDAMPENPLDLLADSGGTGSAYDATSEDGLLDAMSRVFESVI 502 >gi|198424351|ref|XP_002122740.1| PREDICTED: similar to collagen, type XXVIII [Ciona intestinalis] Length = 870 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 62/183 (33%), Gaps = 24/183 (13%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +K +LL +D+ + + +++ ++ ++ R ++ S + Sbjct: 673 FDRVKQWILLTVDAFNPGE-RRNPLHVDVVQFSERSMIEVD-SDVASSAQISDQVQGIIQ 730 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + T + A++ + + K +I +TDG+ S Sbjct: 731 MRSGTKTYSALEFVNREVWPLL----------------RPGAFKILITMTDGDA----SE 770 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQ 407 + ++A + + + + + L SS +NV N +L + +I + Sbjct: 771 DRNVDAIEEANNKYNLMAAVGVGSKVISSELEDF--SSTGNVFNVDNFSALESIISDIVE 828 Query: 408 LMV 410 + Sbjct: 829 NIC 831 >gi|83859351|ref|ZP_00952872.1| hypothetical protein OA2633_13140 [Oceanicaulis alexandrii HTCC2633] gi|83852798|gb|EAP90651.1| hypothetical protein OA2633_13140 [Oceanicaulis alexandrii HTCC2633] Length = 178 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 8/111 (7%), Positives = 29/111 (26%), Gaps = 8/111 (7%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + + E+ ++ FAL + L + + + + + Sbjct: 1 MRQLIRKCLGFHRDERGVSAVEFALLAPFMIALYLGSVQLT--------LGLTADRKVSQ 52 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 A+ + +++ ++ A F + ++ Sbjct: 53 VANSVADLVTQDDFVTDADLLDIYAAADAILNPFAPAPLSLRITSVRMDAD 103 >gi|308270390|emb|CBX27002.1| hypothetical protein N47_A10310 [uncultured Desulfobacterium sp.] Length = 1056 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 24/290 (8%), Positives = 65/290 (22%), Gaps = 50/290 (17%) Query: 164 LLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDK 223 + ++ +P L+ + D + + + + Sbjct: 4 YDENGNSPITITSMPTDGWTGWITGKQVLVRFITDANPA-TVSTGWKIDSYQYQSKTDYN 62 Query: 224 KRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRD 283 +++ +A++ ++ MG N + +K + +T Sbjct: 63 FFSRLDVATSAIINVIE---STRGRVNWGIMGFGETNATPPMNASYNDDQQK-QSIITAL 118 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKK--------------RSFFTNFFRQGVKIPSL-- 327 ++ +M + S + + + G S Sbjct: 119 NGFSATGDSNIAESMNLVLKEFNSKAQQIHKDCNKNFVIVLSDGYADHDTLGSDWGSPVA 178 Query: 328 -----------------------PFQKFIIFLTDGENNNFKSNVNTIKICDKAKEN--FI 362 G + K T D+ ++ I Sbjct: 179 DYAALNAAEASQYHFTQDPFQDANPPADYYDDIAGFMYSHKYTDYTSIPADERADSADNI 238 Query: 363 KIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 + I +LK + L++ F ++ L+ Sbjct: 239 IVHNIGF---STESPMLKHASDLGGGIFLTAYSKSQLVNAFYSLGILIAE 285 >gi|212538105|ref|XP_002149208.1| von Willebrand and RING finger domain protein [Penicillium marneffei ATCC 18224] gi|210068950|gb|EEA23041.1| von Willebrand and RING finger domain protein [Penicillium marneffei ATCC 18224] Length = 1016 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 21/228 (9%), Positives = 60/228 (26%), Gaps = 21/228 (9%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 K S + + + + V P+ + Sbjct: 451 TTKTNKRQSSATSSYGANKSNNTALTEYTNSMRDPRPAPSFHVPLDLVVVIPVSSSMQGL 510 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L++AL + ++ MGL+ + + T+ + Sbjct: 511 KITLLRDALRFLVLNLGPRD------RMGLVTFGSSGGGVPLVGMTTKAWNGWGKILNSL 564 Query: 287 LILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKS 346 + + + + + N I+ ++D ++ +S Sbjct: 565 RPVGQKSLRADVVEGANVAMDLLMQRKSNNPIS------------TILLISDSSTSDPES 612 Query: 347 NVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + + AK + I + + + L++ + + V + Sbjct: 613 VDFVVSRAEAAK---VGIHSFGLGLTHKPDTLIELSTRTKACYTYVKD 657 >gi|190894970|ref|YP_001985263.1| hypothetical protein RHECIAT_PC0000636 [Rhizobium etli CIAT 652] gi|218513508|ref|ZP_03510348.1| hypothetical protein Retl8_07211 [Rhizobium etli 8C-3] gi|190700631|gb|ACE94713.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 194 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 41/171 (23%), Gaps = 7/171 (4%) Query: 4 LSRFRFYFKKGI---ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 LS FRF + + A + FAL + ++L+ + + + + Sbjct: 7 LSLFRFARSRLLDLARDRLAASGVEFALVLPILVMLLFGTVDLGHALTVSRKI----DEI 62 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 + S + + A L +K + + + Sbjct: 63 ASTTGDLIGQQSSWTSSDVTKLLSGASFILQPYDTSGLKITLTVNDISKNGNATVNWSAA 122 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ 171 S+ S A + + + Y N Sbjct: 123 FNTSALNSGAASAIDIPSQIQDAGVQVVLTRVQYTLTTPVSSFFSNFTGQN 173 >gi|167382730|ref|XP_001736239.1| hypothetical protein [Entamoeba dispar SAW760] gi|165901405|gb|EDR27488.1| hypothetical protein EDI_092790 [Entamoeba dispar SAW760] Length = 473 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 39/134 (29%), Gaps = 20/134 (14%) Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 T+ + Q + T A + + D T Sbjct: 183 TDSINQLMYSLCKISYGGGTRYRDAFNKMIDDIKQDIFDKTQTID--------------- 227 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKT-CVSSPEYHYN 391 I ++DG N ++ K D+ E I + TI +L + C + +Y Sbjct: 228 IFLVSDG--NEIQNRKLLRKSLDRLNEYKISVHTIDTGDGTWKDKLFEISCEQNG--YYI 283 Query: 392 VVNADSLIHVFQNI 405 N + +F + Sbjct: 284 NSNQQGIDTLFNAL 297 >gi|333028467|ref|ZP_08456531.1| putative von Willebrand factor [Streptomyces sp. Tu6071] gi|332748319|gb|EGJ78760.1| putative von Willebrand factor [Streptomyces sp. Tu6071] Length = 588 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 49/200 (24%), Gaps = 24/200 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-------GT 273 + +++M K +LL L + + + + + G Sbjct: 397 PGRGQSRMDVTKASLLQALAQFTSADDIGLWEFSTRLDGDRDYRELVPTDRLGARKGEGV 456 Query: 274 EKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + T AYQ + F P Sbjct: 457 TQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYAADKFNAVVLLTDGTNEDP--- 513 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 G + D A+ + +V I++ G + ++ + Sbjct: 514 -------GSLTRGALLTKLRDLADPAR--PLPLVAIAVGPEAAGDDVKAIGSATGGSGFK 564 Query: 392 VVNADSLIHVFQNISQLMVH 411 V + + V I+ +V Sbjct: 565 VDDPAQIHEV---INNAIVE 581 >gi|318059857|ref|ZP_07978580.1| hypothetical protein SSA3_18051 [Streptomyces sp. SA3_actG] gi|318076736|ref|ZP_07984068.1| hypothetical protein SSA3_08427 [Streptomyces sp. SA3_actF] Length = 588 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 22/200 (11%), Positives = 49/200 (24%), Gaps = 24/200 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW-------GT 273 + +++M K +LL L + + + + + G Sbjct: 397 PGRGQSRMDVTKASLLQALAQFTSADDIGLWEFSTRLDGDRDYRELVPTDRLGARKGEGV 456 Query: 274 EKVRQYVTRDMDSLI--LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + T AYQ + F P Sbjct: 457 TQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYAADKFNAVVLLTDGTNEDP--- 513 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 G + D A+ + +V I++ G + ++ + Sbjct: 514 -------GSLTRGALLTKLRDLADPAR--PLPLVAIAVGPEAAGDDVKAIGSATGGSGFK 564 Query: 392 VVNADSLIHVFQNISQLMVH 411 V + + V I+ +V Sbjct: 565 VDDPAQIHEV---INNAIVE 581 >gi|227112734|ref|ZP_03826390.1| putative hemagglutinin/hemolysin-related protein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 1159 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 40/362 (11%), Positives = 89/362 (24%), Gaps = 57/362 (15%) Query: 93 DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV-- 150 DA +F S + V +S+ + ++ + +++ DV Sbjct: 567 DAGKFNVVAQATSTEIANHDTATGYSAGGVEQYGMSIGTIGDDTMSGTHSHDVMIADVSG 626 Query: 151 -------MTSYDYRLQFIEHLLNQRYNQKI-------VSFIPALLRIEMGERPIFLIELV 196 + + + + + + S I + G +FL + Sbjct: 627 LQIIEGQNYNIAFMVDSSGSMSSTDIDNARTSLSNVFKSLINSAGGANSGTVNVFLADFD 686 Query: 197 VDL--SGSMHCAMNSDPE------DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 + + S++ + + K F + + Sbjct: 687 TQVGKTVSVNLNDTNALNKLTAVLNSMVGGSSAGGGTNYEDVFKTTANWFQSDVVKKNVG 746 Query: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 Y G T + N Q + D + + S ++ A + + Sbjct: 747 NNLTYFITDGEPTYYQTNEANEVRVSDWWQSLNIDNIAYKPGQSYSMD-VRGAVREVIDS 805 Query: 309 KKRSFFTNFFRQGVKIPSLPFQKF-----------------------------IIFLTDG 339 K + + K+ + D Sbjct: 806 KGNVYQYAWSDDSKKVVGTVIGQVHAQGDGTYEISSLGGVGNNSSYWTQDYRGNWHWVDN 865 Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 N SN N + + K+ I I I A NG L + + + ++ +L Sbjct: 866 SGNRDVSNTNAAQAFNLLKKQS-TIEAIGIGADLNGDTLKRY--DTDGVVQDHIDPANLN 922 Query: 400 HV 401 Sbjct: 923 KA 924 >gi|115658168|ref|XP_790852.2| PREDICTED: similar to Clca1 protein, partial [Strongylocentrotus purpuratus] gi|115935961|ref|XP_001184497.1| PREDICTED: similar to Clca1 protein, partial [Strongylocentrotus purpuratus] Length = 496 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 5/81 (6%), Positives = 22/81 (27%) Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSF 176 ++ +++ + + S L + R+++ I S Sbjct: 222 SAWRVMTERTTDFNAGIQPVSNTTPTFDVIQLSTIRSVVLVLDISGSMGGSRFDRMIQSS 281 Query: 177 IPALLRIEMGERPIFLIELVV 197 ++ + + ++ Sbjct: 282 TDYIMNVIPDGSKLGIVVFSS 302 >gi|256393183|ref|YP_003114747.1| hypothetical protein Caci_4038 [Catenulispora acidiphila DSM 44928] gi|256359409|gb|ACU72906.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 125 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 11/128 (8%), Positives = 35/128 (27%), Gaps = 4/128 (3%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFES 81 + + V++ ++ +G + + A AGA + R G Sbjct: 2 VTAFTVVIVIALMVFVGLAYDGGRALDGRVKALNEAQEAARAGAQALNLGALRAGGTAVL 61 Query: 82 ISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSN 141 A + AK ++ + + ++ + + + + + + Sbjct: 62 DP----EAAVGAAKAYLAGTGDAGTVDVAGTTVTVSVTHVQATKFLGLVGVGSITAHVTA 117 Query: 142 NTIFYNMD 149 + Sbjct: 118 SARAEQGT 125 >gi|238609519|ref|XP_002397504.1| hypothetical protein MPER_02057 [Moniliophthora perniciosa FA553] gi|215472087|gb|EEB98434.1| hypothetical protein MPER_02057 [Moniliophthora perniciosa FA553] Length = 183 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 7/71 (9%), Positives = 22/71 (30%), Gaps = 3/71 (4%) Query: 340 ENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLI 399 + ++ + +A+ + I + S + L + + V + L Sbjct: 21 DASDSTRRAQMDLVLARAEAANVPIHSFGYGRSHDPASLWLMSNHTSGTYTFVKDWYDLR 80 Query: 400 HVFQNISQLMV 410 +I+ + Sbjct: 81 D---SIAGCIG 88 >gi|87308894|ref|ZP_01091032.1| hypothetical protein DSM3645_19093 [Blastopirellula marina DSM 3645] gi|87288237|gb|EAQ80133.1| hypothetical protein DSM3645_19093 [Blastopirellula marina DSM 3645] Length = 208 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 32/124 (25%), Gaps = 7/124 (5%) Query: 22 FSIIFALSVMSFLLLIG--FLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 S+++ + +M L G F I + +E+A A LAG + L+ Sbjct: 19 VSVLWLIVIMPL-ALTGLIFTIETGRLWLARAEVENALEAGALAGVIEWNQLLAVNNQIS 77 Query: 80 ESISNHAKRALID----DAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + F + Y T + + + Sbjct: 78 TQTPRQMAAEMAGSNTVQGANFAFADPDLNYLPYEVGQVQTHLNGNKLAGGELIFGAITQ 137 Query: 136 RLDS 139 + Sbjct: 138 TQPT 141 >gi|291541799|emb|CBL14909.1| TadE-like protein [Ruminococcus bromii L2-63] Length = 221 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 46/149 (30%), Gaps = 3/149 (2%) Query: 17 SEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLG 76 SE I +++ FLIY+ + Y K +ES + GA+ + Sbjct: 13 SENGTVIIEATFVFPIMFIILFFLIYMGNAFYMKAQVESVVEQKAIQGAAYCADPILETM 72 Query: 77 DRFESISNHAKRALIDDAKRF---IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 + + F K S A+ T +I+ Sbjct: 73 KETGKFPSLSALETKPYRYIFGGMNDVESKISNEVEEALTGKTSTFFTNMYPKITTAKSD 132 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + ++ ++++V + + + F+ Sbjct: 133 IAKYNNYVVYSTFSVEVKYTIKFPISFLG 161 >gi|224283566|ref|ZP_03646888.1| Cof-like hydrolase [Bifidobacterium bifidum NCIMB 41171] Length = 280 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 41/166 (24%), Gaps = 7/166 (4%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR---- 282 + A+ + Y + S G+ + Sbjct: 79 RHDIAPEAIDAATAYFERHGIENYQWQGADGMYISCGYARHLESKGSAVWKNGQFARYWH 138 Query: 283 --DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D ++ T + F + R + +L + + Sbjct: 139 LIDDIAVPDGSTLGHVIHASKGSYFSGADPDVSFDDVSRDLGEWFTLVHGSYDDISPNNG 198 Query: 341 NNNFKSNVNTIKICDKAKENFIKIV-TISINASPNGQRLLKTCVSS 385 +CD AK+ I TI+I S N +LK +S Sbjct: 199 ELLINGVDKGTALCDVAKDLGYSIADTIAIGDSDNDTAMLKAAGTS 244 >gi|226325178|ref|ZP_03800696.1| hypothetical protein COPCOM_02970 [Coprococcus comes ATCC 27758] gi|225206526|gb|EEG88880.1| hypothetical protein COPCOM_02970 [Coprococcus comes ATCC 27758] Length = 162 Score = 37.5 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 58/172 (33%), Gaps = 16/172 (9%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 SE+ + + A+ + ++L LI V Y+ +M +A +A Sbjct: 5 HSERGSEVLEAAIVLPMLIVLAFCLIQVCLIAYEGTAMSAAIESAA--------MKSDFS 56 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 N +A+ D+ +K++++ ++ ++ + + T + Sbjct: 57 AAIASGNVN---QAVKDELASNTPGIVKDNITVTNSKVSYSDRSDEATVTDTGSTGI--- 110 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFI-EHLLNQRYNQKIVSFIPALLRIEMG 186 S ++ ++ Y + ++ I +P ++E+ Sbjct: 111 -TTISKSSTTALLESTVDYKIPTLISVGSFDGIKMSKTIKVEVPVSEKMEVS 161 >gi|332360731|gb|EGJ38540.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK49] Length = 478 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 41/341 (12%), Positives = 87/341 (25%), Gaps = 43/341 (12%) Query: 86 AKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIF 145 + D + V+ ++ + ++ Sbjct: 105 STAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETATVDVTKTEVAPSKFDKAPEPITRRT 164 Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 N T + K + + + I VVD SGSM Sbjct: 165 ENTLSGTD-----ENGNPYTQYERVDKTTTITYTSTPPTVTKVGSADIVFVVDRSGSMGG 219 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 ++ ++N K+ + L F D + + +D L + + Sbjct: 220 TIDIVRANINEFVRNITKEGI---TARFGLATFSDEVYGRNSGSKDEDTVLTRFGSSY-- 274 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP 325 + ++ + + A++ A T + Sbjct: 275 ---FTTDPAELEKAL---------------AAIRIASGGDTPETPTPALNQIISTYDWSK 316 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFI--KIVTISINASPNGQRLLKTCV 383 S +KF++ LTD E S K I + T+ + + K Sbjct: 317 SSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATV-----KAIEGIYKNFA 371 Query: 384 SSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKYSVILK 419 + ++ N +L I + + +Y I+K Sbjct: 372 -TEGRVLDIEN--NLADALTKGTTSWIVESVNEARYYKIIK 409 >gi|296448098|ref|ZP_06890000.1| TadE family protein [Methylosinus trichosporium OB3b] gi|296254412|gb|EFH01537.1| TadE family protein [Methylosinus trichosporium OB3b] Length = 186 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 16/146 (10%), Positives = 37/146 (25%), Gaps = 17/146 (11%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKK----NSMESANNAA------- 60 ++ ++ FAL +L++ +I Y + + +A N A Sbjct: 12 RRFARCVAGASALEFALIAPVVVLILTGMIDYGLAVYTRFSMNERLSAAANYAMLSAASV 71 Query: 61 ------ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 LA + + A+R ++ + Sbjct: 72 GSSSGATLAATLVAIIPSQFDVTVVVNNGPAAQRVGGVSTPSGTASNADLCYCPTLSGST 131 Query: 115 NTEIQNIVNSSRISMTHMANNRLDSS 140 +S S +A + + Sbjct: 132 VAWGSATTCASTCSSGVLAGKFVTIA 157 >gi|113953124|ref|YP_730426.1| structural toxin protein RtxA [Synechococcus sp. CC9311] gi|113880475|gb|ABI45433.1| structural toxin protein RtxA [Synechococcus sp. CC9311] Length = 2154 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 23/267 (8%), Positives = 55/267 (20%), Gaps = 51/267 (19%) Query: 97 FIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDY 156 + + S + Y + +I+ + ++ + + ++ + Sbjct: 987 ENISDVASSTTPYKVDYDLVSKSDIIPTGSGTIEGTNGEDILIGDVGGGSLVNQSINLSL 1046 Query: 157 RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNS 216 L ++ N S + + I Sbjct: 1047 VLDVSRSMILSNINFNNASVTRFSALQTATKDLLSEI----------------------- 1083 Query: 217 APICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKV 276 Q K+ +K + + N + Sbjct: 1084 ---AQSGATAKVQIVKYSTEG----------------------SDVGYYNFTSGDDQTVL 1118 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP---SLPFQKFI 333 Q D T+ + A ++ + N + S F Sbjct: 1119 NQAFQDIDDLQAGGGTNYEAGLVTALNWISGGITSNTPLNVNQTDKDKVIFISDGEPSFY 1178 Query: 334 IFLTDGENNNFKSNVNTIKICDKAKEN 360 D + N N + D + N Sbjct: 1179 YRGNDTQEVNGPGNTFSQAAIDHIQGN 1205 >gi|86159251|ref|YP_466036.1| TadE-like [Anaeromyxobacter dehalogenans 2CP-C] gi|85775762|gb|ABC82599.1| TadE-like protein [Anaeromyxobacter dehalogenans 2CP-C] Length = 136 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 37/124 (29%), Gaps = 4/124 (3%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 + G +E+ ++ FAL + L ++ + + ++ E A GA Sbjct: 1 MRSGRHTERGVAAVEFALVLPFLLAIVLGGLEWGFYFFR----EQVVVNAAREGARAGSV 56 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMT 130 E + A A D + + S S V T +V S + Sbjct: 57 AAVSPLAAAEGAARAALVAGSVDPATVGASVVASSRSTVDPVTGETIDSVVVTVSFPFVP 116 Query: 131 HMAN 134 Sbjct: 117 LTGF 120 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 34/101 (33%), Gaps = 9/101 (8%) Query: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 +F S + + ++ G + ++ I + I + + Sbjct: 10 DHHFQEASGSRASQEVPQIAVVISSGPVEDHVHGP-----AKALRKAGILLYAIGVRDAV 64 Query: 374 NGQRLLKTCVSSPEYHY--NVVNADSLIHVFQNISQLMVHR 412 + L+ SSP+ ++ V N L ++ Q + Q + Sbjct: 65 WAE--LREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDT 103 >gi|86157056|ref|YP_463841.1| hypothetical protein Adeh_0628 [Anaeromyxobacter dehalogenans 2CP-C] gi|85773567|gb|ABC80404.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-C] Length = 167 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 8/161 (4%), Positives = 37/161 (22%), Gaps = 12/161 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHY-------KKNSMESANNAAI 61 + + I+ + +M ++ + + +++ + + AA Sbjct: 2 SLSRPLPRRASGSTLILTVILLMVLAVIGVAAVSLGSQERINASGKTQRDQLYACAAAAR 61 Query: 62 LAGASKMVSNLSRLGDRFESIS-----NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 L +++ + + + + + +T Sbjct: 62 LQIWAELARYGRGYLESGNVAGSVTLPDGTVLTAPSHYDSDPDVTVSSVVLKNTVTTAST 121 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + +S M + + ++ Sbjct: 122 PTATDLTNSFNFMQGLNTATGYTVVAKCKDRRGRPLEVEFV 162 >gi|311065042|ref|YP_003971768.1| hydrolase [Bifidobacterium bifidum PRL2010] gi|310867362|gb|ADP36731.1| Hydrolase (HAD superfamily) [Bifidobacterium bifidum PRL2010] Length = 280 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 41/166 (24%), Gaps = 7/166 (4%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR---- 282 + A+ + Y + S G+ + Sbjct: 79 RHDIAPEAIDAATAYFERHGIENYQWQGADGMYISCGYARHLESKGSAVWKNGQFARYWH 138 Query: 283 --DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D ++ T + F + R + +L + + Sbjct: 139 LIDDIAVPDGSTLGHVIHASKGSYFSGADPDVSFDDVSRDLGEWFTLVHGSYDDISPNNG 198 Query: 341 NNNFKSNVNTIKICDKAKENFIKIV-TISINASPNGQRLLKTCVSS 385 +CD AK+ I TI+I S N +LK +S Sbjct: 199 ELLINGVDKGTALCDVAKDLGYSIADTIAIGDSDNDTAMLKAAGTS 244 >gi|284989062|ref|YP_003407616.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284062307|gb|ADB73245.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 547 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 58/206 (28%), Gaps = 31/206 (15%) Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLD----------SIDLLSHVKEDVYMGLIGYT 260 D++++ T+ ++A L E + ++ T Sbjct: 352 VFDISTSMEAPAGDGTRATLARDAAKSTLTLVPGNFALGLWFFAAELDGERDWTEVVP-T 410 Query: 261 TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQ 320 ++E +E + + + + + D L T A + Sbjct: 411 RQLEAEVEGTVQRDLLDEELDTIPDRLSPGGTGLYDTTLDAVR----------------A 454 Query: 321 GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQ 376 ++ +TDG N + + E IK++ +++ + Sbjct: 455 ARSDFDPRAVNSVLVVTDGTNEDSGGVDLDELLATLRSEADPDRPIKVIGVALGPDADLG 514 Query: 377 RLLKTCVSSPEYHYNVVNADSLIHVF 402 L + + Y+ V+ L V Sbjct: 515 ALERIADVTGGAAYSAVDPTDLQTVL 540 >gi|154502427|ref|ZP_02039487.1| hypothetical protein RUMGNA_00240 [Ruminococcus gnavus ATCC 29149] gi|153797052|gb|EDN79472.1| hypothetical protein RUMGNA_00240 [Ruminococcus gnavus ATCC 29149] Length = 1499 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 26/292 (8%), Positives = 72/292 (24%), Gaps = 25/292 (8%) Query: 54 ESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY---- 109 + A +A LA + + + +S + AL A + + I +L G Sbjct: 941 QVALDAGGLAAGTSLTNGMSTGVATGSAGLQEQISALSQTATSTLSSTIDGNLPGVTASA 1000 Query: 110 ----------------SAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTS 153 S + T + I+ + ++ + + Sbjct: 1001 TASGAAITNGITSGIDSGMSGATASAANASVDTINAMADGISNGAATITQTISELTQTVT 1060 Query: 154 YDY---RLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSD 210 + E + ++ ++ M Sbjct: 1061 EALNQCWSDVSNSTTTAWSEIGTNMSTSLTQTSTLVETSLTTMQ--TNVQSIMESFTTGT 1118 Query: 211 PEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPS 270 E + ++++ ++ + + + S M + V + Sbjct: 1119 AEKMTQMGSSIQTALSEISVNIDSTMTSIQTGIATSTQGWSTAMSAAMLSLTVSIQSGMA 1178 Query: 271 WGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGV 322 T V + ++ T ++ AM+ L+ + + + Sbjct: 1179 QATMTVTVAMLSLRTAIQTGSTAASTAMQTGMTQLSLTVRTNMMQSAITTQA 1230 >gi|126727881|ref|ZP_01743709.1| hypothetical protein RB2150_00472 [Rhodobacterales bacterium HTCC2150] gi|126702822|gb|EBA01927.1| hypothetical protein RB2150_00472 [Rhodobacterales bacterium HTCC2150] Length = 176 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 6/56 (10%), Positives = 22/56 (39%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 + + + + E N ++ F +++ + L+ + ++ +E A + Sbjct: 3 IKNLKRFLTRTAKDESGNATLEFVMTMPLVITLMFSTVESGILLVQQMMLERALDV 58 >gi|13476309|ref|NP_107879.1| hypothetical protein mll7595 [Mesorhizobium loti MAFF303099] gi|14027070|dbj|BAB54024.1| mll7595 [Mesorhizobium loti MAFF303099] Length = 589 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 27/326 (8%), Positives = 81/326 (24%), Gaps = 12/326 (3%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + KANF+++ ALS L L I + ++ ++ + A + AS Sbjct: 9 ISRLARSMAGDRKANFAVMTALSAPVALALAAVAIDEASIYSERREAQAMVDLAAITAAS 68 Query: 67 KMVS-NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + N + + ++ +++ T + + Sbjct: 69 NINNVNTAVVTTLTDNGMPGVVVQASGQTIPPAVGKTVVTVTQGRYASTTTNVTQRFQAG 128 Query: 126 RISMTHMANNRLDSSNNTIFYNMDVMTSY------DYRLQFIEHLLNQRYNQKIVSFIPA 179 + ++ Q + ++ + Sbjct: 129 VTPYNAVRVTLAKVPARYFASSLIPTPVIGTQATASMTPQATFSVGSRLLSVNGGILNAL 188 Query: 180 LLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFL 239 L + G + +++ +S + + + + + A K + Sbjct: 189 LSGLLGGNISLSVMDYNGLISADVSVLSFI--SALATQLNITAGTYSDVLASKATVGQIA 246 Query: 240 DSIDLLSHVKEDVYMGLIGYTTRVEKNI--EPSWGTEKVRQYVTRDMDSLIL-KPTDSTP 296 ++ + + + L ++ + S + S Sbjct: 247 TAMANVPGLGNTAKVALQTIASKSTSTVKIPLSSLVDLGSVGSLGLGQQPSGLGVDASAM 306 Query: 297 AMKQAYQILTSDKKRSFFTNFFRQGV 322 M A +L + ++ Sbjct: 307 GMLTAAAVLANGTNQAAVDLGATIPG 332 >gi|198283210|ref|YP_002219531.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665578|ref|YP_002425441.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|198247731|gb|ACH83324.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218517791|gb|ACK78377.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 1137 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 24/303 (7%), Positives = 64/303 (21%), Gaps = 56/303 (18%) Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + +++ + + + + ++ + + + Sbjct: 247 SSDAPVINDVLYAIAAQWQPDATNAVGW-GPNPYGLNLQSYMTGNSGVNFSNYSNGLVSG 305 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNI-EPSWGTEKVRQYVTRDMDS 286 M + V S + + ++ Sbjct: 306 MTPTSAGYFPLSQQVWESQRGYAFNANTSYNKGNIVSPISATNSANATNIANAILPEVFR 365 Query: 287 LILKPTDSTPAMKQAYQILTS---DKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 + A + + P+ K++IF+TDG+ Sbjct: 366 TDKSINYAAEGPITASAGYSPTAGAFSTALSYYEGSLPNPPPATCGSKYVIFITDGQPTQ 425 Query: 344 F---------------------------------KSNVNTIKICDKAKENFIKIVTISIN 370 + V TI + IK + + Sbjct: 426 GMEHGYVYPPLGSASAQMFGVTSITASTWSSTNNNAVVETINEIQALAQKGIKTYVLGVG 485 Query: 371 ASPN----------------GQRLLKTCVSSPE--YHYNVVNADSLIHVFQNISQLMVHR 412 ++ N GQ +L + Y +A + +I ++ + Sbjct: 486 SAVNPNVPGASAADQAEALQGQAVLTAMAQAGGTSNFYAATSASDVQSAMNSIIANILGK 545 Query: 413 KYS 415 S Sbjct: 546 SVS 548 >gi|222084462|ref|YP_002542991.1| hypothetical protein Arad_0354 [Agrobacterium radiobacter K84] gi|221721910|gb|ACM25066.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 192 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 11/116 (9%), Positives = 30/116 (25%), Gaps = 8/116 (6%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 +R + E+ +I FA+ L+L + + A Sbjct: 8 ARIYTRLRHFRRDERGIGAIEFAILFPVLLMLYLGAFEITVGLSVEKRTSRA-------- 59 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A + L++ ++ + + ++G ++ Sbjct: 60 AGSIADILTQKTSTTKAELATMPSVAGAIFTPYATTGLTLKVTGIQIDAGSSATVA 115 >gi|15072454|gb|AAK38350.1| von Willebrand factor A-related protein [Mus musculus] Length = 415 Score = 37.5 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 50/157 (31%), Gaps = 19/157 (12%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ ++ + + + T++ A+ A + L +++ Sbjct: 74 SLVHVGSQPHTEFTFDQYSSGQAIRDAIRVAPQRMGDTNTGLALAYAKEQLFAEEAG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ + I +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGVTIFIVSTGRGNL 175 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L + ++ + V + D L + + + + Sbjct: 176 LELLAAASAPAEKHLHFV-DVDDLPIIARELRGSITD 211 >gi|270308521|ref|YP_003330579.1| hypothetical protein DhcVS_1137 [Dehalococcoides sp. VS] gi|270154413|gb|ACZ62251.1| hypothetical protein DhcVS_1137 [Dehalococcoides sp. VS] Length = 375 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 48/144 (33%), Gaps = 11/144 (7%) Query: 6 RFRFYFKKGIASEKA---NFSIIF----ALSVMSFLLLIGFL-IYVLDWHYKKNSMESAN 57 R + S++ ++IF A+ + L+L+G + + + + A Sbjct: 2 RNLITISRRFRSQQGQALVVALIFLAVGAIMIPPLLMLMGSALVQGTTFENRTTGL-YAA 60 Query: 58 NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTE 117 +A I + S L + R L + + +LS + Y Sbjct: 61 DAGIEQAIWYLDPENSPLIPGGLPTNTGETRTLPGISIDG--RTVSVTLSYLTEDTYQIM 118 Query: 118 IQNIVNSSRISMTHMANNRLDSSN 141 + + IS+T + + ++ Sbjct: 119 STSTSPTETISVTAVISETFNNYT 142 >gi|70734190|ref|YP_257830.1| hypothetical protein PFL_0688 [Pseudomonas fluorescens Pf-5] gi|68348489|gb|AAY96095.1| conserved hypothetical protein [Pseudomonas fluorescens Pf-5] Length = 181 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 12/139 (8%), Positives = 31/139 (22%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 +K +I FA + F + L+ S A A+ + + Sbjct: 43 RKQKGAVAIEFAAVFVIFFAVFYGLVSYSLPLLMMQSFHQATAEAVRRAVMVDPNTPNYA 102 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 ++ + F S + V + +++ + Sbjct: 103 TTVQNLANSVVQTQTQWIPPPFNFAATDYSAVYSAGVLTVSVNYPTSKLNQVLPRLVLPG 162 Query: 136 RLDSSNNTIFYNMDVMTSY 154 + + Sbjct: 163 IGPVPSLPTTLTASSSLQF 181 >gi|300697744|ref|YP_003748405.1| hypothetical protein RCFBP_mp20592 [Ralstonia solanacearum CFBP2957] gi|299074468|emb|CBJ54018.1| conserved protein of unknown function, putative TadE-like [Ralstonia solanacearum CFBP2957] Length = 159 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 15/136 (11%), Positives = 40/136 (29%), Gaps = 4/136 (2%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 SI FA+ V LL++ ++Y + + A A +N R+ + Sbjct: 24 GVASIEFAVVVPVLLLMMLGIVYYGVIFAMQQMLTLAAEEGARAALRYQSTNTQRVAAAY 83 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSA----VFYNTEIQNIVNSSRISMTHMANN 135 ++S+ + + + + + T N+ + + Sbjct: 84 SAVSSVLPSFISGRVQTNQSSTPLVTCQNVAGMQCLSVVLTMPLTTGNNPLLPAIPLLPV 143 Query: 136 RLDSSNNTIFYNMDVM 151 + + + + Sbjct: 144 PTTLTGSAVVQLVSGS 159 >gi|303238794|ref|ZP_07325326.1| S-layer domain protein [Acetivibrio cellulolyticus CD2] gi|302593673|gb|EFL63389.1| S-layer domain protein [Acetivibrio cellulolyticus CD2] Length = 1543 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 40/354 (11%), Positives = 101/354 (28%), Gaps = 49/354 (13%) Query: 62 LAGASKMVS-NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 +A ASKMV +L +IS + D +++ + + + ++ Sbjct: 457 IASASKMVEFDLDDPMPISGTISFDPDVSSEKDY--IVRSFVFDGSKKIAEGQDTFKVLP 514 Query: 121 IVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 + +S M + + N + + + + I+S A Sbjct: 515 LADSQTFDYNLMPGEEVSENLNVVIPELPPKADVIFSFDLTGSMGG------ILSTAKAR 568 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAM--NSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + M E +++ + M D C + T ++ Sbjct: 569 AKAIMTELNKLGVDINYGVVSYMDYPNGSYGYSGDYPYKLDCALTEST------ASVSDS 622 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 ++S+ L + + Y + + NI G ++ D + + M Sbjct: 623 INSLYLGNGADSPESYTRVLYESYADTNIGWRNG---AKKIFVNFGDDIPHDN-NINEGM 678 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + I + + + +++ K+ + Sbjct: 679 PELTGIYDTGRDLGRDGLMGTE-------------------------DDLDLQKVLSEMA 713 Query: 359 ENFIKIVTISINASPNGQRLLKTCVS-SPEYHYNVVNADSLIHVFQNISQLMVH 411 +N I++ I+ + K + Y+ +AD V + ++ + Sbjct: 714 DNKIEL--IACQTNGACGDYWKYWAGLTGGKLYDTNSADMANDVVKAVTDSLTS 765 >gi|239917284|ref|YP_002956842.1| hypothetical protein Mlut_07560 [Micrococcus luteus NCTC 2665] gi|281414238|ref|ZP_06245980.1| hypothetical protein MlutN2_03378 [Micrococcus luteus NCTC 2665] gi|239838491|gb|ACS30288.1| hypothetical protein Mlut_07560 [Micrococcus luteus NCTC 2665] Length = 152 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 14/150 (9%), Positives = 40/150 (26%), Gaps = 10/150 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSF--LLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + +++ L+L+ + + ++S ++A +AGA Sbjct: 3 RRLTARLGHD-DGQTTVLTVGLCAVLVALMLVMLAV--TTVQLQHRRLQSLADSAAIAGA 59 Query: 66 SK----MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + + ++ + A+ + S + Sbjct: 60 EELGFRLGEDPGVVLSDDDVAASASAHLAAVGAQEAVPGLGGMSARVAEDGTTVVVTLD- 118 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + +MT L S Sbjct: 119 ARADLFAMTGPFAGTLPLSVPLEATGSSRT 148 >gi|300313226|ref|YP_003777318.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] gi|300076011|gb|ADJ65410.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] Length = 755 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 7/47 (14%), Positives = 18/47 (38%), Gaps = 1/47 (2%) Query: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVS 70 + ++ +L +G + + Y + M +A A +A + Sbjct: 635 LTVATLPVMVLAVGIGVDYAFYIYSRLQMHLAAGEAIAVALEKSIQE 681 >gi|188580135|ref|YP_001923580.1| TadE family protein [Methylobacterium populi BJ001] gi|179343633|gb|ACB79045.1| TadE family protein [Methylobacterium populi BJ001] Length = 183 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 14/161 (8%), Positives = 52/161 (32%), Gaps = 18/161 (11%) Query: 12 KKGIASEKANFSIIFALS-----VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 I +E+ + ++ FAL + ++ L++ ++ A + A + Sbjct: 5 SNYIRNEEGSTAVEFALVGTAFILTLLFVMASSLVFYMN---------QALDNATARASR 55 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVF----YNTEIQNIV 122 ++++ + ++++ + F +++ +L Y + + + Sbjct: 56 QILTGGLQSQSTAATLTSFRENVCSYLPAAFSCSNLVINLYVVPKEVQPSGYYSFVSANL 115 Query: 123 NSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 + +S + + + V+ + I Sbjct: 116 DGVLVSNLAAGAGQFSLGGRGDYQYLQVIYPIMFLPPQISS 156 >gi|85374102|ref|YP_458164.1| hypothetical protein ELI_06375 [Erythrobacter litoralis HTCC2594] gi|84787185|gb|ABC63367.1| hypothetical protein ELI_06375 [Erythrobacter litoralis HTCC2594] Length = 150 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 5/38 (13%), Positives = 11/38 (28%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVL 44 R + + +I A +LL + + Sbjct: 2 IRDLLGTLRGDARGSMAIETAFVAPVLILLALGTVDLG 39 >gi|153003544|ref|YP_001377869.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. Fw109-5] gi|152027117|gb|ABS24885.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. Fw109-5] Length = 1666 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 36/134 (26%), Gaps = 10/134 (7%) Query: 290 KPTDSTP-AMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 TD+ A T F + + + + + D N+ + Sbjct: 491 GSTDTGIRASATGGGSGIPQSLCRECTRFPAEKDWLNNPIRVSWYLHNFDLRQNSENTED 550 Query: 349 NTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPEYHYNVVNA----DSLIHVFQ 403 + + I TI + A+ + +L++ +A L Sbjct: 551 CAAMGGKQLLD----IYTIGLGATGDAATMLESMAKMGGGLFKAADDAPTLRGKLDEALV 606 Query: 404 NISQLMVHRKYSVI 417 I+Q + + Sbjct: 607 AINQRSTSFSVATV 620 >gi|78189873|ref|YP_380211.1| hypothetical protein Cag_1920 [Chlorobium chlorochromatii CaD3] gi|78172072|gb|ABB29168.1| hypothetical protein Cag_1920 [Chlorobium chlorochromatii CaD3] Length = 3834 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 42/327 (12%), Positives = 95/327 (29%), Gaps = 18/327 (5%) Query: 27 ALSVMSFLLLIGFLIY-VLDWHYKKNSMESAN--NAAILAGASKMVSNLSRLGDRFESIS 83 A + + LI + + + +E+A +AA L + S S + Sbjct: 3163 ATVISALTTLIAAAMDSTTNAAAAETKVETALGLDAATLGATLSLTSYDPLAEASKTSTT 3222 Query: 84 NHAK-RALIDDAKRFIKNHIKESLSGY--SAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + AK A+ A N+I + +A ++ Q + N S + + +D + Sbjct: 3223 DAAKINAVKVHAATIQLNNIMDVAISVADAAGSTLSKAQIVENVSDSLLAQAGTDTVDVT 3282 Query: 141 NNT---IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEM----GERPIFLI 193 ++ + + T + F + + + + A + I I Sbjct: 3283 SDAVIEVAIKTGLSTGLTTKPNFNDVVAAIADALALANREIATIATNATGTNAVASITDI 3342 Query: 194 ELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVY 253 ++ S + + K T + L A + V Sbjct: 3343 VEAQIVAQSTIVPDAYAAVVADDSSAITTKADTFSSQLGEAAKEVETIFVNHAPTGSVVI 3402 Query: 254 MGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 G++ + + + + Q V + + A YQ++ +D ++ Sbjct: 3403 NGVVM----PGEILTAATDSIADNQGVGAISYQWLRGGEVISGATNATYQLVAADIGKAI 3458 Query: 314 -FTNFFRQGVKIPSLPFQKFIIFLTDG 339 + G I + D Sbjct: 3459 SVKASYTDGAGFSESMNSNATIAVPDA 3485 >gi|224047663|ref|XP_002193801.1| PREDICTED: vitrin [Taeniopygia guttata] Length = 746 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 49/179 (27%), Gaps = 36/179 (20%) Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW--GTEKVRQYVTRD 283 + K L ++ MG++ Y ++ +R + + Sbjct: 375 RRFQLQKRFLGNVAQALG---ISSAGPLMGIVQYGDDPSTEFNLKTYVNSKDLRNAIEKI 431 Query: 284 MDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNN 343 ++ A+ SF F ++ L DG + Sbjct: 432 Q--QKGGLSNVGKAL-------------SFVNKNFFLDANGNRGGAPNVVVVLVDGWPTD 476 Query: 344 FKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL---------KTCVSSPEYHYNVV 393 + A+E+ I I +++ A+ ++ C +S Y++ Sbjct: 477 -----RVEEASRLARESGINIFFVTVAAAAQSEKQNVIEPNFVDKAVCRTSG--FYSIN 528 >gi|218660803|ref|ZP_03516733.1| hypothetical protein RetlI_15107 [Rhizobium etli IE4771] Length = 194 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 46/158 (29%), Gaps = 3/158 (1%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAG-- 64 R + A + FAL + ++L+ + + ++ A+ L G Sbjct: 14 RSRLHRLAQERTAASGVEFALVLPILVMLLFGTVDLGHALTVSRKIDEIASTTGDLIGQQ 73 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 +S S++++L I + + + + + YNT N + Sbjct: 74 SSWTSSDVTKLLSGASFILQPYDTSGLTITVAVNDISKSGNATVNWSAAYNTSALNSGTA 133 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 S I + + T + F Sbjct: 134 SAIDIPSQIQDAGVQVVLTRVQYTLTTPVSAFFSNFTG 171 >gi|313140719|ref|ZP_07802912.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313133229|gb|EFR50846.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 282 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 41/166 (24%), Gaps = 7/166 (4%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR---- 282 + A+ + Y + S G+ + Sbjct: 81 RHDIAPEAIDAATAYFERHGIENYQWQGADGMYISCGYARHLESKGSAVWKNGQFARYWH 140 Query: 283 --DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D ++ T + F + R + +L + + Sbjct: 141 LIDDIAVPDGSTLGHVIHASKGSYFSGADPDVSFDDVSRDLGEWFTLVHGSYDDISPNNG 200 Query: 341 NNNFKSNVNTIKICDKAKENFIKIV-TISINASPNGQRLLKTCVSS 385 +CD AK+ I TI+I S N +LK +S Sbjct: 201 ELLINGVDKGTALCDVAKDLGYSIADTIAIGDSDNDTAMLKAAGTS 246 >gi|167577156|ref|ZP_02370030.1| hypothetical protein BthaT_03412 [Burkholderia thailandensis TXDOH] Length = 621 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 45/142 (31%), Gaps = 18/142 (12%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + ++I A+++ + +G + + + Y++ +++S + A LA A M ++ Sbjct: 57 GSLAVIAAIAIGVVIAALG-AVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATA 115 Query: 80 E--SISNHAKRALIDDAKR---------------FIKNHIKESLSGYSAVFYNTEIQNIV 122 + ++ + A + F + +G A ++ Sbjct: 116 QSAALGDGFDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTLTR 175 Query: 123 NSSRISMTHMANNRLDSSNNTI 144 + S+ Sbjct: 176 VVPYYFLGAQRTVSATSTAQAT 197 >gi|133737082|emb|CAL58684.1| polyketide synthase [Sorangium cellulosum] Length = 5331 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 13/126 (10%), Positives = 31/126 (24%), Gaps = 3/126 (2%) Query: 47 HYKKNSMESAN---NAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIK 103 + + A +AA++A A+ + + R E + A+ + + Sbjct: 687 VSARARLMQALPARDAAMVAIAASEGEVAAVIAARGEGGARATIAAINGPSSTVVAGDRD 746 Query: 104 ESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEH 163 L + + + S +H + LD+ Sbjct: 747 AVLDVAARFEALGRKTSRLRVSHAFHSHHMDGMLDAFRRVAEGLTFHPPRIPIVSNVTGQ 806 Query: 164 LLNQRY 169 Sbjct: 807 RAGDGE 812 >gi|308473011|ref|XP_003098732.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] gi|308268332|gb|EFP12285.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] Length = 396 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 13/127 (10%), Positives = 38/127 (29%), Gaps = 4/127 (3%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 ++ + ++ S+ T A+ + S + + Sbjct: 87 TYNSVASQKADLNQYQSIADAYTGVFDALSTTVDTIQSYLATGLALAERMLVDQTVNSTR 146 Query: 330 QKFIIFLT-DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSP 386 + + N + + + ++ K + I I+T++ + L L S P Sbjct: 147 AHYKRVMIVYASEYNGNGESDPLPLAERLKLSNINIITVAYEQPGSVGLLQGLTQIAS-P 205 Query: 387 EYHYNVV 393 + ++ Sbjct: 206 GFSFSSE 212 >gi|218130608|ref|ZP_03459412.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] gi|217986952|gb|EEC53283.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] Length = 247 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 64/190 (33%), Gaps = 20/190 (10%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKV 276 K+ A+ +A+ L +D +S D + + + +++ + K+ Sbjct: 23 TSGSMAGNKIGAVNDAVENVLPMLDEISASNPDAEIKVAALEFSSGCNWL----YDEPKL 78 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 D T A ++ L+ + + F II L Sbjct: 79 ASEFVW-QDVTASGLTSLGAACQELNTKLSRNGFMQTPSGSFA-----------PAIILL 126 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 +DG + + + +N IKI I+I + L + + E + V N D Sbjct: 127 SDGGPTDDFYGGLSKLKANNWFKNAIKI-AIAIGDDADKDVLTQF-TGTNEAVFTVHNID 184 Query: 397 SLIHVFQNIS 406 +L + + ++ Sbjct: 185 ALKQIIRVVA 194 >gi|218194396|gb|EEC76823.1| hypothetical protein OsI_14963 [Oryza sativa Indica Group] Length = 661 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 5/76 (6%) Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-IHVFQN 404 S+ + IC +A I T N N + + S + + + +L F + Sbjct: 93 SDNHDNSICMEALSTNYNIHTFGFNGMHNVRAMYNIASRSNGMYDLLNDDRNLITEAFIS 152 Query: 405 ISQLMVHRKYSVILKG 420 + S+I+ G Sbjct: 153 CMNKIT----SIIVLG 164 >gi|312129175|ref|YP_003996515.1| hypothetical protein Lbys_0384 [Leadbetterella byssophila DSM 17132] gi|311905721|gb|ADQ16162.1| hypothetical protein Lbys_0384 [Leadbetterella byssophila DSM 17132] Length = 3453 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 16/300 (5%), Positives = 59/300 (19%), Gaps = 6/300 (2%) Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + + L + S + S + + S Sbjct: 1614 NSTTATASNLAAGTYTVTVTDANGCTTSQSFTITQPASALEASISAQTNVSCNGGT-NGS 1672 Query: 141 NNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQ-KIVSFIPALLRIEMGERPIFLIELVVDL 199 +Y + + + I + + Sbjct: 1673 ATVTATGGTAPYTYSWSPTGGTSATASGLSAGTYTVTVTDANGISTSAQVTITQPTALVA 1732 Query: 200 SGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGY 259 S + + SA + + + + L+ Sbjct: 1733 SSGTITHVYAFGASTGSATVMVNGGTAPYTYTWSGTNSTTATASNLAAG-TYTVTVTDAN 1791 Query: 260 TTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFR 319 ++ + + + +++ + T+ + ++ Sbjct: 1792 GCTTSQSFTITQPSSALEAFISAQTNVSCNGGTN---GSATVTATGGTAPYTYSWSPTGG 1848 Query: 320 QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLL 379 L + + +TD + ++ + + + AS ++ Sbjct: 1849 TSATASGLAAGTYTVTVTDANGISTTTSATITQPTALVASSGTITHVYAFGASTGSATVM 1908 >gi|295840162|ref|ZP_06827095.1| von Willebrand factor, type A [Streptomyces sp. SPB74] gi|197698094|gb|EDY45027.1| von Willebrand factor, type A [Streptomyces sp. SPB74] Length = 588 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 21/200 (10%), Positives = 49/200 (24%), Gaps = 24/200 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---------PSW 271 + +++M K +LL L + + + + + S Sbjct: 397 PGRGQSRMDVTKASLLQALAQFTPADDIGLWEFATRLDGDRDYRELVPTARLGARKGSSG 456 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQK 331 + + T AY+ + D F P Sbjct: 457 TQREALTAAFGALQPQPGGATGLYDTTLAAYREASEDYAADKFNAVVLLTDGTNEEP--- 513 Query: 332 FIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 G + + D A + +V I++ + ++ + Sbjct: 514 -------GSMTRGELLTRLRDLADPAH--PLPLVAIAVGPEAAEDDMEAIGGATGGSGFK 564 Query: 392 VVNADSLIHVFQNISQLMVH 411 V + + V I+ +V Sbjct: 565 VDDPAQIHEV---INNAIVE 581 >gi|269126096|ref|YP_003299466.1| hypothetical protein Tcur_1855 [Thermomonospora curvata DSM 43183] gi|268311054|gb|ACY97428.1| hypothetical protein Tcur_1855 [Thermomonospora curvata DSM 43183] Length = 382 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 21/42 (50%), Gaps = 3/42 (7%) Query: 16 ASEKANFSIIFALSV--MSFLLLIGFLIYVLDWHYKKNSMES 55 E A + +FAL + + + ++ ++ + + ++ +++ Sbjct: 17 RDEGAVTT-LFALLLSSLIVVGMLALVVDIGRLYIEREELQN 57 >gi|313471867|ref|ZP_07812359.1| putative LPXTG-motif cell wall anchor domain protein [Lactobacillus jensenii 1153] gi|313449020|gb|EEQ68220.2| putative LPXTG-motif cell wall anchor domain protein [Lactobacillus jensenii 1153] Length = 1490 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 27/273 (9%), Positives = 73/273 (26%), Gaps = 8/273 (2%) Query: 40 LIYVLDWHYKKNSMESANNA--AILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRF 97 + K ++ N+A A KM ++ + A + + Sbjct: 273 ATDISGVDTAKTDAQNLNDAKKAAKDAIDKMEDLSDADKTAAKANVDKATDISGVNTAKT 332 Query: 98 IKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 ++ ++ E + + + + + ++ + Sbjct: 333 DAQNLNDAKKAAKDAIDKMEDLSDADKTAAKANVDKATDISGVDTAKTDAQNLNDAKKAA 392 Query: 158 LQFIEHLLNQRYNQK------IVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDP 211 + I + N QK +V+ P +++ + + DLS + D Sbjct: 393 KKAINEMGNLNNAQKDTALKAVVNATPESAVVKVQNTAQTTDDNMSDLSTDANFKNADDI 452 Query: 212 EDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSW 271 ++ ++ + LD I+ S ++ + + Sbjct: 453 KNSSNYVNADKNLKDAYDKAVENAKALLDKINGSSTGDVSTESAVVKAAKQAVDEALNAL 512 Query: 272 GTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 + S I ++ + A K A + Sbjct: 513 NGDSNLSKAKEAAKSAIDGMSNLSDADKTAAKA 545 >gi|33600818|ref|NP_888378.1| hypothetical protein BB1833 [Bordetella bronchiseptica RB50] gi|33568418|emb|CAE32330.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 747 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 26/73 (35%), Gaps = 1/73 (1%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + I AL+++ + L+G + + Y K ++ + A L G + Sbjct: 15 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 73 Query: 80 ESISNHAKRALID 92 + RA + Sbjct: 74 GQAVRASVRANLG 86 >gi|313106815|ref|ZP_07793028.1| putative membrane protein [Pseudomonas aeruginosa 39016] gi|310879530|gb|EFQ38124.1| putative membrane protein [Pseudomonas aeruginosa 39016] Length = 556 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 35/306 (11%), Positives = 91/306 (29%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E SA + I + N++ + Sbjct: 66 SAQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVH--- 122 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + + F +++ + Sbjct: 123 ----------------------VEVTKSVPGSLVANLGGLFGGGNANVDLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T++ A D+ + + MG Sbjct: 161 NATISAGTGLASVNSGQSALLNPILSGLLGTQIDLSAAAYNGIADAKLSVLDILGPNGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 L+G T + + ++Q + ++ L S A++ + S + Sbjct: 221 LVGVDTSLGTIEQLLNTNVGLQQVLAASVNVLAKNGVASVEALRTQLVGVKSATLKLGDL 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|306823811|ref|ZP_07457185.1| AMT family ammonium or ammonia transporter [Bifidobacterium dentium ATCC 27679] gi|309802086|ref|ZP_07696196.1| ammonium transporter [Bifidobacterium dentium JCVIHMP022] gi|304552809|gb|EFM40722.1| AMT family ammonium or ammonia transporter [Bifidobacterium dentium ATCC 27679] gi|308221287|gb|EFO77589.1| ammonium transporter [Bifidobacterium dentium JCVIHMP022] Length = 431 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 43/187 (22%), Gaps = 11/187 (5%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSM-ESANNAAILAGASKMVSNLSRLG 76 + N + I + + FL+ G + K + AA L+ M Sbjct: 5 DSGNAAWILTSASLVFLMTPGVAFFYGGMVRAKAVLNMMIMEAAALSVTMIMWVLWGWSI 64 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 + D A F+ + G V+ + A Sbjct: 65 AYA---GSSVGGIFGDPATGFLLKDTMVADDGVFGSAALNSNNYPVSVDVAFQSAFAMIT 121 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEH-------LLNQRYNQKIVSFIPALLRIEMGERP 189 + I + T + +I ++ I + + Sbjct: 122 VALICGAIAERVKYSTWMIFVALWITFDYAPLAHMVWNGGLLSADGAISQAIGAAAHDFA 181 Query: 190 IFLIELV 196 + + Sbjct: 182 GGTVVHI 188 >gi|256379383|ref|YP_003103043.1| hypothetical protein Amir_5378 [Actinosynnema mirum DSM 43827] gi|255923686|gb|ACU39197.1| WD40 domain protein beta Propeller [Actinosynnema mirum DSM 43827] Length = 1066 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 24/350 (6%), Positives = 69/350 (19%), Gaps = 43/350 (12%) Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 G V+ + + + + S G + F Sbjct: 149 GTKLAVTTEAGIRVLRVADGAVLADVPAPPTLSARDSEPAWSPDGTTIAFTRRIDGQETP 208 Query: 124 SSRISMTHMANN---RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPAL 180 S+ + + + + + L + + K Sbjct: 209 STGPYAVVGGTSPGGDYRTGVSLTTDEVPNNPDIVFLLDNTGSMGGYIQDAKAKLTGVLA 268 Query: 181 LRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLD 240 ++ + + + D + + ++A + L Sbjct: 269 RISQLQPQARYGLA-------------TYRDIDEPEHYELRQRPTADLSAFQTQLNSV-- 313 Query: 241 SIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQ 300 + ED + GL T + ++ + + + Sbjct: 314 TAGGGGDGPEDWFNGLRQLATDPTV---FTRDGSRIVVLLGDAPPHFDCDNPAAAYCPRY 370 Query: 301 AYQ----ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDG--ENNNFKSNVNTIKIC 354 + + + + I TDG + + + I Sbjct: 371 PTRDQVVSALTGAGAQVLAIPLGTPSSLDAGGHATDIANATDGLLIDGSIPPDATADAIA 430 Query: 355 DKAKENFIKI---------VTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 ++ + + T+ + QR Sbjct: 431 AGIRDLPVTVTPQHSCDHGATVGFDPPGLTQR-------GGGQVSFAETV 473 >gi|152964362|ref|YP_001360146.1| diguanylate cyclase with PAS/PAC sensor [Kineococcus radiotolerans SRS30216] gi|151358879|gb|ABS01882.1| diguanylate cyclase with PAS/PAC sensor [Kineococcus radiotolerans SRS30216] Length = 888 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 36/135 (26%), Gaps = 6/135 (4%) Query: 33 FLLLIGFLIYVLDWHYKKNSMESANNAA--ILAGASKMVSNLSRLGDRFESISNHAKRAL 90 + LIG V + + + A + A +A + + RL +++ RA Sbjct: 558 LVGLIGMATDVSELENSEALLRVAVDTAPVGIALVAVAGPDAGRLLRVNRALARFTARAE 617 Query: 91 IDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN----RLDSSNNTIFY 146 + + + + + + V + + A + + S + Sbjct: 618 EELLAGGFADLVADGERPEVGELFADVLAGAVAEQTVEVRLAAADGRVLDAEVSAALVQL 677 Query: 147 NMDVMTSYDYRLQFI 161 Sbjct: 678 QGADPLLLCLVEDVT 692 >gi|328907361|gb|EGG27127.1| hypothetical protein PA08_1370 [Propionibacterium sp. P08] Length = 123 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 29/122 (23%), Gaps = 8/122 (6%) Query: 22 FSIIFALSVMSFLLLIGFLIYVL-DWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFE 80 S++ + + L+ G + + + A AA + L Sbjct: 1 MSVLITMLLPVLLVSAGIAVDGAERARAVRIAHTVAAEAARAGCEEGSAAQLVGQDGTGA 60 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + A A R S +A + V + + + Sbjct: 61 ARVAAETSARGAKADRL-------SRLVITANGDAVSVATEVTRPTRLLALIGLTEVHGR 113 Query: 141 NN 142 + Sbjct: 114 AS 115 >gi|314957231|gb|EFT01335.1| TadE-like protein [Propionibacterium acnes HL027PA1] Length = 58 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 6/37 (16%), Positives = 12/37 (32%), Gaps = 2/37 (5%) Query: 14 GIASEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHY 48 E+ + S+ L V L++ G + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQIS 37 >gi|262199249|ref|YP_003270458.1| serine/threonine protein kinase [Haliangium ochraceum DSM 14365] gi|262082596|gb|ACY18565.1| serine/threonine protein kinase [Haliangium ochraceum DSM 14365] Length = 706 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 19/244 (7%), Positives = 65/244 (26%), Gaps = 17/244 (6%) Query: 11 FKKGIASEKANFSI-IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAIL--AGASK 67 +K I + F+I + + + + + +S+ + A + A + Sbjct: 435 LQKLIRRNRGVFAIASVVFATLLVVGAAFAAVMIATIENTSDSLRIQDEQAAIVQANTLE 494 Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 + L R + + + D + + + Y+ + ++Q + + + Sbjct: 495 RSIRTNMLQGRADVVRDLVAGLRADPSLSSLNVIRVDRTYAYTNLQTLKQVQQRLENPGV 554 Query: 128 SMT--------HMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPA 179 + N + + N + + + + + + Sbjct: 555 IAQIRDKFPDLVSSINEVQETAFPKIRNSEKTPAVAL-FDYDRS-SWNDIVENLRTVTRI 612 Query: 180 LLRIEMGE----RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNAL 235 + +PI G A + + + ++ + +N Sbjct: 613 DTEAGVPTLTVLKPIENSPKCQACHGGEEEAGYGSNKVRAVLVVKRSREAVEAKIAENTR 672 Query: 236 LLFL 239 + Sbjct: 673 TTMM 676 >gi|33593029|ref|NP_880673.1| hypothetical protein BP2003 [Bordetella pertussis Tohama I] gi|33563404|emb|CAE42283.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382442|gb|AEE67289.1| hypothetical protein BPTD_1973 [Bordetella pertussis CS] Length = 747 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 26/73 (35%), Gaps = 1/73 (1%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + I AL+++ + L+G + + Y K ++ + A L G + Sbjct: 15 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 73 Query: 80 ESISNHAKRALID 92 + RA + Sbjct: 74 GQAVRASVRANLG 86 >gi|57233893|ref|YP_182067.1| hypothetical protein DET1356 [Dehalococcoides ethenogenes 195] gi|57224341|gb|AAW39398.1| hypothetical protein DET1356 [Dehalococcoides ethenogenes 195] Length = 452 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 59/187 (31%), Gaps = 7/187 (3%) Query: 11 FKKGIASEKAN---FSIIF----ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 KK I EK ++IF A+ + L+LIG + ++ + ++A + Sbjct: 4 MKKLIKGEKGASLAIALIFLAIGAIMLPPLLMLIGSGLQQGTAIEERTAGIYDSDAGVEW 63 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 + + + + D +I + + + E Y V E+ + + Sbjct: 64 AINLIKTGGEGVTDSDGNIGLPNQDSPRRYILSALNGDTLEVKLTYHNVGSYYEVVSKAS 123 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + S+T A + + +F N D ++ + + + + Sbjct: 124 LNGKSITTRATLKYQLGGSGLFDNAITSLDGDVIIKNNTTVSSFPNMDNGNIIANGDIIV 183 Query: 184 EMGERPI 190 Sbjct: 184 GSSANID 190 >gi|33596976|ref|NP_884619.1| hypothetical protein BPP2383 [Bordetella parapertussis 12822] gi|33566427|emb|CAE37680.1| putative exported protein [Bordetella parapertussis] Length = 749 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 26/73 (35%), Gaps = 1/73 (1%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + I AL+++ + L+G + + Y K ++ + A L G + Sbjct: 17 GSILIPAALAILIGVALLGAA-QLGYFFYMKRELQKTADLAALTGVQVLSPGDEAACAAA 75 Query: 80 ESISNHAKRALID 92 + RA + Sbjct: 76 GQAVRASVRANLG 88 >gi|269960460|ref|ZP_06174832.1| hypothetical protein VME_12160 [Vibrio harveyi 1DA3] gi|269834537|gb|EEZ88624.1| hypothetical protein VME_12160 [Vibrio harveyi 1DA3] Length = 154 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 16/152 (10%), Positives = 30/152 (19%), Gaps = 5/152 (3%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 S+ + +K I F +++ L+L +I V + + A Sbjct: 3 SQIIQLRIRAFQEQKGLAIIEFIIALPVLLMLSVLIIDVCRAFIQYTEVNKALQNGARYA 62 Query: 65 ASKMVSNLS-----RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQ 119 L + I+ +A T Sbjct: 63 LVDTYGTLDFSAIADEASIKNVVVYGKPAGGGTPVIDHIEVGDITVTQPTAATKQVTLSA 122 Query: 120 NIVNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 S + L S Sbjct: 123 TYDYVPIFSSLPFSGTSLQFSIGASASMRTGP 154 >gi|148258228|ref|YP_001242813.1| hypothetical protein BBta_7023 [Bradyrhizobium sp. BTAi1] gi|146410401|gb|ABQ38907.1| hypothetical protein BBta_7023 [Bradyrhizobium sp. BTAi1] Length = 184 Score = 37.1 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 15/137 (10%), Positives = 36/137 (26%), Gaps = 10/137 (7%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R A ++ + FA V L++ + A + + Sbjct: 3 DRLLRRLGAFGADKRGIAATEFAFIVPLMLVMFFGTVEFCSGI--------AVDRKVTLM 54 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 A + S+ +S ++ A + + + + +Q V Sbjct: 55 ARTLSDLTSQSTSVGDSDMSNFFAASTGIMYPY--STTPVNATITELYVDPKTMQATVMW 112 Query: 125 SRISMTHMANNRLDSSN 141 S+ S + + Sbjct: 113 SKGSAPRSSGTTVGVPA 129 >gi|330828501|ref|YP_004391453.1| RTX protein [Aeromonas veronii B565] gi|328803637|gb|AEB48836.1| RTX protein [Aeromonas veronii B565] Length = 2390 Score = 37.1 bits (84), Expect = 5.7, Method: Composition-based stats. Identities = 21/278 (7%), Positives = 60/278 (21%), Gaps = 28/278 (10%) Query: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIV-------SFIPALLRIEMGERPIFLIELVV 197 + + + + N S + G I L++ Sbjct: 1838 VVLPGQNYNIAFMVDSSGSIGTSAMNTIKNQLAQIFASLKASAGSEGAGTVNIMLVDFDA 1897 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + S+ + + ++ + + L Sbjct: 1898 LSNASISVNLKDSNALNQLQAVIDSMSSGGGTNYEDVFKTVANWFSSSTAQSNSNAKNLT 1957 Query: 258 GYTTRVEKNIEPSWGTEK--------------VRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 + T + + + + ++ + A + Sbjct: 1958 YFITDGQPTYYLGNEGSNPRVLNFSGSSYDVYLNSLLGGTYIPSRVYSSNGYTIIDSAGR 2017 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 + + + G + + DG + + N++ + Sbjct: 2018 VYADGRYSNPIGFMRPDGRGGYT-----YSTLDGDGGTTSSSTINNSVAGFGLLTGMGVT 2072 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHV 401 + I + A+ + L S VNA L + Sbjct: 2073 VEAIGLGANISYNDLKSY--DSDGVIMTGVNASDLANA 2108 >gi|163738632|ref|ZP_02146046.1| hypothetical protein RGBS107_11427 [Phaeobacter gallaeciensis BS107] gi|161387960|gb|EDQ12315.1| hypothetical protein RGBS107_11427 [Phaeobacter gallaeciensis BS107] Length = 199 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 15/37 (40%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGF 39 +L+R R F + + S+ FA + L + Sbjct: 1 MLTRIRTLFYRYRRETDGSVSVEFAFYMPLLLGVFAA 37 >gi|160897506|ref|YP_001563088.1| membrane protein [Delftia acidovorans SPH-1] gi|160363090|gb|ABX34703.1| membrane protein-like protein [Delftia acidovorans SPH-1] Length = 762 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 34/128 (26%), Gaps = 3/128 (2%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 + + + FAL LL+G + + +Y K ++ + A A + + S Sbjct: 27 RRQGGSLLVQFALLGSVLFLLLGV-VDLGYMYYAKRDLQRIADLAATQAAQAINAERSNR 85 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + R + + A S + Sbjct: 86 AACVAAGEDSITANWPVPLTRNSTQTVVSCGNWNPADPARQSSGG--RFFDASSGPLNAA 143 Query: 136 RLDSSNNT 143 R+ + Sbjct: 144 RVTVGGTS 151 >gi|308472851|ref|XP_003098652.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] gi|308268252|gb|EFP12205.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] Length = 396 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 59/214 (27%), Gaps = 34/214 (15%) Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + VVD S M N +V S T++ S Sbjct: 35 SNLWLDVVAVVDNSQGM---TNPGLINVASDIYSVFSSGTRI--------------GSNS 77 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTE---KVRQYVTRDMDSLILKPTDSTPAMKQAYQ 303 +GL+ Y + + + + + + + + ++ A + Sbjct: 78 SEPRTTRVGLVTYNSNATQKADLNKYQSIDDVLNEIYDDISTVVNTADSYLATGLQLAEK 137 Query: 304 ILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 +L Q +++ +I + ++ + + ++ K + I Sbjct: 138 MLID------------QSENTNRAHYKRVVIVY--ASEYKGEGELDPLNVANRLKLSDIN 183 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADS 397 I+T++ + + V NA Sbjct: 184 IITVAYEQKGDDGLFHDLSQIASPGFSFVNNASD 217 >gi|229493940|ref|ZP_04387712.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229319217|gb|EEN85066.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 533 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 37/368 (10%), Positives = 75/368 (20%), Gaps = 44/368 (11%) Query: 56 ANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN 115 A + ++ A S + + + + Sbjct: 180 ATSESVRAALVPHAQAESVRTEEQPVGQKMLTELVSSGKVGVSTEQQVVEFTKAESKISA 239 Query: 116 TEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDV----MTSYDYRLQFIEHLLNQRYNQ 171 T + A+ + S L + Sbjct: 240 TVPATGTMIMDYPLVVSASAPDRHYAAAKKGDAIAELLHSESAQEVLTRSGFRTSGGGAV 299 Query: 172 KIVSFIPALLRIEMGERPIFLIE------------LVVDLSGSMHCAMNSDPEDVNSAPI 219 + L + + + +D+S SM + Sbjct: 300 EGGVGDVTALTVSEQSVADQTLGAWALMAKPIRTLVAIDVSASMDYPAAGGKTRMQLTTA 359 Query: 220 CQDKKRTKM-AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 ++ L F ++ +E V + + Sbjct: 360 AALAGNQVFPDSVAAGLWAFSQGMNGTQDFQELVPIRRYDTIVDGVTQRKLMADQANRLT 419 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + R T A++ + +I LTD Sbjct: 420 ELKR-------GATGLYDTTLAAFRKVQESY----------------DPRAVNSVIILTD 456 Query: 339 GENNNFKSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 G N + S + A+E + IVTI I + L + Y + Sbjct: 457 GANEDPDSISREQLLDVLAREQDPARPVIIVTIGITDDADAAALADISRVTGGSTYIARD 516 Query: 395 ADSLIHVF 402 + VF Sbjct: 517 PSEISEVF 524 >gi|156404067|ref|XP_001640229.1| predicted protein [Nematostella vectensis] gi|156227362|gb|EDO48166.1| predicted protein [Nematostella vectensis] Length = 218 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 55/187 (29%), Gaps = 17/187 (9%) Query: 229 AALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLI 288 K A L + I + + + L ++ + S + + D ++ Sbjct: 39 DNYKLAKTLTKEIISRFTISPDKTRVSLNFFSAHHVIVSKLSDNFS--KSKLMSLTDQMM 96 Query: 289 LKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIP-SLPFQKFIIFLTDGENNNFKSN 347 ++++ L + + + ++G P + + +TDG Sbjct: 97 Y---------EKSFSKLATTLEAVHYELLVKKGGARPKQKGVKMATVLVTDGYGTAG--F 145 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEY---HYNVVNADSLIHVFQN 404 T K+ +++ T+ +++ S P + + V Sbjct: 146 EETSDEAKSMKKYNVEMFTVYPEKGRTYNKIMMHLASKPTKSHLFKLTKDGGARRKVVDQ 205 Query: 405 ISQLMVH 411 I + + Sbjct: 206 IVKQICS 212 >gi|15963887|ref|NP_384240.1| hypothetical protein SMc04118 [Sinorhizobium meliloti 1021] gi|307315735|ref|ZP_07595254.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] gi|307320420|ref|ZP_07599837.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15073062|emb|CAC41521.1| Conserved hypothetical transmembrane protein [Sinorhizobium meliloti 1021] gi|306893986|gb|EFN24755.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306898626|gb|EFN29294.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C] Length = 194 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 11/184 (5%), Positives = 42/184 (22%), Gaps = 21/184 (11%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 F + + ++ FA+ + + + A +S + Sbjct: 13 GAFHGLLRDRRGAGAVEFAIVAPLLIAAYVGAFELSLGFTVARKVGRA--------SSAV 64 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYN------------T 116 +++ ++ + + + + + ++G Sbjct: 65 SDIVTQEQQVSKAFLDGMRNVARNMLVPYDGSDYDLKITGIQVNGTTEGKVAWSRGWSDA 124 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR-LQFIEHLLNQRYNQKIVS 175 V + S+ + + + + + V + R + Sbjct: 125 SDGATVPYAVNSVVSVPADLDAVNAFVVRTELVVNHQLSLFGSDAGGTIPLSRTSYYRQR 184 Query: 176 FIPA 179 F Sbjct: 185 FGTT 188 >gi|304392393|ref|ZP_07374334.1| putative TadE family protein [Ahrensia sp. R2A130] gi|303295497|gb|EFL89856.1| putative TadE family protein [Ahrensia sp. R2A130] Length = 204 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 18/54 (33%), Gaps = 4/54 (7%) Query: 2 HLLSR----FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKN 51 +LL R + ++ E+ + FAL + L + I + Sbjct: 3 NLLKRPSLLSSGFLRRFRKDERGISMVEFALISPALLSMYLGAIVATHMEHAST 56 >gi|300727566|ref|ZP_07060956.1| von Willebrand factor, type A [Prevotella bryantii B14] gi|299775155|gb|EFI71757.1| von Willebrand factor, type A [Prevotella bryantii B14] Length = 245 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 66/191 (34%), Gaps = 22/191 (11%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKED--VYMGLIGYTTRVEKNIEPSWGTEKV 276 K+ A+ +A+ L +D +S D + + + +++ + K+ Sbjct: 17 TSGSMIGNKIGAVNDAVENVLPMLDEISASNPDAEIKVAALEFSSGCNWL----YDEPKL 72 Query: 277 RQYVTRDMDSLILKPTDSTPA-MKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIF 335 D T A ++ ++ S ++ +F II Sbjct: 73 ASEFVW-QDVTASGLTSLGAACLELNSKLSRSGFMQTPSGSFAPA------------IIL 119 Query: 336 LTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNA 395 L+DG + + + +N IKI I+I + L + + E + V N Sbjct: 120 LSDGGPTDDFHGGLSKLKANNWFKNAIKI-AIAIGDDADKDVLTQF-TGTNEAVFTVHNI 177 Query: 396 DSLIHVFQNIS 406 D+L + + ++ Sbjct: 178 DALKQIIRVVA 188 >gi|116623630|ref|YP_825786.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] gi|116226792|gb|ABJ85501.1| TadE family protein [Candidatus Solibacter usitatus Ellin6076] Length = 175 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 47/125 (37%), Gaps = 5/125 (4%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 K + F L+++ L+++ ++ + K+++ A + + +++ Sbjct: 8 KRRKGAELLEFTLALLPLLMMVFVMLDAAWAIFVKSTLAFAVRTGV-----RQGITITKT 62 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 ++ K + ++ F+ + K ++ Y T +V+ + + + N Sbjct: 63 QAGSSDLTTMVKNIVRANSLGFLSDTSKIKVNFYHPPDDPTSTAALVDVTTDATGNNPLN 122 Query: 136 RLDSS 140 + S Sbjct: 123 IMQVS 127 >gi|307943458|ref|ZP_07658802.1| putative TadE family protein [Roseibium sp. TrichSKD4] gi|307773088|gb|EFO32305.1| putative TadE family protein [Roseibium sp. TrichSKD4] Length = 181 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 27/99 (27%), Gaps = 10/99 (10%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLL---IGFLIYVLDWHYKKNSMESANNAAILA 63 F+ I E ++ FA+ L+ + + + A + A A Sbjct: 1 MLRAFRNLILREGGATAVEFAIMFPLMLVFFINVIVMFDGFRANR-------ALSVASHA 53 Query: 64 GASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 G+ + L + A++ I Sbjct: 54 GSDLLSRFQENLSSKDIQNVLATTSAIMGQYADKTDPVI 92 >gi|154499681|ref|ZP_02037719.1| hypothetical protein BACCAP_03338 [Bacteroides capillosus ATCC 29799] gi|150271279|gb|EDM98536.1| hypothetical protein BACCAP_03338 [Bacteroides capillosus ATCC 29799] Length = 154 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 23/57 (40%), Gaps = 7/57 (12%) Query: 7 FRFYFKKGIASEKA--NF--SIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNA 59 F ++ + SE+ N+ +++F + L I I + K ++ A + Sbjct: 7 FLQKVRRTLKSERGEANYFSTVVFIFIAVILLAFI---IDLFGIISTKQELDHAADQ 60 >gi|254506780|ref|ZP_05118920.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] gi|219550361|gb|EED27346.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] Length = 385 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 17/190 (8%), Positives = 45/190 (23%), Gaps = 20/190 (10%) Query: 30 VMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN--------------LSRL 75 +++ + + I V K ++++ +AA LA A + + + Sbjct: 1 MLALIGVAALSIDVNHAMLNKTRLQNSVDAAALAAAIVLDNGGSEAQATAAAKTTLTNVA 60 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + + ++ + + N S+ + + Sbjct: 61 NATGNTEMDFTSAQVVVQFSNDPATFPFSGFDPDEDSYARVAVSNFALSNFFAHVFSVDK 120 Query: 136 RLDSSNNTI-FYNMDVMTSYDYRL----QFIEHLLNQRYNQKIVSFIPALLRIEMGERPI 190 L S+ ++ + I MG Sbjct: 121 NLASTAVAGPSPSVIDSNNIVPMAVCEGDSSGS-AGYVPGDLYALKIADQKLGSMGSGNY 179 Query: 191 FLIELVVDLS 200 L++ S Sbjct: 180 QLLDFGSGAS 189 >gi|330881351|gb|EGH15500.1| type IV pilus-associated protein, putative [Pseudomonas syringae pv. glycinea str. race 4] Length = 1034 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 22/254 (8%), Positives = 51/254 (20%), Gaps = 15/254 (5%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIE-LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + P I +V DL+ + + A D + Sbjct: 195 SNNTSIRIGLATFNDPNNSDLGPGGKIARIVTDLAPVARTIYQPNEVLQDQANKNSDALK 254 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +A+L + L + I+ Sbjct: 255 SAIASLTPSANTPLAETYYEITRYFRGMSPFYQSGSNYVSPIQYRCQKNYGVVITDGLPT 314 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII-------FLTD 338 PT+ + + L + + G + + D Sbjct: 315 YDRTFPTNDPDDVLDTTRSLPNWDLNAANDGDNLSGDGEGDTLYLDDLAKFAYDIDLRRD 374 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP---EYHYNVVNA 395 + AK + TI ++L + ++ ++ Sbjct: 375 AVTTGGDLTGKSWDTAGFAK-QNLSTYTIGF---TASNQMLNDAANDTHGHGKYFQANDS 430 Query: 396 DSLIHVFQNISQLM 409 L + Sbjct: 431 AGLTSALSLALSDI 444 >gi|293342302|ref|XP_001059971.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|293354106|ref|XP_344486.4| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|149034121|gb|EDL88891.1| similar to hypothetical protein 4933430J11 (predicted) [Rattus norvegicus] Length = 535 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 17/124 (13%) Query: 246 SHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 + Y+ + + ++ + + + T + +A + + Sbjct: 72 DKSGSVAKNWIYIYSFAEGLVKKFTKN--EIHKSLLLLKNIEPQGLTHMQKGLIKANEQI 129 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 R I+ LTDG K ++T++ KA++ I Sbjct: 130 QKSAAR--------------GHRAVSVIVALTDGLLL-LKPYLDTMEEAKKARKLGAIIY 174 Query: 366 TISI 369 T+ + Sbjct: 175 TVGV 178 >gi|308472935|ref|XP_003098694.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] gi|308268294|gb|EFP12247.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] Length = 405 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 39/140 (27%), Gaps = 7/140 (5%) Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPF 329 ++ + + S+ A+ S + Sbjct: 94 TYNSVATQNADLNQYQSIEDAYYGIYGALSTTVNTTESYLTTGLNAAVELFSRQSFRSNR 153 Query: 330 QKF-IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCV-SSPE 387 Q + + + N + + + I ++ K + + I+TI+ + L +SP Sbjct: 154 QHYRKVIIVYASEYNGRGEFDPVPIANRLKASGVNIITIAYEQPGSAGLLQGLSQIASPG 213 Query: 388 YHYNVVNAD-----SLIHVF 402 + ++ N + Sbjct: 214 FSFSGDNIAGNLVKEIQTAL 233 >gi|330823813|ref|YP_004387116.1| PilC domain-containing protein [Alicycliphilus denitrificans K601] gi|329309185|gb|AEB83600.1| Neisseria PilC domain protein [Alicycliphilus denitrificans K601] Length = 1245 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 24/277 (8%), Positives = 62/277 (22%), Gaps = 15/277 (5%) Query: 140 SNNTIFYNMDVMTSYDYRLQ--FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 Y + + + + + + + G + Sbjct: 201 GTAATPYLGCRRNYHIFMTDGRWNGSVSGGTQDDNTNNITLPDGTVY-GSTTLANRPKNG 259 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + + + S + + LK + + E+ Sbjct: 260 LYADTYSNTLADWAFKSWSTKLQTATDSNGVTGLKGTPQPTAEYNK--APATENFGKDTT 317 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 G +++ P + VT + + T + + D + Sbjct: 318 GNDAILDRFWNPRYNPANWPHMVTYTIGLSKMAYTWPGNKVSGSKDPAAPDTACTVIQAP 377 Query: 318 FR--QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + + + F D +N + T I + Sbjct: 378 GQGCATGNKATFDTDLMVPFSYDPGSNGSLPDFITGNIKWPDIGSG--------GEPVRS 429 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L ++ Y V + L F++I Q + + Sbjct: 430 LDLWHAALNGRGRFYAVEKGEDLAKAFRDIFQQINTQ 466 >gi|126728315|ref|ZP_01744131.1| hypothetical protein SSE37_20032 [Sagittula stellata E-37] gi|126711280|gb|EBA10330.1| hypothetical protein SSE37_20032 [Sagittula stellata E-37] Length = 332 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 4/38 (10%), Positives = 14/38 (36%), Gaps = 2/38 (5%) Query: 24 IIFALSVMS-FLLLIGFLIYVLDWHYK-KNSMESANNA 59 ++ A+ + + + I +++A +A Sbjct: 61 VLMAIVLPVAMIWVAAMAIRAARVMRTESARLQAAIDA 98 >gi|83716874|ref|YP_438808.1| hypothetical protein BTH_II0608 [Burkholderia thailandensis E264] gi|167615325|ref|ZP_02383960.1| hypothetical protein BthaB_03492 [Burkholderia thailandensis Bt4] gi|257141890|ref|ZP_05590152.1| hypothetical protein BthaA_22154 [Burkholderia thailandensis E264] gi|83650699|gb|ABC34763.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 598 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 45/142 (31%), Gaps = 18/142 (12%) Query: 20 ANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRF 79 + ++I A+++ + +G + + + Y++ +++S + A LA A M ++ Sbjct: 34 GSLAVIAAIAIGVVIAALG-AVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATA 92 Query: 80 E--SISNHAKRALIDDAKR---------------FIKNHIKESLSGYSAVFYNTEIQNIV 122 + ++ N A + F + +G A ++ Sbjct: 93 QSAALGNGFDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTLTR 152 Query: 123 NSSRISMTHMANNRLDSSNNTI 144 + S+ Sbjct: 153 VVPYYFLGAQRTVSATSTAQAT 174 >gi|218662246|ref|ZP_03518176.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli IE4771] Length = 487 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 56/193 (29%), Gaps = 25/193 (12%) Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQ 278 + + +L L + ++ + +V T + + + + Sbjct: 55 NSGSMSGPSIEQARQSLALAISRLNPDD--RFNVIRFDDTMTDYFKGLVAATPD--NREK 110 Query: 279 YVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTD 338 + T+ PA++ A + + + ++FLTD Sbjct: 111 AIAYVRGLTADGGTEMLPALEDALRNQGPVASGAL-----------------RQVVFLTD 153 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL 398 G N + I A ++ T+ I ++PN + K + + D + Sbjct: 154 GAIGNEQQLFQEIT----ANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQV 209 Query: 399 IHVFQNISQLMVH 411 + + + Sbjct: 210 ASRMGELFAKLQN 222 >gi|197106454|ref|YP_002131831.1| hypothetical protein PHZ_c2993 [Phenylobacterium zucineum HLK1] gi|196479874|gb|ACG79402.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 158 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 37/126 (29%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 ++ + ++ A ++ +L++ L+ + + S+E A A GA + Sbjct: 25 RQLRRLARCRRGAAAVEAAFALPLLVLMLTGLVEGGRLAWTRTSLEFAVQEAARQGALDL 84 Query: 69 VSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS 128 S D +++N A + + S + + Sbjct: 85 SRCTSGGADPCGALANARNVAAAKASAAGVPASAFTVASDPACGVRVRAAHRFRFVAAGF 144 Query: 129 MTHMAN 134 + Sbjct: 145 VGAAPT 150 >gi|195044863|ref|XP_001991891.1| GH11815 [Drosophila grimshawi] gi|193901649|gb|EDW00516.1| GH11815 [Drosophila grimshawi] Length = 4021 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 16/259 (6%), Positives = 46/259 (17%), Gaps = 6/259 (2%) Query: 43 VLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHI 102 + ++ + LA A+ V + + D+ S + KR ++ Sbjct: 553 YQSISRTSSRVQEDLDNTTLAAATADVESTTAESDQTNSKVESELTSTGSTGKRNYQSIS 612 Query: 103 KESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIE 162 + S + T + + +Y + Sbjct: 613 RTSSRVQEDLDNTTLAAATADVEST-TAESDQTNSKVESELTSTGSTGKRNYQSISRTSS 671 Query: 163 HLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQD 222 + N + + + S N S ++ Sbjct: 672 RVQEDLDNTTLAAATADVESTTAESDQANSKVESDLKSTGSTGKRNYQSISRTSGGFKEE 731 Query: 223 KKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV-- 280 + + + + + Sbjct: 732 LINKTSTTATADVESTTAESGRKQIGRTSSQTNSKIESDVASNGSTSKRNYHSISRTSGG 791 Query: 281 ---TRDMDSLILKPTDSTP 296 + T + Sbjct: 792 IEEELINKTSKHDSTATAD 810 >gi|299132280|ref|ZP_07025475.1| Flp pilus assembly protein TadG [Afipia sp. 1NLS2] gi|298592417|gb|EFI52617.1| Flp pilus assembly protein TadG [Afipia sp. 1NLS2] Length = 194 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 14/158 (8%), Positives = 42/158 (26%), Gaps = 8/158 (5%) Query: 13 KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 + + +A ++ FA+ + L+L I V A + ++ + + Sbjct: 14 RLVRDVRAVAAVEFAVILPIVLMLFFGTIEVSTGV--------AVDRKVIILTRTLSDLI 65 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 S+ ++ ++A + ++ +S +S S Sbjct: 66 SQATSITDTDISNAFNISSAVMAPYSNAPVQAKISQVFIDTNGIAKVKWSKASNTSARGC 125 Query: 133 ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYN 170 + I + + + + Sbjct: 126 NEVVTTLVPSGIAIGGTYLIMSEVAYDYTPAAGMNGGS 163 >gi|15599493|ref|NP_252987.1| TadG [Pseudomonas aeruginosa PAO1] gi|9950519|gb|AAG07685.1|AE004846_2 TadG [Pseudomonas aeruginosa PAO1] Length = 556 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 36/306 (11%), Positives = 90/306 (29%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E SA + I + N++ + Sbjct: 66 SAQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAVANEAVH--- 122 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + + F +++ + Sbjct: 123 ----------------------VEVTKSVPGSLVANLGGLFGGGNANVDLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T + A D+ + + MG Sbjct: 161 NATISAGTGLASVNSGQSALLNPILSGLLGTHIDLSAAAYNGIADAKLSVLDILGADGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LIG T + + ++Q + ++ L S A++ + S + Sbjct: 221 LIGVDTSLGTIEQLLNTNVGLQQVLAASVNVLAKNGVASVEALRAQLVGVKSATLKLGDL 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|260778542|ref|ZP_05887434.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio coralliilyticus ATCC BAA-450] gi|260604706|gb|EEX31001.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio coralliilyticus ATCC BAA-450] Length = 5696 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 28/271 (10%), Positives = 68/271 (25%), Gaps = 30/271 (11%) Query: 148 MDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL-------RIEMGERPIFLIEL--VVD 198 + + L + + K G I L E Sbjct: 5194 AGQDYNIAFVLDTSGSMGGRVGTAKTEIIEVFDQLLSTANSGSNAGTVNILLTEFATNSG 5253 Query: 199 LSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIG 258 ++ S+ + + D S + + + + + ++ D ++ + + I Sbjct: 5254 VTISVDLSSPTTRADFVSQLSSIEDNGSGLTNYETSFDAAVEWFDANTNDQAENITYFIT 5313 Query: 259 YTTRVEKNI---------EPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK 309 N + + + V D L + + + +Y+ + Sbjct: 5314 DGNPNYNNHLNVTSADFDQLVLDYDASTKDVVTLDDILPD---NYSVGIAVSYKG--EEI 5368 Query: 310 KRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISI 369 S + TDG N + ++ + + + + I Sbjct: 5369 VSSSGAVKSPLTGATLGNLSYSGGLRYTDGSNASVQARHMYQVLALISA-----VEAVGI 5423 Query: 370 NASPNGQRLLKTCVSSPEYHYNVVNADSLIH 400 + N L + S +N D L Sbjct: 5424 GSGVNEATLSQY--DSDGVVVPNINVDELAE 5452 >gi|239627294|ref|ZP_04670325.1| von Willebrand factor [Clostridiales bacterium 1_7_47_FAA] gi|239517440|gb|EEQ57306.1| von Willebrand factor [Clostridiales bacterium 1_7_47FAA] Length = 681 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 63/241 (26%), Gaps = 49/241 (20%) Query: 160 FIEHLLNQRY-NQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAP 218 +LN + + R+ P V+D+SGSM Sbjct: 281 NSGLMLNTGEKENFFLLMVQPPERVPAEAIPPREYIFVLDVSGSMF-------------- 326 Query: 219 ICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV--EKNIEPSWGTEKV 276 + K + + ++ LI ++ E V Sbjct: 327 ------GYPLDTAKELIRNMVSNLRETDTFN------LILFSNDAIRMSARSLPATDENV 374 Query: 277 RQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFL 336 + + T+ PA+++A I S + ++ + Sbjct: 375 ERAINLINRQKGGGGTELAPALEKAVGIPMDSGAGS----------------VSRSVVVI 418 Query: 337 TDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNAD 396 TDG ++ ++ + + + I S N + + + V ++ Sbjct: 419 TDGYMSDEQA--IFDIVAGNL--DTTSFFSFGIGTSVNRYLIEGIARTGGGESFVVTDSS 474 Query: 397 S 397 Sbjct: 475 E 475 >gi|319763923|ref|YP_004127860.1| fg-gap repeat-containing protein [Alicycliphilus denitrificans BC] gi|317118484|gb|ADV00973.1| FG-GAP repeat-containing protein [Alicycliphilus denitrificans BC] Length = 1245 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 24/277 (8%), Positives = 62/277 (22%), Gaps = 15/277 (5%) Query: 140 SNNTIFYNMDVMTSYDYRLQ--FIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVV 197 Y + + + + + + + G + Sbjct: 201 GTAATPYLGCRRNYHIFMTDGRWNGSVSGGTQDDNTNNITLPDGTVY-GSTTLANRPKNG 259 Query: 198 DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLI 257 + + + S + + LK + + E+ Sbjct: 260 LYADTYSNTLADWAFKSWSTKLQTATDSNGVTGLKGTPQPTAEYNK--APATENFGKDTT 317 Query: 258 GYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNF 317 G +++ P + VT + + T + + D + Sbjct: 318 GNDAILDRFWNPRYNPANWPHMVTYTIGLSKMAYTWPGNKVSGSKDPAAPDTACTVIQAP 377 Query: 318 FR--QGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNG 375 + + + F D +N + T I + Sbjct: 378 GQGCATGNKATFDTDLMVPFSYDPGSNGSLPDFITGNIKWPDIGSG--------GEPVRS 429 Query: 376 QRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 L ++ Y V + L F++I Q + + Sbjct: 430 LDLWHAALNGRGRFYAVEKGEDLAKAFRDIFQQINTQ 466 >gi|239906054|ref|YP_002952793.1| hypothetical protein DMR_14160 [Desulfovibrio magneticus RS-1] gi|239795918|dbj|BAH74907.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 133 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 16/104 (15%), Positives = 34/104 (32%), Gaps = 4/104 (3%) Query: 15 IASEKANFSIIFALSVMSFLL--LIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 I + + S+ FAL ++ F + LIG + + ++ + A ++A A Sbjct: 10 IDRQDGSISVEFALMLVLFFMPLLIGI-VDFGQILHAQSVVTRAAREGVVAAAR-NQDIP 67 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 + + ++ A I A + Y Sbjct: 68 TAVDAYIQNAGYDTSLAHIATAGSRVAGEPVMVTVRYDTSAMVI 111 >gi|84495453|ref|ZP_00994572.1| von Willebrand factor, type A [Janibacter sp. HTCC2649] gi|84384946|gb|EAQ00826.1| von Willebrand factor, type A [Janibacter sp. HTCC2649] Length = 560 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 48/142 (33%), Gaps = 12/142 (8%) Query: 272 GTEKVRQYVTRDMDSLILKPTDST-PAMKQAYQILTSDKKRSFFTNFFRQGV-----KIP 325 G K + + T + A+ A + + K Sbjct: 407 GKGKDWAAIAPIARLDEPEGTGTHRDALVAALKKFQTYISGDTGLYDTTAAAFEEMTKTY 466 Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKE-----NFIKIVTISINASPNGQRLLK 380 + ++ +TDG+N++ ++ ++ K+ ++IVTI + + + L Sbjct: 467 DPEYFNSVVIITDGKNDDPGGGLDLDQLLAHIKKGHDPTRPVRIVTIGMG-EADTKVLKT 525 Query: 381 TCVSSPEYHYNVVNADSLIHVF 402 ++ Y +D L+ VF Sbjct: 526 ISTATGGTTYTANTSDDLLTVF 547 >gi|326925129|ref|XP_003208773.1| PREDICTED: hypothetical protein LOC100547685, partial [Meleagris gallopavo] Length = 721 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 39/98 (39%), Gaps = 15/98 (15%) Query: 326 SLPFQKFIIFLTDGENNNFKSNVNTIKIC-DKAKENFIKIVTISINASPNGQRLLKTCVS 384 + I+ LTDGE+++ + +C ++ +E+ I TI++ + L + Sbjct: 7 GTTYGSEIVLLTDGEDSS-------MSVCRERVRESGAIIHTIALG-PAAAKELEEFSNI 58 Query: 385 SPE---YHYNVVNADSLIHVFQNI---SQLMVHRKYSV 416 + Y +V L+ F I S + + V Sbjct: 59 TGGLQLYAVDVDVPSKLVEAFSEITTGSGDISEQSIQV 96 >gi|320326239|gb|EFW82293.1| type IV pilus-associated protein, putative [Pseudomonas syringae pv. glycinea str. B076] gi|320330663|gb|EFW86640.1| type IV pilus-associated protein, putative [Pseudomonas syringae pv. glycinea str. race 4] Length = 1067 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 22/254 (8%), Positives = 51/254 (20%), Gaps = 15/254 (5%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIE-LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + P I +V DL+ + + A D + Sbjct: 195 SNNTSIRIGLATFNDPNNSDLGPGGKIARIVTDLAPVARTIYQPNEVLQDQANKNSDALK 254 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +A+L + L + I+ Sbjct: 255 SAIASLTPSANTPLAETYYEITRYFRGMSPFYQSGSNYVSPIQYRCQKNYGVVITDGLPT 314 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII-------FLTD 338 PT+ + + L + + G + + D Sbjct: 315 YDRTFPTNDPDDVLDTTRSLPNWDLNAANDGDNLSGDGEGDTLYLDDLAKFAYDIDLRRD 374 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP---EYHYNVVNA 395 + AK + TI ++L + ++ ++ Sbjct: 375 AVTTGGDLTGKSWDTAGFAK-QNLSTYTIGF---TASNQMLNDAANDTHGHGKYFQANDS 430 Query: 396 DSLIHVFQNISQLM 409 L + Sbjct: 431 AGLTSALSLALSDI 444 >gi|297198202|ref|ZP_06915599.1| von Willebrand factor [Streptomyces sviceus ATCC 29083] gi|197714651|gb|EDY58685.1| von Willebrand factor [Streptomyces sviceus ATCC 29083] Length = 592 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 62/199 (31%), Gaps = 24/199 (12%) Query: 221 QDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYV 280 R++M + +LL L + +GL ++T ++ + + K+ Sbjct: 399 PGTGRSRMDVTRASLLQALATFTQED------EIGLWEFSTELDGDKDY-----KILVPT 447 Query: 281 TRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQ-----KFIIF 335 R DS T + A+ L + + ++ Sbjct: 448 DRLGDSTAAGTTQRER-LSAAFGGLEPVPGGATGLYDTTLAAYKAATSSYAKGKFNALVV 506 Query: 336 LTDGENNNFKSNVNTIKICDKAK----ENFIKIVTISINASPNGQRLLKTCVSSPEYHYN 391 LTDG N + S I + K + ++ I++ + + ++ Sbjct: 507 LTDGVNQDPGSISRGALISELEKLSSPARPVPLIVIAVGPDADRAEAEQLAEATGGSGQQ 566 Query: 392 VVNADSLIHVFQNISQLMV 410 V + + V I + +V Sbjct: 567 VNDPAQIHTV---ILKAIV 582 >gi|26325991|dbj|BAC26739.1| unnamed protein product [Mus musculus] Length = 415 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 16/157 (10%), Positives = 49/157 (31%), Gaps = 19/157 (12%) Query: 255 GLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFF 314 L+ ++ + + + T++ A+ A + L +++ Sbjct: 74 SLVHVGSQPHTEFTFDQYSSGQAIQDAIRVAPQRMGDTNTGLALAYAKEQLFAEEAG--- 130 Query: 315 TNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASPN 374 K ++++TDG +++ + K+ I +S Sbjct: 131 ----------ARPGVPKVLVWVTDGGSSDPVGPPM-----QELKDLGATIFIVSTGRGNL 175 Query: 375 GQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + L + ++ + V + D L + + + + Sbjct: 176 LELLAAASAPAEKHLHFV-DVDDLPIIARELRGSITD 211 >gi|312127132|ref|YP_003992006.1| hypothetical protein Calhy_0908 [Caldicellulosiruptor hydrothermalis 108] gi|311777151|gb|ADQ06637.1| hypothetical protein Calhy_0908 [Caldicellulosiruptor hydrothermalis 108] Length = 638 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 50/182 (27%), Gaps = 14/182 (7%) Query: 11 FKKGIASEK-ANFSIIFALSVMSFLLLIGFLIYVLDWHYK-------KNSMESANNAAIL 62 K I S + II + LLL+ + + K +A ++A+ Sbjct: 2 MKNFIRSSSKGSALIIVIVFFTVLLLLVAGTMELALTSLIHSKETINKTQTSTAADSAME 61 Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIV 122 + +++ +A+ + + + + +N Sbjct: 62 YILFYISKAIAQAK-----RLTYAQFFDSTGRLIYTGDSFENDYLNTFNSYIADFFENRG 116 Query: 123 NSSRISMTHMANNRLDSSNNTI-FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 N I M N+ + SN + Y + F + + + Sbjct: 117 NRIGIDMKLADNSSVQVSNVSELILLARQSCEYISNISFSRSGNSYILEVEALDSTTKTK 176 Query: 182 RI 183 R+ Sbjct: 177 RV 178 >gi|330928504|ref|XP_003302293.1| hypothetical protein PTT_14044 [Pyrenophora teres f. teres 0-1] gi|311322453|gb|EFQ89612.1| hypothetical protein PTT_14044 [Pyrenophora teres f. teres 0-1] Length = 292 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 55/182 (30%), Gaps = 28/182 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + A D ++ A ++A+ + + + M + E + Sbjct: 4 DNSEASRNGDYVPSRWEAQQDAVNMIFSAKTGANPESSVGLMSMG--GNTPEILTTLTTD 61 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 KV + + I + + A L + +S Q+ Sbjct: 62 IGKV---LDGLHRTKIKGSSHFVTGINVAALALKHRQNKS---------------QKQRI 103 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLK---TCVSSPE 387 +IF + N +K+ K K++ I I I+ + ++L S+ Sbjct: 104 VIFNCSPIE---EDEKNLVKLAKKMKKSGISIDIIAFGELSDDTTRKLQAFSDNVQSAEG 160 Query: 388 YH 389 + Sbjct: 161 SY 162 >gi|219559725|ref|ZP_03538801.1| hypothetical protein MtubT1_21367 [Mycobacterium tuberculosis T17] Length = 75 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 25/48 (52%), Gaps = 1/48 (2%) Query: 26 FALSVM-SFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL 72 FA++++ L + G Y+ ++ ++A + A L A+++ S L Sbjct: 9 FAVAMIAVLLCVTGAGAYLGSAVVARHRAQAAADLASLVAAARLPSGL 56 >gi|84387242|ref|ZP_00990263.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] gi|84377889|gb|EAP94751.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] Length = 142 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 8/132 (6%), Positives = 31/132 (23%), Gaps = 4/132 (3%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 ++ ++ A++ L +G +I + + N + + S++ Sbjct: 6 KHQRGFAAVEMAIATPVLLFFLGLVIELGNVLIHYNVISKSVQNGARYAVSEVYDTKGGT 65 Query: 76 GDRF----ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTH 131 + + + + + + V Sbjct: 66 IAPTLEIQNVVVYGQSSVGTAVLSTLTTADVTVTPPSIDSYVRVSVTYDYVPLFLSIPLS 125 Query: 132 MANNRLDSSNNT 143 + + S + Sbjct: 126 ATSFSIPLSVTS 137 >gi|330988796|gb|EGH86899.1| type IV pilus-associated protein, putative [Pseudomonas syringae pv. lachrymans str. M301315] Length = 1067 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 22/254 (8%), Positives = 51/254 (20%), Gaps = 15/254 (5%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIE-LVVDLSGSMHCAMNSDPEDVNSAPICQDKKR 225 + P I +V DL+ + + A D + Sbjct: 195 SNNTSIRIGLATFNDPNNSDLGPGGKIARIVTDLAPVARTIYQPNEVLQDQANKNSDALK 254 Query: 226 TKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMD 285 + +A+L + L + I+ Sbjct: 255 SAIASLTPSANTPLAETYYEITRYFRGMSPFYQSGSNYVSPIQYRCQKNYGVVITDGLPT 314 Query: 286 SLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFII-------FLTD 338 PT+ + + L + + G + + D Sbjct: 315 YDRTFPTNDPDDVLDTTRSLPNWDLNAANDGDNLSGDGEGDTLYLDDLAKFAYDIDLRRD 374 Query: 339 GENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSP---EYHYNVVNA 395 + AK + TI ++L + ++ ++ Sbjct: 375 AVTTGGDLTGKSWDTAGFAK-QNLSTYTIGF---TASNQMLNDAANDTHGHGKYFQANDS 430 Query: 396 DSLIHVFQNISQLM 409 L + Sbjct: 431 AGLTSALSLALSDI 444 >gi|294011437|ref|YP_003544897.1| tight adherence protein TadE [Sphingobium japonicum UT26S] gi|292674767|dbj|BAI96285.1| tight adherence protein TadE [Sphingobium japonicum UT26S] Length = 239 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 32/102 (31%), Gaps = 1/102 (0%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAGASKMVSNL 72 + I FA LLL+ + ++ + + A A A + Sbjct: 15 LGGDRRGLALIEFAYMAPIMLLLMVGGAELANYSITSMRISALALQVADNASRIGEGDPM 74 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 ++ I++ + AL + + E SG S+ Sbjct: 75 AKKKVSEAQINDLLQGALAQGGNLNVNSTYVEKQSGGSSTIK 116 >gi|218887820|ref|YP_002437141.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758774|gb|ACL09673.1| TadE family protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 145 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 35/137 (25%), Gaps = 9/137 (6%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R +++ ++ AL + L+ L+ + SA A AGA Sbjct: 13 LRHLPSSRALAQRGLAALEVALMLPVIAALLYVLVEGGNTIRA----YSALTEASRAGAR 68 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 +V N + + + + S V +S Sbjct: 69 HVVMNDDVA-----GVQPFVRSLATTLDPNALTATAVKDTSKQMVTVTVKYAYKSVFTSN 123 Query: 127 ISMTHMANNRLDSSNNT 143 + S +T Sbjct: 124 PYNGDPNDALYTFSAST 140 >gi|318062252|ref|ZP_07980973.1| hypothetical protein SSA3_30224 [Streptomyces sp. SA3_actG] gi|318080540|ref|ZP_07987872.1| hypothetical protein SSA3_28586 [Streptomyces sp. SA3_actF] Length = 127 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 33/127 (25%), Gaps = 2/127 (1%) Query: 26 FALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNL--SRLGDRFESIS 83 L + FL L F++ ++ A A + G+ + Sbjct: 1 MILFALVFLSLAAFVVDGGLSISQRERAADLAEQAARYAAQDIDREALYGNEGEAPINAG 60 Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 N R A+ + + A + E++ + + + S Sbjct: 61 NCPARVAAFAAESGMSGADAAASGCVEADADHVEVRIQLTYRPVFTGIFYGGSIHVSGTA 120 Query: 144 IFYNMDV 150 + N Sbjct: 121 VAENKVG 127 >gi|146340336|ref|YP_001205384.1| hypothetical protein BRADO3363 [Bradyrhizobium sp. ORS278] gi|146193142|emb|CAL77154.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 183 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 14/74 (18%), Positives = 27/74 (36%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKM 68 + + + + ++ F L FL L+ LI + + +ES + A + Sbjct: 13 RFCRDFLGDRRGATAVEFGLVAAPFLALVIALIQTFLVFFAQQLLESVAQQSARAVMTGQ 72 Query: 69 VSNLSRLGDRFESI 82 V S D F+ Sbjct: 73 VRASSMTADAFKQN 86 >gi|254822564|ref|ZP_05227565.1| hypothetical protein MintA_21724 [Mycobacterium intracellulare ATCC 13950] Length = 442 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 15/156 (9%), Positives = 41/156 (26%), Gaps = 11/156 (7%) Query: 40 LIYVLDW------HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDD 93 + + K ++ESA + A +A + ++ + L+ + E +S L + Sbjct: 201 AVEGMLARIGKEQQRTKAALESARDFAAVA-SHELRTPLTAMRTNLEVLSTLD---LPPE 256 Query: 94 AKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRIS-MTHMANNRLDSSNNTIFYNMDVMT 152 + + + + S E + + L Sbjct: 257 QRHEVIGDVIRTQSRIEGTLTALERLAQGELTTVDDFVPFDITELLDRAAHDALRSYPAV 316 Query: 153 SYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 ++ ++V ++ G Sbjct: 317 EVSLVPSPTVLMVGLPTGLRLVIDNAIANAVKHGNA 352 >gi|328946175|gb|EGG40320.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1087] Length = 450 Score = 36.8 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 96/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 A +S + + + + A D KN + Sbjct: 53 AVSSPIEAAGAEIAADTTKPVVEESTAKPGDLIDISKNVSPVDVKESQEGNQTVRVETAT 112 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 113 VDVTKTEVAPAKVDKAPEPTTRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 167 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 168 PPTVTKAGSADIVFVVDRSGSMGATIDIVRANINEFVRNITKEGI---TARFGLATFSDE 224 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 225 VFGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 264 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 265 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 324 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + + K + ++ N +L I + + +Y Sbjct: 325 IERTVATV-----KAIEGIYKNFA-TEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 376 Query: 415 SVILK 419 I+K Sbjct: 377 YKIIK 381 >gi|308510751|ref|XP_003117558.1| hypothetical protein CRE_00423 [Caenorhabditis remanei] gi|308238204|gb|EFO82156.1| hypothetical protein CRE_00423 [Caenorhabditis remanei] Length = 177 Score = 36.8 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 51/168 (30%), Gaps = 7/168 (4%) Query: 245 LSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQI 304 E +GL + P + + +LK D+ + Q+ Sbjct: 4 FDVSSETARIGLAVVNGDQNEQPPPVATLNDITSHTNFHSKLELLK--DNYADFDHSGQL 61 Query: 305 LTSDKKRSFFTNFFRQ-GVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIK 363 L + + + + ++++T G N + + K+ Sbjct: 62 LAHNLDVICDDSGYSSTKEGYRTTIKNHLLLYITSGTAFNVDPVPFVQNMLAR-KQYG-- 118 Query: 364 IVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 ++T+ + + LK ++ N L + + I +L++ Sbjct: 119 MITVGYGDGLDFHK-LKNLAGGASCAFSAKNPTELNSLIKPIQRLIMT 165 >gi|225021498|ref|ZP_03710690.1| hypothetical protein CORMATOL_01518 [Corynebacterium matruchotii ATCC 33806] gi|224945880|gb|EEG27089.1| hypothetical protein CORMATOL_01518 [Corynebacterium matruchotii ATCC 33806] Length = 492 Score = 36.8 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 38/348 (10%), Positives = 98/348 (28%), Gaps = 21/348 (6%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A+ L AS +++ + + + D K+FI + + + Y+ + Sbjct: 163 ASALVAASLNNNSVDATKQALTKDKSVTVASAVQDGKKFIVVNETSAPANYTFTEIKDVV 222 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 Q + S + +A + + + + + V + + L + Q+ + Sbjct: 223 QPVRVVSLTATDTVAEDAVRAGAD--LGSSMVNADAGKQATSVTSLAATQALQQFDTEAN 280 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + P + S + + D RT + NA+ Sbjct: 281 SAAPASPAPAPAPEAQPQAAASAVTSANTLFLFDTSAAMAQASDDGRTWFQVVSNAIAQA 340 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + V Y + + S+ + + + Sbjct: 341 VPMVGAQHQVALWNYSSPLNPGVKKGWRENVSFASSMTNEQIGNTAIGFTTGG------- 393 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + ++ +T G ++ + T + AK Sbjct: 394 -------VPQTRAATVAAVNYATSYAKESGQPARVVLITTGTSDLGDMSAVTQAL-SAAK 445 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +++ + + L+K +S V AD++ V + +S Sbjct: 446 AAKVELSVVHVGTGTQDSELMKAAQASE----VVTTADNVADVIEKLS 489 >gi|268573966|ref|XP_002641960.1| C. briggsae CBR-UNC-36 protein [Caenorhabditis briggsae] Length = 1250 Score = 36.8 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 27/257 (10%), Positives = 71/257 (27%), Gaps = 31/257 (12%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + L +L QRY + ++ I + + + Sbjct: 243 NSATNSKNVLIMLDMSGSMLGQRYE--------------VAKQTTEAILETLSHNDYFNI 288 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 S + + K AL +D+ G Y + Sbjct: 289 MTFSKNTFLLDGCNGTNGLLQATMRNKKALRRKMDT---------YQSEGKAEYEKALPL 339 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + T++ S++ T+ + + + + ++ + G Sbjct: 340 AFSVLLDLKGSYALYTKEEMSMMSGNATNDYQFHLELPEHVLAATRQYIDSINNGGGDNN 399 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCV 383 + I+ +TDG N +K + + +++ T + + + + Sbjct: 400 RGAC-ENVIMLITDGAPNAYKKIFDMYNA-----DKKVRVFTFLVGDEAIDFNEVREMAC 453 Query: 384 SSPEYHYNVVNADSLIH 400 ++ Y +V N + Sbjct: 454 NNRGYMVHVANMADVDE 470 >gi|227823965|ref|YP_002827938.1| pilus assembly protein contains TadE domain [Sinorhizobium fredii NGR234] gi|227342967|gb|ACP27185.1| pilus assembly protein contains TadE domain [Sinorhizobium fredii NGR234] Length = 188 Score = 36.8 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 17/154 (11%), Positives = 41/154 (26%), Gaps = 7/154 (4%) Query: 1 MHLLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAA 60 M+L+ + + + FA+ ++ + + A Sbjct: 1 MNLVRGVFGPLWRLSTDRRGVGGVEFAIVAPILIMAYIGAFELSVGLNVVRKVA----RA 56 Query: 61 ILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQN 120 A A + S +S++N A+ L A I + ++ Sbjct: 57 SSAVADLVSQEASVDTAFLDSMNNVAESILAPYAGTDYTLKITGIQVTGTTTGTVLWSRD 116 Query: 121 I---VNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 S T + ++ + + + V Sbjct: 117 QDGGTPYPANSTTTVPSDLEAVNAFVVRTELVVP 150 >gi|221222878|sp|A8XP97|UNC36_CAEBR RecName: Full=Voltage-dependent calcium channel unc-36; AltName: Full=Pharyngeal calcium channel subunit alpha-2; AltName: Full=Uncoordinated protein 36; Flags: Precursor gi|309358190|emb|CAP34577.2| CBR-UNC-36 protein [Caenorhabditis briggsae AF16] Length = 1254 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 27/257 (10%), Positives = 71/257 (27%), Gaps = 31/257 (12%) Query: 146 YNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHC 205 + + L +L QRY + ++ I + + + Sbjct: 243 NSATNSKNVLIMLDMSGSMLGQRYE--------------VAKQTTEAILETLSHNDYFNI 288 Query: 206 AMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEK 265 S + + K AL +D+ G Y + Sbjct: 289 MTFSKNTFLLDGCNGTNGLLQATMRNKKALRRKMDT---------YQSEGKAEYEKALPL 339 Query: 266 NIEPSWGTEKVRQYVTRDMDSLILKP-TDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKI 324 + T++ S++ T+ + + + + ++ + G Sbjct: 340 AFSVLLDLKGSYALYTKEEMSMMSGNATNDYQFHLELPEHVLAATRQYIDSINNGGGDNN 399 Query: 325 PSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP-NGQRLLKTCV 383 + I+ +TDG N +K + + +++ T + + + + Sbjct: 400 RGAC-ENVIMLITDGAPNAYKKIFDMYNA-----DKKVRVFTFLVGDEAIDFNEVREMAC 453 Query: 384 SSPEYHYNVVNADSLIH 400 ++ Y +V N + Sbjct: 454 NNRGYMVHVANMADVDE 470 >gi|163742982|ref|ZP_02150365.1| hypothetical protein RG210_01912 [Phaeobacter gallaeciensis 2.10] gi|161383665|gb|EDQ08051.1| hypothetical protein RG210_01912 [Phaeobacter gallaeciensis 2.10] Length = 199 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 8/37 (21%), Positives = 15/37 (40%) Query: 3 LLSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGF 39 +L+R R F + + S+ FA + L + Sbjct: 1 MLTRIRTLFYRYRRETDGSVSVEFAFYMPLLLGVFAA 37 >gi|85713499|ref|ZP_01044489.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] gi|85699403|gb|EAQ37270.1| Flp pilus assembly protein TadG [Nitrobacter sp. Nb-311A] Length = 193 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 4/64 (6%) Query: 1 MHLLSRFRFYFK---KGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESAN 57 M RF+ + + + + ++ FAL F L+ +I + S+E+A Sbjct: 7 MSPFPRFKATARMAARFQGHKGGSAAVEFALVAPMFFALLFAIIETGLVFFASQSLETAL 66 Query: 58 -NAA 60 ++A Sbjct: 67 QDSA 70 >gi|254239033|ref|ZP_04932356.1| hypothetical protein PACG_05210 [Pseudomonas aeruginosa C3719] gi|126170964|gb|EAZ56475.1| hypothetical protein PACG_05210 [Pseudomonas aeruginosa C3719] Length = 556 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 36/306 (11%), Positives = 90/306 (29%), Gaps = 25/306 (8%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ I+ A +++ L+ + ++ + ++ +++ + A L AS+ + Sbjct: 6 ARQRGAIGILAATTLLLALICLLLVVDTGRLYLEQRNLQRVADVAALESASQGALCGDQS 65 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + S + + DA + E SA + I + N++ + Sbjct: 66 SAQATSFAKASAMLNGFDADAAGSSLSAEVGGVLSAGGLRSFIASASNAAAANEAVH--- 122 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIEL 195 ++ + + F +++ + Sbjct: 123 ----------------------VEVTKSVPGSLVANLGGLFGGGNANVDLRAEAVARRLP 160 Query: 196 VVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMG 255 +S A + + PI T + A D+ + + MG Sbjct: 161 NATISAGTGLASVNSGQSALLNPILSGLLGTHIDLSAAAYNGIADAKLSVLDILGADGMG 220 Query: 256 LIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFT 315 LIG T + + ++Q + ++ L S A++ + S + Sbjct: 221 LIGVDTSLGNIEQLLNTNVGLQQVLAASVNVLAKNGVASVEALRAQLVGVKSATLKLGDL 280 Query: 316 NFFRQG 321 G Sbjct: 281 LGLSAG 286 >gi|325687363|gb|EGD29384.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK72] Length = 450 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 97/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI-QNI 121 A +S + + + + A D KN + + Sbjct: 53 AVSSPIEAAGTEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVHVETTT 112 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 113 VDVTKTEVAPGKVDKAPDPITRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 167 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 168 PPTVTKAGSADIVFVVDRSGSMGSTIDIVRANINEFVRNITKEGI---TARFGLATFSDE 224 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 225 VYGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 264 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 265 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 324 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + + K + ++ N +L I + + +Y Sbjct: 325 IERTVATV-----KAIEGIYKNFA-TEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 376 Query: 415 SVILK 419 I+K Sbjct: 377 YKIIK 381 >gi|256396003|ref|YP_003117567.1| hypothetical protein Caci_6888 [Catenulispora acidiphila DSM 44928] gi|256362229|gb|ACU75726.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 141 Score = 36.8 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 38/143 (26%), Gaps = 15/143 (10%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA--GASKMVSNLSRLGDRF 79 ++ L V +++IG + + A + A+ A AS+ + + D Sbjct: 1 MTVFVTLLVGVLMMVIGLVTDGGGAVAARA---DALDEAMSAGRAASQAIDIEALHRDHV 57 Query: 80 ESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI----------SM 129 + A R+ D +H S+ A T + + Sbjct: 58 IVLDEPAARSAAADFMAATGDHAVISIRQVGAANRVTHVGGDAGPEVTVTVTRTVKTQFL 117 Query: 130 THMANNRLDSSNNTIFYNMDVMT 152 + + + Sbjct: 118 WAVGIDSFAMTATATVTPEPGSL 140 >gi|305681252|ref|ZP_07404059.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659457|gb|EFM48957.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 492 Score = 36.8 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 38/348 (10%), Positives = 98/348 (28%), Gaps = 21/348 (6%) Query: 59 AAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEI 118 A+ L AS +++ + + + D K+FI + + + Y+ + Sbjct: 163 ASALVAASLNNNSVDATKQALTKDKSVTVASAVQDGKKFIVVNETSAPANYTFTEIKDVV 222 Query: 119 QNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIP 178 Q + S + +A + + + + + V + + L + Q+ + Sbjct: 223 QPVRVVSLTATDTVAEDAVRAGAD--LGSSMVNADAGKQATSVTSLAATQALQQFDTEAN 280 Query: 179 ALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLF 238 + P + S + + D RT + NA+ Sbjct: 281 SAAPASPAPAPAPEAQPQAAASAVTSANTLFLFDTSAAMAQASDDGRTWFQVVSNAIAQA 340 Query: 239 LDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAM 298 + + V Y + + S+ + + + Sbjct: 341 VPMVGAQHQVALWNYSSPLNPGVKKGWRENVSFASSMTNEQIGNTAIGFTTGG------- 393 Query: 299 KQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAK 358 + + ++ +T G ++ + T + AK Sbjct: 394 -------VPQTRAATVAAVNYATSYAKESGQPARVVLVTTGTSDLGDMSAVTQAL-SAAK 445 Query: 359 ENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNIS 406 +++ + + L+K +S V AD++ V + +S Sbjct: 446 AAKVELSVVHVGTGTQDSELMKAAQASE----VVTTADNVADVIEKLS 489 >gi|332978757|gb|EGK15449.1| hypothetical protein HMPREF9373_0075 [Psychrobacter sp. 1501(2011)] Length = 2843 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 34/341 (9%), Positives = 74/341 (21%), Gaps = 32/341 (9%) Query: 55 SANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFY 114 ++A+ G+ + + + S + D + Sbjct: 2436 DTTDSAVATGSISF----TDVDNFDASDFSIDLAGPTDITVKGETVSWTWDDLTSQLTGQ 2491 Query: 115 NTEIQNIVNSSRISMTHMANN-------RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQ 167 + + + +A + +++ + +QF + + Sbjct: 2492 VVIEGESLEVMTVDVGPIATDGSGGFVADYNTTLLHAIDHPSNDDEDQLDVQFEASVSDG 2551 Query: 168 RYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTK 227 + + I S S V A + T+ Sbjct: 2552 TQSASTNFAVTVEDDRPAQSESDVDIAAATSDSNITIILDTSGSMSVEVASGGGE---TR 2608 Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 + K A++ LD D+ VK + + + + + Sbjct: 2609 LDLAKAAIVKLLDGYDVAGDVKVQLITFAKEAQAQPQWM--------TIDELKDLLPGIN 2660 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFT----------NFFRQGVKIPSLPFQKFIIFLT 337 T+ A+ A + F QG Sbjct: 2661 ARGGTNYDAALDVAMTSYDNAGGIFGANNYSYFLTDGEPTFGQGTVSDFGSGLNGTGGQN 2720 Query: 338 DGENNNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRL 378 +G +N N IK + S N L Sbjct: 2721 NGNPDNGIQAQEETLWTTFLNANDIKSYAFAFGPSANNNNL 2761 >gi|303248313|ref|ZP_07334575.1| TadE family protein [Desulfovibrio fructosovorans JJ] gi|302490338|gb|EFL50250.1| TadE family protein [Desulfovibrio fructosovorans JJ] Length = 147 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 13/147 (8%), Positives = 41/147 (27%), Gaps = 6/147 (4%) Query: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 K+ ++ ++ A++++ + L+ L+ + + +++A A + Sbjct: 1 MKRLHRDQRGLAAVEMAIAMVLLVPLLLILVEASRALTEYSQLQNAAMEGARMLARQNGE 60 Query: 71 NLSRLGDRFESISNHAKRA----LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS- 125 ++ A + +++ YN ++S Sbjct: 61 TGGVEDYIKNTVLTDASGKSLFDGAEPTVTISPRDANNNVTVQVDHDYNPSFMPQYDASG 120 Query: 126 -RISMTHMANNRLDSSNNTIFYNMDVM 151 ++ L S T Sbjct: 121 NPTPFNLPGSDTLTISAKTTMALPAAN 147 >gi|260463531|ref|ZP_05811730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259030622|gb|EEW31899.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 552 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 9/112 (8%), Positives = 31/112 (27%), Gaps = 6/112 (5%) Query: 294 STPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKI 353 S A+ A L K + + + ++ +TDG + Sbjct: 93 SASAITDAADSLKFLGKTPLTAAVKQAAEALKYTEDKATVVLITDGLETCGGDPC---AL 149 Query: 354 CDKAKENFIKIV--TISINASPNGQRLLK-TCVSSPEYHYNVVNADSLIHVF 402 + + + + + + + + + ++ + + +L Sbjct: 150 GKELEASGVDFTADVVGFGLTADEGKQIACLADNTGGKYIQASDEKALQEAL 201 >gi|326330150|ref|ZP_08196461.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] gi|325951963|gb|EGD43992.1| putative membrane protein [Nocardioidaceae bacterium Broad-1] Length = 129 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 42/136 (30%), Gaps = 16/136 (11%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 E+ + SI + + + ++ + ++ + LA A+ + Sbjct: 7 QDERGSASIELIILLPALFAIMFLGMQAALYYEART----------LAIAAA------QE 50 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 G R N A+ ++ + ++L G T V + S++ + Sbjct: 51 GARAAGAENGTSAEGNRAARSYLADAGGDALEGARTRASRTATSASVTVTGHSLSVVPGW 110 Query: 136 RLDSSNNTIFYNMDVM 151 L + + V Sbjct: 111 HLRVTQSASVATERVT 126 >gi|134295593|ref|YP_001119328.1| TadE family protein [Burkholderia vietnamiensis G4] gi|134138750|gb|ABO54493.1| TadE family protein [Burkholderia vietnamiensis G4] Length = 147 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 12/134 (8%), Positives = 36/134 (26%), Gaps = 1/134 (0%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSME-SANNAAILAGASKMVSNLSR 74 ++ ++ FA+ F ++ +I + S+ +A+ A A Sbjct: 11 RRQRGATAVEFAIIFPLFFVICYAIICFGMIFVIQQSLTFAASEGARAALNYAPDLATRT 70 Query: 75 LGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + + K + Y + ++ + + Sbjct: 71 SKAQSAAQTVVGWLNISAPSVTVQAPACKYDATLYCLSVTVSYSPRAWVTTMPFLGTIVT 130 Query: 135 NRLDSSNNTIFYNM 148 L S+ Sbjct: 131 QPLTSTAVVQISQS 144 >gi|326202881|ref|ZP_08192748.1| hypothetical protein Cpap_2190 [Clostridium papyrosolvens DSM 2782] gi|325986958|gb|EGD47787.1| hypothetical protein Cpap_2190 [Clostridium papyrosolvens DSM 2782] Length = 213 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 39/135 (28%), Gaps = 5/135 (3%) Query: 22 FSIIF---ALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 ++F A++ + L+ + + YK+NS+ A + A+ A + + S G Sbjct: 1 MGVLFIGLAVTFLVLLVFVSAA-DYAVYTYKRNSVSKAMDYAVTAAVQNINKDKSLEGLS 59 Query: 79 FESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLD 138 N R K LS + + N + + + Sbjct: 60 EGFDENTGTRLTDGIEINIDKAETTF-LSVFEGNSSQRQTDIKGNLLICATSVSGGDVDY 118 Query: 139 SSNNTIFYNMDVMTS 153 + + Sbjct: 119 RIKTSSGEVIAGKVE 133 >gi|121534454|ref|ZP_01666277.1| hypothetical protein TcarDRAFT_1555 [Thermosinus carboxydivorans Nor1] gi|121306947|gb|EAX47866.1| hypothetical protein TcarDRAFT_1555 [Thermosinus carboxydivorans Nor1] Length = 366 Score = 36.8 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 32/359 (8%), Positives = 80/359 (22%), Gaps = 35/359 (9%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R ++ + +K + +I+ + ++ +L + + + A+ Sbjct: 1 MRTQLRRYLKGQKGSAAILAIVVMLILAVLGAAYVGLGMTET--------------STAA 46 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 ++ + + A + E S + Sbjct: 47 NFRDGIAAQYLAEAGVKRALIELYNNSAWDPGAGGLTEKQGNGSFTVIVRPGETSSQKKV 106 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMG 186 +S + R + + S + + I + G Sbjct: 107 VSTGQVNRARRQVVLDVNLPSNSGNGSVY---DYSVFAGSNMVVHNKARVIG-AIHSNNG 162 Query: 187 ERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLS 246 + + + A + A A + Sbjct: 163 MELKNGSYISGKVEVHGAFTDHGATLGTPPAITNAPEIP--FPAFNAADYKQGAQVLPWG 220 Query: 247 HVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 Y + I+ WG+ + + D I T Sbjct: 221 RTLSAGNYRGKYYYMGSQLTIDSGWGSIQGDATIFATGDIDIKNGTVIG----------- 269 Query: 307 SDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIV 365 S F + + + K ++ ++ G N N S+V + I + Sbjct: 270 --GDSSAFLIIALGNIDVNNGSILKNVVLISAGGNINLHSDVELTGSA--IAKEEIIVH 324 >gi|226349295|ref|YP_002776409.1| hypothetical protein ROP_pROB01-00580 [Rhodococcus opacus B4] gi|226245210|dbj|BAH55557.1| hypothetical protein [Rhodococcus opacus B4] Length = 150 Score = 36.4 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 13/144 (9%), Positives = 43/144 (29%), Gaps = 5/144 (3%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + + ++ + +I L+ + +LL+G+ + + A GA Sbjct: 2 RGTRWISRARRDDRGDVTIELCLATVILILLLGW----MYAYGVNRQAHQKVEHAATEGA 57 Query: 66 SKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSS 125 + + A ++ + ++ SG+ + + Sbjct: 58 RAASLARTIATATPLAYQ-AAAGSMDGQGLKCATMNVNADTSGFRTRPGVPATVEVTVTC 116 Query: 126 RISMTHMANNRLDSSNNTIFYNMD 149 ++S + ++ + + Sbjct: 117 QVSFDALGWPGVNGARTVTATAIS 140 >gi|255264656|ref|ZP_05343998.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255106991|gb|EET49665.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 317 Score = 36.4 bits (82), Expect = 7.9, Method: Composition-based stats. Identities = 8/123 (6%), Positives = 33/123 (26%), Gaps = 1/123 (0%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKN-SMESANNAAILAGASKMVSNLSRLGDRFE 80 +++ ++ + + + +++ +++A +A + + ++ L E Sbjct: 43 MTLLAIFLPVAMIWVAALAVRSSRIVREESRRLQAAIDAMRQTYVADRQAGVTTLQPTVE 102 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 N +A S ++ + + + Sbjct: 103 RKLNEIAKATQKTESALATFASSRDKVTVSLEPTASDTSAEDTEQTSLALGTSAEDISAP 162 Query: 141 NNT 143 T Sbjct: 163 ITT 165 >gi|221132347|ref|XP_002163331.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 435 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 32/117 (27%), Gaps = 9/117 (7%) Query: 295 TPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKIC 354 A++ A + + K I+FL KS + Sbjct: 62 NYALQTACSTIFYKGEAGRRIGSALSSFHKLFPRVGKKILFL----ITMGKSEDDVNNSA 117 Query: 355 DKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVH 411 + + + I TI I L Y + ++LI V +I + Sbjct: 118 KQLALSGVSIFTIGIGEEVEKNELKSI-----SRFYLTTDWNNLITVVASIQNSLTK 169 >gi|189203839|ref|XP_001938255.1| 26S proteasome non-ATPase regulatory subunit 4 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187985354|gb|EDU50842.1| 26S proteasome non-ATPase regulatory subunit 4 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 299 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 55/182 (30%), Gaps = 28/182 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + A D ++ A ++A+ + + + M + E + Sbjct: 11 DNSEASRNGDYVPSRWEAQQDAVNMIFSAKTGANPESSVGLMSMG--GNTPEILTTLTTD 68 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 KV + + I + + A L + +S Q+ Sbjct: 69 IGKV---LDGLHRTKIKGSSHFVTGINVAALALKHRQNKS---------------QKQRI 110 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLK---TCVSSPE 387 +IF + N +K+ K K++ I I I+ + ++L S+ Sbjct: 111 VIFNCSPIE---EDEKNLVKLAKKMKKSGISIDIIAFGELSDDTTRKLQAFSDNVQSAEG 167 Query: 388 YH 389 + Sbjct: 168 SY 169 >gi|168699474|ref|ZP_02731751.1| hypothetical protein GobsU_08132 [Gemmata obscuriglobus UQM 2246] Length = 417 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 13/172 (7%), Positives = 42/172 (24%), Gaps = 2/172 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + A+ + + ++ + + ++ ++ +AA LA + + +N Sbjct: 10 RAGAVLVKVAICLPVLIGVLALNLDGGRLYDERRRAQATADAAALAAGADLYANYWTNQG 69 Query: 78 RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA--NN 135 + A + A + + + ++ S + V Sbjct: 70 ADPIGTARAAAEQVAVANGYPASAVTVTVPPTSGAYAGQPGHAEVRIDSAVTNSFGRTIT 129 Query: 136 RLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 + + L + + I L + Sbjct: 130 GSATGVSARAVGRGEPVRIGIVLLNPTASGAFKNTAAAFALINKPLVVNSTS 181 >gi|91776868|ref|YP_546624.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Methylobacillus flagellatus KT] gi|91710855|gb|ABE50783.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Methylobacillus flagellatus KT] Length = 1245 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 20/246 (8%), Positives = 50/246 (20%), Gaps = 25/246 (10%) Query: 115 NTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIV 174 + I + A + S+ + V + + L + + Sbjct: 13 VLLLATIWLVGNAWLAEAAVTDISSNPLSTASGTVVRPNVMFVLDDSGSMNFSFMPDFLG 72 Query: 175 SFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKR--------T 226 + + SG A DP + Sbjct: 73 YGSYMAGKCRASNGNMTASCAGTSQSGQNRPATGGDPPVYSPLVNAIYYNPQISYTPPIN 132 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIE---PS-----------WG 272 + +++ + V I + V+ + + + Sbjct: 133 PCDTSQTLPT--MNATNTDGWRAVRVVGHAINNGSCVQNSARANILTEYPERAYCGTVYN 190 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDK-KRSFFTNFFRQGVKIPSLPFQK 331 + R + + T +T +S + S F + + Sbjct: 191 SSTRRNETVCKRNGIDEPATGTTNGYDWPNNTFSSGRTINSHPHYFLISPREYCTTSQLD 250 Query: 332 FIIFLT 337 T Sbjct: 251 VCTLRT 256 >gi|227904522|ref|ZP_04022327.1| surface protein [Lactobacillus acidophilus ATCC 4796] gi|227867731|gb|EEJ75152.1| surface protein [Lactobacillus acidophilus ATCC 4796] Length = 3517 Score = 36.4 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 27/317 (8%), Positives = 70/317 (22%), Gaps = 8/317 (2%) Query: 50 KNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGY 109 K ++ A + A + +N + + A A + I+ + + Sbjct: 924 KTELQDAIDQAS---TVESSNNYTNADPNLQKAYTDAISAGQTVLSKENATQIEVNKALT 980 Query: 110 SAVFYNTEIQNIVNSSRISMTHM-ANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + + V + A + + + + Sbjct: 981 TINNAKDALNGDVKKAASKEALQKAVDEAPTVKSDDAAYYNGSDEAKAAYDKAISAGQTV 1040 Query: 169 YNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPE--DVNSAPICQDKKRT 226 N + + + ++ A+N+ D N+ + ++T Sbjct: 1041 LNNPDATATQITDALNAINTAKGDFDGKTTDKSALETAINNSKTVKDSNNYTNADETQKT 1100 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKN--IEPSWGTEKVRQYVTRDM 284 + + + LD + L + + E + Sbjct: 1101 AYDSAVTSAQIVLDKTNATQAEVNQALQDLETANNNLNGDAKTEAANKAALEAAVKDAPN 1160 Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 + T + Y + + +L GE N Sbjct: 1161 VRNTPAYYNGTNEAQTTYNNAINAGQAVLDQANPSANDVKSALDAINAAKANLKGEATNT 1220 Query: 345 KSNVNTIKICDKAKENF 361 S + + K Sbjct: 1221 TSLETALTNANNVKNTG 1237 >gi|220915781|ref|YP_002491085.1| hypothetical protein A2cp1_0662 [Anaeromyxobacter dehalogenans 2CP-1] gi|219953635|gb|ACL64019.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 167 Score = 36.4 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 8/161 (4%), Positives = 37/161 (22%), Gaps = 12/161 (7%) Query: 9 FYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHY-------KKNSMESANNAAI 61 + + I+ + +M ++ + + +++ + + AA Sbjct: 2 SLSRPLPRRASGSTLILTVILLMVLAVIGVAAVSLGSQERINASGKTQRDQLYACAAAAR 61 Query: 62 LAGASKMVSNLSRLGDRFESIS-----NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT 116 L +++ + + + + + +T Sbjct: 62 LQIWAELARYGRGYLESGNVAGSVTLPDGTVLTAPSHYDSAADVTVSSVVLKNTVTTAST 121 Query: 117 EIQNIVNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + +S M + + ++ Sbjct: 122 PTATDLTNSFNFMQGLNTATGYTVVAKCKDRRGRPLEVEFV 162 >gi|315445474|ref|YP_004078353.1| hypothetical protein Mspyr1_39290 [Mycobacterium sp. Spyr1] gi|315263777|gb|ADU00519.1| hypothetical protein Mspyr1_39290 [Mycobacterium sp. Spyr1] Length = 172 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 14/143 (9%), Positives = 37/143 (25%), Gaps = 9/143 (6%) Query: 21 NFSIIFALSVMSFLLLIGFLI-----YVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 I+ L ++ L GF I + + +A + M + + Sbjct: 18 AAGIVAVLCSVAMLAAGGF-IWSEHRDAVRGQQSRAEFAAAAEQ---VAVTLMSLDHADP 73 Query: 76 GDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANN 135 + I +++ + + + + +K S + + S + + Sbjct: 74 QTGVQRILDNSVDPFLSEFRSTTGDFVKLSKDAEVTTKAVANAAAVQSISGDAAVVLVAA 133 Query: 136 RLDSSNNTIFYNMDVMTSYDYRL 158 +N L Sbjct: 134 TSTVTNADGVTEAPRSWQLKLDL 156 >gi|222628417|gb|EEE60549.1| hypothetical protein OsJ_13899 [Oryza sativa Japonica Group] Length = 1082 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 5/76 (6%) Query: 346 SNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSL-IHVFQN 404 S+ + IC +A I T N N + + S + + + +L F + Sbjct: 567 SDNHDNSICMEALSTNYNIHTFGFNGMHNVRAMYNIASRSNGMYDLLNDDRNLITEAFIS 626 Query: 405 ISQLMVHRKYSVILKG 420 + S+I+ G Sbjct: 627 CMNKIT----SIIVLG 638 >gi|315091028|gb|EFT63004.1| TadE-like protein [Propionibacterium acnes HL110PA4] Length = 74 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 6/37 (16%), Positives = 12/37 (32%), Gaps = 2/37 (5%) Query: 14 GIASEKA--NFSIIFALSVMSFLLLIGFLIYVLDWHY 48 E+ + S+ L V L++ G + Sbjct: 1 MARDERGGGSVSVWMLLMVPVILVMAGLVFDGSRQIS 37 >gi|84496015|ref|ZP_00994869.1| hypothetical protein JNB_00810 [Janibacter sp. HTCC2649] gi|84382783|gb|EAP98664.1| hypothetical protein JNB_00810 [Janibacter sp. HTCC2649] Length = 130 Score = 36.4 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 39/125 (31%), Gaps = 7/125 (5%) Query: 22 FSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG-ASKMVSNLSRLGDRFE 80 +I A S + ++L+G + + + + A LA A++ + G Sbjct: 1 MTIWLATSGFAMIVLVGLAVDLGGQVHAQQR------ARTLAAQAARTGGQQLQPGPAIR 54 Query: 81 SISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSS 140 + A RA A R + S + + + + + + L + Sbjct: 55 GEAASADRARAVQAARAYLAAADVTGSVVVSGGDTVTVTTTDTYATTFLGIIGISTLTVT 114 Query: 141 NNTIF 145 N Sbjct: 115 GNAQA 119 >gi|298676220|ref|YP_003727969.1| fibronectin type III domain-containing protein [Methanohalobium evestigatum Z-7303] gi|298289208|gb|ADI75173.1| Fibronectin type III domain protein [Methanohalobium evestigatum Z-7303] Length = 941 Score = 36.4 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 37/277 (13%), Positives = 84/277 (30%), Gaps = 14/277 (5%) Query: 68 MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRI 127 ++++ + S + A D+ ++ ++ +T+ +I + Sbjct: 264 LITDNFNTTEATISKEDFILNAGTIDSITKTESGSNTTIKITLTSEVDTDTIDININESG 323 Query: 128 SMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGE 187 S+ ANN LD S + ++ M L + + N + L ++ + Sbjct: 324 SIEDSANNVLDDSTVSTAVSVSGMDGVAPTLDNAKSVDKTTINVTVSDSGTGLNISKISK 383 Query: 188 RPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSH 247 L +GS+ ++ D + + +K+ A + L D I+ + Sbjct: 384 NDFSLS------TGSISTIDTNNLNDGDESGNITINLESKVDADTITVSLQNDGINDSAG 437 Query: 248 VKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQIL 305 V I+ + TE +T ++ + T+ + Sbjct: 438 NSNTSGSATTNNMDGVAPTIKEFKAANTEAQNISITLNVSEELGGETNDIN------VTI 491 Query: 306 TSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENN 342 T + N F + TDG+ Sbjct: 492 TGAENAEMDNNNFSTEDSSAPYVYTATYESSTDGDYT 528 >gi|222084463|ref|YP_002542992.1| hypothetical protein Arad_0356 [Agrobacterium radiobacter K84] gi|221721911|gb|ACM25067.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 237 Score = 36.4 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 11/153 (7%), Positives = 41/153 (26%), Gaps = 7/153 (4%) Query: 5 SRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAG 64 R + + + +I FAL + + ++I ++ + + +A + Sbjct: 48 RRAKRGLRAFGRAHDGAAAIEFALLAIPYFMIIFAILETFVAFIAEQVVSNAVD------ 101 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNS 124 + + + ++A ++ I E + + + Sbjct: 102 -TLSRQVRTGQITATNTTGQQFRQAFCNEISVLIACSAAELATPTNLYLDVKSYSTFADM 160 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 ++ D + + + + Sbjct: 161 PTTIPRVSSDPYSDLNTTGFSFAPGGTGTINMV 193 >gi|86139958|ref|ZP_01058523.1| secreted hemolysin-type calcium-binding bacteriocin, putative [Roseobacter sp. MED193] gi|85823376|gb|EAQ43586.1| secreted hemolysin-type calcium-binding bacteriocin, putative [Roseobacter sp. MED193] Length = 3377 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 12/103 (11%), Positives = 33/103 (32%), Gaps = 1/103 (0%) Query: 47 HYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL 106 ++ ++SA + A A+ + + D+ ++ DA + ++ S Sbjct: 171 ELQRQQLQSALDTASAELAAAQNNVGAMTSDKADADQTVTDAQTTLDAAAATMSLLQSSG 230 Query: 107 SGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 A + + + A LD++ + + Sbjct: 231 QVGDAQA-ALSLAQTALIQSQNALNAAQGELDTATTSAASMLS 272 >gi|227875848|ref|ZP_03993974.1| TadE family protein [Mobiluncus mulieris ATCC 35243] gi|306819129|ref|ZP_07452843.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239] gi|307700008|ref|ZP_07637057.1| TadE-like protein [Mobiluncus mulieris FB024-16] gi|227843596|gb|EEJ53779.1| TadE family protein [Mobiluncus mulieris ATCC 35243] gi|304648105|gb|EFM45416.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239] gi|307614769|gb|EFN93989.1| TadE-like protein [Mobiluncus mulieris FB024-16] Length = 150 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 13/138 (9%), Positives = 39/138 (28%), Gaps = 4/138 (2%) Query: 12 KKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSN 71 ++ + E+ + + FAL V + L+ ++ + + K ++ +AA + V Sbjct: 15 RRILKCERGSEPVSFALVVPLVMFLVLGVLQLALSMWVKTTL---IDAASAGAHAAAVVG 71 Query: 72 LSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI-VNSSRISMT 130 A D + + + + + + V++ + Sbjct: 72 APPEVAEETVREVLASTVGRDYVQNVEVSRVNLTTVSNQFGSAPVDALQVSVSAPLPVLG 131 Query: 131 HMANNRLDSSNNTIFYNM 148 + Sbjct: 132 FFGFGSFQVVGHAAVEVA 149 >gi|221070081|ref|ZP_03546186.1| membrane protein-like protein [Comamonas testosteroni KF-1] gi|220715104|gb|EED70472.1| membrane protein-like protein [Comamonas testosteroni KF-1] Length = 520 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 4/52 (7%), Positives = 22/52 (42%), Gaps = 3/52 (5%) Query: 8 RFYFKKGIASEKANFSIIFAL-SVMSFLLLIGFLIYVLDWHYKKNSMESANN 58 + + + +++ AL +++ + L I + + +++ ++ + Sbjct: 6 TGLGRARMRRQSGTVAMLGALWLMIALICLA--TIDIGNVFWQRRELQKIAD 55 >gi|308509664|ref|XP_003117015.1| CRE-CLEC-60 protein [Caenorhabditis remanei] gi|308241929|gb|EFO85881.1| CRE-CLEC-60 protein [Caenorhabditis remanei] Length = 399 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 17/176 (9%), Positives = 48/176 (27%), Gaps = 15/176 (8%) Query: 228 MAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSL 287 +AA ++ I + +GL+ Y + + L Sbjct: 59 IAANIASVFSNNTRIGTNPNEPRTTRLGLVTYNAVANTVADLDQFQS-LDDVYDGIFGGL 117 Query: 288 ILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSN 347 + + G + + ++ + + Sbjct: 118 AQVSSSDESYLAHGLAQAE---------IILEAGQTAVNRSHYERVVIVY-ASTYKGSGS 167 Query: 348 VNTIKICDKAKENFIKIVTISINASPNGQRL--LKTCVSSPEYHYNVVNADSLIHV 401 ++ I + D+ K + I+T++ + +G L L+ + P + + + Sbjct: 168 LDPIPVADRLKTAGVTIITVAYDQDGDGALLHDLQKISTPP--YNFANTDQAGNTI 221 >gi|167584951|ref|ZP_02377339.1| hypothetical protein BuboB_06421 [Burkholderia ubonensis Bu] Length = 56 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 7/38 (18%), Positives = 15/38 (39%), Gaps = 4/38 (10%) Query: 4 LSRFRFYFKKGIASEKANFSI----IFALSVMSFLLLI 37 +S+ K+ + E +I I AL + + + Sbjct: 1 MSKLVQQLKQFVRDEDGVTAIEYGLIAALIAVVIIGAV 38 >gi|116626538|ref|YP_828694.1| hypothetical protein Acid_7501 [Candidatus Solibacter usitatus Ellin6076] gi|116229700|gb|ABJ88409.1| hypothetical protein Acid_7501 [Candidatus Solibacter usitatus Ellin6076] Length = 143 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 26/77 (33%), Gaps = 2/77 (2%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 R + ++ + AL + LL + + Y SM++A A+L +S Sbjct: 1 MRKSVR--ARRKRGGAIMEMALLMPWVFLLFIGALDWGFYAYALISMQAATRTAVLYTSS 58 Query: 67 KMVSNLSRLGDRFESIS 83 + G + Sbjct: 59 TDATKADSAGACTLVVK 75 >gi|324995677|gb|EGC27589.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK678] Length = 450 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 95/365 (26%), Gaps = 44/365 (12%) Query: 63 AGASKMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESL-SGYSAVFYNTEIQNI 121 A +S + + + A D KN + Sbjct: 53 AVSSPIEVAGAEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETAT 112 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALL 181 V+ ++ + ++ N T + K + Sbjct: 113 VDVTKTEVAPAKVDKAPEPITRRTENTLSGTD-----ENGNPYTQYERVDKTTTITYTST 167 Query: 182 RIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 + + I VVD SGSM ++ ++N K+ + L F D Sbjct: 168 PPTVTKAGSADIVFVVDRSGSMGGTIDIVRANINEFVRNITKEGI---TARFGLATFSDE 224 Query: 242 IDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQA 301 + + +D L + + + ++ + + A++ A Sbjct: 225 VFGRNSGSKDEDTVLTRFGSSY-----FTTDPAELEKAL---------------AAIRIA 264 Query: 302 YQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENF 361 T + S +KF++ LTD E S K Sbjct: 265 SGGDTPETPTPALNQIISTYDWSKSSKNKKFVVLLTDAEMKEDPSIPTVADTLAALKAAG 324 Query: 362 I--KIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQN-----ISQLMVHRKY 414 I + T+ + + K + ++ N +L I + + +Y Sbjct: 325 IERTVATV-----KAIEGIYKNFA-TEGRVLDIEN--NLADALTKGTTSWIVESVNEARY 376 Query: 415 SVILK 419 I+K Sbjct: 377 YKIIK 381 >gi|169627635|ref|YP_001701284.1| hypothetical protein MAB_0531c [Mycobacterium abscessus ATCC 19977] gi|169239602|emb|CAM60630.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 522 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 45/134 (33%), Gaps = 21/134 (15%) Query: 285 DSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNF 344 D+L+ T + A++ + S + ++ +TDG N + Sbjct: 405 DALVGGGTGLYDTARAAFRKVQSTY----------------DPHYINSVVIITDGSNEDP 448 Query: 345 KSNVNTIKICDKAKENF----IKIVTISINASPNGQRLLKTCVSSPEYHYN-VVNADSLI 399 + KE + ++T+ I + L + ++P NAD + Sbjct: 449 NGISLEQLLATLKKEQDPARPVVLITLGITEDADATVLKQISDATPGAGSRVARNADEIP 508 Query: 400 HVFQNISQLMVHRK 413 ++ ++ + + Sbjct: 509 NIVIDLIKARTTAR 522 >gi|67537994|ref|XP_662771.1| hypothetical protein AN5167.2 [Aspergillus nidulans FGSC A4] gi|40743158|gb|EAA62348.1| hypothetical protein AN5167.2 [Aspergillus nidulans FGSC A4] gi|259484621|tpe|CBF81002.1| TPA: von Willebrand and RING finger domain protein (AFU_orthologue; AFUA_6G06990) [Aspergillus nidulans FGSC A4] Length = 1166 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 23/233 (9%), Positives = 65/233 (27%), Gaps = 32/233 (13%) Query: 167 QRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRT 226 ++ S + I + + + +++ + V P+ + Sbjct: 609 NGNMKRQASLNSSHGAARSNNTAITDYT-NMGVESVLSPSIHIPVDIVVVIPVSSSMQGL 667 Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDS 286 K+ L+++L + ++ MGL+ + + T K ++ + S Sbjct: 668 KITLLRDSLKFLVQNLGPRD------RMGLVTFGSSGGGV-PLVGMTTKSWGGWSKILSS 720 Query: 287 LILKPT-----DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGEN 341 + D A +L K + ++ S P + Sbjct: 721 IRPVGQKSLRADVVEGANVAMDLLMQRKSSNPVSSILLISDSSTSDPDSVDFVV------ 774 Query: 342 NNFKSNVNTIKICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVN 394 + AK + I + + + +++ + + V + Sbjct: 775 ----------SRAEAAK---VGIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 814 >gi|237653353|ref|YP_002889667.1| hypothetical protein Tmz1t_2688 [Thauera sp. MZ1T] gi|237624600|gb|ACR01290.1| conserved hypothetical protein [Thauera sp. MZ1T] Length = 474 Score = 36.4 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 8/78 (10%), Positives = 25/78 (32%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGD 77 + ++ V + L ++ + +K + +A +A + A+ L+ Sbjct: 19 QHGQALVLGLFLVFATLAVLLLMFNTGRVVDEKMRLTNAADATAWSVATLEARALNYDAY 78 Query: 78 RFESISNHAKRALIDDAK 95 +I + + Sbjct: 79 TNRAIVANEVAIAQAISL 96 >gi|90418065|ref|ZP_01225977.1| hypothetical protein SI859A1_02204 [Aurantimonas manganoxydans SI85-9A1] gi|90337737|gb|EAS51388.1| hypothetical protein SI859A1_02204 [Aurantimonas manganoxydans SI85-9A1] Length = 189 Score = 36.4 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 15/155 (9%), Positives = 34/155 (21%), Gaps = 3/155 (1%) Query: 14 GIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLS 73 ++ FA+ + LL+ +ES A + Sbjct: 19 FGGDRGGVAALEFAIILPGLLLIYLGGFEASKALEASRKVESTAETVGNLIARNRTMTET 78 Query: 74 RLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMA 133 L + + S D K + + + + ++ Sbjct: 79 GLENIYNISSAIMVPFSTDGLKIVVTTVSVDDKGKGTVDWSQANTGPALDKGDRYDVP-- 136 Query: 134 NNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQR 168 + L T + V Y + + Sbjct: 137 -SELVFGTETYLVVVSVSYPYKPVMDYGGFFSGTT 170 >gi|75674505|ref|YP_316926.1| Flp pilus assembly protein TadG [Nitrobacter winogradskyi Nb-255] gi|74419375|gb|ABA03574.1| Flp pilus assembly protein TadG [Nitrobacter winogradskyi Nb-255] Length = 242 Score = 36.4 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 14/152 (9%), Positives = 41/152 (26%), Gaps = 8/152 (5%) Query: 7 FRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGAS 66 + + + FA+ V L+++ I V A N + A Sbjct: 50 LKDMASALRRDTRGLAATEFAMIVPLMLVMLFGTIEVSSGV--------AVNRKVTLVAR 101 Query: 67 KMVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSR 126 + S+ ++ + A + ++ ++S + + + + Sbjct: 102 TLSDLTSQSKVVNDADVTNFLAASYGIMWPYSSAPVQATISELYIDPATSVARVQWSKGK 161 Query: 127 ISMTHMANNRLDSSNNTIFYNMDVMTSYDYRL 158 + + S + V+ + Sbjct: 162 APRGAGSTVSIPSGLIGRDSSGKVLPNQYLIF 193 >gi|310288172|ref|YP_003939431.1| Hydrolase (HAD superfamily) [Bifidobacterium bifidum S17] gi|309252109|gb|ADO53857.1| Hydrolase (HAD superfamily) [Bifidobacterium bifidum S17] Length = 280 Score = 36.4 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 41/166 (24%), Gaps = 7/166 (4%) Query: 227 KMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTR---- 282 + A+ + Y + S G+ + Sbjct: 79 RHDIAPEAIDAATAYFERHGIENYQWQGADGMYISCGYARHLESKGSAVWKNGQFARYWH 138 Query: 283 --DMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGE 340 D ++ T + F + R + +L + + Sbjct: 139 LIDDIAVPDGSTLGHVIHASKGSYFSGADPDVSFDDVSRDLGEWFTLVHGSYDDISPNNG 198 Query: 341 NNNFKSNVNTIKICDKAKENFIKIV-TISINASPNGQRLLKTCVSS 385 +C AK+ IV TI+I S N +LK +S Sbjct: 199 ELLINGVDKGTALCGVAKDLGYSIVDTIAIGDSDNDAAMLKAAGTS 244 >gi|320322711|gb|EFW78804.1| hypothetical protein PsgB076_22337 [Pseudomonas syringae pv. glycinea str. B076] Length = 677 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 73/294 (24%), Gaps = 3/294 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ ++ AL++ LL ++ + +K S++ + A L A + + Sbjct: 12 ARQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGAA 71 Query: 76 GDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + R + G + + ++ H Sbjct: 72 TAPGFATQSATRNGFTPNTGGRTLTTLCGTLTVGLQSQRVFVADSTQALAIQVFAAHPVP 131 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + +F + + I S + + Sbjct: 132 RSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDSTRAAILNPLIGS 191 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L+ S ++ +T + L++ +S + + Sbjct: 192 LLGGTLNLSVANWQGLASTDLSLLSYLNRLKTDLNLTAVGYSEVLNTSVSVSKLIQSAIN 251 Query: 255 GLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L D L ++ + A+ ++L Sbjct: 252 VLDPGATLNGTATIAGLQALKLAAGATTVLLGDVLSIQGSSDLAALNTNLRLLD 305 >gi|71733733|ref|YP_276531.1| hypothetical protein PSPPH_4415 [Pseudomonas syringae pv. phaseolicola 1448A] gi|71554286|gb|AAZ33497.1| conserved hypothetical protein [Pseudomonas syringae pv. phaseolicola 1448A] Length = 677 Score = 36.4 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 73/294 (24%), Gaps = 3/294 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ ++ AL++ LL ++ + +K S++ + A L A + + Sbjct: 12 ARQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGAA 71 Query: 76 GDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + R + G + + ++ H Sbjct: 72 TAPGFATQSATRNGFTPNTGGRTLTTLCGTLTVGLQSQRVFVADSTQALAIQVFAAHPVP 131 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + +F + + I S + + Sbjct: 132 RSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDSTRAAILNPLIGS 191 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L+ S ++ +T + L++ +S + + Sbjct: 192 LLGGTLNLSVANWQGLASTDLSLLSYLNRLKTDLNLTAVGYSEVLNTSVSVSKLIQSAIN 251 Query: 255 GLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L D L ++ + A+ ++L Sbjct: 252 VLDPGATLNGTATIAGLQALKLAAGATTVLLGDVLSIQGSSDLAALNTNLRLLD 305 >gi|330873718|gb|EGH07867.1| hypothetical protein Pgy4_04872 [Pseudomonas syringae pv. glycinea str. race 4] Length = 687 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 73/294 (24%), Gaps = 3/294 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ ++ AL++ LL ++ + +K S++ + A L A + + Sbjct: 22 ARQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGAA 81 Query: 76 GDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + R + G + + ++ H Sbjct: 82 TAPGFATQSATRNGFTPNTGGRTLTTLCGTLTVGLQSQRVFVADSTQALAIQVFAAHPVP 141 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + +F + + I S + + Sbjct: 142 RSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDSTRAAILNPLIGS 201 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L+ S ++ +T + L++ +S + + Sbjct: 202 LLGGTLNLSVANWQGLASTDLSLLSYLNRLKTDLNLTAVGYSEVLNTSVSVSKLIQSAIN 261 Query: 255 GLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L D L ++ + A+ ++L Sbjct: 262 VLDPGATLNGTATIAGLQALKLAAGATTVLLGDVLSIQGSSDLAALNTNLRLLD 315 >gi|320101672|ref|YP_004177263.1| TadE family protein [Isosphaera pallida ATCC 43644] gi|319748954|gb|ADV60714.1| TadE family protein [Isosphaera pallida ATCC 43644] Length = 163 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 27/84 (32%), Gaps = 1/84 (1%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAI-LAGASKMVSNLSRLG 76 + + A+ VM + + +I + ++A A LA S L Sbjct: 20 RRGVTVVEMAIVVMVVFMFLMGIIEFSRILMTRVMWDNAARAGARLAVVSTDSLTDEELI 79 Query: 77 DRFESISNHAKRALIDDAKRFIKN 100 + +S A ++ + N Sbjct: 80 QKIDSFLPAATKSTVVGGFNPNVN 103 >gi|163748341|ref|ZP_02155615.1| hypothetical protein OIHEL45_20501 [Oceanibulbus indolifex HEL-45] gi|161378387|gb|EDQ02882.1| hypothetical protein OIHEL45_20501 [Oceanibulbus indolifex HEL-45] Length = 73 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 10/74 (13%), Positives = 23/74 (31%), Gaps = 3/74 (4%) Query: 4 LSRFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILA 63 + R E + +I+F + + + I ++ D + A +A + Sbjct: 1 MKRANKTMFSFFKDESGSQTIVFVILLPLLVWSILAMLAFTDMFRVRA---IATDATAVI 57 Query: 64 GASKMVSNLSRLGD 77 S + D Sbjct: 58 ADSLSRQTMPIDAD 71 >gi|294495833|ref|YP_003542326.1| cobaltochelatase [Methanohalophilus mahii DSM 5219] gi|292666832|gb|ADE36681.1| Cobaltochelatase [Methanohalophilus mahii DSM 5219] Length = 1982 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 13/284 (4%), Positives = 62/284 (21%), Gaps = 11/284 (3%) Query: 84 NHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNT 143 ++ + + + + + + + + Sbjct: 130 SYFDYHSDGTTSDPVATYYSNMGTEGEGLENAENLLTYLTMKSKVTYISYSANDALQTAS 189 Query: 144 IFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSM 203 + + Y + + + + ++Q S + + + ++ ++ Sbjct: 190 ESNDHSDLIEYTF-IDYSDSGISQEMINASESGFLETQDVIVCQGIYDAFTDNTTVNTTL 248 Query: 204 HCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRV 263 A + + P+ + + D ++ ++ G Sbjct: 249 KSAHDKGTVIYSIDPMGAYTPPSYFDYHSDGTTS-----DPVATYYSNMGTEGEGLENAE 303 Query: 264 EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVK 323 + ++ + + ++ + + Sbjct: 304 NLLTYLTMKSKVTYISYSANDALQTASESNDHSDLIE--YTFIDYSDSGISQEMINASES 361 Query: 324 IPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTI 367 + + + G + F N A + I +I Sbjct: 362 G---FLETQDVIVCQGIYDAFTDNTTVNTTLKSAHDKGTVIYSI 402 >gi|284992066|ref|YP_003410620.1| putative DNA helicase [Geodermatophilus obscurus DSM 43160] gi|284065311|gb|ADB76249.1| putative DNA helicase [Geodermatophilus obscurus DSM 43160] Length = 2005 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 20/278 (7%), Positives = 68/278 (24%), Gaps = 14/278 (5%) Query: 34 LLLIGFLIYV----LDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISNHAKRA 89 + + F++ + + +++A A+ A ++ ++ D + S A Sbjct: 743 VGMGPFVLDLHDRGAKASVVRAQVQAALEHAV-AVDAEGSADEEMDADSDDLHSARRTLA 801 Query: 90 LIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTIFYNMD 149 + +AV + + S + + L + + Sbjct: 802 RYAGRLHARNAAGLSYYTARTAVLAMRDDVPTLPVSPAFVAVASPETLTAVRRALALL-- 859 Query: 150 VMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMHCAMNS 209 ++ + V + A ++ + + L+ ++ Sbjct: 860 --PDIADMVRPSPRHAWAFVDTAQVDLLAAQQTAAAVDQAVRDLPSEPHLARALREVRTP 917 Query: 210 DPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEP 269 D+++ T + L + + + G + + Sbjct: 918 A--DLDALAHLLGGPATTLDVLDEVPTERWNVATTTVLGEVAAFSGAVHPGLELATPAAL 975 Query: 270 SWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTS 307 ++ + + +K L Sbjct: 976 EL---RLADLYVAAQTAQAGRFMARRAGLKAVRDQLRP 1010 >gi|209551754|ref|YP_002283671.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537510|gb|ACI57445.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 211 Score = 36.4 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 17/154 (11%), Positives = 42/154 (27%), Gaps = 7/154 (4%) Query: 6 RFRFYFKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 RF + + S + +I FAL + + L+I ++ + + +A + + Sbjct: 18 RFLRF-RALARSREGAAAIEFALLAIPYFLVIFAILETFVAFAAEELVSNAVDT----MS 72 Query: 66 SKMVSNLSRLGD--RFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 KM + + ++A ++ I+ E + Sbjct: 73 RKMRTGQITYNLGRTTDMNQAQFRQAFCNEISILIRCSATEVATPSKLYVDVQTFSTFSA 132 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 D + Y + + Sbjct: 133 IPTTIPKLSTAKYADINTAAFKYTPGGAGTINMV 166 >gi|257068879|ref|YP_003155134.1| CBS domain-containing protein [Brachybacterium faecium DSM 4810] gi|256559697|gb|ACU85544.1| CBS domain-containing protein [Brachybacterium faecium DSM 4810] Length = 444 Score = 36.4 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 12/140 (8%), Positives = 28/140 (20%), Gaps = 13/140 (9%) Query: 22 FSIIFALSVMSFLLLIGF------LIYVLDWHYKKNSMESANNAAIL---AGASKMVSNL 72 L ++ + L G + + A A+ A V Sbjct: 1 MLTFALLVLIPLIALAGAVSAVLTAVDAAQLSVSRT----ALEKAVAPRPAAVRDRVLAQ 56 Query: 73 SRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHM 132 R + + + + V + + M + Sbjct: 57 QADTARTLASVSLGRMLAETVMVAAVAGVTVAVRDHVGHSGLLLPGLATVVVAGLLMLLV 116 Query: 133 ANNRLDSSNNTIFYNMDVMT 152 + + + V T Sbjct: 117 LAISPRTIGRSRPEAVLVST 136 >gi|113866448|ref|YP_724937.1| Type IV pilus assembly protein, tip-associated adhesin PilY1 [Ralstonia eutropha H16] gi|113525224|emb|CAJ91569.1| Type IV pilus assembly protein, tip-associated adhesin PilY1 [Ralstonia eutropha H16] Length = 1090 Score = 36.4 bits (82), Expect = 9.6, Method: Composition-based stats. Identities = 14/129 (10%), Positives = 32/129 (24%), Gaps = 12/129 (9%) Query: 293 DSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIK 352 A+ + + ++ +P K G+ N + Sbjct: 204 YIANALNNGKPQDSGQDAYNKLKEAGATTAQLTPIPLPKGNNIDNWGDEWARFLNDLDLN 263 Query: 353 ICDKAKE-----NFIKIVTISINASPNGQR-----LLKTCVSSPEYHYNVV--NADSLIH 400 + A I TI + + + ++ V + D+L Sbjct: 264 AANNAGSTTATRQNIITYTIGVTDGSQQSQDYLQFVTSMAQKGGGGNFTVRAGDDDALSK 323 Query: 401 VFQNISQLM 409 ++I M Sbjct: 324 ALKDIFNEM 332 >gi|223938636|ref|ZP_03630527.1| hypothetical protein Cflav_PD2430 [bacterium Ellin514] gi|223892755|gb|EEF59225.1| hypothetical protein Cflav_PD2430 [bacterium Ellin514] Length = 1249 Score = 36.4 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 33/349 (9%), Positives = 87/349 (24%), Gaps = 24/349 (6%) Query: 15 IASEKANFSIIFALSVMSF--LLLIGFLIYVLDWHYK----KNSMES----ANNAAILAG 64 SE+ ++ + + + + I + ++ ++ A +AAI Sbjct: 3 RRSEQGVALVVTLILLSVITFMAVAFLAI--SRHEREGVINRSH-QNDSVFAADAAIEHI 59 Query: 65 ASKMVSNLSRLGDRFESISNHAKRALI-DDAKRFIKNHIKESLSGYSAVFYNTEIQNIVN 123 +K+VS + + F + + + R + + + + T+ + N Sbjct: 60 KAKLVSEMLVNNNGFNVGLLVSTNFINTNGFVRGLASFTNVNYDIRNTGQPLTQNDFLQN 119 Query: 124 SSRISMTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRI 183 + + + + ++ + + + +P + Sbjct: 120 VANLLILPRPPVFVSTNQAQRNGQLASEFRFYLDFNRNGVYDTNGWVTPYYGLVPPKINP 179 Query: 184 EMGERPIFLIELVV--DLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDS 241 I E + D H N + L N + + Sbjct: 180 RAPFFAIGDPEWIGILDRPELPHSRSNQFVARYA-FIALPMGSAMDINYLHNVAKYYTGT 238 Query: 242 IDLLSHVKEDVYMGLIGYTTRV--EKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMK 299 + D +G WG ++ + + Sbjct: 239 GQPYEGFRRDQGVGTWEINVGAFLFALNPTVWGAGGSTYAYDPIGNAQTVGL-----GFQ 293 Query: 300 QAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNV 348 A IL ++ + + P F + DG + + Sbjct: 294 DAADILAYRFNNNWNNPLTFAQMYPGATPSTLFQLDHADGYSLGLLATN 342 >gi|183602317|ref|ZP_02963684.1| possible ammonium ion transporter [Bifidobacterium animalis subsp. lactis HN019] gi|219682673|ref|YP_002469056.1| ammonium transporter [Bifidobacterium animalis subsp. lactis AD011] gi|241190251|ref|YP_002967645.1| ammonium transporter [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195657|ref|YP_002969212.1| ammonium transporter [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218531|gb|EDT89175.1| possible ammonium ion transporter [Bifidobacterium animalis subsp. lactis HN019] gi|219620323|gb|ACL28480.1| ammonium transporter [Bifidobacterium animalis subsp. lactis AD011] gi|240248643|gb|ACS45583.1| ammonium transporter [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250211|gb|ACS47150.1| ammonium transporter [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793238|gb|ADG32773.1| ammonium transporter [Bifidobacterium animalis subsp. lactis V9] Length = 431 Score = 36.4 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 42/187 (22%), Gaps = 11/187 (5%) Query: 18 EKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES-ANNAAILAGASKMVSNLSRLG 76 + N + I + + FL+ G ++ K + AA L+ + Sbjct: 5 DSGNAAWILTSACLVFLMTPGVALFYGGMVRAKAVLNMLIMEAAALSVTMIIWGLWGWSI 64 Query: 77 DRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNR 136 N D A F+ + G + A Sbjct: 65 AYA---GNSIGGVFGDPATGFLLRDTMVAQDGVFTSASLNSNNYPASVDVAFQAAFAMIT 121 Query: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEH-------LLNQRYNQKIVSFIPALLRIEMGERP 189 + I + T + +I ++ I L + Sbjct: 122 VALICGAIAERVKYSTWMIFVALWITFDYAPMAHMVWNGGLLSADGAISQALGAAAHDFA 181 Query: 190 IFLIELV 196 + + Sbjct: 182 GGTVVHI 188 >gi|289706415|ref|ZP_06502773.1| conserved hypothetical protein [Micrococcus luteus SK58] gi|289556910|gb|EFD50243.1| conserved hypothetical protein [Micrococcus luteus SK58] Length = 152 Score = 36.4 bits (82), Expect = 9.8, Method: Composition-based stats. Identities = 14/150 (9%), Positives = 40/150 (26%), Gaps = 10/150 (6%) Query: 8 RFYFKKGIASEKANFSIIFALSVMSF--LLLIGFLIYVLDWHYKKNSMESANNAAILAGA 65 R + +++ L+L+ + + ++S ++A +AGA Sbjct: 3 RPLTARLGRD-DGQTTVLTVGLCAVLVALMLVMLAV--TTVQLQHRRLQSLADSAAVAGA 59 Query: 66 SK----MVSNLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNI 121 + + + + + ++ + A+ + S + Sbjct: 60 EELGFRLGEDPGVVLSDGDVAASASAHLAAVGAREAVPGLGGMSARVAEDGTTVVVTLD- 118 Query: 122 VNSSRISMTHMANNRLDSSNNTIFYNMDVM 151 + +MT L S Sbjct: 119 ARADLFAMTGPFAGTLPLSVPLEATGSSRT 148 >gi|312215813|emb|CBX95765.1| similar to 26S proteasome non-ATPase regulatory subunit 4 [Leptosphaeria maculans] Length = 299 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 54/182 (29%), Gaps = 28/182 (15%) Query: 213 DVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGLIGYTTRVEKNIEPSWG 272 D + A D ++ A ++A+ L + + M + E + + Sbjct: 11 DNSEASRNGDYVPSRWEAQQDAVNLIFSAKTGANPESSVGLMSMG--GNTPEILVTLTTD 68 Query: 273 TEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKF 332 K+ + + I + + A L + + ++ Sbjct: 69 IGKI---LDGLHRTKIKGSSHFVTGINVAALALKHRQ----------------NKSQKQR 109 Query: 333 IIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISIN--ASPNGQRLLK---TCVSSPE 387 II + + N IK+ K K+ I I I+ + ++L S+ Sbjct: 110 IIIFN--CSPIEEDEKNLIKLAKKMKKTGISIDIIAFGELSDDTTRKLQAFSDNVQSAEG 167 Query: 388 YH 389 + Sbjct: 168 SY 169 >gi|103487753|ref|YP_617314.1| hypothetical protein Sala_2272 [Sphingopyxis alaskensis RB2256] gi|98977830|gb|ABF53981.1| conserved hypothetical protein [Sphingopyxis alaskensis RB2256] Length = 219 Score = 36.4 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 12/153 (7%), Positives = 36/153 (23%), Gaps = 11/153 (7%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMES----ANNAAI-LAGASKMVS 70 S +A + A ++ +L+ + + + + + A +A S + Sbjct: 20 QSTRAAVMLEMAFAIPFLILVGFGGLEIANLTLAHTRVSQLGLNTADNAARIAAGSNLTQ 79 Query: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSGYSAVFYNT------EIQNIVNS 124 R D E + A++ + + + ++ Sbjct: 80 PEIREVDINEVFAGAARQVAGMGFENNGRIILSSLQRNNDGGQTIKWQRCFGNLEVASAY 139 Query: 125 SRISMTHMANNRLDSSNNTIFYNMDVMTSYDYR 157 + T+ + Sbjct: 140 GVEGTGATGTDFPGMGPAGREVTAAAGTAIMFV 172 >gi|320330504|gb|EFW86483.1| hypothetical protein PsgRace4_08285 [Pseudomonas syringae pv. glycinea str. race 4] Length = 677 Score = 36.4 bits (82), Expect = 10.0, Method: Composition-based stats. Identities = 26/294 (8%), Positives = 73/294 (24%), Gaps = 3/294 (1%) Query: 16 ASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRL 75 A ++ ++ AL++ LL ++ + +K S++ + A L A + + Sbjct: 12 ARQRGAIGMMAALTLGLALLCTLIVVDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGAA 71 Query: 76 GDRFESISNHAKR-ALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMAN 134 + + + + R + G + + ++ H Sbjct: 72 TAPGFATQSATRNGFTPNTGGRTLTTLCGTLTVGLQSQRVFVADSTQALAIQVFAAHPVP 131 Query: 135 NRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIE 194 + + +F + + I S + + Sbjct: 132 RSIAAGIVALFEKTPSPANITISATAVAASAAPLAALTIRSAAVTMDSTRAAILNPLIGS 191 Query: 195 LVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYM 254 L+ S ++ +T + L++ +S + + Sbjct: 192 LLGGTLNLSVANWQGLASTDLSLLSYLNRLKTDLNLTAVGYSEVLNTSVSVSKLIQSAIN 251 Query: 255 GLIGYT--TRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILT 306 L D L ++ + A+ ++L Sbjct: 252 VLDPGATLNGTATIAGLQALKLAAGATTVLLGDVLSIQGSSDLAALNTNLRLLD 305 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.306 0.119 0.273 Lambda K H 0.267 0.0366 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,469,005,195 Number of Sequences: 14124377 Number of extensions: 124528065 Number of successful extensions: 1057833 Number of sequences better than 10.0: 10000 Number of HSP's better than 10.0 without gapping: 3060 Number of HSP's successfully gapped in prelim test: 7458 Number of HSP's that attempted gapping in prelim test: 1031644 Number of HSP's gapped (non-prelim): 24025 length of query: 420 length of database: 4,842,793,630 effective HSP length: 142 effective length of query: 278 effective length of database: 2,837,132,096 effective search space: 788722722688 effective search space used: 788722722688 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 82 (36.4 bits)